Abstract

The paper presents data mining methods applied to gene selection for recognition of a particular type of prostate cancer on the basis of gene expression arrays. Several chosen methods of gene selection, including the Fisher method, correlation of gene with a class, application of the support vector machine and statistical hypotheses, are compared on the basis of clustering measures. The results of applying these individual selection methods are combined together to identify the most often selected genes forming the required pattern, best associated with the cancerous cases. This resulting pattern of selected gene lists is treated as the input data to the classifier, performing the task of the final recognition of the patterns. The numerical results of the recognition of prostate cancer from normal (reference) cases using the selected genes and the support vector machine confirm the good performance of the proposed gene selection approach

Details

Title
Data mining methods for gene selection on the basis of gene expression arrays
Author
Muszynski, Michal; Osowski, Stanislaw
Pages
657-668
Publication year
2014
Publication date
Sep 2014
Publisher
De Gruyter Brill Sp. z o.o., Paradigm Publishing Services
ISSN
1641876X
e-ISSN
20838492
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1625229649
Copyright
Copyright De Gruyter Open Sp. z o.o. Sep 2014