Content area

Abstract

Mammography is a widely used screening tool and is the gold standard for the early detection of breast cancer. The classification of breast masses into the benign and malignant categories is an important problem in the area of computer-aided diagnosis of breast cancer. A small dataset of 57 breast mass images, each with 22 features computed, was used in this investigation; the same dataset has been previously used in other studies. The extracted features relate to edge-sharpness, shape, and texture. The novelty of this paper is the adaptation and application of the classification technique called genetic programming (GP), which possesses feature selection implicitly. To refine the pool of features available to the GP classifier, we used feature-selection methods, including the introduction of three statistical measures--Student's t test, Kolmogorov-Smirnov test, and Kullback-Leibler divergence. Both the training and test accuracies obtained were high: above 99.5% for training and typically above 98% for test experiments. A leave-one-out experiment showed 97.3% success in the classification of benign masses and 95.0% success in the classification of malignant tumors. A shape feature known as fractional concavity was found to be the most important among those tested, since it was automatically selected by the GP classifier in almost every experiment.[PUBLICATION ABSTRACT]

Details

Title
Classification of breast masses in mammograms using genetic programming and feature selection
Author
Nandi, R J; Nandi, A K; Rangayyan, R M; Scutt, D
Pages
683-94
Publication year
2006
Publication date
Aug 2006
Publisher
Springer Nature B.V.
ISSN
01400118
e-ISSN
17410444
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
661178990
Copyright
International Federation for Medical and Biological Engineering 2006