About the Authors:
Qian Zhang
Roles Conceptualization, Investigation, Methodology
Affiliation: College of Mathematics and Statistics, Shaanxi Normal University, Xi’an, 710119, PR China
XiaoLi Yang
Roles Supervision, Writing – review & editing
* E-mail: [email protected]
Affiliation: College of Mathematics and Statistics, Shaanxi Normal University, Xi’an, 710119, PR China
ORCID logo https://orcid.org/0000-0002-6676-8384
ZhongKui Sun
Roles Supervision, Writing – review & editing
Affiliation: Department of Applied Mathematics, Northwestern Polytechnical University, Xi’an, 710129, PR China
Introduction
Alzheimer’s disease (AD) is an irreversible progressive neurodegenerative disease in the brain. Mild Cognitive Impairment (MCI), as a prodromal stage of AD, is the most challenging group, which can be divided into Early Mild Cognitive Impairment (EMCI) and Late Mild Cognitive Impairment (LMCI). It is well known that MCI has a high risk of developing into AD, thus identifying MCI is key to early detection and diagnosis of AD [1].
In recent years, there has been great effort to identify pathological markers of AD and MCI so as to better computer-aided diagnosis or prognosis of them. With the rapid development of neuroimaging technology, various kinds of neuroimaging modalities including magnetic resonance imaging (MRI), positron emission tomography (PET), electroencephalogram (EEG) and cerebrospinal fluid (CSF) have been widely applied to the auxiliary prediction and diagnosis of AD [2–5]. Particularly, structural MRI (sMRI), for its non-invasive and convenient to provide visual brain atrophy information of macroscopic tissue and structure, has become more and more popular in clinical diagnosis of AD [6–8]. Some researchers have devoted to distinguishing AD and MCI with sMRI data using different features and achieved some good classification results. For example, based on the structural image data of the subjects, Beheshti et al. have made the classification accuracy of AD versus NC reach 84.07% by extracting the feature of the gray matter similarity matrix. Then the Mini-mental State Examination (MMSE) scale has been further considered, and the classification of patients with AD versus NC has reached the accuracy of 97.01% [9]. Tang et al. have selected shape differences and heterosexual fraction values of hippocampus and amygdala as features, and achieved the accuracy of 96.4% in AD versus NC classification experiments with SVM classifier [10]. By selecting Automated Anatomical Labeling (AAL) template as Region of Interest (ROI), previous works have taken functional connectivity as feature and achieved the classification accuracy of 82% and 91% for AD versus non-AD and NC versus MCI subjects, respectively [11,12]. Visser et al. have found that patients with MCI have medial temporal atrophy and memory impairment, and the feature of memory impairment is a better predictor of dementia than that of medial temporal lobe, moreover, the combination of them can improve the prediction accuracy of AD [13].
As well known that loss of nerve cells and synapses is common in MCI and AD patients, which results in the phenomenon of brain atrophy. The degree of brain atrophy can be effectively measured by gray matter (GM) volume. Some works in the literature have validated that gray matter in global and local brain regions is lost for MCI and AD subjects [14–16]. Through the follow-up of NC or MCI subjects and the measurement of the volume of gray matter in hippocampus, amygdala or temporal lobe, previous works have reported that the elderly will have physiological brain atrophy with age [17,18]. Butto et al. have found that the temporal lobe gray matter is decreased in AD patients by voxel-based morphometric (VBM) method [19]. Thus, gray matter volume has been employed as a kind of anatomical feature to effectively assist early prediction of AD. By classifying the gray matter volume of some brain structures, Schmitter et al. have improved the classification accuracy of AD versus MCI group and early AD versus late AD group, respectively [20]. By combing the gray matter volume with average intensity and CSF, Suk et al. have confirmed the classification accuracy of AD is greatly improved based on MRI and PET [21].
On the other hand, there exist left and right asymmetry in brain structure and function which is termed as the phenomenon of lateralization. Ratnarajah et al. have studied the asymmetry of brain structural connection in normal newborns [22]. Iturria-Medina et al. have explored the asymmetry of hemispheric structures between human and nonhuman primates [23]. By introducing the lateralization index to quantify the asymmetry, Kim et al. have found the cortical asymmetry between NC, MCI and AD patients [24]. Though the asymmetry is of significance in brain structure and function, this characteristic of asymmetry has rarely been attempted to recognize AD, especially combining the lateralization index with some anatomical features to assist prediction and early diagnosis of AD. On the same time, previous works have indicated that the classification accuracy of AD and MCI is usually influenced by various factors including subject data, feature type and experimental algorithms [25,26]. Therefore, in this work we devote to selecting the lateralization index and the GM volume as features, and then establishing several classifier models including Random Forest (RF), Decision Tree (DT), K-Nearest Neighbor(KNN) and Support vector machine (SVM) with RBF kernel, Linear kernel or Polynomial kernel, so as to realize binary classification prediction among NC, EMCI, LMCI and AD subjects from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. The main purpose of this work is to explore how the classification features and classifier models affect the classification accuracy.
The structure of this work is as follows. Section 2 presents preparatory work and classification algorithm used in this work. The classification performances are illustrated in Section 3 in detail. Finally, conclusion and discussion are address in Section 4.
Preparatory work and classification algorithm
In this section, some preparatory works for the classification experiment are firstly present, then the detailed experimental algorithm is illustrated.
Data acquisition
Experimental data employed in this work are obtained from the ADNI database (ADNI, http://adni.loni.usc.edu/). The primary purpose of ADNI is to collect, validate and utilize data, including MRI and PET images, genetics, cognitive tests, CSF and blood biomarkers, so as to define the progression of AD [2,5,27]. ADNI results from combined efforts of numerous co-investigators from an extensive range of academic institutions and private corporations. Subjects recruiting in ADNI include Alzheimer’s disease patients, mild cognitive impairment subjects and elderly controls.
In the present work, the data sample contains 203 structural MRI images from 94 subjects ranged from 55 to 90 years old from the ADNI dataset. For each subject multiple images are collected at different time points. As indicated by the ADNI dataset, the cognitive function outcomes of MMSE (Mini-Mental State Examination) and CDR (Clinical Dementia Rating) for the same subjects vary in different ages. Previous works reported that in the follow-up of NC or MCI subjects the elderly will have physiological brain atrophy with age by measuring the volume of gray matter in hippocampus, amygdala or temporal lobe [17,18]. These findings indicates that the MRI scanning status of the same subject may change over time. Among these structural images, 67 images are for 27 NC subjects (12 males and 15 females), 58 images are for 27 EMCI subjects (18 males and 9 females), 39 images are for 18 LMCI subjects (10 males and 8 females) and 39 images are for 22 AD subjects (12 males and 10 females). The three dimensional T1-weighted anatomical magnetization prepared rapid gradient echo (MPRAGE) imaging parameters are as follows: the three dimension of sMRI images is 256×256×170, the flip angle is 9°, the slice thickness is1.2mm, the TR/TE is 6.8/3.1 and the voxel size is 1mm×1mm×1.2mm.
Image pre-processing
Pre-processing of MRI samples is performed in MATLAB 2016b environment by Computational Anatomy Toolbox version 12 (CAT12, http://dbm.neuro.uni-jena.de/cat) [28]. CAT 12 is a popular neuroimaging analysis software that can realize the Voxel-based Morphometric (VBM) pipeline, which runs in the Statistical Parametric Mapping toolbox version12 (SPM12). It is commonly applied to investigate the distribution of GM and calculate the volume of GM. The detailed processing steps include format conversion, skull removal, tissue segmentation, dartel registration and normalization. Then based on the AAL atlas the GM volume of 90 brain anatomical are acquired.
Lateralization index
As stated in the Introduction, the phenomenon of lateralization exists in brain for its left and right asymmetry of structure and function. The asymmetry is quantified by lateralization index. The Lateralization Index (LI) for gray matter volume is defined as follows [24]Where L and R represent the values of gray matter volume of the corresponding brain regions in the left and right hemispheres, respectively. This LI can quantitatively measure the difference of GM volume between the left and right hemispheres of the brain. A positive LI means that the GM volume for the left hemisphere is more prominent than the right hemisphere. The brain region with positive LI is called the left asymmetric region, otherwise referred to as the right asymmetric region.
Feature extraction
It’s necessary to extract feature for image classification. In this work, two hypothesis testing methods, i.e., one way ANOVA and the independent sample non-parametric test of Kruskal-Wallis, are employed to perform features extraction using 203 structural MRI scans [29]. For the samples of GM volume and LI in every brain region for four groups of NC, LMCI, EMCI and AD subjects, the condition of normality and variance homogeneity are firstly examined. If this condition is met, the ANOVA as well as the LSD-t multiple comparisons are carried out for the samples within one brain region so as to check whether there is significant difference between the four groups. Otherwise, the Kruskal-Wallis test is conducted. The threshold p value is set to 0.05 in the above hypothesis test. On the basis of IBM SPSS Statistics 25, the features including 64 indicators for the GM volume and 15 indicators for the LI are obtained.
Classification algorithm
As indicated in the algorithm flow chart of Fig 1, the detailed classification algorithm for the considered four test groups are as follows. Just as stated in the above, the data samples containing 203 structural MRI images from 94 different subjects are firstly obtained from the ADNI dataset. Then, the MRI samples are pre-processed. After processing, calculate the GM volume on the AAL brain atlas, and then calculate the LI feature through the GM volume. Statistical analysis of ANOVA and the Kruskal-Wallis test is followed to extract classification feature. The obtained features of the GM volume in 64 different brain regions and LI for 15 different brain regions are normalized. Next, three classifier algorithms including RF, DT, KNN and SVM with RBF kernel, Linear kernel or Polynomial kernel are employed to realize binary classification among NC, EMCI, LMCI and AD groups. Using the 10-fold cross-validation method, the classification accuracy for each classifier algorithm is evaluated over 50 realizations of the result of 10-fold cross-validation. At last, the classification accuracies using the single feature of GM volume or LI are compared with that using two combined features of GM volume and LI.
[Figure omitted. See PDF.]
Fig 1. The classification algorithm flow chart for binary classification among NC, EMCI, LMCI and AD groups.
https://doi.org/10.1371/journal.pone.0262722.g001
In this work, the turning parameters for these employed classifier models are as follows. The turning parameters of SVM classifier with RBF kernel are optimized by grid search technique. The regularization parameter—C and the kernel coefficient—gamma are respectively optimized in the range of [2^Cmin, 2^Cmax] and [2^Gmin, 2^Gmax], in which the default values are Cmin = -8, Cmax = 8, Gmin = - 8 and Gmax = 8. Besides the SVM classifier with RBF kernel, the tuning parameters for other classifier algorithms are selected to optimize the classifier accuracy. For SVM classifier with polynomial kernel, the turning parameters of degree, regularization parameter—C and the kernel coefficient–gamma are chosen as 3, 1and 0.5, respectively. The tuning parameter of minleaf for DT classifier is 3. For RF classifier, the number of trees is 500, the max depth of each tree is default for the small sample size. The tuning parameter k for KNN classifier is 1.
Classification performances
In this section, the classification performances are presented in detail. They include binary classification results across six test groups, comparison of classification results between different subject groups and different classification algorithms, and comparison with other works in the literature.
Binary classification results
In this section, several classifier algorithms including RF, DT, KNN and SVM with RBF kernel, Linear kernel or Polynomial kernel are used to realize binary classification among NC, EMCI, LMCI and AD groups. The classification accuracies are outlined in Tables 1–6. The above two lines denotes the accuracies in the case that the single feature of GM volume in 64 different brain regions and the single feature of LI for 15 different brain regions are respectively considered, whereas the below line denotes the accuracies in the case that the newly introduced feature of LI is integrated into the original feature of GM volume.
[Figure omitted. See PDF.]
Table 1. Classification results in NC versus EMCI under three kinds of features(%).
https://doi.org/10.1371/journal.pone.0262722.t001
[Figure omitted. See PDF.]
Table 2. Classification results in NC versus LMCI under three kinds of features(%).
https://doi.org/10.1371/journal.pone.0262722.t002
[Figure omitted. See PDF.]
Table 3. Classification results in EMCI versus LMCI under three kinds of features(%).
https://doi.org/10.1371/journal.pone.0262722.t003
[Figure omitted. See PDF.]
Table 4. Classification results in EMCI versus AD under three kinds of features(%).
https://doi.org/10.1371/journal.pone.0262722.t004
[Figure omitted. See PDF.]
Table 5. Classification results in LMCI versus AD under three kinds of features(%).
https://doi.org/10.1371/journal.pone.0262722.t005
[Figure omitted. See PDF.]
Table 6. Classification results in AD versus NC under three kinds of features(%).
https://doi.org/10.1371/journal.pone.0262722.t006
Classification results in NC versus EMCI.
Table 1 shows the classification accuracies of NC group versus EMCI group under different classification algorithms. In the case of single feature of GM volume, the results indicate that the KNN classifier has the highest accuracy of 95.4269%. In the case of single feature LI, the results indicate that the SVM classifier with Polynomial kernel has the highest accuracy of 91.2821%. In the two cases, the lowest accuracies of 75.0244% and 73.4821% are all obtained by DT classifier. In the case of the combined feature of GM volume and LI, the highest accuracy and the lowest one are still achieved by KNN classifier and DT classifier, which are 95.491% and 74.6538%, respectively. What’s more, with the introduction of LI feature into GM volume feature, the classification accuracies under SVM classifiers with RBF kernel, Linear kernel or Polynomial kernel as well as RF classifier are improved significantly compared to those in the feature of GM, among which the accuracy under SVM classifier with Linear kernel has the most enhancement of 2.3718%, whereas the prediction accuracy under DT classifier is slightly decreased, with a decrease of 0.3706%.
Classification results in NC versus LMCI.
The classification accuracies of NC group versus LMCI group under different classification models are displayed in Table 2. Under the circumstance of single feature of GM volume and single feature of LI, it turns out that the KNN classifier has the highest accuracy of 93.8364% and 96.1564% respectively, and the DT classifier has the lowest accuracy of 75.5800% and 77.0636% respectively. Under the circumstance of the combined feature of GM volume and LI, the DT classifier still has the lowest accuracy of 77.1491%, while the highest accuracy is achieved by SVM classifier with RBF kernel has the accuracy of 96.0909%. Furthermore, with the introduction of LI feature into the GM volume feature, the classification accuracies under RF classifier, DT classifier, KNN classifier as well as SVM classifiers with RBF kernel, Linear kernel or Polynomial kernel are all improved remarkably compared to those in the feature of GM. Especially the SVM classifier with RBF kernel has the most enhancement of 5.6364%.
Classification results in EMCI versus LMCI.
Table 3 illustrates the classification accuracies of EMCI group versus LMCI group under three classification algorithms. Under the single feature of GM volume, the experimental results indicate that the SVM classifier with Polynomial kernel has the highest accuracy of 94.7778%, and the DT classifier has the lowest accuracy of 76.0711%. In the case of LI feature, KNN classifier has the highest classification accuracy of 91.2711% and DT has the lowest classification accuracy of 69.48%. Under the combined feature of GM volume and LI, the SVM classifier with Polynomial kernel and the DT classifier can still achieve the highest accuracy and the lowest one, which are 94.7778% and 74.0289%, respectively. With the addition of LI feature to GM volume, compared to those in the feature of GM the prediction accuracies of under SVM classifier with Polynomial kernel is unchanged, the prediction accuracies under RF classifier and KNN classifier are slightly improved, while the accuracies of DT classifier, SVM classifiers with Linear kernel and RBF kernel are decreased by 0.8889%, 2.1111% and 2.0422% respectively.
Classification results in EMCI versus AD.
The classification accuracies of EMCI group versus AD group are depicted in Table 4 under different classification models. In the situation of single feature of GM volume, the results show that the KNN classifier has the highest accuracy of 92.8822%, and the SVM classifier with RBF kernel has the lowest accuracy of 85.6667%. In the case of LI feature, SVM classifier with RBF kernel has the highest classification accuracy and DT has the lowest classification accuracy, which are 91% and 73.6178%, respectively. In the situation of the combined feature of GM volume and LI, the SVM classifier with RBF kernel still has the lowest accuracy of 85.8889%, while the highest accuracy of 94.2244% is achieved by KNN classifier. As the LI feature is introduced into the GM volume feature, all the experimental accuracies of these classifiers are improved compared to those in the feature of GM, among which the most enhancement of 5.2222% is achieved by SVM classifier with Linear kernel.
Classification results in LMCI versus AD.
The classification accuracies of LMCI group versus AD group are presented in Table 5 under different classifier algorithms. In the case of single feature of GM volume, the results suggest that the SVM classifier with Polynomial kernel has the highest accuracy of 92.3214%, and the SVM classifier with RBF kernel has the lowest accuracy of 84.4643%. Under the circumstance of single feature of LI, SVM with RBF kernel has the highest classification accuracy of 89.8214%, while SVM classifier with Linear kernel has the lowest accuracy of 58.0357%. With the introduction of LI feature into the GM volume feature, the highest accuracy is 93.7500% and the lowest one is 87.3179%, which are obtained by SVM with Linear kernel and DT classifier, respectively. The accuracies under RF classifier and SVM classifier with Polynomial kernel remain basically unchanged, the accuracy under KNN classifier is slightly decreased, however the classification accuracies under DT classifier, SVM classifier with RBF kernel or Linear kernel are improved significantly, which are 0.9858%, 3.8% and 5.1786%, respectively.
Classification results in AD versus NC.
Table 6 depicts the classification accuracies of AD group versus NC group under different classifier algorithms. Under the single feature of GM volume, the results suggest that the KNN classifier has the highest accuracy of 98.08%, and DT classifier have the lowest accuracy of 90.9709%. Under the single feature of LI, the results show that the accuracy of SVM with RBF kernel has the highest accuracy of 94.3636%, while DT classifier have the lowest accuracy of 74.7873%. Moreover, when the LI feature is introduced into the GM volume feature, the highest accuracy and the lowest one are achieved by KNN classifier and DT classifier, which are 98.2564% and 91.1018% respectively. All the accuracies under different classification algorithms have been enhanced, with the most enhancement of 2% under SVM classifier with Linear kernel. In addition, when the LI feature is introduced into the GM volume feature, all the accuracies under five classification algorithms have been enhanced, with the most enhancement of 2% under SVM classifier with RBF kernel.
Comparison of classification performances
In the following, firstly the classification performances between different subject groups are compared when fixing each classifier model. Then the classification performances between different classification algorithms are compared when fixing each binary subject groups. At last, the current classification performances are compared with the previous results in the literature.
Comparison between different subject groups.
When fixing the SVM classifier with RBF kernel, linear kernel and polynomial kernel, RF classifier, DT classifier and KNN classifier, the classification accuracies for binary subject groups NC versus EMCI, NC versus LMCI, EMCI versus LMCI, EMCI versus AD, LMCI versus AD, AD versus NC are depicted in Fig 2A–2F, respectively. Comparing with these figures, some results can be obtained as follows. (1) For each classifier algorithm, the accuracies between the six subject groups are obviously different. Interestingly, the accuracies under the KNN classifier and the SVM classifier with polynomial kernel and with the RBF kernel are relatively stable among different groups, while the accuracies for different groups under SVM classifier with Liner kernel and DT classifier fluctuate severely. (2) For all the classification algorithms, the classification accuracy for AD versus NC group is almost the best, reaching the highest accuracy of 98.0909%. (3) By introducing the LI feature into the GM volume, the classification accuracies for most test groups are improved except for EMCI versus LMCI and NC versus EMCI groups. The highest improvement is 5.6364% under the SVM classifier model with RBF kernel. For NC versus EMCI group under DT classifier as well as EMCI versus LMCI group under SVM classifier with RBF kernel and linear kernel, the classification accuracies are decreased slightly, with the maximum decrease of 2.1111%.
[Figure omitted. See PDF.]
Fig 2. For each classification algorithm, the classification accuracies for binary subject groups NC versus EMCI, NC versus LEMC, EMCI versus LEMC, EMCI versus AD, LMCI versus AD, AD versus NC.
https://doi.org/10.1371/journal.pone.0262722.g002
Comparison between different classification algorithms.
For each binary subject groups NC versus EMCI, NC versus LEMC, EMCI versus LMCI, EMCI versus AD, LMCI versus AD, AD versus NC, Fig 3A–3F illustrates the classification accuracies under SVM classifier with RBF kernel, linear kernel and polynomial kernel, RF classifier, DT classifier and KNN classifier, respectively. From these figures, some results can be summarized as follows. (1) For each subject groups, the accuracies under the different classification algorithms are distinctly diverse. Note that among the six binary subject groups, the accuracies for EMCI versus AD and NC versus AD groups are relatively stable and the classification accuracies for the rest subject groups fluctuate obviously, especially when the classification performance in the feature of LI is not included. (2) For all the six subject groups, the classification accuracies under the SVM classifier with polynomial kernel and the KNN classifier are relatively higher, while the accuracies under the DT classifier are mostly lowest. (3) By combining the LI feature with the GM volume feature, the classification accuracies for most test groups are enhanced, with the most enhancement 5.6364% for NC versus LMCI group under the SVM classifier model with RBF kernel. While for NC versus EMCI group under DT classifier as well as EMCI versus LMCI group under SVM classifier with RBF kernel and linear kernel, the classification accuracies are a little degraded, with the most reduction of 2.1111%.
[Figure omitted. See PDF.]
Fig 3. For each subject group, the classification accuracy under SVM classifier with RBF kernel, linear kernel and polynomial kernel, RF classifier, DT classifier and KNN classifier.
https://doi.org/10.1371/journal.pone.0262722.g003
Comparison with other works
In the literature, there have been some results that identifying AD based on structural MRI. By extracting CSF and hippocampus volume on the AAL atlas, Ben et al. classified AD, MCI and NC group and obtained the accuracy of 87% for AD versus NC group [26]. On the basis of the volume of the hippocampus and entorhinal cortex, Fan et al. enhanced the classification accuracy of NC versus AD to 94.3% through a high-dimensional classification model [30]. Based on the T1-weighted MR scan GM segment, Klöppel et al. employed SVM classifier with Linear kernel to classify pathological AD patients and the elderly with normal cognition, where 96% of AD patients were classified correctly [31]. In our work, combing two features including GM volume and LI, the classification accuracy for AD versus NC is 98.2564% under the KNN classifier, which is also higher than the best classification accuracy of 92.4% for AD versus NC only using low-level feature from MRI [21].
Conclusion and discussion
This work focuses on computer-aided diagnosis of AD and its prodromal stage including EMCI and LMCI. Employing the MRI images from ADNI dataset, the GM volume and LI based on AAL atlas are obtained. The classification feature containing GM volume in 64 different brain regions and LI for 15 different brain regions are firstly extracted by ANOVA and the Kruskal-Wallis test. Then, several classifier algorithms including RF, DT, KNN and SVM with RBF kernel, Linear kernel or Polynomial kernel are employed to realize binary classification among NC, EMCI, LMCI and AD groups. How the classification features and experimental algorithms influence the classification performances are explored. Using the 10-fold cross-validation method, the accuracies for binary classification are illustrated in Tables 1–6, Figs 2 and 3, from which some conclusions can be summarized as follows.
1. For different classifier models, the classification accuracy of NC group versus AD group is relatively stable especially in the feature of GM volume and the combined feature of LI and GM volume, which implies that the classification accuracy for AD versus NC is not dependent on these classifier algorithms. Moreover, no matter what classifier model is employed, the accuracy of AD versus NC is almost the highest across these binary classifications, with a best accuracy of 98.2564%. This indicates that the classification accuracies are affected by test groups, and a greater disease difference between groups is helpful to enhance classification accuracies.
2. For different subject groups, the SVM classifier algorithm with Polynomial kernel and the KNN classifier algorithm shows relatively stable and high classification accuracies, while DT classifier algorithm shows relatively unstable and lower classification accuracies. This implies that the classification accuracies are influenced by classifier model.
3. The classification performance in the feature of LI is poor compared with that in the case of GM feature i.e., the classification accuracies in the feature of LI are relatively low and unstable for most classifier models and subject groups. As the lateralization information is introduced into the original feature of GM volume, the classification accuracy for most binary subject groups is significantly improved, with a maximum increase of 5.6364%. However, the classification accuracies for some binary test groups are slightly decreased, including EMCI versus LMCI under DT classifier, SVM classifier with RBF kernel and Linear kernel, as well as NC versus EMCI under DT classifier, with a maximum decrease of 2.1111%. It can be seen that the feature of LI can effectively enhance the classification results for most test groups, which is helpful for early prediction of AD. When the disease difference between test groups is not striking (EMCI versus LMCI, NC versus EMCI), the prediction accuracy is slightly decreased by incorporating the LI. This may be due to the fact that when the boundary of the test groups is ambiguous, there is no distinct difference in brain lateralization, which makes the feature of LI can not play a positive role in classification prediction.
The above results indicate that various factors of subject samples, feature types and experimental algorithms influence classification performance remarkably, especially the newly introduced feature of LI is beneficial to improve the classification performances when the disease difference between test groups is distinct. As known that MCI patients are a high risk population for AD. Early diagnosis and intervention can greatly reduce the risk of AD. This work proposes a technique to classify AD progression based on comprehensive samples including NC, EMCI, LMCI and AD groups. Here MCI is divided into fine stages of early MCI and late MCI, compared to a general stage of MCI in most previous works. In addition, one may wonder how is the classification performance if the four groups of NC, EMCI, LMCI and AD are simultaneously classified. Using the combined feature of GM volume and LI, our further experimental results illustrate that the classification accuracies for the four groups under different classifier of SVM with RBF, Linear, Linear kernel, RF, DT and KNN are 84.8714%, 85.2648%, 88.109%, 84.2595%, 84.2595% and 71.3448%, respectively. The classification accuracies of four group classification are relatively lower than those obtained in binary classification. As known that four group prediction will be much more clinically important and more exciting, meanwhile, two group classification might be easier and more powerful. Thus, these obtained results are believed to shed new perspective on neuroimaging data analysis, which provides diverse theoretical basis for clinical recognition and diagnosis of AD in different situations of two subject groups or four subject groups.
At last, we want to point that besides the evaluation metric of classification accuracy there are some other metrics such as AUC, TPR and TNR to reflect the classification performance. Due to the various considered factors including three kinds of classification features, several kinds of classification algorithms and six groups of binary classification, this work only uses the evaluation metric of classification accuracy to study how classification features and experimental algorithms influence classification performance for simplicity. In addition, the data sample in this work consists of 203 structural MRI scans from 94 subjects. For the same subject, multiple images are collected at different time points. Although they may change over time, it is hard to control the dependence of different samples from the same subjects. Regretfully, multiple scans from the same subjects are treated as independent samples in this work. For the features selection, we extract the classification features using the 203 structural MRI scans by hypothesis testing methods (ANOVA and Kruskal-Wallis test). Then these 203 samples are split into training and validation sets by random divisions in the 10-fold cross-validation. Due to the randomness of sample selection, if one subject has multiple scans, some scans maybe divided into the training data and others maybe divided into the validation data. For the above two occasions there might be some shared information between training and testing dataset, which might lead to the over fitted classification results. These statements might be the limitation of the present results. In the future, we will increase the sample size and try to avoid the issue of overfitting by directly dividing different subjects into training and testing data with a ratio, performing variable selection and classification purely on training samples, and then test the results on the testing samples. On the same time, several evaluation metrics including AUC, TPR and TNR would be employed in combination to reflect the classification performance more thoroughly.
Citation: Zhang Q, Yang X, Sun Z (2022) Classification of Alzheimer’s disease progression based on sMRI using gray matter volume and lateralization index. PLoS ONE 17(3): e0262722. https://doi.org/10.1371/journal.pone.0262722
1. Reitz C, Mayeux R. Alzheimer disease: Epidemiology, Diagnostic Criteria, Risk Factors and Biomarkers. Biochem Pharmacol. 2014; 88: 640–651. pmid:24398425
2. Yang X, Tan MZ, Qiu A. CSF and Brain Structural Imaging Markers of the Alzheimer’s Pathological Cascade. PLoS ONE. 2012; 7: e47406. pmid:23284610
3. Daliri MR. Automated Diagnosis of Alzheimer Disease using the Scale-Invariant Feature Transforms in Magnetic Resonance Images. J Med Syst. 2012; 36: 995–1000. pmid:21584770
4. Nimmy John T, Subha D Puthankattil, Menon R. Analysis of long range dependence inthe EEG signals of Alzheimer patients. Cogn Neurodyn. 2018; 12: 183–199. pmid:29564027
5. Wenlu Y, Fangyu H, Xinyun C, Xudong H. ICA-Based Automatic Classification of PET Images from ADNI Database. International Conference on Neural Information Processing. Springer, Berlin, Heidelberg. 2011; 265–272.
6. Budson AE, Solomon PR. New diagnostic criteria for Alzheimer’s disease and mild cognitive impairment for the practical neurologist. Pract Neurol. 2012; 12: 88–96. pmid:22450454
7. Fayed N, Modrego PJ, Garcia-Marti G, Sanz-Requena R, Marti-Bonmati L. Magnetic resonance spectroscopy and brain volumetry in mild cognitive impairment. A prospective study. Magn Reson Imaging. 2017; 38: 27–32. pmid:27964994
8. Young JM. Probabilistic prediction of Alzheimer’s disease from multimodal image data with Gaussian processes. University of London. 2015.
9. Beheshti I, Maikusa N, Matsuda H, Demirel H, Anbarjafari G.Histogram-Based Feature Extraction from Individual Gray Matter Similarity-Matrix for Alzheimer’s Disease Classification. J Alzheimers Dis. 2017; 55: 1571–1582. pmid:27886012
10. Tang X, Qin Y, Wu J, Zhang M, Zhu W, Miller MI. Shape and diffusion tensor imaging based integrative analysis of the hippocampus and the amygdala in Alzheimer’s disease. Magn Reson Imaging. 2016; 34: 1087–1099. pmid:27211255
11. Tzourio-Mazoyer N, Landeau B, Papathanassiou D, Crivello F, Etard O, Delcroix N, et al. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage. 2002; 15: 273–289. pmid:11771995
12. Chen G, Ward BD, Xie C, Li W, Wu Z, Jones JL, et al. Classification of Alzheimer disease, mild cognitive impairment, and normal cognitive status with large-scale network analysis based on resting-state functional MR imaging. Radiology. 2011; 259: 213–221. pmid:21248238
13. Visser PJ, Scheltens P, Verhey FR, Schmand B, Launer LJ, Jolles J, et al. Medial temporal lobe atrophy and memory dysfunction as predictors for dementia in subjects with mild congnitive impairment. J Neurol. 1999; 246: 477–185. pmid:10431775
14. Chételat G, Desgranges B, De La Sayette V, Viader F, Eustache F, Baron JC. Mapping gray matter loss with voxel-based morphometry in mild cognitive impairment. Neuroreport. 2002; 13: 1939–1943. pmid:12395096
15. Karas GB, Scheltens P, Rombouts SA, Visser PJ, van Schijndel RA, Fox NC, et al. Global and local gray matter loss in mild cognitive impairment and Alzheimer’s disease. Neuroimage. 2004; 23: 708–716. pmid:15488420
16. Frisoni GB, Testa C, Zorzan A, Sabattoli F, Beltramello A, Soininen H, et al. Detection of grey matter loss in mild Alzheimer’s disease with voxel based morphometry. J Neurol Neurosurg Psychiatry. 2002; 73: 657–664. pmid:12438466
17. Smith et al, Chebrolu H, Wekstein DR. Brain structural alterations before mild cognitive impairment. Neurology. 2007; 68: 1268–1273. pmid:17438217
18. Den Heijer T, Geerlings MI, Hoebeek FE, Hofman A, Koudstaal PJ, Breteler MM. Use of hippocampal and amygdalar volumes on magnetic resonance imaging to predict dementia in cognitively intact elderly people. Arch Gen Psychiatry. 2006; 63: 57–62. pmid:16389197
19. Busatto GF, Garrido GE, Almeida OP, Castro CC, Camargo CH, Cid CG, et al. A voxel-based morphometry study of temporal lobe gray matter reductions in Alzheimer’s disease. Neurobiol Aging. 2003; 24: 221–231. pmid:12498956
20. Schmitter D, Roche A, Maréchal B, Ribes D, Abdulkadir A, Bach-Cuadra M, et al, Alzheimer’s Disease Neuroimaging Initiative. An evaluation of volume-base morphometry for prediction of mild cognitive impairment and Alzheimer’s disease. Neuroimage. 2014; Clin 7: 7–17. pmid:25429357
21. Suk HI, Lee SW, Shen D, Alzheimer’s Disease Neuroimaging Initiative. Latent feature representation with stacked auto-encoder for AD/MCI diagnosis. Brain Struct Funct. 2015; 220: 841–859. pmid:24363140
22. Ratnarajah N, Rifkin-Graboi A, Fortier MV, Chong YS, Kwek K, Saw SM, et al. Structural connectivity asymmetry in the neonatal brain. Neuroimage. 2013; 75: 187–194. pmid:23501049
23. Iturria-Medina Y, Pérez Fernández A, Morris DM, Canales-Rodríguez EJ, Haroon HA, García Pentón L, et al. Brain hemispheric structural efficiency and interconnectivity rightward asymmetry in human and nonhuman primates. Cereb Cortex. 2011; 21: 56–67. pmid:20382642
24. Kim JH, Lee JW, Kim GH, Roh JH, Kim MJ, Seo SW, et al. Cortical asymmetries in normal, mild cognitive impairment, and Alzheimer’s disease. Neurobiol Aging. 2012; 33: 1959–1966. pmid:21907459
25. Nasrolahzadeh M, Mohammadpoory Z, Haddadnia J. Higher-order spectral analysis of spontaneous speech signals in Alzheimer’s disease. Cogn Neurodyn. 2018; 12: 583–596. pmid:30483366
26. Ben Ahmed O, Benois-Pineau J, Allard M, Ben Amar C, Catheline G. Classification of Alzheimer’s disease subjects from MRI using hippocampal visual features. Multimedia Tools and Applications. 2014; 74: 1249–1266.
27. Wang J, Peng G, Liu P, Tan X, Luo B. Alzheimer’s Disease Neuroimaging Initiative. Regulating effect of CBF on memory in cognitively normal older adults with different ApoE genotype: the Alzheimer’s Disease Neuroimaging Initiative (ADNI). Cogn Neurodyn. 2019; 13: 513–518. pmid:31741688
28. Gaser C, Dahnke R, Kurth K, Luders E, Alzheimer’s Disease Neuroimaging Initiative. CAT-A Computational Anatomy Toolbox for the Analysis of Structural MRI Data. HBM. 2016; 336–348.
29. Canditiis DD. Statistical Inference Techniques. Encyclopedia of bioinformation and Computational Biology. 2019; 1: 698–705.
30. Fan Y, Batmanghelich N, Clark CM, Davatzikos C, Alzheimer’s Disease Neuroimaging Initiative. Spatial patterns of brain atrophy in MCI patients, identified via high-dimensional pattern classification, predict subsequent cognitive decline. Neuroimage. 2008; 39: 1731–1743. pmid:18053747
31. Klöppel S, Stonnington CM, Chu C, Draganski B, Scahill RI, Rohrer JD, et al. Automatic classification of MR scans in Alzheimer’s disease. Brain. 2008; 131: 681–689. pmid:18202106
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
© 2022 Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Abstract
Note that identifying Mild Cognitive Impairment (MCI) is crucial to early detection and diagnosis of Alzheimer’s disease (AD). This work explores how classification features and experimental algorithms influence classification performances on the ADNI database. Based on structural Magnetic Resonance Images (sMRI), two features including gray matter (GM) volume and lateralization index (LI) are firstly extracted through hypothesis testing. Afterward, several classifier algorithms including Random Forest (RF), Decision Tree (DT), K-Nearest Neighbor(KNN) and Support Vector Machine (SVM) with RBF kernel, Linear kernel or Polynomial kernel are established to realize binary classification among Normal Control (NC), Early Mild Cognitive Impairment (EMCI), Late Mild Cognitive Impairment (LMCI) and AD groups. The main experimental results are as follows. (1) The classification performance in the feature of LI is poor compared with those in the feature of GM volume or the combined feature of LI and GM volume, i.e., the classification accuracies in the feature of LI are relatively low and unstable for most classifier models and subject groups. (2) Comparing with the classification performances in the feature of GM volume and the combined feature of LI and GM volume, the classification accuracy of NC group versus AD group is relatively stable for different classifier models, moreover, the accuracy of AD group versus NC group is almost the highest, with the most classification accuracy of 98.0909%. (3) For different subject groups, the SVM classifier algorithm with Polynomial kernel and the KNN classifier algorithm show relatively stable and high classification accuracy, while DT classifier algorithm shows relatively unstable and lower classification accuracy. (4) Except the groups of EMCI versus LMCI and NC versus EMCI, the classification accuracies are significantly enhanced by emerging the LI into the original feature of GM volume, with the maximum accuracy increase of 5.6364%. These results indicate that various factors of subject data, feature types and experimental algorithms influence classification performances remarkably, especially the newly introduced feature of LI into the feature of GM volume is helpful to improve classification results in some certain extent.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer