Enhancing Early Dementia Detection: A Machine Learning Approach Leveraging Cognitive and Neuroimaging Features for Optimal Predictive Performance

Irfan, Muhammad; Shahrestani, Seyed; Elkhodr, Mahmoud

doi:10.3390/app131810470

Open AccessArticle

Enhancing Early Dementia Detection: A Machine Learning Approach Leveraging Cognitive and Neuroimaging Features for Optimal Predictive Performance

by

Muhammad Irfan

¹,

Seyed Shahrestani

¹ and

Mahmoud Elkhodr

^2,*

¹

School of Computer, Data and Mathematical Sciences, Western Sydney University, Penrith, NSW 2751, Australia

²

School of Engineering and Technology, CQUniversity, Sydney, NSW 2000, Australia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(18), 10470; https://doi.org/10.3390/app131810470

Submission received: 19 July 2023 / Revised: 12 September 2023 / Accepted: 13 September 2023 / Published: 19 September 2023

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

Dementia, including Alzheimer’s Disease (AD), is a complex condition, and early detection remains a formidable challenge due to limited patient records and uncertainty in identifying relevant features. This paper proposes a machine learning approach to address this issue, utilizing cognitive and neuroimaging features for training predictive models. This study highlighted the viability of cognitive test scores in dementia detection—a procedure that offers the advantage of simplicity. The AdaBoost Ensemble model, trained on cognitive features, displayed a robust performance with an accuracy rate of approximately 83%. Notably, this model surpassed benchmark models such as the Artificial Neural Network, Support Vector Machine, and Naïve Bayes. This study underscores the potential of cognitive tests and machine learning for early dementia detection.

Keywords:

alzheimer; dementia; cognitive features; neuroimaging features; neighborhood component analysis (NCA); machine learning

1. Introduction

The aging population in numerous countries poses several challenges to already strained healthcare systems [1]. Globally, the population aged 65 and over is growing faster than any other age group [2]. This demographic shift toward older populations is likely to trigger a surge in age-related conditions such as cognitive impairment and Alzheimer’s Disease (AD) [1]. By 2050, it is estimated that around 152 million people worldwide will suffer from dementia [3]. With a new case of dementia occurring every three seconds globally, the rate is truly alarming [3]. The dramatic surge in the number of dementia patients affects caregivers and their families not just psychologically but also physically, socioeconomically, and economically [4]. As a result, early-stage screening of dementia patients is of utmost importance [4]. Early-stage screening can identify dementia symptoms before the condition fully develops [4], allowing treatment plans to be initiated promptly to control dementia’s progression. Hence, cognitive screening plays a critical role in enhancing the dementia healthcare system [4].

Dementia can be caused by a variety of diseases, including Alzheimer’s Disease (AD), cerebrovascular dementia, hypothyroidism, and benign brain tumors [5]. The most common type of dementia is Alzheimer’s disease [4]. The five major symptoms of dementia include memory loss, issues with visual perception, reduced reasoning and judgment, communication and language issues, and an inability to pay attention [6]. Cognitive tests (CT), which are neuropsychological assessments administered by clinical experts, are commonly used to evaluate memory capacity, general cognition, and language issues in patients [7]. As significant cognitive decline is one of the most critical early-stage dementia symptoms, quantifiable measurements through cognitive tests play a vital role in early detection [8]. Neuroimaging tests such as Magnetic Resonance Imaging (MRI) are also used to examine brain activities and diagnose dementia [8]. However, standard, state-of-the-art diagnostic procedures for dementia, such as cerebrospinal fluid analysis tests and neuroimaging tests, are often expensive, time-consuming, and carry risk factors [9].

Therefore, there is growing interest in using Machine Learning (ML) technologies to predict and detect the early stages of dementia. For example, the work reported in [10] proposed using a Machine Learning algorithm to detect the stages of dementia using screening tests. Kruthika et al. [11] detailed how machine learning techniques, including the SVM (Support Vector Machines) and KNN (K-Nearest Neighbor) algorithms, are used for predicting and classifying dementia. Veeramuthu et al. [12] leveraged machine learning to develop a decision-making CAD (Computer-Aided Design) tool for detecting dementia.

In response to these developments, this paper proposes the use of Machine Learning (ML) for the early-stage detection of Alzheimer’s Disease. The paper is an extended version of the paper “Early Detection of Alzheimer’s Disease: A Novel Cognitive Feature Selection Approach Using Machine Learning” published in the proceedings of the 2021 Conference on Advances in Information, Communication and Cybersecurity [13]. Our research blends applied ML methods with a novel feature selection technique. Key objectives include:

Utilizing all available features (cognitive, neuroimaging, and combined) from the ADNI-1, ADNI-2, and ADNI-3 datasets for Alzheimer’s Disease detection
Implementing robust preprocessing techniques, including handling missing values and data normalization.
Proposing and employing the novel NCA-F feature selection method to pinpoint critical and relevant features.

Conducting comparative analyses using various machine learning methods on the selected features building on these objectives, the model proposed in this article implements the AdaBoost Ensemble (adB), Artificial Neural Network (ANN), Support Vector Machine (SVM), and Naïve Bayes (NB) machine learning algorithms with cognitive and neuroimaging features obtained from a public dataset [14] consisting of 13,916 patients’ records. The remainder of this paper is organized as follows: Section 2 discusses the role of machine learning in predicting dementia; Section 3 outlines the research methodology; and Section 4 analyzes the results of the implemented machine learning models and compares them with existing work. Section 6 details the limitations of this work, while Section 5 offers concluding remarks.

2. Related Works

Machine learning (ML) is a promising and emerging technique used for the early detection of dementia. ML has been extensively used in the literature for the prediction of cognitive diseases [15]. The use of an ensemble classification model to identify patients with high and low dementia risks was proposed in [16]. The claimed classification accuracy was 94.7% when trained using paralinguistic features only. However, an increase of 2.5% (97.2% accuracy for combined features) in the model’s accuracy was reported when both paralinguistic and episodic memory features were used. Grassi et al. [17] developed an ensemble of 13 (i.e., SVM with radial basis function, SVM with linear kernel, SVM with polynomial kernel, L1 regularized logistic regression, L2 regularized logistic regression, multilayer perceptron, decision tree, k-NN, random forest, Naïve Bayes, liner regression) machine learning models to predict the conversion from Mild Cognitive Impairment (MCI) to Alzheimer’s disease. The authors reported that the ensemble was able to achieve an Area Under Curve (AUC) of 0.88, a specificity of 79.9%, and a sensitivity of 77.7%. Zhou et al. [18] proposed a novel approach for dementia diagnosis based on a three-stage deep feature learning and fusion system. Yang et al. [19] proposed a novel feature weighting method based on nearest neighbors called Component Feature Selections (NCFS). This method leverages a feature weighting vector to maximize classification accuracy and was reported to outperform other benchmark techniques.

Recently, a shift in research has been observed, and the use of cognitive features for the prediction of Alzheimer’s disease (AD) has been reported [20,21]. Ford et al. [21] used 18 cognitive features for the prediction of dementia. From the results, the authors reported a 0.74 area under the receiver operating characteristic (ROC) curve. Gill et al. [20] used both cognitive and neuroimaging features to predict AD. The authors used only four cognitive features and claimed 81.8% accuracy and an AUC of 0.79 for cognitive features while, for neuroimaging features, 75.7% accuracy and 0.77 AUC.

ML models use a series of steps for the identification, training, and testing of algorithms to find the feature(s) of interest for a given dataset. The extracted features play an important role in the performance of the prediction model for dementia. In machine learning, the process of selecting the appropriate features from a dataset to train the model is known as “feature selection”, which is very important in dataset cleaning [22]. In datasets that contain many features, it is challenging to select the features that are most relevant to the model. As such, removing irrelevant and redundant features from a given dataset improves the overall performance of machine-learning models [23]. Three feature selection approaches are commonly used in the literature, including the filtering approach, the wrapper, and embedded methods [6]. The features are selected based on the multiple statistical test scores and the derived correlation with the target variable. Test scores are based on a correlation coefficient, which defines the statistical relationship between the variables. Other approaches use the correlation coefficient as a feature selector, such as Pearson’s correlation, Linear Discriminant Analysis (LDA), Analysis of Variance (ANOVA), and the Chi-Square approach [24]. Furthermore, AlShboul et al. utilized machine learning to analyze ADNI data, focusing on classifying dementia stages through cognitive and demographic features [25]. Their work, based on the TADPOLE challenge, underscored effective algorithms and highlighted the potential of cognitive tests for non-invasive diagnoses. Their reliance on comprehensive assessments, such as the CDR, supported its application in clinical decision-making and achieved an accuracy rate of 89%.

In a similar vein, Lin et al. employed machine learning to pinpoint gene biomarkers crucial for the prediction of stable MCI patients, boasting an AUC value of 0.841. This research emphasizes the importance of early diagnosis and the potential of precision medicine [26]. Another significant study revealed cognitive and functional markers associated with AD progression [27]. By comparing various cognitive domains, the authors devised a computational method to monitor AD progression. Evaluations using ADNI data shed light on functional components that are closely tied to the disease’s progression. Nonetheless, the rise of deep learning techniques, particularly transfer learning, has shown promise in enhancing AD detection from neuroimaging scans. However, it is worth noting that these methods often demand resource-intensive computational models [28].

Deep learning is also utilized for the biomarker’s prediction of AD stages and progression from neuroimaging data [29,30]. The problem with these methods is that they utilize high-dimensional 3D neuroimaging data such as PET and MRI scans. In contrast, neuroimaging biomarkers are utilized for the progression of AD disease based on TADPOLE challenge data [14,31,32]. A study compares the top methods from the TADPOLE Challenge for predicting AD evolution [31]. Algorithms forecasted clinical diagnosis, ventricular volume, and cognitive scores. Different algorithms were evaluated, showing significant performance improvements over baselines, and interpretability analysis was conducted using SHAP values. The features CDRSB, AV45-PET, and FDG-PET are identified as the best-performing features [31]. Similarly, ML algorithms forecasted clinical diagnosis, ADAS-Cog13, and ventricular volume. No single algorithm excelled in predicting all three outcomes [32]. While some methods outperformed baselines, performance variation and challenges in addressing missing data were observed. In our proposed method, we have utilized cognitive scores and neuroimaging measurements to detect different stages of AD.

3. A Machine Learning Approach for Predicting Dementia

The research proposes a novel approach referred to as the Neighbourhood Component Analysis and Correlation-Based Filtration (NCA-F) method for the selection of the important primary features for the prediction of dementia. The research highlighted the impact of selected features in the early-stage detection of dementia. The ML-based model relies on a technique that enhances the feature reduction and selection processes of the relevant features. The process of combining cognitive and neuroimaging features resulted in the formulation of dementia biomarkers. Figure 1 provides an overview of the three-stage methodology used in this research.

As this research focuses on predicting dementia, neuroimaging measurements, along with cognitive scores, have been employed. Neuroimaging measurements and cognitive scores are considered sensitive features [14]. For the prediction of dementia, a diverse set of ML algorithms was selected based on their compatibility with our data. Specifically:

SVM: This was chosen for its suitability for high-dimensional data.
ANN: An advanced algorithm optimized and parameterized for high feature numbers.
NB: A probabilistic method ideal for smaller datasets due to its assumption of feature independence.
AdBE: An ensemble method that improves upon decision trees by emphasizing corrections from previous iterations.

These algorithms, AdaBoost Ensemble (AdBE), Artificial Neural Network (ANN), Support Vector Machine (SVM), and Naïve Bayes (NB), were trained for multiple combinations of features.

In addition to neuroimaging measurements, we leveraged cognitive test scores as an integral part of our feature set. While neuroimaging provides detailed structural and functional insights into the brain, cognitive tests offer a more accessible and immediate means of assessing an individual’s cognitive functions. These tests are not only easier to administer but also critical in real-world clinical settings where quick and non-invasive evaluations are often necessary. By combining both of these types of data, we aimed to provide a comprehensive and clinically relevant model for dementia prediction. Furthermore, the inclusion of non-imaging cognitive tests from the ADNI dataset ensures that our research remains pertinent to a broader range of clinical scenarios beyond just those with imaging facilities.

3.1. Data Extraction

The dataset used in this research is provided by The Alzheimer’s Disease Prediction of Longitudinal Evolution [14]. This research has merged data from all phases of the Alzheimer’s Disease Neuroimaging Initiative (ADNI)* database (adni.loni.usc.edu) [33,34,35], hence the name ADNIMERGE dataset. *Data used in the preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in the analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf (accessed on 11 January 2023).

The ADNI was launched in 2003 as a public-private partnership led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial MRI, positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of MCI and early AD. It contains both cognitive test scores and neuroimaging measurement values for 13,892 records of 2132 patients. Many patients have visited multiple times, and each visit is recorded as a new record because a cognitive test score changes on each visit. Cognitive Normal (CN) records in the dataset were 4911, while Alzheimer’s Disease (AD) records were 8981, as shown in Figure 2. ADNIMERGE standard datasets contain some or all of the eight biomarkers, including (i) the main cognitive tests (ADAS, MMSE, and RAVLT), (ii) MRI ROIs (volumes, areas, and thicknesses), (iii) FDG PET ROI averages, (iv) AV45 PET ROI averages, (v) AV1451 PET ROI averages, (vi) DTI ROI measures (cell radial diffusivity, axonal diffusivity), (vii) CSF biomarkers, and (viii) some other features such as APOE status, demographic information, and diagnosis.

3.2. Data Pre-Processing

The pre-processing method is implemented to clean the noisy data and avoid underfitting or overfitting problems. The further steps involved are listed below.

3.2.1. Handling Missing Values

Each record of the ADNIMERGE dataset has 113 features, which contain several missing values. Retaining features with a high percentage of missing values can lead to inaccuracies. We set our removal threshold at 40%, which ensures a more robust dataset while allowing for manageable imputation. The remaining missing values are filled using Iterative Imputer, a method that imputes missing values for each feature from all the remaining features in a round-robin manner [36].

3.2.2. Data Normalization

Data normalization is an important process that impacts the overall performance of the model. Normalizing data ensures stable and efficient optimization and better generalization in ML and DL models. The ADNIMERGE dataset has a diverse range of values in different features. Thus, this dataset was normalized between 0 and 1 using the minimax technique. Equation (1) shows the mathematical expression used for dataset normalization.

z_{i} = \frac{x_{i} - \min (x)}{\max (x) - \min (x)}

(1)

where

x

denotes the input data and

z_{i}

denotes the

i_{t h}

normalized data;

\min (x)

and

\max (x)

are the minimum and maximum values, respectively.

3.3. The Feature Selection Approach

It is well established that the removal of redundant and irrelevant features from a dataset improves the model’s performance. as the use of irrelevant features may result in the model becoming underfitted or overfitted. The Filtering and Embedded feature selection methods were adopted. Nonetheless, this research also explored and investigated the benefits of combining the Filter and Neighborhood component feature selection approaches.

3.3.1. The Filtering Method

By using the Filtering method, a correlation heat map is generated by the correlation coefficient. This is a measure of the linear dependency between two or more variables. The Correlation coefficient matrix is defined as the matrix for each of the pairwise variable combinations [1], as expressed mathematically in Equations (2) and (3).

R = (\begin{matrix} p (A, A) & p (A, B) \\ p (B, A) & p (B, B) \end{matrix})

(2)

p (A, B) = \frac{C o v (A, B)}{σ_{A} σ_{B}}

(3)

In Equation (3),

σ

denotes the mean and standard deviation, while

C o v

denotes the covariance function. The Filtering method approach aims to define a threshold and filter only the highly correlated features of the developed model. Only the features with a correlated threshold having an absolute value greater than 0.9 are filtered in this work.

3.3.2. Wrapper Method

In a wrapper method like Principal Component Analysis (PCA) or Neighborhood Component Analysis (NCA), weights are assigned to features based on the clustering and classification performance of individual and combined features, respectively. In the proposed research, NCA with the Stochastic Gradient Decent (SGD) method as a solver is used for assigning weights to the features. SGD is suitable for handling large datasets, and its stochastic nature makes optimization for NCA more efficient and effective. The least-weighted features are excluded based on performance.

3.3.3. The Proposed Method (NCA-F)

This research proposes a new approach referred to as NCA-F (Neighborhood Component Analysis and correlation-based Filtration method). It uses a combination of filtering, Pearson’s correlation coefficient, and a wrapper method in the feature selection process. Firstly, irrelevant features are excluded based on Pearson’s correlation coefficient having an absolute value greater than 0.9. After the filtration process, the selected features are further processed using the NCA method. NCA aims to assign weights to each feature based on its classification performance. NCA-F has used the nearest neighbor classifier for checking the performance of different combinations of features and then assigning weights to each feature. These weighted features are then sorted in descending order and used in the model for the prediction of dementia. This research has separated neuroimaging measures (neuroimaging features) and cognitive test scores (cognitive features) and combined both of these features (combined features) to analyze the impact of both features on dementia detection together and separately. These features are shown in Table 1 and Table 2 after applying the feature filtration method with their weights. Although in a typical ML model, age is treated as a demographic, in AD prediction, age is an important factor and is treated as a cognitive feature by ADNIMERGE. “Combined features” are the combination of these two tables.

4. Results

4.1. Experimental Setup

Before applying the ADNIMERGE dataset along with all features to machine learning models, this research has analyzed the effects of neuroimaging, cognitive, and combined features using the AdaBoost Ensemble classifier using 5-fold validation, and the best number of features is selected for all three features. Performance measures for the N number of features for all three types of features are shown in Figure 3, Figure 4 and Figure 5. A sample correlation matrix for neuroimaging features is shown in Figure 6, which contains only the top five weighted neuroimaging features. Similarly, for cognitive features, only 18 out of 27 are selected after 5-fold cross-validation to achieve the best accuracy for the AdaBoost Ensemble classifier. For combined features, 19 features are selected out of 35 features, which are shown in Table 3.

After the selection of the best features from all three types, four different machine learning models, mentioned in Section 3, were used to predict dementia, and the results were validated with 5-fold cross-validation. Python 3.0 is used for the implementation of this methodology. The results of the implemented machine learning models for predicting dementia using highly weighted combinations of features are analyzed and discussed in the next sections.

4.2. Training and Testing

For the training setup, an 80:20 ratio between the training and test datasets was used. In this stage, 5 neuroimaging, 18 cognitive, and 19 combined highly weighted features from the proposed NCA-F were used to analyze the effect of these features on dementia prediction. AdB, ANN, SVM, and NB models are trained and tested, and the results are reported in Table 3. AdB has outperformed ANN, SVM, and NB models in most cases. For neuroimaging features, the SVM model outperformed the other three models, i.e., the ANN, AdB, and NB, and achieved ~74% classification accuracy, as shown in Table 4. SVM, ANN, and AdB models have performed almost equally, while NB has achieved the lowest performance measures as compared to other models, as shown in Figure 7a, which depicts the Receiver Operating Characteristic (ROC) curve for all four models on neuroimaging features. The Area Under the Curve (AUC) for SVM, ANN, and AdB models is ~80% whereas the NB AUC is ~74% after 10-fold cross-validation. For cognitive features, the AdB model has outperformed the remaining models mentioned and achieved ~83% classification accuracy. Figure 7b also shows that AdB has the optimum results with ~90% AUC. Similarly, for combined features, AdB has outperformed the other three models with ~83% accuracy and ~90% AUC. The combined features contain 7 neuroimaging and 12 cognitive features.

From Table 4, it is evident that only neuroimaging features have not performed well for dementia prediction. Cognitive features are more effective than neuroimaging features for prediction. Combined features have also performed well for the prediction; however, the only problem with combined features is that neuroimaging features are required. Moreover, the AdB model has good performance results for all three features. We have explicitly checked the performance of all four models and concluded that AdB has the optimum results, as shown in Figure 7. For further investigations, the overall performance of all three features is checked for all four models, and the results of the AdB model are shown in Figure 8. Neuro, Cog, and Com are neuroimaging, cognitive, and combined features, respectively. Figure 9 also depicts that the ROC curve of cognitive and combined features has better results than neuroimaging features.

4.3. Comparative Analysis

This section provides a comparison between the proposed NCA-F and the literature on different benchmark ADNI datasets. This article has explored Gill et al. [20] for comparison purposes because this work has also focused on the early detection of dementia using neuroimaging (MRI) features, cognitive (clinical) features, and a combination of both of these features. Gill et al. used the ADNI1 dataset; however, at that time, the number of records in the dataset was only 600, and ADNI1 used during this research has 5013 records, while ADNIMERGE has a total of 13,892 records. Further comparative analysis between the proposed work and Gill et al. [20] has been discussed below.

4.3.1. Comparison of NCA-F on ADNI1 (Updated) with the Literature

Upon the use of 5 neuroimaging features exclusively with the updated ADNI1 dataset, an accuracy of 79.40% is achieved, while a drastic jump of 8.49% is recorded to report an overall accuracy of 87.89% on 12 selected cognitive features, whereas the combination of both features, with 7 and 9 features of neuroimaging and cognitive metrics used, achieves an accuracy of 87.39%. Pertinent to mention, that the record number remains the same for all experimental analyses at 5013.

4.3.2. Comparison of NCA-F on ADNIMERGE with Literature

Three different categories of data are used to test the NCA-F method: Neuroimaging, Cognitive, and a combination of both. NCA-F fares the best when a combination of both features is used, as in the ADNIMERGE dataset, where the 7 and 12 best features are selected from neuroimaging and cognitive features, respectively. A reported accuracy of 83.42% is achieved, in contrast, which drops to 74.33% if only neuroimaging data are used with four features. Similarly, the best 18 cognitive features correspond to an accuracy of 83.15%, a mere drop of 0.27% in terms of accuracy. The frequency of recording remains static for all the experiments, at 13,892. However, it is the tradeoff required to opt between enhanced performance and higher accuracy.

4.3.3. Comparison between NCA-F on ADNI1 (Updated) and ADNIMERGE

A comparative analysis of the two exhibits some surprising yet convincing results. such as the use of mixed features did not yield the expected rise in accuracy, whereas, as evident from the experimental analysis in this paper vis-à-vis others, the result is skewed by the most weighted metrics, most notably the cognitive features instead of neuroimaging features. The accuracy of combinatory features using ADNI1 (updated) and ADDIMERGE datasets are 87.39 and 83.42 percent, respectively. The drop of approximately 4% happens, which perhaps can be attributed to the latter’s abundance of records resulting in over-fitting and hence erroneous classification. Additionally, the accuracy drop takes place regardless of the three extra features used for learning when the ADNIMERGE dataset is used, which validates our presumption: the most weighted (significant) features skew accuracy the greatest.

4.3.4. Benefits of Cognitive and Neuroimaging Features and Their Relevance to DSM-5 Criteria

Cognitive testing establishes a baseline for an individual’s cognitive abilities in the absence of symptomatic indicators, serving as a foundation for future comparative evaluations if cognitive decline is suspected. Cognitive features are crucial for early detection and accurate staging of Alzheimer’s Disease (AD). These features allow for the identification of nuanced cognitive changes before overt symptoms are present, establish benchmarks for monitoring deviations from healthy cognitive patterns, and provide objective measures conducive to accurate diagnoses.

In relation to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) criteria, cognitive tests such as the Mini-Mental State Examination (MMSE) and the Montreal Cognitive Assessment (MoCA) offer valuable insights into several domains, including memory, executive function, and attention. These domains correspond with DSM-5 criteria for diagnosing neurocognitive disorders, including AD. Specifically, the decline in one or more cognitive domains, evident through cognitive testing, is a key criterion for the diagnosis according to DSM-5.

Neuroimaging, on the other hand, contributes critical baseline data about brain structure and activity. Subsequent scans can be compared to these baselines to identify structural and functional changes that might be indicative of AD. When cognitive and neuroimaging features are combined, they significantly improve the sensitivity and specificity of detecting AD in both its early and late stages, as demonstrated in Table 5.

Our research adds a new dimension to the field by identifying a distinct set of cognitive and neuroimaging features critical for AD diagnosis. Unlike existing studies that have emphasized the importance of features such as CDRSB, AV45-PET, FDG-PET, and ADAS-Cog13 [31,32], our model isolated 4 neuroimaging and 18 cognitive features as being more critical in the early and accurate detection of AD.

4.3.5. Overall Comparison

The overall comparison between the proposed NCA-F and Gill et al. [20] is given in Table 5. This table summarizes the results of the performance comparison in terms of the number of features used, the type of features, the dataset type, the number of records, the accuracy achieved, and the Area under the ROC curve (AUC). From Table 5, it can be observed that the proposed approach has achieved better performance on both datasets (ADNIMERGE and ADNI-1) as compared to the existing approach of Gill et al. [20]. We have also validated the performance of Gill et al.’s methodology on the updated ADNI-1 for a fair comparison. Our proposed methodology has outperformed Gill et al.’s. All the results are cross-validated with 5-fold cross-validation. The accuracy of the proposed method for cognitive features of the ADNIMERGE dataset is not the best of all; however, these features are independent of any Magnetic Resonance Imaging (MRI) or neuroimaging tests, and we are interested in the early detection of dementia based on some cognitive tests. The ADNI-1 dataset contains a limited number of records, which creates overfitting. On the other hand, ADNIMERGE has many records that resolve the overfitting problem. Moreover, researchers have identified different features such as CDRSB, AV45-PET, FDG-PET, and ADAS-Cog13 that are important for the progression of AD [31,32]. In contrast, our proposed methods have individually identified 4 neuroimaging and 18 cognitive features as more important for the detection of AD.

When comparing our approach against existing models, particularly the methodology by Gill et al., it is crucial to underscore the diversified feature set we utilized. While Gill et al. predominantly relied on biomarker-derived features, our model uniquely integrated both cognitive and neuroimaging features, leading to better performance across both the updated ADNI-1 and ADNIMERGE datasets. Moreover, our approach showed greater flexibility; it can adapt to resource-limited settings by using only cognitive features, thereby serving a broader clinical spectrum. Furthermore, the higher performance of our model substantiates the benefit of our proposed Neighborhood Component Analysis and Correlation-Based Filtration (NCA-F) in feature selection, which was pivotal in identifying the 4 neuroimaging and 18 cognitive features as most critical for AD detection.

4.4. Additional Observations on Robustness, Generalizability, and Limitations

4.4.1. Robustness and Reliability of AUC Values

To assess the robustness and reliability of the reported AUC (Area Under the Curve) values, multiple validation techniques were employed. Additional analyses were conducted using bootstrapping and stratified K-Fold cross-validation. In the bootstrapping analysis, the AUC ranged from 0.88 to 0.93 across 1000 iterations, with a mean AUC of 0.91 and a 95% confidence interval of [0.89, 0.92]. Similarly, in the stratified K-Fold cross-validation (K = 10), the AUC ranged from 0.87 to 0.92 with a mean of 0.90, further reinforcing the reliability of this metric. These additional analyses consistently indicated high AUC values, comparable to those reported in the main experiments. Furthermore, the experiment was repeated under identical conditions. A consistent pattern of results was observed across both the initial and repeated experiments. The high AUC values obtained suggest the model’s strong ability to distinguish between the classes, even if other performance metrics may appear modest.

4.4.2. Feature Independence and Clinical Usability

Independence between cognitive features and neuroimaging tests was observed, which offers benefits for resource-limited settings where advanced neuroimaging facilities may be unavailable.

4.4.3. Model Stability across Datasets

Stability in the performance of the AdaBoost model was noted when tested across different sizes and types of datasets. This suggests a lower susceptibility to overfitting.

4.4.4. Limitations Regarding Neuroimaging Features

Neuroimaging features alone were found to be less effective than combined cognitive features in predicting dementia. This limitation is noteworthy for clinicians and healthcare systems.

4.4.5. Cross-Validation Reliability

Consistency across different cross-validation folds was observed, reinforcing the reliability of the methodology used.

4.4.6. Recommendations

Our research suggests that for early detection and ongoing monitoring of AD, cognitive evaluations remain a cornerstone, aligning with typical clinical approaches. These assessments can often be conducted more frequently and are less invasive than neuroimaging studies. Cognitive features offer early warning signs and may provide a foundation for longitudinal tracking of cognitive health. When significant cognitive decline is suspected or observed, neuroimaging studies, including MRI and PET scans, should be considered for a more comprehensive understanding.

Regarding the choice of medical methods for data-driven approaches, our results indicate that in the early stages, cognitive assessments such as the Mini-Mental State Examination (MMSE) or the Montreal Cognitive Assessment (MoCA) align well with common clinical practices. As the disease progresses, more sophisticated and comprehensive neuroimaging tests may become increasingly important for understanding the extent and nature of degenerative changes.

It is essential to note that the observations and insights offered here are based on data analysis and should not be construed as clinical advice. We are not medical professionals, and these findings are intended to contribute to a scientific understanding that could inform but not replace professional medical evaluations and treatment plans.

4.5. Summary of Findings

This research explored the influence of neuroimaging, cognitive, and combined features on predicting dementia through the AdaBoost Ensemble classifier. Based on the analysis of the ADNIMERGE dataset, the best number of features was selected for each type: 5 neuroimaging features, 18 cognitive features, and 19 combined features. The best weighted combined features were identified using Neighborhood Component Analysis and Correlation-Based Filtration (NCA-F), and four different machine learning models (AdaBoost (AdB), Artificial Neural Network (ANN), Support Vector Machine (SVM), and Naive Bayes (NB)) were employed to predict dementia. The performance of these models was cross-validated through a 10-fold process.

In terms of feature types, SVM outperformed the other models on neuroimaging features, achieving approximately 74% classification accuracy. On cognitive features, the AdaBoost model had the highest performance, with approximately 83% accuracy. When it comes to combined features, which contain 7 neuroimaging and 12 cognitive features, AdaBoost again had the best performance with approximately 83% accuracy.

Interestingly, neuroimaging features alone did not yield high performance for dementia prediction. This study found that cognitive features are more effective than neuroimaging features for prediction, and combined features also performed well. However, the challenge with combined features is that neuroimaging features are required. Among the four models, AdaBoost showed good performance results for all three feature types.

Comparative analysis was also performed between the proposed NCA-F method and previous work by Gill et al., which also focused on the early detection of dementia using neuroimaging and cognitive features. The proposed NCA-F method achieved better performance on both the updated ADNI-1 and ADNIMERGE datasets when compared to the existing approach by Gill et al.

To this end, in this study, distinct advantages were observed depending on the types of features used for Alzheimer’s Disease (AD) diagnosis. Models built on biomarkers such as CDRSB, AV45-PET, and FDG-PET offered robustness and were particularly effective at capturing advanced stages of the disease, corroborating previous research [31,32]. However, models relying solely on cognitive features, as assessed through tools such as the Mini-Mental State Examination (MMSE) and Montreal Cognitive Assessment (MoCA), exhibited higher sensitivity in the early detection of AD, a crucial requirement for timely intervention. Interestingly, our combined feature models, incorporating both biomarkers and cognitive features, showed the most balanced performance across the early and late stages. This balance is evidenced by an 83% accuracy rate and consistently high AUC values, suggesting that a multi-dimensional approach is often more comprehensive for diagnosing a complex disorder like AD.

5. Limitations

Despite the promising outcomes of this study, some limitations must be acknowledged. Firstly, the datasets used, ADNIMERGE and ADNI-1, despite being extensive and informative, still represent a specific patient population. Results could potentially vary with other datasets or populations. This study also relies heavily on the accuracy of cognitive tests and the feature selection method. Variations in test administration, subject response, and selection algorithms can introduce errors.

Another limitation is the need for neuroimaging in the combined features. While combined features provided slightly better results, the requirement for neuroimaging makes this less feasible in many real-world clinical settings. Cognitive testing is generally easier to administer, cheaper, and more accessible, especially in low-resource settings. Hence, a focus on further improving performance with cognitive features alone would be beneficial.

Additionally, while the AdaBoost model performed well in this study, it might not be the optimal model for all scenarios. The performance of machine learning models can vary based on the specific characteristics of the dataset. Therefore, exploring other potential models could be valuable. Lastly, the issue of overfitting must be considered, particularly when dealing with a large number of records, as seen with the ADNIMERGE dataset. Techniques to mitigate this problem and ensure the model’s generalizability should be considered in future studies.

6. Conclusions

This research explored the use of machine learning algorithms in the early detection of dementia, with a particular focus on the potential of cognitive features derived from a series of cognitive tests, and contrasted these with neuroimaging features in predictive model training. The unique contribution of this research lies in the implementation of the AdaBoost Ensemble model on cognitive features, yielding an enhanced accuracy rate of approximately 83%. The AdaBoost model demonstrated improved performance compared to other benchmark models, including the Artificial Neural Network, Support Vector Machine, and Naïve Bayes. While the performance metrics improved when we combined cognitive and neuroimaging features, we emphasized cognitive features because of their clinical convenience and ease of execution. Furthermore, our work underscores the significance of cognitive assessments in the early detection of dementia, suggesting that clinicians should prioritize evaluating specific cognitive elements during AD screening. By shedding light on which cognitive areas are pivotal, this research informs clinicians about optimal times for cognitive feature assessment and how it complements the biomarker assessments in the diagnosis trajectory.

Future work should consider refining these machine-learning models, exploring other machine-learning algorithms, and enhancing performance using cognitive features alone. Future work should also consider analyzing different datasets or patient populations to validate the general applicability of the model. Overall, this research paves the way for innovative, machine-learning-assisted strategies for early dementia detection, promoting the use of easily accessible cognitive tests.

Author Contributions

Conceptualization, M.E.; Methodology, S.S. and M.E.; Software (Python 3.0), M.I.; Validation, S.S. and M.E.; Investigation, M.I.; Resources, S.S. and M.E.; Writing—original draft, M.I., S.S. and M.E.; Writing—review & editing, M.I. All authors have read and agreed to the published version of the manuscript.

Funding

We gratefully acknowledge MDPI for generously granting us an APC waiver valued at 2070 CHF, thereby facilitating the publication of this research. This kind gesture significantly supports our efforts to disseminate our findings to the wider scientific community.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No primary data were collected. The data used in this study were retrieved from the Alzheimer’s Disease Prediction of Longitudinal Evolution (TADPOLE) dataset [14]. Access to the ADNI-related datasets is contingent on adherence to the ADNI Data Use Agreement.

Acknowledgments

The authors declare that they have not used Artificial Intelligence (AI) tools in the writing of this article. However, ChatGPT 4 was employed for copyediting and proofreading certain sections, including the abstract, introduction, and conclusion. The authors took precautions to ensure that ChatGPT 4 did not introduce any text that was not authored by the original writers. Throughout the process, the authors reviewed and revised the suggestions provided by ChatGPT 4 to maintain the accuracy and integrity of the final manuscript. Moreover, the authors would like to express their sincere gratitude to the Alzheimer’s Disease Neuroimaging Initiative (ADNI) for providing access to the TADPOLE challenge dataset. ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd. and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research and Development, LLC.; Johnson and Johnson Pharmaceutical Research and Development LLC.; Lumosity; Lundbeck; Merck and Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research are providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org accessed on March 2023). The grantee organization is the Northern California Institute for Research and Education, and this study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuroimaging at the University of Southern California.

Conflicts of Interest

The authors declare no conflict of interest.

References

Tong, T.; Chignell, M.; Lam, P.; Tierney, M.C.; Lee, J. Designing serious games for cognitive assessment of the elderly. Proc. Int. Symp. Hum. Factors Ergon. Health Care 2014, 3, 28–35. [Google Scholar] [CrossRef]
Tang, J.; Alelyani, S.; Liu, H. Feature selection for classification: A review. Data Classification: Algorithms and Applications 2014. Available online: https://www.cse.msu.edu/~tangjili/publication/feature_selection_for_classification.pdf (accessed on 12 September 2023).
Prince, M.J. World Alzheimer Report 2015: The Global Impact of Dementia: An Analysis of Prevalence, Incidence, Cost and Trends; Alzheimer’s Disease International: London, UK, 2015. [Google Scholar]
Australia, D.; Baker, S.; Banerjee, S. Alzheimer’s Disease International: World Alzheimer Report 2019: Attitudes to Dementia; Alzheimer’s Disease International: London, UK, 2019. [Google Scholar]
Bateman, A.; Bennett, H.P. The granulin gene family: From cancer to dementia. Bioassays 2009, 31, 1245–1254. [Google Scholar] [CrossRef] [PubMed]
Cunningham, E.; McGuinness, B.; Herron, B.; Passmore, A. Dementia. Ulster. Med. J. 2015, 84, 79–87. [Google Scholar] [PubMed]
Velayudhan, L.; Ryu, S.H.; Raczek, M.; Philpot, M.; Lindesay, J.; Critchfield, M.; Livingston, G. Review of brief cognitive tests for patients with suspected dementia. Int. Psychogeriatr. 2014, 26, 1247–1262. [Google Scholar] [CrossRef] [PubMed]
Jack, C.R., Jr.; Holtzman, D.M. Biomarker modeling of Alzheimer’s disease. Neuron 2013, 80, 1347–1358. [Google Scholar] [CrossRef] [PubMed]
Guyon, I.; Gunn, S.; Nikravesh, M.; Zadeh, L.A. Feature Extraction: Foundations and Applications; Springer: Berlin/Heidelberg, Germany, 2008; Volume 207. [Google Scholar]
Youn, Y.C.; Choi, S.H.; Shin, H.-W.; Kim, K.W.; Jang, J.-W.; Jung, J.J.; Hsiung, G.-Y.R.; Kim, S.Y. Detection of cognitive impairment using a machine-learning algorithm. Neuropsychiatr. Dis. Treat. 2018, 14, 2939–2945. [Google Scholar] [CrossRef] [PubMed]
Kruthika, K.R.; Maheshappa, H.D. Alzheimer’s Disease Neuroimaging Initiative. Multistage classifier-based approach for Alzheimer’s disease prediction and retrieval. Inform. Med. 2019, 14, 34–42. [Google Scholar]
Veeramuthu, A.; Meenakshi, S.; Manjusha, P.S. A New Approach for Alzheimer Disease Diagnosis by using Association Rule over PET Images. Int. J. Comput. Appl. 2014, 91, 9–14. [Google Scholar] [CrossRef]
Irfan, M.; Shahrestani, S.; Elkhodr, M. Early Detection of the Alzheimer’s Disease: A Novel Cognitive Feature Selection Approach Using Machine Learning. In Advances in Information, Communication and Cybersecurity: Proceedings of ICI2C’21; Springer International Publishing: Cham, Switzerland, 2022; pp. 383–392. [Google Scholar]
Marinescu, R.V.; Oxtoby, N.P.; Young, A.L.; Bron, E.E.; Toga, A.W.; Weiner, M.W.; Barkhof, F.; Fox, N.C.; Golland, P.; Klein, S.; et al. TADPOLE Challenge: Accurate Alzheimer’s disease prediction through crowdsourced forecasting of future data. In International Workshop on Predictive Intelligence in Medicine; Springer: Cham, Switzerland, 2019; pp. 1–10. [Google Scholar]
Bratic’, B.; Kurbalija, V.; Ivanovic, M.; Oder, I.; Bosnic, Z. Machine learning for predicting cognitive diseases: Methods, data sources and risk factors. J. Med. Syst. 2018, 42, 243. [Google Scholar] [CrossRef]
You, Y.; Ahmed, B.; Barr, P.; Ballard, K.; Valenzuela, M. Predicting dementia risk using paralinguistic and memory test features with machine learning models. In Proceedings of the 2019 IEEE Healthcare Innovations and Point of Care Technologies, (HI-POCT), Bethesda, MD, USA, 20–22 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 56–59. [Google Scholar]
Grassi, M.; Rouleaux, N.; Caldirola, D.; Loewenstein, D.; Schruers, K.; Perna, G.; Dumontier, M. A novel ensemble-based machine learning algorithm to predict the conversion from mild cognitive impairment to Alzheimer’s disease using socio-demographic characteristics, clinical information and neuropsychological measures. Front. Neurol. 2019, 10, 756. [Google Scholar] [CrossRef]
Zhou, T.; Thung, K.-H.; Zhu, X.; Shen, D. Effective feature learning and fusion of multimodality data using stage-wise deep neural network for dementia diagnosis. Hum. Brain Mapp. 2019, 40, 1001–1016. [Google Scholar] [CrossRef] [PubMed]
Yang, W.; Wang, K.; Zuo, W. Neighborhood component feature selection for high-dimensional data. JCP 2012, 7, 161–168. [Google Scholar] [CrossRef]
Gill, S.; Mouches, P.; Hu, S.; Rajashekar, D.; MacMaster, F.P.; Smith, E.E.; Forkert, N.D.; Ismail, Z. Alzheimer’s Disease Neuroimaging Initiative. Using machine learning to predict dementia from neuropsychiatric symptom and neuroimaging data. J. Alzheimer’s Dis. 2020, 75, 277–288. [Google Scholar] [CrossRef] [PubMed]
Ford, E.; Rooney, P.; Oliver, S.; Hoile, R.; Hurley, P.; Banerjee, S.; van Marwijk, H.; Cassell, J. Identifying undetected dementia in UK primary care patients: A retrospective case-control study comparing machine learning and standard epidemiological approaches. BMC Med. Inform. Decis. Mak. 2019, 19, 248. [Google Scholar] [CrossRef] [PubMed]
Vespa, J.; Armstrong, D.M.; Medina, L. Demographic Turning Points for the United States: Population Projections for 2020 to 2060; US Department of Commerce, Economics and Statistics Administration, US Census Bureau: Washington, DC, USA, 2018.
Liu, H.; Dougherty, E.R.; Dy, J.G.; Torkkola, K.; Tuv, E.; Peng, H.; Ding, C.; Long, F.; Berens, M.; Parsons, L.; et al. Evolving feature selection. IEEE Intell. Syst. 2005, 20, 64–76. [Google Scholar] [CrossRef]
Fisher, R.A. Statistical methods for research workers. In Breakthroughs in Statistics; Springer: Berlin/Heidelberg, Germany, 1992; pp. 66–70. [Google Scholar]
AlShboul, R.; Thabtah, F.; Walter Scott, A.J.; Wang, Y. The Application of Intelligent Data Models for Dementia Classification. Appl. Sci. 2023, 13, 3612. [Google Scholar] [CrossRef]
Lin, R.H.; Wang, C.C.; Tung, C.W. A machine learning classifier for predicting stable MCI patients using gene biomarkers. Int. J. Environ. Res. Public Health 2022, 19, 4839. [Google Scholar] [CrossRef]
Thabtah, F.; Ong, S.; Peebles, D. Detection of dementia progression from functional activities data using machine learning techniques. Intell. Decis. Technol. 2022, 16, 615–630. [Google Scholar] [CrossRef]
Khan, R.; Akbar, S.; Mehmood, A.; Shahid, F.; Munir, K.; Ilyas, N.; Asif, M.; Zheng, Z. A transfer learning approach for multiclass classification of Alzheimer’s disease using MRI images. Front. Neurosci. 2023, 9, 1050777. [Google Scholar] [CrossRef]
Hao, X.; Bao, Y.; Guo, Y.; Yu, M.; Zhang, D.; Risacher, S.L.; Saykin, A.J.; Yao, X.; Shen, L. Alzheimer’s Disease Neuroimaging Initiative. Multi-modal neuroimaging feature selection with consistent metric constraint for diagnosis of Alzheimer’s disease. Med. Image Anal. 2020, 1, 101625. [Google Scholar] [CrossRef]
Zhang, T.; Shi, M. Multi-modal neuroimaging feature fusion for diagnosis of Alzheimer’s disease. J. Neurosci. Methods 2020, 341, 108795. [Google Scholar] [CrossRef]
Hernandez, M.; Ramon-Julvez, U.; Ferraz, F.; ADNI Consortium. Explainable AI toward understanding the performance of the top three TADPOLE Challenge methods in the forecast of Alzheimer’s disease diagnosis. PLoS ONE 2022, 17, e0264695. [Google Scholar] [CrossRef] [PubMed]
Marinescu, R.V.; Bron, E.E.; Oxtoby, N.P.; Young, A.L.; Toga, A.W.; Weiner, M.W.; Barkhof, F.; Fox, N.C.; Golland, P.; Klein, S.; et al. Predicting Alzheimer’s disease progression: Results from the TADPOLE challenge: Neuroimaging: Neuroimaging predictors of cognitive decline. Alzheimer’s Dement. 2020, 16, e039538. [Google Scholar] [CrossRef]
Weiner, M.W.; Veitch, D.P.; Aisen, P.S.; Beckett, L.A.; Cairns, N.J.; Green, R.C.; Harvey, D.; Jack, C.R., Jr.; Jagust, W.; Morris, J.C.; et al. The Alzheimer’s disease neuroimaging initiative 3: Continued innovation for clinical trial improvement. Alzheimer’s Dement. 2017, 13, 561–571. [Google Scholar] [CrossRef] [PubMed]
Weiner, M.W.; Aisen, P.S.; Jack, C.R., Jr.; Jagust, W.J.; Trojanowski, J.Q.; Shaw, L.; Saykin, A.J.; Morris, J.C.; Cairns, N.; Beckett, L.A.; et al. The Alzheimer’s disease neuroimaging initiative: Progress report and future plans. Alzheimer’s Dement. 2010, 6, 202–211. [Google Scholar] [CrossRef] [PubMed]
Center, Protocol Consultants Coordinating, and Neuropathology Biomarker Core. Alzheimer’s Disease Neuroimaging Initiative 2 (adni2) Protocol (adc-039). Available online: https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=0ca834481ad459fb75546fc04fcb25a57e2b9297 (accessed on 10 February 2023).
Buuren, S.V.; Groothuis-Oudshoorn, K. mice: Multivariate imputation by chained equations in R. J. Stat. Softw. 2010, 45, 1–68. [Google Scholar] [CrossRef]

Figure 1. Overview of the proposed research methodology.

Figure 2. Disease distribution.

Figure 3. Neuroimaging features performance on N number of features.

Figure 4. Cognitive features performance on N number of features.

Figure 5. Combined features performance on N number of features.

Figure 6. A sample correlation matrix for neuroimaging features.

Figure 7. (a–c). ROC curve for all three types of features.

Figure 8. Overall performance for AdB model.

Figure 9. ROC curve of all three features for AdB.

Table 1. The list of filtered neuroimaging features after NCA-F.

S. No.	Feature	Weight
1	ICV	4.981
2	Ventricles	4.1487
3	WholeBrain	3.6149
4	mPACCdigit	2.986
5	Hippocampus	2.9641
6	MidTemp	2.952
7	Fusiform	2.662
8	Entorhinal	1.8558

Table 2. The list of filtered cognitive features after NCA-F.

S. No	Feature	Weight
1	AGE	15.288
2	EcogSPPlan	3.0912
3	PTGENDER	2.9801
4	CDRSB	2.6719
5	EcogSPOrgan	2.345
6	EcogSPLang	2.18
7	ADAS11	2.0056
8	RAVLTimmediate	1.9877
9	ADASQ4	1.7356
10	MMSE	1.565
11	EcogPtOrgan	1.5628
12	EcogSPVisspat	1.5223
13	LDELTOTAL	1.1009
14	EcogPtLang	1.0243
15	FAQ	0.91021
16	MOCA	0.79935
17	EcogPtTotal	0.75844
18	EcogPtDivatt	0.72765
19	EcogSPMem	0.33403
20	EcogPtMem	0.14011
21	TRABSCOR	0.024946
22	EcogSPDivatt	0.0033972
23	EcogPtVisspat	1.07 × 10⁻⁷
24	RAVLTpercforgetting	1.51 × 10⁻¹²
25	EcogPtPlan	7.03 × 10⁻¹⁶
26	RAVLTlearning	8.71 × 10⁻²⁴
27	RAVLTforgetting	9.52 × 10⁻⁴⁷

Table 3. Selected best-weighted combined features after NCA-F and classification.

S. No	Feature	Type	Weight
1	AGE	Cognitive	14.033
2	Ventricles	Neuroimaging	3.3561
3	PTGENDER	Cognitive	2.4853
4	WholeBrain	Neuroimaging	2.3404
5	MidTemp	Neuroimaging	2.207
6	CDRSB	Neuroimaging	2.168
7	ICV	Neuroimaging	1.9832
8	Fusiform	Neuroimaging	1.6853
9	LDELTOTAL	Cognitive	1.4914
10	Hippocampus	Neuroimaging	1.4577
11	EcogPtLang	Cognitive	1.2934
12	MMSE	Cognitive	1.1187
13	Entorhinal	Cognitive	0.90657
14	EcogPtMem	Cognitive	0.20554
15	RAVLTimmediate	Cognitive	0.16261
16	FAQ	Cognitive	0.024644
17	EcogPtPlan	Cognitive	0.01449
18	ADASQ4	Cognitive	0.0027758
19	EcogSPDivatt	Cognitive	0.0018811

Table 4. Features types and ML models comparison for ADNIMERGE dataset.

Features Type	Features Number	Model	Accuracy	Precision	Recall	F1-Score	AUC
Features Type	Features Number	Model	Percent
Neuroimaging	05	AdaBoost	73.22	71.13	67.67	68.48	79.5
		ANN	73.83	71.80	68.44	69.28	80.14
		SVM	74.33	72.54	69.07	69.95	79.06
		NB	63.72	66.94	67.82	63.62	74.72
Cognitive	18	AdaBoost	83.15	82.56	79.76	80.80	90.03
		ANN	82.73	81.90	79.73	80.57	89.15
		SVM	81.49	82.41	76.25	77.91	86.56
		NB	63.58	72.14	70.62	63.49	80.95
Combined	7 Neuro + 12 Cog	AdaBoost	83.42	82.80	80.27	81.23	90.47
		ANN	83.10	82.40	79.90	80.85	89.93
		SVM	81.40	82.07	76.32	77.92	87.32
		NB	66.97	72.38	72.64	66.97	81.42

Table 5. Performance Comparison of the Proposed Approach against the existing approaches.

Features	Method	Dataset	Selected Features	Number of Records	Accuracy znj(%)	AUC
Neuroimaging	Gill et al. [20]	ADNI-1	41	600	75.70	0.77
	Gill et al. [20]	ADNI-1 (Updated)	41	5013	73.28	0.74
	NCA-F	ADNI-1 (Updated)	5	5013	79.4	0.88
	NCA-F	ADNIMERGE	4	13,892	74.33	0.79
Cognitive	Gill et al. [20]	ADNI-1	4	600	81.80	0.79
	Gill et al. [20]	ADNI-1 (Updated)	4	5013	79.42	0.81
	NCA-F	ADNI-1 (Updated)	12	5013	87.89	0.95
	NCA-F	ADNIMERGE	18	13,892	83.15	0.90
Combined (Neuroimaging + Cognitive)	Gill et al. [20]	ADNI-1	41 + 4	600	84.90	0.86
	Gill et al. [20]	ADNI-1 (Updated)	41 + 4	5013	82.17	0.84
	NCA-F	ADNI-1 (Updated)	7 + 9	5013	87.39	0.95
	NCA-F	ADNIMERGE	7 + 12	13,892	83.42	0.90

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Irfan, M.; Shahrestani, S.; Elkhodr, M. Enhancing Early Dementia Detection: A Machine Learning Approach Leveraging Cognitive and Neuroimaging Features for Optimal Predictive Performance. Appl. Sci. 2023, 13, 10470. https://doi.org/10.3390/app131810470

AMA Style

Irfan M, Shahrestani S, Elkhodr M. Enhancing Early Dementia Detection: A Machine Learning Approach Leveraging Cognitive and Neuroimaging Features for Optimal Predictive Performance. Applied Sciences. 2023; 13(18):10470. https://doi.org/10.3390/app131810470

Chicago/Turabian Style

Irfan, Muhammad, Seyed Shahrestani, and Mahmoud Elkhodr. 2023. "Enhancing Early Dementia Detection: A Machine Learning Approach Leveraging Cognitive and Neuroimaging Features for Optimal Predictive Performance" Applied Sciences 13, no. 18: 10470. https://doi.org/10.3390/app131810470

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Enhancing Early Dementia Detection: A Machine Learning Approach Leveraging Cognitive and Neuroimaging Features for Optimal Predictive Performance

Abstract

1. Introduction

2. Related Works

3. A Machine Learning Approach for Predicting Dementia

3.1. Data Extraction

3.2. Data Pre-Processing

3.2.1. Handling Missing Values

3.2.2. Data Normalization

3.3. The Feature Selection Approach

3.3.1. The Filtering Method

3.3.2. Wrapper Method

3.3.3. The Proposed Method (NCA-F)

4. Results

4.1. Experimental Setup

4.2. Training and Testing

4.3. Comparative Analysis

4.3.1. Comparison of NCA-F on ADNI1 (Updated) with the Literature

4.3.2. Comparison of NCA-F on ADNIMERGE with Literature

4.3.3. Comparison between NCA-F on ADNI1 (Updated) and ADNIMERGE

4.3.4. Benefits of Cognitive and Neuroimaging Features and Their Relevance to DSM-5 Criteria

4.3.5. Overall Comparison

4.4. Additional Observations on Robustness, Generalizability, and Limitations

4.4.1. Robustness and Reliability of AUC Values

4.4.2. Feature Independence and Clinical Usability

4.4.3. Model Stability across Datasets

4.4.4. Limitations Regarding Neuroimaging Features

4.4.5. Cross-Validation Reliability

4.4.6. Recommendations

4.5. Summary of Findings

5. Limitations

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI