Machine learning strategies for predicting Alzheimer’s disease progression
Adhinrag Kalarikkal Induchudan , Kevin Curran
Design+ ›› 2025, Vol. 2 ›› Issue (3) : 025270031
Alzheimer’s disease (AD) represents a significant global health challenge, affecting millions of individuals worldwide through progressive cognitive decline and behavioral changes. The burden extends beyond patients to caregivers and healthcare systems. While traditional diagnostic methods pose financial obstacles, emerging non-imaging techniques show promise. Machine learning has emerged as a transformative approach for enhancing both diagnosis and management. This study aims to develop a robust multi-class classification model using random forest (RF) and extreme gradient boosting algorithms on non-imaging data from the Australian AD Neuroimaging Initiative, with emphasis on the Australian Imaging, Biomarkers, and Lifestyle Study of Aging. Extensive data analysis was conducted, including feature importance and selection, to improve interpretability and classification accuracy. Synthetic oversampling was applied to address class imbalance. The findings indicate the superiority of the tuned RF model, achieving 90% in accuracy, precision, recall, and F1 scores. In addition, cost-effective diagnostic variables were explored, with neuropsychology assessment variables demonstrating exceptional accuracy (90%). This research contributes to early AD detection, personalized treatment, and optimized resource allocation.
Alzheimer’s disease / Machine learning / Python classification model / Non-imaging data / Random Forest / Extreme gradient boosting / Australian imaging biomarkers and lifestyle study of aging / Diagnosis
| [1] |
|
| [2] |
|
| [3] |
|
| [4] |
AIBL Study ADNI Non-imaging Data. aibl.csiro.au. Available from: https://aibl.csiro.au/adni/nonimaging.php [Last accessed on 2024 Apr 30]. |
| [5] |
ADNI. About.Available from: https://adni.loni.usc.edu/about [Last accessed on 2024 Apr 30]. |
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
Scikit-Learn. Scikit-learn: Machine Learning in Python. Scikit-Learn; 2019. Available from: https://scikit-learn.org/stable [Last accessed on 2024 Apr 30]. |
| [10] |
|
| [11] |
|
| [12] |
[ 2024 Apr 30] |
| [13] |
|
| [14] |
Pandas.factorize -- Pandas 1.5.3 Documentation. Available from: https://pandas.pydata.org/docs/reference/api/pandas. factorize.html [Last accessed on 2024 Apr 30]. |
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
Scikit-Learn. Sklearn.Model_Selection. RandomizedSearchCV - Scikit-Learn 0.21.3 Documentation. Scikit-Learn; 2019. Available from: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection. RandomizedSearchCV.html [Last accessed on 2024 Apr 30]. |
| [25] |
Scikit. 3.3. Metrics and Scoring: Quantifying the Quality of Predictions - Scikit-Learn 0.23.2 Documentation. Scikit- Learn. Available from: https://scikit-learn.org/stable/modules/model_evaluation.html#classification-report [Last accessed on 2024 Apr 30]. |
| [26] |
|
| [27] |
|
| [28] |
|
| [29] |
|
| [30] |
|
/
| 〈 |
|
〉 |