Skip to main content
Search
Main content
JMIR formative research
Published

Prediction of Clinically Significant Depressive Symptoms at 2-Year Follow-Up in Older Adults: Machine Learning Study Using the English Longitudinal Study of Ageing

Authors

Bahar Khorram, Ramin Nilforooshan, Payam Barnaghi, Samaneh Kouchaki

Abstract

JMIR Form Res. 2026 Jul 2;10:e84744. doi: 10.2196/84744.

ABSTRACT

BACKGROUND: Depression in older adults is often underdiagnosed due to atypical symptom presentation and generational stigma, leading to delayed intervention. Early identification of individuals at risk of developing elevated depressive symptoms is therefore critical, but traditional approaches show limited predictive accuracy. To date, no study has applied machine learning (ML) models to predict clinically significant depressive symptoms at 2-year follow-up in older adults in the United Kingdom using data from the English Longitudinal Study of Ageing (ELSA). Moreover, the impact of encoding strategies for categorical health care variables has not been examined.

OBJECTIVE: This study aimed to develop and evaluate ML models to predict the clinically significant depressive symptoms at 2-year follow-up in older adults using ELSA data. We further compared ordinal and one-hot encoding strategies across different ML architectures and identified key predictors of depressive symptoms at follow-up.

METHODS: Data were drawn from 4 consecutive waves of ELSA, including participants aged ≥50 years without significant depressive symptoms at the baseline wave (waves 6-9). Clinically significant depressive symptoms were defined as 8-item Center for Epidemiologic Studies Depression Scale (CES-D 8) scores of ≥4 at the subsequent wave (waves 7-10). Over 120 features spanning sociodemographic, psychological, and health-related domains were analyzed. Eight ML models were applied, including tree-based ensembles, deep learning architectures for tabular data, distance-based methods, probabilistic methods, and linear methods. Model performance was assessed using the area under the receiver operating characteristic curve (AUROC) and F1-score. Model interpretability was examined using Shapley additive explanations (SHAP). Sensitivity analyses assessed the robustness of results across alternative CES-D 8 thresholds (≥3, ≥4, and ≥5) and encoding strategies.

RESULTS: Across waves, the best-performing models achieved mean AUROC scores of 0.72-0.73, with a peak of 0.75 in the highest-performing wave. Ordinal encoding consistently outperformed one-hot encoding across all ML models, yielding improvements in AUROCs and F1-scores, with the greatest increase in tree-based methods. SHAP consistently identified loneliness, sleep disturbances, and low social engagement as strong predictors of elevated depressive symptoms at follow-up. Sensitivity analyses across CES-D 8 thresholds demonstrated robust feature importance, with AUROCs ranging from 0.67 to 0.82. Traditional ML models (random forest, extreme gradient boosting, and support vector machines) generally achieved higher performance than the deep learning models for this task.

CONCLUSIONS: Our findings demonstrate the feasibility of predicting clinically significant depressive symptoms at 2-year follow-up in UK older adults, with moderate accuracy. Ordinal encoding demonstrates superior performance for health care datasets with inherently ordered categorical features. The identification of consistent risk factors highlights opportunities for developing targeted clinical screening tools and preventive interventions. This study provides new evidence on depressive symptom prediction in the UK context, leveraging longitudinal data from ELSA, and contributes to advancing digital mental health research for aging populations.

PMID:42390822 | DOI:10.2196/84744

UK DRI Authors

Prof Payam Barnaghi

Group Leader

Combining engineering and technological innovation to produce a secure 'Healthy Home' system

Prof Payam Barnaghi