Psychiatry Investig.  2025 Mar;22(3):267-278. 10.30773/pi.2024.0156.

Comparison of Logistic Regression and Machine Learning Approaches in Predicting Depressive Symptoms: A National-Based Study

Affiliations
  • 1School of Public Health, Suzhou Medical College of Soochow University, Suzhou, China
  • 2Research Center for Psychology and Behavioral Sciences, Soochow University, Suzhou, China
  • 3Graduate School of Interdisciplinary Science and Engineering in Health Systems, Okayama University, Okayama, Japan
  • 4Department of General Medicine, Medical Big Data Center, the Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Nanjing Medical University, Suzhou, China
  • 5School of Public Health, Peking University, Beijing, China
  • 6State Key Laboratory of Radiation Medicine and Protection, School of Radiation Medicine and Protection, Soochow University, Suzhou, China
  • 7Collaborative Innovation Center of Radiological Medicine of Jiangsu Higher Education Institutions, Suzhou, China

Abstract


Objective
Machine learning (ML) has been reported to have better predictive capability than traditional statistical techniques. The aim of this study was to assess the efficacy of ML algorithms and logistic regression (LR) for predicting depressive symptoms during the COVID-19 pandemic.
Methods
Analyses were carried out in a national cross-sectional study involving 21,916 participants. The ML algorithms in this study included random forest (RF), support vector machine (SVM), neural network (NN), and gradient boosting machine (GBM) methods. The performance indices were sensitivity, specificity, accuracy, precision, F1-score, and area under the receiver operating characteristic curve (AUC).
Results
LR and NN had the best performance in terms of AUCs. The risk of overfitting was found to be negligible for most ML models except for RF, and GBM obtained the highest sensitivity, specificity, accuracy, precision, and F1-score. Therefore, LR, NN, and GBM models ranked among the best models.
Conclusion
Compared with ML models, LR model performed comparably to ML models in predicting depressive symptoms and identifying potential risk factors while also exhibiting a lower risk of overfitting.

Keyword

Logistic regression; Machine learning; Depressive symptoms; Risk factors.
Full Text Links
  • PI
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2025 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr