J Biomed Transl Res.  2022 Mar;23(1):17-28. 10.12729/jbtr.2022.23.1.17.

Machine learning-based risk factor analysis for periodontal disease from a Korean National Survey

Affiliations
  • 1Medical Research Institute, School of Medicine, Chungbuk National University, Cheongju 28644, Korea
  • 2Department of Big Data Cooperative Course, Chungbuk National University, Cheongju 28644, Korea
  • 3Department of Management Information Systems, Chungbuk National University, Cheongju 28644, Korea
  • 4Department of Biomedical Engineering, School of Medicine, Chungbuk National University, Cheongju 28644, Korea
  • 5Institute for Trauma Research, College of Medicine, Korea University, Seoul 02841, Korea

Abstract

Periodontal disease is a chronic but treatable condition which often does not cause pain during the initial stages of the illness. Lack of awareness of symptoms can delay initiation of treatment and worsen health. The aim of this study was to develop and compare different risk prediction models for periodontal disease using machine learning algorithms. We obtained information on risk factors for periodontal disease from the Korea National Health and Nutrition Examination Survey (KNHANES) dataset. Principal component analysis and an auto-encoder were used to extract data on risk factors for periodontal disease. A synthetic minority oversampling technique algorithm was used to solve the problem of data imbalance. We used a combination of logistic regression analysis, support vector machine (SVM) learning, random forest, and AdaBoost to classify and compare risk prediction models for periodontal disease. In cases where we used principal component analysis (PCA) to extract risk factors, the recall was higher than the feature selection method in the logistic regression and support-vector machine learning models. AdaBoost’s recall was 0.98, showing the highest performance of both feature selection and PCA. The F1 score showed relatively high performance in AdaBoost, logistic regression, and SVM learning models. By using the risk factors extracted from the research results and the predictive model based on machine learning, it will be able to help in the prevention and diagnosis of periodontal disease, and it will be used to study the relationship with various diseases related to periodontal disease.

Keyword

periodontal disease; risk factors; feature extraction; machine learning; prediction model
Full Text Links
  • JBTR
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr