Identification of important features in overweight and obesity among Korean adolescents using machine learning

Serim Lee, Jong Serl Chun

Research output: Contribution to journalArticlepeer-review

Abstract

Overweight and obesity in adolescents have been reported as one of the most serious threats worldwide including South Korea. This study aims to investigate the complex factors contributing to overweight and obesity in Korean adolescents using various machine learning methods. The research includes a dataset of 43,268 records from the 16th Korean Youth Risk Behavior Web-based Survey and explores 71 different factors, such as sociodemographic characteristics, dietary habits, health, behavior problems, family, and peer and school-related factors. Our analysis encompassed an array of algorithms, including Logistic Regression, Ridge, LASSO, Elasticnet, Decision tree, Bagging, Random forest, AdaBoost, and XGBoost. A total of nine machine learning models exhibited accuracy levels within the range of 0.7662 to 0.8403. Based on the domains and sub-domains of factors, it was determined that domains including sociodemographic characteristics, dietary habits, physical health, psychological health, behavioral problems, family factor, and peer and school factors were deemed important. Additionally, it is suggested that attention be given to newly-emerged features indicated by machine learning techniques, including oral health, smartphone addiction, smoking, sexual behavior, school violence, and nationality of parents. The current study's findings emphasize the critical need for collective and customized prevention programs considering multi-facet features to prevent overweight and obesity among Korean adolescents.

Original languageEnglish
Article number107644
JournalChildren and Youth Services Review
Volume161
DOIs
StatePublished - Jun 2024

Bibliographical note

Publisher Copyright:
© 2024 Elsevier Ltd

Keywords

  • Feature importance
  • Korean adolescents
  • Machine learning
  • Obesity
  • Overweight

Fingerprint

Dive into the research topics of 'Identification of important features in overweight and obesity among Korean adolescents using machine learning'. Together they form a unique fingerprint.

Cite this