TY - JOUR
T1 - Forecasting of non-accidental, cardiovascular, and respiratory mortality with environmental exposures adopting machine learning approaches
AU - Lee, Woojoo
AU - Lim, Youn Hee
AU - Ha, Eunhee
AU - Kim, Yoenjin
AU - Lee, Won Kyung
N1 - Funding Information:
The study was supported by the Bio & Medical Technology Development Program of the National Research Foundation funded by the Korean government (MSIT) (2019M3E5D1A0206962012). The funders had no role in study design, data collection, analysis, decision to publish, or manuscript preparation.
Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2022/12
Y1 - 2022/12
N2 - Environmental exposure constantly changes with time and various interactions that can affect health outcomes. Machine learning (ML) or deep learning (DL) algorithms have been used to solve complex problems, such as multiple exposures and their interactions. This study developed predictive models for cause-specific mortality using ML and DL algorithms with the daily or hourly measured meteorological and air pollution data. The ML algorithm improved the performance compared to the conventional methods, even though the optimal algorithm depended on the adverse health outcomes. The best algorithms were extreme gradient boosting, ridge, and elastic net, respectively, for non-accidental, cardiovascular, and respiratory mortality with daily measurement; they were superior to the generalized additive model reducing a mean absolute error by 4.7%, 4.9%, and 16.8%, respectively. With hourly measurements, the ML model tended to outperform the conventional models, even though hourly data, instead of daily data, did not enhance the performance in some models. The proposed model allows a better understanding and development of robust predictive models for health outcomes using multiple environmental exposures.
AB - Environmental exposure constantly changes with time and various interactions that can affect health outcomes. Machine learning (ML) or deep learning (DL) algorithms have been used to solve complex problems, such as multiple exposures and their interactions. This study developed predictive models for cause-specific mortality using ML and DL algorithms with the daily or hourly measured meteorological and air pollution data. The ML algorithm improved the performance compared to the conventional methods, even though the optimal algorithm depended on the adverse health outcomes. The best algorithms were extreme gradient boosting, ridge, and elastic net, respectively, for non-accidental, cardiovascular, and respiratory mortality with daily measurement; they were superior to the generalized additive model reducing a mean absolute error by 4.7%, 4.9%, and 16.8%, respectively. With hourly measurements, the ML model tended to outperform the conventional models, even though hourly data, instead of daily data, did not enhance the performance in some models. The proposed model allows a better understanding and development of robust predictive models for health outcomes using multiple environmental exposures.
KW - Cardiovascular diseases
KW - Deep learning
KW - Environmental exposures
KW - Machine learning
KW - Respiratory tract diseases
UR - http://www.scopus.com/inward/record.url?scp=85134354603&partnerID=8YFLogxK
U2 - 10.1007/s11356-022-21768-9
DO - 10.1007/s11356-022-21768-9
M3 - Article
C2 - 35834079
AN - SCOPUS:85134354603
SN - 0944-1344
VL - 29
SP - 88318
EP - 88329
JO - Environmental Science and Pollution Research
JF - Environmental Science and Pollution Research
IS - 58
ER -