TY - JOUR
T1 - Development of Machine‐Learning Model to Predict COVID‐19 Mortality
T2 - Application of Ensemble Model and Regarding Feature Impacts
AU - Baik, Seung Min
AU - Lee, Miae
AU - Hong, Kyung Sook
AU - Park, Dong Jin
N1 - Publisher Copyright:
© 2022 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2022/6
Y1 - 2022/6
N2 - This study was designed to develop machine‐learning models to predict COVID‐19 mortality and identify its key features based on clinical characteristics and laboratory tests. For this, deep‐learning (DL) and machine‐learning (ML) models were developed using receiver operating characteristic (ROC) area under the curve (AUC) and F1 score optimization of 87 parameters. Of the two, the DL model exhibited better performance (AUC 0.8721, accuracy 0.84, and F1 score 0.76). However, we also blended DL with ML, and the ensemble model performed the best (AUC 0.8811, accuracy 0.85, and F1 score 0.77). The DL model is generally unable to extract feature importance; however, we succeeded by using the Shapley Additive exPlanations method for each model. This study demonstrated both the applicability of DL and ML models for classifying COVID‐19 mortality using hospital‐structured data and that the ensemble model had the best predictive ability.
AB - This study was designed to develop machine‐learning models to predict COVID‐19 mortality and identify its key features based on clinical characteristics and laboratory tests. For this, deep‐learning (DL) and machine‐learning (ML) models were developed using receiver operating characteristic (ROC) area under the curve (AUC) and F1 score optimization of 87 parameters. Of the two, the DL model exhibited better performance (AUC 0.8721, accuracy 0.84, and F1 score 0.76). However, we also blended DL with ML, and the ensemble model performed the best (AUC 0.8811, accuracy 0.85, and F1 score 0.77). The DL model is generally unable to extract feature importance; however, we succeeded by using the Shapley Additive exPlanations method for each model. This study demonstrated both the applicability of DL and ML models for classifying COVID‐19 mortality using hospital‐structured data and that the ensemble model had the best predictive ability.
KW - COVID‐19
KW - artificial intelligence
KW - ensemble model
KW - mortality
UR - http://www.scopus.com/inward/record.url?scp=85132564518&partnerID=8YFLogxK
U2 - 10.3390/diagnostics12061464
DO - 10.3390/diagnostics12061464
M3 - Article
AN - SCOPUS:85132564518
SN - 2075-4418
VL - 12
JO - Diagnostics
JF - Diagnostics
IS - 6
M1 - 1464
ER -