Abstract
We aimed to develop a novel prediction model for early neurological deterioration (END) based on an interpretable machine learning (ML) algorithm for atrial fibrillation (AF)-related stroke and to evaluate the prediction accuracy and feature importance of ML models. Data from multicenter prospective stroke registries in South Korea were collected. After stepwise data preprocessing, we utilized logistic regression, support vector machine, extreme gradient boosting, light gradient boosting machine (LightGBM), and multilayer perceptron models. We used the Shapley additive explanation (SHAP) method to evaluate feature importance. Of the 3,213 stroke patients, the 2,363 who had arrived at the hospital within 24 h of symptom onset and had available information regarding END were included. Of these, 318 (13.5%) had END. The LightGBM model showed the highest area under the receiver operating characteristic curve (0.772; 95% confidence interval, 0.715–0.829). The feature importance analysis revealed that fasting glucose level and the National Institute of Health Stroke Scale score were the most influential factors. Among ML algorithms, the LightGBM model was particularly useful for predicting END, as it revealed new and diverse predictors. Additionally, the effects of the features on the predictive power of the model were individualized using the SHAP method.
Original language | English |
---|---|
Article number | 20610 |
Journal | Scientific Reports |
Volume | 11 |
Issue number | 1 |
DOIs | |
State | Published - Dec 2021 |
Bibliographical note
Funding Information:The authors disclose receipt of the following financial support for the research, authorship, and publication of this article: the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (NRF-2020R1C1C1009294) and Korea University Grant. The funders had no role in the study design; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Funding Information:
The authors declare the following potential conflicts of interest with respect to the research, authorship, and publication of this article: J-M Jung has received lecture honoraria from Pfizer, Sanofi-Aventis, Ostuka, Dong-A, and Hanmi Pharmaceutical Co., Ltd; consulting fees from Daewoong Pharmaceutical Co., Ltd. WK Seo received honoraria for lectures from Pfizer, Sanofi-Aventis, Otsuka Korea, Dong-A Pharmaceutical Co., Ltd., Beyer, Daewoong Pharmaceutical Co. Ltd., Daiichi Sankyo Korea Co., Ltd., and Boryung Pharmaceutical Co., Ltd.; a study grant from Daiichi Sankyo Korea Co., Ltd.; and consulting fees from OBELAB Inc. All other authors have no coompeting interest.
Publisher Copyright:
© 2021, The Author(s).