Abstract
As an alternative to in vivo Draize rabbit eye irritation test, this study aimed to construct an in silico model to predict the complete United Nations (UN) Globally Harmonized System (GHS) for classification and labeling of chemicals for eye irritation category [eye damage (Category 1), irritating to eye (Category 2) and nonirritating (No category)] of liquid chemicals with Integrated approaches to testing and assessment (IATA)-like two-stage random forest approach. Liquid chemicals (n = 219) with 34 physicochemical descriptors and quality in vivo data were collected with no missing values. Seven machine learning algorithms (Naive Bayes, Logistic Regression, First Large Margin, Neural Net, Random Forest (RF), Gradient Boosted Tree, and Support Vector Machine) were examined for the ternary categorization of eye irritation potential at a single run through 10-fold cross-validation. RF, which performed best, was further improved by applying the ‘Bottom-up approach’ concept of IATA, namely, separating No category first, and discriminating Category 1 from 2, thereafter. The best performing training dataset achieved an overall accuracy of 73% and the correct prediction for Category 1, 2, and No category was 80%, 50%, and 77%, respectively for the test dataset. This prediction model was further validated with an external dataset of 28 chemicals, for which an overall accuracy of 71% was achieved.
Original language | English |
---|---|
Pages (from-to) | 960-972 |
Number of pages | 13 |
Journal | Journal of Toxicology and Environmental Health - Part A |
Volume | 84 |
Issue number | 23 |
DOIs | |
State | Published - 2021 |
Bibliographical note
Funding Information:This work was supported by the National Research Foundation (NRF) funded by the Ministry of Science and ICT (MSIT) (2018R1A5A2025286) and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea (Grant No. HP20C0061).
Funding Information:
This work was supported by the Ministry of Science and ICT, South Korea [2018R1A5A2025286] and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea [Grant No. HP20C0061]. This work was supported by the National Research Foundation (NRF) funded by the Ministry of Science and ICT (MSIT) (2018R1A5A2025286) and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea (Grant No. HP20C0061).
Publisher Copyright:
© 2021 Taylor & Francis.
Keywords
- Eye irritation potential
- in silico
- machine-learning
- physicochemical descriptor
- random forest