In silico prediction of the full United Nations Globally Harmonized System eye irritation categories of liquid chemicals by IATA-like bottom-up approach of random forest method

Yeonsoo Kang, Boram Jeong, Doo Hyeon Lim, Donghwan Lee, Kyung Min Lim

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

As an alternative to in vivo Draize rabbit eye irritation test, this study aimed to construct an in silico model to predict the complete United Nations (UN) Globally Harmonized System (GHS) for classification and labeling of chemicals for eye irritation category [eye damage (Category 1), irritating to eye (Category 2) and nonirritating (No category)] of liquid chemicals with Integrated approaches to testing and assessment (IATA)-like two-stage random forest approach. Liquid chemicals (n = 219) with 34 physicochemical descriptors and quality in vivo data were collected with no missing values. Seven machine learning algorithms (Naive Bayes, Logistic Regression, First Large Margin, Neural Net, Random Forest (RF), Gradient Boosted Tree, and Support Vector Machine) were examined for the ternary categorization of eye irritation potential at a single run through 10-fold cross-validation. RF, which performed best, was further improved by applying the ‘Bottom-up approach’ concept of IATA, namely, separating No category first, and discriminating Category 1 from 2, thereafter. The best performing training dataset achieved an overall accuracy of 73% and the correct prediction for Category 1, 2, and No category was 80%, 50%, and 77%, respectively for the test dataset. This prediction model was further validated with an external dataset of 28 chemicals, for which an overall accuracy of 71% was achieved.

Original languageEnglish
Pages (from-to)960-972
Number of pages13
JournalJournal of Toxicology and Environmental Health - Part A
Volume84
Issue number23
DOIs
StatePublished - 2021

Bibliographical note

Funding Information:
This work was supported by the National Research Foundation (NRF) funded by the Ministry of Science and ICT (MSIT) (2018R1A5A2025286) and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea (Grant No. HP20C0061).

Funding Information:
This work was supported by the Ministry of Science and ICT, South Korea [2018R1A5A2025286] and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea [Grant No. HP20C0061]. This work was supported by the National Research Foundation (NRF) funded by the Ministry of Science and ICT (MSIT) (2018R1A5A2025286) and the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health and Welfare, Republic of Korea (Grant No. HP20C0061).

Publisher Copyright:
© 2021 Taylor & Francis.

Keywords

  • Eye irritation potential
  • in silico
  • machine-learning
  • physicochemical descriptor
  • random forest

Fingerprint

Dive into the research topics of 'In silico prediction of the full United Nations Globally Harmonized System eye irritation categories of liquid chemicals by IATA-like bottom-up approach of random forest method'. Together they form a unique fingerprint.

Cite this