In silico prediction of multiple-category classification model for cytochrome P450 inhibitors and non-inhibitors using machine-learning method$

J. H. Lee, S. Basith, M. Cui, B. Kim, S. Choi

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

The cytochrome P450 (CYP) enzyme superfamily is involved in phase I metabolism which chemically modifies a variety of substrates via oxidative reactions to make them more water-soluble and easier to eliminate. Inhibition of these enzymes leads to undesirable effects, including toxic drug accumulations and adverse drug–drug interactions. Hence, it is necessary to develop in silico models that can predict the inhibition potential of compounds for different CYP isoforms. This study focused on five major CYP isoforms, including CYP1A2, 2C9, 2C19, 2D6 and 3A4, that are responsible for more than 90% of the metabolism of clinical drugs. The main aim of this study is to develop a multiple-category classification model (MCM) for the major CYP isoforms using a Laplacian-modified naïve Bayesian method. The dataset composed of more than 4500 compounds was collected from the PubChem Bioassay database. VolSurf+ descriptors and FCFP_8 fingerprint were used as input features to build classification models. The results demonstrated that the developed MCM using Laplacian-modified naïve Bayesian method was successful in classifying inhibitors and non-inhibitors for each CYP isoform. Moreover, the accuracy, sensitivity and specificity values for both training and test sets were above 80% and also yielded satisfactory area under the receiver operating characteristic curve and Matthews correlation coefficient values.

Original languageEnglish
Pages (from-to)863-874
Number of pages12
JournalSAR and QSAR in Environmental Research
Volume28
Issue number10
DOIs
StatePublished - 3 Oct 2017

Keywords

  • classification
  • Cytochrome P450
  • descriptors
  • inhibitors
  • machine-learning
  • multiple-category modelling
  • non-inhibitors

Fingerprint

Dive into the research topics of 'In silico prediction of multiple-category classification model for cytochrome P450 inhibitors and non-inhibitors using machine-learning method$'. Together they form a unique fingerprint.

Cite this