Pair your accounts.

Export articles to Mendeley

Get article recommendations from ACS based on references in your Mendeley library.

Pair your accounts.

Export articles to Mendeley

Get article recommendations from ACS based on references in your Mendeley library.

You’ve supercharged your research process with ACS and Mendeley!

Click to create an ACS ID

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

Your Mendeley pairing has expired. Please reconnect
ACS Publications. Most Trusted. Most Cited. Most Read
My Activity

Figure 1Loading Img

Classification of Cytochrome P450 Activities Using Machine Learning Methods

View Author Information
Department of Gastroenterology & Hepatology, University Hospital Basel, University of Basel, Basel, Switzerland, and Freiburg Center for Data Analysis and Modelling, Albert-Ludwigs-University, Freiburg, Germany
* Corresponding author: Prof. Dr. Juergen Drewe, Department of Gastroenterology & Hepatology, University Hospital of Basel, Petersgraben 4, CH-4031 Basel, Switzerland. E-mail: [email protected]. Phone: +41-61-265 3848. Fax: +41-61-265 8581.
†University of Basel.
Cite this: Mol. Pharmaceutics 2009, 6, 6, 1920–1926
Publication Date (Web):October 8, 2009
Copyright © 2009 American Chemical Society

    Article Views





    Other access options
    Supporting Info (1)»


    Abstract Image

    The cytochrome P450 (CYP) system plays an integral part in the metabolism of drugs and other xenobiotics. Knowledge of the structural features required for interaction with any of the different isoforms of the CYP system is therefore immensely valuable in early drug discovery. In this paper, we focus on three major isoforms (CYP 1A2, CYP 2D6, and CYP 3A4) and present a data set of 335 structurally diverse drug compounds classified for their interaction (as substrate, inhibitor, or any interaction) with these isoforms. We also present machine learning models using a variety of commonly used methods (k-nearest neighbors, decision tree induction using the CHAID and CRT algorithms, random forests, artificial neural networks, and support vector machines using the radial basis function (RBF) and homogeneous polynomials as kernel functions). We discuss the physicochemical features relevant for each end point and compare it to similar studies. Many of these models perform exceptionally well, even with 10-fold cross-validation, yielding corrected classification rates of 81.7 to 91.9% for CYP 1A2, 89.2 to 92.9% for CYP 2D6, and 87.4 to 89.9% for CYP3A4. Our models help in understanding the structural requirements for CYP interactions and can serve as sensitive tools in virtual screenings and lead optimization for toxicological profiles in drug discovery.

    Read this article

    To access this article, please review the available access options below.

    Get instant access

    Purchase Access

    Read this article for 48 hours. Check out below using your ACS ID or as a guest.


    Access through Your Institution

    You may have access to this article through your institution.

    Your institution does not have access to this content. You can change your affiliated institution below.

    Supporting Information

    Jump To

    Table of predictions for 353 compounds. This material is available free of charge via the Internet at

    Terms & Conditions

    Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system:

    Cited By

    This article is cited by 31 publications.

    1. Yusra Sajid Kiani, Ishrat Jabeen. Lipophilic Metabolic Efficiency (LipMetE) and Drug Efficiency Indices to Explore the Metabolic Properties of the Substrates of Selected Cytochrome P450 Isoforms. ACS Omega 2020, 5 (1) , 179-188.
    2. Sabina Podlewska, Wojciech M. Czarnecki, Rafał Kafel, and Andrzej J. Bojarski . Creating the New from the Old: Combinatorial Libraries Generation with Machine-Learning-Based Compound Structure Optimization. Journal of Chemical Information and Modeling 2017, 57 (2) , 133-147.
    3. Yufeng J. Tseng Bo-Han Su Ming-Tsung Hsu Olivia A. Lin . Steps Toward a Virtual Rat: Predictive Absorption, Distribution, Metabolism, and Toxicity Models. 2016, 283-329.
    4. Agata Kurczyk, Dawid Warszycki, Robert Musiol, Rafał Kafel, Andrzej J. Bojarski, and Jaroslaw Polanski . Ligand-Based Virtual Screening in a Search for Novel Anti-HIV-1 Chemotypes. Journal of Chemical Information and Modeling 2015, 55 (10) , 2168-2177.
    5. Bo-Han Su, Yi-shu Tu, Chieh Lin, Chi-Yu Shao, Olivia A. Lin, and Yufeng J. Tseng . Rule-Based Prediction Models of Cytochrome P450 Inhibition. Journal of Chemical Information and Modeling 2015, 55 (7) , 1426-1434.
    6. Sabine Schultes, Albert J. Kooistra, Henry F. Vischer, Saskia Nijmeijer, Eric E. J. Haaksma, Rob Leurs, Iwan J. P. de Esch, and Chris de Graaf . Combinatorial Consensus Scoring for Ligand-Based Virtual Fragment Screening: A Comparative Case Study for Serotonin 5-HT3A, Histamine H1, and Histamine H4 Receptors. Journal of Chemical Information and Modeling 2015, 55 (5) , 1030-1044.
    7. Sean Ekins, Joel S. Freundlich, and Robert C. Reynolds . Fusing Dual-Event Data Sets for Mycobacterium tuberculosis Machine Learning Models and Their Evaluation. Journal of Chemical Information and Modeling 2013, 53 (11) , 3054-3063.
    8. Johannes Kirchmair, Mark J. Williamson, Jonathan D. Tyzack, Lu Tan, Peter J. Bond, Andreas Bender, and Robert C. Glen . Computational Prediction of Metabolism: Sites, Products, SAR, P450 Enzyme Dynamics, and Mechanisms. Journal of Chemical Information and Modeling 2012, 52 (3) , 617-648.
    9. Feixiong Cheng, Yue Yu, Jie Shen, Lei Yang, Weihua Li, Guixia Liu, Philip W. Lee, and Yun Tang . Classification of Cytochrome P450 Inhibitors and Noninhibitors Using Combined Classifiers. Journal of Chemical Information and Modeling 2011, 51 (5) , 996-1011.
    10. Claudia Suenderhauf, Felix Hammann, Andreas Maunz, Christoph Helma, and Jörg Huwyler . Combinatorial QSAR Modeling of Human Intestinal Absorption. Molecular Pharmaceutics 2011, 8 (1) , 213-224.
    11. Lisa Michielan and Stefano Moro. Pharmaceutical Perspectives of Nonlinear QSAR Strategies. Journal of Chemical Information and Modeling 2010, 50 (6) , 961-978.
    12. Harutoshi Kato. Computational prediction of cytochrome P450 inhibition and induction. Drug Metabolism and Pharmacokinetics 2020, 35 (1) , 30-44.
    13. Han Shi, Simin Liu, Junqi Chen, Xuan Li, Qin Ma, Bin Yu. Predicting drug-target interactions using Lasso with random forest based on evolutionary information and chemical structure. Genomics 2019, 111 (6) , 1839-1852.
    14. Yusra Sajid Kiani, Ishrat Jabeen. Exploring the Chemical Space of Cytochrome P450 Inhibitors Using Integrated Physicochemical Parameters, Drug Efficiency Metrics and Decision Tree Models. Computation 2019, 7 (2) , 26.
    15. Yanmin Zhang, Yuchen Wang, Weineng Zhou, Yuanrong Fan, Junnan Zhao, Lu Zhu, Shuai Lu, Tao Lu, Yadong Chen, Haichun Liu. A combined drug discovery strategy based on machine learning and molecular docking. Chemical Biology & Drug Design 2019, 93 (5) , 685-699.
    16. Long Yu, Xinyu Shi, Shengwei Tian, Shuangyin Gao, Li Li. Classification of Cytochrome P450 1A2 Inhibitors and Noninhibitors Based on Deep Belief Network. International Journal of Computational Intelligence and Applications 2017, 16 (01) , 1750002.
    17. Thet Su Win, Aijaz Ahmad Malik, Virapong Prachayasittikul, Jarl E S Wikberg, Chanin Nantasenamat, Watshara Shoombuatong. Hemopred: A Web Server for Predicting the Hemolytic Activity of Peptides. Future Medicinal Chemistry 2017, 9 (3) , 275-291.
    18. Reny Pratiwi, Aijaz Ahmad Malik, Nalini Schaduangrat, Virapong Prachayasittikul, Jarl E. S. Wikberg, Chanin Nantasenamat, Watshara Shoombuatong. CryoProtect: A Web Server for Classifying Antifreeze Proteins from Nonantifreeze Proteins. Journal of Chemistry 2017, 2017 , 1-15.
    19. Xi Chen, Lian-sheng Qiao, Yi-lian Cai, Yan-ling Zhang, Gong-yu Li. Combination Computing of Support Vector Machine, Support Vector Regression and Molecular Docking for Potential Cytochrome P450 1A2 Inhibitors. Chinese Journal of Chemical Physics 2016, 29 (5) , 629-634.
    20. Wojciech Marian Czarnecki. Weighted Tanimoto Extreme Learning Machine with Case Study in Drug Discovery. IEEE Computational Intelligence Magazine 2015, 10 (3) , 19-29.
    21. Chi-Yu Shao, Bo-Han Su, Yi-Shu Tu, Chieh Lin, Olivia A. Lin, Yufeng J. Tseng. CypRules: a rule-based P450 inhibition prediction server. Bioinformatics 2015, 31 (11) , 1869-1871.
    22. Xianchao Pan, Li Chao, Sujun Qu, Shuheng Huang, Li Yang, Hu Mei. An improved large-scale prediction model of CYP1A2 inhibitors by using combined fragment descriptors. RSC Advances 2015, 5 (102) , 84232-84237.
    23. Ahmed E Enayetallah, Dinesh Puppala, Daniel Ziemek, James E Fischer, Sheila Kantesaria, Mathew T Pletcher. Assessing the translatability of In vivo cardiotoxicity mechanisms to In vitro models using causal reasoning. BMC Pharmacology and Toxicology 2013, 14 (1)
    24. A. K. Madan, Sanjay Bajaj, Harish Dureja. Classification Models for Safe Drug Molecules. 2013, 99-124.
    25. Jayalakshmi Sridhar, Jiawang Liu, Maryam Foroozesh, Cheryl L. Klein Stevens. Insights on Cytochrome P450 Enzymes and Inhibitors Obtained Through QSAR Studies. Molecules 2012, 17 (8) , 9283-9305.
    26. Chin Yee Liew, Chuen Pan, Andre Tan, Ke Xin Magneline Ang, Chun Wei Yap. QSAR classification of metabolic activation of chemicals into covalently reactive species. Molecular Diversity 2012, 16 (2) , 389-400.
    27. Felix Hammann, Juergen Drewe. Decision tree models for data mining in hit discovery. Expert Opinion on Drug Discovery 2012, 7 (4) , 341-352.
    28. Svava Ósk Jónsdóttir, Tine Ringsted, Nikolai G. Nikolov, Marianne Dybdahl, Eva Bay Wedebye, Jay R. Niemelä. Identification of cytochrome P450 2D6 and 2C9 substrates and inhibitors by QSAR analysis. Bioorganic & Medicinal Chemistry 2012, 20 (6) , 2042-2053.
    29. Liew Chin Yee, Yap Chun Wei. Current Modeling Methods Used in QSAR/QSPR. 2012, 1-31.
    30. Miriam Carbon‐Mangels, Michael C. Hutter. Selecting Relevant Descriptors for Classification by Bayesian Estimates: A Comparison with Decision Trees and Support Vector Machines Approaches for Disparate Data Sets. Molecular Informatics 2011, 30 (10) , 885-895.
    31. Anthony E Klon. Machine learning algorithms for the prediction of hERG and CYP450 binding in drug development. Expert Opinion on Drug Metabolism & Toxicology 2010, 6 (7) , 821-833.