Computational Prediction and Validation of an Expert’s Evaluation of Chemical ProbesClick to copy article linkArticle link copied!
Abstract
In a decade with over half a billion dollars of investment, more than 300 chemical probes have been identified to have biological activity through NIH funded screening efforts. We have collected the evaluations of an experienced medicinal chemist on the likely chemistry quality of these probes based on a number of criteria including literature related to the probe and potential chemical reactivity. Over 20% of these probes were found to be undesirable. Analysis of the molecular properties of these compounds scored as desirable suggested higher pKa, molecular weight, heavy atom count, and rotatable bond number. We were particularly interested whether the human evaluation aspect of medicinal chemistry due diligence could be computationally predicted. We used a process of sequential Bayesian model building and iterative testing as we included additional probes. Following external validation of these methods and comparing different machine learning methods, we identified Bayesian models with accuracy comparable to other measures of drug-likeness and filtering rules created to date.
Cited By
This article is cited by 21 publications.
- Vadim Korolev, Artem Mitrofanov, Alexandru Korotcov, Valery Tkachenko. Graph Convolutional Neural Networks as “General-Purpose” Property Predictors: The Universality and Limits of Applicability. Journal of Chemical Information and Modeling 2020, 60
(1)
, 22-28. https://doi.org/10.1021/acs.jcim.9b00587
- Alexandru Korotcov, Valery Tkachenko, Daniel P. Russo, and Sean Ekins . Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets. Molecular Pharmaceutics 2017, 14
(12)
, 4462-4475. https://doi.org/10.1021/acs.molpharmaceut.7b00578
- Alex M. Clark, Krishna Dole, and Sean Ekins . Open Source Bayesian Models. 3. Composite Models for Prediction of Binned Responses. Journal of Chemical Information and Modeling 2016, 56
(2)
, 275-285. https://doi.org/10.1021/acs.jcim.5b00555
- Markus Boehm Liying Zhang Nicole Bodycombe Mateusz Maciejewski Anne Mai Wassermann . The Many Facets of Screening Library Design. 2016, 345-364. https://doi.org/10.1021/bk-2016-1222.ch016
- Alex M. Clark, Krishna Dole, Anna Coulon-Spektor, Andrew McNutt, George Grass, Joel S. Freundlich, Robert C. Reynolds, and Sean Ekins . Open Source Bayesian Models. 1. Application to ADME/Tox and Drug Discovery Datasets. Journal of Chemical Information and Modeling 2015, 55
(6)
, 1231-1245. https://doi.org/10.1021/acs.jcim.5b00143
- Alex M. Clark and Sean Ekins . Open Source Bayesian Models. 2. Mining a “Big Dataset” To Create and Validate Models with ChEMBL. Journal of Chemical Information and Modeling 2015, 55
(6)
, 1246-1260. https://doi.org/10.1021/acs.jcim.5b00144
- Christopher A. Lipinski, Nadia K. Litterman, Christopher Southan, Antony J. Williams, Alex M. Clark, and Sean Ekins . Parallel Worlds of Public and Commercial Bioactive Chemistry Data. Journal of Medicinal Chemistry 2015, 58
(5)
, 2068-2076. https://doi.org/10.1021/jm5011308
- Ashima Ahuja, Sonia Singh, Yogesh Murti. Chemical Probes Review: Choosing the Right Path Towards Pharmacological Targets in Drug Discovery, Challenges and Future Perspectives. Combinatorial Chemistry & High Throughput Screening 2024, 27
(17)
, 2544-2564. https://doi.org/10.2174/0113862073283304231118155730
- Ctibor Škuta, Christopher Southan, Petr Bartůněk. Will the chemical probes please stand up?. RSC Medicinal Chemistry 2021, 12
(8)
, 1428-1441. https://doi.org/10.1039/D1MD00138H
- Conor R. Caffrey, Dietmar Steverding, Rafaela S. Ferreira, Renata B. de Oliveira, Anthony J. O'Donoghue, Ludovica Monti, Carlo Ballatore, Kelly A. Bachovchin, Lori Ferrins, Michael P. Pollastri, Kimberley M. Zorn, Daniel H. Foil, Alex M. Clark, Melina Mottin, Carolina H. Andrade, Jair L. de Siqueira‐Neto, Sean Ekins. Drug Discovery and Development for Kinetoplastid Diseases. 2021, 1-79. https://doi.org/10.1002/0471266949.bmc235.pub2
- Cristina D. Cruz, Pauli Wrigstedt, Karina Moslova, Vladimir Iashin, Heidi Mäkkylä, Léo Ghemtio, Sami Heikkinen, Päivi Tammela, Jesus E. Perea-Buceta. Installation of an aryl boronic acid function into the external section of -aryl-oxazolidinones: Synthesis and antimicrobial evaluation. European Journal of Medicinal Chemistry 2021, 211 , 113002. https://doi.org/10.1016/j.ejmech.2020.113002
- Holger Stark. The chemical probe – scopes, limitations and challenges. Expert Opinion on Drug Discovery 2020, 15
(12)
, 1365-1367. https://doi.org/10.1080/17460441.2020.1781086
- Danish Shahzad, Aamer Saeed, Fayaz Ali Larik, Pervaiz Ali Channar, Qamar Abbas, Mohamed F. Alajmi, M. Ifzan Arshad, Mauricio F. Erben, Mubashir Hassan, Hussain Raza, Sung-Yum Seo, Hesham R. El-Seedi. Novel C-2 Symmetric Molecules as α-Glucosidase and α-Amylase Inhibitors: Design, Synthesis, Kinetic Evaluation, Molecular Docking and Pharmacokinetics. Molecules 2019, 24
(8)
, 1511. https://doi.org/10.3390/molecules24081511
- Alex M. Clark, Kimberley M. Zorn, Mary A. Lingerfelt, Sean Ekins. Developing Next Generation Tools for Computational Toxicology. 2018, 363-387. https://doi.org/10.1002/9781119282594.ch14
- Sean Ekins, Alex M. Clark, Krishna Dole, Kellan Gregory, Andrew M. Mcnutt, Anna Coulon Spektor, Charlie Weatherall, Nadia K. Litterman, Barry A. Bunin. Data Mining and Computational Modeling of High-Throughput Screening Datasets. 2018, 197-221. https://doi.org/10.1007/978-1-4939-7724-6_14
- Akshata Gad, Andrew Titus Manuel, Jinuraj K. R., Lijo John, Sajeev R., Shanmuga Priya V. G., Abdul Jaleel U.C.. Virtual screening and repositioning of inconclusive molecules of beta-lactamase Bioassays—A data mining approach. Computational Biology and Chemistry 2017, 70 , 65-88. https://doi.org/10.1016/j.compbiolchem.2017.07.005
- Sean Ekins, Anna Coulon Spektor, Alex M. Clark, Krishna Dole, Barry A. Bunin. Collaborative drug discovery for More Medicines for Tuberculosis (MM4TB). Drug Discovery Today 2017, 22
(3)
, 555-565. https://doi.org/10.1016/j.drudis.2016.10.009
- Alexander L. Perryman, Thomas P. Stratton, Sean Ekins, Joel S. Freundlich. Predicting Mouse Liver Microsomal Stability with “Pruned” Machine Learning Models and Public Data. Pharmaceutical Research 2016, 33
(2)
, 433-449. https://doi.org/10.1007/s11095-015-1800-5
- Isao Nakanishi, Katsumi Murata, Naoya Nagata, Masakuni Kurono, Takayoshi Kinoshita, Misato Yasue, Takako Miyazaki, Yoshinori Takei, Shinya Nakamura, Atsushi Sakurai, Nobuko Iwamoto, Keiji Nishiwaki, Tetsuko Nakaniwa, Yusuke Sekiguchi, Akira Hirasawa, Gozoh Tsujimoto, Kazuo Kitaura. Identification of protein kinase CK2 inhibitors using solvent dipole ordering virtual screening. European Journal of Medicinal Chemistry 2015, 96 , 396-404. https://doi.org/10.1016/j.ejmech.2015.04.032
- Nadia Litterman, Christopher Lipinski, Sean Ekins. Small molecules with antiviral activity against the Ebola virus. F1000Research 2015, 4 , 38. https://doi.org/10.12688/f1000research.6120.1
- Sean Ekins, Nadia K. Litterman, Renée J.G. Arnold, Robert W. Burgess, Joel S. Freundlich, Steven J. Gray, Joseph J. Higgins, Brett Langley, Dianna E. Willis, Lucia Notterpek, David Pleasure, Michael W. Sereda, Allison Moore. A brief review of recent Charcot-Marie-Tooth research and priorities. F1000Research 2015, 4 , 53. https://doi.org/10.12688/f1000research.6160.1
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.