ACS Publications. Most Trusted. Most Cited. Most Read
My Activity
CONTENT TYPES

Figure 1Loading Img

pKa Prediction of Monoprotic Small Molecules the SMARTS Way

View Author Information
Department of Medicinal Chemistry, College of Pharmacy, University of Michigan, Ann Arbor, Michigan 48109
* Corresponding author e-mail: [email protected]
Cite this: J. Chem. Inf. Model. 2008, 48, 10, 2042–2053
Publication Date (Web):October 1, 2008
https://doi.org/10.1021/ci8001815
Copyright © 2008 American Chemical Society

    Article Views

    1762

    Altmetric

    -

    Citations

    LEARN ABOUT THESE METRICS
    Other access options
    Supporting Info (3)»

    Abstract

    Abstract Image

    Realizing favorable absorption, distribution, metabolism, elimination, and toxicity profiles is a necessity due to the high attrition rate of lead compounds in drug development today. The ability to accurately predict bioavailability can help save time and money during the screening and optimization processes. As several robust programs already exist for predicting logP, we have turned our attention to the fast and robust prediction of pKa for small molecules. Using curated data from the Beilstein Database and Lange’s Handbook of Chemistry, we have created a decision tree based on a novel set of SMARTS strings that can accurately predict the pKa for monoprotic compounds with R2 of 0.94 and root mean squared error of 0.68. Leave-some-out (10%) cross-validation achieved Q2 of 0.91 and root mean squared error of 0.80.

    Read this article

    To access this article, please review the available access options below.

    Get instant access

    Purchase Access

    Read this article for 48 hours. Check out below using your ACS ID or as a guest.

    Recommended

    Access through Your Institution

    You may have access to this article through your institution.

    Your institution does not have access to this content. You can change your affiliated institution below.

    Supporting Information

    ARTICLE SECTIONS
    Jump To

    Decision tree including predictions and ranges for each node and SMILES for all compounds and the predicted pKa values for all test compounds from SMARTS pKa, SPARC, MARVIN, Advanced Chemistry Development (ACD)/Labs Online v. 8.03, and ADME Boxes. This material is available free of charge via the Internet at http://pubs.acs.org.

    Terms & Conditions

    Electronic Supporting Information files are available without a subscription to ACS Web Editions. The American Chemical Society holds a copyright ownership interest in any copyrightable Supporting Information. Files available from the ACS website may be downloaded for personal use only. Users are not otherwise permitted to reproduce, republish, redistribute, or sell any Supporting Information from the ACS website, either in whole or in part, in either machine-readable form or any other form without permission from the American Chemical Society. For permission to reproduce, republish and redistribute this material, requesters must process their own requests via the RightsLink permission system. Information about how to use the RightsLink permission system can be found at http://pubs.acs.org/page/copyright/permissions.html.

    Cited By

    This article is cited by 26 publications.

    1. Yipin Lu, Shankara Anand, William Shirley, Peter Gedeck, Brian P. Kelley, Suzanne Skolnik, Stephane Rodde, Mai Nguyen, Mika Lindvall, Weiping Jia. Prediction of pKa Using Machine Learning Methods with Rooted Topological Torsion Fingerprints: Application to Aliphatic Amines. Journal of Chemical Information and Modeling 2019, 59 (11) , 4706-4719. https://doi.org/10.1021/acs.jcim.9b00498
    2. Haoyu S. Yu, Mark A. Watson, and Art D. Bochevarov . Weighted Averaging Scheme and Local Atomic Descriptor for pKa Prediction Based on Density Functional Theory. Journal of Chemical Information and Modeling 2018, 58 (2) , 271-286. https://doi.org/10.1021/acs.jcim.7b00537
    3. Art D. Bochevarov, Mark A. Watson, and Jeremy R. Greenwood , Dean M. Philipp . Multiconformation, Density Functional Theory-Based pKa Prediction in Application to Large, Flexible Organic Molecules with Diverse Functional Groups. Journal of Chemical Theory and Computation 2016, 12 (12) , 6001-6019. https://doi.org/10.1021/acs.jctc.6b00805
    4. John Manchester, Grant Walkup, Olga Rivin and Zhiping You. Evaluation of pKa Estimation Methods on 211 Druglike Compounds. Journal of Chemical Information and Modeling 2010, 50 (4) , 565-571. https://doi.org/10.1021/ci100019p
    5. Yilin Meng and Adrian E. Roitberg. Constant pH Replica Exchange Molecular Dynamics in Biomolecules Using a Discrete Protonation Model. Journal of Chemical Theory and Computation 2010, 6 (4) , 1401-1412. https://doi.org/10.1021/ct900676b
    6. Xiong-Yi Huang, Hua-Jing Wang and Jing Shi. Theoretical Study on Acidities of (S)-Proline Amide Derivatives in DMSO and Its Implications for Organocatalysis. The Journal of Physical Chemistry A 2010, 114 (2) , 1068-1081. https://doi.org/10.1021/jp909043a
    7. Shuming Zhang, Jon Baker and Peter Pulay . A Reliable and Efficient First Principles-Based Method for Predicting pKa Values. 2. Organic Acids. The Journal of Physical Chemistry A 2010, 114 (1) , 432-442. https://doi.org/10.1021/jp9067087
    8. Adam C. Lee and Gordon M. Crippen. Predicting pKa. Journal of Chemical Information and Modeling 2009, 49 (9) , 2013-2033. https://doi.org/10.1021/ci900209w
    9. Jialu Wu, Yu Kang, Peichen Pan, Tingjun Hou. Machine learning methods for pKa prediction of small molecules: Advances and challenges. Drug Discovery Today 2022, 27 (12) , 103372. https://doi.org/10.1016/j.drudis.2022.103372
    10. Patrick J. Ropp, Jesse C. Kaminsky, Sara Yablonski, Jacob D. Durrant. Dimorphite-DL: an open-source program for enumerating the ionization states of drug-like small molecules. Journal of Cheminformatics 2019, 11 (1) https://doi.org/10.1186/s13321-019-0336-9
    11. Tadi Venkata Sivakumar, Anirban Bhaduri, Rajasekhara Reddy Duvvuru Muni, Jin Hwan Park, Tae Yong Kim. SimCAL: a flexible tool to compute biochemical reaction similarity. BMC Bioinformatics 2018, 19 (1) https://doi.org/10.1186/s12859-018-2248-5
    12. Nripendra Madhab Biswas, Amit Shard, Sagarkumar Patel, Pinaki Sengupta. Drug development and bioanalytical method validation for a novel anticancer molecule, 4‐(dimethylamino)‐2‐(p‐tolylamino) thiazole‐5‐carbonitrile. Drug Development Research 2018, 79 (8) , 391-399. https://doi.org/10.1002/ddr.21462
    13. Kavya Sri Nemani, Amit Shard, Pinaki Sengupta. Establishment of a quantitative bioanalytical method for an acetylcholinesterase inhibitor Ethyl 3-(2-(4-fluorophenyl) amino)-4-phenylthiazol-5-yl)-3-oxopropanoate including its physicochemical characterization and in vitro metabolite profiling using Liquid Chromatography-Mass Spectrometry. Journal of Chromatography B 2018, 1096 , 214-222. https://doi.org/10.1016/j.jchromb.2018.08.019
    14. K.R. Przybylak, J.C. Madden, E. Covey-Crump, L. Gibson, C. Barber, M. Patel, M.T.D. Cronin. Characterisation of data resources for in silico modelling: benchmark datasets for ADME properties. Expert Opinion on Drug Metabolism & Toxicology 2018, 14 (2) , 169-181. https://doi.org/10.1080/17425255.2017.1316449
    15. Frank C. Pickard, Gerhard König, Florentina Tofoleanu, Juyong Lee, Andrew C. Simmonett, Yihan Shao, Jay W. Ponder, Bernard R. Brooks. Blind prediction of distribution in the SAMPL5 challenge with QM based protomer and pK a corrections. Journal of Computer-Aided Molecular Design 2016, 30 (11) , 1087-1100. https://doi.org/10.1007/s10822-016-9955-7
    16. Harun M. Patel, Malleshappa N. Noolvi, Poonam Sharma, Varun Jaiswal, Sumit Bansal, Sandeep Lohan, Suthar Sharad Kumar, Vikrant Abbot, Saurabh Dhiman, Varun Bhardwaj. Quantitative structure–activity relationship (QSAR) studies as strategic approach in drug discovery. Medicinal Chemistry Research 2014, 23 (12) , 4991-5007. https://doi.org/10.1007/s00044-014-1072-3
    17. . Nitrogen Acids. 2013, 77-92. https://doi.org/10.1201/b16128-12
    18. R. Fraczkiewicz. In Silico Prediction of Ionization. 2013https://doi.org/10.1016/B978-0-12-409547-2.02610-X
    19. Shuming Zhang. A reliable and efficient first principles‐based method for predicting p K a values. 4. organic bases. Journal of Computational Chemistry 2012, 33 (31) , 2469-2482. https://doi.org/10.1002/jcc.23068
    20. Nina Jeliazkova. Web tools for predictive toxicology model building. Expert Opinion on Drug Metabolism & Toxicology 2012, 8 (7) , 791-801. https://doi.org/10.1517/17425255.2012.685158
    21. John C. Dearden. Prediction of Physicochemical Properties. 2012, 93-138. https://doi.org/10.1007/978-1-62703-050-2_6
    22. Paul G. Seybold. Quantum Chemical‐QSPR Estimation of the Acidities and Basicities of Organic Compounds. 2012, 83-104. https://doi.org/10.1016/B978-0-12-396498-4.00015-6
    23. Nina Jeliazkova, Vedrin Jeliazkov. AMBIT RESTful web services: an implementation of the OpenTox application programming interface. Journal of Cheminformatics 2011, 3 (1) https://doi.org/10.1186/1758-2946-3-18
    24. Nina Jeliazkova, Nikolay Kochev. AMBIT‐SMARTS: Efficient Searching of Chemical Structures and Fragments. Molecular Informatics 2011, 30 (8) , 707-720. https://doi.org/10.1002/minf.201100028
    25. David Lagorce, Christelle Reynes, Anne‐Claude Camproux, Maria A. Miteva, Olivier Sperandio, Bruno O. Villoutreix. In Silico ADME/Tox Predictions. 2011, 29-124. https://doi.org/10.1002/9780470915110.ch2
    26. A. P. Harding, P. L. A. Popelier. pKa Prediction from an ab initio bond length: part 2—phenols. Physical Chemistry Chemical Physics 2011, 13 (23) , 11264. https://doi.org/10.1039/c1cp20379g

    Pair your accounts.

    Export articles to Mendeley

    Get article recommendations from ACS based on references in your Mendeley library.

    Pair your accounts.

    Export articles to Mendeley

    Get article recommendations from ACS based on references in your Mendeley library.

    You’ve supercharged your research process with ACS and Mendeley!

    STEP 1:
    Click to create an ACS ID

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    Please note: If you switch to a different device, you may be asked to login again with only your ACS ID.

    MENDELEY PAIRING EXPIRED
    Your Mendeley pairing has expired. Please reconnect