A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening
- Jack ScantleburyJack ScantleburyDepartment of Statistics, University of Oxford, Oxford OX1 2JD, United KingdomMore by Jack Scantlebury
- ,
- Lucy VostLucy VostDepartment of Statistics, University of Oxford, Oxford OX1 2JD, United KingdomMore by Lucy Vost
- ,
- Anna CarberyAnna CarberyDepartment of Statistics, University of Oxford, Oxford OX1 2JD, United KingdomDiamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot OX11 0DE, United KingdomMore by Anna Carbery
- ,
- Thomas E. HadfieldThomas E. HadfieldDepartment of Statistics, University of Oxford, Oxford OX1 2JD, United KingdomMore by Thomas E. Hadfield
- ,
- Oliver M. TurnbullOliver M. TurnbullDepartment of Statistics, University of Oxford, Oxford OX1 2JD, United KingdomMore by Oliver M. Turnbull
- ,
- Nathan Brown
- ,
- Vijil ChenthamarakshanVijil ChenthamarakshanIBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598, United StatesMore by Vijil Chenthamarakshan
- ,
- Payel DasPayel DasIBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598, United StatesMore by Payel Das
- ,
- Harold GrosjeanHarold GrosjeanStructural Genomics Consortium, University of Oxford, Oxford OX3 7DQ, United KingdomMore by Harold Grosjean
- ,
- Frank von DelftFrank von DelftDiamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot OX11 0DE, United KingdomCentre for Medicines Discovery, University of Oxford, Oxford OX3 7DQ, United KingdomDepartment of Biochemistry, University of Johannesburg, Johannesburg 2006, South AfricaResearch Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, United KingdomMore by Frank von Delft
- , and
- Charlotte M. Deane*Charlotte M. Deane*E-mail: [email protected]Department of Statistics, University of Oxford, Oxford OX1 2JD, United KingdomMore by Charlotte M. Deane
Abstract

Over the past few years, many machine learning-based scoring functions for predicting the binding of small molecules to proteins have been developed. Their objective is to approximate the distribution which takes two molecules as input and outputs the energy of their interaction. Only a scoring function that accounts for the interatomic interactions involved in binding can accurately predict binding affinity on unseen molecules. However, many scoring functions make predictions based on data set biases rather than an understanding of the physics of binding. These scoring functions perform well when tested on similar targets to those in the training set but fail to generalize to dissimilar targets. To test what a machine learning-based scoring function has learned, input attribution, a technique for learning which features are important to a model when making a prediction on a particular data point, can be applied. If a model successfully learns something beyond data set biases, attribution should give insight into the important binding interactions that are taking place. We built a machine learning-based scoring function that aimed to avoid the influence of bias via thorough train and test data set filtering and show that it achieves comparable performance on the Comparative Assessment of Scoring Functions, 2016 (CASF-2016) benchmark to other leading methods. We then use the CASF-2016 test set to perform attribution and find that the bonds identified as important by PointVS, unlike those extracted from other scoring functions, have a high correlation with those found by a distance-based interaction profiler. We then show that attribution can be used to extract important binding pharmacophores from a given protein target when supplied with a number of bound structures. We use this information to perform fragment elaboration and see improvements in docking scores compared to using structural information from a traditional, data-based approach. This not only provides definitive proof that the scoring function has learned to identify some important binding interactions but also constitutes the first deep learning-based method for extracting structural information from a target for molecule design.
This publication is licensed under
License Summary*
You are free to share (copy and redistribute) this article in any medium or format and to adapt (remix, transform, and build upon) the material for any purpose, even commercially within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share (copy and redistribute) this article in any medium or format and to adapt (remix, transform, and build upon) the material for any purpose, even commercially within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
License Summary*
You are free to share (copy and redistribute) this article in any medium or format and to adapt (remix, transform, and build upon) the material for any purpose, even commercially within the parameters below:
Creative Commons (CC): This is a Creative Commons license.
Attribution (BY): Credit must be given to the creator.
*Disclaimer
This summary highlights only some of the key features and terms of the actual license. It is not a license and has no legal value. Carefully review the actual license before using these materials.
Introduction
Methods
Figure 1

Figure 1. An overall schema of the methods used to debias and test PointVS. We first thoroughly filter the testing and training sets (a) before benchmarking the performance of PointVS on the docking power and scoring power tests (b). We then use attribution to gain insights into important binding regions in the protein pocket (c), which we use for fragment elaboration (d).
Building and Testing PointVS
Model
Figure 2

Figure 2. Architecture and input format of PointVS. Each of the n atoms in the input to the model (left) is given a one-hot encoded feature vector with a single bit added to indicate whether the atom is from the ligand or the receptor, as well as a position . There are m edges, defined by the edge indices ei ∈ {0, ···, n – 1}2×m, which are the indices of connected atoms, and the corresponding edge attributes ea ∈ {0, 1}3×m. These are one-hot encodings representing ligand–ligand, ligand–protein, and protein–protein edges. There are skip connections between each of the EGNN layers, and the linear and global average final pooling (GAP) layers only act upon the node features f.
Data Sets
data set | size | unique PDB IDs | task/split |
---|---|---|---|
Redocked | 780,572 | 19,595 | C/Train |
Redocked\gninaSetPose80 | 633,047 | 17,528 | C/Train |
Redocked\Core80 | 629,963 | 16,900 | C/Train |
General | 19,157 | 19,157 | R/Train |
General\Core80 | 14,555 | 14,555 | R/Train |
gninaSetPose | 20,411 | 411 | C/Test |
Core | 285 | 285 | R/Test |
Redocked includes no structures with the same PDB code as any structures in the gninaSetPose or Core sets, and the General set shares no PDB codes with the Core set. In the task/split column, “C” refers to pose classification and “R” refers to affinity regression.
1. | Tanimoto similarity of the 2048-bit Morgan fingerprint between the ligand and any of the 285 test set ligands greater than 0.8. | ||||
2. | Sequence identity between the protein and any of the 285 test set proteins greater than 0.8. |
Performance Metrics
Attribution
Atom Masking
Bond Masking
Edge Attention
gnina
InteractionGraphNet
Fragment Elaboration
Results
Training and Testing PointVS
Bias in CASF-16
Docking and Scoring Power Tests
Figure 3

Figure 3. Top-N vs N pose ranking performance on the gninaSetPose set for Autodock Vina, PointVS, and gnina. Terms in brackets in the legend refer to the training sets; filtering the training set by protein and ligand similarity results in slightly degraded performance. The Top-1 values for PointVS trained on Redocked\gninaSetPose80 and gnina trained on Redocked are the same (68%), with PointVS trained on Redocked achieving 70%.
Top-1 | ||
---|---|---|
model | crystal pose included | crystal pose not included |
PointVS (Redocked\CoreR) | 90.5 | 84.9 |
PointVS (Redocked\Core80) | 91.2 | 84.2 |
PointVS (Redocked) | 91.3 | 85.3 |
gnina (Redocked) | 91.2 | 83.2 |
Autodock Vina | 90.2 | 84.6 |
The nonmachine learning scoring function is shown in italic. The performance is shown for both the case where the crystal structure of the ligand is included in the set of poses being ranked and the case where it is not included, in which case the native pose is defined as any pose less than 2 Å RMSD away from the crystal pose. The highest Top-1 for each case is shown in bold. Not included in the table are ΔVinaXGB and ΔVinaRF, for which only the performances on the crystal pose included test are provided by the authors as 92 and 90, respectively.
scoring function | PCC | |
---|---|---|
Biased | gnina (Redocked) | 0.753 ± 0.008 |
gnina (General) | 0.816 ± 0.008 | |
PointVS (General) | 0.805 ± 0.010 | |
PointVS (General\CoreR) | 0.803 ± 0.012 | |
ΔVinaXGB | 0.796 | |
ΔVinaRF | 0.732 | |
Debiased | PointVS (General\Core80) | 0.754 ± 0.015 |
X-Score | 0.631 | |
Autodock Vina | 0.601 |
Scoring functions are split into biased and debiased methods according to the overlap between their training sets, which are shown in brackets where applicable, and the Core set. Nonmachine learning scoring functions are shown in italic. The highest PCCs obtained with both biased and debiased methods are shown in bold.
Attribution: Identifying Important Binding Sites
Human Tankyrase-2 Inhibitors
Figure 4

Figure 4. Three Tankyrase-2 inhibitors: 5C5P (a, d, g, j), 4J21 (b, e, h, k), and 4J22 (c, f, i, l). The top row shows the Protein–Ligand Interaction Profiler (PLIP) analysis of each structure, where dark blue lines are hydrogen bonds, dotted gray are hydrophobic interactions, dotted green are π–π interactions, solid green are halogen bonds, and lilac are water bridges. The second row shows the results of performing atom masking with gnina, with green representing a positive attribution score (identified as making a positive contribution to binding) and red, a negative score (making a negative contribution). The third row shows the results of bond masking with IGN, with lines between atoms showing the top five highest-scoring edges for each structure, with darker red representing higher scores. The bottom row shows the results of edge-attention attribution performed with PointVS, with the lines also showing the top five highest-scoring edges, and darker pink representing higher scores.
Large Scale Attribution Tests
ρ5 | ρ10 | |
---|---|---|
PointVS | 0.640 | 0.788 |
gnina | 0.234 | 0.093 |
IGN | 0.209 | –0.071 |
The means were calculated over a subset of 20 randomly selected bound structures from the PDBBind Core set (see SI for PDB IDs).
Hotspot Identification Using PointVS
Figure 5

Figure 5. Donor and acceptor hotspot maps in the binding pocket of Mpro, colored purple and orange, respectively, and numbered according to their rank. The hotspots on the left (a) were obtained with the Hotspots API, the hotspots in the center (b) were obtained with PointVS by performing attribution on 152 structures of bound fragments, and the hotspots on the right (c) were obtained by extracting the most common protein atoms identified by PLIP as involved in hydrogen bonds with ligand atoms across the 152 bound structures (in this case identified over five times).
Impact of Fragment Screen Similarity
Dependency on Fragment Screen Size
Figure 6

Figure 6. Top five scoring hotspot maps in the binding pocket of Mpro found by performing attribution on 40 sets of randomly sampled sets of 10 (a), 30 (b), and 80 (c) bound structures. Both donor and acceptor hotspots are shown in red. The spheres representing the hotspots are transparent and overlaid, so the more opaque a hotspot appears, the more times it was ranked in the top five.
Fragment Elaboration
PointVS | Hotspots API | |||
---|---|---|---|---|
rank | fragments successfully elaborated | elaborations per fragment | fragments successfully elaborated | elaborations per fragment |
1 | 57 | 215 | 26 | 189 |
2 | 82 | 182 | 57 | 183 |
3 | 51 | 202 | 59 | 207 |
4 | 14 | 199 | 9 | 158 |
5 | 21 | 223 | 28 | 179 |
6 | 60 | 183 | 23 | 162 |
7 | 46 | 228 | 3 | 220 |
8 | 78 | 216 | 54 | 197 |
9 | 24 | 215 | 0 | 0 |
10 | 47 | 218 | 28 | 179 |
STRIFE was provided with 109 fragments to elaborate on for each hotspot and asked to generate 250 elaborations for each one.
method | PointVS | API |
---|---|---|
Hotspots 1–5: ΔSLE20 | 0.304 | 0.213 |
Hotspots 6–10: ΔSLE20 | 0.146 | 0.269 |
The highest ΔSLE20 for both ranges of hotspots is shown in bold.
Mpro: 152 | Mac1: 58 | NSP14: 19 | ||||
---|---|---|---|---|---|---|
ΔSLE20 | PointVS | API | PointVS | API | PointVS | API |
Hotspots 1–5 | 0.341 | 0.213 | 1.749 | 1.581 | 1.708 | 1.240 |
Hotspots 6–10 | 0.189 | 0.269 | 1.727 | 1.748 | 1.143 | 1.193 |
The numbers after the target names refer to the number of bound fragment structures available for each target. The list of fragalysis codes corresponding to the structures used is given in SI Table 1. The highest ΔSLE20 for both ranges of hotspots is shown in bold.
Conclusion
Data and Software Availability
Supporting Information
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jcim.3c00322.
A brief description of equivariance and invariance, some more in-depth information about the architecture of PointVS and the data sets used to train and test it, a comparison of the different attribution methods used, a description of the process by which the API hotspots were obtained, an additional assessment of the reliance of PointVS hotspots on fragment screen size, and a case study showing how PointVS could have aided in a real-world fragment-to-lead campaign; also all of the IDs of the structures used, from both fragalysis and the PDB (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.
Acknowledgments
J.S. was supported by funding from the Biotechnology and Biosciences Research Council (BB/S507611/1) and BenevolentAI, and L.V. was supported by funding from the Engineering and Physical Sciences Research Council (EP/W522211/1) and IBM Research. This work was additionally supported by the Rosetrees Trust (ref M940).
References
This article references 56 other publications.
- 1Barnett, E.; Onete, D.; Salekin, A.; Faraone, S. V. Genomic Machine Learning Meta-regression: Insights on Associations of Study Features with Reported Model Performance. medRxiv 2022, DOI: 10.1101/2022.01.10.22268751
- 2Shim, M.; Lee, S.-H.; Hwang, H.-J. Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection. Sci. Rep. 2021, 11, 7980, DOI: 10.1038/s41598-021-87157-3[Crossref], [PubMed], [CAS], Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXptFOlsLg%253D&md5=a43ca92128b177b8a67e2437fbe5c556Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selectionShim, Miseon; Lee, Seung-Hwan; Hwang, Han-JeongScientific Reports (2021), 11 (1), 7980CODEN: SRCEC3; ISSN:2045-2322. (Nature Research)In recent years, machine learning techniques have been frequently applied to uncovering neuropsychiatric biomarkers with the aim of accurately diagnosing neuropsychiatric diseases and predicting treatment prognosis. However, many studies did not perform cross validation (CV) when using machine learning techniques, or others performed CV in an incorrect manner, leading to significantly biased results due to overfitting problem. The aim of this study is to investigate the impact of CV on the prediction performance of neuropsychiatric biomarkers, in particular, for feature selection performed with high-dimensional features. To this end, we evaluated prediction performances using both simulation data and actual electroencephalog. (EEG) data. The overall prediction accuracies of the feature selection method performed outside of CV were considerably higher than those of the feature selection method performed within CV for both the simulation and actual EEG data. The differences between the prediction accuracies of the two feature selection approaches can be thought of as the amt. of overfitting due to selection bias. Our results indicate the importance of correctly using CV to avoid biased results of prediction performance of neuropsychiatric biomarkers.
- 3Roberts, M. Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans. Nat. Mach. Intell. 2021, 3, 199– 217, DOI: 10.1038/s42256-021-00307-0
- 4Tu, F.; Zhu, J.; Zheng, Q.; Zhou, M. Be Careful of When: An Empirical Study on Time-Related Misuse of Issue Tracking Data. In Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering , New York, NY, USA, 2018; p 307– 318.
- 5Sieg, J.; Flachsenberg, F.; Rarey, M. In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual Screening. J. Chem. Inf. Model. 2019, 59, 947– 961, DOI: 10.1021/acs.jcim.8b00712[ACS Full Text
], [CAS], Google Scholar
5https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXktVOgsrY%253D&md5=6cb48f2acd541c10c8565f202a6eb9e8In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual ScreeningSieg, Jochen; Flachsenberg, Florian; Rarey, MatthiasJournal of Chemical Information and Modeling (2019), 59 (3), 947-961CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)A review. Reports of successful applications of machine learning (ML) methods in structure-based virtual screening (SBVS) are increasing. ML methods such as convolutional neural networks show promising results and often outperform traditional methods such as empirical scoring functions in retrospective validation. However, trained ML models are often treated as black boxes and are not straightforwardly interpretable. In most cases, it is unknown which features in the data are decisive and whether a model's predictions are right for the right reason. Hence, the authors reevaluated three widely used benchmark data sets in the context of ML methods and came to the conclusion that not every benchmark data set is suitable. Moreover, the authors demonstrate on two examples from current literature that bias is learned implicitly and unnoticed from std. benchmarks. On the basis of these results, the authors conclude that there is a need for eligible validation expts. and benchmark data sets suited to ML for more bias-controlled validation in ML-based SBVS. Therefore, the authors provide guidelines for setting up validation expts. and give a perspective on how new data sets could be generated. - 6Chen, L.; Cruz, A.; Ramsey, S.; Dickson, C. J.; Duca, J. S.; Hornak, V.; Koes, D. R.; Kurtzman, T. Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screening. PLoS One 2019, 14, e0220113 DOI: 10.1371/journal.pone.0220113[Crossref], [PubMed], [CAS], Google Scholar6https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhvFeks7rL&md5=637472aa275cbacf3e926983f3795726Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screeningChen, Lieyang; Cruz, Anthony; Ramsey, Steven; Dickson, Callum J.; Duca, Jose S.; Hornak, Viktor; Koes, David R.; Kurtzman, TomPLoS One (2019), 14 (8), e0220113CODEN: POLNCL; ISSN:1932-6203. (Public Library of Science)Recently much effort has been invested in using convolutional neural network (CNN) models trained on 3D structural images of protein-ligand complexes to distinguish binding from non-binding ligands for virtual screening. However, the dearth of reliable protein-ligand x-ray structures and binding affinity data has required the use of constructed datasets for the training and evaluation of CNN mol. recognition models. Here, we outline various sources of bias in one such widely-used dataset, the Directory of Useful Decoys: Enhanced (DUD-E). We have constructed and performed tests to investigate whether CNN models developed using DUD-E are properly learning the underlying physics of mol. recognition, as intended, or are instead learning biases inherent in the dataset itself. We find that superior enrichment efficiency in CNN models can be attributed to the analog and decoy bias hidden in the DUD-E dataset rather than successful generalization of the pattern of protein-ligand interactions. Comparing addnl. deep learning models trained on PDBbind datasets, we found that their enrichment performances using DUD-E are not superior to the performance of the docking program AutoDock Vina. Together, these results suggest that biases that could be present in constructed datasets should be thoroughly evaluated before applying them to machine learning based methodol. development.
- 7Wallach, I.; Heifets, A. Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization. J. Chem. Inf. Model. 2018, 58, 916– 932, DOI: 10.1021/acs.jcim.7b00403[ACS Full Text
], [CAS], Google Scholar
7https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXotlCgs7s%253D&md5=a59fb9ba04b1e6b37cb83594625081ffMost Ligand-Based Classification Benchmarks Reward Memorization Rather than GeneralizationWallach, Izhar; Heifets, AbrahamJournal of Chemical Information and Modeling (2018), 58 (5), 916-932CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Undetected overfitting can occur when there are significant redundancies between training and validation data. The authors describe AVE, a new measure of training-validation redundancy for ligand-based classification problems, that accounts for the similarity among inactive mols. as well as active ones. The authors investigated seven widely used benchmarks for virtual screening and classification, and the authors show that the amt. of AVE bias strongly correlates with the performance of ligand-based predictive methods irresp. of the predicted property, chem. fingerprint, similarity measure, or previously applied unbiasing techniques. Therefore, it may be the case that the previously reported performance of most ligand-based methods can be explained by overfitting to benchmarks rather than good prospective accuracy. - 8Wouters, O. J.; McKee, M.; Luyten, J. Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009–2018. JAMA, J. Am. Med. Assoc. 2020, 323, 844– 853, DOI: 10.1001/jama.2020.1166[Crossref], [PubMed], [CAS], Google Scholar8https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB387oslyhsA%253D%253D&md5=ff63c2f03e59ef0223585a71fb6d5df0Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009-2018Wouters Olivier J; McKee Martin; Luyten JeroenJAMA (2020), 323 (9), 844-853 ISSN:.IMPORTANCE: The mean cost of developing a new drug has been the subject of debate, with recent estimates ranging from $314 million to $2.8 billion. OBJECTIVE: To estimate the research and development investment required to bring a new therapeutic agent to market, using publicly available data. DESIGN AND SETTING: Data were analyzed on new therapeutic agents approved by the US Food and Drug Administration (FDA) between 2009 and 2018 to estimate the research and development expenditure required to bring a new medicine to market. Data were accessed from the US Securities and Exchange Commission, Drugs@FDA database, and ClinicalTrials.gov, alongside published data on clinical trial success rates. EXPOSURES: Conduct of preclinical and clinical studies of new therapeutic agents. MAIN OUTCOMES AND MEASURES: Median and mean research and development spending on new therapeutic agents approved by the FDA, capitalized at a real cost of capital rate (the required rate of return for an investor) of 10.5% per year, with bootstrapped CIs. All amounts were reported in 2018 US dollars. RESULTS: The FDA approved 355 new drugs and biologics over the study period. Research and development expenditures were available for 63 (18%) products, developed by 47 different companies. After accounting for the costs of failed trials, the median capitalized research and development investment to bring a new drug to market was estimated at $985.3 million (95% CI, $683.6 million-$1228.9 million), and the mean investment was estimated at $1335.9 million (95% CI, $1042.5 million-$1637.5 million) in the base case analysis. Median estimates by therapeutic area (for areas with ≥5 drugs) ranged from $765.9 million (95% CI, $323.0 million-$1473.5 million) for nervous system agents to $2771.6 million (95% CI, $2051.8 million-$5366.2 million) for antineoplastic and immunomodulating agents. Data were mainly accessible for smaller firms, orphan drugs, products in certain therapeutic areas, first-in-class drugs, therapeutic agents that received accelerated approval, and products approved between 2014 and 2018. Results varied in sensitivity analyses using different estimates of clinical trial success rates, preclinical expenditures, and cost of capital. CONCLUSIONS AND RELEVANCE: This study provides an estimate of research and development costs for new therapeutic agents based on publicly available data. Differences from previous studies may reflect the spectrum of products analyzed, the restricted availability of data in the public domain, and differences in underlying assumptions in the cost calculations.
- 9Morris, G. M.; Huey, R.; Lindstrom, W.; Sanner, M. F.; Belew, R. K.; Goodsell, D. S.; Olson, A. J. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 2009, 30, 2785– 2791, DOI: 10.1002/jcc.21256[Crossref], [PubMed], [CAS], Google Scholar9https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1MXht1GitrnK&md5=679ce22fc50e9291c9aa16e7a1855845AutoDock and AutoDockTools: Automated docking with selective receptor flexibilityMorris, Garrett M.; Huey, Ruth; Lindstrom, William; Sanner, Michel F.; Belew, Richard K.; Goodsell, David S.; Olson, Arthur J.Journal of Computational Chemistry (2009), 30 (16), 2785-2791CODEN: JCCHDD; ISSN:0192-8651. (John Wiley & Sons, Inc.)We describe the testing and release of AutoDock4 and the accompanying graphical user interface AutoDockTools. AutoDock4 incorporates limited flexibility in the receptor. Several tests are reported here, including a redocking expt. with 188 diverse ligand-protein complexes and a cross-docking expt. using flexible sidechains in 87 HIV protease complexes. We also report its utility in anal. of covalently bound ligands, using both a grid-based docking method and a modification of the flexible sidechain technique. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2009.
- 10Verdonk, M. L.; Cole, J. C.; Hartshorn, M. J.; Murray, C. W.; Taylor, R. D. Improved protein–ligand docking using GOLD. Proteins: Struct., Funct., Bioinf. 2003, 52, 609– 623, DOI: 10.1002/prot.10465[Crossref], [PubMed], [CAS], Google Scholar10https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD3sXmvFGrsLg%253D&md5=73a627d99dab6de1ae364c8e8e5f0feaImproved protein-ligand docking using GOLDVerdonk, Marcel L.; Cole, Jason C.; Hartshorn, Michael J.; Murray, Christopher W.; Taylor, Richard D.Proteins: Structure, Function, and Genetics (2003), 52 (4), 609-623CODEN: PSFGEY; ISSN:0887-3585. (Wiley-Liss, Inc.)The Chemscore function was implemented as a scoring function for the protein-ligand docking program GOLD, and its performance compared to the original Goldscore function and two consensus docking protocols, "Goldscore-CS" and "Chemscore-GS," in terms of docking accuracy, prediction of binding affinities, and speed. In the "Goldscore-CS" protocol, dockings produced with the Goldscore function are scored and ranked with the Chemscore function; in the "Chemscore-GS" protocol, dockings produced with the Chemscore function are scored and ranked with the Goldscore function. Comparisons were made for a "clean" set of 224 protein-ligand complexes, and for two subsets of this set, one for which the ligands are "drug-like," the other for which they are "fragment-like.". For "drug-like" and "fragment-like" ligands, the docking accuracies obtained with Chemscore and Goldscore functions are similar. For larger ligands, Goldscore gives superior results. Docking with the Chemscore function is up to three times faster than docking with the Goldscore function. Both combined docking protocols give significant improvements in docking accuracy over the use of the Goldscore or Chemscore function alone. "Goldscore-CS" gives success rates of up to 81% (top-ranked GOLD soln. within 2.0 Å of the exptl. binding mode) for the "clean list," but at the cost of long search times. For most virtual screening applications, "Chemscore-GS" seems optimal; search settings that give docking speeds of around 0.25-1.3 min/compd. have success rates of about 78% for "drug-like" compds. and 85% for "fragment-like" compds. In terms of producing binding energy ests., the Goldscore function appears to perform better than the Chemscore function and the two consensus protocols, particularly for faster search settings. Even at docking speeds of around 1-2 min/compd., the Goldscore function predicts binding energies with a std. deviation of ∼10.5 kJ/mol.
- 11Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators. Neural Netw 1989, 2, 359– 366, DOI: 10.1016/0893-6080(89)90020-8
- 12Shen, C.; Ding, J.; Wang, Z.; Cao, D.; Ding, X.; Hou, T. From machine learning to deep learning: Advances in scoring functions for protein–ligand docking. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2020, 10, e1429 DOI: 10.1002/wcms.1429[Crossref], [CAS], Google Scholar12https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXitlyjsbbN&md5=0dd878e211fad64fada03e883a5ab47aFrom machine learning to deep learning: Advances in scoring functions for protein-ligand dockingShen, Chao; Ding, Junjie; Wang, Zhe; Cao, Dongsheng; Ding, Xiaoqin; Hou, TingjunWiley Interdisciplinary Reviews: Computational Molecular Science (2020), 10 (1), e1429CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. Mol. docking has been regarded as a routine tool for drug discovery, but its accuracy highly depends on the reliability of scoring functions (SFs). With the rapid development of machine learning (ML) techniques, ML-based SFs have gradually emerged as a promising alternative for protein-ligand binding affinity prediction and virtual screening, and most of them have shown significantly better performance than a wide range of classical SFs. Emergence of more data-hungry deep learning (DL) approaches in recent years further fascinates the exploitation of more accurate SFs. Here, we summarize the progress of traditional ML-based SFs in the last few years and provide insights into recently developed DL-based SFs. We believe that the continuous improvement in ML-based SFs can surely guide the early-stage drug design and accelerate the discovery of new drugs. This article is categorized under: Computer and Information Science > Chemoinformatics.
- 13Jones, D.; Kim, H.; Zhang, X.; Zemla, A.; Stevenson, G.; Bennett, W. F. D.; Kirshner, D.; Wong, S. E.; Lightstone, F. C.; Allen, J. E. Improved Protein–Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference. J. Chem. Inf. Model. 2021, 61, 1583– 1592, DOI: 10.1021/acs.jcim.0c01306[ACS Full Text
], [CAS], Google Scholar
13https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXmvFaqtrc%253D&md5=11f92ba4bae7dabc75b9e55c1911352bImproved Protein-Ligand Binding Affinity Prediction with Structure-Based Deep Fusion InferenceJones, Derek; Kim, Hyojin; Zhang, Xiaohua; Zemla, Adam; Stevenson, Garrett; Bennett, W. F. Drew; Kirshner, Daniel; Wong, Sergio E.; Lightstone, Felice C.; Allen, Jonathan E.Journal of Chemical Information and Modeling (2021), 61 (4), 1583-1592CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Predicting accurate protein-ligand binding affinities is an important task in drug discovery but remains a challenge even with computationally expensive biophysics-based energy scoring methods and state-of-the-art deep learning approaches. Despite the recent advances in the application of deep convolutional and graph neural network-based approaches, it remains unclear what the relative advantages of each approach are and how they compare with physics-based methodologies that have found more mainstream success in virtual screening pipelines. We present fusion models that combine features and inference from complementary representations to improve binding affinity prediction. This, to our knowledge, is the first comprehensive study that uses a common series of evaluations to directly compare the performance of three-dimensional (3D)-convolutional neural networks (3D-CNNs), spatial graph neural networks (SG-CNNs), and their fusion. We use temporal and structure-based splits to assess performance on novel protein targets. To test the practical applicability of our models, we examine their performance in cases that assume that the crystal structure is not available. In these cases, binding free energies are predicted using docking pose coordinates as the inputs to each model. In addn., we compare these deep learning approaches to predictions based on docking scores and mol. mechanic/generalized Born surface area (MM/GBSA) calcns. Our results show that the fusion models make more accurate predictions than their constituent neural network models as well as docking scoring and MM/GBSA rescoring, with the benefit of greater computational efficiency than the MM/GBSA method. Finally, we provide the code to reproduce our results and the parameter files of the trained models used in this work. The software is available as open source at https://github.com/llnl/fast. Model parameter files are available at ftp://gdo-bioinformatics.ucllnl.org/fast/pdbbind2016_model_checkpoints/. - 14Chenthamarakshan, V.; Das, P.; Hoffman, S.; Strobelt, H.; Padhi, I.; Lim, K. W.; Hoover, B.; Manica, M.; Born, J.; Laino, T.; Mojsilovic, A. CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models. In 34th Conference on Neural Information Processing Systems , 2020; pp 4320– 4332.Google ScholarThere is no corresponding record for this reference.
- 15Hoffman, S. C.; Chenthamarakshan, V.; Wadhawan, K.; Chen, P.-Y.; Das, P. Optimizing molecules using efficient queries from property evaluations. Nat. Mach. Intell. 2022, 4, 21, DOI: 10.1038/s42256-021-00422-y
- 16Boyles, F.; Deane, C. M.; Morris, G. M. Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked Poses. J. Chem. Inf. Model. 2022, 62, 5329– 5341, DOI: 10.1021/acs.jcim.1c00096[ACS Full Text
], [CAS], Google Scholar
16https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhvFWms7rN&md5=2c539500f49ad27ff6682ff11e9360ebLearning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked PosesBoyles, Fergus; Deane, Charlotte M.; Morris, Garrett M.Journal of Chemical Information and Modeling (2022), 62 (22), 5329-5341CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Machine learning scoring functions for protein-ligand binding affinity have been found to consistently outperform classical scoring functions when trained and tested on crystal structures of bound protein-ligand complexes. However, it is less clear how these methods perform when applied to docked poses of complexes. We explore how the use of docked rather than crystallog. poses for both training and testing affects the performance of machine learning scoring functions. Using the PDBbind Core Sets as benchmarks, we show that the performance of a structure-based machine learning scoring function trained and tested on docked poses is lower than that of the same scoring function trained and tested on crystallog. poses. We construct a hybrid scoring function by combining both structure-based and ligand-based features, and show that its ability to predict binding affinity using docked poses is comparable to that of purely structure-based scoring functions trained and tested on crystal poses. We also present a new, freely available validation set-the Updated DUD-E Diverse Subset-for binding affinity prediction using data from DUD-E and ChEMBL. Despite strong performance on docked poses of the PDBbind Core Sets, we find that our hybrid scoring function sometimes generalizes poorly to a protein target not represented in the training set, demonstrating the need for improved scoring functions and addnl. validation benchmarks. - 17Su, M.; Yang, Q.; Du, Y.; Feng, G.; Liu, Z.; Li, Y.; Wang, R. Comparative Assessment of Scoring Functions: The CASF-2016 Update. J. Chem. Inf. Model. 2019, 59, 895– 913, DOI: 10.1021/acs.jcim.8b00545[ACS Full Text
], [CAS], Google Scholar
17https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXitlWhtLjM&md5=573295cfd2298a2dee933027041c3d9cComparative Assessment of Scoring Functions: The CASF-2016 UpdateSu, Minyi; Yang, Qifan; Du, Yu; Feng, Guoqin; Liu, Zhihai; Li, Yan; Wang, RenxiaoJournal of Chemical Information and Modeling (2019), 59 (2), 895-913CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)In structure-based drug design, scoring functions are often employed to evaluate protein-ligand interactions. A variety of scoring functions have been developed so far, and thus, some objective benchmarks are desired for assessing their strength and weakness. The comparative assessment of scoring functions (CASF) benchmark developed by us provides an answer to this demand. CASF is designed as a "scoring benchmark", where the scoring process is decoupled from the docking process to depict the performance of scoring function more precisely. Here, we describe the latest update of this benchmark, i.e., CASF-2016. Each scoring function is still evaluated by four metrics, including "scoring power", "ranking power", "docking power", and "screening power". Nevertheless, the evaluation methods have been improved considerably in several aspects. A new test set is compiled, which consists of 285 protein-ligand complexes with high-quality crystal structures and reliable binding consts. A panel of 25 scoring functions are tested on CASF-2016 as a demonstration. Our results reveal that the performance of current scoring functions is more promising in terms of docking power than scoring, ranking, and screening power. Scoring power is somewhat correlated with ranking power, so are docking power and screening power. The results obtained on CASF-2016 may provide valuable guidance for the end users to make smart choices among available scoring functions. Moreover, CASF is created as an open-access benchmark so that other researchers can utilize it to test a wider range of scoring functions. The complete CASF-2016 benchmark will be released on the PDBbind-CN web server (http://www.pdbbind-cn.org/casf.asp/) once this article is published. - 18Liu, Z.; Su, M.; Han, L.; Liu, J.; Yang, Q.; Li, Y.; Wang, R. Forging the Basis for Developing Protein–Ligand Interaction Scoring Functions. Acc. Chem. Res. 2017, 50, 302– 309, DOI: 10.1021/acs.accounts.6b00491[ACS Full Text
], [CAS], Google Scholar
18https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXit12qsLc%253D&md5=48d7287ef58c3ed8d9ff7579014fc486Forging the Basis for Developing Protein-Ligand Interaction Scoring FunctionsLiu, Zhihai; Su, Minyi; Han, Li; Liu, Jie; Yang, Qifan; Li, Yan; Wang, RenxiaoAccounts of Chemical Research (2017), 50 (2), 302-309CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)A review. In structure-based drug design, scoring functions are widely used for fast evaluation of protein-ligand interactions. They are often applied in combination with mol. docking and de novo design methods. Since the early 1990s, a whole spectrum of protein-ligand interaction scoring functions have been developed. Regardless of their tech. difference, scoring functions all need data sets combining protein-ligand complex structures and binding affinity data for parametrization and validation. However, data sets of this kind used to be rather limited in terms of size and quality. However, std. metrics for evaluating scoring function used to be ambiguous. Scoring functions are often tested in mol. docking or even virtual screening trials, which are not totally appropriate for reflecting the genuine quality of scoring functions. Collectively, these underlying obstacles have impeded the invention of more advanced scoring functions. In this Account, the authors describe their long-lasting efforts on overcoming these obstacles, which involve two related projects. On the first project, the authors have created the PDBbind database. It is the first database that systematically annotates the protein-ligand complexes in the Protein Data Bank (PDB) with exptl. binding data. This database has been updated annually since its first public release in 2004. The latest release (version 2016) provides binding data for 16,179 biomol. complexes in PDB. Data sets provided by PDBbind have been applied to many computational and statistical studies on protein-ligand interaction and various subjects. In particular, it has become a major data resource for scoring function development. On the second project, the authors established the Comparative Assessment of Scoring Functions (CASF) benchmark for scoring function evaluation. The authors' key idea is to decouple the "scoring" process from the "sampling" process, so scoring functions can be tested in a relatively pure context to reflect their quality. In the authors' latest work on this track, i.e., CASF-2013, the performance of a scoring function was quantified in four aspects, including "scoring power", "ranking power", "docking power", and "screening power". All four performance tests were conducted on a test set contg. 195 high-quality protein-ligand complexes selected from PDBbind. A panel of 20 std. scoring functions were tested as demonstration. Importantly, CASF is designed to be an open-access benchmark, with which scoring functions developed by different researchers can be compared on the same ground. Indeed, it has become a popular choice for scoring function validation in recent years. Despite the considerable progress that has been made so far, the performance of today's scoring functions still does not meet people's expectation in many aspects. There is a const. demand for more advanced scoring functions. The authors' efforts have helped to overcome some obstacles underlying scoring function development so that the researchers in this field can move forward faster. The authors will continue to improve the PDBbind database and the CASF benchmark in the future to keep them as useful community resources. - 19Yosinski, J.; Clune, J.; Nguyen, A.; Fuchs, T.; Lipson, H. Understanding neural networks through deep visualization. arXiv 2015, arXiv:1506.06579, DOI: 10.48550/arXiv.1506.06579
- 20Samek, W.; Binder, A.; Montavon, G.; Bach, S.; Müller, K.-R. Evaluating the visualization of what a Deep Neural Network has learned. arXiv 2015, arXiv:1509.06321, DOI: 10.48550/arXiv.1509.06321
- 21Poelking, C.; Chessari, G.; Murray, C. W.; Hall, R. J.; Colwell, L.; Verdonk, M. Meaningful machine learning models and machine-learned pharmacophores from fragment screening campaigns. arXiv 2022, arXiv:2204.06348, DOI: 10.48550/arXiv.2204.06348
- 22Sharma, B.; Chenthamarakshan, V.; Dhurandhar, A.; Pereira, S.; Hendler, J. A.; Dordick, J. S.; Das, P. Accurate Clinical Toxicity Prediction using Multi-task Deep Neural Nets and Contrastive Molecular Explanations. Sci. Rep. 2023, 13, 4908, DOI: 10.1038/s41598-023-31169-8[Crossref], [PubMed], [CAS], Google Scholar22https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3sXmsV2ks7g%253D&md5=63f1d0798d9ba9b71f3d970b58e752eaAccurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanationsSharma, Bhanushee; Chenthamarakshan, Vijil; Dhurandhar, Amit; Pereira, Shiranee; Hendler, James A.; Dordick, Jonathan S.; Das, PayelScientific Reports (2023), 13 (1), 4908CODEN: SRCEC3; ISSN:2045-2322. (Nature Portfolio)Explainable machine learning for mol. toxicity prediction is a promising approach for efficient drug development and chem. safety. A predictive ML model of toxicity can reduce exptl. cost and time while mitigating ethical concerns by significantly reducing animal and clin. testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clin. toxicity data. Two different mol. input representations are used; Morgan fingerprints and pre-trained SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clin., as indicated by the area under the Receiver Operator Characteristic curve and balanced accuracy. In particular, pre-trained mol. SMILES embeddings as input to the multi-task model improved clin. toxicity predictions compared to existing models in MoleculeNet benchmark. Addnl., our multitask approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clin. platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clin. toxicity predictions. To provide confidence and explain the model's predictions, we adapt a post-hoc contrastive explanation method that returns pertinent pos. and neg. features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, arom. amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature anal. captures more of the in vitro (53%) and in vivo (56%), rather than of the clin. (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo exptl. data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clin. and in vivo mol. toxicity.
- 23Hadfield, T. E.; Scantlebury, J.; Deane, C. M. Exploring The Ability Of Machine Learning-Based Virtual Screening Models To Identify The Functional Groups Responsible For Binding. bioRxiv 2023, DOI: 10.1101/2023.04.29.538820
- 24McNutt, A. T.; Francoeur, P.; Aggarwal, R.; Masuda, T.; Meli, R.; Ragoza, M.; Sunseri, J.; Koes, D. R. GNINA 1.0: molecular docking with deep learning. J. Cheminf. 2021, 13, 43, DOI: 10.1186/s13321-021-00522-2
- 25Zheng, L.; Fan, J.; Mu, Y. OnionNet: a Multiple-Layer Intermolecular-Contact-Based Convolutional Neural Network for Protein–Ligand Binding Affinity Prediction. ACS Omega 2019, 4, 15956– 15965, DOI: 10.1021/acsomega.9b01997[ACS Full Text
], [CAS], Google Scholar
24https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhslKhtb3J&md5=0eaf1406acc14b7052d62481de5dec78OnionNet: a Multiple-Layer Intermolecular-Contact-Based Convolutional Neural Network for Protein-Ligand Binding Affinity PredictionZheng, Liangzhen; Fan, Jingrong; Mu, YuguangACS Omega (2019), 4 (14), 15956-15965CODEN: ACSODF; ISSN:2470-1343. (American Chemical Society)Computational drug discovery provides an efficient tool for helping large-scale lead mol. screening. One of the major tasks of lead discovery is identifying mols. with promising binding affinities toward a target, a protein in general. The accuracies of current scoring functions that are used to predict the binding affinity are not satisfactory enough. Thus, machine learning or deep learning based methods have been developed recently to improve the scoring functions. In this study, a deep convolutional neural network model (called OnionNet) is introduced; its features are based on rotation-free element-pair-specific contacts between ligands and protein atoms, and the contacts are further grouped into different distance ranges to cover both the local and nonlocal interaction information between the ligand and the protein. The prediction power of the model is evaluated and compared with other scoring functions using the comparative assessment of scoring functions (CASF-2013) benchmark and the v2016 core set of the PDBbind database. The robustness of the model is further explored by predicting the binding affinities of the complexes generated from docking simulations instead of exptl. detd. PDB structures. - 26Li, Y.; Rezaei, M. A.; Li, C.; Li, X. DeepAtom: A Framework for Protein-Ligand Binding Affinity Prediction. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ; 2019; pp 303– 310.
- 27Feinberg, E. N.; Sur, D.; Wu, Z.; Husic, B. E.; Mai, H.; Li, Y.; Sun, S.; Yang, J.; Ramsundar, B.; Pande, V. S. PotentialNet for molecular property prediction. ACS Cent. Sci. 2018, 4, 1520– 1530, DOI: 10.1021/acscentsci.8b00507[ACS Full Text
], [CAS], Google Scholar
26https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXitVGrs7fF&md5=570999e7f212d7c737b24f71665893b7PotentialNet for Molecular Property PredictionFeinberg, Evan N.; Sur, Debnil; Wu, Zhenqin; Husic, Brooke E.; Mai, Huanghao; Li, Yang; Sun, Saisai; Yang, Jianyi; Ramsundar, Bharath; Pande, Vijay S.ACS Central Science (2018), 4 (11), 1520-1530CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)The arc of drug discovery entails a multiparameter optimization problem spanning vast length scales. The key parameters range from soly. (angstroms) to protein-ligand binding (nanometers) to in vivo toxicity (meters). Through feature learning-instead of feature engineering-deep neural networks promise to outperform both traditional physics-based and knowledge-based machine learning models for predicting mol. properties pertinent to drug discovery. To this end, we present the PotentialNet family of graph convolutions. These models are specifically designed for and achieve state-of-the-art performance for protein-ligand binding affinity. We further validate these deep neural networks by setting new stds. of performance in several ligand-based tasks. In parallel, we introduce a new metric, the Regression Enrichment Factor EF(R)χ, to measure the early enrichment of computational models for chem. data. Finally, we introduce a cross-validation strategy based on structural homol. clustering that can more accurately measure model generalizability, which crucially distinguishes the aims of machine learning for drug discovery from std. machine learning tasks. - 28Morrone, J. A.; Weber, J. K.; Huynh, T.; Luo, H.; Cornell, W. D. Combining Docking Pose Rank and Structure with Deep Learning Improves Protein–Ligand Binding Mode Prediction over a Baseline Docking Approach. J. Chem. Inf. Model. 2020, 60, 4170– 4179, DOI: 10.1021/acs.jcim.9b00927[ACS Full Text
], [CAS], Google Scholar
27https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXjsVOgtbs%253D&md5=ed7f0ba1797757a61d64c817db36cc13Combining Docking Pose Rank and Structure with Deep Learning Improves Protein-Ligand Binding Mode Prediction over a Baseline Docking ApproachMorrone, Joseph A.; Weber, Jeffrey K.; Huynh, Tien; Luo, Heng; Cornell, Wendy D.Journal of Chemical Information and Modeling (2020), 60 (9), 4170-4179CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)The authors present a simple, modular graph-based convolutional neural network that takes structural information from protein-ligand complexes as input to generate models for activity and binding mode prediction. Complex structures are generated by a std. docking procedure and fed into a dual-graph architecture that includes sep. subnetworks for the ligand bonded topol. and the ligand-protein contact map. Recent work has indicated that data set bias drives many past promising results derived from combining deep learning and docking. The dual-graph network allows contributions from ligand identity that give rise to such biases to be distinguished from effects of protein-ligand interactions on classification. The authors show that the neural network is capable of learning from protein structural information when, as in the case of binding mode prediction, an unbiased data set is constructed. The authors next develop a deep learning model for binding mode prediction that uses docking ranking as input in combination with docking structures. This strategy mirrors past consensus models and outperforms a baseline docking program (AutoDock Vina) in a variety of tests, including on cross-docking data sets that mimic real-world docking use cases. Furthermore, the magnitudes of network predictions serve as reliable measures of model confidence. - 29Jiang, D.; Hsieh, C.-Y.; Wu, Z.; Kang, Y.; Wang, J.; Wang, E.; Liao, B.; Shen, C.; Xu, L.; Wu, J.; Cao, D.; Hou, T. InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein–Ligand Interaction Predictions. J. Med. Chem. 2021, 64, 18209– 18232, DOI: 10.1021/acs.jmedchem.1c01830[ACS Full Text
], [CAS], Google Scholar
28https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXislSnsrnE&md5=bbee6eb73a8c6b95c389fe11e45c7723InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein-Ligand Interaction PredictionsJiang, Dejun; Hsieh, Chang-Yu; Wu, Zhenxing; Kang, Yu; Wang, Jike; Wang, Ercheng; Liao, Ben; Shen, Chao; Xu, Lei; Wu, Jian; Cao, Dongsheng; Hou, TingjunJournal of Medicinal Chemistry (2021), 64 (24), 18209-18232CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)Accurate quantification of protein-ligand interactions remains a key challenge to structure-based drug design. However, traditional machine learning (ML)-based methods based on handcrafted descriptors, one-dimensional protein sequences, and/or two-dimensional graph representations limit their capability to learn the generalized mol. interactions in 3D space. Here, we proposed a novel deep graph representation learning framework named InteractionGraphNet (IGN) to learn the protein-ligand interactions from the 3D structures of protein-ligand complexes. In IGN, two independent graph convolution modules were stacked to sequentially learn the intramol. and intermol. interactions, and the learned intermol. interactions can be efficiently used for subsequent tasks. Extensive binding affinity prediction, large-scale structure-based virtual screening, and pose prediction expts. demonstrated that IGN achieved better or competitive performance against other state-of-the-art ML-based baselines and docking programs. More importantly, such state-of-the-art performance was proven from the successful learning of the key features in protein-ligand interactions instead of just memorizing certain biased patterns from data. - 30Hochuli, J.; Helbling, A.; Skaist, T.; Ragoza, M.; Koes, D. R. Visualizing convolutional neural network protein-ligand scoring. J. Mol. Graphics Modell. 2018, 84, 96– 108, DOI: 10.1016/j.jmgm.2018.06.005[Crossref], [PubMed], [CAS], Google Scholar29https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXhtF2ltb3P&md5=db697d859132f02c4a2cef4bfc00ea88Visualizing convolutional neural network protein-ligand scoringHochuli, Joshua; Helbling, Alec; Skaist, Tamar; Ragoza, Matthew; Koes, David RyanJournal of Molecular Graphics & Modelling (2018), 84 (), 96-108CODEN: JMGMFI; ISSN:1093-3263. (Elsevier Ltd.)Protein-ligand scoring is an important step in a structure-based drug design pipeline. Selecting a correct binding pose and predicting the binding affinity of a protein-ligand complex enables effective virtual screening. Machine learning techniques can make use of the increasing amts. of structural data that are becoming publicly available. Convolutional neural network (CNN) scoring functions in particular have shown promise in pose selection and affinity prediction for protein-ligand complexes. Neural networks are known for being difficult to interpret. Understanding the decisions of a particular network can help tune parameters and training data to maximize performance. Visualization of neural networks helps decomp. complex scoring functions into pictures that are more easily parsed by humans. Here we present three methods for visualizing how individual protein-ligand complexes are interpreted by 3D convolutional neural networks. We also present a visualization of the convolutional filters and their wts. We describe how the intuition provided by these visualizations aids in network design.
- 31Lim, J.; Ryu, S.; Park, K.; Choe, Y. J.; Ham, J.; Kim, W. Y. Predicting Drug–Target Interaction Using a Novel Graph Neural Network with 3D Structure-Embedded Graph Representation. J. Chem. Inf. Model. 2019, 59, 3981– 3988, DOI: 10.1021/acs.jcim.9b00387[ACS Full Text
], [CAS], Google Scholar
30https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhs1egur3F&md5=670cd989a369d67c734f6762a5a906cfPredicting Drug-Target Interaction Using a Novel Graph Neural Network with 3D Structure-Embedded Graph RepresentationLim, Jaechang; Ryu, Seongok; Park, Kyubyong; Choe, Yo Joong; Ham, Jiyeon; Kim, Woo YounJournal of Chemical Information and Modeling (2019), 59 (9), 3981-3988CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)We propose a novel deep learning approach for predicting drug-target interaction using a graph neural network. We introduce a distance-aware graph attention algorithm to differentiate various types of intermol. interactions. Furthermore, we ext. the graph feature of intermol. interactions directly from the 3D structural information on the protein-ligand binding pose. Thus, the model can learn key features for accurate predictions of drug-target interaction rather than just memorize certain patterns of ligand mols. As a result, our model shows better performance than docking and other deep learning methods for both virtual screening (AUROC of 0.968 for the DUD-E test set) and pose prediction (AUROC of 0.935 for the PDBbind test set). In addn., it can reproduce the natural population distribution of active mols. and inactive mols. - 32Hadfield, T. E.; Imrie, F.; Merritt, A.; Birchall, K.; Deane, C. M. Incorporating Target-Specific Pharmacophoric Information into Deep Generative Models for Fragment Elaboration. J. Chem. Inf. Model. 2022, 62, 2280– 2292, DOI: 10.1021/acs.jcim.1c01311[ACS Full Text
], [CAS], Google Scholar
31https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XhtFGru7rE&md5=c639b01f0c509bbd2e20308fa044bdb5Incorporating Target-Specific Pharmacophoric Information into Deep Generative Models for Fragment ElaborationHadfield, Thomas E.; Imrie, Fergus; Merritt, Andy; Birchall, Kristian; Deane, Charlotte M.Journal of Chemical Information and Modeling (2022), 62 (10), 2280-2292CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Despite recent interest in deep generative models for scaffold elaboration, their applicability to fragment-to-lead campaigns has so far been limited. This is primarily due to their inability to account for local protein structure or a user's design hypothesis. We propose a novel method for fragment elaboration, STRIFE, that overcomes these issues. STRIFE takes as input fragment hotspot maps (FHMs) extd. from a protein target and processes them to provide meaningful and interpretable structural information to its generative model, which in turn is able to rapidly generate elaborations with complementary pharmacophores to the protein. In a large-scale evaluation, STRIFE outperforms existing, structure-unaware, fragment elaboration methods in proposing highly ligand-efficient elaborations. In addn. to automatically extg. pharmacophoric information from a protein target's FHM, STRIFE optionally allows the user to specify their own design hypotheses. - 33Curran, P. R.; Radoux, C. J.; Smilova, M. D.; Sykes, R. A.; Higueruelo, A. P.; Bradley, A. R.; Marsden, B. D.; Spring, D. R.; Blundell, T. L.; Leach, A. R.; Pitt, W. R.; Cole, J. C. Hotspots API: A Python Package for the Detection of Small Molecule Binding Hotspots and Application to Structure-Based Drug Design. J. Chem. Inf. Model. 2020, 60, 1911– 1916, DOI: 10.1021/acs.jcim.9b00996[ACS Full Text
], [CAS], Google Scholar
32https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXlsFahsbw%253D&md5=9402dded4d637e5b324e329d9e2c9a08Hotspots API: A Python Package for the Detection of Small Molecule Binding Hotspots and Application to Structure-Based Drug DesignCurran, Peter R.; Radoux, Chris J.; Smilova, Mihaela D.; Sykes, Richard A.; Higueruelo, Alicia P.; Bradley, Anthony R.; Marsden, Brian D.; Spring, David R.; Blundell, Tom L.; Leach, Andrew R.; Pitt, William R.; Cole, Jason C.Journal of Chemical Information and Modeling (2020), 60 (4), 1911-1916CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Methods that survey protein surfaces for binding hotspots can help to evaluate target tractability and guide exploration of potential ligand binding regions. Fragment Hotspot Maps builds upon interaction data mined from the CSD (Cambridge Structural Database) and exploits the idea of identifying hotspots using small chem. fragments, which is now widely used to design new drug leads. Prior to this publication, Fragment Hotspot Maps was only publicly available through a web application. To increase the accessibility of this algorithm we present the Hotspots API (application programming interface), a toolkit that offers programmatic access to the core Fragment Hotspot Maps algorithm, thereby facilitating the interpretation and application of 3the anal. To demonstrate the package's utility, we present a workflow which automatically derives protein hydrogen-bond constraints for mol. docking with GOLD. The Hotspots API is available from https://github.com/prcurran/hotspots under the MIT license and is dependent upon the com. CSD Python API. - 34Salentin, S.; Schreiber, S.; Haupt, V. J.; Adasme, M. F.; Schroeder, M. PLIP: fully automated protein–ligand interaction profiler. Nucleic Acids Res. 2015, 43, W443– W447, DOI: 10.1093/nar/gkv315[Crossref], [PubMed], [CAS], Google Scholar33https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXhtVymtbrI&md5=4878881a7bd715e5585682876e102e65PLIP: fully automated protein-ligand interaction profilerSalentin, Sebastian; Schreiber, Sven; Haupt, V. Joachim; Adasme, Melissa F.; Schroeder, MichaelNucleic Acids Research (2015), 43 (W1), W443-W447CODEN: NARHAD; ISSN:0305-1048. (Oxford University Press)The characterization of interactions in protein-ligand complexes is essential for research in structural bioinformatics, drug discovery and biol. However, comprehensive tools are not freely available to the research community. Here, we present the protein-ligand interaction profiler (PLIP), a novel web service for fully automated detection and visualization of relevant non-covalent protein-ligand contacts in 3D structures, freely available at projects.biotec.tu-dresden.de/plip-web. The input is either a Protein Data Bank structure, a protein or ligand name, or a custom protein-ligand complex (e.g. from docking). In contrast to other tools, the rule-based PLIP algorithm does not require any structure prepn. It returns a list of detected interactions on single atom level, covering seven interaction types (hydrogen bonds, hydrophobic contacts, pi-stacking, pi-cation interactions, salt bridges, water bridges and halogen bonds). PLIP stands out by offering publication-ready images, PyMOL sesions files to generate custom images and parsable result files to facilitate successive data processing. The full python source code is available for download on the website. PLIP's command-line mode allows for high-throughput interaction profiling.
- 35Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A. N.; Kaiser, L. u.; Polosukhin, I. Attention is All you Need. arXiv 2017, arXiv:1706.03762, DOI: 10.48550/arXiv.1706.03762
- 36Francoeur, P. G.; Masuda, T.; Sunseri, J.; Jia, A.; Iovanisci, R. B.; Snyder, I.; Koes, D. R. Three-Dimensional Convolutional Neural Networks and a Cross-Docked Data Set for Structure-Based Drug Design. J. Chem. Inf. Model. 2020, 60, 4200– 4215, DOI: 10.1021/acs.jcim.0c00411[ACS Full Text
], [CAS], Google Scholar
35https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhslWqs73N&md5=5a5465187dbdef41d556baddd1a8e4d9Three-Dimensional Convolutional Neural Networks and a Cross-Docked Data Set for Structure-Based Drug DesignFrancoeur, Paul G.; Masuda, Tomohide; Sunseri, Jocelyn; Jia, Andrew; Iovanisci, Richard B.; Snyder, Ian; Koes, David R.Journal of Chemical Information and Modeling (2020), 60 (9), 4200-4215CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)One of the main challenges in drug discovery is predicting protein-ligand binding affinity. Recently, machine learning approaches have made substantial progress on this task. However, current methods of model evaluation are overly optimistic in measuring generalization to new targets, and there does not exist a std. data set of sufficient size to compare performance between models. We present a new data set for structure-based machine learning, the CrossDocked2020 set, with 22.5 million poses of ligands docked into multiple similar binding pockets across the Protein Data Bank, and perform a comprehensive evaluation of grid-based convolutional neural network (CNN) models on this data set. We also demonstrate how the partitioning of the training data and test data can impact the results of models trained with the PDBbind data set, how performance improves by adding more lower-quality training data, and how training with docked poses imparts pose sensitivity to the predicted affinity of a complex. Our best performing model, an ensemble of five densely connected CNNs, achieves a root mean squared error of 1.42 and Pearson R of 0.612 on the affinity prediction task, an AUC of 0.956 at binding pose classification, and a 68.4% accuracy at pose selection on the CrossDocked2020 set. By providing data splits for clustered cross-validation and the raw data for the CrossDocked2020 set, we establish the first standardized data set for training machine learning models to recognize ligands in noncognate target structures while also greatly expanding the no. of poses available for training. In order to facilitate community adoption of this data set for benchmarking protein-ligand binding affinity prediction, we provide our models, wts., and the CrossDocked2020 set at https://github.com/gnina/models. - 37Landrum, G. RDKit: Open-source cheminformatics ; 2016; https://www.bibsonomy.org/bibtex/28d01fceeccd6bf2486e47d7c4207b108/salotz.
- 38O’Boyle, N. M.; Banck, M.; James, C. A.; Morley, C.; Vandermeersch, T.; Hutchison, G. R. Open Babel: An open chemical toolbox. J. Cheminf. 2011, 3, 33, DOI: 10.1186/1758-2946-3-33[Crossref], [PubMed], [CAS], Google Scholar37https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXhsVWjurbF&md5=74e4f19b7f87417f916d57f7abcfb761Open Babel: an open chemical toolboxO'Boyle, Noel M.; Banck, Michael; James, Craig A.; Morley, Chris; Vandermeersch, Tim; Hutchison, Geoffrey R.Journal of Cheminformatics (2011), 3 (), 33CODEN: JCOHB3; ISSN:1758-2946. (Chemistry Central Ltd.)Background: A frequent problem in computational modeling is the interconversion of chem. structures between different formats. While std. interchange formats exist (for example, Chem. Markup Language) and de facto stds. have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chem. data, differences in the data stored by different formats (0D vs. 3D, for example), and competition between software along with a lack of vendor-neutral formats. Results: We discuss, for the first time, Open Babel, an open-source chem. toolbox that speaks the many languages of chem. data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chem. and mol. data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion. Conclusions: Open Babel presents a soln. to the proliferation of multiple chem. file formats. In addn., it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chem. data in areas such as org. chem., drug design, materials science, and computational chem. It is freely available under an open-source license.
- 39Yang, J.; Shen, C.; Huang, N. Predicting or Pretending: Artificial Intelligence for Protein-Ligand Interactions Lack of Sufficiently Large and Unbiased Datasets. Frontiers in Pharmacology 2020, 11, 1, DOI: 10.3389/fphar.2020.00069
- 40Scantlebury, J.; Brown, N.; Von Delft, F.; Deane, C. M. Data Set Augmentation Allows Deep Learning-Based Virtual Screening to Better Generalize to Unseen Target Classes and Highlight Important Binding Interactions. J. Chem. Inf. Model. 2020, 60, 3722– 3730, DOI: 10.1021/acs.jcim.0c00263[ACS Full Text
], [CAS], Google Scholar
39https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhsVaqt77O&md5=8859d41da975e883cac7c2489befab60Data Set Augmentation Allows Deep Learning-Based Virtual Screening to Better Generalize to Unseen Target Classes and Highlight Important Binding InteractionsScantlebury, Jack; Brown, Nathan; Von Delft, Frank; Deane, Charlotte M.Journal of Chemical Information and Modeling (2020), 60 (8), 3722-3730CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Current deep learning methods for structure-based virtual screening take the structures of both the protein and the ligand as input but make little or no use of the protein structure when predicting ligand binding. Here, a relatively simple method of data set augmentation forces such deep learning methods to take into account information from the protein. Models trained in this way are more generalizable (make better predictions on protein/ligand complexes from a different distribution to the training data). They also assign more meaningful importance to the protein and ligand atoms involved in binding. Overall, the authors' results show that data set augmentation can help deep learning-based virtual screening to learn phys. interactions rather than data set biases. - 41Satorras, V. G.; Hoogeboom, E.; Welling, M. E(n) Equivariant Graph Neural Networks. arXiv 2021, arXiv:2102.09844, DOI: 10.48550/arXiv.2102.09844
- 42Ragoza, M.; Hochuli, J.; Idrobo, E.; Sunseri, J.; Koes, D. R. Protein-Ligand Scoring with Convolutional Neural Networks. J. Chem. Inf. Model. 2017, 57, 942– 957, DOI: 10.1021/acs.jcim.6b00740[ACS Full Text
], [CAS], Google Scholar
41https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXlsVems7Y%253D&md5=9cae97167da0c93e1d896f85339cca7eProtein-Ligand Scoring with Convolutional Neural NetworksRagoza, Matthew; Hochuli, Joshua; Idrobo, Elisa; Sunseri, Jocelyn; Koes, David RyanJournal of Chemical Information and Modeling (2017), 57 (4), 942-957CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Computational approaches to drug discovery can reduce the time and cost assocd. with exptl. assays and enable the screening of novel chemotypes. Structure-based drug design methods rely on scoring functions to rank and predict binding affinities and poses. The ever-expanding amt. of protein-ligand binding and structural data enables the use of deep machine learning techniques for protein-ligand scoring. We describe convolutional neural network (CNN) scoring functions that take as input a comprehensive three-dimensional (3D) representation of a protein-ligand interaction. A CNN scoring function automatically learns the key features of protein-ligand interactions that correlate with binding. We train and optimize our CNN scoring functions to discriminate between correct and incorrect binding poses and known binders and nonbinders. We find that our CNN scoring function outperforms the AutoDock Vina scoring function when ranking poses both for pose prediction and virtual screening. - 43Zhu, H.; Yang, J.; Huang, N. Assessment of the Generalization Abilities of Machine-Learning Scoring Functions for Structure-Based Virtual Screening. J. Chem. Inf. Model. 2022, 62, 5485– 5502, DOI: 10.1021/acs.jcim.2c01149[ACS Full Text
], [CAS], Google Scholar
42https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38Xis1Kru7rK&md5=d6197f1f9a4da0df36b420bb9746b446Assessment of the Generalization Abilities of Machine-Learning Scoring Functions for Structure-Based Virtual ScreeningZhu, Hui; Yang, Jincai; Huang, NiuJournal of Chemical Information and Modeling (2022), 62 (22), 5485-5502CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)In structure-based virtual screening (SBVS), it is crit. that scoring functions capture protein-ligand at. interactions. By focusing on the local domains of ligand binding pockets, a standardized pocket Pfam-based clustering (Pfam-cluster) approach was developed to assess the cross-target generalization ability of machine-learning scoring functions (MLSFs). Subsequently, 12 typical MLSFs were evaluated using random cross-validation (Random-CV), protein sequence similarity-based cross-validation (Seq-CV), and pocket Pfam-based cross-validation (Pfam-CV) methods. Surprisingly, all of the tested models showed decreased performances from Random-CV to Seq-CV to Pfam-CV expts., not showing satisfactory generalization capacity. Our interpretable anal. suggested that the predictions on novel targets by MLSFs were dependent on buried solvent-accessible surface area (SASA)-related features of complex structures, with greater predicted binding affinities on complexes owning larger protein-ligand interfaces. By combining buried SASA-related features with target-specific patterns that were only shared among structurally similar compds. in the same cluster, the random forest (RF)-Score attained a good performance in the Random-CV test. Based on these findings, we strongly advise assessing the generalization ability of MLSFs with the Pfam-cluster approach and being cautious with the features learned by MLSFs. - 44Radoux, C. J.; Olsson, T. S.; Pitt, W. R.; Groom, C. R.; Blundell, T. L. Identifying interactions that determine fragment binding at protein hotspots. J. Med. Chem. 2016, 59, 4314– 4325, DOI: 10.1021/acs.jmedchem.5b01980[ACS Full Text
], [CAS], Google Scholar
43https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28XltlOms7Y%253D&md5=fa25eabfe291f9519f5701bcbc6c52c3Identifying Interactions that Determine Fragment Binding at Protein HotspotsRadoux, Chris J.; Olsson, Tjelvar S. G.; Pitt, Will R.; Groom, Colin R.; Blundell, Tom L.Journal of Medicinal Chemistry (2016), 59 (9), 4314-4325CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)Locating a ligand-binding site is an important first step in structure-guided drug discovery, but current methods do little to suggest which interactions within a pocket are the most important for binding. Here the authors illustrate a method that samples at. hotspots with simple mol. probes to produce fragment hotspot maps. These maps specifically highlight fragment-binding sites and their corresponding pharmacophores. For ligand-bound structures, they provide an intuitive visual guide within the binding site, directing medicinal chemists where to grow the mol. and alerting them to suboptimal interactions within the original hit. The fragment hotspot map calcn. is validated using exptl. binding positions of 21 fragments and subsequent lead mols. The ligands are found in high scoring areas of the fragment hotspot maps, with fragment atoms having a median percentage rank of 97%. Protein kinase B and pantothenate synthetase are examd. in detail. In each case, the fragment hotspot maps are able to rationalize a Free-Wilson anal. of SAR data from a fragment-based drug design project. - 45Skyner, R.; von Delft, F. Xchem, fragalysis; https://fragalysis.diamond.ac.uk/ (accessed: 2022-07-30).Google ScholarThere is no corresponding record for this reference.
- 46Wang, C.; Zhang, Y. Improving scoring-docking-screening powers of protein–ligand scoring functions using random forest. J. Comput. Chem. 2017, 38, 169– 177, DOI: 10.1002/jcc.24667[Crossref], [PubMed], [CAS], Google Scholar45https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28XhvVygurzM&md5=1a3ea0b5f159c9c0edf315fed2110ed0Improving scoring-docking-screening powers of protein-ligand scoring functions using random forestWang, Cheng; Zhang, YingkaiJournal of Computational Chemistry (2017), 38 (3), 169-177CODEN: JCCHDD; ISSN:0192-8651. (John Wiley & Sons, Inc.)The development of new protein-ligand scoring functions using machine learning algorithms, such as random forest, has been of significant interest. By efficiently using expanded feature sets and a large set of exptl. data, random forest based scoring functions (RFbScore) can achieve better correlations to exptl. protein-ligand binding data with known crystal structures; however, more extensive tests indicate that such enhancement in scoring power comes with significant under-performance in docking and screening power tests compared to traditional scoring functions. To improve scoring-docking-screening powers of protein-ligand docking functions simultaneously, the authors have introduced a ΔvinaRF parameterization and feature selection framework based on random forest. The authors' developed scoring function ΔvinaRF20, which employs 20 descriptors in addn. to the AutoDock Vina score, can achieve superior performance in all power tests of both CASF-2013 and CASF-2007 benchmarks compared to classical scoring functions. The ΔvinaRF20 scoring function and its code are freely available on the web at: https://www.nyu.edu/projects/yzhang/DeltaVina.
- 47Lu, J.; Hou, X.; Wang, C.; Zhang, Y. Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions. J. Chem. Inf. Model. 2019, 59, 4540– 4549, DOI: 10.1021/acs.jcim.9b00645[ACS Full Text
], [CAS], Google Scholar
46https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXitVSqs73O&md5=57fea8fa6135a109c3bcfd5ddffb1b33Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring FunctionsLu, Jianing; Hou, Xuben; Wang, Cheng; Zhang, YingkaiJournal of Chemical Information and Modeling (2019), 59 (11), 4540-4549CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Structure-based drug design is critically dependent on accuracy of mol. docking scoring functions, and there is a significant interest to advance scoring functions with machine learning approaches. In this work, by judiciously expanding the training set, exploring new features related to explicit mediating water mols. as well as ligand conformation stability, and applying extreme gradient boosting (XGBoost) with Δ-Vina parametrization, we have improved robustness and applicability of machine-learning scoring functions. The new scoring function ΔvinaXGB can not only perform consistently among the top compared to classical scoring functions for the CASF-2016 benchmark but also achieves significantly better prediction accuracy in different types of structures that mimic real docking applications. - 48Moon, S.; Zhung, W.; Yang, S.; Lim, J.; Kim, W. Y. PIGNet: a physics-informed deep learning model toward generalized drug–target interaction predictions. Chem. Sci. 2022, 13, 3661– 3673, DOI: 10.1039/D1SC06946B[Crossref], [PubMed], [CAS], Google Scholar47https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XmtlKktb0%253D&md5=c7ca118db214dde8563a0945db0a2331PIGNet: a physics-informed deep learning model toward generalized drug-target interaction predictionsMoon, Seokhyun; Zhung, Wonho; Yang, Soojung; Lim, Jaechang; Kim, Woo YounChemical Science (2022), 13 (13), 3661-3673CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Recently, deep neural network (DNN)-based drug-target interaction (DTI) models were highlighted for their high accuracy with affordable computational costs. Yet, the models' insufficient generalization remains a challenging problem in the practice of in silico drug discovery. We propose two key strategies to enhance generalization in the DTI model. The first is to predict the atom-atom pairwise interactions via physics-informed equations parameterized with neural networks and provides the total binding affinity of a protein-ligand complex as their sum. We further improved the model generalization by augmenting a broader range of binding poses and ligands to training data. We validated our model, PIGNet, in the comparative assessment of scoring functions (CASF) 2016, demonstrating the outperforming docking and screening powers than previous methods. Our physics-informing strategy also enables the interpretation of predicted affinities by visualizing the contribution of ligand substructures, providing insights for further ligand optimization.
- 49Shen, C.; Zhang, X.; Deng, Y.; Gao, J.; Wang, D.; Xu, L.; Pan, P.; Hou, T.; Kang, Y. Boosting Protein–Ligand Binding Pose Prediction and Virtual Screening Based on Residue–Atom Distance Likelihood Potential and Graph Transformer. J. Med. Chem. 2022, 65, 10691– 10706, DOI: 10.1021/acs.jmedchem.2c00991[ACS Full Text
], [CAS], Google Scholar
48https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XhvF2htrbO&md5=217e5d659ff8c69c9c9559c99344aef0Boosting Protein-Ligand Binding Pose Prediction and Virtual Screening Based on Residue-Atom Distance Likelihood Potential and Graph TransformerShen, Chao; Zhang, Xujun; Deng, Yafeng; Gao, Junbo; Wang, Dong; Xu, Lei; Pan, Peichen; Hou, Tingjun; Kang, YuJournal of Medicinal Chemistry (2022), 65 (15), 10691-10706CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)The past few years have witnessed enormous progress toward applying machine learning approaches to the development of protein-ligand scoring functions. However, the robust performance and wide applicability of scoring functions remain a big challenge for increasing the success rate of docking-based virtual screening. Herein, a novel scoring function named RTMScore was developed by introducing a tailored residue-based graph representation strategy and several graph transformer layers for the learning of protein and ligand representations, followed by a mixt. d. network to obtain residue-atom distance likelihood potential. Our approach was resolutely validated on the CASF-2016 benchmark, and the results indicate that RTMScore can outperform almost all of the other state-of-the-art methods in terms of both the docking and screening powers. Further evaluation confirms the robustness of our approach that can not only retain its docking power on cross-docked poses but also achieve improved performance as a rescoring tool in larger-scale virtual screening. - 50Liao, Z.; You, R.; Huang, X.; Yao, X.; Huang, T.; Zhu, S. DeepDock: Enhancing Ligand-protein Interaction Prediction by a Combination of Ligand and Structure Information. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ; 2019; pp 311– 317.
- 51https://fragalysis.diamond.ac.uk/viewer/react/download/tag/5740113a-7603-4af4-9523-7f902186d4a2 (accessed: 2022-10-14).Google ScholarThere is no corresponding record for this reference.
- 52Carbery, A.; Skyner, R.; von Delft, F.; Deane, C. M. Fragment Libraries Designed to Be Functionally Diverse Recover Protein Binding Information More Efficiently Than Standard Structurally Diverse Libraries. J. Med. Chem. 2022, 65, 11404– 11413, DOI: 10.1021/acs.jmedchem.2c01004[ACS Full Text
], [CAS], Google Scholar
51https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XitFWktLnO&md5=20c210891befe38eb6ec658cba2a01b8Fragment Libraries Designed to Be Functionally Diverse Recover Protein Binding Information More Efficiently Than Standard Structurally Diverse LibrariesCarbery, Anna; Skyner, Rachael; von Delft, Frank; Deane, Charlotte M.Journal of Medicinal Chemistry (2022), 65 (16), 11404-11413CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)Current fragment-based drug design relies on the efficient exploration of chem. space by using structurally diverse libraries of small fragments. However, structurally dissimilar compds. can exploit the same interactions and thus be functionally similar. Using three-dimensional structures of many fragments bound to multiple targets, we examd. if a better strategy for selecting fragments for screening libraries exists. We show that structurally diverse fragments can be described as functionally redundant, often making the same interactions. Ranking fragments by the no. of novel interactions they made, we show that functionally diverse selections of fragments substantially increase the amt. of information recovered for unseen targets compared to the amts. recovered by other methods of selection. Using these results, we design small functionally efficient libraries that can give significantly more information about new protein targets than similarly sized structurally diverse libraries. By covering more functional space, we can generate more diverse sets of drug leads. - 53Schuller, M. Fragment binding to the Nsp3 macrodomain of SARS-CoV-2 identified through crystallographic screening and computational docking. Sci. Adv. 2021, 7, eabf8711 DOI: 10.1126/sciadv.abf8711
- 54Imprachim, N.; Yosaatmadja, Y.; Newman, J. A. Crystal structures and fragment screening of SARS-CoV-2 NSP14 reveal details of exoribonuclease activation and mRNA capping and provide starting points for antiviral drug development. Nucleic Acids Research 2023, 51, 475, DOI: 10.1093/nar/gkac1207[Crossref], [PubMed], [CAS], Google Scholar53https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB28rps1ahtg%253D%253D&md5=b3ad340352d17592df9e8d0fffa58af3Crystal structures and fragment screening of SARS-CoV-2 NSP14 reveal details of exoribonuclease activation and mRNA capping and provide starting points for antiviral drug developmentImprachim Nergis; Yosaatmadja Yuliana; Newman Joseph ANucleic acids research (2023), 51 (1), 475-487 ISSN:.NSP14 is a dual function enzyme containing an N-terminal exonuclease domain (ExoN) and C-terminal Guanine-N7-methyltransferase (N7-MTase) domain. Both activities are essential for the viral life cycle and may be targeted for anti-viral therapeutics. NSP14 forms a complex with NSP10, and this interaction enhances the nuclease but not the methyltransferase activity. We have determined the structure of SARS-CoV-2 NSP14 in the absence of NSP10 to 1.7 ÅA resolution. Comparisons with NSP14/NSP10 complexes reveal significant conformational changes that occur within the NSP14 ExoN domain upon binding of NSP10, including helix to coil transitions that facilitate the formation of the ExoN active site and provide an explanation of the stimulation of nuclease activity by NSP10. We have determined the structure of NSP14 in complex with cap analogue 7MeGpppG, and observe conformational changes within a SAM/SAH interacting loop that plays a key role in viral mRNA capping offering new insights into MTase activity. We perform an X-ray fragment screen on NSP14, revealing 72 hits bound to sites of inhibition in the ExoN and MTase domains. These fragments serve as excellent starting point tools for structure guided development of NSP14 inhibitors that may be used to treat COVID-19 and potentially other future viral threats.
- 55https://fragalysis.diamond.ac.uk/viewer/react/download/tag/01f41754-b2bb-4817-acc3-a5ebe820316d (accessed: 2022-10-14).Google ScholarThere is no corresponding record for this reference.
- 56https://fragalysis.diamond.ac.uk/viewer/react/download/tag/3df36d6b-3a5b-400d-97eb-af3b0e0df42d (accessed: 2022-10-14).Google ScholarThere is no corresponding record for this reference.
Cited By
This article has not yet been cited by other publications.
Abstract
Figure 1
Figure 1. An overall schema of the methods used to debias and test PointVS. We first thoroughly filter the testing and training sets (a) before benchmarking the performance of PointVS on the docking power and scoring power tests (b). We then use attribution to gain insights into important binding regions in the protein pocket (c), which we use for fragment elaboration (d).
Figure 2
Figure 2. Architecture and input format of PointVS. Each of the n atoms in the input to the model (left) is given a one-hot encoded feature vector with a single bit added to indicate whether the atom is from the ligand or the receptor, as well as a position . There are m edges, defined by the edge indices ei ∈ {0, ···, n – 1}2×m, which are the indices of connected atoms, and the corresponding edge attributes ea ∈ {0, 1}3×m. These are one-hot encodings representing ligand–ligand, ligand–protein, and protein–protein edges. There are skip connections between each of the EGNN layers, and the linear and global average final pooling (GAP) layers only act upon the node features f.
Figure 3
Figure 3. Top-N vs N pose ranking performance on the gninaSetPose set for Autodock Vina, PointVS, and gnina. Terms in brackets in the legend refer to the training sets; filtering the training set by protein and ligand similarity results in slightly degraded performance. The Top-1 values for PointVS trained on Redocked\gninaSetPose80 and gnina trained on Redocked are the same (68%), with PointVS trained on Redocked achieving 70%.
Figure 4
Figure 4. Three Tankyrase-2 inhibitors: 5C5P (a, d, g, j), 4J21 (b, e, h, k), and 4J22 (c, f, i, l). The top row shows the Protein–Ligand Interaction Profiler (PLIP) analysis of each structure, where dark blue lines are hydrogen bonds, dotted gray are hydrophobic interactions, dotted green are π–π interactions, solid green are halogen bonds, and lilac are water bridges. The second row shows the results of performing atom masking with gnina, with green representing a positive attribution score (identified as making a positive contribution to binding) and red, a negative score (making a negative contribution). The third row shows the results of bond masking with IGN, with lines between atoms showing the top five highest-scoring edges for each structure, with darker red representing higher scores. The bottom row shows the results of edge-attention attribution performed with PointVS, with the lines also showing the top five highest-scoring edges, and darker pink representing higher scores.
Figure 5
Figure 5. Donor and acceptor hotspot maps in the binding pocket of Mpro, colored purple and orange, respectively, and numbered according to their rank. The hotspots on the left (a) were obtained with the Hotspots API, the hotspots in the center (b) were obtained with PointVS by performing attribution on 152 structures of bound fragments, and the hotspots on the right (c) were obtained by extracting the most common protein atoms identified by PLIP as involved in hydrogen bonds with ligand atoms across the 152 bound structures (in this case identified over five times).
Figure 6
Figure 6. Top five scoring hotspot maps in the binding pocket of Mpro found by performing attribution on 40 sets of randomly sampled sets of 10 (a), 30 (b), and 80 (c) bound structures. Both donor and acceptor hotspots are shown in red. The spheres representing the hotspots are transparent and overlaid, so the more opaque a hotspot appears, the more times it was ranked in the top five.
References
ARTICLE SECTIONSThis article references 56 other publications.
- 1Barnett, E.; Onete, D.; Salekin, A.; Faraone, S. V. Genomic Machine Learning Meta-regression: Insights on Associations of Study Features with Reported Model Performance. medRxiv 2022, DOI: 10.1101/2022.01.10.22268751
- 2Shim, M.; Lee, S.-H.; Hwang, H.-J. Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection. Sci. Rep. 2021, 11, 7980, DOI: 10.1038/s41598-021-87157-3[Crossref], [PubMed], [CAS], Google Scholar2https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXptFOlsLg%253D&md5=a43ca92128b177b8a67e2437fbe5c556Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selectionShim, Miseon; Lee, Seung-Hwan; Hwang, Han-JeongScientific Reports (2021), 11 (1), 7980CODEN: SRCEC3; ISSN:2045-2322. (Nature Research)In recent years, machine learning techniques have been frequently applied to uncovering neuropsychiatric biomarkers with the aim of accurately diagnosing neuropsychiatric diseases and predicting treatment prognosis. However, many studies did not perform cross validation (CV) when using machine learning techniques, or others performed CV in an incorrect manner, leading to significantly biased results due to overfitting problem. The aim of this study is to investigate the impact of CV on the prediction performance of neuropsychiatric biomarkers, in particular, for feature selection performed with high-dimensional features. To this end, we evaluated prediction performances using both simulation data and actual electroencephalog. (EEG) data. The overall prediction accuracies of the feature selection method performed outside of CV were considerably higher than those of the feature selection method performed within CV for both the simulation and actual EEG data. The differences between the prediction accuracies of the two feature selection approaches can be thought of as the amt. of overfitting due to selection bias. Our results indicate the importance of correctly using CV to avoid biased results of prediction performance of neuropsychiatric biomarkers.
- 3Roberts, M. Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans. Nat. Mach. Intell. 2021, 3, 199– 217, DOI: 10.1038/s42256-021-00307-0
- 4Tu, F.; Zhu, J.; Zheng, Q.; Zhou, M. Be Careful of When: An Empirical Study on Time-Related Misuse of Issue Tracking Data. In Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering , New York, NY, USA, 2018; p 307– 318.
- 5Sieg, J.; Flachsenberg, F.; Rarey, M. In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual Screening. J. Chem. Inf. Model. 2019, 59, 947– 961, DOI: 10.1021/acs.jcim.8b00712[ACS Full Text
], [CAS], Google Scholar
5https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXktVOgsrY%253D&md5=6cb48f2acd541c10c8565f202a6eb9e8In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual ScreeningSieg, Jochen; Flachsenberg, Florian; Rarey, MatthiasJournal of Chemical Information and Modeling (2019), 59 (3), 947-961CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)A review. Reports of successful applications of machine learning (ML) methods in structure-based virtual screening (SBVS) are increasing. ML methods such as convolutional neural networks show promising results and often outperform traditional methods such as empirical scoring functions in retrospective validation. However, trained ML models are often treated as black boxes and are not straightforwardly interpretable. In most cases, it is unknown which features in the data are decisive and whether a model's predictions are right for the right reason. Hence, the authors reevaluated three widely used benchmark data sets in the context of ML methods and came to the conclusion that not every benchmark data set is suitable. Moreover, the authors demonstrate on two examples from current literature that bias is learned implicitly and unnoticed from std. benchmarks. On the basis of these results, the authors conclude that there is a need for eligible validation expts. and benchmark data sets suited to ML for more bias-controlled validation in ML-based SBVS. Therefore, the authors provide guidelines for setting up validation expts. and give a perspective on how new data sets could be generated. - 6Chen, L.; Cruz, A.; Ramsey, S.; Dickson, C. J.; Duca, J. S.; Hornak, V.; Koes, D. R.; Kurtzman, T. Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screening. PLoS One 2019, 14, e0220113 DOI: 10.1371/journal.pone.0220113[Crossref], [PubMed], [CAS], Google Scholar6https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhvFeks7rL&md5=637472aa275cbacf3e926983f3795726Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screeningChen, Lieyang; Cruz, Anthony; Ramsey, Steven; Dickson, Callum J.; Duca, Jose S.; Hornak, Viktor; Koes, David R.; Kurtzman, TomPLoS One (2019), 14 (8), e0220113CODEN: POLNCL; ISSN:1932-6203. (Public Library of Science)Recently much effort has been invested in using convolutional neural network (CNN) models trained on 3D structural images of protein-ligand complexes to distinguish binding from non-binding ligands for virtual screening. However, the dearth of reliable protein-ligand x-ray structures and binding affinity data has required the use of constructed datasets for the training and evaluation of CNN mol. recognition models. Here, we outline various sources of bias in one such widely-used dataset, the Directory of Useful Decoys: Enhanced (DUD-E). We have constructed and performed tests to investigate whether CNN models developed using DUD-E are properly learning the underlying physics of mol. recognition, as intended, or are instead learning biases inherent in the dataset itself. We find that superior enrichment efficiency in CNN models can be attributed to the analog and decoy bias hidden in the DUD-E dataset rather than successful generalization of the pattern of protein-ligand interactions. Comparing addnl. deep learning models trained on PDBbind datasets, we found that their enrichment performances using DUD-E are not superior to the performance of the docking program AutoDock Vina. Together, these results suggest that biases that could be present in constructed datasets should be thoroughly evaluated before applying them to machine learning based methodol. development.
- 7Wallach, I.; Heifets, A. Most Ligand-Based Classification Benchmarks Reward Memorization Rather than Generalization. J. Chem. Inf. Model. 2018, 58, 916– 932, DOI: 10.1021/acs.jcim.7b00403[ACS Full Text
], [CAS], Google Scholar
7https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXotlCgs7s%253D&md5=a59fb9ba04b1e6b37cb83594625081ffMost Ligand-Based Classification Benchmarks Reward Memorization Rather than GeneralizationWallach, Izhar; Heifets, AbrahamJournal of Chemical Information and Modeling (2018), 58 (5), 916-932CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Undetected overfitting can occur when there are significant redundancies between training and validation data. The authors describe AVE, a new measure of training-validation redundancy for ligand-based classification problems, that accounts for the similarity among inactive mols. as well as active ones. The authors investigated seven widely used benchmarks for virtual screening and classification, and the authors show that the amt. of AVE bias strongly correlates with the performance of ligand-based predictive methods irresp. of the predicted property, chem. fingerprint, similarity measure, or previously applied unbiasing techniques. Therefore, it may be the case that the previously reported performance of most ligand-based methods can be explained by overfitting to benchmarks rather than good prospective accuracy. - 8Wouters, O. J.; McKee, M.; Luyten, J. Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009–2018. JAMA, J. Am. Med. Assoc. 2020, 323, 844– 853, DOI: 10.1001/jama.2020.1166[Crossref], [PubMed], [CAS], Google Scholar8https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB387oslyhsA%253D%253D&md5=ff63c2f03e59ef0223585a71fb6d5df0Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009-2018Wouters Olivier J; McKee Martin; Luyten JeroenJAMA (2020), 323 (9), 844-853 ISSN:.IMPORTANCE: The mean cost of developing a new drug has been the subject of debate, with recent estimates ranging from $314 million to $2.8 billion. OBJECTIVE: To estimate the research and development investment required to bring a new therapeutic agent to market, using publicly available data. DESIGN AND SETTING: Data were analyzed on new therapeutic agents approved by the US Food and Drug Administration (FDA) between 2009 and 2018 to estimate the research and development expenditure required to bring a new medicine to market. Data were accessed from the US Securities and Exchange Commission, Drugs@FDA database, and ClinicalTrials.gov, alongside published data on clinical trial success rates. EXPOSURES: Conduct of preclinical and clinical studies of new therapeutic agents. MAIN OUTCOMES AND MEASURES: Median and mean research and development spending on new therapeutic agents approved by the FDA, capitalized at a real cost of capital rate (the required rate of return for an investor) of 10.5% per year, with bootstrapped CIs. All amounts were reported in 2018 US dollars. RESULTS: The FDA approved 355 new drugs and biologics over the study period. Research and development expenditures were available for 63 (18%) products, developed by 47 different companies. After accounting for the costs of failed trials, the median capitalized research and development investment to bring a new drug to market was estimated at $985.3 million (95% CI, $683.6 million-$1228.9 million), and the mean investment was estimated at $1335.9 million (95% CI, $1042.5 million-$1637.5 million) in the base case analysis. Median estimates by therapeutic area (for areas with ≥5 drugs) ranged from $765.9 million (95% CI, $323.0 million-$1473.5 million) for nervous system agents to $2771.6 million (95% CI, $2051.8 million-$5366.2 million) for antineoplastic and immunomodulating agents. Data were mainly accessible for smaller firms, orphan drugs, products in certain therapeutic areas, first-in-class drugs, therapeutic agents that received accelerated approval, and products approved between 2014 and 2018. Results varied in sensitivity analyses using different estimates of clinical trial success rates, preclinical expenditures, and cost of capital. CONCLUSIONS AND RELEVANCE: This study provides an estimate of research and development costs for new therapeutic agents based on publicly available data. Differences from previous studies may reflect the spectrum of products analyzed, the restricted availability of data in the public domain, and differences in underlying assumptions in the cost calculations.
- 9Morris, G. M.; Huey, R.; Lindstrom, W.; Sanner, M. F.; Belew, R. K.; Goodsell, D. S.; Olson, A. J. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 2009, 30, 2785– 2791, DOI: 10.1002/jcc.21256[Crossref], [PubMed], [CAS], Google Scholar9https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD1MXht1GitrnK&md5=679ce22fc50e9291c9aa16e7a1855845AutoDock and AutoDockTools: Automated docking with selective receptor flexibilityMorris, Garrett M.; Huey, Ruth; Lindstrom, William; Sanner, Michel F.; Belew, Richard K.; Goodsell, David S.; Olson, Arthur J.Journal of Computational Chemistry (2009), 30 (16), 2785-2791CODEN: JCCHDD; ISSN:0192-8651. (John Wiley & Sons, Inc.)We describe the testing and release of AutoDock4 and the accompanying graphical user interface AutoDockTools. AutoDock4 incorporates limited flexibility in the receptor. Several tests are reported here, including a redocking expt. with 188 diverse ligand-protein complexes and a cross-docking expt. using flexible sidechains in 87 HIV protease complexes. We also report its utility in anal. of covalently bound ligands, using both a grid-based docking method and a modification of the flexible sidechain technique. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2009.
- 10Verdonk, M. L.; Cole, J. C.; Hartshorn, M. J.; Murray, C. W.; Taylor, R. D. Improved protein–ligand docking using GOLD. Proteins: Struct., Funct., Bioinf. 2003, 52, 609– 623, DOI: 10.1002/prot.10465[Crossref], [PubMed], [CAS], Google Scholar10https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BD3sXmvFGrsLg%253D&md5=73a627d99dab6de1ae364c8e8e5f0feaImproved protein-ligand docking using GOLDVerdonk, Marcel L.; Cole, Jason C.; Hartshorn, Michael J.; Murray, Christopher W.; Taylor, Richard D.Proteins: Structure, Function, and Genetics (2003), 52 (4), 609-623CODEN: PSFGEY; ISSN:0887-3585. (Wiley-Liss, Inc.)The Chemscore function was implemented as a scoring function for the protein-ligand docking program GOLD, and its performance compared to the original Goldscore function and two consensus docking protocols, "Goldscore-CS" and "Chemscore-GS," in terms of docking accuracy, prediction of binding affinities, and speed. In the "Goldscore-CS" protocol, dockings produced with the Goldscore function are scored and ranked with the Chemscore function; in the "Chemscore-GS" protocol, dockings produced with the Chemscore function are scored and ranked with the Goldscore function. Comparisons were made for a "clean" set of 224 protein-ligand complexes, and for two subsets of this set, one for which the ligands are "drug-like," the other for which they are "fragment-like.". For "drug-like" and "fragment-like" ligands, the docking accuracies obtained with Chemscore and Goldscore functions are similar. For larger ligands, Goldscore gives superior results. Docking with the Chemscore function is up to three times faster than docking with the Goldscore function. Both combined docking protocols give significant improvements in docking accuracy over the use of the Goldscore or Chemscore function alone. "Goldscore-CS" gives success rates of up to 81% (top-ranked GOLD soln. within 2.0 Å of the exptl. binding mode) for the "clean list," but at the cost of long search times. For most virtual screening applications, "Chemscore-GS" seems optimal; search settings that give docking speeds of around 0.25-1.3 min/compd. have success rates of about 78% for "drug-like" compds. and 85% for "fragment-like" compds. In terms of producing binding energy ests., the Goldscore function appears to perform better than the Chemscore function and the two consensus protocols, particularly for faster search settings. Even at docking speeds of around 1-2 min/compd., the Goldscore function predicts binding energies with a std. deviation of ∼10.5 kJ/mol.
- 11Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators. Neural Netw 1989, 2, 359– 366, DOI: 10.1016/0893-6080(89)90020-8
- 12Shen, C.; Ding, J.; Wang, Z.; Cao, D.; Ding, X.; Hou, T. From machine learning to deep learning: Advances in scoring functions for protein–ligand docking. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2020, 10, e1429 DOI: 10.1002/wcms.1429[Crossref], [CAS], Google Scholar12https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXitlyjsbbN&md5=0dd878e211fad64fada03e883a5ab47aFrom machine learning to deep learning: Advances in scoring functions for protein-ligand dockingShen, Chao; Ding, Junjie; Wang, Zhe; Cao, Dongsheng; Ding, Xiaoqin; Hou, TingjunWiley Interdisciplinary Reviews: Computational Molecular Science (2020), 10 (1), e1429CODEN: WIRCAH; ISSN:1759-0884. (Wiley-Blackwell)A review. Mol. docking has been regarded as a routine tool for drug discovery, but its accuracy highly depends on the reliability of scoring functions (SFs). With the rapid development of machine learning (ML) techniques, ML-based SFs have gradually emerged as a promising alternative for protein-ligand binding affinity prediction and virtual screening, and most of them have shown significantly better performance than a wide range of classical SFs. Emergence of more data-hungry deep learning (DL) approaches in recent years further fascinates the exploitation of more accurate SFs. Here, we summarize the progress of traditional ML-based SFs in the last few years and provide insights into recently developed DL-based SFs. We believe that the continuous improvement in ML-based SFs can surely guide the early-stage drug design and accelerate the discovery of new drugs. This article is categorized under: Computer and Information Science > Chemoinformatics.
- 13Jones, D.; Kim, H.; Zhang, X.; Zemla, A.; Stevenson, G.; Bennett, W. F. D.; Kirshner, D.; Wong, S. E.; Lightstone, F. C.; Allen, J. E. Improved Protein–Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference. J. Chem. Inf. Model. 2021, 61, 1583– 1592, DOI: 10.1021/acs.jcim.0c01306[ACS Full Text
], [CAS], Google Scholar
13https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXmvFaqtrc%253D&md5=11f92ba4bae7dabc75b9e55c1911352bImproved Protein-Ligand Binding Affinity Prediction with Structure-Based Deep Fusion InferenceJones, Derek; Kim, Hyojin; Zhang, Xiaohua; Zemla, Adam; Stevenson, Garrett; Bennett, W. F. Drew; Kirshner, Daniel; Wong, Sergio E.; Lightstone, Felice C.; Allen, Jonathan E.Journal of Chemical Information and Modeling (2021), 61 (4), 1583-1592CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Predicting accurate protein-ligand binding affinities is an important task in drug discovery but remains a challenge even with computationally expensive biophysics-based energy scoring methods and state-of-the-art deep learning approaches. Despite the recent advances in the application of deep convolutional and graph neural network-based approaches, it remains unclear what the relative advantages of each approach are and how they compare with physics-based methodologies that have found more mainstream success in virtual screening pipelines. We present fusion models that combine features and inference from complementary representations to improve binding affinity prediction. This, to our knowledge, is the first comprehensive study that uses a common series of evaluations to directly compare the performance of three-dimensional (3D)-convolutional neural networks (3D-CNNs), spatial graph neural networks (SG-CNNs), and their fusion. We use temporal and structure-based splits to assess performance on novel protein targets. To test the practical applicability of our models, we examine their performance in cases that assume that the crystal structure is not available. In these cases, binding free energies are predicted using docking pose coordinates as the inputs to each model. In addn., we compare these deep learning approaches to predictions based on docking scores and mol. mechanic/generalized Born surface area (MM/GBSA) calcns. Our results show that the fusion models make more accurate predictions than their constituent neural network models as well as docking scoring and MM/GBSA rescoring, with the benefit of greater computational efficiency than the MM/GBSA method. Finally, we provide the code to reproduce our results and the parameter files of the trained models used in this work. The software is available as open source at https://github.com/llnl/fast. Model parameter files are available at ftp://gdo-bioinformatics.ucllnl.org/fast/pdbbind2016_model_checkpoints/. - 14Chenthamarakshan, V.; Das, P.; Hoffman, S.; Strobelt, H.; Padhi, I.; Lim, K. W.; Hoover, B.; Manica, M.; Born, J.; Laino, T.; Mojsilovic, A. CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models. In 34th Conference on Neural Information Processing Systems , 2020; pp 4320– 4332.Google ScholarThere is no corresponding record for this reference.
- 15Hoffman, S. C.; Chenthamarakshan, V.; Wadhawan, K.; Chen, P.-Y.; Das, P. Optimizing molecules using efficient queries from property evaluations. Nat. Mach. Intell. 2022, 4, 21, DOI: 10.1038/s42256-021-00422-y
- 16Boyles, F.; Deane, C. M.; Morris, G. M. Learning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked Poses. J. Chem. Inf. Model. 2022, 62, 5329– 5341, DOI: 10.1021/acs.jcim.1c00096[ACS Full Text
], [CAS], Google Scholar
16https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXhvFWms7rN&md5=2c539500f49ad27ff6682ff11e9360ebLearning from Docked Ligands: Ligand-Based Features Rescue Structure-Based Scoring Functions When Trained on Docked PosesBoyles, Fergus; Deane, Charlotte M.; Morris, Garrett M.Journal of Chemical Information and Modeling (2022), 62 (22), 5329-5341CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Machine learning scoring functions for protein-ligand binding affinity have been found to consistently outperform classical scoring functions when trained and tested on crystal structures of bound protein-ligand complexes. However, it is less clear how these methods perform when applied to docked poses of complexes. We explore how the use of docked rather than crystallog. poses for both training and testing affects the performance of machine learning scoring functions. Using the PDBbind Core Sets as benchmarks, we show that the performance of a structure-based machine learning scoring function trained and tested on docked poses is lower than that of the same scoring function trained and tested on crystallog. poses. We construct a hybrid scoring function by combining both structure-based and ligand-based features, and show that its ability to predict binding affinity using docked poses is comparable to that of purely structure-based scoring functions trained and tested on crystal poses. We also present a new, freely available validation set-the Updated DUD-E Diverse Subset-for binding affinity prediction using data from DUD-E and ChEMBL. Despite strong performance on docked poses of the PDBbind Core Sets, we find that our hybrid scoring function sometimes generalizes poorly to a protein target not represented in the training set, demonstrating the need for improved scoring functions and addnl. validation benchmarks. - 17Su, M.; Yang, Q.; Du, Y.; Feng, G.; Liu, Z.; Li, Y.; Wang, R. Comparative Assessment of Scoring Functions: The CASF-2016 Update. J. Chem. Inf. Model. 2019, 59, 895– 913, DOI: 10.1021/acs.jcim.8b00545[ACS Full Text
], [CAS], Google Scholar
17https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXitlWhtLjM&md5=573295cfd2298a2dee933027041c3d9cComparative Assessment of Scoring Functions: The CASF-2016 UpdateSu, Minyi; Yang, Qifan; Du, Yu; Feng, Guoqin; Liu, Zhihai; Li, Yan; Wang, RenxiaoJournal of Chemical Information and Modeling (2019), 59 (2), 895-913CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)In structure-based drug design, scoring functions are often employed to evaluate protein-ligand interactions. A variety of scoring functions have been developed so far, and thus, some objective benchmarks are desired for assessing their strength and weakness. The comparative assessment of scoring functions (CASF) benchmark developed by us provides an answer to this demand. CASF is designed as a "scoring benchmark", where the scoring process is decoupled from the docking process to depict the performance of scoring function more precisely. Here, we describe the latest update of this benchmark, i.e., CASF-2016. Each scoring function is still evaluated by four metrics, including "scoring power", "ranking power", "docking power", and "screening power". Nevertheless, the evaluation methods have been improved considerably in several aspects. A new test set is compiled, which consists of 285 protein-ligand complexes with high-quality crystal structures and reliable binding consts. A panel of 25 scoring functions are tested on CASF-2016 as a demonstration. Our results reveal that the performance of current scoring functions is more promising in terms of docking power than scoring, ranking, and screening power. Scoring power is somewhat correlated with ranking power, so are docking power and screening power. The results obtained on CASF-2016 may provide valuable guidance for the end users to make smart choices among available scoring functions. Moreover, CASF is created as an open-access benchmark so that other researchers can utilize it to test a wider range of scoring functions. The complete CASF-2016 benchmark will be released on the PDBbind-CN web server (http://www.pdbbind-cn.org/casf.asp/) once this article is published. - 18Liu, Z.; Su, M.; Han, L.; Liu, J.; Yang, Q.; Li, Y.; Wang, R. Forging the Basis for Developing Protein–Ligand Interaction Scoring Functions. Acc. Chem. Res. 2017, 50, 302– 309, DOI: 10.1021/acs.accounts.6b00491[ACS Full Text
], [CAS], Google Scholar
18https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXit12qsLc%253D&md5=48d7287ef58c3ed8d9ff7579014fc486Forging the Basis for Developing Protein-Ligand Interaction Scoring FunctionsLiu, Zhihai; Su, Minyi; Han, Li; Liu, Jie; Yang, Qifan; Li, Yan; Wang, RenxiaoAccounts of Chemical Research (2017), 50 (2), 302-309CODEN: ACHRE4; ISSN:0001-4842. (American Chemical Society)A review. In structure-based drug design, scoring functions are widely used for fast evaluation of protein-ligand interactions. They are often applied in combination with mol. docking and de novo design methods. Since the early 1990s, a whole spectrum of protein-ligand interaction scoring functions have been developed. Regardless of their tech. difference, scoring functions all need data sets combining protein-ligand complex structures and binding affinity data for parametrization and validation. However, data sets of this kind used to be rather limited in terms of size and quality. However, std. metrics for evaluating scoring function used to be ambiguous. Scoring functions are often tested in mol. docking or even virtual screening trials, which are not totally appropriate for reflecting the genuine quality of scoring functions. Collectively, these underlying obstacles have impeded the invention of more advanced scoring functions. In this Account, the authors describe their long-lasting efforts on overcoming these obstacles, which involve two related projects. On the first project, the authors have created the PDBbind database. It is the first database that systematically annotates the protein-ligand complexes in the Protein Data Bank (PDB) with exptl. binding data. This database has been updated annually since its first public release in 2004. The latest release (version 2016) provides binding data for 16,179 biomol. complexes in PDB. Data sets provided by PDBbind have been applied to many computational and statistical studies on protein-ligand interaction and various subjects. In particular, it has become a major data resource for scoring function development. On the second project, the authors established the Comparative Assessment of Scoring Functions (CASF) benchmark for scoring function evaluation. The authors' key idea is to decouple the "scoring" process from the "sampling" process, so scoring functions can be tested in a relatively pure context to reflect their quality. In the authors' latest work on this track, i.e., CASF-2013, the performance of a scoring function was quantified in four aspects, including "scoring power", "ranking power", "docking power", and "screening power". All four performance tests were conducted on a test set contg. 195 high-quality protein-ligand complexes selected from PDBbind. A panel of 20 std. scoring functions were tested as demonstration. Importantly, CASF is designed to be an open-access benchmark, with which scoring functions developed by different researchers can be compared on the same ground. Indeed, it has become a popular choice for scoring function validation in recent years. Despite the considerable progress that has been made so far, the performance of today's scoring functions still does not meet people's expectation in many aspects. There is a const. demand for more advanced scoring functions. The authors' efforts have helped to overcome some obstacles underlying scoring function development so that the researchers in this field can move forward faster. The authors will continue to improve the PDBbind database and the CASF benchmark in the future to keep them as useful community resources. - 19Yosinski, J.; Clune, J.; Nguyen, A.; Fuchs, T.; Lipson, H. Understanding neural networks through deep visualization. arXiv 2015, arXiv:1506.06579, DOI: 10.48550/arXiv.1506.06579
- 20Samek, W.; Binder, A.; Montavon, G.; Bach, S.; Müller, K.-R. Evaluating the visualization of what a Deep Neural Network has learned. arXiv 2015, arXiv:1509.06321, DOI: 10.48550/arXiv.1509.06321
- 21Poelking, C.; Chessari, G.; Murray, C. W.; Hall, R. J.; Colwell, L.; Verdonk, M. Meaningful machine learning models and machine-learned pharmacophores from fragment screening campaigns. arXiv 2022, arXiv:2204.06348, DOI: 10.48550/arXiv.2204.06348
- 22Sharma, B.; Chenthamarakshan, V.; Dhurandhar, A.; Pereira, S.; Hendler, J. A.; Dordick, J. S.; Das, P. Accurate Clinical Toxicity Prediction using Multi-task Deep Neural Nets and Contrastive Molecular Explanations. Sci. Rep. 2023, 13, 4908, DOI: 10.1038/s41598-023-31169-8[Crossref], [PubMed], [CAS], Google Scholar22https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3sXmsV2ks7g%253D&md5=63f1d0798d9ba9b71f3d970b58e752eaAccurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanationsSharma, Bhanushee; Chenthamarakshan, Vijil; Dhurandhar, Amit; Pereira, Shiranee; Hendler, James A.; Dordick, Jonathan S.; Das, PayelScientific Reports (2023), 13 (1), 4908CODEN: SRCEC3; ISSN:2045-2322. (Nature Portfolio)Explainable machine learning for mol. toxicity prediction is a promising approach for efficient drug development and chem. safety. A predictive ML model of toxicity can reduce exptl. cost and time while mitigating ethical concerns by significantly reducing animal and clin. testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clin. toxicity data. Two different mol. input representations are used; Morgan fingerprints and pre-trained SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clin., as indicated by the area under the Receiver Operator Characteristic curve and balanced accuracy. In particular, pre-trained mol. SMILES embeddings as input to the multi-task model improved clin. toxicity predictions compared to existing models in MoleculeNet benchmark. Addnl., our multitask approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clin. platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clin. toxicity predictions. To provide confidence and explain the model's predictions, we adapt a post-hoc contrastive explanation method that returns pertinent pos. and neg. features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, arom. amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature anal. captures more of the in vitro (53%) and in vivo (56%), rather than of the clin. (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo exptl. data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clin. and in vivo mol. toxicity.
- 23Hadfield, T. E.; Scantlebury, J.; Deane, C. M. Exploring The Ability Of Machine Learning-Based Virtual Screening Models To Identify The Functional Groups Responsible For Binding. bioRxiv 2023, DOI: 10.1101/2023.04.29.538820
- 24McNutt, A. T.; Francoeur, P.; Aggarwal, R.; Masuda, T.; Meli, R.; Ragoza, M.; Sunseri, J.; Koes, D. R. GNINA 1.0: molecular docking with deep learning. J. Cheminf. 2021, 13, 43, DOI: 10.1186/s13321-021-00522-2
- 25Zheng, L.; Fan, J.; Mu, Y. OnionNet: a Multiple-Layer Intermolecular-Contact-Based Convolutional Neural Network for Protein–Ligand Binding Affinity Prediction. ACS Omega 2019, 4, 15956– 15965, DOI: 10.1021/acsomega.9b01997[ACS Full Text
], [CAS], Google Scholar
24https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhslKhtb3J&md5=0eaf1406acc14b7052d62481de5dec78OnionNet: a Multiple-Layer Intermolecular-Contact-Based Convolutional Neural Network for Protein-Ligand Binding Affinity PredictionZheng, Liangzhen; Fan, Jingrong; Mu, YuguangACS Omega (2019), 4 (14), 15956-15965CODEN: ACSODF; ISSN:2470-1343. (American Chemical Society)Computational drug discovery provides an efficient tool for helping large-scale lead mol. screening. One of the major tasks of lead discovery is identifying mols. with promising binding affinities toward a target, a protein in general. The accuracies of current scoring functions that are used to predict the binding affinity are not satisfactory enough. Thus, machine learning or deep learning based methods have been developed recently to improve the scoring functions. In this study, a deep convolutional neural network model (called OnionNet) is introduced; its features are based on rotation-free element-pair-specific contacts between ligands and protein atoms, and the contacts are further grouped into different distance ranges to cover both the local and nonlocal interaction information between the ligand and the protein. The prediction power of the model is evaluated and compared with other scoring functions using the comparative assessment of scoring functions (CASF-2013) benchmark and the v2016 core set of the PDBbind database. The robustness of the model is further explored by predicting the binding affinities of the complexes generated from docking simulations instead of exptl. detd. PDB structures. - 26Li, Y.; Rezaei, M. A.; Li, C.; Li, X. DeepAtom: A Framework for Protein-Ligand Binding Affinity Prediction. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ; 2019; pp 303– 310.
- 27Feinberg, E. N.; Sur, D.; Wu, Z.; Husic, B. E.; Mai, H.; Li, Y.; Sun, S.; Yang, J.; Ramsundar, B.; Pande, V. S. PotentialNet for molecular property prediction. ACS Cent. Sci. 2018, 4, 1520– 1530, DOI: 10.1021/acscentsci.8b00507[ACS Full Text
], [CAS], Google Scholar
26https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXitVGrs7fF&md5=570999e7f212d7c737b24f71665893b7PotentialNet for Molecular Property PredictionFeinberg, Evan N.; Sur, Debnil; Wu, Zhenqin; Husic, Brooke E.; Mai, Huanghao; Li, Yang; Sun, Saisai; Yang, Jianyi; Ramsundar, Bharath; Pande, Vijay S.ACS Central Science (2018), 4 (11), 1520-1530CODEN: ACSCII; ISSN:2374-7951. (American Chemical Society)The arc of drug discovery entails a multiparameter optimization problem spanning vast length scales. The key parameters range from soly. (angstroms) to protein-ligand binding (nanometers) to in vivo toxicity (meters). Through feature learning-instead of feature engineering-deep neural networks promise to outperform both traditional physics-based and knowledge-based machine learning models for predicting mol. properties pertinent to drug discovery. To this end, we present the PotentialNet family of graph convolutions. These models are specifically designed for and achieve state-of-the-art performance for protein-ligand binding affinity. We further validate these deep neural networks by setting new stds. of performance in several ligand-based tasks. In parallel, we introduce a new metric, the Regression Enrichment Factor EF(R)χ, to measure the early enrichment of computational models for chem. data. Finally, we introduce a cross-validation strategy based on structural homol. clustering that can more accurately measure model generalizability, which crucially distinguishes the aims of machine learning for drug discovery from std. machine learning tasks. - 28Morrone, J. A.; Weber, J. K.; Huynh, T.; Luo, H.; Cornell, W. D. Combining Docking Pose Rank and Structure with Deep Learning Improves Protein–Ligand Binding Mode Prediction over a Baseline Docking Approach. J. Chem. Inf. Model. 2020, 60, 4170– 4179, DOI: 10.1021/acs.jcim.9b00927[ACS Full Text
], [CAS], Google Scholar
27https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXjsVOgtbs%253D&md5=ed7f0ba1797757a61d64c817db36cc13Combining Docking Pose Rank and Structure with Deep Learning Improves Protein-Ligand Binding Mode Prediction over a Baseline Docking ApproachMorrone, Joseph A.; Weber, Jeffrey K.; Huynh, Tien; Luo, Heng; Cornell, Wendy D.Journal of Chemical Information and Modeling (2020), 60 (9), 4170-4179CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)The authors present a simple, modular graph-based convolutional neural network that takes structural information from protein-ligand complexes as input to generate models for activity and binding mode prediction. Complex structures are generated by a std. docking procedure and fed into a dual-graph architecture that includes sep. subnetworks for the ligand bonded topol. and the ligand-protein contact map. Recent work has indicated that data set bias drives many past promising results derived from combining deep learning and docking. The dual-graph network allows contributions from ligand identity that give rise to such biases to be distinguished from effects of protein-ligand interactions on classification. The authors show that the neural network is capable of learning from protein structural information when, as in the case of binding mode prediction, an unbiased data set is constructed. The authors next develop a deep learning model for binding mode prediction that uses docking ranking as input in combination with docking structures. This strategy mirrors past consensus models and outperforms a baseline docking program (AutoDock Vina) in a variety of tests, including on cross-docking data sets that mimic real-world docking use cases. Furthermore, the magnitudes of network predictions serve as reliable measures of model confidence. - 29Jiang, D.; Hsieh, C.-Y.; Wu, Z.; Kang, Y.; Wang, J.; Wang, E.; Liao, B.; Shen, C.; Xu, L.; Wu, J.; Cao, D.; Hou, T. InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein–Ligand Interaction Predictions. J. Med. Chem. 2021, 64, 18209– 18232, DOI: 10.1021/acs.jmedchem.1c01830[ACS Full Text
], [CAS], Google Scholar
28https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3MXislSnsrnE&md5=bbee6eb73a8c6b95c389fe11e45c7723InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein-Ligand Interaction PredictionsJiang, Dejun; Hsieh, Chang-Yu; Wu, Zhenxing; Kang, Yu; Wang, Jike; Wang, Ercheng; Liao, Ben; Shen, Chao; Xu, Lei; Wu, Jian; Cao, Dongsheng; Hou, TingjunJournal of Medicinal Chemistry (2021), 64 (24), 18209-18232CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)Accurate quantification of protein-ligand interactions remains a key challenge to structure-based drug design. However, traditional machine learning (ML)-based methods based on handcrafted descriptors, one-dimensional protein sequences, and/or two-dimensional graph representations limit their capability to learn the generalized mol. interactions in 3D space. Here, we proposed a novel deep graph representation learning framework named InteractionGraphNet (IGN) to learn the protein-ligand interactions from the 3D structures of protein-ligand complexes. In IGN, two independent graph convolution modules were stacked to sequentially learn the intramol. and intermol. interactions, and the learned intermol. interactions can be efficiently used for subsequent tasks. Extensive binding affinity prediction, large-scale structure-based virtual screening, and pose prediction expts. demonstrated that IGN achieved better or competitive performance against other state-of-the-art ML-based baselines and docking programs. More importantly, such state-of-the-art performance was proven from the successful learning of the key features in protein-ligand interactions instead of just memorizing certain biased patterns from data. - 30Hochuli, J.; Helbling, A.; Skaist, T.; Ragoza, M.; Koes, D. R. Visualizing convolutional neural network protein-ligand scoring. J. Mol. Graphics Modell. 2018, 84, 96– 108, DOI: 10.1016/j.jmgm.2018.06.005[Crossref], [PubMed], [CAS], Google Scholar29https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1cXhtF2ltb3P&md5=db697d859132f02c4a2cef4bfc00ea88Visualizing convolutional neural network protein-ligand scoringHochuli, Joshua; Helbling, Alec; Skaist, Tamar; Ragoza, Matthew; Koes, David RyanJournal of Molecular Graphics & Modelling (2018), 84 (), 96-108CODEN: JMGMFI; ISSN:1093-3263. (Elsevier Ltd.)Protein-ligand scoring is an important step in a structure-based drug design pipeline. Selecting a correct binding pose and predicting the binding affinity of a protein-ligand complex enables effective virtual screening. Machine learning techniques can make use of the increasing amts. of structural data that are becoming publicly available. Convolutional neural network (CNN) scoring functions in particular have shown promise in pose selection and affinity prediction for protein-ligand complexes. Neural networks are known for being difficult to interpret. Understanding the decisions of a particular network can help tune parameters and training data to maximize performance. Visualization of neural networks helps decomp. complex scoring functions into pictures that are more easily parsed by humans. Here we present three methods for visualizing how individual protein-ligand complexes are interpreted by 3D convolutional neural networks. We also present a visualization of the convolutional filters and their wts. We describe how the intuition provided by these visualizations aids in network design.
- 31Lim, J.; Ryu, S.; Park, K.; Choe, Y. J.; Ham, J.; Kim, W. Y. Predicting Drug–Target Interaction Using a Novel Graph Neural Network with 3D Structure-Embedded Graph Representation. J. Chem. Inf. Model. 2019, 59, 3981– 3988, DOI: 10.1021/acs.jcim.9b00387[ACS Full Text
], [CAS], Google Scholar
30https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXhs1egur3F&md5=670cd989a369d67c734f6762a5a906cfPredicting Drug-Target Interaction Using a Novel Graph Neural Network with 3D Structure-Embedded Graph RepresentationLim, Jaechang; Ryu, Seongok; Park, Kyubyong; Choe, Yo Joong; Ham, Jiyeon; Kim, Woo YounJournal of Chemical Information and Modeling (2019), 59 (9), 3981-3988CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)We propose a novel deep learning approach for predicting drug-target interaction using a graph neural network. We introduce a distance-aware graph attention algorithm to differentiate various types of intermol. interactions. Furthermore, we ext. the graph feature of intermol. interactions directly from the 3D structural information on the protein-ligand binding pose. Thus, the model can learn key features for accurate predictions of drug-target interaction rather than just memorize certain patterns of ligand mols. As a result, our model shows better performance than docking and other deep learning methods for both virtual screening (AUROC of 0.968 for the DUD-E test set) and pose prediction (AUROC of 0.935 for the PDBbind test set). In addn., it can reproduce the natural population distribution of active mols. and inactive mols. - 32Hadfield, T. E.; Imrie, F.; Merritt, A.; Birchall, K.; Deane, C. M. Incorporating Target-Specific Pharmacophoric Information into Deep Generative Models for Fragment Elaboration. J. Chem. Inf. Model. 2022, 62, 2280– 2292, DOI: 10.1021/acs.jcim.1c01311[ACS Full Text
], [CAS], Google Scholar
31https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XhtFGru7rE&md5=c639b01f0c509bbd2e20308fa044bdb5Incorporating Target-Specific Pharmacophoric Information into Deep Generative Models for Fragment ElaborationHadfield, Thomas E.; Imrie, Fergus; Merritt, Andy; Birchall, Kristian; Deane, Charlotte M.Journal of Chemical Information and Modeling (2022), 62 (10), 2280-2292CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Despite recent interest in deep generative models for scaffold elaboration, their applicability to fragment-to-lead campaigns has so far been limited. This is primarily due to their inability to account for local protein structure or a user's design hypothesis. We propose a novel method for fragment elaboration, STRIFE, that overcomes these issues. STRIFE takes as input fragment hotspot maps (FHMs) extd. from a protein target and processes them to provide meaningful and interpretable structural information to its generative model, which in turn is able to rapidly generate elaborations with complementary pharmacophores to the protein. In a large-scale evaluation, STRIFE outperforms existing, structure-unaware, fragment elaboration methods in proposing highly ligand-efficient elaborations. In addn. to automatically extg. pharmacophoric information from a protein target's FHM, STRIFE optionally allows the user to specify their own design hypotheses. - 33Curran, P. R.; Radoux, C. J.; Smilova, M. D.; Sykes, R. A.; Higueruelo, A. P.; Bradley, A. R.; Marsden, B. D.; Spring, D. R.; Blundell, T. L.; Leach, A. R.; Pitt, W. R.; Cole, J. C. Hotspots API: A Python Package for the Detection of Small Molecule Binding Hotspots and Application to Structure-Based Drug Design. J. Chem. Inf. Model. 2020, 60, 1911– 1916, DOI: 10.1021/acs.jcim.9b00996[ACS Full Text
], [CAS], Google Scholar
32https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXlsFahsbw%253D&md5=9402dded4d637e5b324e329d9e2c9a08Hotspots API: A Python Package for the Detection of Small Molecule Binding Hotspots and Application to Structure-Based Drug DesignCurran, Peter R.; Radoux, Chris J.; Smilova, Mihaela D.; Sykes, Richard A.; Higueruelo, Alicia P.; Bradley, Anthony R.; Marsden, Brian D.; Spring, David R.; Blundell, Tom L.; Leach, Andrew R.; Pitt, William R.; Cole, Jason C.Journal of Chemical Information and Modeling (2020), 60 (4), 1911-1916CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Methods that survey protein surfaces for binding hotspots can help to evaluate target tractability and guide exploration of potential ligand binding regions. Fragment Hotspot Maps builds upon interaction data mined from the CSD (Cambridge Structural Database) and exploits the idea of identifying hotspots using small chem. fragments, which is now widely used to design new drug leads. Prior to this publication, Fragment Hotspot Maps was only publicly available through a web application. To increase the accessibility of this algorithm we present the Hotspots API (application programming interface), a toolkit that offers programmatic access to the core Fragment Hotspot Maps algorithm, thereby facilitating the interpretation and application of 3the anal. To demonstrate the package's utility, we present a workflow which automatically derives protein hydrogen-bond constraints for mol. docking with GOLD. The Hotspots API is available from https://github.com/prcurran/hotspots under the MIT license and is dependent upon the com. CSD Python API. - 34Salentin, S.; Schreiber, S.; Haupt, V. J.; Adasme, M. F.; Schroeder, M. PLIP: fully automated protein–ligand interaction profiler. Nucleic Acids Res. 2015, 43, W443– W447, DOI: 10.1093/nar/gkv315[Crossref], [PubMed], [CAS], Google Scholar33https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXhtVymtbrI&md5=4878881a7bd715e5585682876e102e65PLIP: fully automated protein-ligand interaction profilerSalentin, Sebastian; Schreiber, Sven; Haupt, V. Joachim; Adasme, Melissa F.; Schroeder, MichaelNucleic Acids Research (2015), 43 (W1), W443-W447CODEN: NARHAD; ISSN:0305-1048. (Oxford University Press)The characterization of interactions in protein-ligand complexes is essential for research in structural bioinformatics, drug discovery and biol. However, comprehensive tools are not freely available to the research community. Here, we present the protein-ligand interaction profiler (PLIP), a novel web service for fully automated detection and visualization of relevant non-covalent protein-ligand contacts in 3D structures, freely available at projects.biotec.tu-dresden.de/plip-web. The input is either a Protein Data Bank structure, a protein or ligand name, or a custom protein-ligand complex (e.g. from docking). In contrast to other tools, the rule-based PLIP algorithm does not require any structure prepn. It returns a list of detected interactions on single atom level, covering seven interaction types (hydrogen bonds, hydrophobic contacts, pi-stacking, pi-cation interactions, salt bridges, water bridges and halogen bonds). PLIP stands out by offering publication-ready images, PyMOL sesions files to generate custom images and parsable result files to facilitate successive data processing. The full python source code is available for download on the website. PLIP's command-line mode allows for high-throughput interaction profiling.
- 35Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A. N.; Kaiser, L. u.; Polosukhin, I. Attention is All you Need. arXiv 2017, arXiv:1706.03762, DOI: 10.48550/arXiv.1706.03762
- 36Francoeur, P. G.; Masuda, T.; Sunseri, J.; Jia, A.; Iovanisci, R. B.; Snyder, I.; Koes, D. R. Three-Dimensional Convolutional Neural Networks and a Cross-Docked Data Set for Structure-Based Drug Design. J. Chem. Inf. Model. 2020, 60, 4200– 4215, DOI: 10.1021/acs.jcim.0c00411[ACS Full Text
], [CAS], Google Scholar
35https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhslWqs73N&md5=5a5465187dbdef41d556baddd1a8e4d9Three-Dimensional Convolutional Neural Networks and a Cross-Docked Data Set for Structure-Based Drug DesignFrancoeur, Paul G.; Masuda, Tomohide; Sunseri, Jocelyn; Jia, Andrew; Iovanisci, Richard B.; Snyder, Ian; Koes, David R.Journal of Chemical Information and Modeling (2020), 60 (9), 4200-4215CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)One of the main challenges in drug discovery is predicting protein-ligand binding affinity. Recently, machine learning approaches have made substantial progress on this task. However, current methods of model evaluation are overly optimistic in measuring generalization to new targets, and there does not exist a std. data set of sufficient size to compare performance between models. We present a new data set for structure-based machine learning, the CrossDocked2020 set, with 22.5 million poses of ligands docked into multiple similar binding pockets across the Protein Data Bank, and perform a comprehensive evaluation of grid-based convolutional neural network (CNN) models on this data set. We also demonstrate how the partitioning of the training data and test data can impact the results of models trained with the PDBbind data set, how performance improves by adding more lower-quality training data, and how training with docked poses imparts pose sensitivity to the predicted affinity of a complex. Our best performing model, an ensemble of five densely connected CNNs, achieves a root mean squared error of 1.42 and Pearson R of 0.612 on the affinity prediction task, an AUC of 0.956 at binding pose classification, and a 68.4% accuracy at pose selection on the CrossDocked2020 set. By providing data splits for clustered cross-validation and the raw data for the CrossDocked2020 set, we establish the first standardized data set for training machine learning models to recognize ligands in noncognate target structures while also greatly expanding the no. of poses available for training. In order to facilitate community adoption of this data set for benchmarking protein-ligand binding affinity prediction, we provide our models, wts., and the CrossDocked2020 set at https://github.com/gnina/models. - 37Landrum, G. RDKit: Open-source cheminformatics ; 2016; https://www.bibsonomy.org/bibtex/28d01fceeccd6bf2486e47d7c4207b108/salotz.
- 38O’Boyle, N. M.; Banck, M.; James, C. A.; Morley, C.; Vandermeersch, T.; Hutchison, G. R. Open Babel: An open chemical toolbox. J. Cheminf. 2011, 3, 33, DOI: 10.1186/1758-2946-3-33[Crossref], [PubMed], [CAS], Google Scholar37https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC3MXhsVWjurbF&md5=74e4f19b7f87417f916d57f7abcfb761Open Babel: an open chemical toolboxO'Boyle, Noel M.; Banck, Michael; James, Craig A.; Morley, Chris; Vandermeersch, Tim; Hutchison, Geoffrey R.Journal of Cheminformatics (2011), 3 (), 33CODEN: JCOHB3; ISSN:1758-2946. (Chemistry Central Ltd.)Background: A frequent problem in computational modeling is the interconversion of chem. structures between different formats. While std. interchange formats exist (for example, Chem. Markup Language) and de facto stds. have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chem. data, differences in the data stored by different formats (0D vs. 3D, for example), and competition between software along with a lack of vendor-neutral formats. Results: We discuss, for the first time, Open Babel, an open-source chem. toolbox that speaks the many languages of chem. data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chem. and mol. data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion. Conclusions: Open Babel presents a soln. to the proliferation of multiple chem. file formats. In addn., it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chem. data in areas such as org. chem., drug design, materials science, and computational chem. It is freely available under an open-source license.
- 39Yang, J.; Shen, C.; Huang, N. Predicting or Pretending: Artificial Intelligence for Protein-Ligand Interactions Lack of Sufficiently Large and Unbiased Datasets. Frontiers in Pharmacology 2020, 11, 1, DOI: 10.3389/fphar.2020.00069
- 40Scantlebury, J.; Brown, N.; Von Delft, F.; Deane, C. M. Data Set Augmentation Allows Deep Learning-Based Virtual Screening to Better Generalize to Unseen Target Classes and Highlight Important Binding Interactions. J. Chem. Inf. Model. 2020, 60, 3722– 3730, DOI: 10.1021/acs.jcim.0c00263[ACS Full Text
], [CAS], Google Scholar
39https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB3cXhsVaqt77O&md5=8859d41da975e883cac7c2489befab60Data Set Augmentation Allows Deep Learning-Based Virtual Screening to Better Generalize to Unseen Target Classes and Highlight Important Binding InteractionsScantlebury, Jack; Brown, Nathan; Von Delft, Frank; Deane, Charlotte M.Journal of Chemical Information and Modeling (2020), 60 (8), 3722-3730CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Current deep learning methods for structure-based virtual screening take the structures of both the protein and the ligand as input but make little or no use of the protein structure when predicting ligand binding. Here, a relatively simple method of data set augmentation forces such deep learning methods to take into account information from the protein. Models trained in this way are more generalizable (make better predictions on protein/ligand complexes from a different distribution to the training data). They also assign more meaningful importance to the protein and ligand atoms involved in binding. Overall, the authors' results show that data set augmentation can help deep learning-based virtual screening to learn phys. interactions rather than data set biases. - 41Satorras, V. G.; Hoogeboom, E.; Welling, M. E(n) Equivariant Graph Neural Networks. arXiv 2021, arXiv:2102.09844, DOI: 10.48550/arXiv.2102.09844
- 42Ragoza, M.; Hochuli, J.; Idrobo, E.; Sunseri, J.; Koes, D. R. Protein-Ligand Scoring with Convolutional Neural Networks. J. Chem. Inf. Model. 2017, 57, 942– 957, DOI: 10.1021/acs.jcim.6b00740[ACS Full Text
], [CAS], Google Scholar
41https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC2sXlsVems7Y%253D&md5=9cae97167da0c93e1d896f85339cca7eProtein-Ligand Scoring with Convolutional Neural NetworksRagoza, Matthew; Hochuli, Joshua; Idrobo, Elisa; Sunseri, Jocelyn; Koes, David RyanJournal of Chemical Information and Modeling (2017), 57 (4), 942-957CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Computational approaches to drug discovery can reduce the time and cost assocd. with exptl. assays and enable the screening of novel chemotypes. Structure-based drug design methods rely on scoring functions to rank and predict binding affinities and poses. The ever-expanding amt. of protein-ligand binding and structural data enables the use of deep machine learning techniques for protein-ligand scoring. We describe convolutional neural network (CNN) scoring functions that take as input a comprehensive three-dimensional (3D) representation of a protein-ligand interaction. A CNN scoring function automatically learns the key features of protein-ligand interactions that correlate with binding. We train and optimize our CNN scoring functions to discriminate between correct and incorrect binding poses and known binders and nonbinders. We find that our CNN scoring function outperforms the AutoDock Vina scoring function when ranking poses both for pose prediction and virtual screening. - 43Zhu, H.; Yang, J.; Huang, N. Assessment of the Generalization Abilities of Machine-Learning Scoring Functions for Structure-Based Virtual Screening. J. Chem. Inf. Model. 2022, 62, 5485– 5502, DOI: 10.1021/acs.jcim.2c01149[ACS Full Text
], [CAS], Google Scholar
42https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38Xis1Kru7rK&md5=d6197f1f9a4da0df36b420bb9746b446Assessment of the Generalization Abilities of Machine-Learning Scoring Functions for Structure-Based Virtual ScreeningZhu, Hui; Yang, Jincai; Huang, NiuJournal of Chemical Information and Modeling (2022), 62 (22), 5485-5502CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)In structure-based virtual screening (SBVS), it is crit. that scoring functions capture protein-ligand at. interactions. By focusing on the local domains of ligand binding pockets, a standardized pocket Pfam-based clustering (Pfam-cluster) approach was developed to assess the cross-target generalization ability of machine-learning scoring functions (MLSFs). Subsequently, 12 typical MLSFs were evaluated using random cross-validation (Random-CV), protein sequence similarity-based cross-validation (Seq-CV), and pocket Pfam-based cross-validation (Pfam-CV) methods. Surprisingly, all of the tested models showed decreased performances from Random-CV to Seq-CV to Pfam-CV expts., not showing satisfactory generalization capacity. Our interpretable anal. suggested that the predictions on novel targets by MLSFs were dependent on buried solvent-accessible surface area (SASA)-related features of complex structures, with greater predicted binding affinities on complexes owning larger protein-ligand interfaces. By combining buried SASA-related features with target-specific patterns that were only shared among structurally similar compds. in the same cluster, the random forest (RF)-Score attained a good performance in the Random-CV test. Based on these findings, we strongly advise assessing the generalization ability of MLSFs with the Pfam-cluster approach and being cautious with the features learned by MLSFs. - 44Radoux, C. J.; Olsson, T. S.; Pitt, W. R.; Groom, C. R.; Blundell, T. L. Identifying interactions that determine fragment binding at protein hotspots. J. Med. Chem. 2016, 59, 4314– 4325, DOI: 10.1021/acs.jmedchem.5b01980[ACS Full Text
], [CAS], Google Scholar
43https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28XltlOms7Y%253D&md5=fa25eabfe291f9519f5701bcbc6c52c3Identifying Interactions that Determine Fragment Binding at Protein HotspotsRadoux, Chris J.; Olsson, Tjelvar S. G.; Pitt, Will R.; Groom, Colin R.; Blundell, Tom L.Journal of Medicinal Chemistry (2016), 59 (9), 4314-4325CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)Locating a ligand-binding site is an important first step in structure-guided drug discovery, but current methods do little to suggest which interactions within a pocket are the most important for binding. Here the authors illustrate a method that samples at. hotspots with simple mol. probes to produce fragment hotspot maps. These maps specifically highlight fragment-binding sites and their corresponding pharmacophores. For ligand-bound structures, they provide an intuitive visual guide within the binding site, directing medicinal chemists where to grow the mol. and alerting them to suboptimal interactions within the original hit. The fragment hotspot map calcn. is validated using exptl. binding positions of 21 fragments and subsequent lead mols. The ligands are found in high scoring areas of the fragment hotspot maps, with fragment atoms having a median percentage rank of 97%. Protein kinase B and pantothenate synthetase are examd. in detail. In each case, the fragment hotspot maps are able to rationalize a Free-Wilson anal. of SAR data from a fragment-based drug design project. - 45Skyner, R.; von Delft, F. Xchem, fragalysis; https://fragalysis.diamond.ac.uk/ (accessed: 2022-07-30).Google ScholarThere is no corresponding record for this reference.
- 46Wang, C.; Zhang, Y. Improving scoring-docking-screening powers of protein–ligand scoring functions using random forest. J. Comput. Chem. 2017, 38, 169– 177, DOI: 10.1002/jcc.24667[Crossref], [PubMed], [CAS], Google Scholar45https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC28XhvVygurzM&md5=1a3ea0b5f159c9c0edf315fed2110ed0Improving scoring-docking-screening powers of protein-ligand scoring functions using random forestWang, Cheng; Zhang, YingkaiJournal of Computational Chemistry (2017), 38 (3), 169-177CODEN: JCCHDD; ISSN:0192-8651. (John Wiley & Sons, Inc.)The development of new protein-ligand scoring functions using machine learning algorithms, such as random forest, has been of significant interest. By efficiently using expanded feature sets and a large set of exptl. data, random forest based scoring functions (RFbScore) can achieve better correlations to exptl. protein-ligand binding data with known crystal structures; however, more extensive tests indicate that such enhancement in scoring power comes with significant under-performance in docking and screening power tests compared to traditional scoring functions. To improve scoring-docking-screening powers of protein-ligand docking functions simultaneously, the authors have introduced a ΔvinaRF parameterization and feature selection framework based on random forest. The authors' developed scoring function ΔvinaRF20, which employs 20 descriptors in addn. to the AutoDock Vina score, can achieve superior performance in all power tests of both CASF-2013 and CASF-2007 benchmarks compared to classical scoring functions. The ΔvinaRF20 scoring function and its code are freely available on the web at: https://www.nyu.edu/projects/yzhang/DeltaVina.
- 47Lu, J.; Hou, X.; Wang, C.; Zhang, Y. Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions. J. Chem. Inf. Model. 2019, 59, 4540– 4549, DOI: 10.1021/acs.jcim.9b00645[ACS Full Text
], [CAS], Google Scholar
46https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BC1MXitVSqs73O&md5=57fea8fa6135a109c3bcfd5ddffb1b33Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring FunctionsLu, Jianing; Hou, Xuben; Wang, Cheng; Zhang, YingkaiJournal of Chemical Information and Modeling (2019), 59 (11), 4540-4549CODEN: JCISD8; ISSN:1549-9596. (American Chemical Society)Structure-based drug design is critically dependent on accuracy of mol. docking scoring functions, and there is a significant interest to advance scoring functions with machine learning approaches. In this work, by judiciously expanding the training set, exploring new features related to explicit mediating water mols. as well as ligand conformation stability, and applying extreme gradient boosting (XGBoost) with Δ-Vina parametrization, we have improved robustness and applicability of machine-learning scoring functions. The new scoring function ΔvinaXGB can not only perform consistently among the top compared to classical scoring functions for the CASF-2016 benchmark but also achieves significantly better prediction accuracy in different types of structures that mimic real docking applications. - 48Moon, S.; Zhung, W.; Yang, S.; Lim, J.; Kim, W. Y. PIGNet: a physics-informed deep learning model toward generalized drug–target interaction predictions. Chem. Sci. 2022, 13, 3661– 3673, DOI: 10.1039/D1SC06946B[Crossref], [PubMed], [CAS], Google Scholar47https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XmtlKktb0%253D&md5=c7ca118db214dde8563a0945db0a2331PIGNet: a physics-informed deep learning model toward generalized drug-target interaction predictionsMoon, Seokhyun; Zhung, Wonho; Yang, Soojung; Lim, Jaechang; Kim, Woo YounChemical Science (2022), 13 (13), 3661-3673CODEN: CSHCCN; ISSN:2041-6520. (Royal Society of Chemistry)Recently, deep neural network (DNN)-based drug-target interaction (DTI) models were highlighted for their high accuracy with affordable computational costs. Yet, the models' insufficient generalization remains a challenging problem in the practice of in silico drug discovery. We propose two key strategies to enhance generalization in the DTI model. The first is to predict the atom-atom pairwise interactions via physics-informed equations parameterized with neural networks and provides the total binding affinity of a protein-ligand complex as their sum. We further improved the model generalization by augmenting a broader range of binding poses and ligands to training data. We validated our model, PIGNet, in the comparative assessment of scoring functions (CASF) 2016, demonstrating the outperforming docking and screening powers than previous methods. Our physics-informing strategy also enables the interpretation of predicted affinities by visualizing the contribution of ligand substructures, providing insights for further ligand optimization.
- 49Shen, C.; Zhang, X.; Deng, Y.; Gao, J.; Wang, D.; Xu, L.; Pan, P.; Hou, T.; Kang, Y. Boosting Protein–Ligand Binding Pose Prediction and Virtual Screening Based on Residue–Atom Distance Likelihood Potential and Graph Transformer. J. Med. Chem. 2022, 65, 10691– 10706, DOI: 10.1021/acs.jmedchem.2c00991[ACS Full Text
], [CAS], Google Scholar
48https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XhvF2htrbO&md5=217e5d659ff8c69c9c9559c99344aef0Boosting Protein-Ligand Binding Pose Prediction and Virtual Screening Based on Residue-Atom Distance Likelihood Potential and Graph TransformerShen, Chao; Zhang, Xujun; Deng, Yafeng; Gao, Junbo; Wang, Dong; Xu, Lei; Pan, Peichen; Hou, Tingjun; Kang, YuJournal of Medicinal Chemistry (2022), 65 (15), 10691-10706CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)The past few years have witnessed enormous progress toward applying machine learning approaches to the development of protein-ligand scoring functions. However, the robust performance and wide applicability of scoring functions remain a big challenge for increasing the success rate of docking-based virtual screening. Herein, a novel scoring function named RTMScore was developed by introducing a tailored residue-based graph representation strategy and several graph transformer layers for the learning of protein and ligand representations, followed by a mixt. d. network to obtain residue-atom distance likelihood potential. Our approach was resolutely validated on the CASF-2016 benchmark, and the results indicate that RTMScore can outperform almost all of the other state-of-the-art methods in terms of both the docking and screening powers. Further evaluation confirms the robustness of our approach that can not only retain its docking power on cross-docked poses but also achieve improved performance as a rescoring tool in larger-scale virtual screening. - 50Liao, Z.; You, R.; Huang, X.; Yao, X.; Huang, T.; Zhu, S. DeepDock: Enhancing Ligand-protein Interaction Prediction by a Combination of Ligand and Structure Information. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) ; 2019; pp 311– 317.
- 51https://fragalysis.diamond.ac.uk/viewer/react/download/tag/5740113a-7603-4af4-9523-7f902186d4a2 (accessed: 2022-10-14).Google ScholarThere is no corresponding record for this reference.
- 52Carbery, A.; Skyner, R.; von Delft, F.; Deane, C. M. Fragment Libraries Designed to Be Functionally Diverse Recover Protein Binding Information More Efficiently Than Standard Structurally Diverse Libraries. J. Med. Chem. 2022, 65, 11404– 11413, DOI: 10.1021/acs.jmedchem.2c01004[ACS Full Text
], [CAS], Google Scholar
51https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A528%3ADC%252BB38XitFWktLnO&md5=20c210891befe38eb6ec658cba2a01b8Fragment Libraries Designed to Be Functionally Diverse Recover Protein Binding Information More Efficiently Than Standard Structurally Diverse LibrariesCarbery, Anna; Skyner, Rachael; von Delft, Frank; Deane, Charlotte M.Journal of Medicinal Chemistry (2022), 65 (16), 11404-11413CODEN: JMCMAR; ISSN:0022-2623. (American Chemical Society)Current fragment-based drug design relies on the efficient exploration of chem. space by using structurally diverse libraries of small fragments. However, structurally dissimilar compds. can exploit the same interactions and thus be functionally similar. Using three-dimensional structures of many fragments bound to multiple targets, we examd. if a better strategy for selecting fragments for screening libraries exists. We show that structurally diverse fragments can be described as functionally redundant, often making the same interactions. Ranking fragments by the no. of novel interactions they made, we show that functionally diverse selections of fragments substantially increase the amt. of information recovered for unseen targets compared to the amts. recovered by other methods of selection. Using these results, we design small functionally efficient libraries that can give significantly more information about new protein targets than similarly sized structurally diverse libraries. By covering more functional space, we can generate more diverse sets of drug leads. - 53Schuller, M. Fragment binding to the Nsp3 macrodomain of SARS-CoV-2 identified through crystallographic screening and computational docking. Sci. Adv. 2021, 7, eabf8711 DOI: 10.1126/sciadv.abf8711
- 54Imprachim, N.; Yosaatmadja, Y.; Newman, J. A. Crystal structures and fragment screening of SARS-CoV-2 NSP14 reveal details of exoribonuclease activation and mRNA capping and provide starting points for antiviral drug development. Nucleic Acids Research 2023, 51, 475, DOI: 10.1093/nar/gkac1207[Crossref], [PubMed], [CAS], Google Scholar53https://chemport.cas.org/services/resolver?origin=ACS&resolution=options&coi=1%3ACAS%3A280%3ADC%252BB28rps1ahtg%253D%253D&md5=b3ad340352d17592df9e8d0fffa58af3Crystal structures and fragment screening of SARS-CoV-2 NSP14 reveal details of exoribonuclease activation and mRNA capping and provide starting points for antiviral drug developmentImprachim Nergis; Yosaatmadja Yuliana; Newman Joseph ANucleic acids research (2023), 51 (1), 475-487 ISSN:.NSP14 is a dual function enzyme containing an N-terminal exonuclease domain (ExoN) and C-terminal Guanine-N7-methyltransferase (N7-MTase) domain. Both activities are essential for the viral life cycle and may be targeted for anti-viral therapeutics. NSP14 forms a complex with NSP10, and this interaction enhances the nuclease but not the methyltransferase activity. We have determined the structure of SARS-CoV-2 NSP14 in the absence of NSP10 to 1.7 ÅA resolution. Comparisons with NSP14/NSP10 complexes reveal significant conformational changes that occur within the NSP14 ExoN domain upon binding of NSP10, including helix to coil transitions that facilitate the formation of the ExoN active site and provide an explanation of the stimulation of nuclease activity by NSP10. We have determined the structure of NSP14 in complex with cap analogue 7MeGpppG, and observe conformational changes within a SAM/SAH interacting loop that plays a key role in viral mRNA capping offering new insights into MTase activity. We perform an X-ray fragment screen on NSP14, revealing 72 hits bound to sites of inhibition in the ExoN and MTase domains. These fragments serve as excellent starting point tools for structure guided development of NSP14 inhibitors that may be used to treat COVID-19 and potentially other future viral threats.
- 55https://fragalysis.diamond.ac.uk/viewer/react/download/tag/01f41754-b2bb-4817-acc3-a5ebe820316d (accessed: 2022-10-14).Google ScholarThere is no corresponding record for this reference.
- 56https://fragalysis.diamond.ac.uk/viewer/react/download/tag/3df36d6b-3a5b-400d-97eb-af3b0e0df42d (accessed: 2022-10-14).Google ScholarThere is no corresponding record for this reference.
Supporting Information
Supporting Information
ARTICLE SECTIONSThe Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jcim.3c00322.
A brief description of equivariance and invariance, some more in-depth information about the architecture of PointVS and the data sets used to train and test it, a comparison of the different attribution methods used, a description of the process by which the API hotspots were obtained, an additional assessment of the reliance of PointVS hotspots on fragment screen size, and a case study showing how PointVS could have aided in a real-world fragment-to-lead campaign; also all of the IDs of the structures used, from both fragalysis and the PDB (PDF)
Terms & Conditions
Most electronic Supporting Information files are available without a subscription to ACS Web Editions. Such files may be downloaded by article for research use (if there is a public use license linked to the relevant article, that license may permit other uses). Permission may be obtained from ACS for other uses through requests via the RightsLink permission system: http://pubs.acs.org/page/copyright/permissions.html.