Diverse, High-Quality Test Set for the Validation of Protein−Ligand Docking Performance

Michael J. Hartshorn, Marcel L. Verdonk,* Gianni Chessari, Suzanne C. Brewerton, Wijnand T. M. Mooij, Paul N. Mortenson, and Christopher W. Murray
Astex Therapeutics, Ltd., 436 Cambridge Science Park, Milton Road, Cambridge CB4 0QA, United Kingdom
J. Med. Chem., 2007, 50 (4), pp 726–741
DOI: 10.1021/jm061277y
Publication Date (Web): January 19, 2007
Copyright © 2007 American Chemical Society
*

 To whom correspondence should be addressed. Tel.:  +44 1223 226206. Fax:  +44 1223 226201. E-mail:  m.verdonk@astex-therapeutics.com.

Abstract

Abstract Image

A procedure for analyzing and classifying publicly available crystal structures has been developed. It has been used to identify high-resolution protein−ligand complexes that can be assessed by reconstructing the electron density for the ligand using the deposited structure factors. The complexes have been clustered according to the protein sequences, and clusters have been discarded if they do not represent proteins thought to be of direct interest to the pharmaceutical or agrochemical industry. Rules have been used to exclude complexes containing non-drug-like ligands. One complex from each cluster has been selected where a structure of sufficient quality was available. The final Astex diverse set contains 85 diverse, relevant protein−ligand complexes, which have been prepared in a format suitable for docking and are to be made freely available to the entire research community (http://www.ccdc.cam.ac.uk). The performance of the docking program GOLD against the new set is assessed using a variety of protocols. Relatively unbiased protocols give success rates of approximately 80% for redocking into native structures, but it is possible to get success rates of over 90% with some protocols.

Related Content

Other ACS content by these authors: