Application of Nearest-Neighbor and Cluster Analyses in Pharmaceutical Lead Discovery

David T. Stanton, Timothy W. Morris, Siddhartha Roychoudhury, and Christian N. Parker*
Procter & Gamble Pharmaceuticals Health Care Research Center, 8700 Mason-Montgomery Road, Mason, Ohio 45040-9462
J. Chem. Inf. Comput. Sci., 1999, 39 (1), pp 21–27
DOI: 10.1021/ci9801015
Publication Date (Web): January 5, 1999
Copyright © 1999 American Chemical Society
*

 Corresponding author e-mail address:  parker.cn@pg.com.

Abstract

High throughput screening (HTS) programs based on diverse collections of compounds can rapidly identify leads for potential drug candidates. In cases where the compound collection is truly diverse, one may only identify a few compounds of interest. However, where a large number of hits are identified, it becomes necessary to examine the structures to determine the true number of compound classes involved so that follow-up studies may be conducted as efficiently as possible. In this case, cluster analysis is applied to determine the structural relationship among HTS hits. To efficiently expand around the region of the hit (or a class of hits) in chemical space, we have applied nearest neighbors analysis1 to select additional compounds from collections of a large number of commercial vendors, achieving an average hit rate in excess of 15%. Applying these techniques in a number of different cases, we obtained results that are useful for subsequent investigations of hits from HTS and other relevant molecular structures from the literature.

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

History

  • Published In Issue January 25, 1999
  • Received June 6, 1998

Recommend & Share

Related Content

Other ACS content by these authors: