Research Article
OptiSim: An Extended Dissimilarity Selection Method for Finding Diverse Representative Subsets‡
Purchase the full-text
- PDF/HTML,
figures/images,
references and tables,
(where available)
Abstract
Compound selection methods currently available to chemists are based on maximum or minimum dissimilarity selection or on hierarchical clustering. Optimizable K-Dissimilarity Selection (OptiSim) is a novel and efficient stochastic selection algorithm which includes maximum and minimum dissimilarity-based selection as special cases. By adjusting the subsample size parameter K, it is possible to adjust the balance between representativeness and diversity in the compounds selected. The OptiSim algorithm is described, along with some analytical tools for comparing it to other selection methods. Such comparisons indicate that OptiSim can mimic the representativeness of selections based on hierarchical clustering and, at least in some cases, improve upon them.
Citing Articles
Citation data is made available by participants in CrossRef's Cited-by Linking service. For a more comprehensive list of citations to this article, users are encouraged to perform a search in SciFinder.
This article has been cited by 29 ACS Journal articles (5 most recent appear below).

Maximum-Score Diversity Selection for Early Drug Discovery
Thorsten Meinl, Claude Ostermann, and Michael R. BertholdJournal of Chemical Information and Modeling2011 51 (2), 237-247Maximum-Score Diversity Selection for Early Drug Discovery
Thorsten Meinl, Claude Ostermann, and Michael R. BertholdJournal of Chemical Information and Modeling2011 51 (2), 237-247Diversity selection is a common task in early drug discovery. One drawback of current approaches is that usually only the structural diversity is taken into account, therefore, activity information is ignored. In this article, we present a modified ...

Discovery of Novel GSK-3β Inhibitors with Potent in Vitro and in Vivo Activities and Excellent Brain Permeability Using Combined Ligand- and Structure-Based Virtual Screening
Mohammad A. Khanfar, Ronald A. Hill, Amal Kaddoumi, and Khalid A. El SayedJournal of Medicinal Chemistry2010 53 (24), 8534-8545Discovery of Novel GSK-3β Inhibitors with Potent in Vitro and in Vivo Activities and Excellent Brain Permeability Using Combined Ligand- and Structure-Based Virtual Screening
Mohammad A. Khanfar, Ronald A. Hill, Amal Kaddoumi, and Khalid A. El SayedJournal of Medicinal Chemistry2010 53 (24), 8534-8545Dysregulation of glycogen synthase kinase (GSK-3β) is implicated in the pathophysiology of many diseases, including type-2 diabetes, stroke, Alzheimer’s, and others. A multistage virtual screening strategy designed so as to overcome known caveats arising ...

Maximum Unbiased Validation (MUV) Data Sets for Virtual Screening Based on PubChem Bioactivity Data
Sebastian G. Rohrer and Knut BaumannJournal of Chemical Information and Modeling2009 49 (2), 169-184Maximum Unbiased Validation (MUV) Data Sets for Virtual Screening Based on PubChem Bioactivity Data
Sebastian G. Rohrer and Knut BaumannJournal of Chemical Information and Modeling2009 49 (2), 169-184Refined nearest neighbor analysis was recently introduced for the analysis of virtual screening benchmark data sets. It constitutes a technique from the field of spatial statistics and provides a mathematical framework for the nonparametric analysis of ...

Data Mining a Small Molecule Drug Screening Representative Subset from NIH PubChem
Xiang-Qun Xie and Jian-Zhong ChenJournal of Chemical Information and Modeling2008 48 (3), 465-475Data Mining a Small Molecule Drug Screening Representative Subset from NIH PubChem
Xiang-Qun Xie and Jian-Zhong ChenJournal of Chemical Information and Modeling2008 48 (3), 465-475PubChem is a scientific showcase of the NIH Roadmap Initiatives. It is a compound repository created to facilitate information exchange and data sharing among the NIH Roadmap-funded Molecular Library Screening Center Network (MLSCN) and the scientific ...

A Scalable Approach to Combinatorial Library Design for Drug Discovery
Puneet Sharma, Srinivasa Salapaka, and Carolyn BeckJournal of Chemical Information and Modeling2008 48 (1), 27-41A Scalable Approach to Combinatorial Library Design for Drug Discovery
Puneet Sharma, Srinivasa Salapaka, and Carolyn BeckJournal of Chemical Information and Modeling2008 48 (1), 27-41In this paper, we propose an algorithm for the design of lead generation libraries required in combinatorial drug discovery. This algorithm addresses simultaneously the two key criteria of diversity and representativeness of compounds in the resulting ...
Tools
-
Add to Favorites
-
Download Citation
-
Email a Colleague -
Permalink
Order Reprints
Rights & Permissions
Citation Alerts
Cart


ACS
Network






