Evaluating Virtual Screening Methods:  Good and Bad Metrics for the “Early Recognition” Problem

Jean-François Truchon* and Christopher I. Bayly
Department of Medicinal Chemistry, Merck Frosst Centre for Therapeutic Research, 16711 TransCanada Highway, Kirkland, Québec, Canada H9H 3L1
J. Chem. Inf. Model., 2007, 47 (2), pp 488–508
DOI: 10.1021/ci600426e
Publication Date (Web): February 9, 2007
Copyright © 2007 American Chemical Society

Abstract

Abstract Image

Many metrics are currently used to evaluate the performance of ranking methods in virtual screening (VS), for instance, the area under the receiver operating characteristic curve (ROC), the area under the accumulation curve (AUAC), the average rank of actives, the enrichment factor (EF), and the robust initial enhancement (RIE) proposed by Sheridan et al. In this work, we show that the ROC, the AUAC, and the average rank metrics have the same inappropriate behaviors that make them poor metrics for comparing VS methods whose purpose is to rank actives early in an ordered list (the “early recognition problem”). In doing so, we derive mathematical formulas that relate those metrics together. Moreover, we show that the EF metric is not sensitive to ranking performance before and after the cutoff. Instead, we formally generalize the ROC metric to the early recognition problem which leads us to propose a novel metric called the Boltzmann-enhanced discrimination of receiver operating characteristic that turns out to contain the discrimination power of the RIE metric but incorporates the statistical significance from ROC and its well-behaved boundaries. Finally, two major sources of errors, namely, the statistical error and the “saturation effects”, are examined. This leads to practical recommendations for the number of actives, the number of inactives, and the “early recognition” importance parameter that one should use when comparing ranking methods. Although this work is applied specifically to VS, it is general and can be used to analyze any method that needs to segregate actives toward the front of a rank-ordered list.

Citing Articles

View all 87 citing articles

Citation data is made available by participants in CrossRef's Cited-by Linking service. For a more comprehensive list of citations to this article, users are encouraged to perform a search in SciFinder.

This article has been cited by 47 ACS Journal articles (5 most recent appear below).

  • Cover Image

    Drug Effect Prediction by Polypharmacology-Based Interaction Profiling

    Zoltán Simon, Ágnes Peragovics, Margit Vigh-Smeller, Gábor Csukly, László Tombor, Zhenhui Yang, Gergely Zahoránszky-Kőhalmi, László Végner, Balázs Jelinek, Péter Hári, Csaba Hetényi, István Bitter, Pál Czobor, and András Málnási-Csizmadia
    Journal of Chemical Information and Modeling2012 52 (1), 134-145
    • Drug Effect Prediction by Polypharmacology-Based Interaction Profiling

      Zoltán Simon, Ágnes Peragovics, Margit Vigh-Smeller, Gábor Csukly, László Tombor, Zhenhui Yang, Gergely Zahoránszky-Kőhalmi, László Végner, Balázs Jelinek, Péter Hári, Csaba Hetényi, István Bitter, Pál Czobor, and András Málnási-Csizmadia
      Journal of Chemical Information and Modeling2012 52 (1), 134-145

      Most drugs exert their effects via multitarget interactions, as hypothesized by polypharmacology. While these multitarget interactions are responsible for the clinical effect profiles of drugs, current methods have failed to uncover the complex ...

  • Cover Image

    PLS-DA - Docking Optimized Combined Energetic Terms (PLSDA-DOCET) Protocol: A Brief Evaluation

    Sorin Avram, Liliana M. Pacureanu, Edward Seclaman, Alina Bora, and Ludovic Kurunczi
    Journal of Chemical Information and Modeling2011 51 (12), 3169-3179
    • PLS-DA - Docking Optimized Combined Energetic Terms (PLSDA-DOCET) Protocol: A Brief Evaluation

      Sorin Avram, Liliana M. Pacureanu, Edward Seclaman, Alina Bora, and Ludovic Kurunczi
      Journal of Chemical Information and Modeling2011 51 (12), 3169-3179

      Docking studies have become popular approaches in drug design, where the binding energy of the ligand in the active site of the protein is estimated by a scoring function. Many promising techniques were developed to enhance the performance of scoring ...

  • Cover Image

    Jointly Handling Potency and Toxicity of Antimicrobial Peptidomimetics by Simple Rules from Desirability Theory and Chemoinformatics

    Maykel Cruz-Monteagudo, Fernanda Borges, and M. Natália D. S. Cordeiro
    Journal of Chemical Information and Modeling2011 51 (12), 3060-3077
    • Jointly Handling Potency and Toxicity of Antimicrobial Peptidomimetics by Simple Rules from Desirability Theory and Chemoinformatics

      Maykel Cruz-Monteagudo, Fernanda Borges, and M. Natália D. S. Cordeiro
      Journal of Chemical Information and Modeling2011 51 (12), 3060-3077

      Today, emerging and increasing resistance to antibiotics has become a threat to public health worldwide. Antimicrobial peptides have unique action mechanisms making them an attractive therapeutic prospect to be applied against resistant bacteria. However, ...

  • Cover Image

    The Ensemble Performance Index: An Improved Measure for Assessing Ensemble Pose Prediction Performance

    Oliver Korb, Patrick McCabe, and Jason Cole
    Journal of Chemical Information and Modeling2011 51 (11), 2915-2919
    • The Ensemble Performance Index: An Improved Measure for Assessing Ensemble Pose Prediction Performance

      Oliver Korb, Patrick McCabe, and Jason Cole
      Journal of Chemical Information and Modeling2011 51 (11), 2915-2919

      We present a theoretical study on the performance of ensemble docking methodologies considering multiple protein structures. We perform a theoretical analysis of pose prediction experiments which is completely unbiased, as we make no assumptions about ...

  • Cover Image

    Rapid Shape-Based Ligand Alignment and Virtual Screening Method Based on Atom/Feature-Pair Similarities and Volume Overlap Scoring

    G. Madhavi Sastry, Steven L. Dixon, and Woody Sherman
    Journal of Chemical Information and Modeling2011 51 (10), 2455-2466
    • Rapid Shape-Based Ligand Alignment and Virtual Screening Method Based on Atom/Feature-Pair Similarities and Volume Overlap Scoring

      G. Madhavi Sastry, Steven L. Dixon, and Woody Sherman
      Journal of Chemical Information and Modeling2011 51 (10), 2455-2466

      Shape-based methods for aligning and scoring ligands have proven to be valuable in the field of computer-aided drug design. Here, we describe a new shape-based flexible ligand superposition and virtual screening method, Phase Shape, which is shown to ...

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

Explore by:


History

  • Published In Issue March 26, 2007
  • Received October 2, 2006

Recommend & Share

  • Share on ACS NetworkACS Network
  • Add to FacebookFacebook
  • Tweet ThisTweet This
  • Add to CiteULikeCiteULike
  • Add to NewsvineNewsvine
  • Digg ThisDigg This
  • Add to DeliciousDelicious

Related Content

Other ACS content by these authors: