Estimating the Confidence of Peptide Identifications without Decoy Databases

Bernhard Y. Renard, Wiebke Timm§, Marc Kirchner§, Judith A. J. Steen, Fred A. Hamprecht and Hanno Steen*§
Interdisciplinary Center for Scientific Computing, University of Heidelberg, Speyerer Strasse 6, 69115 Heidelberg, Germany, Proteomics Center at Children’s Hospital Boston, 320 Longwood Avenue, Boston, Massachusetts 02115, Department of Pathology, Children’s Hospital Boston and Harvard Medical School, 320 Longwood Avenue, Boston, Massachusetts 02115, and Department of Neurobiology, Harvard Medical School and F. M. Kirby Neurobiology Center, Children’s Hospital Boston, 3 Blackfan Circle, Boston, Massachusetts 02115
Anal. Chem., 2010, 82 (11), pp 4314–4318
DOI: 10.1021/ac902892j
Publication Date (Web): May 10, 2010
Copyright © 2010 American Chemical Society
* Corresponding author. Hanno Steen, Children’s Hospital Boston, Department of Pathology, Enders 1130, 320 Longwood Avenue, Boston, MA 02115. Fax: +1-617-730-0168. E-mail: hanno.steen@childrens.harvard.edu., †

University of Heidelberg.

, ‡

Proteomics Center at Children’s Hospital Boston.

, §

Department of Pathology, Children’s Hospital Boston and Harvard Medical School.

,

Department of Neurobiology, Harvard Medical School and F. M. Kirby Neurobiology Center, Children’s Hospital Boston.

Abstract

Using decoy databases to compute the confidence of peptide identifications has become the standard procedure for mass spectrometry driven proteomics. While decoy databases have numerous advantages, they double the run time and are not applicable to all peptide identification problems such as error-tolerant or de novo searches or the large-scale identification of cross-linked peptides. Instead, we propose a fast, simple and robust mixture modeling approach to estimate the confidence of peptide identifications without the need for decoy database searches, which automatically checks whether its underlying assumptions are fulfilled. This approach is then evaluated on 41 LC/MS data sets of varying complexity and origin. The results are very similar to those of the decoy database strategy at a negligible computational cost. Our approach is applicable not only to standard protein identification workflows, but also to proteomics problems for which meaningful decoy databases cannot be constructed.

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

Explore by:


History

  • Published In Issue June 01, 2010
  • Article ASAPMay 10, 2010
  • Received: December 18, 2009
    Accepted: April 20, 2010

Recommend & Share

  • Share on ACS NetworkACS Network
  • Add to FacebookFacebook
  • Tweet ThisTweet This
  • Add to CiteULikeCiteULike
  • Add to NewsvineNewsvine
  • Digg ThisDigg This
  • Add to DeliciousDelicious

Related Content

Other ACS content by these authors: