Use of Artificial Neural Networks for the Accurate Prediction of Peptide Liquid Chromatography Elution Times in Proteome Analyses

Konstantinos Petritis, Lars J. Kangas, Patrick L. Ferguson, Gordon A. Anderson, Ljiljana Paša-Tolić, Mary S. Lipton, Kenneth J. Auberry, Eric F. Strittmatter, Yufeng Shen, Rui Zhao, and Richard D. Smith*
Biological Sciences Division and Environmental and Molecular Laboratory, Pacific Northwest National Laboratory, P.O. Box 999, Richland, Washington 99352
Anal. Chem., 2003, 75 (5), pp 1039–1048
DOI: 10.1021/ac0205154
Publication Date (Web): January 25, 2003
Copyright © 2003 American Chemical Society
*

 Corresponding author.

Abstract

The use of artificial neural networks (ANNs) is described for predicting the reversed-phase liquid chromatography retention times of peptides enzymatically digested from proteome-wide proteins. To enable the accurate comparison of the numerous LC/MS data sets, a genetic algorithm was developed to normalize the peptide retention data into a range (from 0 to 1), improving the peptide elution time reproducibility to 1%. The network developed in this study was based on amino acid residue composition and consists of 20 input nodes, 2 hidden nodes, and 1 output node. A data set of 7000 confidently identified peptides from the microorganism Deinococcus radiodurans was used for the training of the ANN. The ANN was then used to predict the elution times for another set of 5200 peptides tentatively identified by MS/MS from a different microorganism (Shewanella oneidensis). The model was found to predict the elution times of peptides with up to 54 amino acid residues (the longest peptide identified after tryptic digestion of S. oneidensis) with an average accuracy of 3%. This predictive capability was then used to distinguish with high confidence isobar peptides otherwise indistinguishable by accurate mass measurements as well as to uncover peptide misidentifications. Thus, integration of ANN peptide elution time prediction in the proteomic research will increase both the number of protein identifications and their confidence.

Tools

SciFinder Links

SciFinder subscribers:  Click to sign in | Not a SciFinder subscriber? Learn more at www.cas.org

History

  • Published In Issue March 01, 2003
  • Received for review August 7, 2002. Accepted December 31, 2002.

Recommend & Share

Related Content

Other ACS content by these authors: