A Chromosome-centric Human Proteome Project (C-HPP) to Characterize the Sets of Proteins Encoded in Chromosome 17Click to copy article linkArticle link copied!
- Suli Liu
- Hogune Im
- Amos Bairoch
- Massimo Cristofanilli
- Rui Chen
- Eric W. Deutsch
- Stephen Dalton
- David Fenyo
- Susan Fanayan
- Chris Gates
- Pascale Gaudet
- Marina Hincapie
- Samir Hanash
- Hoguen Kim
- Seul-Ki Jeong
- Emma Lundberg
- George Mias
- Rajasree Menon
- Zhaomei Mu
- Edouard Nice
- Young-Ki Paik
- Mathias Uhlen
- Lance Wells
- Shiaw-Lin Wu
- Fangfei Yan
- Fan Zhang
- Yue Zhang
- Michael Snyder
- Gilbert S. Omenn
- Ronald C. Beavis
- William S. Hancock
Abstract
We report progress assembling the parts list for chromosome 17 and illustrate the various processes that we have developed to integrate available data from diverse genomic and proteomic knowledge bases. As primary resources, we have used GPMDB, neXtProt, PeptideAtlas, Human Protein Atlas (HPA), and GeneCards. All sites share the common resource of Ensembl for the genome modeling information. We have defined the chromosome 17 parts list with the following information: 1169 protein-coding genes, the numbers of proteins confidently identified by various experimental approaches as documented in GPMDB, neXtProt, PeptideAtlas, and HPA, examples of typical data sets obtained by RNASeq and proteomic studies of epithelial derived tumor cell lines (disease proteome) and a normal proteome (peripheral mononuclear cells), reported evidence of post-translational modifications, and examples of alternative splice variants (ASVs). We have constructed a list of the 59 “missing” proteins as well as 201 proteins that have inconclusive mass spectrometric (MS) identifications. In this report we have defined a process to establish a baseline for the incorporation of new evidence on protein identification and characterization as well as related information from transcriptome analyses. This initial list of “missing” proteins that will guide the selection of appropriate samples for discovery studies as well as antibody reagents. Also we have illustrated the significant diversity of protein variants (including post-translational modifications, PTMs) using regions on chromosome 17 that contain important oncogenes. We emphasize the need for mandated deposition of proteomics data in public databases, the further development of improved PTM, ASV, and single nucleotide variant (SNV) databases, and the construction of Web sites that can integrate and regularly update such information. In addition, we describe the distribution of both clustered and scattered sets of protein families on the chromosome. Since chromosome 17 is rich in cancer-associated genes, we have focused the clustering of cancer-associated genes in such genomic regions and have used the ERBB2 amplicon as an example of the value of a proteogenomic approach in which one integrates transcriptomic with proteomic information and captures evidence of coexpression through coordinated regulation.
Cited By
This article is cited by 31 publications.
- Gilbert S. Omenn, Lydie Lane, Emma K. Lundberg, Christopher M. Overall, and Eric W. Deutsch . Progress on the HUPO Draft Human Proteome: 2017 Metrics of the Human Proteome Project. Journal of Proteome Research 2017, 16
(12)
, 4281-4287. https://doi.org/10.1021/acs.jproteome.7b00375
- Jiahui Guo, Yizhi Cui, Ziqi Yan, Yanzhang Luo, Wanling Zhang, Suyuan Deng, Shengquan Tang, Gong Zhang, Qing-Yu He, and Tong Wang . Phosphoproteome Characterization of Human Colorectal Cancer SW620 Cell-Derived Exosomes and New Phosphosite Discovery for C-HPP. Journal of Proteome Research 2016, 15
(11)
, 4060-4072. https://doi.org/10.1021/acs.jproteome.6b00391
- Julia Fangfei Yan, Hoguen Kim, Seul-Ki Jeong, Hyoung-Joo Lee, Manveen K. Sethi, Ling Y. Lee, Ronald C. Beavis, Hogune Im, Michael P. Snyder, Matan Hofree, Trey Ideker, Shiaw-lin Wu, Young-Ki Paik, Susan Fanayan, and William S. Hancock . Integrated Proteomic and Genomic Analysis of Gastric Cancer Patient Tissues. Journal of Proteome Research 2015, 14
(12)
, 4995-5006. https://doi.org/10.1021/acs.jproteome.5b00827
- Rajasree Menon, Bharat Panwar, Ridvan Eksi, Celina Kleer, Yuanfang Guan, and Gilbert S. Omenn . Computational Inferences of the Functions of Alternative/Noncanonical Splice Isoforms Specific to HER2+/ER–/PR– Breast Cancers, a Chromosome 17 C-HPP Study. Journal of Proteome Research 2015, 14
(9)
, 3519-3529. https://doi.org/10.1021/acs.jproteome.5b00498
- Bharat Panwar, Rajasree Menon, Ridvan Eksi, Gilbert S. Omenn, and Yuanfang Guan . MI-PVT: A Tool for Visualizing the Chromosome-Centric Human Proteome. Journal of Proteome Research 2015, 14
(9)
, 3762-3767. https://doi.org/10.1021/acs.jproteome.5b00525
- Yang Chen, Yaxing Li, Jiayong Zhong, Jing Zhang, Zhipeng Chen, Lijuan Yang, Xin Cao, Qing-Yu He, Gong Zhang, and Tong Wang . Identification of Missing Proteins Defined by Chromosome-Centric Proteome Project in the Cytoplasmic Detergent-Insoluble Proteins. Journal of Proteome Research 2015, 14
(9)
, 3693-3709. https://doi.org/10.1021/pr501103r
- Pratik D. Jagtap, James E. Johnson, Getiria Onsongo, Fredrik W. Sadler, Kevin Murray, Yuanbo Wang, Gloria M. Shenykman, Sricharan Bandhakavi, Lloyd M. Smith, and Timothy J. Griffin . Flexible and Accessible Workflows for Improved Proteogenomic Analysis Using the Galaxy Framework. Journal of Proteome Research 2014, 13
(12)
, 5898-5908. https://doi.org/10.1021/pr500812t
- Sneha M. Pinto, Srikanth S. Manda, Min-Sik Kim, KyOnese Taylor, Lakshmi Dhevi Nagarajha Selvan, Lavanya Balakrishnan, Tejaswini Subbannayya, Fangfei Yan, T. S. Keshava Prasad, Harsha Gowda, Charles Lee, William S. Hancock, and Akhilesh Pandey . Functional Annotation of Proteome Encoded by Human Chromosome 22. Journal of Proteome Research 2014, 13
(6)
, 2749-2760. https://doi.org/10.1021/pr401169d
- Jiayong Zhong, Yizhi Cui, Jiahui Guo, Zhipeng Chen, Lijuan Yang, Qing-Yu He, Gong Zhang, and Tong Wang . Resolving Chromosome-Centric Human Proteome with Translating mRNA Analysis: A Strategic Demonstration. Journal of Proteome Research 2014, 13
(1)
, 50-59. https://doi.org/10.1021/pr4007409
- Rajasree Menon, Hogune Im, Emma (Yue) Zhang, Shiaw-Lin Wu, Rui Chen, Michael Snyder, William S. Hancock, and Gilbert S. Omenn . Distinct Splice Variants and Pathway Enrichment in the Cell-Line Models of Aggressive Human Breast Cancer Subtypes. Journal of Proteome Research 2014, 13
(1)
, 212-227. https://doi.org/10.1021/pr400773v
- Mohammad T. Islam, Gagan Garg, William S. Hancock, Brian A. Risk, Mark S. Baker, and Shoba Ranganathan . Protannotator: A Semiautomated Pipeline for Chromosome-Wise Functional Annotation of the “Missing” Human Proteome. Journal of Proteome Research 2014, 13
(1)
, 76-83. https://doi.org/10.1021/pr400794x
- Lydie Lane, Amos Bairoch, Ronald C. Beavis, Eric W. Deutsch, Pascale Gaudet, Emma Lundberg, and Gilbert S. Omenn . Metrics for the Human Proteome Project 2013–2014 and Strategies for Finding Missing Proteins. Journal of Proteome Research 2014, 13
(1)
, 15-20. https://doi.org/10.1021/pr401144x
- Young-Ki Paik , Gilbert S. Omenn , Visith Thongboonkerd , Gyorgy Marko-Varga , William S. Hancock . Genome-wide Proteomics, Chromosome-centric Human Proteome Project (C-HPP), Part II. Journal of Proteome Research 2014, 13
(1)
, 1-4. https://doi.org/10.1021/pr4011958
- Susan Fanayan, Joshua T. Smith, Ling Y. Lee, Fangfei Yan, Michael Snyder, William S. Hancock, and Edouard Nice . Proteogenomic Analysis of Human Colon Carcinoma Cell Lines LIM1215, LIM1899, and LIM2405. Journal of Proteome Research 2013, 12
(4)
, 1732-1742. https://doi.org/10.1021/pr3010869
- K.A. Deinichenko, G.S. Krasnov, S.P. Radko, K.G. Ptitsyn, V.V. Shapovalova, O.S. Timoshenko, S.A. Khmeleva, L.K. Kurbatov, Y.Y. Kiseleva, E.V. Ilgisonis, M.A. Pyatnitskiy, E.V. Poverennaya, O.I. Kiseleva, I.V. Vakhrushev, A.V. Tsvetkova, I.V. Buromski, S.S. Markin, V.G. Zgoda, A.I. Archakov, A.V. Lisitsa, E.A. Ponomarenko. Human CHR18: “Stakhanovite” Genes, Missing and uPE1 Proteins in Liver Tissue and HepG2 Cells. Biomedical Chemistry: Research and Methods 2021, 4
(1)
, e00144. https://doi.org/10.18097/BMCRM00144
- Shaocheng Wu, Hongjiu Zhang, Shamileh Fouladdel, Hongyang Li, Evan Keller, Max S. Wicha, Gilbert S. Omenn, Ebrahim Azizi, Yuanfang Guan. Cellular, transcriptomic and isoform heterogeneity of breast cancer cell line revealed by full-length single-cell RNA sequencing. Computational and Structural Biotechnology Journal 2020, 18 , 676-685. https://doi.org/10.1016/j.csbj.2020.03.005
- Fatemeh Hadizadeh, Charlie W. Lees, Catherine Labbé, John D. Rioux, Miles Parkes, Alexandra Zhernakova, Andre Franke, Charlotte Hedin, Mauro D’Amato. IBD Genomic Risk Loci and Overlap with Other Inflammatory Diseases. 2019, 91-115. https://doi.org/10.1007/978-3-030-28703-0_5
- Malgorzata A. Komor, Thang V. Pham, Annemieke C. Hiemstra, Sander R. Piersma, Anne S. Bolijn, Tim Schelfhorst, Pien M. Delis-van Diemen, Marianne Tijssen, Robert P. Sebra, Meredith Ashby, Gerrit A. Meijer, Connie R. Jimenez, Remond J.A. Fijneman. Identification of Differentially Expressed Splice Variants by the Proteogenomic Pipeline Splicify. Molecular & Cellular Proteomics 2017, 16
(10)
, 1850-1863. https://doi.org/10.1074/mcp.TIR117.000056
- Gerben Menschaert, David Fenyö. Proteogenomics from a bioinformatics angle: A growing field. Mass Spectrometry Reviews 2017, 36
(5)
, 584-599. https://doi.org/10.1002/mas.21483
- Jiang Wu. Targeting the Right Protein Isoform: Mass Spectrometry‐Based Proteomic Characterization of Alternative Splice Variants. 2017, 55-65. https://doi.org/10.1002/9781119371779.ch6
- Marie Locard-Paulet, Olivier Pible, Anne Gonzalez de Peredo, Béatrice Alpha-Bazin, Christine Almunia, Odile Burlet-Schiltz, Jean Armengaud. Clinical implications of recent advances in proteogenomics. Expert Review of Proteomics 2016, 13
(2)
, 185-199. https://doi.org/10.1586/14789450.2016.1132169
- Angelo Gámez-Pozo, Julia Berges-Soria, Jorge M. Arevalillo, Paolo Nanni, Rocío López-Vacas, Hilario Navarro, Jonas Grossmann, Carlos A. Castaneda, Paloma Main, Mariana Díaz-Almirón, Enrique Espinosa, Eva Ciruelos, Juan Ángel Fresno Vara. Combined Label-Free Quantitative Proteomics and microRNA Expression Analysis of Breast Cancer Unravel Molecular Differences with Clinical Implications. Cancer Research 2015, 75
(11)
, 2243-2253. https://doi.org/10.1158/0008-5472.CAN-14-1937
- Raphael Tavares, Nicole M. Scherer, Carlos G. Ferreira, Fabricio F. Costa, Fabio Passetti. Splice variants in the proteome: a promising and challenging field to targeted drug discovery. Drug Discovery Today 2015, 20
(3)
, 353-360. https://doi.org/10.1016/j.drudis.2014.11.002
- Alexander Koch, Daria Gawron, Sandra Steyaert, Elvis Ndah, Jeroen Crappé, Sarah De Keulenaer, Ellen De Meester, Ming Ma, Ben Shen, Kris Gevaert, Wim Van Criekinge, Petra Van Damme, Gerben Menschaert. A proteogenomics approach integrating proteomics and ribosome profiling increases the efficiency of protein identification and enables the discovery of alternative translation start sites. PROTEOMICS 2014, 14
(23-24)
, 2688-2698. https://doi.org/10.1002/pmic.201400180
- Yongsheng Bai, Justin Hassler, Ahdad Ziyar, Philip Li, Zachary Wright, Rajasree Menon, Gilbert S. Omenn, James D. Cavalcoli, Randal J. Kaufman, Maureen A. Sartor, . Novel Bioinformatics Method for Identification of Genome-Wide Non-Canonical Spliced Regions Using RNA-Seq Data. PLoS ONE 2014, 9
(7)
, e100864. https://doi.org/10.1371/journal.pone.0100864
- Gilbert S. Omenn, Yuanfang Guan, Rajasree Menon. A new class of protein cancer biomarker candidates: Differentially expressed splice variants of ERBB2 (HER2/neu) and ERBB1 (EGFR) in breast cancer cell lines. Journal of Proteomics 2014, 107 , 103-112. https://doi.org/10.1016/j.jprot.2014.04.012
- Gilbert S. Omenn. Plasma Proteomics, The Human Proteome Project, and Cancer-Associated Alternative Splice Variant Proteins. Biochimica et Biophysica Acta (BBA) - Proteins and Proteomics 2014, 1844
(5)
, 866-873. https://doi.org/10.1016/j.bbapap.2013.10.016
- Laura S. Rozek, Dana C. Dolinoy, Maureen A. Sartor, Gilbert S. Omenn. Epigenetics: Relevance and Implications for Public Health. Annual Review of Public Health 2014, 35
(1)
, 105-122. https://doi.org/10.1146/annurev-publhealth-032013-182513
- Raphael Tavares, Nicole de Miranda Scherer, Bianca Alves Pauletti, Elói Araújo, Edson Luiz Folador, Gabriel Espindola, Carlos Gil Ferreira, Adriana Franco Paes Leme, Paulo Sergio Lopes de Oliveira, Fabio Passetti. SpliceProt: A protein sequence repository of predicted human splice variants. PROTEOMICS 2014, 14
(2-3)
, 181-185. https://doi.org/10.1002/pmic.201300078
- Elena Ponomarenko, Ancha Baranova, Andrey Lisitsa, Juan Pablo Albar, Alexander Archakov. The Chromosome‐centric Human Proteome Project at
FEBS
Congress. PROTEOMICS 2014, 14
(2-3)
, 147-152. https://doi.org/10.1002/pmic.201300373
- Gilbert S. Omenn, Rajasree Menon, Yang Zhang. Innovations in proteomic profiling of cancers: Alternative splice variants as a new class of cancer biomarker candidates and bridging of proteomics with structural biology. Journal of Proteomics 2013, 90 , 28-37. https://doi.org/10.1016/j.jprot.2013.04.007
Article Views are the COUNTER-compliant sum of full text article downloads since November 2008 (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.
Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information about Crossref citation counts.
The Altmetric Attention Score is a quantitative measure of the attention that a research article has received online. Clicking on the donut icon will load a page at altmetric.com with additional details about the score and the social media presence for the given article. Find more information on the Altmetric Attention Score and how the score is calculated.