Identification and characterization of proteins encoded by chromosome 12 as part of chromosome-centric human proteome project

Pinto, Sneha M (2014) Identification and characterization of proteins encoded by chromosome 12 as part of chromosome-centric human proteome project. Journal of Proteome Research, 13 (7). pp. 3166-3177.

[img] PDF
scan0011.pdf - Published Version
Restricted to Registered users only

Download (10MB) | Request a copy

Abstract

Chromosome-centric human proteome project (C-HPP) is a global initiative to comprehensively characterize proteins encoded by genes across all human chromosomes by teams focusing on individual chromosomes. Here, we report mass spectrometry-based identification and characterization of proteins encoded by genes on chromosome 12. Our study is based on proteomic profiling of 30 different histologically normal human tissues and cell types using high-resolution mass spectrometry. In our analysis, we identified 1,535 proteins encoded by 836 genes on human chromosome 12. This includes 89 genes that are designated as "missing proteins" by "neXtProt" as they did not have any prior evidence either by mass spectrometry or by antibody-based detection methods. We identified several variant peptides that reflected coding SNPs annotated in dbSNP database. We also confirmed the start sites of ∼200 proteins by identifying protein N-terminal acetylated peptides. We also identified alternative start sites for 11 proteins that were not annotated in public databases until now. Most importantly, we identified 12 novel protein coding regions on chromosome 12 using our proteogenomics strategy. All of the 12 regions have been annotated as pseudogenes in public databases. This study demonstrates that there is scope for significantly improving annotation of protein coding genes in the human genome using mass-spectrometry-derived data. Individual efforts as part of C-HPP initiative should significantly contribute toward enriching human protein annotation. The data have been deposited to ProteomeXchange with identifier PXD000561.

Item Type: Article
Uncontrolled Keywords: Proteomics; proteogenomics; non-coding RNA; pseudogenes; open reading frame.
Subjects: Research > Research Center - Health Sciences
Depositing User: KMC Manipal
Date Deposited: 13 Feb 2015 06:40
Last Modified: 13 Feb 2015 06:40
URI: http://eprints.manipal.edu/id/eprint/141898

Actions (login required)

View Item View Item