Cargando…

An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome

BACKGROUND: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool. RESULTS: Using a search against an in silico translation of the entire human genome, combined with a series of annotation f...

Descripción completa

Detalles Bibliográficos
Autores principales: Bitton, Danny A., Smith, Duncan L., Connolly, Yvonne, Scutt, Paul J., Miller, Crispin J.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2812506/
https://www.ncbi.nlm.nih.gov/pubmed/20126623
http://dx.doi.org/10.1371/journal.pone.0008949
Descripción
Sumario:BACKGROUND: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool. RESULTS: Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successfully validated by a different MS technique. Two of these correspond to novel isoforms of Heterogeneous Ribonuclear Proteins, while the rest correspond to novel loci. CONCLUSIONS: MS technology can be used for ab initio gene discovery in human data, which, since it is based on different underlying assumptions, identifies protein-coding genes not found by other techniques. As MS technology continues to evolve, such approaches will become increasingly powerful.