Cargando…

An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome

BACKGROUND: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool. RESULTS: Using a search against an in silico translation of the entire human genome, combined with a series of annotation f...

Descripción completa

Detalles Bibliográficos
Autores principales: Bitton, Danny A., Smith, Duncan L., Connolly, Yvonne, Scutt, Paul J., Miller, Crispin J.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2812506/
https://www.ncbi.nlm.nih.gov/pubmed/20126623
http://dx.doi.org/10.1371/journal.pone.0008949
_version_ 1782176835305472000
author Bitton, Danny A.
Smith, Duncan L.
Connolly, Yvonne
Scutt, Paul J.
Miller, Crispin J.
author_facet Bitton, Danny A.
Smith, Duncan L.
Connolly, Yvonne
Scutt, Paul J.
Miller, Crispin J.
author_sort Bitton, Danny A.
collection PubMed
description BACKGROUND: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool. RESULTS: Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successfully validated by a different MS technique. Two of these correspond to novel isoforms of Heterogeneous Ribonuclear Proteins, while the rest correspond to novel loci. CONCLUSIONS: MS technology can be used for ab initio gene discovery in human data, which, since it is based on different underlying assumptions, identifies protein-coding genes not found by other techniques. As MS technology continues to evolve, such approaches will become increasingly powerful.
format Text
id pubmed-2812506
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28125062010-02-02 An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome Bitton, Danny A. Smith, Duncan L. Connolly, Yvonne Scutt, Paul J. Miller, Crispin J. PLoS One Research Article BACKGROUND: Most protein mass spectrometry (MS) experiments rely on searches against a database of known or predicted proteins, limiting their ability as a gene discovery tool. RESULTS: Using a search against an in silico translation of the entire human genome, combined with a series of annotation filters, we identified 346 putative novel peptides [False Discovery Rate (FDR)<5%] in a MS dataset derived from two human breast epithelial cell lines. A subset of these were then successfully validated by a different MS technique. Two of these correspond to novel isoforms of Heterogeneous Ribonuclear Proteins, while the rest correspond to novel loci. CONCLUSIONS: MS technology can be used for ab initio gene discovery in human data, which, since it is based on different underlying assumptions, identifies protein-coding genes not found by other techniques. As MS technology continues to evolve, such approaches will become increasingly powerful. Public Library of Science 2010-01-28 /pmc/articles/PMC2812506/ /pubmed/20126623 http://dx.doi.org/10.1371/journal.pone.0008949 Text en Bitton et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bitton, Danny A.
Smith, Duncan L.
Connolly, Yvonne
Scutt, Paul J.
Miller, Crispin J.
An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
title An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
title_full An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
title_fullStr An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
title_full_unstemmed An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
title_short An Integrated Mass-Spectrometry Pipeline Identifies Novel Protein Coding-Regions in the Human Genome
title_sort integrated mass-spectrometry pipeline identifies novel protein coding-regions in the human genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2812506/
https://www.ncbi.nlm.nih.gov/pubmed/20126623
http://dx.doi.org/10.1371/journal.pone.0008949
work_keys_str_mv AT bittondannya anintegratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT smithduncanl anintegratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT connollyyvonne anintegratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT scuttpaulj anintegratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT millercrispinj anintegratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT bittondannya integratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT smithduncanl integratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT connollyyvonne integratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT scuttpaulj integratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome
AT millercrispinj integratedmassspectrometrypipelineidentifiesnovelproteincodingregionsinthehumangenome