Cargando…

The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment

BACKGROUND: Most mass spectrometry (MS) based proteomic studies depend on searching acquired tandem mass (MS/MS) spectra against databases of known protein sequences. In these experiments, however, a large number of high quality spectra remain unassigned. These spectra may correspond to novel peptid...

Descripción completa

Detalles Bibliográficos
Autores principales: Ning, Kang, Nesvizhskii, Alexey I
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3024872/
https://www.ncbi.nlm.nih.gov/pubmed/21172049
http://dx.doi.org/10.1186/1471-2105-11-S11-S14
_version_ 1782196825039568896
author Ning, Kang
Nesvizhskii, Alexey I
author_facet Ning, Kang
Nesvizhskii, Alexey I
author_sort Ning, Kang
collection PubMed
description BACKGROUND: Most mass spectrometry (MS) based proteomic studies depend on searching acquired tandem mass (MS/MS) spectra against databases of known protein sequences. In these experiments, however, a large number of high quality spectra remain unassigned. These spectra may correspond to novel peptides not present in the database, especially those corresponding to novel alternative splice (AS) forms. Recently, fast and comprehensive profiling of mammalian genomes using deep sequencing (i.e. RNA-Seq) has become possible. MS-based proteomics can potentially be used as an aid for protein-level validation of novel AS events observed in RNA-Seq data. RESULTS: In this work, we have used publicly available mouse tissue proteomic and RNA-Seq datasets and have examined the feasibility of using MS data for the identification of novel AS forms by searching MS/MS spectra against translated mRNA sequences derived from RNA-Seq data. A significant correlation between the likelihood of identifying a peptide from MS/MS data and the number of reads in RNA-Seq data for the same gene was observed. Based on in silico experiments, it was also observed that only a fraction of novel AS forms identified from RNA-Seq had the corresponding junction peptide compatible with MS/MS sequencing. The number of novel peptides that were actually identified from MS/MS spectra was substantially lower than the number expected based on in silico analysis. CONCLUSIONS: The ability to confirm novel AS forms from MS/MS data in the dataset analyzed was found to be quite limited. This can be explained in part by low abundance of many novel transcripts, with the abundance of their corresponding protein products falling below the limit of detection by MS.
format Text
id pubmed-3024872
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30248722011-01-22 The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment Ning, Kang Nesvizhskii, Alexey I BMC Bioinformatics Research BACKGROUND: Most mass spectrometry (MS) based proteomic studies depend on searching acquired tandem mass (MS/MS) spectra against databases of known protein sequences. In these experiments, however, a large number of high quality spectra remain unassigned. These spectra may correspond to novel peptides not present in the database, especially those corresponding to novel alternative splice (AS) forms. Recently, fast and comprehensive profiling of mammalian genomes using deep sequencing (i.e. RNA-Seq) has become possible. MS-based proteomics can potentially be used as an aid for protein-level validation of novel AS events observed in RNA-Seq data. RESULTS: In this work, we have used publicly available mouse tissue proteomic and RNA-Seq datasets and have examined the feasibility of using MS data for the identification of novel AS forms by searching MS/MS spectra against translated mRNA sequences derived from RNA-Seq data. A significant correlation between the likelihood of identifying a peptide from MS/MS data and the number of reads in RNA-Seq data for the same gene was observed. Based on in silico experiments, it was also observed that only a fraction of novel AS forms identified from RNA-Seq had the corresponding junction peptide compatible with MS/MS sequencing. The number of novel peptides that were actually identified from MS/MS spectra was substantially lower than the number expected based on in silico analysis. CONCLUSIONS: The ability to confirm novel AS forms from MS/MS data in the dataset analyzed was found to be quite limited. This can be explained in part by low abundance of many novel transcripts, with the abundance of their corresponding protein products falling below the limit of detection by MS. BioMed Central 2010-12-14 /pmc/articles/PMC3024872/ /pubmed/21172049 http://dx.doi.org/10.1186/1471-2105-11-S11-S14 Text en Copyright ©2010 Nesvizhskii and Ning et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Ning, Kang
Nesvizhskii, Alexey I
The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment
title The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment
title_full The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment
title_fullStr The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment
title_full_unstemmed The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment
title_short The utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from RNA-Seq data: a preliminary assessment
title_sort utility of mass spectrometry-based proteomic data for validation of novel alternative splice forms reconstructed from rna-seq data: a preliminary assessment
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3024872/
https://www.ncbi.nlm.nih.gov/pubmed/21172049
http://dx.doi.org/10.1186/1471-2105-11-S11-S14
work_keys_str_mv AT ningkang theutilityofmassspectrometrybasedproteomicdataforvalidationofnovelalternativespliceformsreconstructedfromrnaseqdataapreliminaryassessment
AT nesvizhskiialexeyi theutilityofmassspectrometrybasedproteomicdataforvalidationofnovelalternativespliceformsreconstructedfromrnaseqdataapreliminaryassessment
AT ningkang utilityofmassspectrometrybasedproteomicdataforvalidationofnovelalternativespliceformsreconstructedfromrnaseqdataapreliminaryassessment
AT nesvizhskiialexeyi utilityofmassspectrometrybasedproteomicdataforvalidationofnovelalternativespliceformsreconstructedfromrnaseqdataapreliminaryassessment