Cargando…

PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations

Massively parallel or second-generation sequencing-based genomic studies continuously identify new genomic alterations that may lead to novel protein sequences, which are attractive candidates for disease biomarkers and therapeutic targets after proteomic validation. Integrative proteogenomic method...

Descripción completa

Detalles Bibliográficos
Autores principales: Wen, Bo, Wang, Xiaojing, Zhang, Bing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396417/
https://www.ncbi.nlm.nih.gov/pubmed/30610011
http://dx.doi.org/10.1101/gr.235028.118
_version_ 1783399246210269184
author Wen, Bo
Wang, Xiaojing
Zhang, Bing
author_facet Wen, Bo
Wang, Xiaojing
Zhang, Bing
author_sort Wen, Bo
collection PubMed
description Massively parallel or second-generation sequencing-based genomic studies continuously identify new genomic alterations that may lead to novel protein sequences, which are attractive candidates for disease biomarkers and therapeutic targets after proteomic validation. Integrative proteogenomic methods have been developed to use mass spectrometry (MS)-based proteomics data for such validation. These methods replace the reference sequence database in proteomic database searching with a customized protein database that incorporates sample- or disease-specific sequences derived from DNA or RNA sequencing, thus enabling the identification of novel protein sequences. Although useful, this spectrum-centric approach requires a full evaluation of all possible spectrum-peptide pairs, which is time-consuming, error-prone, and difficult to apply. Here, we present PepQuery, a peptide-centric approach that focuses on only novel DNA or protein sequences of interest. PepQuery allows quick and easy proteomic validation of genomic alterations without customized database construction. We demonstrated the sensitivity and specificity of the approach in validating completely novel proteins, novel splice junctions, and single amino acid variants using simulations and experimental data. Notably, enabling unrestricted modification searching in PepQuery reduced false positives by up to 95%. We implemented PepQuery as both web-based and stand-alone applications. The web version provides direct access to more than half a billion MS/MS spectra from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and other cancer proteomic studies. The stand-alone version supports batch analysis and user-provided MS/MS data. PepQuery will increase the usage of proteogenomics beyond the proteomics community and will broaden the application of proteogenomics in personalized medicine.
format Online
Article
Text
id pubmed-6396417
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-63964172019-09-01 PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations Wen, Bo Wang, Xiaojing Zhang, Bing Genome Res Resource Massively parallel or second-generation sequencing-based genomic studies continuously identify new genomic alterations that may lead to novel protein sequences, which are attractive candidates for disease biomarkers and therapeutic targets after proteomic validation. Integrative proteogenomic methods have been developed to use mass spectrometry (MS)-based proteomics data for such validation. These methods replace the reference sequence database in proteomic database searching with a customized protein database that incorporates sample- or disease-specific sequences derived from DNA or RNA sequencing, thus enabling the identification of novel protein sequences. Although useful, this spectrum-centric approach requires a full evaluation of all possible spectrum-peptide pairs, which is time-consuming, error-prone, and difficult to apply. Here, we present PepQuery, a peptide-centric approach that focuses on only novel DNA or protein sequences of interest. PepQuery allows quick and easy proteomic validation of genomic alterations without customized database construction. We demonstrated the sensitivity and specificity of the approach in validating completely novel proteins, novel splice junctions, and single amino acid variants using simulations and experimental data. Notably, enabling unrestricted modification searching in PepQuery reduced false positives by up to 95%. We implemented PepQuery as both web-based and stand-alone applications. The web version provides direct access to more than half a billion MS/MS spectra from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and other cancer proteomic studies. The stand-alone version supports batch analysis and user-provided MS/MS data. PepQuery will increase the usage of proteogenomics beyond the proteomics community and will broaden the application of proteogenomics in personalized medicine. Cold Spring Harbor Laboratory Press 2019-03 /pmc/articles/PMC6396417/ /pubmed/30610011 http://dx.doi.org/10.1101/gr.235028.118 Text en © 2019 Wen et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Wen, Bo
Wang, Xiaojing
Zhang, Bing
PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
title PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
title_full PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
title_fullStr PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
title_full_unstemmed PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
title_short PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
title_sort pepquery enables fast, accurate, and convenient proteomic validation of novel genomic alterations
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396417/
https://www.ncbi.nlm.nih.gov/pubmed/30610011
http://dx.doi.org/10.1101/gr.235028.118
work_keys_str_mv AT wenbo pepqueryenablesfastaccurateandconvenientproteomicvalidationofnovelgenomicalterations
AT wangxiaojing pepqueryenablesfastaccurateandconvenientproteomicvalidationofnovelgenomicalterations
AT zhangbing pepqueryenablesfastaccurateandconvenientproteomicvalidationofnovelgenomicalterations