Cargando…

Tandem Mass Spectrum Identification via Cascaded Search

[Image: see text] Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide–spectrum match (PSM) may not end up being statistically signifi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kertesz-Farkas, Attila, Keich, Uri, Noble, William Stafford
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Chemical Society 2015
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4533645/ https://www.ncbi.nlm.nih.gov/pubmed/26084232 http://dx.doi.org/10.1021/pr501173s

_version_	1782385367696015360
author	Kertesz-Farkas, Attila Keich, Uri Noble, William Stafford
author_facet	Kertesz-Farkas, Attila Keich, Uri Noble, William Stafford
author_sort	Kertesz-Farkas, Attila
collection	PubMed
description	[Image: see text] Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide–spectrum match (PSM) may not end up being statistically significant after multiple testing correction. Researchers can mitigate this problem by controlling the hypothesis space in various ways: considering only peptides resulting from enzymatic cleavages, ignoring possible post-translational modifications or single nucleotide variants, etc. However, these strategies sacrifice identifications of spectra generated by rarer types of peptides. In this work, we introduce a statistical testing framework, cascade search, that directly addresses this problem. The method requires that the user specify a priori a statistical confidence threshold as well as a series of peptide databases. For instance, such a cascade of databases could include fully tryptic, semitryptic, and nonenzymatic peptides or peptides with increasing numbers of modifications. Cascaded search then gradually expands the list of candidate peptides from more likely peptides toward rare peptides, sequestering at each stage any spectrum that is identified with a specified statistical confidence. We compare cascade search to a standard procedure that lumps all of the peptides into a single database, as well as to a previously described group FDR procedure that computes the FDR separately within each database. We demonstrate, using simulated and real data, that cascade search identifies more spectra at a fixed FDR threshold than with either the ungrouped or grouped approach. Cascade search thus provides a general method for maximizing the number of identified spectra in a statistically rigorous fashion.
format	Online Article Text
id	pubmed-4533645
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	American Chemical Society
record_format	MEDLINE/PubMed
spelling	pubmed-45336452016-06-18 Tandem Mass Spectrum Identification via Cascaded Search Kertesz-Farkas, Attila Keich, Uri Noble, William Stafford J Proteome Res [Image: see text] Accurate assignment of peptide sequences to observed fragmentation spectra is hindered by the large number of hypotheses that must be considered for each observed spectrum. A high score assigned to a particular peptide–spectrum match (PSM) may not end up being statistically significant after multiple testing correction. Researchers can mitigate this problem by controlling the hypothesis space in various ways: considering only peptides resulting from enzymatic cleavages, ignoring possible post-translational modifications or single nucleotide variants, etc. However, these strategies sacrifice identifications of spectra generated by rarer types of peptides. In this work, we introduce a statistical testing framework, cascade search, that directly addresses this problem. The method requires that the user specify a priori a statistical confidence threshold as well as a series of peptide databases. For instance, such a cascade of databases could include fully tryptic, semitryptic, and nonenzymatic peptides or peptides with increasing numbers of modifications. Cascaded search then gradually expands the list of candidate peptides from more likely peptides toward rare peptides, sequestering at each stage any spectrum that is identified with a specified statistical confidence. We compare cascade search to a standard procedure that lumps all of the peptides into a single database, as well as to a previously described group FDR procedure that computes the FDR separately within each database. We demonstrate, using simulated and real data, that cascade search identifies more spectra at a fixed FDR threshold than with either the ungrouped or grouped approach. Cascade search thus provides a general method for maximizing the number of identified spectra in a statistically rigorous fashion. American Chemical Society 2015-06-18 2015-08-07 /pmc/articles/PMC4533645/ /pubmed/26084232 http://dx.doi.org/10.1021/pr501173s Text en Copyright © 2015 American Chemical Society This is an open access article published under an ACS AuthorChoice License (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html) , which permits copying and redistribution of the article or any adaptations for non-commercial purposes.
spellingShingle	Kertesz-Farkas, Attila Keich, Uri Noble, William Stafford Tandem Mass Spectrum Identification via Cascaded Search
title	Tandem Mass Spectrum Identification via Cascaded Search
title_full	Tandem Mass Spectrum Identification via Cascaded Search
title_fullStr	Tandem Mass Spectrum Identification via Cascaded Search
title_full_unstemmed	Tandem Mass Spectrum Identification via Cascaded Search
title_short	Tandem Mass Spectrum Identification via Cascaded Search
title_sort	tandem mass spectrum identification via cascaded search
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4533645/ https://www.ncbi.nlm.nih.gov/pubmed/26084232 http://dx.doi.org/10.1021/pr501173s
work_keys_str_mv	AT kerteszfarkasattila tandemmassspectrumidentificationviacascadedsearch AT keichuri tandemmassspectrumidentificationviacascadedsearch AT noblewilliamstafford tandemmassspectrumidentificationviacascadedsearch

Tandem Mass Spectrum Identification via Cascaded Search

Ejemplares similares