Cargando…

Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library

The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library...

Descripción completa

Detalles Bibliográficos
Autores principales: Yen, Chia-Yu, Houel, Stephane, Ahn, Natalie G., Old, William M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Biochemistry and Molecular Biology 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3134071/
https://www.ncbi.nlm.nih.gov/pubmed/21532008
http://dx.doi.org/10.1074/mcp.M111.007666
_version_ 1782207955852066816
author Yen, Chia-Yu
Houel, Stephane
Ahn, Natalie G.
Old, William M.
author_facet Yen, Chia-Yu
Houel, Stephane
Ahn, Natalie G.
Old, William M.
author_sort Yen, Chia-Yu
collection PubMed
description The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library of confidently assigned spectra. Two problems relate to library size. First, reference spectral libraries are limited to rediscovery of previously identified peptides and are not applicable to new peptides, because of their incomplete coverage of the human proteome. Second, problems arise when searching a spectral library the size of the entire human proteome. We observed that traditional dot product scoring methods do not scale well with spectral library size, showing reduction in sensitivity when library size is increased. We show that this problem can be addressed by optimizing scoring metrics for spectrum-to-spectrum searches with large spectral libraries. MS/MS spectra for the 1.3 million predicted tryptic peptides in the human proteome are simulated using a kinetic fragmentation model (MassAnalyzer version2.1) to create a proteome-wide simulated spectral library. Searches of the simulated library increase MS/MS assignments by 24% compared with Mascot, when using probabilistic and rank based scoring methods. The proteome-wide coverage of the simulated library leads to 11% increase in unique peptide assignments, compared with parallel searches of a reference spectral library. Further improvement is attained when reference spectra and simulated spectra are combined into a hybrid spectral library, yielding 52% increased MS/MS assignments compared with Mascot searches. Our study demonstrates the advantages of using probabilistic and rank based scores to improve performance of spectrum-to-spectrum search strategies.
format Online
Article
Text
id pubmed-3134071
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher The American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-31340712012-01-13 Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library Yen, Chia-Yu Houel, Stephane Ahn, Natalie G. Old, William M. Mol Cell Proteomics Technological Innovation and Resources The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library of confidently assigned spectra. Two problems relate to library size. First, reference spectral libraries are limited to rediscovery of previously identified peptides and are not applicable to new peptides, because of their incomplete coverage of the human proteome. Second, problems arise when searching a spectral library the size of the entire human proteome. We observed that traditional dot product scoring methods do not scale well with spectral library size, showing reduction in sensitivity when library size is increased. We show that this problem can be addressed by optimizing scoring metrics for spectrum-to-spectrum searches with large spectral libraries. MS/MS spectra for the 1.3 million predicted tryptic peptides in the human proteome are simulated using a kinetic fragmentation model (MassAnalyzer version2.1) to create a proteome-wide simulated spectral library. Searches of the simulated library increase MS/MS assignments by 24% compared with Mascot, when using probabilistic and rank based scoring methods. The proteome-wide coverage of the simulated library leads to 11% increase in unique peptide assignments, compared with parallel searches of a reference spectral library. Further improvement is attained when reference spectra and simulated spectra are combined into a hybrid spectral library, yielding 52% increased MS/MS assignments compared with Mascot searches. Our study demonstrates the advantages of using probabilistic and rank based scores to improve performance of spectrum-to-spectrum search strategies. The American Society for Biochemistry and Molecular Biology 2011-07 2011-04-30 /pmc/articles/PMC3134071/ /pubmed/21532008 http://dx.doi.org/10.1074/mcp.M111.007666 Text en © 2011 by The American Society for Biochemistry and Molecular Biology, Inc. Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) applies to Author Choice Articles
spellingShingle Technological Innovation and Resources
Yen, Chia-Yu
Houel, Stephane
Ahn, Natalie G.
Old, William M.
Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
title Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
title_full Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
title_fullStr Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
title_full_unstemmed Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
title_short Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
title_sort spectrum-to-spectrum searching using a proteome-wide spectral library
topic Technological Innovation and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3134071/
https://www.ncbi.nlm.nih.gov/pubmed/21532008
http://dx.doi.org/10.1074/mcp.M111.007666
work_keys_str_mv AT yenchiayu spectrumtospectrumsearchingusingaproteomewidespectrallibrary
AT houelstephane spectrumtospectrumsearchingusingaproteomewidespectrallibrary
AT ahnnatalieg spectrumtospectrumsearchingusingaproteomewidespectrallibrary
AT oldwilliamm spectrumtospectrumsearchingusingaproteomewidespectrallibrary