Cargando…
Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library
The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The American Society for Biochemistry and Molecular Biology
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3134071/ https://www.ncbi.nlm.nih.gov/pubmed/21532008 http://dx.doi.org/10.1074/mcp.M111.007666 |
_version_ | 1782207955852066816 |
---|---|
author | Yen, Chia-Yu Houel, Stephane Ahn, Natalie G. Old, William M. |
author_facet | Yen, Chia-Yu Houel, Stephane Ahn, Natalie G. Old, William M. |
author_sort | Yen, Chia-Yu |
collection | PubMed |
description | The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library of confidently assigned spectra. Two problems relate to library size. First, reference spectral libraries are limited to rediscovery of previously identified peptides and are not applicable to new peptides, because of their incomplete coverage of the human proteome. Second, problems arise when searching a spectral library the size of the entire human proteome. We observed that traditional dot product scoring methods do not scale well with spectral library size, showing reduction in sensitivity when library size is increased. We show that this problem can be addressed by optimizing scoring metrics for spectrum-to-spectrum searches with large spectral libraries. MS/MS spectra for the 1.3 million predicted tryptic peptides in the human proteome are simulated using a kinetic fragmentation model (MassAnalyzer version2.1) to create a proteome-wide simulated spectral library. Searches of the simulated library increase MS/MS assignments by 24% compared with Mascot, when using probabilistic and rank based scoring methods. The proteome-wide coverage of the simulated library leads to 11% increase in unique peptide assignments, compared with parallel searches of a reference spectral library. Further improvement is attained when reference spectra and simulated spectra are combined into a hybrid spectral library, yielding 52% increased MS/MS assignments compared with Mascot searches. Our study demonstrates the advantages of using probabilistic and rank based scores to improve performance of spectrum-to-spectrum search strategies. |
format | Online Article Text |
id | pubmed-3134071 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | The American Society for Biochemistry and Molecular Biology |
record_format | MEDLINE/PubMed |
spelling | pubmed-31340712012-01-13 Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library Yen, Chia-Yu Houel, Stephane Ahn, Natalie G. Old, William M. Mol Cell Proteomics Technological Innovation and Resources The unambiguous assignment of tandem mass spectra (MS/MS) to peptide sequences remains a key unsolved problem in proteomics. Spectral library search strategies have emerged as a promising alternative for peptide identification, in which MS/MS spectra are directly compared against a reference library of confidently assigned spectra. Two problems relate to library size. First, reference spectral libraries are limited to rediscovery of previously identified peptides and are not applicable to new peptides, because of their incomplete coverage of the human proteome. Second, problems arise when searching a spectral library the size of the entire human proteome. We observed that traditional dot product scoring methods do not scale well with spectral library size, showing reduction in sensitivity when library size is increased. We show that this problem can be addressed by optimizing scoring metrics for spectrum-to-spectrum searches with large spectral libraries. MS/MS spectra for the 1.3 million predicted tryptic peptides in the human proteome are simulated using a kinetic fragmentation model (MassAnalyzer version2.1) to create a proteome-wide simulated spectral library. Searches of the simulated library increase MS/MS assignments by 24% compared with Mascot, when using probabilistic and rank based scoring methods. The proteome-wide coverage of the simulated library leads to 11% increase in unique peptide assignments, compared with parallel searches of a reference spectral library. Further improvement is attained when reference spectra and simulated spectra are combined into a hybrid spectral library, yielding 52% increased MS/MS assignments compared with Mascot searches. Our study demonstrates the advantages of using probabilistic and rank based scores to improve performance of spectrum-to-spectrum search strategies. The American Society for Biochemistry and Molecular Biology 2011-07 2011-04-30 /pmc/articles/PMC3134071/ /pubmed/21532008 http://dx.doi.org/10.1074/mcp.M111.007666 Text en © 2011 by The American Society for Biochemistry and Molecular Biology, Inc. Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) applies to Author Choice Articles |
spellingShingle | Technological Innovation and Resources Yen, Chia-Yu Houel, Stephane Ahn, Natalie G. Old, William M. Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library |
title | Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library |
title_full | Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library |
title_fullStr | Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library |
title_full_unstemmed | Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library |
title_short | Spectrum-to-Spectrum Searching Using a Proteome-wide Spectral Library |
title_sort | spectrum-to-spectrum searching using a proteome-wide spectral library |
topic | Technological Innovation and Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3134071/ https://www.ncbi.nlm.nih.gov/pubmed/21532008 http://dx.doi.org/10.1074/mcp.M111.007666 |
work_keys_str_mv | AT yenchiayu spectrumtospectrumsearchingusingaproteomewidespectrallibrary AT houelstephane spectrumtospectrumsearchingusingaproteomewidespectrallibrary AT ahnnatalieg spectrumtospectrumsearchingusingaproteomewidespectrallibrary AT oldwilliamm spectrumtospectrumsearchingusingaproteomewidespectrallibrary |