Cargando…

SAMPI: Protein Identification with Mass Spectra Alignments

BACKGROUND: Mass spectrometry based peptide mass fingerprints (PMFs) offer a fast, efficient, and robust method for protein identification. A protein is digested (usually by trypsin) and its mass spectrum is compared to simulated spectra for protein sequences in a database. However, existing tools f...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaltenbach, Hans-Michael, Wilke, Andreas, Böcker, Sebastian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1851022/
https://www.ncbi.nlm.nih.gov/pubmed/17386090
http://dx.doi.org/10.1186/1471-2105-8-102
_version_ 1782132959621414912
author Kaltenbach, Hans-Michael
Wilke, Andreas
Böcker, Sebastian
author_facet Kaltenbach, Hans-Michael
Wilke, Andreas
Böcker, Sebastian
author_sort Kaltenbach, Hans-Michael
collection PubMed
description BACKGROUND: Mass spectrometry based peptide mass fingerprints (PMFs) offer a fast, efficient, and robust method for protein identification. A protein is digested (usually by trypsin) and its mass spectrum is compared to simulated spectra for protein sequences in a database. However, existing tools for analyzing PMFs often suffer from missing or heuristic analysis of the significance of search results and insufficient handling of missing and additional peaks. RESULTS: We present an unified framework for analyzing Peptide Mass Fingerprints that offers a number of advantages over existing methods: First, comparison of mass spectra is based on a scoring function that can be custom-designed for certain applications and explicitly takes missing and additional peaks into account. The method is able to simulate almost every additive scoring scheme. Second, we present an efficient deterministic method for assessing the significance of a protein hit, independent of the underlying scoring function and sequence database. We prove the applicability of our approach using biological mass spectrometry data and compare our results to the standard software Mascot. CONCLUSION: The proposed framework for analyzing Peptide Mass Fingerprints shows performance comparable to Mascot on small peak lists. Introducing more noise peaks, we are able to keep identification rates at a similar level by using the flexibility introduced by scoring schemes.
format Text
id pubmed-1851022
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18510222007-04-11 SAMPI: Protein Identification with Mass Spectra Alignments Kaltenbach, Hans-Michael Wilke, Andreas Böcker, Sebastian BMC Bioinformatics Research Article BACKGROUND: Mass spectrometry based peptide mass fingerprints (PMFs) offer a fast, efficient, and robust method for protein identification. A protein is digested (usually by trypsin) and its mass spectrum is compared to simulated spectra for protein sequences in a database. However, existing tools for analyzing PMFs often suffer from missing or heuristic analysis of the significance of search results and insufficient handling of missing and additional peaks. RESULTS: We present an unified framework for analyzing Peptide Mass Fingerprints that offers a number of advantages over existing methods: First, comparison of mass spectra is based on a scoring function that can be custom-designed for certain applications and explicitly takes missing and additional peaks into account. The method is able to simulate almost every additive scoring scheme. Second, we present an efficient deterministic method for assessing the significance of a protein hit, independent of the underlying scoring function and sequence database. We prove the applicability of our approach using biological mass spectrometry data and compare our results to the standard software Mascot. CONCLUSION: The proposed framework for analyzing Peptide Mass Fingerprints shows performance comparable to Mascot on small peak lists. Introducing more noise peaks, we are able to keep identification rates at a similar level by using the flexibility introduced by scoring schemes. BioMed Central 2007-03-26 /pmc/articles/PMC1851022/ /pubmed/17386090 http://dx.doi.org/10.1186/1471-2105-8-102 Text en Copyright © 2007 Kaltenbach et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kaltenbach, Hans-Michael
Wilke, Andreas
Böcker, Sebastian
SAMPI: Protein Identification with Mass Spectra Alignments
title SAMPI: Protein Identification with Mass Spectra Alignments
title_full SAMPI: Protein Identification with Mass Spectra Alignments
title_fullStr SAMPI: Protein Identification with Mass Spectra Alignments
title_full_unstemmed SAMPI: Protein Identification with Mass Spectra Alignments
title_short SAMPI: Protein Identification with Mass Spectra Alignments
title_sort sampi: protein identification with mass spectra alignments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1851022/
https://www.ncbi.nlm.nih.gov/pubmed/17386090
http://dx.doi.org/10.1186/1471-2105-8-102
work_keys_str_mv AT kaltenbachhansmichael sampiproteinidentificationwithmassspectraalignments
AT wilkeandreas sampiproteinidentificationwithmassspectraalignments
AT bockersebastian sampiproteinidentificationwithmassspectraalignments