Cargando…
Quality assessment of tandem mass spectra using support vector machine (SVM)
BACKGROUND: Tandem mass spectrometry has become particularly useful for the rapid identification and characterization of protein components of complex biological mixtures. Powerful database search methods have been developed for the peptide identification, such as SEQUEST and MASCOT, which are imple...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648784/ https://www.ncbi.nlm.nih.gov/pubmed/19208151 http://dx.doi.org/10.1186/1471-2105-10-S1-S49 |
_version_ | 1782164987457830912 |
---|---|
author | Zou, An-Min Wu, Fang-Xiang Ding, Jia-Rui Poirier, Guy G |
author_facet | Zou, An-Min Wu, Fang-Xiang Ding, Jia-Rui Poirier, Guy G |
author_sort | Zou, An-Min |
collection | PubMed |
description | BACKGROUND: Tandem mass spectrometry has become particularly useful for the rapid identification and characterization of protein components of complex biological mixtures. Powerful database search methods have been developed for the peptide identification, such as SEQUEST and MASCOT, which are implemented by comparing the mass spectra obtained from unknown proteins or peptides with theoretically predicted spectra derived from protein databases. However, the majority of spectra generated from a mass spectrometry experiment are of too poor quality to be interpreted while some of spectra with high quality cannot be interpreted by one method but perhaps by others. Hence a filtering algorithm that removes those spectra with poor quality prior to the database search is appealing. RESULTS: This paper proposes a support vector machine (SVM) based approach to assess the quality of tandem mass spectra. Each mass spectrum is mapping into the 16 proposed features to describe its quality. Based the results from SEQUEST, four SVM classifiers with the input of the 16 features are trained and tested on ISB data and TOV data, respectively. The superior performance of the proposed SVM classifiers is illustrated both by the comparison with the existing classifiers and by the validation in terms of MASCOT search results. CONCLUSION: The proposed method can be employed to effectively remove the poor quality spectra before the spectral searching, and also to find the more peptides or post-translational peptides from spectra with high quality using different search engines or de novo method. |
format | Text |
id | pubmed-2648784 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26487842009-03-03 Quality assessment of tandem mass spectra using support vector machine (SVM) Zou, An-Min Wu, Fang-Xiang Ding, Jia-Rui Poirier, Guy G BMC Bioinformatics Research BACKGROUND: Tandem mass spectrometry has become particularly useful for the rapid identification and characterization of protein components of complex biological mixtures. Powerful database search methods have been developed for the peptide identification, such as SEQUEST and MASCOT, which are implemented by comparing the mass spectra obtained from unknown proteins or peptides with theoretically predicted spectra derived from protein databases. However, the majority of spectra generated from a mass spectrometry experiment are of too poor quality to be interpreted while some of spectra with high quality cannot be interpreted by one method but perhaps by others. Hence a filtering algorithm that removes those spectra with poor quality prior to the database search is appealing. RESULTS: This paper proposes a support vector machine (SVM) based approach to assess the quality of tandem mass spectra. Each mass spectrum is mapping into the 16 proposed features to describe its quality. Based the results from SEQUEST, four SVM classifiers with the input of the 16 features are trained and tested on ISB data and TOV data, respectively. The superior performance of the proposed SVM classifiers is illustrated both by the comparison with the existing classifiers and by the validation in terms of MASCOT search results. CONCLUSION: The proposed method can be employed to effectively remove the poor quality spectra before the spectral searching, and also to find the more peptides or post-translational peptides from spectra with high quality using different search engines or de novo method. BioMed Central 2009-01-30 /pmc/articles/PMC2648784/ /pubmed/19208151 http://dx.doi.org/10.1186/1471-2105-10-S1-S49 Text en Copyright © 2009 Zou et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Zou, An-Min Wu, Fang-Xiang Ding, Jia-Rui Poirier, Guy G Quality assessment of tandem mass spectra using support vector machine (SVM) |
title | Quality assessment of tandem mass spectra using support vector machine (SVM) |
title_full | Quality assessment of tandem mass spectra using support vector machine (SVM) |
title_fullStr | Quality assessment of tandem mass spectra using support vector machine (SVM) |
title_full_unstemmed | Quality assessment of tandem mass spectra using support vector machine (SVM) |
title_short | Quality assessment of tandem mass spectra using support vector machine (SVM) |
title_sort | quality assessment of tandem mass spectra using support vector machine (svm) |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648784/ https://www.ncbi.nlm.nih.gov/pubmed/19208151 http://dx.doi.org/10.1186/1471-2105-10-S1-S49 |
work_keys_str_mv | AT zouanmin qualityassessmentoftandemmassspectrausingsupportvectormachinesvm AT wufangxiang qualityassessmentoftandemmassspectrausingsupportvectormachinesvm AT dingjiarui qualityassessmentoftandemmassspectrausingsupportvectormachinesvm AT poirierguyg qualityassessmentoftandemmassspectrausingsupportvectormachinesvm |