Cargando…

In silico fragmentation for computer assisted identification of metabolite mass spectra

BACKGROUND: Mass spectrometry has become the analytical method of choice in metabolomics research. The identification of unknown compounds is the main bottleneck. In addition to the precursor mass, tandem MS spectra carry informative fragment peaks, but the coverage of spectral libraries of measured...

Descripción completa

Detalles Bibliográficos
Autores principales: Wolf, Sebastian, Schmidt, Stephan, Müller-Hannemann, Matthias, Neumann, Steffen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853470/
https://www.ncbi.nlm.nih.gov/pubmed/20307295
http://dx.doi.org/10.1186/1471-2105-11-148
_version_ 1782180026945372160
author Wolf, Sebastian
Schmidt, Stephan
Müller-Hannemann, Matthias
Neumann, Steffen
author_facet Wolf, Sebastian
Schmidt, Stephan
Müller-Hannemann, Matthias
Neumann, Steffen
author_sort Wolf, Sebastian
collection PubMed
description BACKGROUND: Mass spectrometry has become the analytical method of choice in metabolomics research. The identification of unknown compounds is the main bottleneck. In addition to the precursor mass, tandem MS spectra carry informative fragment peaks, but the coverage of spectral libraries of measured reference compounds are far from covering the complete chemical space. Compound libraries such as PubChem or KEGG describe a larger number of compounds, which can be used to compare their in silico fragmentation with spectra of unknown metabolites. RESULTS: We created the MetFrag suite to obtain a candidate list from compound libraries based on the precursor mass, subsequently ranked by the agreement between measured and in silico fragments. In the evaluation MetFrag was able to rank most of the correct compounds within the top 3 candidates returned by an exact mass query in KEGG. Compared to a previously published study, MetFrag obtained better results than the commercial MassFrontier software. Especially for large compound libraries, the candidates with a good score show a high structural similarity or just different stereochemistry, a subsequent clustering based on chemical distances reduces this redundancy. The in silico fragmentation requires less than a second to process a molecule, and MetFrag performs a search in KEGG or PubChem on average within 30 to 300 seconds, respectively, on an average desktop PC. CONCLUSIONS: We presented a method that is able to identify small molecules from tandem MS measurements, even without spectral reference data or a large set of fragmentation rules. With today's massive general purpose compound libraries we obtain dozens of very similar candidates, which still allows a confident estimate of the correct compound class. Our tool MetFrag improves the identification of unknown substances from tandem MS spectra and delivers better results than comparable commercial software. MetFrag is available through a web application, web services and as java library. The web frontend allows the end-user to analyse single spectra and browse the results, whereas the web service and console application are aimed to perform batch searches and evaluation.
format Text
id pubmed-2853470
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28534702010-04-13 In silico fragmentation for computer assisted identification of metabolite mass spectra Wolf, Sebastian Schmidt, Stephan Müller-Hannemann, Matthias Neumann, Steffen BMC Bioinformatics Methodology article BACKGROUND: Mass spectrometry has become the analytical method of choice in metabolomics research. The identification of unknown compounds is the main bottleneck. In addition to the precursor mass, tandem MS spectra carry informative fragment peaks, but the coverage of spectral libraries of measured reference compounds are far from covering the complete chemical space. Compound libraries such as PubChem or KEGG describe a larger number of compounds, which can be used to compare their in silico fragmentation with spectra of unknown metabolites. RESULTS: We created the MetFrag suite to obtain a candidate list from compound libraries based on the precursor mass, subsequently ranked by the agreement between measured and in silico fragments. In the evaluation MetFrag was able to rank most of the correct compounds within the top 3 candidates returned by an exact mass query in KEGG. Compared to a previously published study, MetFrag obtained better results than the commercial MassFrontier software. Especially for large compound libraries, the candidates with a good score show a high structural similarity or just different stereochemistry, a subsequent clustering based on chemical distances reduces this redundancy. The in silico fragmentation requires less than a second to process a molecule, and MetFrag performs a search in KEGG or PubChem on average within 30 to 300 seconds, respectively, on an average desktop PC. CONCLUSIONS: We presented a method that is able to identify small molecules from tandem MS measurements, even without spectral reference data or a large set of fragmentation rules. With today's massive general purpose compound libraries we obtain dozens of very similar candidates, which still allows a confident estimate of the correct compound class. Our tool MetFrag improves the identification of unknown substances from tandem MS spectra and delivers better results than comparable commercial software. MetFrag is available through a web application, web services and as java library. The web frontend allows the end-user to analyse single spectra and browse the results, whereas the web service and console application are aimed to perform batch searches and evaluation. BioMed Central 2010-03-22 /pmc/articles/PMC2853470/ /pubmed/20307295 http://dx.doi.org/10.1186/1471-2105-11-148 Text en Copyright ©2010 Wolf et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology article
Wolf, Sebastian
Schmidt, Stephan
Müller-Hannemann, Matthias
Neumann, Steffen
In silico fragmentation for computer assisted identification of metabolite mass spectra
title In silico fragmentation for computer assisted identification of metabolite mass spectra
title_full In silico fragmentation for computer assisted identification of metabolite mass spectra
title_fullStr In silico fragmentation for computer assisted identification of metabolite mass spectra
title_full_unstemmed In silico fragmentation for computer assisted identification of metabolite mass spectra
title_short In silico fragmentation for computer assisted identification of metabolite mass spectra
title_sort in silico fragmentation for computer assisted identification of metabolite mass spectra
topic Methodology article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2853470/
https://www.ncbi.nlm.nih.gov/pubmed/20307295
http://dx.doi.org/10.1186/1471-2105-11-148
work_keys_str_mv AT wolfsebastian insilicofragmentationforcomputerassistedidentificationofmetabolitemassspectra
AT schmidtstephan insilicofragmentationforcomputerassistedidentificationofmetabolitemassspectra
AT mullerhannemannmatthias insilicofragmentationforcomputerassistedidentificationofmetabolitemassspectra
AT neumannsteffen insilicofragmentationforcomputerassistedidentificationofmetabolitemassspectra