Cargando…

Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy

In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to a...

Descripción completa

Detalles Bibliográficos
Autores principales: Blaženović, Ivana, Kind, Tobias, Torbašinović, Hrvoje, Obrenović, Slobodan, Mehta, Sajjan S., Tsugawa, Hiroshi, Wermuth, Tobias, Schauer, Nicolas, Jahn, Martina, Biedendieck, Rebekka, Jahn, Dieter, Fiehn, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445034/
https://www.ncbi.nlm.nih.gov/pubmed/29086039
http://dx.doi.org/10.1186/s13321-017-0219-x
_version_ 1783238799214510080
author Blaženović, Ivana
Kind, Tobias
Torbašinović, Hrvoje
Obrenović, Slobodan
Mehta, Sajjan S.
Tsugawa, Hiroshi
Wermuth, Tobias
Schauer, Nicolas
Jahn, Martina
Biedendieck, Rebekka
Jahn, Dieter
Fiehn, Oliver
author_facet Blaženović, Ivana
Kind, Tobias
Torbašinović, Hrvoje
Obrenović, Slobodan
Mehta, Sajjan S.
Tsugawa, Hiroshi
Wermuth, Tobias
Schauer, Nicolas
Jahn, Martina
Biedendieck, Rebekka
Jahn, Dieter
Fiehn, Oliver
author_sort Blaženović, Ivana
collection PubMed
description In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-017-0219-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5445034
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-54450342017-06-13 Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy Blaženović, Ivana Kind, Tobias Torbašinović, Hrvoje Obrenović, Slobodan Mehta, Sajjan S. Tsugawa, Hiroshi Wermuth, Tobias Schauer, Nicolas Jahn, Martina Biedendieck, Rebekka Jahn, Dieter Fiehn, Oliver J Cheminform Research Article In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-017-0219-x) contains supplementary material, which is available to authorized users. Springer International Publishing 2017-05-25 /pmc/articles/PMC5445034/ /pubmed/29086039 http://dx.doi.org/10.1186/s13321-017-0219-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Blaženović, Ivana
Kind, Tobias
Torbašinović, Hrvoje
Obrenović, Slobodan
Mehta, Sajjan S.
Tsugawa, Hiroshi
Wermuth, Tobias
Schauer, Nicolas
Jahn, Martina
Biedendieck, Rebekka
Jahn, Dieter
Fiehn, Oliver
Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
title Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
title_full Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
title_fullStr Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
title_full_unstemmed Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
title_short Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
title_sort comprehensive comparison of in silico ms/ms fragmentation tools of the casmi contest: database boosting is needed to achieve 93% accuracy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445034/
https://www.ncbi.nlm.nih.gov/pubmed/29086039
http://dx.doi.org/10.1186/s13321-017-0219-x
work_keys_str_mv AT blazenovicivana comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT kindtobias comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT torbasinovichrvoje comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT obrenovicslobodan comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT mehtasajjans comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT tsugawahiroshi comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT wermuthtobias comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT schauernicolas comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT jahnmartina comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT biedendieckrebekka comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT jahndieter comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy
AT fiehnoliver comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy