Cargando…
Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy
In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to a...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445034/ https://www.ncbi.nlm.nih.gov/pubmed/29086039 http://dx.doi.org/10.1186/s13321-017-0219-x |
_version_ | 1783238799214510080 |
---|---|
author | Blaženović, Ivana Kind, Tobias Torbašinović, Hrvoje Obrenović, Slobodan Mehta, Sajjan S. Tsugawa, Hiroshi Wermuth, Tobias Schauer, Nicolas Jahn, Martina Biedendieck, Rebekka Jahn, Dieter Fiehn, Oliver |
author_facet | Blaženović, Ivana Kind, Tobias Torbašinović, Hrvoje Obrenović, Slobodan Mehta, Sajjan S. Tsugawa, Hiroshi Wermuth, Tobias Schauer, Nicolas Jahn, Martina Biedendieck, Rebekka Jahn, Dieter Fiehn, Oliver |
author_sort | Blaženović, Ivana |
collection | PubMed |
description | In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-017-0219-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5445034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-54450342017-06-13 Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy Blaženović, Ivana Kind, Tobias Torbašinović, Hrvoje Obrenović, Slobodan Mehta, Sajjan S. Tsugawa, Hiroshi Wermuth, Tobias Schauer, Nicolas Jahn, Martina Biedendieck, Rebekka Jahn, Dieter Fiehn, Oliver J Cheminform Research Article In mass spectrometry-based untargeted metabolomics, rarely more than 30% of the compounds are identified. Without the true identity of these molecules it is impossible to draw conclusions about the biological mechanisms, pathway relationships and provenance of compounds. The only way at present to address this discrepancy is to use in silico fragmentation software to identify unknown compounds by comparing and ranking theoretical MS/MS fragmentations from target structures to experimental tandem mass spectra (MS/MS). We compared the performance of four publicly available in silico fragmentation algorithms (MetFragCL, CFM-ID, MAGMa+ and MS-FINDER) that participated in the 2016 CASMI challenge. We found that optimizing the use of metadata, weighting factors and the manner of combining different tools eventually defined the ultimate outcomes of each method. We comprehensively analysed how outcomes of different tools could be combined and reached a final success rate of 93% for the training data, and 87% for the challenge data, using a combination of MAGMa+, CFM-ID and compound importance information along with MS/MS matching. Matching MS/MS spectra against the MS/MS libraries without using any in silico tool yielded 60% correct hits, showing that the use of in silico methods is still important. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-017-0219-x) contains supplementary material, which is available to authorized users. Springer International Publishing 2017-05-25 /pmc/articles/PMC5445034/ /pubmed/29086039 http://dx.doi.org/10.1186/s13321-017-0219-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Blaženović, Ivana Kind, Tobias Torbašinović, Hrvoje Obrenović, Slobodan Mehta, Sajjan S. Tsugawa, Hiroshi Wermuth, Tobias Schauer, Nicolas Jahn, Martina Biedendieck, Rebekka Jahn, Dieter Fiehn, Oliver Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy |
title | Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy |
title_full | Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy |
title_fullStr | Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy |
title_full_unstemmed | Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy |
title_short | Comprehensive comparison of in silico MS/MS fragmentation tools of the CASMI contest: database boosting is needed to achieve 93% accuracy |
title_sort | comprehensive comparison of in silico ms/ms fragmentation tools of the casmi contest: database boosting is needed to achieve 93% accuracy |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5445034/ https://www.ncbi.nlm.nih.gov/pubmed/29086039 http://dx.doi.org/10.1186/s13321-017-0219-x |
work_keys_str_mv | AT blazenovicivana comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT kindtobias comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT torbasinovichrvoje comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT obrenovicslobodan comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT mehtasajjans comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT tsugawahiroshi comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT wermuthtobias comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT schauernicolas comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT jahnmartina comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT biedendieckrebekka comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT jahndieter comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy AT fiehnoliver comprehensivecomparisonofinsilicomsmsfragmentationtoolsofthecasmicontestdatabaseboostingisneededtoachieve93accuracy |