Cargando…

Metabolite identification through multiple kernel learning on fragmentation trees

Motivation: Metabolite identification from tandem mass spectrometric data is a key task in metabolomics. Various computational methods have been proposed for the identification of metabolites from tandem mass spectra. Fragmentation tree methods explore the space of possible ways in which the metabol...

Descripción completa

Detalles Bibliográficos
Autores principales: Shen, Huibin, Dührkop, Kai, Böcker, Sebastian, Rousu, Juho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058957/
https://www.ncbi.nlm.nih.gov/pubmed/24931979
http://dx.doi.org/10.1093/bioinformatics/btu275
_version_ 1782321193997565952
author Shen, Huibin
Dührkop, Kai
Böcker, Sebastian
Rousu, Juho
author_facet Shen, Huibin
Dührkop, Kai
Böcker, Sebastian
Rousu, Juho
author_sort Shen, Huibin
collection PubMed
description Motivation: Metabolite identification from tandem mass spectrometric data is a key task in metabolomics. Various computational methods have been proposed for the identification of metabolites from tandem mass spectra. Fragmentation tree methods explore the space of possible ways in which the metabolite can fragment, and base the metabolite identification on scoring of these fragmentation trees. Machine learning methods have been used to map mass spectra to molecular fingerprints; predicted fingerprints, in turn, can be used to score candidate molecular structures. Results: Here, we combine fragmentation tree computations with kernel-based machine learning to predict molecular fingerprints and identify molecular structures. We introduce a family of kernels capturing the similarity of fragmentation trees, and combine these kernels using recently proposed multiple kernel learning approaches. Experiments on two large reference datasets show that the new methods significantly improve molecular fingerprint prediction accuracy. These improvements result in better metabolite identification, doubling the number of metabolites ranked at the top position of the candidates list. Contact: huibin.shen@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4058957
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40589572014-06-18 Metabolite identification through multiple kernel learning on fragmentation trees Shen, Huibin Dührkop, Kai Böcker, Sebastian Rousu, Juho Bioinformatics Ismb 2014 Proceedings Papers Committee Motivation: Metabolite identification from tandem mass spectrometric data is a key task in metabolomics. Various computational methods have been proposed for the identification of metabolites from tandem mass spectra. Fragmentation tree methods explore the space of possible ways in which the metabolite can fragment, and base the metabolite identification on scoring of these fragmentation trees. Machine learning methods have been used to map mass spectra to molecular fingerprints; predicted fingerprints, in turn, can be used to score candidate molecular structures. Results: Here, we combine fragmentation tree computations with kernel-based machine learning to predict molecular fingerprints and identify molecular structures. We introduce a family of kernels capturing the similarity of fragmentation trees, and combine these kernels using recently proposed multiple kernel learning approaches. Experiments on two large reference datasets show that the new methods significantly improve molecular fingerprint prediction accuracy. These improvements result in better metabolite identification, doubling the number of metabolites ranked at the top position of the candidates list. Contact: huibin.shen@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-06-15 2014-06-11 /pmc/articles/PMC4058957/ /pubmed/24931979 http://dx.doi.org/10.1093/bioinformatics/btu275 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Ismb 2014 Proceedings Papers Committee
Shen, Huibin
Dührkop, Kai
Böcker, Sebastian
Rousu, Juho
Metabolite identification through multiple kernel learning on fragmentation trees
title Metabolite identification through multiple kernel learning on fragmentation trees
title_full Metabolite identification through multiple kernel learning on fragmentation trees
title_fullStr Metabolite identification through multiple kernel learning on fragmentation trees
title_full_unstemmed Metabolite identification through multiple kernel learning on fragmentation trees
title_short Metabolite identification through multiple kernel learning on fragmentation trees
title_sort metabolite identification through multiple kernel learning on fragmentation trees
topic Ismb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058957/
https://www.ncbi.nlm.nih.gov/pubmed/24931979
http://dx.doi.org/10.1093/bioinformatics/btu275
work_keys_str_mv AT shenhuibin metaboliteidentificationthroughmultiplekernellearningonfragmentationtrees
AT duhrkopkai metaboliteidentificationthroughmultiplekernellearningonfragmentationtrees
AT bockersebastian metaboliteidentificationthroughmultiplekernellearningonfragmentationtrees
AT rousujuho metaboliteidentificationthroughmultiplekernellearningonfragmentationtrees