Cargando…

MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach

Motivation: The use of liquid chromatography coupled to mass spectrometry has enabled the high-throughput profiling of the metabolite composition of biological samples. However, the large amount of data obtained can be difficult to analyse and often requires computational processing to understand wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Daly, Rónán, Rogers, Simon, Wandy, Joe, Jankevics, Andris, Burgess, Karl E. V., Breitling, Rainer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173012/
https://www.ncbi.nlm.nih.gov/pubmed/24916385
http://dx.doi.org/10.1093/bioinformatics/btu370
_version_ 1782336121291669504
author Daly, Rónán
Rogers, Simon
Wandy, Joe
Jankevics, Andris
Burgess, Karl E. V.
Breitling, Rainer
author_facet Daly, Rónán
Rogers, Simon
Wandy, Joe
Jankevics, Andris
Burgess, Karl E. V.
Breitling, Rainer
author_sort Daly, Rónán
collection PubMed
description Motivation: The use of liquid chromatography coupled to mass spectrometry has enabled the high-throughput profiling of the metabolite composition of biological samples. However, the large amount of data obtained can be difficult to analyse and often requires computational processing to understand which metabolites are present in a sample. This article looks at the dual problem of annotating peaks in a sample with a metabolite, together with putatively annotating whether a metabolite is present in the sample. The starting point of the approach is a Bayesian clustering of peaks into groups, each corresponding to putative adducts and isotopes of a single metabolite. Results: The Bayesian modelling introduced here combines information from the mass-to-charge ratio, retention time and intensity of each peak, together with a model of the inter-peak dependency structure, to increase the accuracy of peak annotation. The results inherently contain a quantitative estimate of confidence in the peak annotations and allow an accurate trade-off between precision and recall. Extensive validation experiments using authentic chemical standards show that this system is able to produce more accurate putative identifications than other state-of-the-art systems, while at the same time giving a probabilistic measure of confidence in the annotations. Availability and implementation: The software has been implemented as part of the mzMatch metabolomics analysis pipeline, which is available for download at http://mzmatch.sourceforge.net/. Contact: Ronan.Daly@glasgow.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4173012
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41730122014-09-25 MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach Daly, Rónán Rogers, Simon Wandy, Joe Jankevics, Andris Burgess, Karl E. V. Breitling, Rainer Bioinformatics Original Papers Motivation: The use of liquid chromatography coupled to mass spectrometry has enabled the high-throughput profiling of the metabolite composition of biological samples. However, the large amount of data obtained can be difficult to analyse and often requires computational processing to understand which metabolites are present in a sample. This article looks at the dual problem of annotating peaks in a sample with a metabolite, together with putatively annotating whether a metabolite is present in the sample. The starting point of the approach is a Bayesian clustering of peaks into groups, each corresponding to putative adducts and isotopes of a single metabolite. Results: The Bayesian modelling introduced here combines information from the mass-to-charge ratio, retention time and intensity of each peak, together with a model of the inter-peak dependency structure, to increase the accuracy of peak annotation. The results inherently contain a quantitative estimate of confidence in the peak annotations and allow an accurate trade-off between precision and recall. Extensive validation experiments using authentic chemical standards show that this system is able to produce more accurate putative identifications than other state-of-the-art systems, while at the same time giving a probabilistic measure of confidence in the annotations. Availability and implementation: The software has been implemented as part of the mzMatch metabolomics analysis pipeline, which is available for download at http://mzmatch.sourceforge.net/. Contact: Ronan.Daly@glasgow.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-10 2014-06-09 /pmc/articles/PMC4173012/ /pubmed/24916385 http://dx.doi.org/10.1093/bioinformatics/btu370 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Daly, Rónán
Rogers, Simon
Wandy, Joe
Jankevics, Andris
Burgess, Karl E. V.
Breitling, Rainer
MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach
title MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach
title_full MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach
title_fullStr MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach
title_full_unstemmed MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach
title_short MetAssign: probabilistic annotation of metabolites from LC–MS data using a Bayesian clustering approach
title_sort metassign: probabilistic annotation of metabolites from lc–ms data using a bayesian clustering approach
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4173012/
https://www.ncbi.nlm.nih.gov/pubmed/24916385
http://dx.doi.org/10.1093/bioinformatics/btu370
work_keys_str_mv AT dalyronan metassignprobabilisticannotationofmetabolitesfromlcmsdatausingabayesianclusteringapproach
AT rogerssimon metassignprobabilisticannotationofmetabolitesfromlcmsdatausingabayesianclusteringapproach
AT wandyjoe metassignprobabilisticannotationofmetabolitesfromlcmsdatausingabayesianclusteringapproach
AT jankevicsandris metassignprobabilisticannotationofmetabolitesfromlcmsdatausingabayesianclusteringapproach
AT burgesskarlev metassignprobabilisticannotationofmetabolitesfromlcmsdatausingabayesianclusteringapproach
AT breitlingrainer metassignprobabilisticannotationofmetabolitesfromlcmsdatausingabayesianclusteringapproach