Cargando…

Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas

It has long been recognized that estimates of isotopic abundance patterns may be instrumental in identifying the many unknown compounds encountered when conducting untargeted metabolic profiling using liquid chromatography/mass spectrometry. While numerous methods have been developed for assigning h...

Descripción completa

Detalles Bibliográficos
Autores principales: Ipsen, Andreas, Want, Elizabeth J., Ebbels, Timothy M. D.
Formato: Texto
Lenguaje:English
Publicado: American Chemical Society 2010
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2930401/
https://www.ncbi.nlm.nih.gov/pubmed/20690638
http://dx.doi.org/10.1021/ac101278x
_version_ 1782185985963982848
author Ipsen, Andreas
Want, Elizabeth J.
Ebbels, Timothy M. D.
author_facet Ipsen, Andreas
Want, Elizabeth J.
Ebbels, Timothy M. D.
author_sort Ipsen, Andreas
collection PubMed
description It has long been recognized that estimates of isotopic abundance patterns may be instrumental in identifying the many unknown compounds encountered when conducting untargeted metabolic profiling using liquid chromatography/mass spectrometry. While numerous methods have been developed for assigning heuristic scores to rank the degree of fit of the observed abundance patterns with theoretical ones, little work has been done to quantify the errors that are associated with the measurements made. Thus, it is generally not possible to determine, in a statistically meaningful manner, whether a given chemical formula would likely be capable of producing the observed data. In this paper, we present a method for constructing confidence regions for the isotopic abundance patterns based on the fundamental distribution of the ion arrivals. Moreover, we develop a method for doing so that makes use of the information pooled together from the measurements obtained across an entire chromatographic peak, as well as from any adducts, dimers, and fragments observed in the mass spectra. This greatly increases the statistical power, thus enabling the analyst to rule out a potentially much larger number of candidate formulas while explicitly guarding against false positives. In practice, small departures from the model assumptions are possible due to detector saturation and interferences between adjacent isotopologues. While these factors form impediments to statistical rigor, they can to a large extent be overcome by restricting the analysis to moderate ion counts and by applying robust statistical methods. Using real metabolic data, we demonstrate that the method is capable of reducing the number of candidate formulas by a substantial amount, even when no bromine or chlorine atoms are present. We argue that further developments in our ability to characterize the data mathematically could enable much more powerful statistical analyses.
format Text
id pubmed-2930401
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-29304012010-08-31 Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas Ipsen, Andreas Want, Elizabeth J. Ebbels, Timothy M. D. Anal Chem It has long been recognized that estimates of isotopic abundance patterns may be instrumental in identifying the many unknown compounds encountered when conducting untargeted metabolic profiling using liquid chromatography/mass spectrometry. While numerous methods have been developed for assigning heuristic scores to rank the degree of fit of the observed abundance patterns with theoretical ones, little work has been done to quantify the errors that are associated with the measurements made. Thus, it is generally not possible to determine, in a statistically meaningful manner, whether a given chemical formula would likely be capable of producing the observed data. In this paper, we present a method for constructing confidence regions for the isotopic abundance patterns based on the fundamental distribution of the ion arrivals. Moreover, we develop a method for doing so that makes use of the information pooled together from the measurements obtained across an entire chromatographic peak, as well as from any adducts, dimers, and fragments observed in the mass spectra. This greatly increases the statistical power, thus enabling the analyst to rule out a potentially much larger number of candidate formulas while explicitly guarding against false positives. In practice, small departures from the model assumptions are possible due to detector saturation and interferences between adjacent isotopologues. While these factors form impediments to statistical rigor, they can to a large extent be overcome by restricting the analysis to moderate ion counts and by applying robust statistical methods. Using real metabolic data, we demonstrate that the method is capable of reducing the number of candidate formulas by a substantial amount, even when no bromine or chlorine atoms are present. We argue that further developments in our ability to characterize the data mathematically could enable much more powerful statistical analyses. American Chemical Society 2010-08-06 2010-09-01 /pmc/articles/PMC2930401/ /pubmed/20690638 http://dx.doi.org/10.1021/ac101278x Text en Copyright © 2010 American Chemical Society http://pubs.acs.org This is an open-access article distributed under the ACS AuthorChoice Terms & Conditions. Any use of this article, must conform to the terms of that license which are available at http://pubs.acs.org.
spellingShingle Ipsen, Andreas
Want, Elizabeth J.
Ebbels, Timothy M. D.
Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
title Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
title_full Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
title_fullStr Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
title_full_unstemmed Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
title_short Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
title_sort construction of confidence regions for isotopic abundance patterns in lc/ms data sets for rigorous determination of molecular formulas
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2930401/
https://www.ncbi.nlm.nih.gov/pubmed/20690638
http://dx.doi.org/10.1021/ac101278x
work_keys_str_mv AT ipsenandreas constructionofconfidenceregionsforisotopicabundancepatternsinlcmsdatasetsforrigorousdeterminationofmolecularformulas
AT wantelizabethj constructionofconfidenceregionsforisotopicabundancepatternsinlcmsdatasetsforrigorousdeterminationofmolecularformulas
AT ebbelstimothymd constructionofconfidenceregionsforisotopicabundancepatternsinlcmsdatasetsforrigorousdeterminationofmolecularformulas