Cargando…
Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas
It has long been recognized that estimates of isotopic abundance patterns may be instrumental in identifying the many unknown compounds encountered when conducting untargeted metabolic profiling using liquid chromatography/mass spectrometry. While numerous methods have been developed for assigning h...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2010
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2930401/ https://www.ncbi.nlm.nih.gov/pubmed/20690638 http://dx.doi.org/10.1021/ac101278x |
_version_ | 1782185985963982848 |
---|---|
author | Ipsen, Andreas Want, Elizabeth J. Ebbels, Timothy M. D. |
author_facet | Ipsen, Andreas Want, Elizabeth J. Ebbels, Timothy M. D. |
author_sort | Ipsen, Andreas |
collection | PubMed |
description | It has long been recognized that estimates of isotopic abundance patterns may be instrumental in identifying the many unknown compounds encountered when conducting untargeted metabolic profiling using liquid chromatography/mass spectrometry. While numerous methods have been developed for assigning heuristic scores to rank the degree of fit of the observed abundance patterns with theoretical ones, little work has been done to quantify the errors that are associated with the measurements made. Thus, it is generally not possible to determine, in a statistically meaningful manner, whether a given chemical formula would likely be capable of producing the observed data. In this paper, we present a method for constructing confidence regions for the isotopic abundance patterns based on the fundamental distribution of the ion arrivals. Moreover, we develop a method for doing so that makes use of the information pooled together from the measurements obtained across an entire chromatographic peak, as well as from any adducts, dimers, and fragments observed in the mass spectra. This greatly increases the statistical power, thus enabling the analyst to rule out a potentially much larger number of candidate formulas while explicitly guarding against false positives. In practice, small departures from the model assumptions are possible due to detector saturation and interferences between adjacent isotopologues. While these factors form impediments to statistical rigor, they can to a large extent be overcome by restricting the analysis to moderate ion counts and by applying robust statistical methods. Using real metabolic data, we demonstrate that the method is capable of reducing the number of candidate formulas by a substantial amount, even when no bromine or chlorine atoms are present. We argue that further developments in our ability to characterize the data mathematically could enable much more powerful statistical analyses. |
format | Text |
id | pubmed-2930401 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-29304012010-08-31 Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas Ipsen, Andreas Want, Elizabeth J. Ebbels, Timothy M. D. Anal Chem It has long been recognized that estimates of isotopic abundance patterns may be instrumental in identifying the many unknown compounds encountered when conducting untargeted metabolic profiling using liquid chromatography/mass spectrometry. While numerous methods have been developed for assigning heuristic scores to rank the degree of fit of the observed abundance patterns with theoretical ones, little work has been done to quantify the errors that are associated with the measurements made. Thus, it is generally not possible to determine, in a statistically meaningful manner, whether a given chemical formula would likely be capable of producing the observed data. In this paper, we present a method for constructing confidence regions for the isotopic abundance patterns based on the fundamental distribution of the ion arrivals. Moreover, we develop a method for doing so that makes use of the information pooled together from the measurements obtained across an entire chromatographic peak, as well as from any adducts, dimers, and fragments observed in the mass spectra. This greatly increases the statistical power, thus enabling the analyst to rule out a potentially much larger number of candidate formulas while explicitly guarding against false positives. In practice, small departures from the model assumptions are possible due to detector saturation and interferences between adjacent isotopologues. While these factors form impediments to statistical rigor, they can to a large extent be overcome by restricting the analysis to moderate ion counts and by applying robust statistical methods. Using real metabolic data, we demonstrate that the method is capable of reducing the number of candidate formulas by a substantial amount, even when no bromine or chlorine atoms are present. We argue that further developments in our ability to characterize the data mathematically could enable much more powerful statistical analyses. American Chemical Society 2010-08-06 2010-09-01 /pmc/articles/PMC2930401/ /pubmed/20690638 http://dx.doi.org/10.1021/ac101278x Text en Copyright © 2010 American Chemical Society http://pubs.acs.org This is an open-access article distributed under the ACS AuthorChoice Terms & Conditions. Any use of this article, must conform to the terms of that license which are available at http://pubs.acs.org. |
spellingShingle | Ipsen, Andreas Want, Elizabeth J. Ebbels, Timothy M. D. Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas |
title | Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas |
title_full | Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas |
title_fullStr | Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas |
title_full_unstemmed | Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas |
title_short | Construction of Confidence Regions for Isotopic Abundance Patterns in LC/MS Data Sets for Rigorous Determination of Molecular Formulas |
title_sort | construction of confidence regions for isotopic abundance patterns in lc/ms data sets for rigorous determination of molecular formulas |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2930401/ https://www.ncbi.nlm.nih.gov/pubmed/20690638 http://dx.doi.org/10.1021/ac101278x |
work_keys_str_mv | AT ipsenandreas constructionofconfidenceregionsforisotopicabundancepatternsinlcmsdatasetsforrigorousdeterminationofmolecularformulas AT wantelizabethj constructionofconfidenceregionsforisotopicabundancepatternsinlcmsdatasetsforrigorousdeterminationofmolecularformulas AT ebbelstimothymd constructionofconfidenceregionsforisotopicabundancepatternsinlcmsdatasetsforrigorousdeterminationofmolecularformulas |