Cargando…

An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry

BACKGROUND: Mass spectrometry (MS) based metabolite profiling has been increasingly popular for scientific and biomedical studies, primarily due to recent technological development such as comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GCxGC/TOF-MS). Nevertheless,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeong, Jaesik, Shi, Xue, Zhang, Xiang, Kim, Seongho, Shen, Changyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228553/
https://www.ncbi.nlm.nih.gov/pubmed/21985394
http://dx.doi.org/10.1186/1471-2105-12-392
_version_ 1782217832342224896
author Jeong, Jaesik
Shi, Xue
Zhang, Xiang
Kim, Seongho
Shen, Changyu
author_facet Jeong, Jaesik
Shi, Xue
Zhang, Xiang
Kim, Seongho
Shen, Changyu
author_sort Jeong, Jaesik
collection PubMed
description BACKGROUND: Mass spectrometry (MS) based metabolite profiling has been increasingly popular for scientific and biomedical studies, primarily due to recent technological development such as comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GCxGC/TOF-MS). Nevertheless, the identifications of metabolites from complex samples are subject to errors. Statistical/computational approaches to improve the accuracy of the identifications and false positive estimate are in great need. We propose an empirical Bayes model which accounts for a competing score in addition to the similarity score to tackle this problem. The competition score characterizes the propensity of a candidate metabolite of being matched to some spectrum based on the metabolite's similarity score with other spectra in the library searched against. The competition score allows the model to properly assess the evidence on the presence/absence status of a metabolite based on whether or not the metabolite is matched to some sample spectrum. RESULTS: With a mixture of metabolite standards, we demonstrated that our method has better identification accuracy than other four existing methods. Moreover, our method has reliable false discovery rate estimate. We also applied our method to the data collected from the plasma of a rat and identified some metabolites from the plasma under the control of false discovery rate. CONCLUSIONS: We developed an empirical Bayes model for metabolite identification and validated the method through a mixture of metabolite standards and rat plasma. The results show that our hierarchical model improves identification accuracy as compared with methods that do not structurally model the involved variables. The improvement in identification accuracy is likely to facilitate downstream analysis such as peak alignment and biomarker identification. Raw data and result matrices can be found at http://www.biostat.iupui.edu/~ChangyuShen/index.htm TRIAL REGISTRATION: 2123938128573429
format Online
Article
Text
id pubmed-3228553
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32285532011-12-07 An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry Jeong, Jaesik Shi, Xue Zhang, Xiang Kim, Seongho Shen, Changyu BMC Bioinformatics Research Article BACKGROUND: Mass spectrometry (MS) based metabolite profiling has been increasingly popular for scientific and biomedical studies, primarily due to recent technological development such as comprehensive two-dimensional gas chromatography time-of-flight mass spectrometry (GCxGC/TOF-MS). Nevertheless, the identifications of metabolites from complex samples are subject to errors. Statistical/computational approaches to improve the accuracy of the identifications and false positive estimate are in great need. We propose an empirical Bayes model which accounts for a competing score in addition to the similarity score to tackle this problem. The competition score characterizes the propensity of a candidate metabolite of being matched to some spectrum based on the metabolite's similarity score with other spectra in the library searched against. The competition score allows the model to properly assess the evidence on the presence/absence status of a metabolite based on whether or not the metabolite is matched to some sample spectrum. RESULTS: With a mixture of metabolite standards, we demonstrated that our method has better identification accuracy than other four existing methods. Moreover, our method has reliable false discovery rate estimate. We also applied our method to the data collected from the plasma of a rat and identified some metabolites from the plasma under the control of false discovery rate. CONCLUSIONS: We developed an empirical Bayes model for metabolite identification and validated the method through a mixture of metabolite standards and rat plasma. The results show that our hierarchical model improves identification accuracy as compared with methods that do not structurally model the involved variables. The improvement in identification accuracy is likely to facilitate downstream analysis such as peak alignment and biomarker identification. Raw data and result matrices can be found at http://www.biostat.iupui.edu/~ChangyuShen/index.htm TRIAL REGISTRATION: 2123938128573429 BioMed Central 2011-10-10 /pmc/articles/PMC3228553/ /pubmed/21985394 http://dx.doi.org/10.1186/1471-2105-12-392 Text en Copyright ©2011 Jeong et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jeong, Jaesik
Shi, Xue
Zhang, Xiang
Kim, Seongho
Shen, Changyu
An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
title An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
title_full An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
title_fullStr An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
title_full_unstemmed An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
title_short An empirical Bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
title_sort empirical bayes model using a competition score for metabolite identification in gas chromatography mass spectrometry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3228553/
https://www.ncbi.nlm.nih.gov/pubmed/21985394
http://dx.doi.org/10.1186/1471-2105-12-392
work_keys_str_mv AT jeongjaesik anempiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT shixue anempiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT zhangxiang anempiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT kimseongho anempiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT shenchangyu anempiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT jeongjaesik empiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT shixue empiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT zhangxiang empiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT kimseongho empiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry
AT shenchangyu empiricalbayesmodelusingacompetitionscoreformetaboliteidentificationingaschromatographymassspectrometry