Cargando…

Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics

Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to mea...

Descripción completa

Detalles Bibliográficos
Autores principales: Mitchell, Joshua M., Fan, Teresa W.-M., Lane, Andrew N., Moseley, Hunter N. B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4112935/
https://www.ncbi.nlm.nih.gov/pubmed/25120557
http://dx.doi.org/10.3389/fgene.2014.00237
_version_ 1782328227812868096
author Mitchell, Joshua M.
Fan, Teresa W.-M.
Lane, Andrew N.
Moseley, Hunter N. B.
author_facet Mitchell, Joshua M.
Fan, Teresa W.-M.
Lane, Andrew N.
Moseley, Hunter N. B.
author_sort Mitchell, Joshua M.
collection PubMed
description Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to meaningful data interpretation is the identification of a wide range of metabolites including unknowns and the determination of their role(s) in various metabolic networks. Chemoselective (CS) probes to tag metabolite functional groups combined with high mass accuracy provide additional structural constraints for metabolite identification and quantification. We have developed a novel algorithm, Chemically Aware Substructure Search (CASS) that efficiently detects functional groups within existing metabolite databases, allowing for combined molecular formula and functional group (from CS tagging) queries to aid in metabolite identification without a priori knowledge. Analysis of the isomeric compounds in both Human Metabolome Database (HMDB) and KEGG Ligand demonstrated a high percentage of isomeric molecular formulae (43 and 28%, respectively), indicating the necessity for techniques such as CS-tagging. Furthermore, these two databases have only moderate overlap in molecular formulae. Thus, it is prudent to use multiple databases in metabolite assignment, since each major metabolite database represents different portions of metabolism within the biosphere. In silico analysis of various CS-tagging strategies under different conditions for adduct formation demonstrate that combined FT-MS derived molecular formulae and CS-tagging can uniquely identify up to 71% of KEGG and 37% of the combined KEGG/HMDB database vs. 41 and 17%, respectively without adduct formation. This difference between database isomer disambiguation highlights the strength of CS-tagging for non-lipid metabolite identification. However, unique identification of complex lipids still needs additional information.
format Online
Article
Text
id pubmed-4112935
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-41129352014-08-12 Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics Mitchell, Joshua M. Fan, Teresa W.-M. Lane, Andrew N. Moseley, Hunter N. B. Front Genet Physiology Large-scale identification of metabolites is key to elucidating and modeling metabolism at the systems level. Advances in metabolomics technologies, particularly ultra-high resolution mass spectrometry (MS) enable comprehensive and rapid analysis of metabolites. However, a significant barrier to meaningful data interpretation is the identification of a wide range of metabolites including unknowns and the determination of their role(s) in various metabolic networks. Chemoselective (CS) probes to tag metabolite functional groups combined with high mass accuracy provide additional structural constraints for metabolite identification and quantification. We have developed a novel algorithm, Chemically Aware Substructure Search (CASS) that efficiently detects functional groups within existing metabolite databases, allowing for combined molecular formula and functional group (from CS tagging) queries to aid in metabolite identification without a priori knowledge. Analysis of the isomeric compounds in both Human Metabolome Database (HMDB) and KEGG Ligand demonstrated a high percentage of isomeric molecular formulae (43 and 28%, respectively), indicating the necessity for techniques such as CS-tagging. Furthermore, these two databases have only moderate overlap in molecular formulae. Thus, it is prudent to use multiple databases in metabolite assignment, since each major metabolite database represents different portions of metabolism within the biosphere. In silico analysis of various CS-tagging strategies under different conditions for adduct formation demonstrate that combined FT-MS derived molecular formulae and CS-tagging can uniquely identify up to 71% of KEGG and 37% of the combined KEGG/HMDB database vs. 41 and 17%, respectively without adduct formation. This difference between database isomer disambiguation highlights the strength of CS-tagging for non-lipid metabolite identification. However, unique identification of complex lipids still needs additional information. Frontiers Media S.A. 2014-07-28 /pmc/articles/PMC4112935/ /pubmed/25120557 http://dx.doi.org/10.3389/fgene.2014.00237 Text en Copyright © 2014 Mitchell, Fan, Lane and Moseley. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Physiology
Mitchell, Joshua M.
Fan, Teresa W.-M.
Lane, Andrew N.
Moseley, Hunter N. B.
Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
title Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
title_full Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
title_fullStr Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
title_full_unstemmed Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
title_short Development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
title_sort development and in silico evaluation of large-scale metabolite identification methods using functional group detection for metabolomics
topic Physiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4112935/
https://www.ncbi.nlm.nih.gov/pubmed/25120557
http://dx.doi.org/10.3389/fgene.2014.00237
work_keys_str_mv AT mitchelljoshuam developmentandinsilicoevaluationoflargescalemetaboliteidentificationmethodsusingfunctionalgroupdetectionformetabolomics
AT fanteresawm developmentandinsilicoevaluationoflargescalemetaboliteidentificationmethodsusingfunctionalgroupdetectionformetabolomics
AT laneandrewn developmentandinsilicoevaluationoflargescalemetaboliteidentificationmethodsusingfunctionalgroupdetectionformetabolomics
AT moseleyhunternb developmentandinsilicoevaluationoflargescalemetaboliteidentificationmethodsusingfunctionalgroupdetectionformetabolomics