Cargando…

Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2

BACKGROUND: An important step in the reconstruction of a metabolic network is annotation of metabolites. Metabolites are generally annotated with various database or structure based identifiers. Metabolite annotations in metabolic reconstructions may be incorrect or incomplete and thus need to be up...

Descripción completa

Detalles Bibliográficos
Autores principales: Haraldsdóttir, Hulda S, Thiele, Ines, Fleming, Ronan MT
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3917611/
https://www.ncbi.nlm.nih.gov/pubmed/24468196
http://dx.doi.org/10.1186/1758-2946-6-2
_version_ 1782302866258526208
author Haraldsdóttir, Hulda S
Thiele, Ines
Fleming, Ronan MT
author_facet Haraldsdóttir, Hulda S
Thiele, Ines
Fleming, Ronan MT
author_sort Haraldsdóttir, Hulda S
collection PubMed
description BACKGROUND: An important step in the reconstruction of a metabolic network is annotation of metabolites. Metabolites are generally annotated with various database or structure based identifiers. Metabolite annotations in metabolic reconstructions may be incorrect or incomplete and thus need to be updated prior to their use. Genome-scale metabolic reconstructions generally include hundreds of metabolites. Manually updating annotations is therefore highly laborious. This prompted us to look for open-source software applications that could facilitate automatic updating of annotations by mapping between available metabolite identifiers. We identified three applications developed for the metabolomics and chemical informatics communities as potential solutions. The applications were MetMask, the Chemical Translation System, and UniChem. The first implements a “metabolite masking” strategy for mapping between identifiers whereas the latter two implement different versions of an InChI based strategy. Here we evaluated the suitability of these applications for the task of mapping between metabolite identifiers in genome-scale metabolic reconstructions. We applied the best suited application to updating identifiers in Recon 2, the latest reconstruction of human metabolism. RESULTS: All three applications enabled partially automatic updating of metabolite identifiers, but significant manual effort was still required to fully update identifiers. We were able to reduce this manual effort by searching for new identifiers using multiple types of information about metabolites. When multiple types of information were combined, the Chemical Translation System enabled us to update over 3,500 metabolite identifiers in Recon 2. All but approximately 200 identifiers were updated automatically. CONCLUSIONS: We found that an InChI based application such as the Chemical Translation System was better suited to the task of mapping between metabolite identifiers in genome-scale metabolic reconstructions. We identified several features, however, that could be added to such an application in order to tailor it to this task.
format Online
Article
Text
id pubmed-3917611
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39176112014-02-24 Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2 Haraldsdóttir, Hulda S Thiele, Ines Fleming, Ronan MT J Cheminform Research Article BACKGROUND: An important step in the reconstruction of a metabolic network is annotation of metabolites. Metabolites are generally annotated with various database or structure based identifiers. Metabolite annotations in metabolic reconstructions may be incorrect or incomplete and thus need to be updated prior to their use. Genome-scale metabolic reconstructions generally include hundreds of metabolites. Manually updating annotations is therefore highly laborious. This prompted us to look for open-source software applications that could facilitate automatic updating of annotations by mapping between available metabolite identifiers. We identified three applications developed for the metabolomics and chemical informatics communities as potential solutions. The applications were MetMask, the Chemical Translation System, and UniChem. The first implements a “metabolite masking” strategy for mapping between identifiers whereas the latter two implement different versions of an InChI based strategy. Here we evaluated the suitability of these applications for the task of mapping between metabolite identifiers in genome-scale metabolic reconstructions. We applied the best suited application to updating identifiers in Recon 2, the latest reconstruction of human metabolism. RESULTS: All three applications enabled partially automatic updating of metabolite identifiers, but significant manual effort was still required to fully update identifiers. We were able to reduce this manual effort by searching for new identifiers using multiple types of information about metabolites. When multiple types of information were combined, the Chemical Translation System enabled us to update over 3,500 metabolite identifiers in Recon 2. All but approximately 200 identifiers were updated automatically. CONCLUSIONS: We found that an InChI based application such as the Chemical Translation System was better suited to the task of mapping between metabolite identifiers in genome-scale metabolic reconstructions. We identified several features, however, that could be added to such an application in order to tailor it to this task. BioMed Central 2014-01-27 /pmc/articles/PMC3917611/ /pubmed/24468196 http://dx.doi.org/10.1186/1758-2946-6-2 Text en Copyright © 2014 Haraldsdóttir et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Haraldsdóttir, Hulda S
Thiele, Ines
Fleming, Ronan MT
Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
title Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
title_full Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
title_fullStr Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
title_full_unstemmed Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
title_short Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
title_sort comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to recon 2
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3917611/
https://www.ncbi.nlm.nih.gov/pubmed/24468196
http://dx.doi.org/10.1186/1758-2946-6-2
work_keys_str_mv AT haraldsdottirhuldas comparativeevaluationofopensourcesoftwareformappingbetweenmetaboliteidentifiersinmetabolicnetworkreconstructionsapplicationtorecon2
AT thieleines comparativeevaluationofopensourcesoftwareformappingbetweenmetaboliteidentifiersinmetabolicnetworkreconstructionsapplicationtorecon2
AT flemingronanmt comparativeevaluationofopensourcesoftwareformappingbetweenmetaboliteidentifiersinmetabolicnetworkreconstructionsapplicationtorecon2