Cargando…

Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling

Genome-scale metabolic models (GEMs) are manually curated repositories describing the metabolic capabilities of an organism. GEMs have been successfully used in different research areas, ranging from systems medicine to biotechnology. However, the different naming conventions (namespaces) of databas...

Descripción completa

Detalles Bibliográficos
Autores principales: Pham, Nhung, van Heck, Ruben G. A., van Dam, Jesse C. J., Schaap, Peter J., Saccenti, Edoardo, Suarez-Diez, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6409771/
https://www.ncbi.nlm.nih.gov/pubmed/30736318
http://dx.doi.org/10.3390/metabo9020028
_version_ 1783402062086668288
author Pham, Nhung
van Heck, Ruben G. A.
van Dam, Jesse C. J.
Schaap, Peter J.
Saccenti, Edoardo
Suarez-Diez, Maria
author_facet Pham, Nhung
van Heck, Ruben G. A.
van Dam, Jesse C. J.
Schaap, Peter J.
Saccenti, Edoardo
Suarez-Diez, Maria
author_sort Pham, Nhung
collection PubMed
description Genome-scale metabolic models (GEMs) are manually curated repositories describing the metabolic capabilities of an organism. GEMs have been successfully used in different research areas, ranging from systems medicine to biotechnology. However, the different naming conventions (namespaces) of databases used to build GEMs limit model reusability and prevent the integration of existing models. This problem is known in the GEM community, but its extent has not been analyzed in depth. In this study, we investigate the name ambiguity and the multiplicity of non-systematic identifiers and we highlight the (in)consistency in their use in 11 biochemical databases of biochemical reactions and the problems that arise when mapping between different namespaces and databases. We found that such inconsistencies can be as high as 83.1%, thus emphasizing the need for strategies to deal with these issues. Currently, manual verification of the mappings appears to be the only solution to remove inconsistencies when combining models. Finally, we discuss several possible approaches to facilitate (future) unambiguous mapping.
format Online
Article
Text
id pubmed-6409771
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-64097712019-03-22 Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling Pham, Nhung van Heck, Ruben G. A. van Dam, Jesse C. J. Schaap, Peter J. Saccenti, Edoardo Suarez-Diez, Maria Metabolites Article Genome-scale metabolic models (GEMs) are manually curated repositories describing the metabolic capabilities of an organism. GEMs have been successfully used in different research areas, ranging from systems medicine to biotechnology. However, the different naming conventions (namespaces) of databases used to build GEMs limit model reusability and prevent the integration of existing models. This problem is known in the GEM community, but its extent has not been analyzed in depth. In this study, we investigate the name ambiguity and the multiplicity of non-systematic identifiers and we highlight the (in)consistency in their use in 11 biochemical databases of biochemical reactions and the problems that arise when mapping between different namespaces and databases. We found that such inconsistencies can be as high as 83.1%, thus emphasizing the need for strategies to deal with these issues. Currently, manual verification of the mappings appears to be the only solution to remove inconsistencies when combining models. Finally, we discuss several possible approaches to facilitate (future) unambiguous mapping. MDPI 2019-02-06 /pmc/articles/PMC6409771/ /pubmed/30736318 http://dx.doi.org/10.3390/metabo9020028 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pham, Nhung
van Heck, Ruben G. A.
van Dam, Jesse C. J.
Schaap, Peter J.
Saccenti, Edoardo
Suarez-Diez, Maria
Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling
title Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling
title_full Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling
title_fullStr Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling
title_full_unstemmed Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling
title_short Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling
title_sort consistency, inconsistency, and ambiguity of metabolite names in biochemical databases used for genome-scale metabolic modelling
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6409771/
https://www.ncbi.nlm.nih.gov/pubmed/30736318
http://dx.doi.org/10.3390/metabo9020028
work_keys_str_mv AT phamnhung consistencyinconsistencyandambiguityofmetabolitenamesinbiochemicaldatabasesusedforgenomescalemetabolicmodelling
AT vanheckrubenga consistencyinconsistencyandambiguityofmetabolitenamesinbiochemicaldatabasesusedforgenomescalemetabolicmodelling
AT vandamjessecj consistencyinconsistencyandambiguityofmetabolitenamesinbiochemicaldatabasesusedforgenomescalemetabolicmodelling
AT schaappeterj consistencyinconsistencyandambiguityofmetabolitenamesinbiochemicaldatabasesusedforgenomescalemetabolicmodelling
AT saccentiedoardo consistencyinconsistencyandambiguityofmetabolitenamesinbiochemicaldatabasesusedforgenomescalemetabolicmodelling
AT suarezdiezmaria consistencyinconsistencyandambiguityofmetabolitenamesinbiochemicaldatabasesusedforgenomescalemetabolicmodelling