Cargando…
Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases
Metabolic models have been proven to be useful tools in system biology and have been successfully applied to various research fields in a wide range of organisms. A relatively complete metabolic network is a prerequisite for deriving reliable metabolic models. The first step in constructing metaboli...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8307411/ https://www.ncbi.nlm.nih.gov/pubmed/34209357 http://dx.doi.org/10.3390/metabo11070431 |
_version_ | 1783728040102068224 |
---|---|
author | Jin, Huan Moseley, Hunter N. B. |
author_facet | Jin, Huan Moseley, Hunter N. B. |
author_sort | Jin, Huan |
collection | PubMed |
description | Metabolic models have been proven to be useful tools in system biology and have been successfully applied to various research fields in a wide range of organisms. A relatively complete metabolic network is a prerequisite for deriving reliable metabolic models. The first step in constructing metabolic network is to harmonize compounds and reactions across different metabolic databases. However, effectively integrating data from various sources still remains a big challenge. Incomplete and inconsistent atomistic details in compound representations across databases is a very important limiting factor. Here, we optimized a subgraph isomorphism detection algorithm to validate generic compound pairs. Moreover, we defined a set of harmonization relationship types between compounds to deal with inconsistent chemical details while successfully capturing atom-level characteristics, enabling a more complete enabling compound harmonization across metabolic databases. In total, 15,704 compound pairs across KEGG (Kyoto Encyclopedia of Genes and Genomes) and MetaCyc databases were detected. Furthermore, utilizing the classification of compound pairs and EC (Enzyme Commission) numbers of reactions, we established hierarchical relationships between metabolic reactions, enabling the harmonization of 3856 reaction pairs. In addition, we created and used atom-specific identifiers to evaluate the consistency of atom mappings within and between harmonized reactions, detecting some consistency issues between the reaction and compound descriptions in these metabolic databases. |
format | Online Article Text |
id | pubmed-8307411 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83074112021-07-25 Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases Jin, Huan Moseley, Hunter N. B. Metabolites Article Metabolic models have been proven to be useful tools in system biology and have been successfully applied to various research fields in a wide range of organisms. A relatively complete metabolic network is a prerequisite for deriving reliable metabolic models. The first step in constructing metabolic network is to harmonize compounds and reactions across different metabolic databases. However, effectively integrating data from various sources still remains a big challenge. Incomplete and inconsistent atomistic details in compound representations across databases is a very important limiting factor. Here, we optimized a subgraph isomorphism detection algorithm to validate generic compound pairs. Moreover, we defined a set of harmonization relationship types between compounds to deal with inconsistent chemical details while successfully capturing atom-level characteristics, enabling a more complete enabling compound harmonization across metabolic databases. In total, 15,704 compound pairs across KEGG (Kyoto Encyclopedia of Genes and Genomes) and MetaCyc databases were detected. Furthermore, utilizing the classification of compound pairs and EC (Enzyme Commission) numbers of reactions, we established hierarchical relationships between metabolic reactions, enabling the harmonization of 3856 reaction pairs. In addition, we created and used atom-specific identifiers to evaluate the consistency of atom mappings within and between harmonized reactions, detecting some consistency issues between the reaction and compound descriptions in these metabolic databases. MDPI 2021-06-30 /pmc/articles/PMC8307411/ /pubmed/34209357 http://dx.doi.org/10.3390/metabo11070431 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jin, Huan Moseley, Hunter N. B. Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases |
title | Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases |
title_full | Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases |
title_fullStr | Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases |
title_full_unstemmed | Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases |
title_short | Hierarchical Harmonization of Atom-Resolved Metabolic Reactions across Metabolic Databases |
title_sort | hierarchical harmonization of atom-resolved metabolic reactions across metabolic databases |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8307411/ https://www.ncbi.nlm.nih.gov/pubmed/34209357 http://dx.doi.org/10.3390/metabo11070431 |
work_keys_str_mv | AT jinhuan hierarchicalharmonizationofatomresolvedmetabolicreactionsacrossmetabolicdatabases AT moseleyhunternb hierarchicalharmonizationofatomresolvedmetabolicreactionsacrossmetabolicdatabases |