Cargando…

Reactant pairs and reaction organization patterns produced by a new rule-based approach

OBJECTIVES: Improvements in bioinformatics applications for the enzyme identification of biochemical reactions, enzyme classifications, mining for specific inhibitors and pathfinding require the accurate computational detection of reaction similarity. We provide a set of substrate-product pairs, clu...

Descripción completa

Detalles Bibliográficos
Autores principales: Vazquez-Hernandez, Carlos, Loza, Antonio, Gutierrez-Rios, Rosa-Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6109353/
https://www.ncbi.nlm.nih.gov/pubmed/30143048
http://dx.doi.org/10.1186/s13104-018-3724-8
_version_ 1783350314334683136
author Vazquez-Hernandez, Carlos
Loza, Antonio
Gutierrez-Rios, Rosa-Maria
author_facet Vazquez-Hernandez, Carlos
Loza, Antonio
Gutierrez-Rios, Rosa-Maria
author_sort Vazquez-Hernandez, Carlos
collection PubMed
description OBJECTIVES: Improvements in bioinformatics applications for the enzyme identification of biochemical reactions, enzyme classifications, mining for specific inhibitors and pathfinding require the accurate computational detection of reaction similarity. We provide a set of substrate-product pairs, clustered by reactions that share similar chemical transformation patterns, for which accuracy was calculated, comparing this set with manually curated data sets. DATA DESCRIPTION: The data were analyzed by a new method that naturally split each reaction into compound pairs and loner compounds, which we called architectures (Vazquez-Hernandez et al. in BMC Syst Biol 12:63, 2018). The data include a set of 7491 curated reactions from the KEGG-Ligand data set. The data are presented in two formats, a string format and a tree structure, both of which reflect the splitting process and the final architectures of each reaction. We are also reporting sets of reactions that show similar splitting patterns naturally grouped into clusters of tree structures. The compound pairs in each cluster were compared with the reactant pairs proposed by the KEGG-RCLASS data set, and a match precision value is also provided. These data were collected with the aim of providing research with a confident set of reactant pairs that is useful for selecting between alternative substrate-product pairs predicted by pathfinders.
format Online
Article
Text
id pubmed-6109353
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61093532018-08-29 Reactant pairs and reaction organization patterns produced by a new rule-based approach Vazquez-Hernandez, Carlos Loza, Antonio Gutierrez-Rios, Rosa-Maria BMC Res Notes Data Note OBJECTIVES: Improvements in bioinformatics applications for the enzyme identification of biochemical reactions, enzyme classifications, mining for specific inhibitors and pathfinding require the accurate computational detection of reaction similarity. We provide a set of substrate-product pairs, clustered by reactions that share similar chemical transformation patterns, for which accuracy was calculated, comparing this set with manually curated data sets. DATA DESCRIPTION: The data were analyzed by a new method that naturally split each reaction into compound pairs and loner compounds, which we called architectures (Vazquez-Hernandez et al. in BMC Syst Biol 12:63, 2018). The data include a set of 7491 curated reactions from the KEGG-Ligand data set. The data are presented in two formats, a string format and a tree structure, both of which reflect the splitting process and the final architectures of each reaction. We are also reporting sets of reactions that show similar splitting patterns naturally grouped into clusters of tree structures. The compound pairs in each cluster were compared with the reactant pairs proposed by the KEGG-RCLASS data set, and a match precision value is also provided. These data were collected with the aim of providing research with a confident set of reactant pairs that is useful for selecting between alternative substrate-product pairs predicted by pathfinders. BioMed Central 2018-08-24 /pmc/articles/PMC6109353/ /pubmed/30143048 http://dx.doi.org/10.1186/s13104-018-3724-8 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Data Note
Vazquez-Hernandez, Carlos
Loza, Antonio
Gutierrez-Rios, Rosa-Maria
Reactant pairs and reaction organization patterns produced by a new rule-based approach
title Reactant pairs and reaction organization patterns produced by a new rule-based approach
title_full Reactant pairs and reaction organization patterns produced by a new rule-based approach
title_fullStr Reactant pairs and reaction organization patterns produced by a new rule-based approach
title_full_unstemmed Reactant pairs and reaction organization patterns produced by a new rule-based approach
title_short Reactant pairs and reaction organization patterns produced by a new rule-based approach
title_sort reactant pairs and reaction organization patterns produced by a new rule-based approach
topic Data Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6109353/
https://www.ncbi.nlm.nih.gov/pubmed/30143048
http://dx.doi.org/10.1186/s13104-018-3724-8
work_keys_str_mv AT vazquezhernandezcarlos reactantpairsandreactionorganizationpatternsproducedbyanewrulebasedapproach
AT lozaantonio reactantpairsandreactionorganizationpatternsproducedbyanewrulebasedapproach
AT gutierrezriosrosamaria reactantpairsandreactionorganizationpatternsproducedbyanewrulebasedapproach