Cargando…

Automated generation of genome-scale metabolic draft reconstructions based on KEGG

BACKGROUND: Constraint-based modeling is a widely used and powerful methodology to assess the metabolic phenotypes and capabilities of an organism. The starting point and cornerstone of all such modeling is a genome-scale metabolic network reconstruction. The creation, further development, and appli...

Descripción completa

Detalles Bibliográficos
Autores principales: Karlsen, Emil, Schulz, Christian, Almaas, Eivind
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6280343/
https://www.ncbi.nlm.nih.gov/pubmed/30514205
http://dx.doi.org/10.1186/s12859-018-2472-z
_version_ 1783378648440504320
author Karlsen, Emil
Schulz, Christian
Almaas, Eivind
author_facet Karlsen, Emil
Schulz, Christian
Almaas, Eivind
author_sort Karlsen, Emil
collection PubMed
description BACKGROUND: Constraint-based modeling is a widely used and powerful methodology to assess the metabolic phenotypes and capabilities of an organism. The starting point and cornerstone of all such modeling is a genome-scale metabolic network reconstruction. The creation, further development, and application of such networks is a growing field of research thanks to a plethora of readily accessible computational tools. While the majority of studies are focused on single-species analyses, typically of a microbe, the computational study of communities of organisms is gaining attention. Similarly, reconstructions that are unified for a multi-cellular organism have gained in popularity. Consequently, the rapid generation of genome-scale metabolic reconstructed networks is crucial. While multiple web-based or stand-alone tools are available for automated network reconstruction, there is, however, currently no publicly available tool that allows the swift assembly of draft reconstructions of community metabolic networks and consolidated metabolic networks for a specified list of organisms. RESULTS: Here, we present AutoKEGGRec, an automated tool that creates first draft metabolic network reconstructions of single organisms, community reconstructions based on a list of organisms, and finally a consolidated reconstruction for a list of organisms or strains. AutoKEGGRec is developed in Matlab and works seamlessly with the COBRA Toolbox v3, and it is based on only using the KEGG database as external input. The generated first draft reconstructions are stored in SBML files and consist of all reactions for a KEGG organism ID and corresponding linked genes. This provides a comprehensive starting point for further refinement and curation using the host of COBRA toolbox functions or other preferred tools. Through the data structures created, the tool also facilitates a comparative analysis of metabolic content in any given number of organisms present in the KEGG database. CONCLUSION: AutoKEGGRec provides a first step in a metabolic network reconstruction process, filling a gap for tools creating community and consolidated metabolic networks. Based only on KEGG data as external input, the generated reconstructions consist of data with a directly traceable foundation and pedigree. With AutoKEGGRec, this kind of modeling is made accessible to a wider part of the genome-scale metabolic analysis community. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2472-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6280343
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62803432018-12-10 Automated generation of genome-scale metabolic draft reconstructions based on KEGG Karlsen, Emil Schulz, Christian Almaas, Eivind BMC Bioinformatics Software BACKGROUND: Constraint-based modeling is a widely used and powerful methodology to assess the metabolic phenotypes and capabilities of an organism. The starting point and cornerstone of all such modeling is a genome-scale metabolic network reconstruction. The creation, further development, and application of such networks is a growing field of research thanks to a plethora of readily accessible computational tools. While the majority of studies are focused on single-species analyses, typically of a microbe, the computational study of communities of organisms is gaining attention. Similarly, reconstructions that are unified for a multi-cellular organism have gained in popularity. Consequently, the rapid generation of genome-scale metabolic reconstructed networks is crucial. While multiple web-based or stand-alone tools are available for automated network reconstruction, there is, however, currently no publicly available tool that allows the swift assembly of draft reconstructions of community metabolic networks and consolidated metabolic networks for a specified list of organisms. RESULTS: Here, we present AutoKEGGRec, an automated tool that creates first draft metabolic network reconstructions of single organisms, community reconstructions based on a list of organisms, and finally a consolidated reconstruction for a list of organisms or strains. AutoKEGGRec is developed in Matlab and works seamlessly with the COBRA Toolbox v3, and it is based on only using the KEGG database as external input. The generated first draft reconstructions are stored in SBML files and consist of all reactions for a KEGG organism ID and corresponding linked genes. This provides a comprehensive starting point for further refinement and curation using the host of COBRA toolbox functions or other preferred tools. Through the data structures created, the tool also facilitates a comparative analysis of metabolic content in any given number of organisms present in the KEGG database. CONCLUSION: AutoKEGGRec provides a first step in a metabolic network reconstruction process, filling a gap for tools creating community and consolidated metabolic networks. Based only on KEGG data as external input, the generated reconstructions consist of data with a directly traceable foundation and pedigree. With AutoKEGGRec, this kind of modeling is made accessible to a wider part of the genome-scale metabolic analysis community. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2472-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-04 /pmc/articles/PMC6280343/ /pubmed/30514205 http://dx.doi.org/10.1186/s12859-018-2472-z Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Karlsen, Emil
Schulz, Christian
Almaas, Eivind
Automated generation of genome-scale metabolic draft reconstructions based on KEGG
title Automated generation of genome-scale metabolic draft reconstructions based on KEGG
title_full Automated generation of genome-scale metabolic draft reconstructions based on KEGG
title_fullStr Automated generation of genome-scale metabolic draft reconstructions based on KEGG
title_full_unstemmed Automated generation of genome-scale metabolic draft reconstructions based on KEGG
title_short Automated generation of genome-scale metabolic draft reconstructions based on KEGG
title_sort automated generation of genome-scale metabolic draft reconstructions based on kegg
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6280343/
https://www.ncbi.nlm.nih.gov/pubmed/30514205
http://dx.doi.org/10.1186/s12859-018-2472-z
work_keys_str_mv AT karlsenemil automatedgenerationofgenomescalemetabolicdraftreconstructionsbasedonkegg
AT schulzchristian automatedgenerationofgenomescalemetabolicdraftreconstructionsbasedonkegg
AT almaaseivind automatedgenerationofgenomescalemetabolicdraftreconstructionsbasedonkegg