Cargando…
Toward the automated generation of genome-scale metabolic networks in the SEED
BACKGROUND: Current methods for the automated generation of genome-scale metabolic networks focus on genome annotation and preliminary biochemical reaction network assembly, but do not adequately address the process of identifying and filling gaps in the reaction network, and verifying that the netw...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1868769/ https://www.ncbi.nlm.nih.gov/pubmed/17462086 http://dx.doi.org/10.1186/1471-2105-8-139 |
_version_ | 1782133404267970560 |
---|---|
author | DeJongh, Matthew Formsma, Kevin Boillot, Paul Gould, John Rycenga, Matthew Best, Aaron |
author_facet | DeJongh, Matthew Formsma, Kevin Boillot, Paul Gould, John Rycenga, Matthew Best, Aaron |
author_sort | DeJongh, Matthew |
collection | PubMed |
description | BACKGROUND: Current methods for the automated generation of genome-scale metabolic networks focus on genome annotation and preliminary biochemical reaction network assembly, but do not adequately address the process of identifying and filling gaps in the reaction network, and verifying that the network is suitable for systems level analysis. Thus, current methods are only sufficient for generating draft-quality networks, and refinement of the reaction network is still largely a manual, labor-intensive process. RESULTS: We have developed a method for generating genome-scale metabolic networks that produces substantially complete reaction networks, suitable for systems level analysis. Our method partitions the reaction space of central and intermediary metabolism into discrete, interconnected components that can be assembled and verified in isolation from each other, and then integrated and verified at the level of their interconnectivity. We have developed a database of components that are common across organisms, and have created tools for automatically assembling appropriate components for a particular organism based on the metabolic pathways encoded in the organism's genome. This focuses manual efforts on that portion of an organism's metabolism that is not yet represented in the database. We have demonstrated the efficacy of our method by reverse-engineering and automatically regenerating the reaction network from a published genome-scale metabolic model for Staphylococcus aureus. Additionally, we have verified that our method capitalizes on the database of common reaction network components created for S. aureus, by using these components to generate substantially complete reconstructions of the reaction networks from three other published metabolic models (Escherichia coli, Helicobacter pylori, and Lactococcus lactis). We have implemented our tools and database within the SEED, an open-source software environment for comparative genome annotation and analysis. CONCLUSION: Our method sets the stage for the automated generation of substantially complete metabolic networks for over 400 complete genome sequences currently in the SEED. With each genome that is processed using our tools, the database of common components grows to cover more of the diversity of metabolic pathways. This increases the likelihood that components of reaction networks for subsequently processed genomes can be retrieved from the database, rather than assembled and verified manually. |
format | Text |
id | pubmed-1868769 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-18687692007-05-15 Toward the automated generation of genome-scale metabolic networks in the SEED DeJongh, Matthew Formsma, Kevin Boillot, Paul Gould, John Rycenga, Matthew Best, Aaron BMC Bioinformatics Research Article BACKGROUND: Current methods for the automated generation of genome-scale metabolic networks focus on genome annotation and preliminary biochemical reaction network assembly, but do not adequately address the process of identifying and filling gaps in the reaction network, and verifying that the network is suitable for systems level analysis. Thus, current methods are only sufficient for generating draft-quality networks, and refinement of the reaction network is still largely a manual, labor-intensive process. RESULTS: We have developed a method for generating genome-scale metabolic networks that produces substantially complete reaction networks, suitable for systems level analysis. Our method partitions the reaction space of central and intermediary metabolism into discrete, interconnected components that can be assembled and verified in isolation from each other, and then integrated and verified at the level of their interconnectivity. We have developed a database of components that are common across organisms, and have created tools for automatically assembling appropriate components for a particular organism based on the metabolic pathways encoded in the organism's genome. This focuses manual efforts on that portion of an organism's metabolism that is not yet represented in the database. We have demonstrated the efficacy of our method by reverse-engineering and automatically regenerating the reaction network from a published genome-scale metabolic model for Staphylococcus aureus. Additionally, we have verified that our method capitalizes on the database of common reaction network components created for S. aureus, by using these components to generate substantially complete reconstructions of the reaction networks from three other published metabolic models (Escherichia coli, Helicobacter pylori, and Lactococcus lactis). We have implemented our tools and database within the SEED, an open-source software environment for comparative genome annotation and analysis. CONCLUSION: Our method sets the stage for the automated generation of substantially complete metabolic networks for over 400 complete genome sequences currently in the SEED. With each genome that is processed using our tools, the database of common components grows to cover more of the diversity of metabolic pathways. This increases the likelihood that components of reaction networks for subsequently processed genomes can be retrieved from the database, rather than assembled and verified manually. BioMed Central 2007-04-26 /pmc/articles/PMC1868769/ /pubmed/17462086 http://dx.doi.org/10.1186/1471-2105-8-139 Text en Copyright © 2007 DeJongh et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article DeJongh, Matthew Formsma, Kevin Boillot, Paul Gould, John Rycenga, Matthew Best, Aaron Toward the automated generation of genome-scale metabolic networks in the SEED |
title | Toward the automated generation of genome-scale metabolic networks in the SEED |
title_full | Toward the automated generation of genome-scale metabolic networks in the SEED |
title_fullStr | Toward the automated generation of genome-scale metabolic networks in the SEED |
title_full_unstemmed | Toward the automated generation of genome-scale metabolic networks in the SEED |
title_short | Toward the automated generation of genome-scale metabolic networks in the SEED |
title_sort | toward the automated generation of genome-scale metabolic networks in the seed |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1868769/ https://www.ncbi.nlm.nih.gov/pubmed/17462086 http://dx.doi.org/10.1186/1471-2105-8-139 |
work_keys_str_mv | AT dejonghmatthew towardtheautomatedgenerationofgenomescalemetabolicnetworksintheseed AT formsmakevin towardtheautomatedgenerationofgenomescalemetabolicnetworksintheseed AT boillotpaul towardtheautomatedgenerationofgenomescalemetabolicnetworksintheseed AT gouldjohn towardtheautomatedgenerationofgenomescalemetabolicnetworksintheseed AT rycengamatthew towardtheautomatedgenerationofgenomescalemetabolicnetworksintheseed AT bestaaron towardtheautomatedgenerationofgenomescalemetabolicnetworksintheseed |