Cargando…

Combining multiple functional annotation tools increases coverage of metabolic annotation

BACKGROUND: Genome-scale metabolic modeling is a cornerstone of systems biology analysis of microbial organisms and communities, yet these genome-scale modeling efforts are invariably based on incomplete functional annotations. Annotated genomes typically contain 30–50% of genes without functional a...

Descripción completa

Detalles Bibliográficos
Autores principales: Griesemer, Marc, Kimbrel, Jeffrey A., Zhou, Carol E., Navid, Ali, D’haeseleer, Patrik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6299973/
https://www.ncbi.nlm.nih.gov/pubmed/30567498
http://dx.doi.org/10.1186/s12864-018-5221-9
_version_ 1783381598171824128
author Griesemer, Marc
Kimbrel, Jeffrey A.
Zhou, Carol E.
Navid, Ali
D’haeseleer, Patrik
author_facet Griesemer, Marc
Kimbrel, Jeffrey A.
Zhou, Carol E.
Navid, Ali
D’haeseleer, Patrik
author_sort Griesemer, Marc
collection PubMed
description BACKGROUND: Genome-scale metabolic modeling is a cornerstone of systems biology analysis of microbial organisms and communities, yet these genome-scale modeling efforts are invariably based on incomplete functional annotations. Annotated genomes typically contain 30–50% of genes without functional annotation, severely limiting our knowledge of the “parts lists” that the organisms have at their disposal. These incomplete annotations may be sufficient to derive a model of a core set of well-studied metabolic pathways that support growth in pure culture. However, pathways important for growth on unusual metabolites exchanged in complex microbial communities are often less understood, resulting in missing functional annotations in newly sequenced genomes. RESULTS: Here, we present results on a comprehensive reannotation of 27 bacterial reference genomes, focusing on enzymes with EC numbers annotated by KEGG, RAST, EFICAz, and the BRENDA enzyme database, and on membrane transport annotations by TransportDB, KEGG and RAST. Our analysis shows that annotation using multiple tools can result in a drastically larger metabolic network reconstruction, adding on average 40% more EC numbers, 3–8 times more substrate-specific transporters, and 37% more metabolic genes. These results are even more pronounced for bacterial species that are phylogenetically distant from well-studied model organisms such as E. coli. CONCLUSIONS: Metabolic annotations are often incomplete and inconsistent. Combining multiple functional annotation tools can greatly improve genome coverage and metabolic network size, especially for non-model organisms and non-core pathways. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5221-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6299973
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62999732018-12-20 Combining multiple functional annotation tools increases coverage of metabolic annotation Griesemer, Marc Kimbrel, Jeffrey A. Zhou, Carol E. Navid, Ali D’haeseleer, Patrik BMC Genomics Research Article BACKGROUND: Genome-scale metabolic modeling is a cornerstone of systems biology analysis of microbial organisms and communities, yet these genome-scale modeling efforts are invariably based on incomplete functional annotations. Annotated genomes typically contain 30–50% of genes without functional annotation, severely limiting our knowledge of the “parts lists” that the organisms have at their disposal. These incomplete annotations may be sufficient to derive a model of a core set of well-studied metabolic pathways that support growth in pure culture. However, pathways important for growth on unusual metabolites exchanged in complex microbial communities are often less understood, resulting in missing functional annotations in newly sequenced genomes. RESULTS: Here, we present results on a comprehensive reannotation of 27 bacterial reference genomes, focusing on enzymes with EC numbers annotated by KEGG, RAST, EFICAz, and the BRENDA enzyme database, and on membrane transport annotations by TransportDB, KEGG and RAST. Our analysis shows that annotation using multiple tools can result in a drastically larger metabolic network reconstruction, adding on average 40% more EC numbers, 3–8 times more substrate-specific transporters, and 37% more metabolic genes. These results are even more pronounced for bacterial species that are phylogenetically distant from well-studied model organisms such as E. coli. CONCLUSIONS: Metabolic annotations are often incomplete and inconsistent. Combining multiple functional annotation tools can greatly improve genome coverage and metabolic network size, especially for non-model organisms and non-core pathways. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5221-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-12-19 /pmc/articles/PMC6299973/ /pubmed/30567498 http://dx.doi.org/10.1186/s12864-018-5221-9 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Griesemer, Marc
Kimbrel, Jeffrey A.
Zhou, Carol E.
Navid, Ali
D’haeseleer, Patrik
Combining multiple functional annotation tools increases coverage of metabolic annotation
title Combining multiple functional annotation tools increases coverage of metabolic annotation
title_full Combining multiple functional annotation tools increases coverage of metabolic annotation
title_fullStr Combining multiple functional annotation tools increases coverage of metabolic annotation
title_full_unstemmed Combining multiple functional annotation tools increases coverage of metabolic annotation
title_short Combining multiple functional annotation tools increases coverage of metabolic annotation
title_sort combining multiple functional annotation tools increases coverage of metabolic annotation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6299973/
https://www.ncbi.nlm.nih.gov/pubmed/30567498
http://dx.doi.org/10.1186/s12864-018-5221-9
work_keys_str_mv AT griesemermarc combiningmultiplefunctionalannotationtoolsincreasescoverageofmetabolicannotation
AT kimbreljeffreya combiningmultiplefunctionalannotationtoolsincreasescoverageofmetabolicannotation
AT zhoucarole combiningmultiplefunctionalannotationtoolsincreasescoverageofmetabolicannotation
AT navidali combiningmultiplefunctionalannotationtoolsincreasescoverageofmetabolicannotation
AT dhaeseleerpatrik combiningmultiplefunctionalannotationtoolsincreasescoverageofmetabolicannotation