Cargando…

KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion

BACKGROUND: The rapid accumulation of sequencing data from metagenomic studies is enabling the generation of huge collections of microbial genomes, with new challenges for mapping their functional potential. In particular, metagenome-assembled genomes are typically incomplete and harbor partial gene...

Descripción completa

Detalles Bibliográficos
Autores principales: Palù, Matteo, Basile, Arianna, Zampieri, Guido, Treu, Laura, Rossi, Alessandro, Morlino, Maria Silvia, Campanaro, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8976094/
https://www.ncbi.nlm.nih.gov/pubmed/35422973
http://dx.doi.org/10.1016/j.csbj.2022.03.015
_version_ 1784680490915069952
author Palù, Matteo
Basile, Arianna
Zampieri, Guido
Treu, Laura
Rossi, Alessandro
Morlino, Maria Silvia
Campanaro, Stefano
author_facet Palù, Matteo
Basile, Arianna
Zampieri, Guido
Treu, Laura
Rossi, Alessandro
Morlino, Maria Silvia
Campanaro, Stefano
author_sort Palù, Matteo
collection PubMed
description BACKGROUND: The rapid accumulation of sequencing data from metagenomic studies is enabling the generation of huge collections of microbial genomes, with new challenges for mapping their functional potential. In particular, metagenome-assembled genomes are typically incomplete and harbor partial gene sequences that can limit their annotation from traditional tools. New scalable solutions are thus needed to facilitate the evaluation of functional potential in microbial genomes. METHODS: To resolve annotation gaps in microbial genomes, we developed KEMET, an open-source Python library devised for the analysis of Kyoto Encyclopedia of Genes and Genomes (KEGG) functional units. KEMET focuses on the in-depth analysis of metabolic reaction networks to identify missing orthologs through hidden Markov model profiles. RESULTS: We evaluate the potential of KEMET for expanding functional annotations by simulating the effect of assembly issues on real gene sequences and showing that our approach can identify missing KEGG orthologs. Additionally, we show that recovered gene annotations can sensibly increase the quality of draft genome-scale metabolic models obtained from metagenome-assembled genomes, in some cases reaching the accuracy of models generated from complete genomes. CONCLUSIONS: KEMET therefore allows expanding genome annotations by targeted searches for orthologous sequences, enabling a better qualitative and quantitative assessment of metabolic capabilities in novel microbial organisms.
format Online
Article
Text
id pubmed-8976094
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-89760942022-04-13 KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion Palù, Matteo Basile, Arianna Zampieri, Guido Treu, Laura Rossi, Alessandro Morlino, Maria Silvia Campanaro, Stefano Comput Struct Biotechnol J Short Communication BACKGROUND: The rapid accumulation of sequencing data from metagenomic studies is enabling the generation of huge collections of microbial genomes, with new challenges for mapping their functional potential. In particular, metagenome-assembled genomes are typically incomplete and harbor partial gene sequences that can limit their annotation from traditional tools. New scalable solutions are thus needed to facilitate the evaluation of functional potential in microbial genomes. METHODS: To resolve annotation gaps in microbial genomes, we developed KEMET, an open-source Python library devised for the analysis of Kyoto Encyclopedia of Genes and Genomes (KEGG) functional units. KEMET focuses on the in-depth analysis of metabolic reaction networks to identify missing orthologs through hidden Markov model profiles. RESULTS: We evaluate the potential of KEMET for expanding functional annotations by simulating the effect of assembly issues on real gene sequences and showing that our approach can identify missing KEGG orthologs. Additionally, we show that recovered gene annotations can sensibly increase the quality of draft genome-scale metabolic models obtained from metagenome-assembled genomes, in some cases reaching the accuracy of models generated from complete genomes. CONCLUSIONS: KEMET therefore allows expanding genome annotations by targeted searches for orthologous sequences, enabling a better qualitative and quantitative assessment of metabolic capabilities in novel microbial organisms. Research Network of Computational and Structural Biotechnology 2022-03-26 /pmc/articles/PMC8976094/ /pubmed/35422973 http://dx.doi.org/10.1016/j.csbj.2022.03.015 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Short Communication
Palù, Matteo
Basile, Arianna
Zampieri, Guido
Treu, Laura
Rossi, Alessandro
Morlino, Maria Silvia
Campanaro, Stefano
KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion
title KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion
title_full KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion
title_fullStr KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion
title_full_unstemmed KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion
title_short KEMET – A python tool for KEGG Module evaluation and microbial genome annotation expansion
title_sort kemet – a python tool for kegg module evaluation and microbial genome annotation expansion
topic Short Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8976094/
https://www.ncbi.nlm.nih.gov/pubmed/35422973
http://dx.doi.org/10.1016/j.csbj.2022.03.015
work_keys_str_mv AT palumatteo kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion
AT basilearianna kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion
AT zampieriguido kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion
AT treulaura kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion
AT rossialessandro kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion
AT morlinomariasilvia kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion
AT campanarostefano kemetapythontoolforkeggmoduleevaluationandmicrobialgenomeannotationexpansion