Cargando…

EC2KEGG: a command line tool for comparison of metabolic pathways

BACKGROUND: Next-generation sequencing and metagenome projects yield a large number of new genomes that need further annotations, such as identification of enzymes and metabolic pathways, or analysis of metabolic strategies of newly sequenced species in comparison to known organisms. While methods f...

Descripción completa

Detalles Bibliográficos
Autor principal: Porollo, Aleksey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4157228/
https://www.ncbi.nlm.nih.gov/pubmed/25202338
http://dx.doi.org/10.1186/1751-0473-9-19
_version_ 1782333836218073088
author Porollo, Aleksey
author_facet Porollo, Aleksey
author_sort Porollo, Aleksey
collection PubMed
description BACKGROUND: Next-generation sequencing and metagenome projects yield a large number of new genomes that need further annotations, such as identification of enzymes and metabolic pathways, or analysis of metabolic strategies of newly sequenced species in comparison to known organisms. While methods for enzyme identification are available, development of the command line tools for high-throughput comparative analysis and visualization of identified enzymes is lagging. METHODS: A set of perl scripts has been developed to perform automated data retrieval from the KEGG database using its new REST program application interface. Enrichment or depletion in metabolic pathways is evaluated using the two-tailed Fisher exact test followed by Benjamini and Hochberg correction. RESULTS: Comparative analysis of a given set of enzymes with a specified reference organism includes mapping to known metabolic pathways, finding shared and unique enzymes, generating links to visualize maps at KEGG Pathway, computing enrichment of the pathways, listing the non-mapped enzymes. CONCLUSIONS: EC2KEGG provides a platform independent toolkit for automated comparison of identified sets of enzymes from newly sequenced organisms against annotated reference genomes. The tool can be used both for manual annotations of individual species and for high-throughput annotations as part of a computational pipeline. The tool is publicly available at http://sourceforge.net/projects/ec2kegg/.
format Online
Article
Text
id pubmed-4157228
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41572282014-09-09 EC2KEGG: a command line tool for comparison of metabolic pathways Porollo, Aleksey Source Code Biol Med Research BACKGROUND: Next-generation sequencing and metagenome projects yield a large number of new genomes that need further annotations, such as identification of enzymes and metabolic pathways, or analysis of metabolic strategies of newly sequenced species in comparison to known organisms. While methods for enzyme identification are available, development of the command line tools for high-throughput comparative analysis and visualization of identified enzymes is lagging. METHODS: A set of perl scripts has been developed to perform automated data retrieval from the KEGG database using its new REST program application interface. Enrichment or depletion in metabolic pathways is evaluated using the two-tailed Fisher exact test followed by Benjamini and Hochberg correction. RESULTS: Comparative analysis of a given set of enzymes with a specified reference organism includes mapping to known metabolic pathways, finding shared and unique enzymes, generating links to visualize maps at KEGG Pathway, computing enrichment of the pathways, listing the non-mapped enzymes. CONCLUSIONS: EC2KEGG provides a platform independent toolkit for automated comparison of identified sets of enzymes from newly sequenced organisms against annotated reference genomes. The tool can be used both for manual annotations of individual species and for high-throughput annotations as part of a computational pipeline. The tool is publicly available at http://sourceforge.net/projects/ec2kegg/. BioMed Central 2014-09-02 /pmc/articles/PMC4157228/ /pubmed/25202338 http://dx.doi.org/10.1186/1751-0473-9-19 Text en Copyright © 2014 Porollo; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Porollo, Aleksey
EC2KEGG: a command line tool for comparison of metabolic pathways
title EC2KEGG: a command line tool for comparison of metabolic pathways
title_full EC2KEGG: a command line tool for comparison of metabolic pathways
title_fullStr EC2KEGG: a command line tool for comparison of metabolic pathways
title_full_unstemmed EC2KEGG: a command line tool for comparison of metabolic pathways
title_short EC2KEGG: a command line tool for comparison of metabolic pathways
title_sort ec2kegg: a command line tool for comparison of metabolic pathways
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4157228/
https://www.ncbi.nlm.nih.gov/pubmed/25202338
http://dx.doi.org/10.1186/1751-0473-9-19
work_keys_str_mv AT porolloaleksey ec2keggacommandlinetoolforcomparisonofmetabolicpathways