Cargando…

AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics

Understanding prokaryotic transformation of recalcitrant pollutants and the in-situ metabolic nets require the integration of massive amounts of biological data. Decades of biochemical studies together with novel next-generation sequencing data have exponentially increased information on aerobic aro...

Descripción completa

Detalles Bibliográficos
Autores principales: Duarte, Márcia, Jauregui, Ruy, Vilchez-Vargas, Ramiro, Junca, Howard, Pieper, Dietmar H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4250580/
https://www.ncbi.nlm.nih.gov/pubmed/25468931
http://dx.doi.org/10.1093/database/bau118
_version_ 1782346967560486912
author Duarte, Márcia
Jauregui, Ruy
Vilchez-Vargas, Ramiro
Junca, Howard
Pieper, Dietmar H.
author_facet Duarte, Márcia
Jauregui, Ruy
Vilchez-Vargas, Ramiro
Junca, Howard
Pieper, Dietmar H.
author_sort Duarte, Márcia
collection PubMed
description Understanding prokaryotic transformation of recalcitrant pollutants and the in-situ metabolic nets require the integration of massive amounts of biological data. Decades of biochemical studies together with novel next-generation sequencing data have exponentially increased information on aerobic aromatic degradation pathways. However, the majority of protein sequences in public databases have not been experimentally characterized and homology-based methods are still the most routinely used approach to assign protein function, allowing the propagation of misannotations. AromaDeg is a web-based resource targeting aerobic degradation of aromatics that comprises recently updated (September 2013) and manually curated databases constructed based on a phylogenomic approach. Grounded in phylogenetic analyses of protein sequences of key catabolic protein families and of proteins of documented function, AromaDeg allows query and data mining of novel genomic, metagenomic or metatranscriptomic data sets. Essentially, each query sequence that match a given protein family of AromaDeg is associated to a specific cluster of a given phylogenetic tree and further function annotation and/or substrate specificity may be inferred from the neighboring cluster members with experimentally validated function. This allows a detailed characterization of individual protein superfamilies as well as high-throughput functional classifications. Thus, AromaDeg addresses the deficiencies of homology-based protein function prediction, combining phylogenetic tree construction and integration of experimental data to obtain more accurate annotations of new biological data related to aerobic aromatic biodegradation pathways. We pursue in future the expansion of AromaDeg to other enzyme families involved in aromatic degradation and its regular update. Database URL: http://aromadeg.siona.helmholtz-hzi.de
format Online
Article
Text
id pubmed-4250580
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42505802014-12-04 AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics Duarte, Márcia Jauregui, Ruy Vilchez-Vargas, Ramiro Junca, Howard Pieper, Dietmar H. Database (Oxford) Original Article Understanding prokaryotic transformation of recalcitrant pollutants and the in-situ metabolic nets require the integration of massive amounts of biological data. Decades of biochemical studies together with novel next-generation sequencing data have exponentially increased information on aerobic aromatic degradation pathways. However, the majority of protein sequences in public databases have not been experimentally characterized and homology-based methods are still the most routinely used approach to assign protein function, allowing the propagation of misannotations. AromaDeg is a web-based resource targeting aerobic degradation of aromatics that comprises recently updated (September 2013) and manually curated databases constructed based on a phylogenomic approach. Grounded in phylogenetic analyses of protein sequences of key catabolic protein families and of proteins of documented function, AromaDeg allows query and data mining of novel genomic, metagenomic or metatranscriptomic data sets. Essentially, each query sequence that match a given protein family of AromaDeg is associated to a specific cluster of a given phylogenetic tree and further function annotation and/or substrate specificity may be inferred from the neighboring cluster members with experimentally validated function. This allows a detailed characterization of individual protein superfamilies as well as high-throughput functional classifications. Thus, AromaDeg addresses the deficiencies of homology-based protein function prediction, combining phylogenetic tree construction and integration of experimental data to obtain more accurate annotations of new biological data related to aerobic aromatic biodegradation pathways. We pursue in future the expansion of AromaDeg to other enzyme families involved in aromatic degradation and its regular update. Database URL: http://aromadeg.siona.helmholtz-hzi.de Oxford University Press 2014-12-01 /pmc/articles/PMC4250580/ /pubmed/25468931 http://dx.doi.org/10.1093/database/bau118 Text en © The Author(s) 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Duarte, Márcia
Jauregui, Ruy
Vilchez-Vargas, Ramiro
Junca, Howard
Pieper, Dietmar H.
AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
title AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
title_full AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
title_fullStr AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
title_full_unstemmed AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
title_short AromaDeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
title_sort aromadeg, a novel database for phylogenomics of aerobic bacterial degradation of aromatics
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4250580/
https://www.ncbi.nlm.nih.gov/pubmed/25468931
http://dx.doi.org/10.1093/database/bau118
work_keys_str_mv AT duartemarcia aromadeganoveldatabaseforphylogenomicsofaerobicbacterialdegradationofaromatics
AT jaureguiruy aromadeganoveldatabaseforphylogenomicsofaerobicbacterialdegradationofaromatics
AT vilchezvargasramiro aromadeganoveldatabaseforphylogenomicsofaerobicbacterialdegradationofaromatics
AT juncahoward aromadeganoveldatabaseforphylogenomicsofaerobicbacterialdegradationofaromatics
AT pieperdietmarh aromadeganoveldatabaseforphylogenomicsofaerobicbacterialdegradationofaromatics