Cargando…

Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes

BACKGROUND: The proportion of conserved DNA sequences with no clear function is steadily growing in bioinformatics databases. Studies of sequence and structural homology have indicated that many uncharacterized protein domain sequences are variants of functionally described domains. If these variant...

Descripción completa

Detalles Bibliográficos
Autores principales: Buttigieg, Pier Luigi, Hankeln, Wolfgang, Kostadinov, Ivaylo, Kottmann, Renzo, Yilmaz, Pelin, Duhaime, Melissa Beth, Glöckner, Frank Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3597751/
https://www.ncbi.nlm.nih.gov/pubmed/23516388
http://dx.doi.org/10.1371/journal.pone.0050869
_version_ 1782262689085521920
author Buttigieg, Pier Luigi
Hankeln, Wolfgang
Kostadinov, Ivaylo
Kottmann, Renzo
Yilmaz, Pelin
Duhaime, Melissa Beth
Glöckner, Frank Oliver
author_facet Buttigieg, Pier Luigi
Hankeln, Wolfgang
Kostadinov, Ivaylo
Kottmann, Renzo
Yilmaz, Pelin
Duhaime, Melissa Beth
Glöckner, Frank Oliver
author_sort Buttigieg, Pier Luigi
collection PubMed
description BACKGROUND: The proportion of conserved DNA sequences with no clear function is steadily growing in bioinformatics databases. Studies of sequence and structural homology have indicated that many uncharacterized protein domain sequences are variants of functionally described domains. If these variants promote an organism's ecological fitness, they are likely to be conserved in the genome of its progeny and the population at large. The genetic composition of microbial communities in their native ecosystems is accessible through metagenomics. We hypothesize the co-variation of protein domain sequences across metagenomes from similar ecosystems will provide insights into their potential roles and aid further investigation. METHODOLOGY/PRINCIPAL FINDINGS: We calculated the correlation of Pfam protein domain sequences across the Global Ocean Sampling metagenome collection, employing conservative detection and correlation thresholds to limit results to well-supported hits and associations. We then examined intercorrelations between domains of unknown function (DUFs) and domains involved in known metabolic pathways using network visualization and cluster-detection tools. We used a cautious “guilty-by-association” approach, referencing knowledge-level resources to identify and discuss associations that offer insight into DUF function. We observed numerous DUFs associated to photobiologically active domains and prevalent in the Cyanobacteria. Other clusters included DUFs associated with DNA maintenance and repair, inorganic nutrient metabolism, and sodium-translocating transport domains. We also observed a number of clusters reflecting known metabolic associations and cases that predicted functional reclassification of DUFs. CONCLUSION/SIGNIFICANCE: Critically examining domain covariation across metagenomic datasets can grant new perspectives on the roles and associations of DUFs in an ecological setting. Targeted attempts at DUF characterization in the laboratory or in silico may draw from these insights and opportunities to discover new associations and corroborate existing ones will arise as more large-scale metagenomic datasets emerge.
format Online
Article
Text
id pubmed-3597751
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35977512013-03-20 Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes Buttigieg, Pier Luigi Hankeln, Wolfgang Kostadinov, Ivaylo Kottmann, Renzo Yilmaz, Pelin Duhaime, Melissa Beth Glöckner, Frank Oliver PLoS One Research Article BACKGROUND: The proportion of conserved DNA sequences with no clear function is steadily growing in bioinformatics databases. Studies of sequence and structural homology have indicated that many uncharacterized protein domain sequences are variants of functionally described domains. If these variants promote an organism's ecological fitness, they are likely to be conserved in the genome of its progeny and the population at large. The genetic composition of microbial communities in their native ecosystems is accessible through metagenomics. We hypothesize the co-variation of protein domain sequences across metagenomes from similar ecosystems will provide insights into their potential roles and aid further investigation. METHODOLOGY/PRINCIPAL FINDINGS: We calculated the correlation of Pfam protein domain sequences across the Global Ocean Sampling metagenome collection, employing conservative detection and correlation thresholds to limit results to well-supported hits and associations. We then examined intercorrelations between domains of unknown function (DUFs) and domains involved in known metabolic pathways using network visualization and cluster-detection tools. We used a cautious “guilty-by-association” approach, referencing knowledge-level resources to identify and discuss associations that offer insight into DUF function. We observed numerous DUFs associated to photobiologically active domains and prevalent in the Cyanobacteria. Other clusters included DUFs associated with DNA maintenance and repair, inorganic nutrient metabolism, and sodium-translocating transport domains. We also observed a number of clusters reflecting known metabolic associations and cases that predicted functional reclassification of DUFs. CONCLUSION/SIGNIFICANCE: Critically examining domain covariation across metagenomic datasets can grant new perspectives on the roles and associations of DUFs in an ecological setting. Targeted attempts at DUF characterization in the laboratory or in silico may draw from these insights and opportunities to discover new associations and corroborate existing ones will arise as more large-scale metagenomic datasets emerge. Public Library of Science 2013-03-14 /pmc/articles/PMC3597751/ /pubmed/23516388 http://dx.doi.org/10.1371/journal.pone.0050869 Text en © 2013 Buttigieg et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Buttigieg, Pier Luigi
Hankeln, Wolfgang
Kostadinov, Ivaylo
Kottmann, Renzo
Yilmaz, Pelin
Duhaime, Melissa Beth
Glöckner, Frank Oliver
Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes
title Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes
title_full Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes
title_fullStr Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes
title_full_unstemmed Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes
title_short Ecogenomic Perspectives on Domains of Unknown Function: Correlation-Based Exploration of Marine Metagenomes
title_sort ecogenomic perspectives on domains of unknown function: correlation-based exploration of marine metagenomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3597751/
https://www.ncbi.nlm.nih.gov/pubmed/23516388
http://dx.doi.org/10.1371/journal.pone.0050869
work_keys_str_mv AT buttigiegpierluigi ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes
AT hankelnwolfgang ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes
AT kostadinovivaylo ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes
AT kottmannrenzo ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes
AT yilmazpelin ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes
AT duhaimemelissabeth ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes
AT glocknerfrankoliver ecogenomicperspectivesondomainsofunknownfunctioncorrelationbasedexplorationofmarinemetagenomes