Cargando…

Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics

Metagenomic approaches became increasingly popular in the past decades due to decreasing costs of DNA sequencing and bioinformatics development. So far, however, the recovery of long genes coding for secondary metabolites still represents a big challenge. Often, the quality of metagenome assemblies...

Descripción completa

Detalles Bibliográficos
Autores principales: Cuadrat, Rafael R. C., Ionescu, Danny, Dávila, Alberto M. R., Grossart, Hans-Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5826242/
https://www.ncbi.nlm.nih.gov/pubmed/29515540
http://dx.doi.org/10.3389/fmicb.2018.00251
_version_ 1783302309146525696
author Cuadrat, Rafael R. C.
Ionescu, Danny
Dávila, Alberto M. R.
Grossart, Hans-Peter
author_facet Cuadrat, Rafael R. C.
Ionescu, Danny
Dávila, Alberto M. R.
Grossart, Hans-Peter
author_sort Cuadrat, Rafael R. C.
collection PubMed
description Metagenomic approaches became increasingly popular in the past decades due to decreasing costs of DNA sequencing and bioinformatics development. So far, however, the recovery of long genes coding for secondary metabolites still represents a big challenge. Often, the quality of metagenome assemblies is poor, especially in environments with a high microbial diversity where sequence coverage is low and complexity of natural communities high. Recently, new and improved algorithms for binning environmental reads and contigs have been developed to overcome such limitations. Some of these algorithms use a similarity detection approach to classify the obtained reads into taxonomical units and to assemble draft genomes. This approach, however, is quite limited since it can classify exclusively sequences similar to those available (and well classified) in the databases. In this work, we used draft genomes from Lake Stechlin, north-eastern Germany, recovered by MetaBat, an efficient binning tool that integrates empirical probabilistic distances of genome abundance, and tetranucleotide frequency for accurate metagenome binning. These genomes were screened for secondary metabolism genes, such as polyketide synthases (PKS) and non-ribosomal peptide synthases (NRPS), using the Anti-SMASH and NAPDOS workflows. With this approach we were able to identify 243 secondary metabolite clusters from 121 genomes recovered from our lake samples. A total of 18 NRPS, 19 PKS, and 3 hybrid PKS/NRPS clusters were found. In addition, it was possible to predict the partial structure of several secondary metabolite clusters allowing for taxonomical classifications and phylogenetic inferences. Our approach revealed a high potential to recover and study secondary metabolites genes from any aquatic ecosystem.
format Online
Article
Text
id pubmed-5826242
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-58262422018-03-07 Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics Cuadrat, Rafael R. C. Ionescu, Danny Dávila, Alberto M. R. Grossart, Hans-Peter Front Microbiol Microbiology Metagenomic approaches became increasingly popular in the past decades due to decreasing costs of DNA sequencing and bioinformatics development. So far, however, the recovery of long genes coding for secondary metabolites still represents a big challenge. Often, the quality of metagenome assemblies is poor, especially in environments with a high microbial diversity where sequence coverage is low and complexity of natural communities high. Recently, new and improved algorithms for binning environmental reads and contigs have been developed to overcome such limitations. Some of these algorithms use a similarity detection approach to classify the obtained reads into taxonomical units and to assemble draft genomes. This approach, however, is quite limited since it can classify exclusively sequences similar to those available (and well classified) in the databases. In this work, we used draft genomes from Lake Stechlin, north-eastern Germany, recovered by MetaBat, an efficient binning tool that integrates empirical probabilistic distances of genome abundance, and tetranucleotide frequency for accurate metagenome binning. These genomes were screened for secondary metabolism genes, such as polyketide synthases (PKS) and non-ribosomal peptide synthases (NRPS), using the Anti-SMASH and NAPDOS workflows. With this approach we were able to identify 243 secondary metabolite clusters from 121 genomes recovered from our lake samples. A total of 18 NRPS, 19 PKS, and 3 hybrid PKS/NRPS clusters were found. In addition, it was possible to predict the partial structure of several secondary metabolite clusters allowing for taxonomical classifications and phylogenetic inferences. Our approach revealed a high potential to recover and study secondary metabolites genes from any aquatic ecosystem. Frontiers Media S.A. 2018-02-20 /pmc/articles/PMC5826242/ /pubmed/29515540 http://dx.doi.org/10.3389/fmicb.2018.00251 Text en Copyright © 2018 Cuadrat, Ionescu, Dávila and Grossart. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Cuadrat, Rafael R. C.
Ionescu, Danny
Dávila, Alberto M. R.
Grossart, Hans-Peter
Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics
title Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics
title_full Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics
title_fullStr Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics
title_full_unstemmed Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics
title_short Recovering Genomics Clusters of Secondary Metabolites from Lakes Using Genome-Resolved Metagenomics
title_sort recovering genomics clusters of secondary metabolites from lakes using genome-resolved metagenomics
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5826242/
https://www.ncbi.nlm.nih.gov/pubmed/29515540
http://dx.doi.org/10.3389/fmicb.2018.00251
work_keys_str_mv AT cuadratrafaelrc recoveringgenomicsclustersofsecondarymetabolitesfromlakesusinggenomeresolvedmetagenomics
AT ionescudanny recoveringgenomicsclustersofsecondarymetabolitesfromlakesusinggenomeresolvedmetagenomics
AT davilaalbertomr recoveringgenomicsclustersofsecondarymetabolitesfromlakesusinggenomeresolvedmetagenomics
AT grossarthanspeter recoveringgenomicsclustersofsecondarymetabolitesfromlakesusinggenomeresolvedmetagenomics