Cargando…

The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures

So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to...

Descripción completa

Detalles Bibliográficos
Autores principales: Strous, Marc, Kraft, Beate, Bisdorf, Regina, Tegetmeyer, Halina E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3514610/
https://www.ncbi.nlm.nih.gov/pubmed/23227024
http://dx.doi.org/10.3389/fmicb.2012.00410
_version_ 1782252053921267712
author Strous, Marc
Kraft, Beate
Bisdorf, Regina
Tegetmeyer, Halina E.
author_facet Strous, Marc
Kraft, Beate
Bisdorf, Regina
Tegetmeyer, Halina E.
author_sort Strous, Marc
collection PubMed
description So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population. In the present study eight metagenomic samples from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13–23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to existing algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a 100 times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt).
format Online
Article
Text
id pubmed-3514610
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-35146102012-12-07 The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures Strous, Marc Kraft, Beate Bisdorf, Regina Tegetmeyer, Halina E. Front Microbiol Microbiology So far, microbial physiology has dedicated itself mainly to pure cultures. In nature, cross feeding and competition are important aspects of microbial physiology and these can only be addressed by studying complete communities such as enrichment cultures. Metagenomic sequencing is a powerful tool to characterize such mixed cultures. In the analysis of metagenomic data, well established algorithms exist for the assembly of short reads into contigs and for the annotation of predicted genes. However, the binning of the assembled contigs or unassembled reads is still a major bottleneck and required to understand how the overall metabolism is partitioned over different community members. Binning consists of the clustering of contigs or reads that apparently originate from the same source population. In the present study eight metagenomic samples from the same habitat, a laboratory enrichment culture, were sequenced. Each sample contained 13–23 Mb of assembled contigs and up to eight abundant populations. Binning was attempted with existing methods but they were found to produce poor results, were slow, dependent on non-standard platforms or produced errors. A new binning procedure was developed based on multivariate statistics of tetranucleotide frequencies combined with the use of interpolated Markov models. Its performance was evaluated by comparison of the results between samples with BLAST and in comparison to existing algorithms for four publicly available metagenomes and one previously published artificial metagenome. The accuracy of the new approach was comparable or higher than existing methods. Further, it was up to a 100 times faster. It was implemented in Java Swing as a complete open source graphical binning application available for download and further development (http://sourceforge.net/projects/metawatt). Frontiers Media S.A. 2012-12-05 /pmc/articles/PMC3514610/ /pubmed/23227024 http://dx.doi.org/10.3389/fmicb.2012.00410 Text en Copyright © 2012 Strous, Kraft, Bisdorf and Tegetmeyer. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Microbiology
Strous, Marc
Kraft, Beate
Bisdorf, Regina
Tegetmeyer, Halina E.
The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures
title The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures
title_full The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures
title_fullStr The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures
title_full_unstemmed The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures
title_short The Binning of Metagenomic Contigs for Microbial Physiology of Mixed Cultures
title_sort binning of metagenomic contigs for microbial physiology of mixed cultures
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3514610/
https://www.ncbi.nlm.nih.gov/pubmed/23227024
http://dx.doi.org/10.3389/fmicb.2012.00410
work_keys_str_mv AT strousmarc thebinningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT kraftbeate thebinningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT bisdorfregina thebinningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT tegetmeyerhalinae thebinningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT strousmarc binningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT kraftbeate binningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT bisdorfregina binningofmetagenomiccontigsformicrobialphysiologyofmixedcultures
AT tegetmeyerhalinae binningofmetagenomiccontigsformicrobialphysiologyofmixedcultures