Cargando…

Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps

Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing a...

Descripción completa

Detalles Bibliográficos
Autores principales: Burton, Joshua N., Liachko, Ivan, Dunham, Maitreya J., Shendure, Jay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4455782/
https://www.ncbi.nlm.nih.gov/pubmed/24855317
http://dx.doi.org/10.1534/g3.114.011825
_version_ 1782374772474118144
author Burton, Joshua N.
Liachko, Ivan
Dunham, Maitreya J.
Shendure, Jay
author_facet Burton, Joshua N.
Liachko, Ivan
Dunham, Maitreya J.
Shendure, Jay
author_sort Burton, Joshua N.
collection PubMed
description Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing and de novo assembly of a metagenome typically yield a collection of contigs that cannot readily be grouped by species. Methods for generating chromatin-level contact probability maps, e.g., as generated by the Hi-C method, provide a signal of contiguity that is completely intracellular and contains both intrachromosomal and interchromosomal information. Here, we demonstrate how this signal can be exploited to reconstruct the individual genomes of microbial species present within a mixed sample. We apply this approach to two synthetic metagenome samples, successfully clustering the genome content of fungal, bacterial, and archaeal species with more than 99% agreement with published reference genomes. We also show that the Hi-C signal can secondarily be used to create scaffolded genome assemblies of individual eukaryotic species present within the microbial community, with higher levels of contiguity than some of the species’ published reference genomes.
format Online
Article
Text
id pubmed-4455782
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-44557822015-06-08 Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps Burton, Joshua N. Liachko, Ivan Dunham, Maitreya J. Shendure, Jay G3 (Bethesda) Investigations Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing and de novo assembly of a metagenome typically yield a collection of contigs that cannot readily be grouped by species. Methods for generating chromatin-level contact probability maps, e.g., as generated by the Hi-C method, provide a signal of contiguity that is completely intracellular and contains both intrachromosomal and interchromosomal information. Here, we demonstrate how this signal can be exploited to reconstruct the individual genomes of microbial species present within a mixed sample. We apply this approach to two synthetic metagenome samples, successfully clustering the genome content of fungal, bacterial, and archaeal species with more than 99% agreement with published reference genomes. We also show that the Hi-C signal can secondarily be used to create scaffolded genome assemblies of individual eukaryotic species present within the microbial community, with higher levels of contiguity than some of the species’ published reference genomes. Genetics Society of America 2014-05-22 /pmc/articles/PMC4455782/ /pubmed/24855317 http://dx.doi.org/10.1534/g3.114.011825 Text en Copyright © 2014 Burton et al http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution Unported License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Burton, Joshua N.
Liachko, Ivan
Dunham, Maitreya J.
Shendure, Jay
Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps
title Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps
title_full Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps
title_fullStr Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps
title_full_unstemmed Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps
title_short Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps
title_sort species-level deconvolution of metagenome assemblies with hi-c–based contact probability maps
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4455782/
https://www.ncbi.nlm.nih.gov/pubmed/24855317
http://dx.doi.org/10.1534/g3.114.011825
work_keys_str_mv AT burtonjoshuan speciesleveldeconvolutionofmetagenomeassemblieswithhicbasedcontactprobabilitymaps
AT liachkoivan speciesleveldeconvolutionofmetagenomeassemblieswithhicbasedcontactprobabilitymaps
AT dunhammaitreyaj speciesleveldeconvolutionofmetagenomeassemblieswithhicbasedcontactprobabilitymaps
AT shendurejay speciesleveldeconvolutionofmetagenomeassemblieswithhicbasedcontactprobabilitymaps