Cargando…

MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries

Characterizing the complete genomic structure of complex microbial communities would represent a key step toward the understanding of their diversity, dynamics, and evolution. Current metagenomics approaches aiming at this goal are typically done by analyzing millions of short DNA sequences directly...

Descripción completa

Detalles Bibliográficos
Autores principales: Baudry, Lyam, Foutel-Rodier, Théo, Thierry, Agnès, Koszul, Romain, Marbouty, Martial
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710406/
https://www.ncbi.nlm.nih.gov/pubmed/31481973
http://dx.doi.org/10.3389/fgene.2019.00753
_version_ 1783446344991506432
author Baudry, Lyam
Foutel-Rodier, Théo
Thierry, Agnès
Koszul, Romain
Marbouty, Martial
author_facet Baudry, Lyam
Foutel-Rodier, Théo
Thierry, Agnès
Koszul, Romain
Marbouty, Martial
author_sort Baudry, Lyam
collection PubMed
description Characterizing the complete genomic structure of complex microbial communities would represent a key step toward the understanding of their diversity, dynamics, and evolution. Current metagenomics approaches aiming at this goal are typically done by analyzing millions of short DNA sequences directly extracted from the environment. New experimental and computational approaches are constantly sought for to improve the analysis and interpretation of such data. We developed MetaTOR, an open-source computational solution that bins DNA contigs into individual genomes according to their 3D contact frequencies. Those contacts are quantified by chromosome conformation capture experiments (3C, Hi-C), also known as proximity-ligation approaches, applied to metagenomics samples (meta3C). MetaTOR was applied on 20 meta3C libraries of mice gut microbiota. We quantified the program ability to recover high-quality metagenome-assembled genomes (MAGs) from metagenomic assemblies generated directly from the meta3C libraries. Whereas nine high-quality MAGs are identified in the 148-Mb assembly generated using a single meta3C library, MetaTOR identifies 82 high-quality MAGs in the 763-Mb assembly generated from the merged 20 meta3C libraries, corresponding to nearly a third of the total assembly. Compared to the hybrid binning softwares MetaBAT or CONCOCT, MetaTOR recovered three times more high-quality MAGs. These results underline the potential of 3C-/Hi-C-based approaches in metagenomic projects.
format Online
Article
Text
id pubmed-6710406
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67104062019-09-03 MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries Baudry, Lyam Foutel-Rodier, Théo Thierry, Agnès Koszul, Romain Marbouty, Martial Front Genet Genetics Characterizing the complete genomic structure of complex microbial communities would represent a key step toward the understanding of their diversity, dynamics, and evolution. Current metagenomics approaches aiming at this goal are typically done by analyzing millions of short DNA sequences directly extracted from the environment. New experimental and computational approaches are constantly sought for to improve the analysis and interpretation of such data. We developed MetaTOR, an open-source computational solution that bins DNA contigs into individual genomes according to their 3D contact frequencies. Those contacts are quantified by chromosome conformation capture experiments (3C, Hi-C), also known as proximity-ligation approaches, applied to metagenomics samples (meta3C). MetaTOR was applied on 20 meta3C libraries of mice gut microbiota. We quantified the program ability to recover high-quality metagenome-assembled genomes (MAGs) from metagenomic assemblies generated directly from the meta3C libraries. Whereas nine high-quality MAGs are identified in the 148-Mb assembly generated using a single meta3C library, MetaTOR identifies 82 high-quality MAGs in the 763-Mb assembly generated from the merged 20 meta3C libraries, corresponding to nearly a third of the total assembly. Compared to the hybrid binning softwares MetaBAT or CONCOCT, MetaTOR recovered three times more high-quality MAGs. These results underline the potential of 3C-/Hi-C-based approaches in metagenomic projects. Frontiers Media S.A. 2019-08-20 /pmc/articles/PMC6710406/ /pubmed/31481973 http://dx.doi.org/10.3389/fgene.2019.00753 Text en Copyright © 2019 Baudry, Foutel-Rodier, Thierry, Koszul and Marbouty http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Baudry, Lyam
Foutel-Rodier, Théo
Thierry, Agnès
Koszul, Romain
Marbouty, Martial
MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
title MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
title_full MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
title_fullStr MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
title_full_unstemmed MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
title_short MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
title_sort metator: a computational pipeline to recover high-quality metagenomic bins from mammalian gut proximity-ligation (meta3c) libraries
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710406/
https://www.ncbi.nlm.nih.gov/pubmed/31481973
http://dx.doi.org/10.3389/fgene.2019.00753
work_keys_str_mv AT baudrylyam metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries
AT foutelrodiertheo metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries
AT thierryagnes metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries
AT koszulromain metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries
AT marboutymartial metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries