Cargando…
MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries
Characterizing the complete genomic structure of complex microbial communities would represent a key step toward the understanding of their diversity, dynamics, and evolution. Current metagenomics approaches aiming at this goal are typically done by analyzing millions of short DNA sequences directly...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710406/ https://www.ncbi.nlm.nih.gov/pubmed/31481973 http://dx.doi.org/10.3389/fgene.2019.00753 |
_version_ | 1783446344991506432 |
---|---|
author | Baudry, Lyam Foutel-Rodier, Théo Thierry, Agnès Koszul, Romain Marbouty, Martial |
author_facet | Baudry, Lyam Foutel-Rodier, Théo Thierry, Agnès Koszul, Romain Marbouty, Martial |
author_sort | Baudry, Lyam |
collection | PubMed |
description | Characterizing the complete genomic structure of complex microbial communities would represent a key step toward the understanding of their diversity, dynamics, and evolution. Current metagenomics approaches aiming at this goal are typically done by analyzing millions of short DNA sequences directly extracted from the environment. New experimental and computational approaches are constantly sought for to improve the analysis and interpretation of such data. We developed MetaTOR, an open-source computational solution that bins DNA contigs into individual genomes according to their 3D contact frequencies. Those contacts are quantified by chromosome conformation capture experiments (3C, Hi-C), also known as proximity-ligation approaches, applied to metagenomics samples (meta3C). MetaTOR was applied on 20 meta3C libraries of mice gut microbiota. We quantified the program ability to recover high-quality metagenome-assembled genomes (MAGs) from metagenomic assemblies generated directly from the meta3C libraries. Whereas nine high-quality MAGs are identified in the 148-Mb assembly generated using a single meta3C library, MetaTOR identifies 82 high-quality MAGs in the 763-Mb assembly generated from the merged 20 meta3C libraries, corresponding to nearly a third of the total assembly. Compared to the hybrid binning softwares MetaBAT or CONCOCT, MetaTOR recovered three times more high-quality MAGs. These results underline the potential of 3C-/Hi-C-based approaches in metagenomic projects. |
format | Online Article Text |
id | pubmed-6710406 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-67104062019-09-03 MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries Baudry, Lyam Foutel-Rodier, Théo Thierry, Agnès Koszul, Romain Marbouty, Martial Front Genet Genetics Characterizing the complete genomic structure of complex microbial communities would represent a key step toward the understanding of their diversity, dynamics, and evolution. Current metagenomics approaches aiming at this goal are typically done by analyzing millions of short DNA sequences directly extracted from the environment. New experimental and computational approaches are constantly sought for to improve the analysis and interpretation of such data. We developed MetaTOR, an open-source computational solution that bins DNA contigs into individual genomes according to their 3D contact frequencies. Those contacts are quantified by chromosome conformation capture experiments (3C, Hi-C), also known as proximity-ligation approaches, applied to metagenomics samples (meta3C). MetaTOR was applied on 20 meta3C libraries of mice gut microbiota. We quantified the program ability to recover high-quality metagenome-assembled genomes (MAGs) from metagenomic assemblies generated directly from the meta3C libraries. Whereas nine high-quality MAGs are identified in the 148-Mb assembly generated using a single meta3C library, MetaTOR identifies 82 high-quality MAGs in the 763-Mb assembly generated from the merged 20 meta3C libraries, corresponding to nearly a third of the total assembly. Compared to the hybrid binning softwares MetaBAT or CONCOCT, MetaTOR recovered three times more high-quality MAGs. These results underline the potential of 3C-/Hi-C-based approaches in metagenomic projects. Frontiers Media S.A. 2019-08-20 /pmc/articles/PMC6710406/ /pubmed/31481973 http://dx.doi.org/10.3389/fgene.2019.00753 Text en Copyright © 2019 Baudry, Foutel-Rodier, Thierry, Koszul and Marbouty http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Baudry, Lyam Foutel-Rodier, Théo Thierry, Agnès Koszul, Romain Marbouty, Martial MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries |
title | MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries |
title_full | MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries |
title_fullStr | MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries |
title_full_unstemmed | MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries |
title_short | MetaTOR: A Computational Pipeline to Recover High-Quality Metagenomic Bins From Mammalian Gut Proximity-Ligation (meta3C) Libraries |
title_sort | metator: a computational pipeline to recover high-quality metagenomic bins from mammalian gut proximity-ligation (meta3c) libraries |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6710406/ https://www.ncbi.nlm.nih.gov/pubmed/31481973 http://dx.doi.org/10.3389/fgene.2019.00753 |
work_keys_str_mv | AT baudrylyam metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries AT foutelrodiertheo metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries AT thierryagnes metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries AT koszulromain metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries AT marboutymartial metatoracomputationalpipelinetorecoverhighqualitymetagenomicbinsfrommammaliangutproximityligationmeta3clibraries |