Cargando…

Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes

Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or ‘bin’ sequences assembled from metagenomic reads into individual groups. Metagenomi...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Hsin-Hung, Liao, Yu-Chieh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828714/
https://www.ncbi.nlm.nih.gov/pubmed/27067514
http://dx.doi.org/10.1038/srep24175
_version_ 1782426638640742400
author Lin, Hsin-Hung
Liao, Yu-Chieh
author_facet Lin, Hsin-Hung
Liao, Yu-Chieh
author_sort Lin, Hsin-Hung
collection PubMed
description Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or ‘bin’ sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/.
format Online
Article
Text
id pubmed-4828714
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-48287142016-04-19 Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes Lin, Hsin-Hung Liao, Yu-Chieh Sci Rep Article Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or ‘bin’ sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/. Nature Publishing Group 2016-04-12 /pmc/articles/PMC4828714/ /pubmed/27067514 http://dx.doi.org/10.1038/srep24175 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Lin, Hsin-Hung
Liao, Yu-Chieh
Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
title Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
title_full Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
title_fullStr Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
title_full_unstemmed Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
title_short Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
title_sort accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828714/
https://www.ncbi.nlm.nih.gov/pubmed/27067514
http://dx.doi.org/10.1038/srep24175
work_keys_str_mv AT linhsinhung accuratebinningofmetagenomiccontigsviaautomatedclusteringsequencesusinginformationofgenomicsignaturesandmarkergenes
AT liaoyuchieh accuratebinningofmetagenomiccontigsviaautomatedclusteringsequencesusinginformationofgenomicsignaturesandmarkergenes