Cargando…
Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes
Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or ‘bin’ sequences assembled from metagenomic reads into individual groups. Metagenomi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828714/ https://www.ncbi.nlm.nih.gov/pubmed/27067514 http://dx.doi.org/10.1038/srep24175 |
_version_ | 1782426638640742400 |
---|---|
author | Lin, Hsin-Hung Liao, Yu-Chieh |
author_facet | Lin, Hsin-Hung Liao, Yu-Chieh |
author_sort | Lin, Hsin-Hung |
collection | PubMed |
description | Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or ‘bin’ sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/. |
format | Online Article Text |
id | pubmed-4828714 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-48287142016-04-19 Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes Lin, Hsin-Hung Liao, Yu-Chieh Sci Rep Article Metagenomics, the application of shotgun sequencing, facilitates the reconstruction of the genomes of individual species from natural environments. A major challenge in the genome recovery domain is to agglomerate or ‘bin’ sequences assembled from metagenomic reads into individual groups. Metagenomic binning without consideration of reference sequences enables the comprehensive discovery of new microbial organisms and aids in the microbial genome reconstruction process. Here we present MyCC, an automated binning tool that combines genomic signatures, marker genes and optional contig coverages within one or multiple samples, in order to visualize the metagenomes and to identify the reconstructed genomic fragments. We demonstrate the superior performance of MyCC compared to other binning tools including CONCOCT, GroopM, MaxBin and MetaBAT on both synthetic and real human gut communities with a small sample size (one to 11 samples), as well as on a large metagenome dataset (over 250 samples). Moreover, we demonstrate the visualization of metagenomes in MyCC to aid in the reconstruction of genomes from distinct bins. MyCC is freely available at http://sourceforge.net/projects/sb2nhri/files/MyCC/. Nature Publishing Group 2016-04-12 /pmc/articles/PMC4828714/ /pubmed/27067514 http://dx.doi.org/10.1038/srep24175 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Lin, Hsin-Hung Liao, Yu-Chieh Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
title | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
title_full | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
title_fullStr | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
title_full_unstemmed | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
title_short | Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
title_sort | accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828714/ https://www.ncbi.nlm.nih.gov/pubmed/27067514 http://dx.doi.org/10.1038/srep24175 |
work_keys_str_mv | AT linhsinhung accuratebinningofmetagenomiccontigsviaautomatedclusteringsequencesusinginformationofgenomicsignaturesandmarkergenes AT liaoyuchieh accuratebinningofmetagenomiccontigsviaautomatedclusteringsequencesusinginformationofgenomicsignaturesandmarkergenes |