Cargando…
MetaGen: reference-free learning with multiple metagenomic samples
A major goal of metagenomics is to identify and study the entire collection of microbial species in a set of targeted samples. We describe a statistical metagenomic algorithm that simultaneously identifies microbial species and estimates their abundances without using reference genomes. As a trade-o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627425/ https://www.ncbi.nlm.nih.gov/pubmed/28974263 http://dx.doi.org/10.1186/s13059-017-1323-y |
_version_ | 1783268713395388416 |
---|---|
author | Xing, Xin Liu, Jun S. Zhong, Wenxuan |
author_facet | Xing, Xin Liu, Jun S. Zhong, Wenxuan |
author_sort | Xing, Xin |
collection | PubMed |
description | A major goal of metagenomics is to identify and study the entire collection of microbial species in a set of targeted samples. We describe a statistical metagenomic algorithm that simultaneously identifies microbial species and estimates their abundances without using reference genomes. As a trade-off, we require multiple metagenomic samples, usually ≥10 samples, to get highly accurate binning results. Compared to reference-free methods based primarily on k-mer distributions or coverage information, the proposed approach achieves a higher species binning accuracy and is particularly powerful when sequencing coverage is low. We demonstrated the performance of this new method through both simulation and real metagenomic studies. The MetaGen software is available at https://github.com/BioAlgs/MetaGen. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1323-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5627425 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-56274252017-10-12 MetaGen: reference-free learning with multiple metagenomic samples Xing, Xin Liu, Jun S. Zhong, Wenxuan Genome Biol Method A major goal of metagenomics is to identify and study the entire collection of microbial species in a set of targeted samples. We describe a statistical metagenomic algorithm that simultaneously identifies microbial species and estimates their abundances without using reference genomes. As a trade-off, we require multiple metagenomic samples, usually ≥10 samples, to get highly accurate binning results. Compared to reference-free methods based primarily on k-mer distributions or coverage information, the proposed approach achieves a higher species binning accuracy and is particularly powerful when sequencing coverage is low. We demonstrated the performance of this new method through both simulation and real metagenomic studies. The MetaGen software is available at https://github.com/BioAlgs/MetaGen. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1323-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-10-03 /pmc/articles/PMC5627425/ /pubmed/28974263 http://dx.doi.org/10.1186/s13059-017-1323-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Method Xing, Xin Liu, Jun S. Zhong, Wenxuan MetaGen: reference-free learning with multiple metagenomic samples |
title | MetaGen: reference-free learning with multiple metagenomic samples |
title_full | MetaGen: reference-free learning with multiple metagenomic samples |
title_fullStr | MetaGen: reference-free learning with multiple metagenomic samples |
title_full_unstemmed | MetaGen: reference-free learning with multiple metagenomic samples |
title_short | MetaGen: reference-free learning with multiple metagenomic samples |
title_sort | metagen: reference-free learning with multiple metagenomic samples |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627425/ https://www.ncbi.nlm.nih.gov/pubmed/28974263 http://dx.doi.org/10.1186/s13059-017-1323-y |
work_keys_str_mv | AT xingxin metagenreferencefreelearningwithmultiplemetagenomicsamples AT liujuns metagenreferencefreelearningwithmultiplemetagenomicsamples AT zhongwenxuan metagenreferencefreelearningwithmultiplemetagenomicsamples |