Cargando…

Reads Binning Improves Alignment-Free Metagenome Comparison

Comparing metagenomic samples is a critical step in understanding the relationships among microbial communities. Recently, next-generation sequencing (NGS) technologies have produced a massive amount of short reads data for microbial communities from different environments. The assembly of these sho...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Kai, Ren, Jie, Sun, Fengzhu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6881972/
https://www.ncbi.nlm.nih.gov/pubmed/31824565
http://dx.doi.org/10.3389/fgene.2019.01156
_version_ 1783474052702142464
author Song, Kai
Ren, Jie
Sun, Fengzhu
author_facet Song, Kai
Ren, Jie
Sun, Fengzhu
author_sort Song, Kai
collection PubMed
description Comparing metagenomic samples is a critical step in understanding the relationships among microbial communities. Recently, next-generation sequencing (NGS) technologies have produced a massive amount of short reads data for microbial communities from different environments. The assembly of these short reads can, however, be time-consuming and challenging. In addition, alignment-based methods for metagenome comparison are limited by incomplete genome and/or pathway databases. In contrast, alignment-free methods for metagenome comparison do not depend on the completeness of genome or pathway databases. Still, the existing alignment-free methods, [Formula: see text] and [Formula: see text] , which model k-tuple patterns using only one Markov chain for each sample, neglect the heterogeneity within metagenomic data wherein potentially thousands of types of microorganisms are sequenced. To address this imperfection in [Formula: see text] and [Formula: see text] , we organized NGS sequences into different reads bins and constructed several corresponding Markov models. Next, we modified the definition of our previous alignment-free methods, [Formula: see text] and [Formula: see text] , to make them more compatible with a scheme of analysis which uses the proposed reads bins. We then used two simulated and three real metagenomic datasets to test the effect of the k-tuple size and Markov orders of background sequences on the performance of these de novo alignment-free methods. For dependable comparison of metagenomic samples, our newly developed alignment-free methods with reads binning outperformed alignment-free methods without reads binning in detecting the relationship among microbial communities, including whether they form groups or change according to some environmental gradients.
format Online
Article
Text
id pubmed-6881972
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68819722019-12-10 Reads Binning Improves Alignment-Free Metagenome Comparison Song, Kai Ren, Jie Sun, Fengzhu Front Genet Genetics Comparing metagenomic samples is a critical step in understanding the relationships among microbial communities. Recently, next-generation sequencing (NGS) technologies have produced a massive amount of short reads data for microbial communities from different environments. The assembly of these short reads can, however, be time-consuming and challenging. In addition, alignment-based methods for metagenome comparison are limited by incomplete genome and/or pathway databases. In contrast, alignment-free methods for metagenome comparison do not depend on the completeness of genome or pathway databases. Still, the existing alignment-free methods, [Formula: see text] and [Formula: see text] , which model k-tuple patterns using only one Markov chain for each sample, neglect the heterogeneity within metagenomic data wherein potentially thousands of types of microorganisms are sequenced. To address this imperfection in [Formula: see text] and [Formula: see text] , we organized NGS sequences into different reads bins and constructed several corresponding Markov models. Next, we modified the definition of our previous alignment-free methods, [Formula: see text] and [Formula: see text] , to make them more compatible with a scheme of analysis which uses the proposed reads bins. We then used two simulated and three real metagenomic datasets to test the effect of the k-tuple size and Markov orders of background sequences on the performance of these de novo alignment-free methods. For dependable comparison of metagenomic samples, our newly developed alignment-free methods with reads binning outperformed alignment-free methods without reads binning in detecting the relationship among microbial communities, including whether they form groups or change according to some environmental gradients. Frontiers Media S.A. 2019-11-21 /pmc/articles/PMC6881972/ /pubmed/31824565 http://dx.doi.org/10.3389/fgene.2019.01156 Text en Copyright © 2019 Song, Ren and Sun http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Song, Kai
Ren, Jie
Sun, Fengzhu
Reads Binning Improves Alignment-Free Metagenome Comparison
title Reads Binning Improves Alignment-Free Metagenome Comparison
title_full Reads Binning Improves Alignment-Free Metagenome Comparison
title_fullStr Reads Binning Improves Alignment-Free Metagenome Comparison
title_full_unstemmed Reads Binning Improves Alignment-Free Metagenome Comparison
title_short Reads Binning Improves Alignment-Free Metagenome Comparison
title_sort reads binning improves alignment-free metagenome comparison
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6881972/
https://www.ncbi.nlm.nih.gov/pubmed/31824565
http://dx.doi.org/10.3389/fgene.2019.01156
work_keys_str_mv AT songkai readsbinningimprovesalignmentfreemetagenomecomparison
AT renjie readsbinningimprovesalignmentfreemetagenomecomparison
AT sunfengzhu readsbinningimprovesalignmentfreemetagenomecomparison