Cargando…

Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene

The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yifei, Alekseyenko, Alexander V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5685621/
https://www.ncbi.nlm.nih.gov/pubmed/29136663
http://dx.doi.org/10.1371/journal.pone.0187940
_version_ 1783278655723536384
author Zhang, Yifei
Alekseyenko, Alexander V.
author_facet Zhang, Yifei
Alekseyenko, Alexander V.
author_sort Zhang, Yifei
collection PubMed
description The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogenetic inference with 16s rRNA gene data. We use a taxonomic gold standard to compare the accuracy of alignment-free phylogenetic inference with that of common microbiome-wide phylogenetic inference pipelines based on PyNAST and MUSCLE alignments with FastTree and RAxML. We re-simulate fecal communities from Human Microbiome Project data to evaluate the performance of the methods on datasets with properties of real data. Our comparisons show that alignment-free methods are not inferior to alignment-based methods in giving accurate and robust phylogenic trees. Moreover, consensus ensembles of alignment-free phylogenies are superior to those built from alignment-based methods in their ability to highlight community differences in low power settings. In addition, the overall running times of alignment-based and alignment-free phylogenetic inference are comparable. Taken together our empirical results suggest that alignment-free methods provide a viable approach for microbiome-wide phylogenetic inference.
format Online
Article
Text
id pubmed-5685621
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56856212017-11-30 Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene Zhang, Yifei Alekseyenko, Alexander V. PLoS One Research Article The diversity of microbiota is best explored by understanding the phylogenetic structure of the microbial communities. Traditionally, sequence alignment has been used for phylogenetic inference. However, alignment-based approaches come with significant challenges and limitations when massive amounts of data are analyzed. In the recent decade, alignment-free approaches have enabled genome-scale phylogenetic inference. Here we evaluate three alignment-free methods: ACS, CVTree, and Kr for phylogenetic inference with 16s rRNA gene data. We use a taxonomic gold standard to compare the accuracy of alignment-free phylogenetic inference with that of common microbiome-wide phylogenetic inference pipelines based on PyNAST and MUSCLE alignments with FastTree and RAxML. We re-simulate fecal communities from Human Microbiome Project data to evaluate the performance of the methods on datasets with properties of real data. Our comparisons show that alignment-free methods are not inferior to alignment-based methods in giving accurate and robust phylogenic trees. Moreover, consensus ensembles of alignment-free phylogenies are superior to those built from alignment-based methods in their ability to highlight community differences in low power settings. In addition, the overall running times of alignment-based and alignment-free phylogenetic inference are comparable. Taken together our empirical results suggest that alignment-free methods provide a viable approach for microbiome-wide phylogenetic inference. Public Library of Science 2017-11-14 /pmc/articles/PMC5685621/ /pubmed/29136663 http://dx.doi.org/10.1371/journal.pone.0187940 Text en © 2017 Zhang, Alekseyenko http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhang, Yifei
Alekseyenko, Alexander V.
Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene
title Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene
title_full Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene
title_fullStr Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene
title_full_unstemmed Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene
title_short Phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rRNA gene
title_sort phylogenic inference using alignment-free methods for applications in microbial community surveys using 16s rrna gene
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5685621/
https://www.ncbi.nlm.nih.gov/pubmed/29136663
http://dx.doi.org/10.1371/journal.pone.0187940
work_keys_str_mv AT zhangyifei phylogenicinferenceusingalignmentfreemethodsforapplicationsinmicrobialcommunitysurveysusing16srrnagene
AT alekseyenkoalexanderv phylogenicinferenceusingalignmentfreemethodsforapplicationsinmicrobialcommunitysurveysusing16srrnagene