Cargando…

Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads

Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step...

Descripción completa

Detalles Bibliográficos
Autores principales: Bertels, Frederic, Silander, Olin K., Pachkov, Mikhail, Rainey, Paul B., van Nimwegen, Erik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3995342/
https://www.ncbi.nlm.nih.gov/pubmed/24600054
http://dx.doi.org/10.1093/molbev/msu088
_version_ 1782312867429613568
author Bertels, Frederic
Silander, Olin K.
Pachkov, Mikhail
Rainey, Paul B.
van Nimwegen, Erik
author_facet Bertels, Frederic
Silander, Olin K.
Pachkov, Mikhail
Rainey, Paul B.
van Nimwegen, Erik
author_sort Bertels, Frederic
collection PubMed
description Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step in any evolutionary analysis. Instead of constructing genome assemblies for all taxa, annotating these assemblies, and aligning orthologous genes, many recent studies 1) directly map raw sequencing reads to a single reference sequence, 2) extract single nucleotide polymorphisms (SNPs), and 3) infer the phylogenetic tree using maximum likelihood methods from the aligned SNP positions. However, here we show that, when using such methods to reconstruct phylogenies from sets of simulated sequences, both the exclusion of nonpolymorphic positions and the alignment to a single reference genome, introduce systematic biases and errors in phylogeny reconstruction. To address these problems, we developed a new method that combines alignments from mappings to multiple reference sequences and show that this successfully removes biases from the reconstructed phylogenies. We implemented this method as a web server named REALPHY (Reference sequence Alignment-based Phylogeny builder), which fully automates phylogenetic reconstruction from raw sequencing reads.
format Online
Article
Text
id pubmed-3995342
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39953422014-06-18 Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads Bertels, Frederic Silander, Olin K. Pachkov, Mikhail Rainey, Paul B. van Nimwegen, Erik Mol Biol Evol Fast Tracks Studies of microbial evolutionary dynamics are being transformed by the availability of affordable high-throughput sequencing technologies, which allow whole-genome sequencing of hundreds of related taxa in a single study. Reconstructing a phylogenetic tree of these taxa is generally a crucial step in any evolutionary analysis. Instead of constructing genome assemblies for all taxa, annotating these assemblies, and aligning orthologous genes, many recent studies 1) directly map raw sequencing reads to a single reference sequence, 2) extract single nucleotide polymorphisms (SNPs), and 3) infer the phylogenetic tree using maximum likelihood methods from the aligned SNP positions. However, here we show that, when using such methods to reconstruct phylogenies from sets of simulated sequences, both the exclusion of nonpolymorphic positions and the alignment to a single reference genome, introduce systematic biases and errors in phylogeny reconstruction. To address these problems, we developed a new method that combines alignments from mappings to multiple reference sequences and show that this successfully removes biases from the reconstructed phylogenies. We implemented this method as a web server named REALPHY (Reference sequence Alignment-based Phylogeny builder), which fully automates phylogenetic reconstruction from raw sequencing reads. Oxford University Press 2014-05 2014-03-05 /pmc/articles/PMC3995342/ /pubmed/24600054 http://dx.doi.org/10.1093/molbev/msu088 Text en © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Fast Tracks
Bertels, Frederic
Silander, Olin K.
Pachkov, Mikhail
Rainey, Paul B.
van Nimwegen, Erik
Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
title Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
title_full Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
title_fullStr Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
title_full_unstemmed Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
title_short Automated Reconstruction of Whole-Genome Phylogenies from Short-Sequence Reads
title_sort automated reconstruction of whole-genome phylogenies from short-sequence reads
topic Fast Tracks
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3995342/
https://www.ncbi.nlm.nih.gov/pubmed/24600054
http://dx.doi.org/10.1093/molbev/msu088
work_keys_str_mv AT bertelsfrederic automatedreconstructionofwholegenomephylogeniesfromshortsequencereads
AT silanderolink automatedreconstructionofwholegenomephylogeniesfromshortsequencereads
AT pachkovmikhail automatedreconstructionofwholegenomephylogeniesfromshortsequencereads
AT raineypaulb automatedreconstructionofwholegenomephylogeniesfromshortsequencereads
AT vannimwegenerik automatedreconstructionofwholegenomephylogeniesfromshortsequencereads