Cargando…

Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer

Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternati...

Descripción completa

Detalles Bibliográficos
Autores principales: Bromberg, Raquel, Grishin, Nick V., Otwinowski, Zbyszek
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4918981/
https://www.ncbi.nlm.nih.gov/pubmed/27336403
http://dx.doi.org/10.1371/journal.pcbi.1004985
_version_ 1782439190368092160
author Bromberg, Raquel
Grishin, Nick V.
Otwinowski, Zbyszek
author_facet Bromberg, Raquel
Grishin, Nick V.
Otwinowski, Zbyszek
author_sort Bromberg, Raquel
collection PubMed
description Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternative to traditional, alignment-based approaches are whole-genome, alignment-free methods. These methods are scalable and require minimal manual intervention. We developed SlopeTree, a new alignment-free method that estimates evolutionary distances by measuring the decay of exact substring matches as a function of match length. SlopeTree corrects for horizontal gene transfer, for composition variation and low complexity sequences, and for branch-length nonlinearity caused by multiple mutations at the same site. We tested SlopeTree on 495 bacteria, 73 archaea, and 72 strains of Escherichia coli and Shigella. We compared our trees to the NCBI taxonomy, to trees based on concatenated alignments, and to trees produced by other alignment-free methods. The results were consistent with current knowledge about prokaryotic evolution. We assessed differences in tree topology over different methods and settings and found that the majority of bacteria and archaea have a core set of proteins that evolves by descent. In trees built from complete genomes rather than sets of core genes, we observed some grouping by phenotype rather than phylogeny, for instance with a cluster of sulfur-reducing thermophilic bacteria coming together irrespective of their phyla. The source-code for SlopeTree is available at: http://prodata.swmed.edu/download/pub/slopetree_v1/slopetree.tar.gz.
format Online
Article
Text
id pubmed-4918981
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-49189812016-07-08 Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer Bromberg, Raquel Grishin, Nick V. Otwinowski, Zbyszek PLoS Comput Biol Research Article Advances in sequencing have generated a large number of complete genomes. Traditionally, phylogenetic analysis relies on alignments of orthologs, but defining orthologs and separating them from paralogs is a complex task that may not always be suited to the large datasets of the future. An alternative to traditional, alignment-based approaches are whole-genome, alignment-free methods. These methods are scalable and require minimal manual intervention. We developed SlopeTree, a new alignment-free method that estimates evolutionary distances by measuring the decay of exact substring matches as a function of match length. SlopeTree corrects for horizontal gene transfer, for composition variation and low complexity sequences, and for branch-length nonlinearity caused by multiple mutations at the same site. We tested SlopeTree on 495 bacteria, 73 archaea, and 72 strains of Escherichia coli and Shigella. We compared our trees to the NCBI taxonomy, to trees based on concatenated alignments, and to trees produced by other alignment-free methods. The results were consistent with current knowledge about prokaryotic evolution. We assessed differences in tree topology over different methods and settings and found that the majority of bacteria and archaea have a core set of proteins that evolves by descent. In trees built from complete genomes rather than sets of core genes, we observed some grouping by phenotype rather than phylogeny, for instance with a cluster of sulfur-reducing thermophilic bacteria coming together irrespective of their phyla. The source-code for SlopeTree is available at: http://prodata.swmed.edu/download/pub/slopetree_v1/slopetree.tar.gz. Public Library of Science 2016-06-23 /pmc/articles/PMC4918981/ /pubmed/27336403 http://dx.doi.org/10.1371/journal.pcbi.1004985 Text en © 2016 Bromberg et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bromberg, Raquel
Grishin, Nick V.
Otwinowski, Zbyszek
Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
title Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
title_full Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
title_fullStr Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
title_full_unstemmed Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
title_short Phylogeny Reconstruction with Alignment-Free Method That Corrects for Horizontal Gene Transfer
title_sort phylogeny reconstruction with alignment-free method that corrects for horizontal gene transfer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4918981/
https://www.ncbi.nlm.nih.gov/pubmed/27336403
http://dx.doi.org/10.1371/journal.pcbi.1004985
work_keys_str_mv AT brombergraquel phylogenyreconstructionwithalignmentfreemethodthatcorrectsforhorizontalgenetransfer
AT grishinnickv phylogenyreconstructionwithalignmentfreemethodthatcorrectsforhorizontalgenetransfer
AT otwinowskizbyszek phylogenyreconstructionwithalignmentfreemethodthatcorrectsforhorizontalgenetransfer