Cargando…

SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees

BACKGROUND: Phylogenetic trees based on sequences from a set of taxa can be incongruent due to horizontal gene transfer (HGT). By identifying the HGT events, we can reconcile the gene trees and derive a taxon tree that adequately represents the species' evolutionary history. One HGT can be repr...

Descripción completa

Detalles Bibliográficos
Autores principales: Hill, Tobias, Nordström, Karl JV, Thollesson, Mikael, Säfström, Tommy M, Vernersson, Andreas KE, Fredriksson, Robert, Schiöth, Helgi B
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2829038/
https://www.ncbi.nlm.nih.gov/pubmed/20152048
http://dx.doi.org/10.1186/1471-2148-10-42
_version_ 1782178065711890432
author Hill, Tobias
Nordström, Karl JV
Thollesson, Mikael
Säfström, Tommy M
Vernersson, Andreas KE
Fredriksson, Robert
Schiöth, Helgi B
author_facet Hill, Tobias
Nordström, Karl JV
Thollesson, Mikael
Säfström, Tommy M
Vernersson, Andreas KE
Fredriksson, Robert
Schiöth, Helgi B
author_sort Hill, Tobias
collection PubMed
description BACKGROUND: Phylogenetic trees based on sequences from a set of taxa can be incongruent due to horizontal gene transfer (HGT). By identifying the HGT events, we can reconcile the gene trees and derive a taxon tree that adequately represents the species' evolutionary history. One HGT can be represented by a rooted Subtree Prune and Regraft (RSPR) operation and the number of RSPRs separating two trees corresponds to the minimum number of HGT events. Identifying the minimum number of RSPRs separating two trees is NP-hard, but the problem can be reduced to fixed parameter tractable. A number of heuristic and two exact approaches to identifying the minimum number of RSPRs have been proposed. This is the first implementation delivering an exact solution as well as the intermediate trees connecting the input trees. RESULTS: We present the SPR Identification Tool (SPRIT), a novel algorithm that solves the fixed parameter tractable minimum RSPR problem and its GPL licensed Java implementation. The algorithm can be used in two ways, exhaustive search that guarantees the minimum RSPR distance and a heuristic approach that guarantees finding a solution, but not necessarily the minimum one. We benchmarked SPRIT against other software in two different settings, small to medium sized trees i.e. five to one hundred taxa and large trees i.e. thousands of taxa. In the small to medium tree size setting with random artificial incongruence, SPRIT's heuristic mode outperforms the other software by always delivering a solution with a low overestimation of the RSPR distance. In the large tree setting SPRIT compares well to the alternatives when benchmarked on finding a minimum solution within a reasonable time. SPRIT presents both the minimum RSPR distance and the intermediate trees. CONCLUSIONS: When used in exhaustive search mode, SPRIT identifies the minimum number of RSPRs needed to reconcile two incongruent rooted trees. SPRIT also performs quick approximations of the minimum RSPR distance, which are comparable to, and often better than, purely heuristic solutions. Put together, SPRIT is an excellent tool for identification of HGT events and pinpointing which taxa have been involved in HGT.
format Text
id pubmed-2829038
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28290382010-02-26 SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees Hill, Tobias Nordström, Karl JV Thollesson, Mikael Säfström, Tommy M Vernersson, Andreas KE Fredriksson, Robert Schiöth, Helgi B BMC Evol Biol Software BACKGROUND: Phylogenetic trees based on sequences from a set of taxa can be incongruent due to horizontal gene transfer (HGT). By identifying the HGT events, we can reconcile the gene trees and derive a taxon tree that adequately represents the species' evolutionary history. One HGT can be represented by a rooted Subtree Prune and Regraft (RSPR) operation and the number of RSPRs separating two trees corresponds to the minimum number of HGT events. Identifying the minimum number of RSPRs separating two trees is NP-hard, but the problem can be reduced to fixed parameter tractable. A number of heuristic and two exact approaches to identifying the minimum number of RSPRs have been proposed. This is the first implementation delivering an exact solution as well as the intermediate trees connecting the input trees. RESULTS: We present the SPR Identification Tool (SPRIT), a novel algorithm that solves the fixed parameter tractable minimum RSPR problem and its GPL licensed Java implementation. The algorithm can be used in two ways, exhaustive search that guarantees the minimum RSPR distance and a heuristic approach that guarantees finding a solution, but not necessarily the minimum one. We benchmarked SPRIT against other software in two different settings, small to medium sized trees i.e. five to one hundred taxa and large trees i.e. thousands of taxa. In the small to medium tree size setting with random artificial incongruence, SPRIT's heuristic mode outperforms the other software by always delivering a solution with a low overestimation of the RSPR distance. In the large tree setting SPRIT compares well to the alternatives when benchmarked on finding a minimum solution within a reasonable time. SPRIT presents both the minimum RSPR distance and the intermediate trees. CONCLUSIONS: When used in exhaustive search mode, SPRIT identifies the minimum number of RSPRs needed to reconcile two incongruent rooted trees. SPRIT also performs quick approximations of the minimum RSPR distance, which are comparable to, and often better than, purely heuristic solutions. Put together, SPRIT is an excellent tool for identification of HGT events and pinpointing which taxa have been involved in HGT. BioMed Central 2010-02-13 /pmc/articles/PMC2829038/ /pubmed/20152048 http://dx.doi.org/10.1186/1471-2148-10-42 Text en Copyright ©2010 Hill et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Hill, Tobias
Nordström, Karl JV
Thollesson, Mikael
Säfström, Tommy M
Vernersson, Andreas KE
Fredriksson, Robert
Schiöth, Helgi B
SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees
title SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees
title_full SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees
title_fullStr SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees
title_full_unstemmed SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees
title_short SPRIT: Identifying horizontal gene transfer in rooted phylogenetic trees
title_sort sprit: identifying horizontal gene transfer in rooted phylogenetic trees
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2829038/
https://www.ncbi.nlm.nih.gov/pubmed/20152048
http://dx.doi.org/10.1186/1471-2148-10-42
work_keys_str_mv AT hilltobias spritidentifyinghorizontalgenetransferinrootedphylogenetictrees
AT nordstromkarljv spritidentifyinghorizontalgenetransferinrootedphylogenetictrees
AT thollessonmikael spritidentifyinghorizontalgenetransferinrootedphylogenetictrees
AT safstromtommym spritidentifyinghorizontalgenetransferinrootedphylogenetictrees
AT vernerssonandreaske spritidentifyinghorizontalgenetransferinrootedphylogenetictrees
AT fredrikssonrobert spritidentifyinghorizontalgenetransferinrootedphylogenetictrees
AT schiothhelgib spritidentifyinghorizontalgenetransferinrootedphylogenetictrees