Cargando…
PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees
BACKGROUND: Phylogenetic species trees are widely used in inferring evolutionary relationships. Existing software and algorithms mainly focus on phylogenetic inference. However, less attention has been paid to intermediate steps, such as processing extremely large sequences and preparing configure f...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889546/ https://www.ncbi.nlm.nih.gov/pubmed/31791235 http://dx.doi.org/10.1186/s12862-019-1541-x |
_version_ | 1783475440960143360 |
---|---|
author | Fang, Yang Liu, Chengcheng Lin, Jiangyi Li, Xufeng Alavian, Kambiz N. Yang, Yi Niu, Yulong |
author_facet | Fang, Yang Liu, Chengcheng Lin, Jiangyi Li, Xufeng Alavian, Kambiz N. Yang, Yi Niu, Yulong |
author_sort | Fang, Yang |
collection | PubMed |
description | BACKGROUND: Phylogenetic species trees are widely used in inferring evolutionary relationships. Existing software and algorithms mainly focus on phylogenetic inference. However, less attention has been paid to intermediate steps, such as processing extremely large sequences and preparing configure files to connect multiple software. When the species number is large, the intermediate steps become a bottleneck that may seriously affect the efficiency of tree building. RESULTS: Here, we present an easy-to-use pipeline named PhySpeTree to facilitate the reconstruction of species trees across bacterial, archaeal, and eukaryotic organisms. Users need only to input the abbreviations of species names; PhySpeTree prepares complex configure files for different software, then automatically downloads genomic data, cleans sequences, and builds trees. PhySpeTree allows users to perform critical steps such as sequence alignment and tree construction by adjusting advanced options. PhySpeTree provides two parallel pipelines based on concatenated highly conserved proteins and small subunit ribosomal RNA sequences, respectively. Accessory modules, such as those for inserting new species, generating visualization configurations, and combining trees, are distributed along with PhySpeTree. CONCLUSIONS: Together with accessory modules, PhySpeTree significantly simplifies tree reconstruction. PhySpeTree is implemented in Python running on modern operating systems (Linux, macOS, and Windows). The source code is freely available with detailed documentation (https://github.com/yangfangs/physpetools). |
format | Online Article Text |
id | pubmed-6889546 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68895462019-12-11 PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees Fang, Yang Liu, Chengcheng Lin, Jiangyi Li, Xufeng Alavian, Kambiz N. Yang, Yi Niu, Yulong BMC Evol Biol Software BACKGROUND: Phylogenetic species trees are widely used in inferring evolutionary relationships. Existing software and algorithms mainly focus on phylogenetic inference. However, less attention has been paid to intermediate steps, such as processing extremely large sequences and preparing configure files to connect multiple software. When the species number is large, the intermediate steps become a bottleneck that may seriously affect the efficiency of tree building. RESULTS: Here, we present an easy-to-use pipeline named PhySpeTree to facilitate the reconstruction of species trees across bacterial, archaeal, and eukaryotic organisms. Users need only to input the abbreviations of species names; PhySpeTree prepares complex configure files for different software, then automatically downloads genomic data, cleans sequences, and builds trees. PhySpeTree allows users to perform critical steps such as sequence alignment and tree construction by adjusting advanced options. PhySpeTree provides two parallel pipelines based on concatenated highly conserved proteins and small subunit ribosomal RNA sequences, respectively. Accessory modules, such as those for inserting new species, generating visualization configurations, and combining trees, are distributed along with PhySpeTree. CONCLUSIONS: Together with accessory modules, PhySpeTree significantly simplifies tree reconstruction. PhySpeTree is implemented in Python running on modern operating systems (Linux, macOS, and Windows). The source code is freely available with detailed documentation (https://github.com/yangfangs/physpetools). BioMed Central 2019-12-02 /pmc/articles/PMC6889546/ /pubmed/31791235 http://dx.doi.org/10.1186/s12862-019-1541-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Fang, Yang Liu, Chengcheng Lin, Jiangyi Li, Xufeng Alavian, Kambiz N. Yang, Yi Niu, Yulong PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees |
title | PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees |
title_full | PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees |
title_fullStr | PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees |
title_full_unstemmed | PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees |
title_short | PhySpeTree: an automated pipeline for reconstructing phylogenetic species trees |
title_sort | physpetree: an automated pipeline for reconstructing phylogenetic species trees |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6889546/ https://www.ncbi.nlm.nih.gov/pubmed/31791235 http://dx.doi.org/10.1186/s12862-019-1541-x |
work_keys_str_mv | AT fangyang physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees AT liuchengcheng physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees AT linjiangyi physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees AT lixufeng physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees AT alaviankambizn physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees AT yangyi physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees AT niuyulong physpetreeanautomatedpipelineforreconstructingphylogeneticspeciestrees |