Cargando…

PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors

BACKGROUND: Selecting an appropriate substitution model and deriving a tree topology for a given sequence set are essential in phylogenetic analysis. However, such time consuming, computationally intensive tasks rely on knowledge of substitution model theories and related expertise to run through al...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Shu-Hwa, Su, Sheng-Yao, Lo, Chen-Zen, Chen, Kuei-Hsien, Huang, Teng-Jay, Kuo, Bo-Han, Lin, Chung-Yen
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2785425/
https://www.ncbi.nlm.nih.gov/pubmed/19997614
http://dx.doi.org/10.1371/journal.pone.0008116
_version_ 1782174814286381056
author Chen, Shu-Hwa
Su, Sheng-Yao
Lo, Chen-Zen
Chen, Kuei-Hsien
Huang, Teng-Jay
Kuo, Bo-Han
Lin, Chung-Yen
author_facet Chen, Shu-Hwa
Su, Sheng-Yao
Lo, Chen-Zen
Chen, Kuei-Hsien
Huang, Teng-Jay
Kuo, Bo-Han
Lin, Chung-Yen
author_sort Chen, Shu-Hwa
collection PubMed
description BACKGROUND: Selecting an appropriate substitution model and deriving a tree topology for a given sequence set are essential in phylogenetic analysis. However, such time consuming, computationally intensive tasks rely on knowledge of substitution model theories and related expertise to run through all possible combinations of several separate programs. To ensure a thorough and efficient analysis and avert tedious manipulations of various programs, this work presents an intuitive framework, the phylogenetic reconstruction with automatic likelihood model selectors (PALM), with convincing, updated algorithms and a best-fit model selection mechanism for seamless phylogenetic analysis. METHODOLOGY: As an integrated framework of ClustalW, PhyML, MODELTEST, ProtTest, and several in-house programs, PALM evaluates the fitness of 56 substitution models for nucleotide sequences and 112 substitution models for protein sequences with scores in various criteria. The input for PALM can be either sequences in FASTA format or a sequence alignment file in PHYLIP format. To accelerate the computing of maximum likelihood and bootstrapping, this work integrates MPICH2/PhyML, PalmMonitor and Palm job controller across several machines with multiple processors and adopts the task parallelism approach. Moreover, an intuitive and interactive web component, PalmTree, is developed for displaying and operating the output tree with options of tree rooting, branches swapping, viewing the branch length values, and viewing bootstrapping score, as well as removing nodes to restart analysis iteratively. SIGNIFICANCE: The workflow of PALM is straightforward and coherent. Via a succinct, user-friendly interface, researchers unfamiliar with phylogenetic analysis can easily use this server to submit sequences, retrieve the output, and re-submit a job based on a previous result if some sequences are to be deleted or added for phylogenetic reconstruction. PALM results in an inference of phylogenetic relationship not only by vanquishing the computation difficulty of ML methods but also providing statistic methods for model selection and bootstrapping. The proposed approach can reduce calculation time, which is particularly relevant when querying a large data set. PALM can be accessed online at http://palm.iis.sinica.edu.tw.
format Text
id pubmed-2785425
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27854252009-12-08 PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors Chen, Shu-Hwa Su, Sheng-Yao Lo, Chen-Zen Chen, Kuei-Hsien Huang, Teng-Jay Kuo, Bo-Han Lin, Chung-Yen PLoS One Research Article BACKGROUND: Selecting an appropriate substitution model and deriving a tree topology for a given sequence set are essential in phylogenetic analysis. However, such time consuming, computationally intensive tasks rely on knowledge of substitution model theories and related expertise to run through all possible combinations of several separate programs. To ensure a thorough and efficient analysis and avert tedious manipulations of various programs, this work presents an intuitive framework, the phylogenetic reconstruction with automatic likelihood model selectors (PALM), with convincing, updated algorithms and a best-fit model selection mechanism for seamless phylogenetic analysis. METHODOLOGY: As an integrated framework of ClustalW, PhyML, MODELTEST, ProtTest, and several in-house programs, PALM evaluates the fitness of 56 substitution models for nucleotide sequences and 112 substitution models for protein sequences with scores in various criteria. The input for PALM can be either sequences in FASTA format or a sequence alignment file in PHYLIP format. To accelerate the computing of maximum likelihood and bootstrapping, this work integrates MPICH2/PhyML, PalmMonitor and Palm job controller across several machines with multiple processors and adopts the task parallelism approach. Moreover, an intuitive and interactive web component, PalmTree, is developed for displaying and operating the output tree with options of tree rooting, branches swapping, viewing the branch length values, and viewing bootstrapping score, as well as removing nodes to restart analysis iteratively. SIGNIFICANCE: The workflow of PALM is straightforward and coherent. Via a succinct, user-friendly interface, researchers unfamiliar with phylogenetic analysis can easily use this server to submit sequences, retrieve the output, and re-submit a job based on a previous result if some sequences are to be deleted or added for phylogenetic reconstruction. PALM results in an inference of phylogenetic relationship not only by vanquishing the computation difficulty of ML methods but also providing statistic methods for model selection and bootstrapping. The proposed approach can reduce calculation time, which is particularly relevant when querying a large data set. PALM can be accessed online at http://palm.iis.sinica.edu.tw. Public Library of Science 2009-12-07 /pmc/articles/PMC2785425/ /pubmed/19997614 http://dx.doi.org/10.1371/journal.pone.0008116 Text en Chen et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chen, Shu-Hwa
Su, Sheng-Yao
Lo, Chen-Zen
Chen, Kuei-Hsien
Huang, Teng-Jay
Kuo, Bo-Han
Lin, Chung-Yen
PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors
title PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors
title_full PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors
title_fullStr PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors
title_full_unstemmed PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors
title_short PALM: A Paralleled and Integrated Framework for Phylogenetic Inference with Automatic Likelihood Model Selectors
title_sort palm: a paralleled and integrated framework for phylogenetic inference with automatic likelihood model selectors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2785425/
https://www.ncbi.nlm.nih.gov/pubmed/19997614
http://dx.doi.org/10.1371/journal.pone.0008116
work_keys_str_mv AT chenshuhwa palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors
AT sushengyao palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors
AT lochenzen palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors
AT chenkueihsien palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors
AT huangtengjay palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors
AT kuobohan palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors
AT linchungyen palmaparalleledandintegratedframeworkforphylogeneticinferencewithautomaticlikelihoodmodelselectors