Cargando…

MultiPhyl: a high-throughput phylogenomics webserver using distributed computing

With the number of fully sequenced genomes increasing steadily, there is greater interest in performing large-scale phylogenomic analyses from large numbers of individual gene families. Maximum likelihood (ML) has been shown repeatedly to be one of the most accurate methods for phylogenetic construc...

Descripción completa

Detalles Bibliográficos
Autores principales:	Keane, Thomas M., Naughton, Thomas J., McInerney, James O.
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2007
Materias:	Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933173/ https://www.ncbi.nlm.nih.gov/pubmed/17553837 http://dx.doi.org/10.1093/nar/gkm359

_version_	1782134302696275968
author	Keane, Thomas M. Naughton, Thomas J. McInerney, James O.
author_facet	Keane, Thomas M. Naughton, Thomas J. McInerney, James O.
author_sort	Keane, Thomas M.
collection	PubMed
description	With the number of fully sequenced genomes increasing steadily, there is greater interest in performing large-scale phylogenomic analyses from large numbers of individual gene families. Maximum likelihood (ML) has been shown repeatedly to be one of the most accurate methods for phylogenetic construction. Recently, there have been a number of algorithmic improvements in maximum-likelihood-based tree search methods. However, it can still take a long time to analyse the evolutionary history of many gene families using a single computer. Distributed computing refers to a method of combining the computing power of multiple computers in order to perform some larger overall calculation. In this article, we present the first high-throughput implementation of a distributed phylogenetics platform, MultiPhyl, capable of using the idle computational resources of many heterogeneous non-dedicated machines to form a phylogenetics supercomputer. MultiPhyl allows a user to upload hundreds or thousands of amino acid or nucleotide alignments simultaneously and perform computationally intensive tasks such as model selection, tree searching and bootstrapping of each of the alignments using many desktop machines. The program implements a set of 88 amino acid models and 56 nucleotide maximum likelihood models and a variety of statistical methods for choosing between alternative models. A MultiPhyl webserver is available for public use at: http://www.cs.nuim.ie/distributed/multiphyl.php.
format	Text
id	pubmed-1933173
institution	National Center for Biotechnology Information
language	English
publishDate	2007
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-19331732007-07-31 MultiPhyl: a high-throughput phylogenomics webserver using distributed computing Keane, Thomas M. Naughton, Thomas J. McInerney, James O. Nucleic Acids Res Articles With the number of fully sequenced genomes increasing steadily, there is greater interest in performing large-scale phylogenomic analyses from large numbers of individual gene families. Maximum likelihood (ML) has been shown repeatedly to be one of the most accurate methods for phylogenetic construction. Recently, there have been a number of algorithmic improvements in maximum-likelihood-based tree search methods. However, it can still take a long time to analyse the evolutionary history of many gene families using a single computer. Distributed computing refers to a method of combining the computing power of multiple computers in order to perform some larger overall calculation. In this article, we present the first high-throughput implementation of a distributed phylogenetics platform, MultiPhyl, capable of using the idle computational resources of many heterogeneous non-dedicated machines to form a phylogenetics supercomputer. MultiPhyl allows a user to upload hundreds or thousands of amino acid or nucleotide alignments simultaneously and perform computationally intensive tasks such as model selection, tree searching and bootstrapping of each of the alignments using many desktop machines. The program implements a set of 88 amino acid models and 56 nucleotide maximum likelihood models and a variety of statistical methods for choosing between alternative models. A MultiPhyl webserver is available for public use at: http://www.cs.nuim.ie/distributed/multiphyl.php. Oxford University Press 2007-07 2007-06-06 /pmc/articles/PMC1933173/ /pubmed/17553837 http://dx.doi.org/10.1093/nar/gkm359 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Articles Keane, Thomas M. Naughton, Thomas J. McInerney, James O. MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
title	MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
title_full	MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
title_fullStr	MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
title_full_unstemmed	MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
title_short	MultiPhyl: a high-throughput phylogenomics webserver using distributed computing
title_sort	multiphyl: a high-throughput phylogenomics webserver using distributed computing
topic	Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933173/ https://www.ncbi.nlm.nih.gov/pubmed/17553837 http://dx.doi.org/10.1093/nar/gkm359
work_keys_str_mv	AT keanethomasm multiphylahighthroughputphylogenomicswebserverusingdistributedcomputing AT naughtonthomasj multiphylahighthroughputphylogenomicswebserverusingdistributedcomputing AT mcinerneyjameso multiphylahighthroughputphylogenomicswebserverusingdistributedcomputing

MultiPhyl: a high-throughput phylogenomics webserver using distributed computing

Ejemplares similares