Cargando…

Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood

We present an evolutionary placement algorithm (EPA) and a Web server for the rapid assignment of sequence fragments (short reads) to edges of a given phylogenetic tree under the maximum-likelihood model. The accuracy of the algorithm is evaluated on several real-world data sets and compared with pl...

Descripción completa

Detalles Bibliográficos
Autores principales: Berger, Simon A., Krompass, Denis, Stamatakis, Alexandros
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3078422/
https://www.ncbi.nlm.nih.gov/pubmed/21436105
http://dx.doi.org/10.1093/sysbio/syr010
_version_ 1782201949365469184
author Berger, Simon A.
Krompass, Denis
Stamatakis, Alexandros
author_facet Berger, Simon A.
Krompass, Denis
Stamatakis, Alexandros
author_sort Berger, Simon A.
collection PubMed
description We present an evolutionary placement algorithm (EPA) and a Web server for the rapid assignment of sequence fragments (short reads) to edges of a given phylogenetic tree under the maximum-likelihood model. The accuracy of the algorithm is evaluated on several real-world data sets and compared with placement by pair-wise sequence comparison, using edit distances and BLAST. We introduce a slow and accurate as well as a fast and less accurate placement algorithm. For the slow algorithm, we develop additional heuristic techniques that yield almost the same run times as the fast version with only a small loss of accuracy. When those additional heuristics are employed, the run time of the more accurate algorithm is comparable with that of a simple BLAST search for data sets with a high number of short query sequences. Moreover, the accuracy of the EPA is significantly higher, in particular when the sample of taxa in the reference topology is sparse or inadequate. Our algorithm, which has been integrated into RAxML, therefore provides an equally fast but more accurate alternative to BLAST for tree-based inference of the evolutionary origin and composition of short sequence reads. We are also actively developing a Web server that offers a freely available service for computing read placements on trees using the EPA.
format Text
id pubmed-3078422
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30784222011-04-18 Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood Berger, Simon A. Krompass, Denis Stamatakis, Alexandros Syst Biol Regular Articles We present an evolutionary placement algorithm (EPA) and a Web server for the rapid assignment of sequence fragments (short reads) to edges of a given phylogenetic tree under the maximum-likelihood model. The accuracy of the algorithm is evaluated on several real-world data sets and compared with placement by pair-wise sequence comparison, using edit distances and BLAST. We introduce a slow and accurate as well as a fast and less accurate placement algorithm. For the slow algorithm, we develop additional heuristic techniques that yield almost the same run times as the fast version with only a small loss of accuracy. When those additional heuristics are employed, the run time of the more accurate algorithm is comparable with that of a simple BLAST search for data sets with a high number of short query sequences. Moreover, the accuracy of the EPA is significantly higher, in particular when the sample of taxa in the reference topology is sparse or inadequate. Our algorithm, which has been integrated into RAxML, therefore provides an equally fast but more accurate alternative to BLAST for tree-based inference of the evolutionary origin and composition of short sequence reads. We are also actively developing a Web server that offers a freely available service for computing read placements on trees using the EPA. Oxford University Press 2011-05 2011-03-23 /pmc/articles/PMC3078422/ /pubmed/21436105 http://dx.doi.org/10.1093/sysbio/syr010 Text en © The Author(s) 2011. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Berger, Simon A.
Krompass, Denis
Stamatakis, Alexandros
Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
title Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
title_full Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
title_fullStr Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
title_full_unstemmed Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
title_short Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
title_sort performance, accuracy, and web server for evolutionary placement of short sequence reads under maximum likelihood
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3078422/
https://www.ncbi.nlm.nih.gov/pubmed/21436105
http://dx.doi.org/10.1093/sysbio/syr010
work_keys_str_mv AT bergersimona performanceaccuracyandwebserverforevolutionaryplacementofshortsequencereadsundermaximumlikelihood
AT krompassdenis performanceaccuracyandwebserverforevolutionaryplacementofshortsequencereadsundermaximumlikelihood
AT stamatakisalexandros performanceaccuracyandwebserverforevolutionaryplacementofshortsequencereadsundermaximumlikelihood