Cargando…

Probabilistic Phylogenetic Inference with Insertions and Deletions

A fundamental task in sequence analysis is to calculate the probability of a multiple alignment given a phylogenetic tree relating the sequences and an evolutionary model describing how sequences change over time. However, the most widely used phylogenetic models only account for residue substitutio...

Descripción completa

Detalles Bibliográficos
Autores principales: Rivas, Elena, Eddy, Sean R.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527138/
https://www.ncbi.nlm.nih.gov/pubmed/18787703
http://dx.doi.org/10.1371/journal.pcbi.1000172
_version_ 1782158791342555136
author Rivas, Elena
Eddy, Sean R.
author_facet Rivas, Elena
Eddy, Sean R.
author_sort Rivas, Elena
collection PubMed
description A fundamental task in sequence analysis is to calculate the probability of a multiple alignment given a phylogenetic tree relating the sequences and an evolutionary model describing how sequences change over time. However, the most widely used phylogenetic models only account for residue substitution events. We describe a probabilistic model of a multiple sequence alignment that accounts for insertion and deletion events in addition to substitutions, given a phylogenetic tree, using a rate matrix augmented by the gap character. Starting from a continuous Markov process, we construct a non-reversible generative (birth–death) evolutionary model for insertions and deletions. The model assumes that insertion and deletion events occur one residue at a time. We apply this model to phylogenetic tree inference by extending the program dnaml in phylip. Using standard benchmarking methods on simulated data and a new “concordance test” benchmark on real ribosomal RNA alignments, we show that the extended program dnaml ε improves accuracy relative to the usual approach of ignoring gaps, while retaining the computational efficiency of the Felsenstein peeling algorithm.
format Text
id pubmed-2527138
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-25271382008-09-19 Probabilistic Phylogenetic Inference with Insertions and Deletions Rivas, Elena Eddy, Sean R. PLoS Comput Biol Research Article A fundamental task in sequence analysis is to calculate the probability of a multiple alignment given a phylogenetic tree relating the sequences and an evolutionary model describing how sequences change over time. However, the most widely used phylogenetic models only account for residue substitution events. We describe a probabilistic model of a multiple sequence alignment that accounts for insertion and deletion events in addition to substitutions, given a phylogenetic tree, using a rate matrix augmented by the gap character. Starting from a continuous Markov process, we construct a non-reversible generative (birth–death) evolutionary model for insertions and deletions. The model assumes that insertion and deletion events occur one residue at a time. We apply this model to phylogenetic tree inference by extending the program dnaml in phylip. Using standard benchmarking methods on simulated data and a new “concordance test” benchmark on real ribosomal RNA alignments, we show that the extended program dnaml ε improves accuracy relative to the usual approach of ignoring gaps, while retaining the computational efficiency of the Felsenstein peeling algorithm. Public Library of Science 2008-09-19 /pmc/articles/PMC2527138/ /pubmed/18787703 http://dx.doi.org/10.1371/journal.pcbi.1000172 Text en Rivas, Eddy. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rivas, Elena
Eddy, Sean R.
Probabilistic Phylogenetic Inference with Insertions and Deletions
title Probabilistic Phylogenetic Inference with Insertions and Deletions
title_full Probabilistic Phylogenetic Inference with Insertions and Deletions
title_fullStr Probabilistic Phylogenetic Inference with Insertions and Deletions
title_full_unstemmed Probabilistic Phylogenetic Inference with Insertions and Deletions
title_short Probabilistic Phylogenetic Inference with Insertions and Deletions
title_sort probabilistic phylogenetic inference with insertions and deletions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2527138/
https://www.ncbi.nlm.nih.gov/pubmed/18787703
http://dx.doi.org/10.1371/journal.pcbi.1000172
work_keys_str_mv AT rivaselena probabilisticphylogeneticinferencewithinsertionsanddeletions
AT eddyseanr probabilisticphylogeneticinferencewithinsertionsanddeletions