Cargando…

Explaining evolution via constrained persistent perfect phylogeny

BACKGROUND: The perfect phylogeny is an often used model in phylogenetics since it provides an efficient basic procedure for representing the evolution of genomic binary characters in several frameworks, such as for example in haplotype inference. The model, which is conceptually the simplest, is ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Bonizzoni, Paola, Carrieri, Anna Paola, Della Vedova, Gianluca, Trucco, Gabriella
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240218/
https://www.ncbi.nlm.nih.gov/pubmed/25572381
http://dx.doi.org/10.1186/1471-2164-15-S6-S10
_version_ 1782345700965613568
author Bonizzoni, Paola
Carrieri, Anna Paola
Della Vedova, Gianluca
Trucco, Gabriella
author_facet Bonizzoni, Paola
Carrieri, Anna Paola
Della Vedova, Gianluca
Trucco, Gabriella
author_sort Bonizzoni, Paola
collection PubMed
description BACKGROUND: The perfect phylogeny is an often used model in phylogenetics since it provides an efficient basic procedure for representing the evolution of genomic binary characters in several frameworks, such as for example in haplotype inference. The model, which is conceptually the simplest, is based on the infinite sites assumption, that is no character can mutate more than once in the whole tree. A main open problem regarding the model is finding generalizations that retain the computational tractability of the original model but are more flexible in modeling biological data when the infinite site assumption is violated because of e.g. back mutations. A special case of back mutations that has been considered in the study of the evolution of protein domains (where a domain is acquired and then lost) is persistency, that is the fact that a character is allowed to return back to the ancestral state. In this model characters can be gained and lost at most once. In this paper we consider the computational problem of explaining binary data by the Persistent Perfect Phylogeny model (referred as PPP) and for this purpose we investigate the problem of reconstructing an evolution where some constraints are imposed on the paths of the tree. RESULTS: We define a natural generalization of the PPP problem obtained by requiring that for some pairs (character, species), neither the species nor any of its ancestors can have the character. In other words, some characters cannot be persistent for some species. This new problem is called Constrained PPP (CPPP). Based on a graph formulation of the CPPP problem, we are able to provide a polynomial time solution for the CPPP problem for matrices whose conflict graph has no edges. Using this result, we develop a parameterized algorithm for solving the CPPP problem where the parameter is the number of characters. CONCLUSIONS: A preliminary experimental analysis shows that the constrained persistent perfect phylogeny model allows to explain efficiently data that do not conform with the classical perfect phylogeny model.
format Online
Article
Text
id pubmed-4240218
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42402182014-11-25 Explaining evolution via constrained persistent perfect phylogeny Bonizzoni, Paola Carrieri, Anna Paola Della Vedova, Gianluca Trucco, Gabriella BMC Genomics Research BACKGROUND: The perfect phylogeny is an often used model in phylogenetics since it provides an efficient basic procedure for representing the evolution of genomic binary characters in several frameworks, such as for example in haplotype inference. The model, which is conceptually the simplest, is based on the infinite sites assumption, that is no character can mutate more than once in the whole tree. A main open problem regarding the model is finding generalizations that retain the computational tractability of the original model but are more flexible in modeling biological data when the infinite site assumption is violated because of e.g. back mutations. A special case of back mutations that has been considered in the study of the evolution of protein domains (where a domain is acquired and then lost) is persistency, that is the fact that a character is allowed to return back to the ancestral state. In this model characters can be gained and lost at most once. In this paper we consider the computational problem of explaining binary data by the Persistent Perfect Phylogeny model (referred as PPP) and for this purpose we investigate the problem of reconstructing an evolution where some constraints are imposed on the paths of the tree. RESULTS: We define a natural generalization of the PPP problem obtained by requiring that for some pairs (character, species), neither the species nor any of its ancestors can have the character. In other words, some characters cannot be persistent for some species. This new problem is called Constrained PPP (CPPP). Based on a graph formulation of the CPPP problem, we are able to provide a polynomial time solution for the CPPP problem for matrices whose conflict graph has no edges. Using this result, we develop a parameterized algorithm for solving the CPPP problem where the parameter is the number of characters. CONCLUSIONS: A preliminary experimental analysis shows that the constrained persistent perfect phylogeny model allows to explain efficiently data that do not conform with the classical perfect phylogeny model. BioMed Central 2014-10-17 /pmc/articles/PMC4240218/ /pubmed/25572381 http://dx.doi.org/10.1186/1471-2164-15-S6-S10 Text en Copyright © 2014 Bonizzoni et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Bonizzoni, Paola
Carrieri, Anna Paola
Della Vedova, Gianluca
Trucco, Gabriella
Explaining evolution via constrained persistent perfect phylogeny
title Explaining evolution via constrained persistent perfect phylogeny
title_full Explaining evolution via constrained persistent perfect phylogeny
title_fullStr Explaining evolution via constrained persistent perfect phylogeny
title_full_unstemmed Explaining evolution via constrained persistent perfect phylogeny
title_short Explaining evolution via constrained persistent perfect phylogeny
title_sort explaining evolution via constrained persistent perfect phylogeny
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240218/
https://www.ncbi.nlm.nih.gov/pubmed/25572381
http://dx.doi.org/10.1186/1471-2164-15-S6-S10
work_keys_str_mv AT bonizzonipaola explainingevolutionviaconstrainedpersistentperfectphylogeny
AT carrieriannapaola explainingevolutionviaconstrainedpersistentperfectphylogeny
AT dellavedovagianluca explainingevolutionviaconstrainedpersistentperfectphylogeny
AT truccogabriella explainingevolutionviaconstrainedpersistentperfectphylogeny