Cargando…
Explaining evolution via constrained persistent perfect phylogeny
BACKGROUND: The perfect phylogeny is an often used model in phylogenetics since it provides an efficient basic procedure for representing the evolution of genomic binary characters in several frameworks, such as for example in haplotype inference. The model, which is conceptually the simplest, is ba...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240218/ https://www.ncbi.nlm.nih.gov/pubmed/25572381 http://dx.doi.org/10.1186/1471-2164-15-S6-S10 |
_version_ | 1782345700965613568 |
---|---|
author | Bonizzoni, Paola Carrieri, Anna Paola Della Vedova, Gianluca Trucco, Gabriella |
author_facet | Bonizzoni, Paola Carrieri, Anna Paola Della Vedova, Gianluca Trucco, Gabriella |
author_sort | Bonizzoni, Paola |
collection | PubMed |
description | BACKGROUND: The perfect phylogeny is an often used model in phylogenetics since it provides an efficient basic procedure for representing the evolution of genomic binary characters in several frameworks, such as for example in haplotype inference. The model, which is conceptually the simplest, is based on the infinite sites assumption, that is no character can mutate more than once in the whole tree. A main open problem regarding the model is finding generalizations that retain the computational tractability of the original model but are more flexible in modeling biological data when the infinite site assumption is violated because of e.g. back mutations. A special case of back mutations that has been considered in the study of the evolution of protein domains (where a domain is acquired and then lost) is persistency, that is the fact that a character is allowed to return back to the ancestral state. In this model characters can be gained and lost at most once. In this paper we consider the computational problem of explaining binary data by the Persistent Perfect Phylogeny model (referred as PPP) and for this purpose we investigate the problem of reconstructing an evolution where some constraints are imposed on the paths of the tree. RESULTS: We define a natural generalization of the PPP problem obtained by requiring that for some pairs (character, species), neither the species nor any of its ancestors can have the character. In other words, some characters cannot be persistent for some species. This new problem is called Constrained PPP (CPPP). Based on a graph formulation of the CPPP problem, we are able to provide a polynomial time solution for the CPPP problem for matrices whose conflict graph has no edges. Using this result, we develop a parameterized algorithm for solving the CPPP problem where the parameter is the number of characters. CONCLUSIONS: A preliminary experimental analysis shows that the constrained persistent perfect phylogeny model allows to explain efficiently data that do not conform with the classical perfect phylogeny model. |
format | Online Article Text |
id | pubmed-4240218 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42402182014-11-25 Explaining evolution via constrained persistent perfect phylogeny Bonizzoni, Paola Carrieri, Anna Paola Della Vedova, Gianluca Trucco, Gabriella BMC Genomics Research BACKGROUND: The perfect phylogeny is an often used model in phylogenetics since it provides an efficient basic procedure for representing the evolution of genomic binary characters in several frameworks, such as for example in haplotype inference. The model, which is conceptually the simplest, is based on the infinite sites assumption, that is no character can mutate more than once in the whole tree. A main open problem regarding the model is finding generalizations that retain the computational tractability of the original model but are more flexible in modeling biological data when the infinite site assumption is violated because of e.g. back mutations. A special case of back mutations that has been considered in the study of the evolution of protein domains (where a domain is acquired and then lost) is persistency, that is the fact that a character is allowed to return back to the ancestral state. In this model characters can be gained and lost at most once. In this paper we consider the computational problem of explaining binary data by the Persistent Perfect Phylogeny model (referred as PPP) and for this purpose we investigate the problem of reconstructing an evolution where some constraints are imposed on the paths of the tree. RESULTS: We define a natural generalization of the PPP problem obtained by requiring that for some pairs (character, species), neither the species nor any of its ancestors can have the character. In other words, some characters cannot be persistent for some species. This new problem is called Constrained PPP (CPPP). Based on a graph formulation of the CPPP problem, we are able to provide a polynomial time solution for the CPPP problem for matrices whose conflict graph has no edges. Using this result, we develop a parameterized algorithm for solving the CPPP problem where the parameter is the number of characters. CONCLUSIONS: A preliminary experimental analysis shows that the constrained persistent perfect phylogeny model allows to explain efficiently data that do not conform with the classical perfect phylogeny model. BioMed Central 2014-10-17 /pmc/articles/PMC4240218/ /pubmed/25572381 http://dx.doi.org/10.1186/1471-2164-15-S6-S10 Text en Copyright © 2014 Bonizzoni et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Bonizzoni, Paola Carrieri, Anna Paola Della Vedova, Gianluca Trucco, Gabriella Explaining evolution via constrained persistent perfect phylogeny |
title | Explaining evolution via constrained persistent perfect phylogeny |
title_full | Explaining evolution via constrained persistent perfect phylogeny |
title_fullStr | Explaining evolution via constrained persistent perfect phylogeny |
title_full_unstemmed | Explaining evolution via constrained persistent perfect phylogeny |
title_short | Explaining evolution via constrained persistent perfect phylogeny |
title_sort | explaining evolution via constrained persistent perfect phylogeny |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240218/ https://www.ncbi.nlm.nih.gov/pubmed/25572381 http://dx.doi.org/10.1186/1471-2164-15-S6-S10 |
work_keys_str_mv | AT bonizzonipaola explainingevolutionviaconstrainedpersistentperfectphylogeny AT carrieriannapaola explainingevolutionviaconstrainedpersistentperfectphylogeny AT dellavedovagianluca explainingevolutionviaconstrainedpersistentperfectphylogeny AT truccogabriella explainingevolutionviaconstrainedpersistentperfectphylogeny |