Cargando…
Mapping ancestral genomes with massive gene loss: A matrix sandwich problem
Motivation: Ancestral genomes provide a better way to understand the structural evolution of genomes than the simple comparison of extant genomes. Most ancestral genome reconstruction methods rely on universal markers, that is, homologous families of DNA segments present in exactly one exemplar in e...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117370/ https://www.ncbi.nlm.nih.gov/pubmed/21685079 http://dx.doi.org/10.1093/bioinformatics/btr224 |
_version_ | 1782206324331773952 |
---|---|
author | Gavranović, Haris Chauve, Cedric Salse, Jérôme Tannier, Eric |
author_facet | Gavranović, Haris Chauve, Cedric Salse, Jérôme Tannier, Eric |
author_sort | Gavranović, Haris |
collection | PubMed |
description | Motivation: Ancestral genomes provide a better way to understand the structural evolution of genomes than the simple comparison of extant genomes. Most ancestral genome reconstruction methods rely on universal markers, that is, homologous families of DNA segments present in exactly one exemplar in every considered species. Complex histories of genes or other markers, undergoing duplications and losses, are rarely taken into account. It follows that some ancestors are inaccessible by these methods, such as the proto–monocotyledon whose evolution involved massive gene loss following a whole genome duplication. Results: We propose a mapping approach based on the combinatorial notion of ‘sandwich consecutive ones matrix’, which explicitly takes gene losses into account. We introduce combinatorial optimization problems related to this concept, and propose a heuristic solver and a lower bound on the optimal solution. We use these results to propose a configuration for the proto-chromosomes of the monocot ancestor, and study the accuracy of this configuration. We also use our method to reconstruct the ancestral boreoeutherian genomes, which illustrates that the framework we propose is not specific to plant paleogenomics but is adapted to reconstruct any ancestral genome from extant genomes with heterogeneous marker content. Availability: Upon request to the authors. Contact: haris.gavranovic@gmail.com; eric.tannier@inria.fr |
format | Online Article Text |
id | pubmed-3117370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-31173702011-06-17 Mapping ancestral genomes with massive gene loss: A matrix sandwich problem Gavranović, Haris Chauve, Cedric Salse, Jérôme Tannier, Eric Bioinformatics Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Motivation: Ancestral genomes provide a better way to understand the structural evolution of genomes than the simple comparison of extant genomes. Most ancestral genome reconstruction methods rely on universal markers, that is, homologous families of DNA segments present in exactly one exemplar in every considered species. Complex histories of genes or other markers, undergoing duplications and losses, are rarely taken into account. It follows that some ancestors are inaccessible by these methods, such as the proto–monocotyledon whose evolution involved massive gene loss following a whole genome duplication. Results: We propose a mapping approach based on the combinatorial notion of ‘sandwich consecutive ones matrix’, which explicitly takes gene losses into account. We introduce combinatorial optimization problems related to this concept, and propose a heuristic solver and a lower bound on the optimal solution. We use these results to propose a configuration for the proto-chromosomes of the monocot ancestor, and study the accuracy of this configuration. We also use our method to reconstruct the ancestral boreoeutherian genomes, which illustrates that the framework we propose is not specific to plant paleogenomics but is adapted to reconstruct any ancestral genome from extant genomes with heterogeneous marker content. Availability: Upon request to the authors. Contact: haris.gavranovic@gmail.com; eric.tannier@inria.fr Oxford University Press 2011-07-01 2011-06-14 /pmc/articles/PMC3117370/ /pubmed/21685079 http://dx.doi.org/10.1093/bioinformatics/btr224 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Gavranović, Haris Chauve, Cedric Salse, Jérôme Tannier, Eric Mapping ancestral genomes with massive gene loss: A matrix sandwich problem |
title | Mapping ancestral genomes with massive gene loss: A matrix sandwich problem |
title_full | Mapping ancestral genomes with massive gene loss: A matrix sandwich problem |
title_fullStr | Mapping ancestral genomes with massive gene loss: A matrix sandwich problem |
title_full_unstemmed | Mapping ancestral genomes with massive gene loss: A matrix sandwich problem |
title_short | Mapping ancestral genomes with massive gene loss: A matrix sandwich problem |
title_sort | mapping ancestral genomes with massive gene loss: a matrix sandwich problem |
topic | Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117370/ https://www.ncbi.nlm.nih.gov/pubmed/21685079 http://dx.doi.org/10.1093/bioinformatics/btr224 |
work_keys_str_mv | AT gavranovicharis mappingancestralgenomeswithmassivegenelossamatrixsandwichproblem AT chauvecedric mappingancestralgenomeswithmassivegenelossamatrixsandwichproblem AT salsejerome mappingancestralgenomeswithmassivegenelossamatrixsandwichproblem AT tanniereric mappingancestralgenomeswithmassivegenelossamatrixsandwichproblem |