Cargando…
Compromise or optimize? The breakpoint anti-median
BACKGROUND: The median of k≥3 genomes was originally defined to find a compromise genome indicative of a common ancestor. However, in gene order comparisons, the usual definitions based on minimizing the sum of distances to the input genomes lead to degenerate medians reflecting only one of the inpu...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5249007/ https://www.ncbi.nlm.nih.gov/pubmed/28105920 http://dx.doi.org/10.1186/s12859-016-1340-y |
_version_ | 1782497370478477312 |
---|---|
author | Larlee, Caroline Anne Brandts, Alex Sankoff, David |
author_facet | Larlee, Caroline Anne Brandts, Alex Sankoff, David |
author_sort | Larlee, Caroline Anne |
collection | PubMed |
description | BACKGROUND: The median of k≥3 genomes was originally defined to find a compromise genome indicative of a common ancestor. However, in gene order comparisons, the usual definitions based on minimizing the sum of distances to the input genomes lead to degenerate medians reflecting only one of the input genomes. “Near-medians”, consisting of equal samples of gene adjacencies from all the input genomes, were designed to restore the idea of compromise to the median problem. RESULT: We explore adjacency sampling constructions in full generality in the case k=3, with given overlapping sets of adjacencies in the three genomes, where all adjacencies in two-way or three-way overlaps are included in the sample. We require the construction to be maximal, in the sense that no additional proportion of adjacencies from any of the genomes may be added without violating the local linearity of the genome. We discover that in incorporating as many adjacencies as possible, evenly from all the input genomes, we are actually maximizing, rather than minimizing, the sum of distances over all other maximal sampling schemes. CONCLUSIONS: We propose to explore compromise instead of parsimony as the organizing principle for the small phylogeny problem. |
format | Online Article Text |
id | pubmed-5249007 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52490072017-01-26 Compromise or optimize? The breakpoint anti-median Larlee, Caroline Anne Brandts, Alex Sankoff, David BMC Bioinformatics Research BACKGROUND: The median of k≥3 genomes was originally defined to find a compromise genome indicative of a common ancestor. However, in gene order comparisons, the usual definitions based on minimizing the sum of distances to the input genomes lead to degenerate medians reflecting only one of the input genomes. “Near-medians”, consisting of equal samples of gene adjacencies from all the input genomes, were designed to restore the idea of compromise to the median problem. RESULT: We explore adjacency sampling constructions in full generality in the case k=3, with given overlapping sets of adjacencies in the three genomes, where all adjacencies in two-way or three-way overlaps are included in the sample. We require the construction to be maximal, in the sense that no additional proportion of adjacencies from any of the genomes may be added without violating the local linearity of the genome. We discover that in incorporating as many adjacencies as possible, evenly from all the input genomes, we are actually maximizing, rather than minimizing, the sum of distances over all other maximal sampling schemes. CONCLUSIONS: We propose to explore compromise instead of parsimony as the organizing principle for the small phylogeny problem. BioMed Central 2016-12-15 /pmc/articles/PMC5249007/ /pubmed/28105920 http://dx.doi.org/10.1186/s12859-016-1340-y Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Larlee, Caroline Anne Brandts, Alex Sankoff, David Compromise or optimize? The breakpoint anti-median |
title | Compromise or optimize? The breakpoint anti-median |
title_full | Compromise or optimize? The breakpoint anti-median |
title_fullStr | Compromise or optimize? The breakpoint anti-median |
title_full_unstemmed | Compromise or optimize? The breakpoint anti-median |
title_short | Compromise or optimize? The breakpoint anti-median |
title_sort | compromise or optimize? the breakpoint anti-median |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5249007/ https://www.ncbi.nlm.nih.gov/pubmed/28105920 http://dx.doi.org/10.1186/s12859-016-1340-y |
work_keys_str_mv | AT larleecarolineanne compromiseoroptimizethebreakpointantimedian AT brandtsalex compromiseoroptimizethebreakpointantimedian AT sankoffdavid compromiseoroptimizethebreakpointantimedian |