Cargando…
Orthology and paralogy constraints: satisfiability and consistency
BACKGROUND: A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family [Formula: see text]. But is a given set [Formula: see text] of orthology/paralogy constrain...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240720/ https://www.ncbi.nlm.nih.gov/pubmed/25572629 http://dx.doi.org/10.1186/1471-2164-15-S6-S12 |
_version_ | 1782345763830890496 |
---|---|
author | Lafond, Manuel El-Mabrouk, Nadia |
author_facet | Lafond, Manuel El-Mabrouk, Nadia |
author_sort | Lafond, Manuel |
collection | PubMed |
description | BACKGROUND: A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family [Formula: see text]. But is a given set [Formula: see text] of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for [Formula: see text]? While previous studies have focused on full sets of constraints, here we consider the general case where [Formula: see text] does not necessarily involve a constraint for each pair of genes. The problem is subdivided in two parts: (1) Is [Formula: see text] satisfiable, i.e. can we find an event-labeled gene tree G inducing [Formula: see text]? (2) Is there such a G which is consistent, i.e., such that all displayed triplet phylogenies are included in a species tree? RESULTS: Previous results on the Graph sandwich problem can be used to answer to (1), and we provide polynomial-time algorithms for satisfiability and consistency with a given species tree. We also describe a new polynomial-time algorithm for the case of consistency with an unknown species tree and full knowledge of pairwise orthology/paralogy relationships, as well as a branch-and-bound algorithm in the case when unknown relations are present. We show that our algorithms can be used in combination with ProteinOrtho, a sequence similarity-based orthology detection tool, to extract a set of robust orthology/paralogy relationships. |
format | Online Article Text |
id | pubmed-4240720 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42407202014-11-25 Orthology and paralogy constraints: satisfiability and consistency Lafond, Manuel El-Mabrouk, Nadia BMC Genomics Research BACKGROUND: A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family [Formula: see text]. But is a given set [Formula: see text] of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for [Formula: see text]? While previous studies have focused on full sets of constraints, here we consider the general case where [Formula: see text] does not necessarily involve a constraint for each pair of genes. The problem is subdivided in two parts: (1) Is [Formula: see text] satisfiable, i.e. can we find an event-labeled gene tree G inducing [Formula: see text]? (2) Is there such a G which is consistent, i.e., such that all displayed triplet phylogenies are included in a species tree? RESULTS: Previous results on the Graph sandwich problem can be used to answer to (1), and we provide polynomial-time algorithms for satisfiability and consistency with a given species tree. We also describe a new polynomial-time algorithm for the case of consistency with an unknown species tree and full knowledge of pairwise orthology/paralogy relationships, as well as a branch-and-bound algorithm in the case when unknown relations are present. We show that our algorithms can be used in combination with ProteinOrtho, a sequence similarity-based orthology detection tool, to extract a set of robust orthology/paralogy relationships. BioMed Central 2014-10-17 /pmc/articles/PMC4240720/ /pubmed/25572629 http://dx.doi.org/10.1186/1471-2164-15-S6-S12 Text en Copyright © 2014 Lafond and El-Mabrouk; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lafond, Manuel El-Mabrouk, Nadia Orthology and paralogy constraints: satisfiability and consistency |
title | Orthology and paralogy constraints: satisfiability and consistency |
title_full | Orthology and paralogy constraints: satisfiability and consistency |
title_fullStr | Orthology and paralogy constraints: satisfiability and consistency |
title_full_unstemmed | Orthology and paralogy constraints: satisfiability and consistency |
title_short | Orthology and paralogy constraints: satisfiability and consistency |
title_sort | orthology and paralogy constraints: satisfiability and consistency |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240720/ https://www.ncbi.nlm.nih.gov/pubmed/25572629 http://dx.doi.org/10.1186/1471-2164-15-S6-S12 |
work_keys_str_mv | AT lafondmanuel orthologyandparalogyconstraintssatisfiabilityandconsistency AT elmabrouknadia orthologyandparalogyconstraintssatisfiabilityandconsistency |