Cargando…

Orthology and paralogy constraints: satisfiability and consistency

BACKGROUND: A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family [Formula: see text]. But is a given set [Formula: see text] of orthology/paralogy constrain...

Descripción completa

Detalles Bibliográficos
Autores principales: Lafond, Manuel, El-Mabrouk, Nadia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240720/
https://www.ncbi.nlm.nih.gov/pubmed/25572629
http://dx.doi.org/10.1186/1471-2164-15-S6-S12
_version_ 1782345763830890496
author Lafond, Manuel
El-Mabrouk, Nadia
author_facet Lafond, Manuel
El-Mabrouk, Nadia
author_sort Lafond, Manuel
collection PubMed
description BACKGROUND: A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family [Formula: see text]. But is a given set [Formula: see text] of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for [Formula: see text]? While previous studies have focused on full sets of constraints, here we consider the general case where [Formula: see text] does not necessarily involve a constraint for each pair of genes. The problem is subdivided in two parts: (1) Is [Formula: see text] satisfiable, i.e. can we find an event-labeled gene tree G inducing [Formula: see text]? (2) Is there such a G which is consistent, i.e., such that all displayed triplet phylogenies are included in a species tree? RESULTS: Previous results on the Graph sandwich problem can be used to answer to (1), and we provide polynomial-time algorithms for satisfiability and consistency with a given species tree. We also describe a new polynomial-time algorithm for the case of consistency with an unknown species tree and full knowledge of pairwise orthology/paralogy relationships, as well as a branch-and-bound algorithm in the case when unknown relations are present. We show that our algorithms can be used in combination with ProteinOrtho, a sequence similarity-based orthology detection tool, to extract a set of robust orthology/paralogy relationships.
format Online
Article
Text
id pubmed-4240720
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42407202014-11-25 Orthology and paralogy constraints: satisfiability and consistency Lafond, Manuel El-Mabrouk, Nadia BMC Genomics Research BACKGROUND: A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer orthology and paralogy relations between genes of a given gene family [Formula: see text]. But is a given set [Formula: see text] of orthology/paralogy constraints possible, i.e., can they simultaneously co-exist in an evolutionary history for [Formula: see text]? While previous studies have focused on full sets of constraints, here we consider the general case where [Formula: see text] does not necessarily involve a constraint for each pair of genes. The problem is subdivided in two parts: (1) Is [Formula: see text] satisfiable, i.e. can we find an event-labeled gene tree G inducing [Formula: see text]? (2) Is there such a G which is consistent, i.e., such that all displayed triplet phylogenies are included in a species tree? RESULTS: Previous results on the Graph sandwich problem can be used to answer to (1), and we provide polynomial-time algorithms for satisfiability and consistency with a given species tree. We also describe a new polynomial-time algorithm for the case of consistency with an unknown species tree and full knowledge of pairwise orthology/paralogy relationships, as well as a branch-and-bound algorithm in the case when unknown relations are present. We show that our algorithms can be used in combination with ProteinOrtho, a sequence similarity-based orthology detection tool, to extract a set of robust orthology/paralogy relationships. BioMed Central 2014-10-17 /pmc/articles/PMC4240720/ /pubmed/25572629 http://dx.doi.org/10.1186/1471-2164-15-S6-S12 Text en Copyright © 2014 Lafond and El-Mabrouk; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lafond, Manuel
El-Mabrouk, Nadia
Orthology and paralogy constraints: satisfiability and consistency
title Orthology and paralogy constraints: satisfiability and consistency
title_full Orthology and paralogy constraints: satisfiability and consistency
title_fullStr Orthology and paralogy constraints: satisfiability and consistency
title_full_unstemmed Orthology and paralogy constraints: satisfiability and consistency
title_short Orthology and paralogy constraints: satisfiability and consistency
title_sort orthology and paralogy constraints: satisfiability and consistency
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4240720/
https://www.ncbi.nlm.nih.gov/pubmed/25572629
http://dx.doi.org/10.1186/1471-2164-15-S6-S12
work_keys_str_mv AT lafondmanuel orthologyandparalogyconstraintssatisfiabilityandconsistency
AT elmabrouknadia orthologyandparalogyconstraintssatisfiabilityandconsistency