Cargando…

On the consistency of orthology relationships

BACKGROUND: Orthologs inference is the starting point of most comparative genomics studies, and a plethora of methods have been designed in the last decade to address this challenging task. In this paper we focus on the problems of deciding consistency with a species tree (known or not) of a partial...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Mark, Paul, Christophe, Scornavacca, Céline
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5123252/
https://www.ncbi.nlm.nih.gov/pubmed/28185580
http://dx.doi.org/10.1186/s12859-016-1267-3
Descripción
Sumario:BACKGROUND: Orthologs inference is the starting point of most comparative genomics studies, and a plethora of methods have been designed in the last decade to address this challenging task. In this paper we focus on the problems of deciding consistency with a species tree (known or not) of a partial set of orthology/paralogy relationships [Formula: see text] on a collection of n genes. RESULTS: We give the first polynomial algorithm – more precisely a O(n (3)) time algorithm – to decide whether [Formula: see text] is consistent, even when the species tree is unknown. We also investigate a biologically meaningful optimization version of these problems, in which we wish to minimize the number of duplication events; unfortunately, we show that all these optimization problems are NP-hard and are unlikely to have good polynomial time approximation algorithms. CONCLUSIONS: Our polynomial algorithm for checking consistency has been implemented in Python and is available at https://github.com/UdeM-LBIT/OrthoPara-ConstraintChecker.