Cargando…

Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss

Motivation: Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numero...

Descripción completa

Detalles Bibliográficos
Autores principales: Bansal, Mukul S., Alm, Eric J., Kellis, Manolis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371857/
https://www.ncbi.nlm.nih.gov/pubmed/22689773
http://dx.doi.org/10.1093/bioinformatics/bts225
_version_ 1782235271838826496
author Bansal, Mukul S.
Alm, Eric J.
Kellis, Manolis
author_facet Bansal, Mukul S.
Alm, Eric J.
Kellis, Manolis
author_sort Bansal, Mukul S.
collection PubMed
description Motivation: Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numerous important applications. Solving this problem requires the use of a reconciliation framework, where the input consists of a gene family phylogeny and the corresponding species phylogeny, and the goal is to reconcile the two by postulating speciation, gene duplication, horizontal gene transfer and gene loss events. This reconciliation problem is referred to as duplication-transfer-loss (DTL) reconciliation and has been extensively studied in the literature. Yet, even the fastest existing algorithms for DTL reconciliation are too slow for reconciling large gene families and for use in more sophisticated applications such as gene tree or species tree reconstruction. Results: We present two new algorithms for the DTL reconciliation problem that are dramatically faster than existing algorithms, both asymptotically and in practice. We also extend the standard DTL reconciliation model by considering distance-dependent transfer costs, which allow for more accurate reconciliation and give an efficient algorithm for DTL reconciliation under this extended model. We implemented our new algorithms and demonstrated up to 100 000-fold speed-up over existing methods, using both simulated and biological datasets. This dramatic improvement makes it possible to use DTL reconciliation for performing rigorous evolutionary analyses of large gene families and enables its use in advanced reconciliation-based gene and species tree reconstruction methods. Availability: Our programs can be freely downloaded from http://compbio.mit.edu/ranger-dtl/. Contact: mukul@csail.mit.edu; manoli@mit.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3371857
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33718572012-06-11 Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss Bansal, Mukul S. Alm, Eric J. Kellis, Manolis Bioinformatics Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Motivation: Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numerous important applications. Solving this problem requires the use of a reconciliation framework, where the input consists of a gene family phylogeny and the corresponding species phylogeny, and the goal is to reconcile the two by postulating speciation, gene duplication, horizontal gene transfer and gene loss events. This reconciliation problem is referred to as duplication-transfer-loss (DTL) reconciliation and has been extensively studied in the literature. Yet, even the fastest existing algorithms for DTL reconciliation are too slow for reconciling large gene families and for use in more sophisticated applications such as gene tree or species tree reconstruction. Results: We present two new algorithms for the DTL reconciliation problem that are dramatically faster than existing algorithms, both asymptotically and in practice. We also extend the standard DTL reconciliation model by considering distance-dependent transfer costs, which allow for more accurate reconciliation and give an efficient algorithm for DTL reconciliation under this extended model. We implemented our new algorithms and demonstrated up to 100 000-fold speed-up over existing methods, using both simulated and biological datasets. This dramatic improvement makes it possible to use DTL reconciliation for performing rigorous evolutionary analyses of large gene families and enables its use in advanced reconciliation-based gene and species tree reconstruction methods. Availability: Our programs can be freely downloaded from http://compbio.mit.edu/ranger-dtl/. Contact: mukul@csail.mit.edu; manoli@mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-06-15 2012-06-09 /pmc/articles/PMC3371857/ /pubmed/22689773 http://dx.doi.org/10.1093/bioinformatics/bts225 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
Bansal, Mukul S.
Alm, Eric J.
Kellis, Manolis
Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
title Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
title_full Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
title_fullStr Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
title_full_unstemmed Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
title_short Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
title_sort efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
topic Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371857/
https://www.ncbi.nlm.nih.gov/pubmed/22689773
http://dx.doi.org/10.1093/bioinformatics/bts225
work_keys_str_mv AT bansalmukuls efficientalgorithmsforthereconciliationproblemwithgeneduplicationhorizontaltransferandloss
AT almericj efficientalgorithmsforthereconciliationproblemwithgeneduplicationhorizontaltransferandloss
AT kellismanolis efficientalgorithmsforthereconciliationproblemwithgeneduplicationhorizontaltransferandloss