Cargando…

A graph-theoretic approach for inparalog detection

Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such...

Descripción completa

Detalles Bibliográficos
Autores principales: Tremblay-Savard, Olivier, Swenson, Krister M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526444/
https://www.ncbi.nlm.nih.gov/pubmed/23281701
http://dx.doi.org/10.1186/1471-2105-13-S19-S16
_version_ 1782253561805012992
author Tremblay-Savard, Olivier
Swenson, Krister M
author_facet Tremblay-Savard, Olivier
Swenson, Krister M
author_sort Tremblay-Savard, Olivier
collection PubMed
description Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such as orthology (genes related through a speciation event) or paralogy (genes related through a duplication event) are usually correlated with these features. For example, recent work has shown that in human and mouse there is a strong connection between function and inparalogs, the paralogs that were created since the speciation event separating the human and mouse lineages. Methods exist for detecting inparalogs that either use information from only two species, or consider a set of species but rely on clustering methods. In this paper we present a graph-theoretic approach for finding lower bounds on the number of inparalogs for a given set of species; we pose an edge covering problem on the similarity graph and give an efficient 2/3-approximation as well as a faster heuristic. Since the physical position of inparalogs corresponding to recent speciations is not likely to have changed since the duplication, we also use our predictions to estimate the types of duplications that have occurred in some vertebrates and drosophila.
format Online
Article
Text
id pubmed-3526444
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35264442013-01-10 A graph-theoretic approach for inparalog detection Tremblay-Savard, Olivier Swenson, Krister M BMC Bioinformatics Proceedings Understanding the history of a gene family that evolves through duplication, speciation, and loss is a fundamental problem in comparative genomics. Features such as function, position, and structural similarity between genes are intimately connected to this history; relationships between genes such as orthology (genes related through a speciation event) or paralogy (genes related through a duplication event) are usually correlated with these features. For example, recent work has shown that in human and mouse there is a strong connection between function and inparalogs, the paralogs that were created since the speciation event separating the human and mouse lineages. Methods exist for detecting inparalogs that either use information from only two species, or consider a set of species but rely on clustering methods. In this paper we present a graph-theoretic approach for finding lower bounds on the number of inparalogs for a given set of species; we pose an edge covering problem on the similarity graph and give an efficient 2/3-approximation as well as a faster heuristic. Since the physical position of inparalogs corresponding to recent speciations is not likely to have changed since the duplication, we also use our predictions to estimate the types of duplications that have occurred in some vertebrates and drosophila. BioMed Central 2012-12-19 /pmc/articles/PMC3526444/ /pubmed/23281701 http://dx.doi.org/10.1186/1471-2105-13-S19-S16 Text en Copyright ©2012 Tremblay-Savard and Swenson; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Tremblay-Savard, Olivier
Swenson, Krister M
A graph-theoretic approach for inparalog detection
title A graph-theoretic approach for inparalog detection
title_full A graph-theoretic approach for inparalog detection
title_fullStr A graph-theoretic approach for inparalog detection
title_full_unstemmed A graph-theoretic approach for inparalog detection
title_short A graph-theoretic approach for inparalog detection
title_sort graph-theoretic approach for inparalog detection
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3526444/
https://www.ncbi.nlm.nih.gov/pubmed/23281701
http://dx.doi.org/10.1186/1471-2105-13-S19-S16
work_keys_str_mv AT tremblaysavardolivier agraphtheoreticapproachforinparalogdetection
AT swensonkristerm agraphtheoreticapproachforinparalogdetection
AT tremblaysavardolivier graphtheoreticapproachforinparalogdetection
AT swensonkristerm graphtheoreticapproachforinparalogdetection