Cargando…

Mapping proteins in the presence of paralogs using units of coevolution

BACKGROUND: We study the problem of mapping proteins between two protein families in the presence of paralogs. This problem occurs as a difficult subproblem in coevolution-based computational approaches for protein-protein interaction prediction. RESULTS: Similar to prior approaches, our method is b...

Descripción completa

Detalles Bibliográficos
Autores principales: El-Kebir, Mohammed, Marschall, Tobias, Wohlers, Inken, Patterson, Murray, Heringa, Jaap, Schönhuth, Alexander, Klau, Gunnar W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852051/
https://www.ncbi.nlm.nih.gov/pubmed/24564758
http://dx.doi.org/10.1186/1471-2105-14-S15-S18
_version_ 1782294409132376064
author El-Kebir, Mohammed
Marschall, Tobias
Wohlers, Inken
Patterson, Murray
Heringa, Jaap
Schönhuth, Alexander
Klau, Gunnar W
author_facet El-Kebir, Mohammed
Marschall, Tobias
Wohlers, Inken
Patterson, Murray
Heringa, Jaap
Schönhuth, Alexander
Klau, Gunnar W
author_sort El-Kebir, Mohammed
collection PubMed
description BACKGROUND: We study the problem of mapping proteins between two protein families in the presence of paralogs. This problem occurs as a difficult subproblem in coevolution-based computational approaches for protein-protein interaction prediction. RESULTS: Similar to prior approaches, our method is based on the idea that coevolution implies equal rates of sequence evolution among the interacting proteins, and we provide a first attempt to quantify this notion in a formal statistical manner. We call the units that are central to this quantification scheme the units of coevolution. A unit consists of two mapped protein pairs and its score quantifies the coevolution of the pairs. This quantification allows us to provide a maximum likelihood formulation of the paralog mapping problem and to cast it into a binary quadratic programming formulation. CONCLUSION: CUPID, our software tool based on a Lagrangian relaxation of this formulation, makes it, for the first time, possible to compute state-of-the-art quality pairings in a few minutes of runtime. In summary, we suggest a novel alternative to the earlier available approaches, which is statistically sound and computationally feasible.
format Online
Article
Text
id pubmed-3852051
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38520512013-12-20 Mapping proteins in the presence of paralogs using units of coevolution El-Kebir, Mohammed Marschall, Tobias Wohlers, Inken Patterson, Murray Heringa, Jaap Schönhuth, Alexander Klau, Gunnar W BMC Bioinformatics Proceedings BACKGROUND: We study the problem of mapping proteins between two protein families in the presence of paralogs. This problem occurs as a difficult subproblem in coevolution-based computational approaches for protein-protein interaction prediction. RESULTS: Similar to prior approaches, our method is based on the idea that coevolution implies equal rates of sequence evolution among the interacting proteins, and we provide a first attempt to quantify this notion in a formal statistical manner. We call the units that are central to this quantification scheme the units of coevolution. A unit consists of two mapped protein pairs and its score quantifies the coevolution of the pairs. This quantification allows us to provide a maximum likelihood formulation of the paralog mapping problem and to cast it into a binary quadratic programming formulation. CONCLUSION: CUPID, our software tool based on a Lagrangian relaxation of this formulation, makes it, for the first time, possible to compute state-of-the-art quality pairings in a few minutes of runtime. In summary, we suggest a novel alternative to the earlier available approaches, which is statistically sound and computationally feasible. BioMed Central 2013-10-15 /pmc/articles/PMC3852051/ /pubmed/24564758 http://dx.doi.org/10.1186/1471-2105-14-S15-S18 Text en Copyright © 2013 El-Kebir et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
El-Kebir, Mohammed
Marschall, Tobias
Wohlers, Inken
Patterson, Murray
Heringa, Jaap
Schönhuth, Alexander
Klau, Gunnar W
Mapping proteins in the presence of paralogs using units of coevolution
title Mapping proteins in the presence of paralogs using units of coevolution
title_full Mapping proteins in the presence of paralogs using units of coevolution
title_fullStr Mapping proteins in the presence of paralogs using units of coevolution
title_full_unstemmed Mapping proteins in the presence of paralogs using units of coevolution
title_short Mapping proteins in the presence of paralogs using units of coevolution
title_sort mapping proteins in the presence of paralogs using units of coevolution
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852051/
https://www.ncbi.nlm.nih.gov/pubmed/24564758
http://dx.doi.org/10.1186/1471-2105-14-S15-S18
work_keys_str_mv AT elkebirmohammed mappingproteinsinthepresenceofparalogsusingunitsofcoevolution
AT marschalltobias mappingproteinsinthepresenceofparalogsusingunitsofcoevolution
AT wohlersinken mappingproteinsinthepresenceofparalogsusingunitsofcoevolution
AT pattersonmurray mappingproteinsinthepresenceofparalogsusingunitsofcoevolution
AT heringajaap mappingproteinsinthepresenceofparalogsusingunitsofcoevolution
AT schonhuthalexander mappingproteinsinthepresenceofparalogsusingunitsofcoevolution
AT klaugunnarw mappingproteinsinthepresenceofparalogsusingunitsofcoevolution