Cargando…
Mapping proteins in the presence of paralogs using units of coevolution
BACKGROUND: We study the problem of mapping proteins between two protein families in the presence of paralogs. This problem occurs as a difficult subproblem in coevolution-based computational approaches for protein-protein interaction prediction. RESULTS: Similar to prior approaches, our method is b...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852051/ https://www.ncbi.nlm.nih.gov/pubmed/24564758 http://dx.doi.org/10.1186/1471-2105-14-S15-S18 |
_version_ | 1782294409132376064 |
---|---|
author | El-Kebir, Mohammed Marschall, Tobias Wohlers, Inken Patterson, Murray Heringa, Jaap Schönhuth, Alexander Klau, Gunnar W |
author_facet | El-Kebir, Mohammed Marschall, Tobias Wohlers, Inken Patterson, Murray Heringa, Jaap Schönhuth, Alexander Klau, Gunnar W |
author_sort | El-Kebir, Mohammed |
collection | PubMed |
description | BACKGROUND: We study the problem of mapping proteins between two protein families in the presence of paralogs. This problem occurs as a difficult subproblem in coevolution-based computational approaches for protein-protein interaction prediction. RESULTS: Similar to prior approaches, our method is based on the idea that coevolution implies equal rates of sequence evolution among the interacting proteins, and we provide a first attempt to quantify this notion in a formal statistical manner. We call the units that are central to this quantification scheme the units of coevolution. A unit consists of two mapped protein pairs and its score quantifies the coevolution of the pairs. This quantification allows us to provide a maximum likelihood formulation of the paralog mapping problem and to cast it into a binary quadratic programming formulation. CONCLUSION: CUPID, our software tool based on a Lagrangian relaxation of this formulation, makes it, for the first time, possible to compute state-of-the-art quality pairings in a few minutes of runtime. In summary, we suggest a novel alternative to the earlier available approaches, which is statistically sound and computationally feasible. |
format | Online Article Text |
id | pubmed-3852051 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38520512013-12-20 Mapping proteins in the presence of paralogs using units of coevolution El-Kebir, Mohammed Marschall, Tobias Wohlers, Inken Patterson, Murray Heringa, Jaap Schönhuth, Alexander Klau, Gunnar W BMC Bioinformatics Proceedings BACKGROUND: We study the problem of mapping proteins between two protein families in the presence of paralogs. This problem occurs as a difficult subproblem in coevolution-based computational approaches for protein-protein interaction prediction. RESULTS: Similar to prior approaches, our method is based on the idea that coevolution implies equal rates of sequence evolution among the interacting proteins, and we provide a first attempt to quantify this notion in a formal statistical manner. We call the units that are central to this quantification scheme the units of coevolution. A unit consists of two mapped protein pairs and its score quantifies the coevolution of the pairs. This quantification allows us to provide a maximum likelihood formulation of the paralog mapping problem and to cast it into a binary quadratic programming formulation. CONCLUSION: CUPID, our software tool based on a Lagrangian relaxation of this formulation, makes it, for the first time, possible to compute state-of-the-art quality pairings in a few minutes of runtime. In summary, we suggest a novel alternative to the earlier available approaches, which is statistically sound and computationally feasible. BioMed Central 2013-10-15 /pmc/articles/PMC3852051/ /pubmed/24564758 http://dx.doi.org/10.1186/1471-2105-14-S15-S18 Text en Copyright © 2013 El-Kebir et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings El-Kebir, Mohammed Marschall, Tobias Wohlers, Inken Patterson, Murray Heringa, Jaap Schönhuth, Alexander Klau, Gunnar W Mapping proteins in the presence of paralogs using units of coevolution |
title | Mapping proteins in the presence of paralogs using units of coevolution |
title_full | Mapping proteins in the presence of paralogs using units of coevolution |
title_fullStr | Mapping proteins in the presence of paralogs using units of coevolution |
title_full_unstemmed | Mapping proteins in the presence of paralogs using units of coevolution |
title_short | Mapping proteins in the presence of paralogs using units of coevolution |
title_sort | mapping proteins in the presence of paralogs using units of coevolution |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852051/ https://www.ncbi.nlm.nih.gov/pubmed/24564758 http://dx.doi.org/10.1186/1471-2105-14-S15-S18 |
work_keys_str_mv | AT elkebirmohammed mappingproteinsinthepresenceofparalogsusingunitsofcoevolution AT marschalltobias mappingproteinsinthepresenceofparalogsusingunitsofcoevolution AT wohlersinken mappingproteinsinthepresenceofparalogsusingunitsofcoevolution AT pattersonmurray mappingproteinsinthepresenceofparalogsusingunitsofcoevolution AT heringajaap mappingproteinsinthepresenceofparalogsusingunitsofcoevolution AT schonhuthalexander mappingproteinsinthepresenceofparalogsusingunitsofcoevolution AT klaugunnarw mappingproteinsinthepresenceofparalogsusingunitsofcoevolution |