Cargando…

CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion

BACKGROUND: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Reid, Adam J., Ranea, Juan A. G., Clegg, Andrew B., Orengo, Christine A.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879367/
https://www.ncbi.nlm.nih.gov/pubmed/20532224
http://dx.doi.org/10.1371/journal.pone.0010908
_version_ 1782181920049725440
author Reid, Adam J.
Ranea, Juan A. G.
Clegg, Andrew B.
Orengo, Christine A.
author_facet Reid, Adam J.
Ranea, Juan A. G.
Clegg, Andrew B.
Orengo, Christine A.
author_sort Reid, Adam J.
collection PubMed
description BACKGROUND: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic genomes due to the presence of large homologous domain families. Here we introduce CODA (Co-Occurrence of Domains Analysis), a method to predict functional associations based on the gene fusion idiom. METHODOLOGY/PRINCIPAL FINDINGS: We apply a novel scoring scheme which takes account of the genome-specific size of homologous domain families involved in fusion to improve accuracy in predicting functional associations. We show that CODA is able to accurately predict functional similarities in human with comparison to state-of-the-art methods and show that different methods can be complementary. CODA is used to produce evidence that a currently uncharacterised human protein may be involved in pathways related to depression and that another is involved in DNA replication. CONCLUSIONS/SIGNIFICANCE: The relative performance of different gene fusion methodologies has not previously been explored. We find that they are largely complementary, with different methods being more or less appropriate in different genomes. Our method is the only one currently available for download and can be run on an arbitrary dataset by the user. The CODA software and datasets are freely available from ftp://ftp.biochem.ucl.ac.uk/pub/gene3d_data/v6.1.0/CODA/. Predictions are also available via web services from http://funcnet.eu/.
format Text
id pubmed-2879367
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28793672010-06-07 CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion Reid, Adam J. Ranea, Juan A. G. Clegg, Andrew B. Orengo, Christine A. PLoS One Research Article BACKGROUND: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic genomes due to the presence of large homologous domain families. Here we introduce CODA (Co-Occurrence of Domains Analysis), a method to predict functional associations based on the gene fusion idiom. METHODOLOGY/PRINCIPAL FINDINGS: We apply a novel scoring scheme which takes account of the genome-specific size of homologous domain families involved in fusion to improve accuracy in predicting functional associations. We show that CODA is able to accurately predict functional similarities in human with comparison to state-of-the-art methods and show that different methods can be complementary. CODA is used to produce evidence that a currently uncharacterised human protein may be involved in pathways related to depression and that another is involved in DNA replication. CONCLUSIONS/SIGNIFICANCE: The relative performance of different gene fusion methodologies has not previously been explored. We find that they are largely complementary, with different methods being more or less appropriate in different genomes. Our method is the only one currently available for download and can be run on an arbitrary dataset by the user. The CODA software and datasets are freely available from ftp://ftp.biochem.ucl.ac.uk/pub/gene3d_data/v6.1.0/CODA/. Predictions are also available via web services from http://funcnet.eu/. Public Library of Science 2010-06-01 /pmc/articles/PMC2879367/ /pubmed/20532224 http://dx.doi.org/10.1371/journal.pone.0010908 Text en Reid et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Reid, Adam J.
Ranea, Juan A. G.
Clegg, Andrew B.
Orengo, Christine A.
CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
title CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
title_full CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
title_fullStr CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
title_full_unstemmed CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
title_short CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
title_sort coda: accurate detection of functional associations between proteins in eukaryotic genomes using domain fusion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879367/
https://www.ncbi.nlm.nih.gov/pubmed/20532224
http://dx.doi.org/10.1371/journal.pone.0010908
work_keys_str_mv AT reidadamj codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion
AT raneajuanag codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion
AT cleggandrewb codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion
AT orengochristinea codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion