Cargando…
CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion
BACKGROUND: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic ge...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879367/ https://www.ncbi.nlm.nih.gov/pubmed/20532224 http://dx.doi.org/10.1371/journal.pone.0010908 |
_version_ | 1782181920049725440 |
---|---|
author | Reid, Adam J. Ranea, Juan A. G. Clegg, Andrew B. Orengo, Christine A. |
author_facet | Reid, Adam J. Ranea, Juan A. G. Clegg, Andrew B. Orengo, Christine A. |
author_sort | Reid, Adam J. |
collection | PubMed |
description | BACKGROUND: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic genomes due to the presence of large homologous domain families. Here we introduce CODA (Co-Occurrence of Domains Analysis), a method to predict functional associations based on the gene fusion idiom. METHODOLOGY/PRINCIPAL FINDINGS: We apply a novel scoring scheme which takes account of the genome-specific size of homologous domain families involved in fusion to improve accuracy in predicting functional associations. We show that CODA is able to accurately predict functional similarities in human with comparison to state-of-the-art methods and show that different methods can be complementary. CODA is used to produce evidence that a currently uncharacterised human protein may be involved in pathways related to depression and that another is involved in DNA replication. CONCLUSIONS/SIGNIFICANCE: The relative performance of different gene fusion methodologies has not previously been explored. We find that they are largely complementary, with different methods being more or less appropriate in different genomes. Our method is the only one currently available for download and can be run on an arbitrary dataset by the user. The CODA software and datasets are freely available from ftp://ftp.biochem.ucl.ac.uk/pub/gene3d_data/v6.1.0/CODA/. Predictions are also available via web services from http://funcnet.eu/. |
format | Text |
id | pubmed-2879367 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-28793672010-06-07 CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion Reid, Adam J. Ranea, Juan A. G. Clegg, Andrew B. Orengo, Christine A. PLoS One Research Article BACKGROUND: In order to understand how biological systems function it is necessary to determine the interactions and associations between proteins. Gene fusion prediction is one approach to detection of such functional relationships. Its use is however known to be problematic in higher eukaryotic genomes due to the presence of large homologous domain families. Here we introduce CODA (Co-Occurrence of Domains Analysis), a method to predict functional associations based on the gene fusion idiom. METHODOLOGY/PRINCIPAL FINDINGS: We apply a novel scoring scheme which takes account of the genome-specific size of homologous domain families involved in fusion to improve accuracy in predicting functional associations. We show that CODA is able to accurately predict functional similarities in human with comparison to state-of-the-art methods and show that different methods can be complementary. CODA is used to produce evidence that a currently uncharacterised human protein may be involved in pathways related to depression and that another is involved in DNA replication. CONCLUSIONS/SIGNIFICANCE: The relative performance of different gene fusion methodologies has not previously been explored. We find that they are largely complementary, with different methods being more or less appropriate in different genomes. Our method is the only one currently available for download and can be run on an arbitrary dataset by the user. The CODA software and datasets are freely available from ftp://ftp.biochem.ucl.ac.uk/pub/gene3d_data/v6.1.0/CODA/. Predictions are also available via web services from http://funcnet.eu/. Public Library of Science 2010-06-01 /pmc/articles/PMC2879367/ /pubmed/20532224 http://dx.doi.org/10.1371/journal.pone.0010908 Text en Reid et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Reid, Adam J. Ranea, Juan A. G. Clegg, Andrew B. Orengo, Christine A. CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion |
title | CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion |
title_full | CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion |
title_fullStr | CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion |
title_full_unstemmed | CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion |
title_short | CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion |
title_sort | coda: accurate detection of functional associations between proteins in eukaryotic genomes using domain fusion |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879367/ https://www.ncbi.nlm.nih.gov/pubmed/20532224 http://dx.doi.org/10.1371/journal.pone.0010908 |
work_keys_str_mv | AT reidadamj codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion AT raneajuanag codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion AT cleggandrewb codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion AT orengochristinea codaaccuratedetectionoffunctionalassociationsbetweenproteinsineukaryoticgenomesusingdomainfusion |