Cargando…

Conserved co-expression for candidate disease gene prioritization

BACKGROUND: Genes that are co-expressed tend to be involved in the same biological process. However, co-expression is not a very reliable predictor of functional links between genes. The evolutionary conservation of co-expression between species can be used to predict protein function more reliably...

Descripción completa

Detalles Bibliográficos
Autores principales: Oti, Martin, van Reeuwijk, Jeroen, Huynen, Martijn A, Brunner, Han G
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2383918/
https://www.ncbi.nlm.nih.gov/pubmed/18433471
http://dx.doi.org/10.1186/1471-2105-9-208
_version_ 1782154834170871808
author Oti, Martin
van Reeuwijk, Jeroen
Huynen, Martijn A
Brunner, Han G
author_facet Oti, Martin
van Reeuwijk, Jeroen
Huynen, Martijn A
Brunner, Han G
author_sort Oti, Martin
collection PubMed
description BACKGROUND: Genes that are co-expressed tend to be involved in the same biological process. However, co-expression is not a very reliable predictor of functional links between genes. The evolutionary conservation of co-expression between species can be used to predict protein function more reliably than co-expression in a single species. Here we examine whether co-expression across multiple species is also a better prioritizer of disease genes than is co-expression between human genes alone. RESULTS: We use co-expression data from yeast (S. cerevisiae), nematode worm (C. elegans), fruit fly (D. melanogaster), mouse and human and find that the use of evolutionary conservation can indeed improve the predictive value of co-expression. The effect that genes causing the same disease have higher co-expression than do other genes from their associated disease loci, is significantly enhanced when co-expression data are combined across evolutionarily distant species. We also find that performance can vary significantly depending on the co-expression datasets used, and just using more data does not necessarily lead to better prioritization. Instead, we find that dataset quality is more important than quantity, and using a consistent microarray platform per species leads to better performance than using more inclusive datasets pooled from various platforms. CONCLUSION: We find that evolutionarily conserved gene co-expression prioritizes disease candidate genes better than human gene co-expression alone, and provide the integrated data as a new resource for disease gene prioritization tools.
format Text
id pubmed-2383918
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23839182008-05-14 Conserved co-expression for candidate disease gene prioritization Oti, Martin van Reeuwijk, Jeroen Huynen, Martijn A Brunner, Han G BMC Bioinformatics Methodology Article BACKGROUND: Genes that are co-expressed tend to be involved in the same biological process. However, co-expression is not a very reliable predictor of functional links between genes. The evolutionary conservation of co-expression between species can be used to predict protein function more reliably than co-expression in a single species. Here we examine whether co-expression across multiple species is also a better prioritizer of disease genes than is co-expression between human genes alone. RESULTS: We use co-expression data from yeast (S. cerevisiae), nematode worm (C. elegans), fruit fly (D. melanogaster), mouse and human and find that the use of evolutionary conservation can indeed improve the predictive value of co-expression. The effect that genes causing the same disease have higher co-expression than do other genes from their associated disease loci, is significantly enhanced when co-expression data are combined across evolutionarily distant species. We also find that performance can vary significantly depending on the co-expression datasets used, and just using more data does not necessarily lead to better prioritization. Instead, we find that dataset quality is more important than quantity, and using a consistent microarray platform per species leads to better performance than using more inclusive datasets pooled from various platforms. CONCLUSION: We find that evolutionarily conserved gene co-expression prioritizes disease candidate genes better than human gene co-expression alone, and provide the integrated data as a new resource for disease gene prioritization tools. BioMed Central 2008-04-23 /pmc/articles/PMC2383918/ /pubmed/18433471 http://dx.doi.org/10.1186/1471-2105-9-208 Text en Copyright © 2008 Oti et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Oti, Martin
van Reeuwijk, Jeroen
Huynen, Martijn A
Brunner, Han G
Conserved co-expression for candidate disease gene prioritization
title Conserved co-expression for candidate disease gene prioritization
title_full Conserved co-expression for candidate disease gene prioritization
title_fullStr Conserved co-expression for candidate disease gene prioritization
title_full_unstemmed Conserved co-expression for candidate disease gene prioritization
title_short Conserved co-expression for candidate disease gene prioritization
title_sort conserved co-expression for candidate disease gene prioritization
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2383918/
https://www.ncbi.nlm.nih.gov/pubmed/18433471
http://dx.doi.org/10.1186/1471-2105-9-208
work_keys_str_mv AT otimartin conservedcoexpressionforcandidatediseasegeneprioritization
AT vanreeuwijkjeroen conservedcoexpressionforcandidatediseasegeneprioritization
AT huynenmartijna conservedcoexpressionforcandidatediseasegeneprioritization
AT brunnerhang conservedcoexpressionforcandidatediseasegeneprioritization