Cargando…

Disease Gene Characterization through Large-Scale Co-Expression Analysis

BACKGROUND: In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET). RESULTS: Celsius, the largest co-normalized microarray dataset of Affymetrix based g...

Descripción completa

Detalles Bibliográficos
Autores principales: Day, Allen, Dong, Jun, Funari, Vincent A., Harry, Bret, Strom, Samuel P., Cohn, Dan H., Nelson, Stanley F.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2797297/
https://www.ncbi.nlm.nih.gov/pubmed/20046828
http://dx.doi.org/10.1371/journal.pone.0008491
_version_ 1782175600491888640
author Day, Allen
Dong, Jun
Funari, Vincent A.
Harry, Bret
Strom, Samuel P.
Cohn, Dan H.
Nelson, Stanley F.
author_facet Day, Allen
Dong, Jun
Funari, Vincent A.
Harry, Bret
Strom, Samuel P.
Cohn, Dan H.
Nelson, Stanley F.
author_sort Day, Allen
collection PubMed
description BACKGROUND: In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET). RESULTS: Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2) and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders. DISCUSSION: UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis.
format Text
id pubmed-2797297
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27972972009-12-31 Disease Gene Characterization through Large-Scale Co-Expression Analysis Day, Allen Dong, Jun Funari, Vincent A. Harry, Bret Strom, Samuel P. Cohn, Dan H. Nelson, Stanley F. PLoS One Research Article BACKGROUND: In the post genome era, a major goal of biology is the identification of specific roles for individual genes. We report a new genomic tool for gene characterization, the UCLA Gene Expression Tool (UGET). RESULTS: Celsius, the largest co-normalized microarray dataset of Affymetrix based gene expression, was used to calculate the correlation between all possible gene pairs on all platforms, and generate stored indexes in a web searchable format. The size of Celsius makes UGET a powerful gene characterization tool. Using a small seed list of known cartilage-selective genes, UGET extended the list of known genes by identifying 32 new highly cartilage-selective genes. Of these, 7 of 10 tested were validated by qPCR including the novel cartilage-specific genes SDK2 and FLJ41170. In addition, we retrospectively tested UGET and other gene expression based prioritization tools to identify disease-causing genes within known linkage intervals. We first demonstrated this utility with UGET using genetically heterogeneous disorders such as Joubert syndrome, microcephaly, neuropsychiatric disorders and type 2 limb girdle muscular dystrophy (LGMD2) and then compared UGET to other gene expression based prioritization programs which use small but discrete and well annotated datasets. Finally, we observed a significantly higher gene correlation shared between genes in disease networks associated with similar complex or Mendelian disorders. DISCUSSION: UGET is an invaluable resource for a geneticist that permits the rapid inclusion of expression criteria from one to hundreds of genes in genomic intervals linked to disease. By using thousands of arrays UGET annotates and prioritizes genes better than other tools especially with rare tissue disorders or complex multi-tissue biological processes. This information can be critical in prioritization of candidate genes for sequence analysis. Public Library of Science 2009-12-31 /pmc/articles/PMC2797297/ /pubmed/20046828 http://dx.doi.org/10.1371/journal.pone.0008491 Text en Day et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Day, Allen
Dong, Jun
Funari, Vincent A.
Harry, Bret
Strom, Samuel P.
Cohn, Dan H.
Nelson, Stanley F.
Disease Gene Characterization through Large-Scale Co-Expression Analysis
title Disease Gene Characterization through Large-Scale Co-Expression Analysis
title_full Disease Gene Characterization through Large-Scale Co-Expression Analysis
title_fullStr Disease Gene Characterization through Large-Scale Co-Expression Analysis
title_full_unstemmed Disease Gene Characterization through Large-Scale Co-Expression Analysis
title_short Disease Gene Characterization through Large-Scale Co-Expression Analysis
title_sort disease gene characterization through large-scale co-expression analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2797297/
https://www.ncbi.nlm.nih.gov/pubmed/20046828
http://dx.doi.org/10.1371/journal.pone.0008491
work_keys_str_mv AT dayallen diseasegenecharacterizationthroughlargescalecoexpressionanalysis
AT dongjun diseasegenecharacterizationthroughlargescalecoexpressionanalysis
AT funarivincenta diseasegenecharacterizationthroughlargescalecoexpressionanalysis
AT harrybret diseasegenecharacterizationthroughlargescalecoexpressionanalysis
AT stromsamuelp diseasegenecharacterizationthroughlargescalecoexpressionanalysis
AT cohndanh diseasegenecharacterizationthroughlargescalecoexpressionanalysis
AT nelsonstanleyf diseasegenecharacterizationthroughlargescalecoexpressionanalysis