Cargando…

Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis

BACKGROUND: High-throughput methods that ascribe a cellular or physiological function for each gene product are useful to understand the roles of genes that have not been extensively characterized by molecular or genetic approaches. One method to infer gene function is "guilt-by-association&quo...

Descripción completa

Detalles Bibliográficos
Autores principales: Klomp, Jeff A, Furge, Kyle A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599284/
https://www.ncbi.nlm.nih.gov/pubmed/22824328
http://dx.doi.org/10.1186/1756-0500-5-370
_version_ 1782262928648437760
author Klomp, Jeff A
Furge, Kyle A
author_facet Klomp, Jeff A
Furge, Kyle A
author_sort Klomp, Jeff A
collection PubMed
description BACKGROUND: High-throughput methods that ascribe a cellular or physiological function for each gene product are useful to understand the roles of genes that have not been extensively characterized by molecular or genetic approaches. One method to infer gene function is "guilt-by-association", in which the expression pattern of a poorly characterized gene is shown to co-vary with the expression of better-characterized genes. The function of the poorly characterized gene is inferred from the known function(s) of the well-described genes. For example, genes co-expressed with transcripts that vary during the cell cycle, development, environmental stresses, and with oncogenesis have been implicated in those processes. FINDINGS: While examining the expression characteristics of several poorly characterized genes, we noted that we could associate each of the genes with a cellular phenotype by correlating individual gene expression changes with gene set enrichment scores from individual samples. We evaluated the effectiveness of this approach using a modest sized gene expression data set (expO) and a compendium of gene expression phenotypes (MSigDBv3.0). We found the transcripts that correlated best with enrichment in mitochondrial and lysosomal gene sets were mostly related to those processes (89/100 and 44/50, respectively). The reciprocal evaluation, ranking gene sets according to correlation of enrichment with an individual gene’s expression, also reflected known associations for prominent genes in the biomedical literature (16/19). In evaluating the model, we also found that 4% of the genome encodes proteins that are associated with small molecule and small peptide signal transduction gene sets, implicating a large number of genes in both internal and external environmental sensing. CONCLUSIONS: Our results show that this approach is useful to infer functions of disparate sets of genes. This method mirrors the biological experimental approaches used by others to associate individual genes with defined gene expression changes. Moreover, the approach can be used beyond discovering genes related to a cellular process to discover meaningful expression phenotypes from a compendium that are associated with a given gene. The effectiveness, versatility, and breadth of this approach make possible its application in a variety of contexts and with a variety of downstream analyses.
format Online
Article
Text
id pubmed-3599284
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35992842013-03-17 Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis Klomp, Jeff A Furge, Kyle A BMC Res Notes Technical Note BACKGROUND: High-throughput methods that ascribe a cellular or physiological function for each gene product are useful to understand the roles of genes that have not been extensively characterized by molecular or genetic approaches. One method to infer gene function is "guilt-by-association", in which the expression pattern of a poorly characterized gene is shown to co-vary with the expression of better-characterized genes. The function of the poorly characterized gene is inferred from the known function(s) of the well-described genes. For example, genes co-expressed with transcripts that vary during the cell cycle, development, environmental stresses, and with oncogenesis have been implicated in those processes. FINDINGS: While examining the expression characteristics of several poorly characterized genes, we noted that we could associate each of the genes with a cellular phenotype by correlating individual gene expression changes with gene set enrichment scores from individual samples. We evaluated the effectiveness of this approach using a modest sized gene expression data set (expO) and a compendium of gene expression phenotypes (MSigDBv3.0). We found the transcripts that correlated best with enrichment in mitochondrial and lysosomal gene sets were mostly related to those processes (89/100 and 44/50, respectively). The reciprocal evaluation, ranking gene sets according to correlation of enrichment with an individual gene’s expression, also reflected known associations for prominent genes in the biomedical literature (16/19). In evaluating the model, we also found that 4% of the genome encodes proteins that are associated with small molecule and small peptide signal transduction gene sets, implicating a large number of genes in both internal and external environmental sensing. CONCLUSIONS: Our results show that this approach is useful to infer functions of disparate sets of genes. This method mirrors the biological experimental approaches used by others to associate individual genes with defined gene expression changes. Moreover, the approach can be used beyond discovering genes related to a cellular process to discover meaningful expression phenotypes from a compendium that are associated with a given gene. The effectiveness, versatility, and breadth of this approach make possible its application in a variety of contexts and with a variety of downstream analyses. BioMed Central 2012-07-23 /pmc/articles/PMC3599284/ /pubmed/22824328 http://dx.doi.org/10.1186/1756-0500-5-370 Text en Copyright ©2012 Klomp and Furge; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Klomp, Jeff A
Furge, Kyle A
Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
title Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
title_full Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
title_fullStr Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
title_full_unstemmed Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
title_short Genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
title_sort genome-wide matching of genes to cellular roles using guilt-by-association models derived from single sample analysis
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3599284/
https://www.ncbi.nlm.nih.gov/pubmed/22824328
http://dx.doi.org/10.1186/1756-0500-5-370
work_keys_str_mv AT klompjeffa genomewidematchingofgenestocellularrolesusingguiltbyassociationmodelsderivedfromsinglesampleanalysis
AT furgekylea genomewidematchingofgenestocellularrolesusingguiltbyassociationmodelsderivedfromsinglesampleanalysis