Cargando…

Graph ranking for exploratory gene data analysis

BACKGROUND: Microarray technology has made it possible to simultaneously monitor the expression levels of thousands of genes in a single experiment. However, the large number of genes greatly increases the challenges of analyzing, comprehending and interpreting the resulting mass of data. Selecting...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Cuilan, Dang, Xin, Chen, Yixin, Wilkins, Dawn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226190/
https://www.ncbi.nlm.nih.gov/pubmed/19811684
http://dx.doi.org/10.1186/1471-2105-10-S11-S19
_version_ 1782217578950688768
author Gao, Cuilan
Dang, Xin
Chen, Yixin
Wilkins, Dawn
author_facet Gao, Cuilan
Dang, Xin
Chen, Yixin
Wilkins, Dawn
author_sort Gao, Cuilan
collection PubMed
description BACKGROUND: Microarray technology has made it possible to simultaneously monitor the expression levels of thousands of genes in a single experiment. However, the large number of genes greatly increases the challenges of analyzing, comprehending and interpreting the resulting mass of data. Selecting a subset of important genes is inevitable to address the challenge. Gene selection has been investigated extensively over the last decade. Most selection procedures, however, are not sufficient for accurate inference of underlying biology, because biological significance does not necessarily have to be statistically significant. Additional biological knowledge needs to be integrated into the gene selection procedure. RESULTS: We propose a general framework for gene ranking. We construct a bipartite graph from the Gene Ontology (GO) and gene expression data. The graph describes the relationship between genes and their associated molecular functions. Under a species condition, edge weights of the graph are assigned to be gene expression level. Such a graph provides a mathematical means to represent both species-independent and species-dependent biological information. We also develop a new ranking algorithm to analyze the weighted graph via a kernelized spatial depth (KSD) approach. Consequently, the importance of gene and molecular function can be simultaneously ranked by a real-valued measure, KSD, which incorporates the global and local structure of the graph. Over-expressed and under-regulated genes also can be separately ranked. CONCLUSION: The gene-function bigraph integrates molecular function annotations into gene expression data. The relevance of genes is described in the graph (through a common function). The proposed method provides an exploratory framework for gene data analysis.
format Online
Article
Text
id pubmed-3226190
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32261902011-11-30 Graph ranking for exploratory gene data analysis Gao, Cuilan Dang, Xin Chen, Yixin Wilkins, Dawn BMC Bioinformatics Proceedings BACKGROUND: Microarray technology has made it possible to simultaneously monitor the expression levels of thousands of genes in a single experiment. However, the large number of genes greatly increases the challenges of analyzing, comprehending and interpreting the resulting mass of data. Selecting a subset of important genes is inevitable to address the challenge. Gene selection has been investigated extensively over the last decade. Most selection procedures, however, are not sufficient for accurate inference of underlying biology, because biological significance does not necessarily have to be statistically significant. Additional biological knowledge needs to be integrated into the gene selection procedure. RESULTS: We propose a general framework for gene ranking. We construct a bipartite graph from the Gene Ontology (GO) and gene expression data. The graph describes the relationship between genes and their associated molecular functions. Under a species condition, edge weights of the graph are assigned to be gene expression level. Such a graph provides a mathematical means to represent both species-independent and species-dependent biological information. We also develop a new ranking algorithm to analyze the weighted graph via a kernelized spatial depth (KSD) approach. Consequently, the importance of gene and molecular function can be simultaneously ranked by a real-valued measure, KSD, which incorporates the global and local structure of the graph. Over-expressed and under-regulated genes also can be separately ranked. CONCLUSION: The gene-function bigraph integrates molecular function annotations into gene expression data. The relevance of genes is described in the graph (through a common function). The proposed method provides an exploratory framework for gene data analysis. BioMed Central 2009-10-08 /pmc/articles/PMC3226190/ /pubmed/19811684 http://dx.doi.org/10.1186/1471-2105-10-S11-S19 Text en Copyright ©2009 Gao et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Gao, Cuilan
Dang, Xin
Chen, Yixin
Wilkins, Dawn
Graph ranking for exploratory gene data analysis
title Graph ranking for exploratory gene data analysis
title_full Graph ranking for exploratory gene data analysis
title_fullStr Graph ranking for exploratory gene data analysis
title_full_unstemmed Graph ranking for exploratory gene data analysis
title_short Graph ranking for exploratory gene data analysis
title_sort graph ranking for exploratory gene data analysis
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226190/
https://www.ncbi.nlm.nih.gov/pubmed/19811684
http://dx.doi.org/10.1186/1471-2105-10-S11-S19
work_keys_str_mv AT gaocuilan graphrankingforexploratorygenedataanalysis
AT dangxin graphrankingforexploratorygenedataanalysis
AT chenyixin graphrankingforexploratorygenedataanalysis
AT wilkinsdawn graphrankingforexploratorygenedataanalysis