Cargando…

Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence

Gene set testing problem has become the focus of microarray data analysis. A gene set is a group of genes that are defined by a priori biological knowledge. Several statistical methods have been proposed to determine whether functional gene sets express differentially (enrichment and/or deletion) in...

Descripción completa

Detalles Bibliográficos
Autores principales: Tsai, Chen-An, Liu, Li-Yu Daisy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3597597/
https://www.ncbi.nlm.nih.gov/pubmed/23516564
http://dx.doi.org/10.1371/journal.pone.0058851
_version_ 1782262656130875392
author Tsai, Chen-An
Liu, Li-Yu Daisy
author_facet Tsai, Chen-An
Liu, Li-Yu Daisy
author_sort Tsai, Chen-An
collection PubMed
description Gene set testing problem has become the focus of microarray data analysis. A gene set is a group of genes that are defined by a priori biological knowledge. Several statistical methods have been proposed to determine whether functional gene sets express differentially (enrichment and/or deletion) in variations of phenotypes. However, little attention has been given to analyzing the dependence structure among gene sets. In this study, we have proposed a novel statistical method of gene set association analysis to identify significantly associated gene sets using the coefficient of intrinsic dependence. The simulation studies show that the proposed method outperforms the conventional methods to detect general forms of association in terms of control of type I error and power. The correlation of intrinsic dependence has been applied to a breast cancer microarray dataset to quantify the un-supervised relationship between two sets of genes in the tumor and non-tumor samples. It was observed that the existence of gene-set association differed across various clinical cohorts. In addition, a supervised learning was employed to illustrate how gene sets, in signaling transduction pathways or subnetworks regulated by a set of transcription factors, can be discovered using microarray data. In conclusion, the coefficient of intrinsic dependence provides a powerful tool for detecting general types of association. Hence, it can be useful to associate gene sets using microarray expression data. Through connecting relevant gene sets, our approach has the potential to reveal underlying associations by drawing a statistically relevant network in a given population, and it can also be used to complement the conventional gene set analysis.
format Online
Article
Text
id pubmed-3597597
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35975972013-03-20 Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence Tsai, Chen-An Liu, Li-Yu Daisy PLoS One Research Article Gene set testing problem has become the focus of microarray data analysis. A gene set is a group of genes that are defined by a priori biological knowledge. Several statistical methods have been proposed to determine whether functional gene sets express differentially (enrichment and/or deletion) in variations of phenotypes. However, little attention has been given to analyzing the dependence structure among gene sets. In this study, we have proposed a novel statistical method of gene set association analysis to identify significantly associated gene sets using the coefficient of intrinsic dependence. The simulation studies show that the proposed method outperforms the conventional methods to detect general forms of association in terms of control of type I error and power. The correlation of intrinsic dependence has been applied to a breast cancer microarray dataset to quantify the un-supervised relationship between two sets of genes in the tumor and non-tumor samples. It was observed that the existence of gene-set association differed across various clinical cohorts. In addition, a supervised learning was employed to illustrate how gene sets, in signaling transduction pathways or subnetworks regulated by a set of transcription factors, can be discovered using microarray data. In conclusion, the coefficient of intrinsic dependence provides a powerful tool for detecting general types of association. Hence, it can be useful to associate gene sets using microarray expression data. Through connecting relevant gene sets, our approach has the potential to reveal underlying associations by drawing a statistically relevant network in a given population, and it can also be used to complement the conventional gene set analysis. Public Library of Science 2013-03-14 /pmc/articles/PMC3597597/ /pubmed/23516564 http://dx.doi.org/10.1371/journal.pone.0058851 Text en © 2013 Tsai Liu http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Tsai, Chen-An
Liu, Li-Yu Daisy
Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence
title Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence
title_full Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence
title_fullStr Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence
title_full_unstemmed Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence
title_short Identifying Gene Set Association Enrichment Using the Coefficient of Intrinsic Dependence
title_sort identifying gene set association enrichment using the coefficient of intrinsic dependence
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3597597/
https://www.ncbi.nlm.nih.gov/pubmed/23516564
http://dx.doi.org/10.1371/journal.pone.0058851
work_keys_str_mv AT tsaichenan identifyinggenesetassociationenrichmentusingthecoefficientofintrinsicdependence
AT liuliyudaisy identifyinggenesetassociationenrichmentusingthecoefficientofintrinsicdependence