Cargando…

Discovering Pair-wise Synergies in Microarray Data

Informative gene selection can have important implications for the improvement of cancer diagnosis and the identification of new drug targets. Individual-gene-ranking methods ignore interactions between genes. Furthermore, popular pair-wise gene evaluation methods, e.g. TSP and TSG, are helpless for...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yuan, Cao, Dan, Gao, Jun, Yuan, Zheming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965793/
https://www.ncbi.nlm.nih.gov/pubmed/27470995
http://dx.doi.org/10.1038/srep30672
_version_ 1782445317396889600
author Chen, Yuan
Cao, Dan
Gao, Jun
Yuan, Zheming
author_facet Chen, Yuan
Cao, Dan
Gao, Jun
Yuan, Zheming
author_sort Chen, Yuan
collection PubMed
description Informative gene selection can have important implications for the improvement of cancer diagnosis and the identification of new drug targets. Individual-gene-ranking methods ignore interactions between genes. Furthermore, popular pair-wise gene evaluation methods, e.g. TSP and TSG, are helpless for discovering pair-wise interactions. Several efforts to discover pair-wise synergy have been made based on the information approach, such as EMBP and FeatKNN. However, the methods which are employed to estimate mutual information, e.g. binarization, histogram-based and KNN estimators, depend on known data or domain characteristics. Recently, Reshef et al. proposed a novel maximal information coefficient (MIC) measure to capture a wide range of associations between two variables that has the property of generality. An extension from MIC(X; Y) to MIC(X(1); X(2); Y) is therefore desired. We developed an approximation algorithm for estimating MIC(X(1); X(2); Y) where Y is a discrete variable. MIC(X(1); X(2); Y) is employed to detect pair-wise synergy in simulation and cancer microarray data. The results indicate that MIC(X(1); X(2); Y) also has the property of generality. It can discover synergic genes that are undetectable by reference feature selection methods such as MIC(X; Y) and TSG. Synergic genes can distinguish different phenotypes. Finally, the biological relevance of these synergic genes is validated with GO annotation and OUgene database.
format Online
Article
Text
id pubmed-4965793
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-49657932016-08-08 Discovering Pair-wise Synergies in Microarray Data Chen, Yuan Cao, Dan Gao, Jun Yuan, Zheming Sci Rep Article Informative gene selection can have important implications for the improvement of cancer diagnosis and the identification of new drug targets. Individual-gene-ranking methods ignore interactions between genes. Furthermore, popular pair-wise gene evaluation methods, e.g. TSP and TSG, are helpless for discovering pair-wise interactions. Several efforts to discover pair-wise synergy have been made based on the information approach, such as EMBP and FeatKNN. However, the methods which are employed to estimate mutual information, e.g. binarization, histogram-based and KNN estimators, depend on known data or domain characteristics. Recently, Reshef et al. proposed a novel maximal information coefficient (MIC) measure to capture a wide range of associations between two variables that has the property of generality. An extension from MIC(X; Y) to MIC(X(1); X(2); Y) is therefore desired. We developed an approximation algorithm for estimating MIC(X(1); X(2); Y) where Y is a discrete variable. MIC(X(1); X(2); Y) is employed to detect pair-wise synergy in simulation and cancer microarray data. The results indicate that MIC(X(1); X(2); Y) also has the property of generality. It can discover synergic genes that are undetectable by reference feature selection methods such as MIC(X; Y) and TSG. Synergic genes can distinguish different phenotypes. Finally, the biological relevance of these synergic genes is validated with GO annotation and OUgene database. Nature Publishing Group 2016-07-29 /pmc/articles/PMC4965793/ /pubmed/27470995 http://dx.doi.org/10.1038/srep30672 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Chen, Yuan
Cao, Dan
Gao, Jun
Yuan, Zheming
Discovering Pair-wise Synergies in Microarray Data
title Discovering Pair-wise Synergies in Microarray Data
title_full Discovering Pair-wise Synergies in Microarray Data
title_fullStr Discovering Pair-wise Synergies in Microarray Data
title_full_unstemmed Discovering Pair-wise Synergies in Microarray Data
title_short Discovering Pair-wise Synergies in Microarray Data
title_sort discovering pair-wise synergies in microarray data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4965793/
https://www.ncbi.nlm.nih.gov/pubmed/27470995
http://dx.doi.org/10.1038/srep30672
work_keys_str_mv AT chenyuan discoveringpairwisesynergiesinmicroarraydata
AT caodan discoveringpairwisesynergiesinmicroarraydata
AT gaojun discoveringpairwisesynergiesinmicroarraydata
AT yuanzheming discoveringpairwisesynergiesinmicroarraydata