Cargando…

Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network

A tissue-specific gene expression shapes the formation of tissues, while gene expression changes reflect the immune response of the human body to environmental stimulations or pressure, particularly in disease conditions, such as cancers. A few genes are commonly expressed across tissues or various...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Lei, Pan, XiaoYong, Zhang, Yu-Hang, Liu, Min, Huang, Tao, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6307323/
https://www.ncbi.nlm.nih.gov/pubmed/30595815
http://dx.doi.org/10.1016/j.csbj.2018.12.002
_version_ 1783382982799654912
author Chen, Lei
Pan, XiaoYong
Zhang, Yu-Hang
Liu, Min
Huang, Tao
Cai, Yu-Dong
author_facet Chen, Lei
Pan, XiaoYong
Zhang, Yu-Hang
Liu, Min
Huang, Tao
Cai, Yu-Dong
author_sort Chen, Lei
collection PubMed
description A tissue-specific gene expression shapes the formation of tissues, while gene expression changes reflect the immune response of the human body to environmental stimulations or pressure, particularly in disease conditions, such as cancers. A few genes are commonly expressed across tissues or various cancers, while others are not. To investigate the functional differences between widely and rarely expressed genes, we defined the genes that were expressed in 32 normal tissues/cancers (i.e., called widely expressed genes; FPKM >1 in all samples) and those that were not detected (i.e., called rarely expressed genes; FPKM <1 in all samples) based on the large gene expression data set provided by Uhlen et al. Each gene was encoded using the gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment scores. Minimum redundancy maximum relevance (mRMR) was used to measure and rank these features on the mRMR feature list. Thereafter, we applied the incremental feature selection method with a supervised classifier recurrent neural network (RNN) to select the discriminate features for classifying widely expressed genes from rarely expressed genes and construct an optimum RNN classifier. The Youden's indexes generated by the optimum RNN classifier and evaluated using a 10-fold cross validation were 0.739 for normal tissues and 0.639 for cancers. Furthermore, the underlying mechanisms of the key discriminate GO and KEGG features were analyzed. Results can facilitate the identification of the expression landscape of genes and elucidation of how gene expression shapes tissues and the microenvironment of cancers.
format Online
Article
Text
id pubmed-6307323
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-63073232018-12-28 Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network Chen, Lei Pan, XiaoYong Zhang, Yu-Hang Liu, Min Huang, Tao Cai, Yu-Dong Comput Struct Biotechnol J Research Article A tissue-specific gene expression shapes the formation of tissues, while gene expression changes reflect the immune response of the human body to environmental stimulations or pressure, particularly in disease conditions, such as cancers. A few genes are commonly expressed across tissues or various cancers, while others are not. To investigate the functional differences between widely and rarely expressed genes, we defined the genes that were expressed in 32 normal tissues/cancers (i.e., called widely expressed genes; FPKM >1 in all samples) and those that were not detected (i.e., called rarely expressed genes; FPKM <1 in all samples) based on the large gene expression data set provided by Uhlen et al. Each gene was encoded using the gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment scores. Minimum redundancy maximum relevance (mRMR) was used to measure and rank these features on the mRMR feature list. Thereafter, we applied the incremental feature selection method with a supervised classifier recurrent neural network (RNN) to select the discriminate features for classifying widely expressed genes from rarely expressed genes and construct an optimum RNN classifier. The Youden's indexes generated by the optimum RNN classifier and evaluated using a 10-fold cross validation were 0.739 for normal tissues and 0.639 for cancers. Furthermore, the underlying mechanisms of the key discriminate GO and KEGG features were analyzed. Results can facilitate the identification of the expression landscape of genes and elucidation of how gene expression shapes tissues and the microenvironment of cancers. Research Network of Computational and Structural Biotechnology 2018-12-14 /pmc/articles/PMC6307323/ /pubmed/30595815 http://dx.doi.org/10.1016/j.csbj.2018.12.002 Text en © 2018 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Chen, Lei
Pan, XiaoYong
Zhang, Yu-Hang
Liu, Min
Huang, Tao
Cai, Yu-Dong
Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network
title Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network
title_full Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network
title_fullStr Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network
title_full_unstemmed Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network
title_short Classification of Widely and Rarely Expressed Genes with Recurrent Neural Network
title_sort classification of widely and rarely expressed genes with recurrent neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6307323/
https://www.ncbi.nlm.nih.gov/pubmed/30595815
http://dx.doi.org/10.1016/j.csbj.2018.12.002
work_keys_str_mv AT chenlei classificationofwidelyandrarelyexpressedgeneswithrecurrentneuralnetwork
AT panxiaoyong classificationofwidelyandrarelyexpressedgeneswithrecurrentneuralnetwork
AT zhangyuhang classificationofwidelyandrarelyexpressedgeneswithrecurrentneuralnetwork
AT liumin classificationofwidelyandrarelyexpressedgeneswithrecurrentneuralnetwork
AT huangtao classificationofwidelyandrarelyexpressedgeneswithrecurrentneuralnetwork
AT caiyudong classificationofwidelyandrarelyexpressedgeneswithrecurrentneuralnetwork