Cargando…

A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes

For precision medicine, there is a need to identify genes that accurately distinguish the physiological state or response to a particular therapy, but this can be challenging. Many methods of analyzing differential expression have been established and applied to this problem, such as t-test, edgeR,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Qifei, Zhang, Haojian, Liang, Yuqing, Jiang, Heling, Tan, Siqiao, Luo, Feng, Yuan, Zheming, Chen, Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006292/
https://www.ncbi.nlm.nih.gov/pubmed/32082366
http://dx.doi.org/10.3389/fgene.2019.01410
_version_ 1783495113671966720
author Wang, Qifei
Zhang, Haojian
Liang, Yuqing
Jiang, Heling
Tan, Siqiao
Luo, Feng
Yuan, Zheming
Chen, Yuan
author_facet Wang, Qifei
Zhang, Haojian
Liang, Yuqing
Jiang, Heling
Tan, Siqiao
Luo, Feng
Yuan, Zheming
Chen, Yuan
author_sort Wang, Qifei
collection PubMed
description For precision medicine, there is a need to identify genes that accurately distinguish the physiological state or response to a particular therapy, but this can be challenging. Many methods of analyzing differential expression have been established and applied to this problem, such as t-test, edgeR, and DEseq2. A common feature of these methods is their focus on a linear relationship (differential expression) between gene expression and phenotype. However, they may overlook nonlinear relationships due to various factors, such as the degree of disease progression, sex, age, ethnicity, and environmental factors. Maximal information coefficient (MIC) was proposed to capture a wide range of associations of two variables in both linear and nonlinear relationships. However, with MIC it is difficult to highlight genes with nonlinear expression patterns as the genes giving the most strongly supported hits are linearly expressed, especially for noisy data. It is thus important to also efficiently identify nonlinearly expressed genes in order to unravel the molecular basis of disease and to reveal new therapeutic targets. We propose a novel nonlinearity measure called normalized differential correlation (NDC) to efficiently highlight nonlinearly expressed genes in transcriptome datasets. Validation using six real-world cancer datasets revealed that the NDC method could highlight nonlinearly expressed genes that could not be highlighted by t-test, MIC, edgeR, and DEseq2, although MIC could capture nonlinear correlations. The classification accuracy indicated that analysis of these genes could adequately distinguish cancer and paracarcinoma tissue samples. Furthermore, the results of biological interpretation of the identified genes suggested that some of them were involved in key functional pathways associated with cancer progression and metastasis. All of this evidence suggests that these nonlinearly expressed genes may play a central role in regulating cancer progression.
format Online
Article
Text
id pubmed-7006292
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-70062922020-02-20 A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes Wang, Qifei Zhang, Haojian Liang, Yuqing Jiang, Heling Tan, Siqiao Luo, Feng Yuan, Zheming Chen, Yuan Front Genet Genetics For precision medicine, there is a need to identify genes that accurately distinguish the physiological state or response to a particular therapy, but this can be challenging. Many methods of analyzing differential expression have been established and applied to this problem, such as t-test, edgeR, and DEseq2. A common feature of these methods is their focus on a linear relationship (differential expression) between gene expression and phenotype. However, they may overlook nonlinear relationships due to various factors, such as the degree of disease progression, sex, age, ethnicity, and environmental factors. Maximal information coefficient (MIC) was proposed to capture a wide range of associations of two variables in both linear and nonlinear relationships. However, with MIC it is difficult to highlight genes with nonlinear expression patterns as the genes giving the most strongly supported hits are linearly expressed, especially for noisy data. It is thus important to also efficiently identify nonlinearly expressed genes in order to unravel the molecular basis of disease and to reveal new therapeutic targets. We propose a novel nonlinearity measure called normalized differential correlation (NDC) to efficiently highlight nonlinearly expressed genes in transcriptome datasets. Validation using six real-world cancer datasets revealed that the NDC method could highlight nonlinearly expressed genes that could not be highlighted by t-test, MIC, edgeR, and DEseq2, although MIC could capture nonlinear correlations. The classification accuracy indicated that analysis of these genes could adequately distinguish cancer and paracarcinoma tissue samples. Furthermore, the results of biological interpretation of the identified genes suggested that some of them were involved in key functional pathways associated with cancer progression and metastasis. All of this evidence suggests that these nonlinearly expressed genes may play a central role in regulating cancer progression. Frontiers Media S.A. 2020-01-31 /pmc/articles/PMC7006292/ /pubmed/32082366 http://dx.doi.org/10.3389/fgene.2019.01410 Text en Copyright © 2020 Wang, Zhang, Liang, Jiang, Tan, Luo, Yuan and Chen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Qifei
Zhang, Haojian
Liang, Yuqing
Jiang, Heling
Tan, Siqiao
Luo, Feng
Yuan, Zheming
Chen, Yuan
A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
title A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
title_full A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
title_fullStr A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
title_full_unstemmed A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
title_short A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
title_sort novel method to efficiently highlight nonlinearly expressed genes
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006292/
https://www.ncbi.nlm.nih.gov/pubmed/32082366
http://dx.doi.org/10.3389/fgene.2019.01410
work_keys_str_mv AT wangqifei anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT zhanghaojian anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT liangyuqing anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT jiangheling anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT tansiqiao anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT luofeng anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT yuanzheming anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT chenyuan anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT wangqifei novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT zhanghaojian novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT liangyuqing novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT jiangheling novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT tansiqiao novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT luofeng novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT yuanzheming novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes
AT chenyuan novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes