Cargando…
A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes
For precision medicine, there is a need to identify genes that accurately distinguish the physiological state or response to a particular therapy, but this can be challenging. Many methods of analyzing differential expression have been established and applied to this problem, such as t-test, edgeR,...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006292/ https://www.ncbi.nlm.nih.gov/pubmed/32082366 http://dx.doi.org/10.3389/fgene.2019.01410 |
_version_ | 1783495113671966720 |
---|---|
author | Wang, Qifei Zhang, Haojian Liang, Yuqing Jiang, Heling Tan, Siqiao Luo, Feng Yuan, Zheming Chen, Yuan |
author_facet | Wang, Qifei Zhang, Haojian Liang, Yuqing Jiang, Heling Tan, Siqiao Luo, Feng Yuan, Zheming Chen, Yuan |
author_sort | Wang, Qifei |
collection | PubMed |
description | For precision medicine, there is a need to identify genes that accurately distinguish the physiological state or response to a particular therapy, but this can be challenging. Many methods of analyzing differential expression have been established and applied to this problem, such as t-test, edgeR, and DEseq2. A common feature of these methods is their focus on a linear relationship (differential expression) between gene expression and phenotype. However, they may overlook nonlinear relationships due to various factors, such as the degree of disease progression, sex, age, ethnicity, and environmental factors. Maximal information coefficient (MIC) was proposed to capture a wide range of associations of two variables in both linear and nonlinear relationships. However, with MIC it is difficult to highlight genes with nonlinear expression patterns as the genes giving the most strongly supported hits are linearly expressed, especially for noisy data. It is thus important to also efficiently identify nonlinearly expressed genes in order to unravel the molecular basis of disease and to reveal new therapeutic targets. We propose a novel nonlinearity measure called normalized differential correlation (NDC) to efficiently highlight nonlinearly expressed genes in transcriptome datasets. Validation using six real-world cancer datasets revealed that the NDC method could highlight nonlinearly expressed genes that could not be highlighted by t-test, MIC, edgeR, and DEseq2, although MIC could capture nonlinear correlations. The classification accuracy indicated that analysis of these genes could adequately distinguish cancer and paracarcinoma tissue samples. Furthermore, the results of biological interpretation of the identified genes suggested that some of them were involved in key functional pathways associated with cancer progression and metastasis. All of this evidence suggests that these nonlinearly expressed genes may play a central role in regulating cancer progression. |
format | Online Article Text |
id | pubmed-7006292 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-70062922020-02-20 A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes Wang, Qifei Zhang, Haojian Liang, Yuqing Jiang, Heling Tan, Siqiao Luo, Feng Yuan, Zheming Chen, Yuan Front Genet Genetics For precision medicine, there is a need to identify genes that accurately distinguish the physiological state or response to a particular therapy, but this can be challenging. Many methods of analyzing differential expression have been established and applied to this problem, such as t-test, edgeR, and DEseq2. A common feature of these methods is their focus on a linear relationship (differential expression) between gene expression and phenotype. However, they may overlook nonlinear relationships due to various factors, such as the degree of disease progression, sex, age, ethnicity, and environmental factors. Maximal information coefficient (MIC) was proposed to capture a wide range of associations of two variables in both linear and nonlinear relationships. However, with MIC it is difficult to highlight genes with nonlinear expression patterns as the genes giving the most strongly supported hits are linearly expressed, especially for noisy data. It is thus important to also efficiently identify nonlinearly expressed genes in order to unravel the molecular basis of disease and to reveal new therapeutic targets. We propose a novel nonlinearity measure called normalized differential correlation (NDC) to efficiently highlight nonlinearly expressed genes in transcriptome datasets. Validation using six real-world cancer datasets revealed that the NDC method could highlight nonlinearly expressed genes that could not be highlighted by t-test, MIC, edgeR, and DEseq2, although MIC could capture nonlinear correlations. The classification accuracy indicated that analysis of these genes could adequately distinguish cancer and paracarcinoma tissue samples. Furthermore, the results of biological interpretation of the identified genes suggested that some of them were involved in key functional pathways associated with cancer progression and metastasis. All of this evidence suggests that these nonlinearly expressed genes may play a central role in regulating cancer progression. Frontiers Media S.A. 2020-01-31 /pmc/articles/PMC7006292/ /pubmed/32082366 http://dx.doi.org/10.3389/fgene.2019.01410 Text en Copyright © 2020 Wang, Zhang, Liang, Jiang, Tan, Luo, Yuan and Chen http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wang, Qifei Zhang, Haojian Liang, Yuqing Jiang, Heling Tan, Siqiao Luo, Feng Yuan, Zheming Chen, Yuan A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes |
title | A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes |
title_full | A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes |
title_fullStr | A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes |
title_full_unstemmed | A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes |
title_short | A Novel Method to Efficiently Highlight Nonlinearly Expressed Genes |
title_sort | novel method to efficiently highlight nonlinearly expressed genes |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7006292/ https://www.ncbi.nlm.nih.gov/pubmed/32082366 http://dx.doi.org/10.3389/fgene.2019.01410 |
work_keys_str_mv | AT wangqifei anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT zhanghaojian anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT liangyuqing anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT jiangheling anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT tansiqiao anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT luofeng anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT yuanzheming anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT chenyuan anovelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT wangqifei novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT zhanghaojian novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT liangyuqing novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT jiangheling novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT tansiqiao novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT luofeng novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT yuanzheming novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes AT chenyuan novelmethodtoefficientlyhighlightnonlinearlyexpressedgenes |