Cargando…
Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
Rheumatoid arthritis (RA) is an incurable disease that afflicts 0.5–1.0% of the global population though it is less threatening at its early stage. Therefore, improved diagnostic efficiency and prognostic outcome are critical for confronting RA. Although machine learning is considered a promising te...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7905311/ https://www.ncbi.nlm.nih.gov/pubmed/33643380 http://dx.doi.org/10.3389/fgene.2021.604714 |
_version_ | 1783655086128365568 |
---|---|
author | Xiao, Jianwei Wang, Rongsheng Cai, Xu Ye, Zhizhong |
author_facet | Xiao, Jianwei Wang, Rongsheng Cai, Xu Ye, Zhizhong |
author_sort | Xiao, Jianwei |
collection | PubMed |
description | Rheumatoid arthritis (RA) is an incurable disease that afflicts 0.5–1.0% of the global population though it is less threatening at its early stage. Therefore, improved diagnostic efficiency and prognostic outcome are critical for confronting RA. Although machine learning is considered a promising technique in clinical research, its potential in verifying the biological significance of gene was not fully exploited. The performance of a machine learning model depends greatly on the features used for model training; therefore, the effectiveness of prediction might reflect the quality of input features. In the present study, we used weighted gene co-expression network analysis (WGCNA) in conjunction with differentially expressed gene (DEG) analysis to select the key genes that were highly associated with RA phenotypes based on multiple microarray datasets of RA blood samples, after which they were used as features in machine learning model validation. A total of six machine learning models were used to validate the biological significance of the key genes based on gene expression, among which five models achieved good performances [area under curve (AUC) >0.85], suggesting that our currently identified key genes are biologically significant and highly representative of genes involved in RA. Combined with other biological interpretations including Gene Ontology (GO) analysis, protein–protein interaction (PPI) network analysis, as well as inference of immune cell composition, our current study might shed a light on the in-depth study of RA diagnosis and prognosis. |
format | Online Article Text |
id | pubmed-7905311 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79053112021-02-26 Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis Xiao, Jianwei Wang, Rongsheng Cai, Xu Ye, Zhizhong Front Genet Genetics Rheumatoid arthritis (RA) is an incurable disease that afflicts 0.5–1.0% of the global population though it is less threatening at its early stage. Therefore, improved diagnostic efficiency and prognostic outcome are critical for confronting RA. Although machine learning is considered a promising technique in clinical research, its potential in verifying the biological significance of gene was not fully exploited. The performance of a machine learning model depends greatly on the features used for model training; therefore, the effectiveness of prediction might reflect the quality of input features. In the present study, we used weighted gene co-expression network analysis (WGCNA) in conjunction with differentially expressed gene (DEG) analysis to select the key genes that were highly associated with RA phenotypes based on multiple microarray datasets of RA blood samples, after which they were used as features in machine learning model validation. A total of six machine learning models were used to validate the biological significance of the key genes based on gene expression, among which five models achieved good performances [area under curve (AUC) >0.85], suggesting that our currently identified key genes are biologically significant and highly representative of genes involved in RA. Combined with other biological interpretations including Gene Ontology (GO) analysis, protein–protein interaction (PPI) network analysis, as well as inference of immune cell composition, our current study might shed a light on the in-depth study of RA diagnosis and prognosis. Frontiers Media S.A. 2021-02-11 /pmc/articles/PMC7905311/ /pubmed/33643380 http://dx.doi.org/10.3389/fgene.2021.604714 Text en Copyright © 2021 Xiao, Wang, Cai and Ye. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Xiao, Jianwei Wang, Rongsheng Cai, Xu Ye, Zhizhong Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis |
title | Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis |
title_full | Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis |
title_fullStr | Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis |
title_full_unstemmed | Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis |
title_short | Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis |
title_sort | coupling of co-expression network analysis and machine learning validation unearthed potential key genes involved in rheumatoid arthritis |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7905311/ https://www.ncbi.nlm.nih.gov/pubmed/33643380 http://dx.doi.org/10.3389/fgene.2021.604714 |
work_keys_str_mv | AT xiaojianwei couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis AT wangrongsheng couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis AT caixu couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis AT yezhizhong couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis |