Cargando…

Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis

Rheumatoid arthritis (RA) is an incurable disease that afflicts 0.5–1.0% of the global population though it is less threatening at its early stage. Therefore, improved diagnostic efficiency and prognostic outcome are critical for confronting RA. Although machine learning is considered a promising te...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Jianwei, Wang, Rongsheng, Cai, Xu, Ye, Zhizhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7905311/
https://www.ncbi.nlm.nih.gov/pubmed/33643380
http://dx.doi.org/10.3389/fgene.2021.604714
_version_ 1783655086128365568
author Xiao, Jianwei
Wang, Rongsheng
Cai, Xu
Ye, Zhizhong
author_facet Xiao, Jianwei
Wang, Rongsheng
Cai, Xu
Ye, Zhizhong
author_sort Xiao, Jianwei
collection PubMed
description Rheumatoid arthritis (RA) is an incurable disease that afflicts 0.5–1.0% of the global population though it is less threatening at its early stage. Therefore, improved diagnostic efficiency and prognostic outcome are critical for confronting RA. Although machine learning is considered a promising technique in clinical research, its potential in verifying the biological significance of gene was not fully exploited. The performance of a machine learning model depends greatly on the features used for model training; therefore, the effectiveness of prediction might reflect the quality of input features. In the present study, we used weighted gene co-expression network analysis (WGCNA) in conjunction with differentially expressed gene (DEG) analysis to select the key genes that were highly associated with RA phenotypes based on multiple microarray datasets of RA blood samples, after which they were used as features in machine learning model validation. A total of six machine learning models were used to validate the biological significance of the key genes based on gene expression, among which five models achieved good performances [area under curve (AUC) >0.85], suggesting that our currently identified key genes are biologically significant and highly representative of genes involved in RA. Combined with other biological interpretations including Gene Ontology (GO) analysis, protein–protein interaction (PPI) network analysis, as well as inference of immune cell composition, our current study might shed a light on the in-depth study of RA diagnosis and prognosis.
format Online
Article
Text
id pubmed-7905311
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79053112021-02-26 Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis Xiao, Jianwei Wang, Rongsheng Cai, Xu Ye, Zhizhong Front Genet Genetics Rheumatoid arthritis (RA) is an incurable disease that afflicts 0.5–1.0% of the global population though it is less threatening at its early stage. Therefore, improved diagnostic efficiency and prognostic outcome are critical for confronting RA. Although machine learning is considered a promising technique in clinical research, its potential in verifying the biological significance of gene was not fully exploited. The performance of a machine learning model depends greatly on the features used for model training; therefore, the effectiveness of prediction might reflect the quality of input features. In the present study, we used weighted gene co-expression network analysis (WGCNA) in conjunction with differentially expressed gene (DEG) analysis to select the key genes that were highly associated with RA phenotypes based on multiple microarray datasets of RA blood samples, after which they were used as features in machine learning model validation. A total of six machine learning models were used to validate the biological significance of the key genes based on gene expression, among which five models achieved good performances [area under curve (AUC) >0.85], suggesting that our currently identified key genes are biologically significant and highly representative of genes involved in RA. Combined with other biological interpretations including Gene Ontology (GO) analysis, protein–protein interaction (PPI) network analysis, as well as inference of immune cell composition, our current study might shed a light on the in-depth study of RA diagnosis and prognosis. Frontiers Media S.A. 2021-02-11 /pmc/articles/PMC7905311/ /pubmed/33643380 http://dx.doi.org/10.3389/fgene.2021.604714 Text en Copyright © 2021 Xiao, Wang, Cai and Ye. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xiao, Jianwei
Wang, Rongsheng
Cai, Xu
Ye, Zhizhong
Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
title Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
title_full Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
title_fullStr Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
title_full_unstemmed Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
title_short Coupling of Co-expression Network Analysis and Machine Learning Validation Unearthed Potential Key Genes Involved in Rheumatoid Arthritis
title_sort coupling of co-expression network analysis and machine learning validation unearthed potential key genes involved in rheumatoid arthritis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7905311/
https://www.ncbi.nlm.nih.gov/pubmed/33643380
http://dx.doi.org/10.3389/fgene.2021.604714
work_keys_str_mv AT xiaojianwei couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis
AT wangrongsheng couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis
AT caixu couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis
AT yezhizhong couplingofcoexpressionnetworkanalysisandmachinelearningvalidationunearthedpotentialkeygenesinvolvedinrheumatoidarthritis