Cargando…
Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm
BACKGROUND: Type 2 diabetes is a major health concern worldwide. The present study is aimed at discovering effective biomarkers for an efficient diagnosis of type 2 diabetes. METHODS: Differentially expressed genes (DEGs) between type 2 diabetes patients and normal controls were identified by analys...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8916865/ https://www.ncbi.nlm.nih.gov/pubmed/35281591 http://dx.doi.org/10.1155/2022/1230761 |
_version_ | 1784668410113687552 |
---|---|
author | Li, Jiabin Ding, Jieying Zhi, D. U. Gu, Kaiyun Wang, Hui |
author_facet | Li, Jiabin Ding, Jieying Zhi, D. U. Gu, Kaiyun Wang, Hui |
author_sort | Li, Jiabin |
collection | PubMed |
description | BACKGROUND: Type 2 diabetes is a major health concern worldwide. The present study is aimed at discovering effective biomarkers for an efficient diagnosis of type 2 diabetes. METHODS: Differentially expressed genes (DEGs) between type 2 diabetes patients and normal controls were identified by analyses of integrated microarray data obtained from the Gene Expression Omnibus database using the Limma package. Functional analysis of genes was performed using the R software package clusterProfiler. Analyses of protein-protein interaction (PPI) performed using Cytoscape with the CytoHubba plugin were used to determine the most sensitive diagnostic gene biomarkers for type 2 diabetes in our study. The support vector machine (SVM) classification model was used to validate the gene biomarkers used for the diagnosis of type 2 diabetes. RESULTS: GSE164416 dataset analysis revealed 499 genes that were differentially expressed between type 2 diabetes patients and normal controls, and these DEGs were found to be enriched in the regulation of the immune effector pathway, type 1 diabetes mellitus, and fatty acid degradation. PPI analysis data showed that five MCODE clusters could be considered as clinically significant modules and that 10 genes (IL1B, ITGB2, ITGAX, COL1A1, CSF1, CXCL12, SPP1, FN1, C3, and MMP2) were identified as “real” hub genes in the PPI network using algorithms such as Degree, MNC, and Closeness. The sensitivity and specificity of the SVM model for identifying patients with type 2 diabetes were 100%, with an area under the curve of 1 in the training as well as the validation dataset. CONCLUSION: Our results indicate that the SVM-based model developed by us can facilitate accurate diagnosis of type 2 diabetes. |
format | Online Article Text |
id | pubmed-8916865 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-89168652022-03-12 Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm Li, Jiabin Ding, Jieying Zhi, D. U. Gu, Kaiyun Wang, Hui Biomed Res Int Research Article BACKGROUND: Type 2 diabetes is a major health concern worldwide. The present study is aimed at discovering effective biomarkers for an efficient diagnosis of type 2 diabetes. METHODS: Differentially expressed genes (DEGs) between type 2 diabetes patients and normal controls were identified by analyses of integrated microarray data obtained from the Gene Expression Omnibus database using the Limma package. Functional analysis of genes was performed using the R software package clusterProfiler. Analyses of protein-protein interaction (PPI) performed using Cytoscape with the CytoHubba plugin were used to determine the most sensitive diagnostic gene biomarkers for type 2 diabetes in our study. The support vector machine (SVM) classification model was used to validate the gene biomarkers used for the diagnosis of type 2 diabetes. RESULTS: GSE164416 dataset analysis revealed 499 genes that were differentially expressed between type 2 diabetes patients and normal controls, and these DEGs were found to be enriched in the regulation of the immune effector pathway, type 1 diabetes mellitus, and fatty acid degradation. PPI analysis data showed that five MCODE clusters could be considered as clinically significant modules and that 10 genes (IL1B, ITGB2, ITGAX, COL1A1, CSF1, CXCL12, SPP1, FN1, C3, and MMP2) were identified as “real” hub genes in the PPI network using algorithms such as Degree, MNC, and Closeness. The sensitivity and specificity of the SVM model for identifying patients with type 2 diabetes were 100%, with an area under the curve of 1 in the training as well as the validation dataset. CONCLUSION: Our results indicate that the SVM-based model developed by us can facilitate accurate diagnosis of type 2 diabetes. Hindawi 2022-03-04 /pmc/articles/PMC8916865/ /pubmed/35281591 http://dx.doi.org/10.1155/2022/1230761 Text en Copyright © 2022 Jiabin Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Jiabin Ding, Jieying Zhi, D. U. Gu, Kaiyun Wang, Hui Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm |
title | Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm |
title_full | Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm |
title_fullStr | Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm |
title_full_unstemmed | Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm |
title_short | Identification of Type 2 Diabetes Based on a Ten-Gene Biomarker Prediction Model Constructed Using a Support Vector Machine Algorithm |
title_sort | identification of type 2 diabetes based on a ten-gene biomarker prediction model constructed using a support vector machine algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8916865/ https://www.ncbi.nlm.nih.gov/pubmed/35281591 http://dx.doi.org/10.1155/2022/1230761 |
work_keys_str_mv | AT lijiabin identificationoftype2diabetesbasedonatengenebiomarkerpredictionmodelconstructedusingasupportvectormachinealgorithm AT dingjieying identificationoftype2diabetesbasedonatengenebiomarkerpredictionmodelconstructedusingasupportvectormachinealgorithm AT zhidu identificationoftype2diabetesbasedonatengenebiomarkerpredictionmodelconstructedusingasupportvectormachinealgorithm AT gukaiyun identificationoftype2diabetesbasedonatengenebiomarkerpredictionmodelconstructedusingasupportvectormachinealgorithm AT wanghui identificationoftype2diabetesbasedonatengenebiomarkerpredictionmodelconstructedusingasupportvectormachinealgorithm |