Cargando…

Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data

BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 datas...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yalan, Cai, Linkun, Liu, Xiu, Wu, Yongjun, Xiang, Qin, Yu, Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577746/
https://www.ncbi.nlm.nih.gov/pubmed/36267740
http://dx.doi.org/10.21037/atm-22-4303
Descripción
Sumario:BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 dataset including RNA-seq data for healthy and T2D patients, was obtained from the Gene Expression Omnibus (GEO) database. The potential functions of the differentially expressed genes (DEGs) were revealed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Moreover, biomarkers were screened out by the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm and receiver operating characteristic (ROC) analysis. Furthermore, single-cell RNA (sc-RNA)-seq data in the “E-MTAB-5061” dataset was downloaded from the ArrayExpress (European Bioinformatics Institute, EBI) database. Principal components analysis (PCA) and t-distributed stochastic neighbor embedding (tSNE) were used for dimensionality reduction analysis and cell clustering. The FindAllMarkers function was used annotate different cell clusters, and key cell clusters were screened by the expression levels of the biomarkers. Finally, the transcription factors (TFs) of the biomarkers were recognized. RESULTS: A total of 111 DEGs were screened in the GSE41762 dataset, which were mainly related to hormone secretion, specialized postsynaptic membrane, pancreatic secretion, JAK-STAT signaling pathway, and Ras signaling pathway. In addition, SLC2A2, SERPINF1, RASGRP1, and CHL1 were screened out as biomarkers of T2D, which possessed potential diagnostic value as AUC value greater than 0.8. A total of 1,515 T2D group cells and 1,817 healthy cohort cells were screened as core cells in the “E-MTAB-5061” dataset. Following tSNE dimensionality reduction cluster analysis, the core cells were divided into 13 cell clusters. According to the marker genes, the 13 cell clusters were annotated into six types of cells. Notably, SERPINF1 was highly expressed in fibroblasts and might be regulated by NR2F2 (nuclear receptor subfamily2, group F, and member 2). CONCLUSIONS: This study identified four biomarkers (SLC2A2, SERPINF1, RASGRP1, and CHL1) for T2D, which provided new markers for the clinical diagnosis of T2D. Among them, SERPINF1 might be regulated by NR2F2, which provides valuable insight into the pathogensis of T2D.