Cargando…
Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 datas...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577746/ https://www.ncbi.nlm.nih.gov/pubmed/36267740 http://dx.doi.org/10.21037/atm-22-4303 |
_version_ | 1784811827333431296 |
---|---|
author | Huang, Yalan Cai, Linkun Liu, Xiu Wu, Yongjun Xiang, Qin Yu, Rong |
author_facet | Huang, Yalan Cai, Linkun Liu, Xiu Wu, Yongjun Xiang, Qin Yu, Rong |
author_sort | Huang, Yalan |
collection | PubMed |
description | BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 dataset including RNA-seq data for healthy and T2D patients, was obtained from the Gene Expression Omnibus (GEO) database. The potential functions of the differentially expressed genes (DEGs) were revealed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Moreover, biomarkers were screened out by the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm and receiver operating characteristic (ROC) analysis. Furthermore, single-cell RNA (sc-RNA)-seq data in the “E-MTAB-5061” dataset was downloaded from the ArrayExpress (European Bioinformatics Institute, EBI) database. Principal components analysis (PCA) and t-distributed stochastic neighbor embedding (tSNE) were used for dimensionality reduction analysis and cell clustering. The FindAllMarkers function was used annotate different cell clusters, and key cell clusters were screened by the expression levels of the biomarkers. Finally, the transcription factors (TFs) of the biomarkers were recognized. RESULTS: A total of 111 DEGs were screened in the GSE41762 dataset, which were mainly related to hormone secretion, specialized postsynaptic membrane, pancreatic secretion, JAK-STAT signaling pathway, and Ras signaling pathway. In addition, SLC2A2, SERPINF1, RASGRP1, and CHL1 were screened out as biomarkers of T2D, which possessed potential diagnostic value as AUC value greater than 0.8. A total of 1,515 T2D group cells and 1,817 healthy cohort cells were screened as core cells in the “E-MTAB-5061” dataset. Following tSNE dimensionality reduction cluster analysis, the core cells were divided into 13 cell clusters. According to the marker genes, the 13 cell clusters were annotated into six types of cells. Notably, SERPINF1 was highly expressed in fibroblasts and might be regulated by NR2F2 (nuclear receptor subfamily2, group F, and member 2). CONCLUSIONS: This study identified four biomarkers (SLC2A2, SERPINF1, RASGRP1, and CHL1) for T2D, which provided new markers for the clinical diagnosis of T2D. Among them, SERPINF1 might be regulated by NR2F2, which provides valuable insight into the pathogensis of T2D. |
format | Online Article Text |
id | pubmed-9577746 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-95777462022-10-19 Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data Huang, Yalan Cai, Linkun Liu, Xiu Wu, Yongjun Xiang, Qin Yu, Rong Ann Transl Med Original Article BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 dataset including RNA-seq data for healthy and T2D patients, was obtained from the Gene Expression Omnibus (GEO) database. The potential functions of the differentially expressed genes (DEGs) were revealed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Moreover, biomarkers were screened out by the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm and receiver operating characteristic (ROC) analysis. Furthermore, single-cell RNA (sc-RNA)-seq data in the “E-MTAB-5061” dataset was downloaded from the ArrayExpress (European Bioinformatics Institute, EBI) database. Principal components analysis (PCA) and t-distributed stochastic neighbor embedding (tSNE) were used for dimensionality reduction analysis and cell clustering. The FindAllMarkers function was used annotate different cell clusters, and key cell clusters were screened by the expression levels of the biomarkers. Finally, the transcription factors (TFs) of the biomarkers were recognized. RESULTS: A total of 111 DEGs were screened in the GSE41762 dataset, which were mainly related to hormone secretion, specialized postsynaptic membrane, pancreatic secretion, JAK-STAT signaling pathway, and Ras signaling pathway. In addition, SLC2A2, SERPINF1, RASGRP1, and CHL1 were screened out as biomarkers of T2D, which possessed potential diagnostic value as AUC value greater than 0.8. A total of 1,515 T2D group cells and 1,817 healthy cohort cells were screened as core cells in the “E-MTAB-5061” dataset. Following tSNE dimensionality reduction cluster analysis, the core cells were divided into 13 cell clusters. According to the marker genes, the 13 cell clusters were annotated into six types of cells. Notably, SERPINF1 was highly expressed in fibroblasts and might be regulated by NR2F2 (nuclear receptor subfamily2, group F, and member 2). CONCLUSIONS: This study identified four biomarkers (SLC2A2, SERPINF1, RASGRP1, and CHL1) for T2D, which provided new markers for the clinical diagnosis of T2D. Among them, SERPINF1 might be regulated by NR2F2, which provides valuable insight into the pathogensis of T2D. AME Publishing Company 2022-09 /pmc/articles/PMC9577746/ /pubmed/36267740 http://dx.doi.org/10.21037/atm-22-4303 Text en 2022 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Original Article Huang, Yalan Cai, Linkun Liu, Xiu Wu, Yongjun Xiang, Qin Yu, Rong Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data |
title | Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data |
title_full | Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data |
title_fullStr | Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data |
title_full_unstemmed | Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data |
title_short | Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data |
title_sort | exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on rna-seq and scrna-seq data |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577746/ https://www.ncbi.nlm.nih.gov/pubmed/36267740 http://dx.doi.org/10.21037/atm-22-4303 |
work_keys_str_mv | AT huangyalan exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata AT cailinkun exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata AT liuxiu exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata AT wuyongjun exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata AT xiangqin exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata AT yurong exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata |