Cargando…

Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data

BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 datas...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yalan, Cai, Linkun, Liu, Xiu, Wu, Yongjun, Xiang, Qin, Yu, Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577746/
https://www.ncbi.nlm.nih.gov/pubmed/36267740
http://dx.doi.org/10.21037/atm-22-4303
_version_ 1784811827333431296
author Huang, Yalan
Cai, Linkun
Liu, Xiu
Wu, Yongjun
Xiang, Qin
Yu, Rong
author_facet Huang, Yalan
Cai, Linkun
Liu, Xiu
Wu, Yongjun
Xiang, Qin
Yu, Rong
author_sort Huang, Yalan
collection PubMed
description BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 dataset including RNA-seq data for healthy and T2D patients, was obtained from the Gene Expression Omnibus (GEO) database. The potential functions of the differentially expressed genes (DEGs) were revealed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Moreover, biomarkers were screened out by the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm and receiver operating characteristic (ROC) analysis. Furthermore, single-cell RNA (sc-RNA)-seq data in the “E-MTAB-5061” dataset was downloaded from the ArrayExpress (European Bioinformatics Institute, EBI) database. Principal components analysis (PCA) and t-distributed stochastic neighbor embedding (tSNE) were used for dimensionality reduction analysis and cell clustering. The FindAllMarkers function was used annotate different cell clusters, and key cell clusters were screened by the expression levels of the biomarkers. Finally, the transcription factors (TFs) of the biomarkers were recognized. RESULTS: A total of 111 DEGs were screened in the GSE41762 dataset, which were mainly related to hormone secretion, specialized postsynaptic membrane, pancreatic secretion, JAK-STAT signaling pathway, and Ras signaling pathway. In addition, SLC2A2, SERPINF1, RASGRP1, and CHL1 were screened out as biomarkers of T2D, which possessed potential diagnostic value as AUC value greater than 0.8. A total of 1,515 T2D group cells and 1,817 healthy cohort cells were screened as core cells in the “E-MTAB-5061” dataset. Following tSNE dimensionality reduction cluster analysis, the core cells were divided into 13 cell clusters. According to the marker genes, the 13 cell clusters were annotated into six types of cells. Notably, SERPINF1 was highly expressed in fibroblasts and might be regulated by NR2F2 (nuclear receptor subfamily2, group F, and member 2). CONCLUSIONS: This study identified four biomarkers (SLC2A2, SERPINF1, RASGRP1, and CHL1) for T2D, which provided new markers for the clinical diagnosis of T2D. Among them, SERPINF1 might be regulated by NR2F2, which provides valuable insight into the pathogensis of T2D.
format Online
Article
Text
id pubmed-9577746
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-95777462022-10-19 Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data Huang, Yalan Cai, Linkun Liu, Xiu Wu, Yongjun Xiang, Qin Yu, Rong Ann Transl Med Original Article BACKGROUND: Type 2 diabetes (T2D) is a prevalent chronic disease with elusive. Combining transcriptome and single-cell sequencing data to explore biomarkers of T2D could provide new insights into the in-depth understanding of the molecular mechanisms and diagnosis of T2D. METHODS: The GSE41762 dataset including RNA-seq data for healthy and T2D patients, was obtained from the Gene Expression Omnibus (GEO) database. The potential functions of the differentially expressed genes (DEGs) were revealed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Moreover, biomarkers were screened out by the Least Absolute Shrinkage and Selection Operator (LASSO) algorithm and receiver operating characteristic (ROC) analysis. Furthermore, single-cell RNA (sc-RNA)-seq data in the “E-MTAB-5061” dataset was downloaded from the ArrayExpress (European Bioinformatics Institute, EBI) database. Principal components analysis (PCA) and t-distributed stochastic neighbor embedding (tSNE) were used for dimensionality reduction analysis and cell clustering. The FindAllMarkers function was used annotate different cell clusters, and key cell clusters were screened by the expression levels of the biomarkers. Finally, the transcription factors (TFs) of the biomarkers were recognized. RESULTS: A total of 111 DEGs were screened in the GSE41762 dataset, which were mainly related to hormone secretion, specialized postsynaptic membrane, pancreatic secretion, JAK-STAT signaling pathway, and Ras signaling pathway. In addition, SLC2A2, SERPINF1, RASGRP1, and CHL1 were screened out as biomarkers of T2D, which possessed potential diagnostic value as AUC value greater than 0.8. A total of 1,515 T2D group cells and 1,817 healthy cohort cells were screened as core cells in the “E-MTAB-5061” dataset. Following tSNE dimensionality reduction cluster analysis, the core cells were divided into 13 cell clusters. According to the marker genes, the 13 cell clusters were annotated into six types of cells. Notably, SERPINF1 was highly expressed in fibroblasts and might be regulated by NR2F2 (nuclear receptor subfamily2, group F, and member 2). CONCLUSIONS: This study identified four biomarkers (SLC2A2, SERPINF1, RASGRP1, and CHL1) for T2D, which provided new markers for the clinical diagnosis of T2D. Among them, SERPINF1 might be regulated by NR2F2, which provides valuable insight into the pathogensis of T2D. AME Publishing Company 2022-09 /pmc/articles/PMC9577746/ /pubmed/36267740 http://dx.doi.org/10.21037/atm-22-4303 Text en 2022 Annals of Translational Medicine. All rights reserved. https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0 (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Original Article
Huang, Yalan
Cai, Linkun
Liu, Xiu
Wu, Yongjun
Xiang, Qin
Yu, Rong
Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
title Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
title_full Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
title_fullStr Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
title_full_unstemmed Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
title_short Exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on RNA-Seq and scRNA-Seq data
title_sort exploring biomarkers and transcriptional factors in type 2 diabetes by comprehensive bioinformatics analysis on rna-seq and scrna-seq data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9577746/
https://www.ncbi.nlm.nih.gov/pubmed/36267740
http://dx.doi.org/10.21037/atm-22-4303
work_keys_str_mv AT huangyalan exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata
AT cailinkun exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata
AT liuxiu exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata
AT wuyongjun exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata
AT xiangqin exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata
AT yurong exploringbiomarkersandtranscriptionalfactorsintype2diabetesbycomprehensivebioinformaticsanalysisonrnaseqandscrnaseqdata