Cargando…

Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer

Background: The invention and development of single-cell technologies have contributed a lot to the understanding of tumor heterogeneity. The objective of this research was to investigate the differentially expressed genes (DEGs) between normal and tumor cells at the single-cell level and explore th...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Hanghang, Tian, Tian, Luo, Haihua, Jiang, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523223/
https://www.ncbi.nlm.nih.gov/pubmed/36186437
http://dx.doi.org/10.3389/fgene.2022.979829
_version_ 1784800226565947392
author Chen, Hanghang
Tian, Tian
Luo, Haihua
Jiang, Yong
author_facet Chen, Hanghang
Tian, Tian
Luo, Haihua
Jiang, Yong
author_sort Chen, Hanghang
collection PubMed
description Background: The invention and development of single-cell technologies have contributed a lot to the understanding of tumor heterogeneity. The objective of this research was to investigate the differentially expressed genes (DEGs) between normal and tumor cells at the single-cell level and explore the clinical application of these genes with bulk RNA-sequencing data in breast cancer. Methods: We collected single-cell, bulk RNA sequencing (RNA-seq) and microarray data from two public databases. Through single-cell analysis of 23,909 mammary gland cells from seven healthy donors and 33,138 tumor cells from seven breast cancer patients, cell type-specific DEGs between normal and tumor cells were identified. With these genes and the bulk RNA-seq data, we developed a prognostic signature and validated the efficacy in two independent cohorts. We also explored the differences of immune infiltration and tumor mutational burden (TMB) between the different risk groups. Results: A total of 6,175 cell-type-specific DEGs were obtained through the single-cell analysis between normal and tumor cells in breast cancer, of which 1,768 genes intersected with the bulk RNA-seq data. An 18-gene signature was constructed to assess the outcomes in breast cancer patients. The efficacy of the signature was notably prominent in two independent cohorts. The low-risk group showed higher immune infiltration and lower TMB. Among the 18 genes in the signature, 16 were also differentially expressed in the bulk RNA-seq dataset. Conclusion: Cell-type-specific DEGs between normal and tumor cells were identified through single-cell transcriptome data. The signature constructed with these DEGs could stratify patients efficiently. The signature was also closely correlated with immune infiltration and TMB. Nearly all the genes in the signature were also differentially expressed at the bulk RNA-seq level.
format Online
Article
Text
id pubmed-9523223
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95232232022-10-01 Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer Chen, Hanghang Tian, Tian Luo, Haihua Jiang, Yong Front Genet Genetics Background: The invention and development of single-cell technologies have contributed a lot to the understanding of tumor heterogeneity. The objective of this research was to investigate the differentially expressed genes (DEGs) between normal and tumor cells at the single-cell level and explore the clinical application of these genes with bulk RNA-sequencing data in breast cancer. Methods: We collected single-cell, bulk RNA sequencing (RNA-seq) and microarray data from two public databases. Through single-cell analysis of 23,909 mammary gland cells from seven healthy donors and 33,138 tumor cells from seven breast cancer patients, cell type-specific DEGs between normal and tumor cells were identified. With these genes and the bulk RNA-seq data, we developed a prognostic signature and validated the efficacy in two independent cohorts. We also explored the differences of immune infiltration and tumor mutational burden (TMB) between the different risk groups. Results: A total of 6,175 cell-type-specific DEGs were obtained through the single-cell analysis between normal and tumor cells in breast cancer, of which 1,768 genes intersected with the bulk RNA-seq data. An 18-gene signature was constructed to assess the outcomes in breast cancer patients. The efficacy of the signature was notably prominent in two independent cohorts. The low-risk group showed higher immune infiltration and lower TMB. Among the 18 genes in the signature, 16 were also differentially expressed in the bulk RNA-seq dataset. Conclusion: Cell-type-specific DEGs between normal and tumor cells were identified through single-cell transcriptome data. The signature constructed with these DEGs could stratify patients efficiently. The signature was also closely correlated with immune infiltration and TMB. Nearly all the genes in the signature were also differentially expressed at the bulk RNA-seq level. Frontiers Media S.A. 2022-09-16 /pmc/articles/PMC9523223/ /pubmed/36186437 http://dx.doi.org/10.3389/fgene.2022.979829 Text en Copyright © 2022 Chen, Tian, Luo and Jiang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Chen, Hanghang
Tian, Tian
Luo, Haihua
Jiang, Yong
Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer
title Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer
title_full Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer
title_fullStr Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer
title_full_unstemmed Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer
title_short Identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk RNA sequencing data in breast cancer
title_sort identification of differentially expressed genes at the single-cell level and prognosis prediction through bulk rna sequencing data in breast cancer
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523223/
https://www.ncbi.nlm.nih.gov/pubmed/36186437
http://dx.doi.org/10.3389/fgene.2022.979829
work_keys_str_mv AT chenhanghang identificationofdifferentiallyexpressedgenesatthesinglecelllevelandprognosispredictionthroughbulkrnasequencingdatainbreastcancer
AT tiantian identificationofdifferentiallyexpressedgenesatthesinglecelllevelandprognosispredictionthroughbulkrnasequencingdatainbreastcancer
AT luohaihua identificationofdifferentiallyexpressedgenesatthesinglecelllevelandprognosispredictionthroughbulkrnasequencingdatainbreastcancer
AT jiangyong identificationofdifferentiallyexpressedgenesatthesinglecelllevelandprognosispredictionthroughbulkrnasequencingdatainbreastcancer