Cargando…

Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types

Despite the remarkable progress in probing tumor transcriptomic heterogeneity by single-cell RNA sequencing (sc-RNAseq) data, several gaps exist in prior studies. Tumor heterogeneity is frequently mentioned but not quantified. Clustering analyses typically target cells rather than genes, and differe...

Descripción completa

Detalles Bibliográficos
Autores principales: Tiong, Khong-Loon, Lin, Yu-Wei, Yeang, Chen-Hsiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Company of Biologists Ltd 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235070/
https://www.ncbi.nlm.nih.gov/pubmed/35665803
http://dx.doi.org/10.1242/bio.059256
_version_ 1784736227609542656
author Tiong, Khong-Loon
Lin, Yu-Wei
Yeang, Chen-Hsiang
author_facet Tiong, Khong-Loon
Lin, Yu-Wei
Yeang, Chen-Hsiang
author_sort Tiong, Khong-Loon
collection PubMed
description Despite the remarkable progress in probing tumor transcriptomic heterogeneity by single-cell RNA sequencing (sc-RNAseq) data, several gaps exist in prior studies. Tumor heterogeneity is frequently mentioned but not quantified. Clustering analyses typically target cells rather than genes, and differential levels of transcriptomic heterogeneity of gene clusters are not characterized. Relations between gene clusters inferred from multiple datasets remain less explored. We provided a series of quantitative methods to analyze cancer sc-RNAseq data. First, we proposed two quantitative measures to assess intra-tumoral heterogeneity/homogeneity. Second, we established a hierarchy of gene clusters from sc-RNAseq data, devised an algorithm to reduce the gene cluster hierarchy to a compact structure, and characterized the gene clusters with functional enrichment and heterogeneity. Third, we developed an algorithm to align the gene cluster hierarchies from multiple datasets to a small number of meta gene clusters. By applying these methods to nine cancer sc-RNAseq datasets, we discovered that cancer cell transcriptomes were more homogeneous within tumors than the accompanying normal cells. Furthermore, many gene clusters from the nine datasets were aligned to two large meta gene clusters, which had high and low heterogeneity and were enriched with distinct functions. Finally, we found the homogeneous meta gene cluster retained stronger expression coherence and associations with survival times in bulk level RNAseq data than the heterogeneous meta gene cluster, yet the combinatorial expression patterns of breast cancer subtypes in bulk level data were not preserved in single-cell data. The inference outcomes derived from nine cancer sc-RNAseq datasets provide insights about the contributing factors for transcriptomic heterogeneity of cancer cells and complex relations between bulk level and single-cell RNAseq data. They demonstrate the utility of our methods to enable a comprehensive characterization of co-expressed gene clusters in a wide range of sc-RNAseq data in cancers and beyond.
format Online
Article
Text
id pubmed-9235070
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Company of Biologists Ltd
record_format MEDLINE/PubMed
spelling pubmed-92350702022-06-27 Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types Tiong, Khong-Loon Lin, Yu-Wei Yeang, Chen-Hsiang Biol Open Research Article Despite the remarkable progress in probing tumor transcriptomic heterogeneity by single-cell RNA sequencing (sc-RNAseq) data, several gaps exist in prior studies. Tumor heterogeneity is frequently mentioned but not quantified. Clustering analyses typically target cells rather than genes, and differential levels of transcriptomic heterogeneity of gene clusters are not characterized. Relations between gene clusters inferred from multiple datasets remain less explored. We provided a series of quantitative methods to analyze cancer sc-RNAseq data. First, we proposed two quantitative measures to assess intra-tumoral heterogeneity/homogeneity. Second, we established a hierarchy of gene clusters from sc-RNAseq data, devised an algorithm to reduce the gene cluster hierarchy to a compact structure, and characterized the gene clusters with functional enrichment and heterogeneity. Third, we developed an algorithm to align the gene cluster hierarchies from multiple datasets to a small number of meta gene clusters. By applying these methods to nine cancer sc-RNAseq datasets, we discovered that cancer cell transcriptomes were more homogeneous within tumors than the accompanying normal cells. Furthermore, many gene clusters from the nine datasets were aligned to two large meta gene clusters, which had high and low heterogeneity and were enriched with distinct functions. Finally, we found the homogeneous meta gene cluster retained stronger expression coherence and associations with survival times in bulk level RNAseq data than the heterogeneous meta gene cluster, yet the combinatorial expression patterns of breast cancer subtypes in bulk level data were not preserved in single-cell data. The inference outcomes derived from nine cancer sc-RNAseq datasets provide insights about the contributing factors for transcriptomic heterogeneity of cancer cells and complex relations between bulk level and single-cell RNAseq data. They demonstrate the utility of our methods to enable a comprehensive characterization of co-expressed gene clusters in a wide range of sc-RNAseq data in cancers and beyond. The Company of Biologists Ltd 2022-06-23 /pmc/articles/PMC9235070/ /pubmed/35665803 http://dx.doi.org/10.1242/bio.059256 Text en © 2022. Published by The Company of Biologists Ltd https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed.
spellingShingle Research Article
Tiong, Khong-Loon
Lin, Yu-Wei
Yeang, Chen-Hsiang
Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
title Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
title_full Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
title_fullStr Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
title_full_unstemmed Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
title_short Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
title_sort characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235070/
https://www.ncbi.nlm.nih.gov/pubmed/35665803
http://dx.doi.org/10.1242/bio.059256
work_keys_str_mv AT tiongkhongloon characterizationofgeneclusterheterogeneityinsinglecelltranscriptomicdatawithinandacrosscancertypes
AT linyuwei characterizationofgeneclusterheterogeneityinsinglecelltranscriptomicdatawithinandacrosscancertypes
AT yeangchenhsiang characterizationofgeneclusterheterogeneityinsinglecelltranscriptomicdatawithinandacrosscancertypes