Cargando…
Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types
Despite the remarkable progress in probing tumor transcriptomic heterogeneity by single-cell RNA sequencing (sc-RNAseq) data, several gaps exist in prior studies. Tumor heterogeneity is frequently mentioned but not quantified. Clustering analyses typically target cells rather than genes, and differe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Company of Biologists Ltd
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235070/ https://www.ncbi.nlm.nih.gov/pubmed/35665803 http://dx.doi.org/10.1242/bio.059256 |
_version_ | 1784736227609542656 |
---|---|
author | Tiong, Khong-Loon Lin, Yu-Wei Yeang, Chen-Hsiang |
author_facet | Tiong, Khong-Loon Lin, Yu-Wei Yeang, Chen-Hsiang |
author_sort | Tiong, Khong-Loon |
collection | PubMed |
description | Despite the remarkable progress in probing tumor transcriptomic heterogeneity by single-cell RNA sequencing (sc-RNAseq) data, several gaps exist in prior studies. Tumor heterogeneity is frequently mentioned but not quantified. Clustering analyses typically target cells rather than genes, and differential levels of transcriptomic heterogeneity of gene clusters are not characterized. Relations between gene clusters inferred from multiple datasets remain less explored. We provided a series of quantitative methods to analyze cancer sc-RNAseq data. First, we proposed two quantitative measures to assess intra-tumoral heterogeneity/homogeneity. Second, we established a hierarchy of gene clusters from sc-RNAseq data, devised an algorithm to reduce the gene cluster hierarchy to a compact structure, and characterized the gene clusters with functional enrichment and heterogeneity. Third, we developed an algorithm to align the gene cluster hierarchies from multiple datasets to a small number of meta gene clusters. By applying these methods to nine cancer sc-RNAseq datasets, we discovered that cancer cell transcriptomes were more homogeneous within tumors than the accompanying normal cells. Furthermore, many gene clusters from the nine datasets were aligned to two large meta gene clusters, which had high and low heterogeneity and were enriched with distinct functions. Finally, we found the homogeneous meta gene cluster retained stronger expression coherence and associations with survival times in bulk level RNAseq data than the heterogeneous meta gene cluster, yet the combinatorial expression patterns of breast cancer subtypes in bulk level data were not preserved in single-cell data. The inference outcomes derived from nine cancer sc-RNAseq datasets provide insights about the contributing factors for transcriptomic heterogeneity of cancer cells and complex relations between bulk level and single-cell RNAseq data. They demonstrate the utility of our methods to enable a comprehensive characterization of co-expressed gene clusters in a wide range of sc-RNAseq data in cancers and beyond. |
format | Online Article Text |
id | pubmed-9235070 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Company of Biologists Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-92350702022-06-27 Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types Tiong, Khong-Loon Lin, Yu-Wei Yeang, Chen-Hsiang Biol Open Research Article Despite the remarkable progress in probing tumor transcriptomic heterogeneity by single-cell RNA sequencing (sc-RNAseq) data, several gaps exist in prior studies. Tumor heterogeneity is frequently mentioned but not quantified. Clustering analyses typically target cells rather than genes, and differential levels of transcriptomic heterogeneity of gene clusters are not characterized. Relations between gene clusters inferred from multiple datasets remain less explored. We provided a series of quantitative methods to analyze cancer sc-RNAseq data. First, we proposed two quantitative measures to assess intra-tumoral heterogeneity/homogeneity. Second, we established a hierarchy of gene clusters from sc-RNAseq data, devised an algorithm to reduce the gene cluster hierarchy to a compact structure, and characterized the gene clusters with functional enrichment and heterogeneity. Third, we developed an algorithm to align the gene cluster hierarchies from multiple datasets to a small number of meta gene clusters. By applying these methods to nine cancer sc-RNAseq datasets, we discovered that cancer cell transcriptomes were more homogeneous within tumors than the accompanying normal cells. Furthermore, many gene clusters from the nine datasets were aligned to two large meta gene clusters, which had high and low heterogeneity and were enriched with distinct functions. Finally, we found the homogeneous meta gene cluster retained stronger expression coherence and associations with survival times in bulk level RNAseq data than the heterogeneous meta gene cluster, yet the combinatorial expression patterns of breast cancer subtypes in bulk level data were not preserved in single-cell data. The inference outcomes derived from nine cancer sc-RNAseq datasets provide insights about the contributing factors for transcriptomic heterogeneity of cancer cells and complex relations between bulk level and single-cell RNAseq data. They demonstrate the utility of our methods to enable a comprehensive characterization of co-expressed gene clusters in a wide range of sc-RNAseq data in cancers and beyond. The Company of Biologists Ltd 2022-06-23 /pmc/articles/PMC9235070/ /pubmed/35665803 http://dx.doi.org/10.1242/bio.059256 Text en © 2022. Published by The Company of Biologists Ltd https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution and reproduction in any medium provided that the original work is properly attributed. |
spellingShingle | Research Article Tiong, Khong-Loon Lin, Yu-Wei Yeang, Chen-Hsiang Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
title | Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
title_full | Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
title_fullStr | Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
title_full_unstemmed | Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
title_short | Characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
title_sort | characterization of gene cluster heterogeneity in single-cell transcriptomic data within and across cancer types |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9235070/ https://www.ncbi.nlm.nih.gov/pubmed/35665803 http://dx.doi.org/10.1242/bio.059256 |
work_keys_str_mv | AT tiongkhongloon characterizationofgeneclusterheterogeneityinsinglecelltranscriptomicdatawithinandacrosscancertypes AT linyuwei characterizationofgeneclusterheterogeneityinsinglecelltranscriptomicdatawithinandacrosscancertypes AT yeangchenhsiang characterizationofgeneclusterheterogeneityinsinglecelltranscriptomicdatawithinandacrosscancertypes |