Cargando…

Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) has revolutionized the transcriptomics field by advancing analyses from tissue-level to cell-level resolution. Despite the great advances in the development of computational methods for various steps of scRNA-seq analyses, one major bottleneck of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chonghui, Zhang, Yan, Gao, Xin, Wang, Guohua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354926/
https://www.ncbi.nlm.nih.gov/pubmed/37468850
http://dx.doi.org/10.1186/s12915-023-01658-3
_version_ 1785075027785285632
author Liu, Chonghui
Zhang, Yan
Gao, Xin
Wang, Guohua
author_facet Liu, Chonghui
Zhang, Yan
Gao, Xin
Wang, Guohua
author_sort Liu, Chonghui
collection PubMed
description BACKGROUND: Single-cell RNA sequencing (scRNA-seq) has revolutionized the transcriptomics field by advancing analyses from tissue-level to cell-level resolution. Despite the great advances in the development of computational methods for various steps of scRNA-seq analyses, one major bottleneck of the existing technologies remains in identifying the molecular relationship between disease phenotype and cell subpopulations, where “disease phenotype” refers to the clinical characteristics of each patient sample, and subpopulation refer to groups of single cells, which often do not correspond to clusters identified by standard single-cell clustering analysis. Here, we present PACSI, a method aimed at distinguishing cell subpopulations associated with disease phenotypes at the single-cell level. RESULTS: PACSI takes advantage of the topological properties of biological networks to introduce a proximity-based measure that quantifies the correlation between each cell and the disease phenotype of interest. Applied to simulated data and four case studies, PACSI accurately identified cells associated with disease phenotypes such as diagnosis, prognosis, and response to immunotherapy. In addition, we demonstrated that PACSI can also be applied to spatial transcriptomics data and successfully label spots that are associated with poor survival of breast carcinoma. CONCLUSIONS: PACSI is an efficient method to identify cell subpopulations associated with disease phenotypes. Our research shows that it has a broad range of applications in revealing mechanistic and clinical insights of diseases. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01658-3.
format Online
Article
Text
id pubmed-10354926
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103549262023-07-20 Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI Liu, Chonghui Zhang, Yan Gao, Xin Wang, Guohua BMC Biol Methodology Article BACKGROUND: Single-cell RNA sequencing (scRNA-seq) has revolutionized the transcriptomics field by advancing analyses from tissue-level to cell-level resolution. Despite the great advances in the development of computational methods for various steps of scRNA-seq analyses, one major bottleneck of the existing technologies remains in identifying the molecular relationship between disease phenotype and cell subpopulations, where “disease phenotype” refers to the clinical characteristics of each patient sample, and subpopulation refer to groups of single cells, which often do not correspond to clusters identified by standard single-cell clustering analysis. Here, we present PACSI, a method aimed at distinguishing cell subpopulations associated with disease phenotypes at the single-cell level. RESULTS: PACSI takes advantage of the topological properties of biological networks to introduce a proximity-based measure that quantifies the correlation between each cell and the disease phenotype of interest. Applied to simulated data and four case studies, PACSI accurately identified cells associated with disease phenotypes such as diagnosis, prognosis, and response to immunotherapy. In addition, we demonstrated that PACSI can also be applied to spatial transcriptomics data and successfully label spots that are associated with poor survival of breast carcinoma. CONCLUSIONS: PACSI is an efficient method to identify cell subpopulations associated with disease phenotypes. Our research shows that it has a broad range of applications in revealing mechanistic and clinical insights of diseases. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-023-01658-3. BioMed Central 2023-07-19 /pmc/articles/PMC10354926/ /pubmed/37468850 http://dx.doi.org/10.1186/s12915-023-01658-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Liu, Chonghui
Zhang, Yan
Gao, Xin
Wang, Guohua
Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI
title Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI
title_full Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI
title_fullStr Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI
title_full_unstemmed Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI
title_short Identification of cell subpopulations associated with disease phenotypes from scRNA-seq data using PACSI
title_sort identification of cell subpopulations associated with disease phenotypes from scrna-seq data using pacsi
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354926/
https://www.ncbi.nlm.nih.gov/pubmed/37468850
http://dx.doi.org/10.1186/s12915-023-01658-3
work_keys_str_mv AT liuchonghui identificationofcellsubpopulationsassociatedwithdiseasephenotypesfromscrnaseqdatausingpacsi
AT zhangyan identificationofcellsubpopulationsassociatedwithdiseasephenotypesfromscrnaseqdatausingpacsi
AT gaoxin identificationofcellsubpopulationsassociatedwithdiseasephenotypesfromscrnaseqdatausingpacsi
AT wangguohua identificationofcellsubpopulationsassociatedwithdiseasephenotypesfromscrnaseqdatausingpacsi