Cargando…

More accurate estimation of cell composition in bulk expression through robust integration of single-cell information

MOTIVATION: The rapid single-cell transcriptomic technology developments have led to an increasing interest in cellular heterogeneity within cell populations. Although cell-type proportions can be obtained directly from single-cell RNA sequencing (scRNA-seq), it is costly and not feasible in every s...

Descripción completa

Detalles Bibliográficos
Autor principal: Karimnezhad, Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710693/
https://www.ncbi.nlm.nih.gov/pubmed/36699374
http://dx.doi.org/10.1093/bioadv/vbac049
_version_ 1784841420081725440
author Karimnezhad, Ali
author_facet Karimnezhad, Ali
author_sort Karimnezhad, Ali
collection PubMed
description MOTIVATION: The rapid single-cell transcriptomic technology developments have led to an increasing interest in cellular heterogeneity within cell populations. Although cell-type proportions can be obtained directly from single-cell RNA sequencing (scRNA-seq), it is costly and not feasible in every study. Alternatively, with fewer experimental complications, cell-type compositions are characterized from bulk RNA-seq data. Many computational tools have been developed and reported in the literature. However, they fail to appropriately incorporate the covariance structures in both scRNA-seq and bulk RNA-seq datasets in use. RESULTS: We present a covariance-based single-cell decomposition (CSCD) method that estimates cell-type proportions in bulk data through building a reference expression profile based on a single-cell data, and learning gene-specific bulk expression transformations using a constrained linear inverse model. The approach is similar to Bisque, a cell-type decomposition method that was recently developed. Bisque is limited to a univariate model, thus unable to incorporate gene-gene correlations into the analysis. We introduce a more advanced model that successfully incorporates the covariance structures in both scRNA-seq and bulk RNA-seq datasets into the analysis, and fixes the collinearity issue by utilizing a linear shrinkage estimation of the corresponding covariance matrices. We applied CSCD to several publicly available datasets and measured the performance of CSCD, Bisque and six other common methods in the literature. Our results indicate that CSCD is more accurate and comprehensive than most of the existing methods. AVAILABILITY AND IMPLEMENTATION: The R package is available on https://github.com/empiricalbayes/CSCDRNA.
format Online
Article
Text
id pubmed-9710693
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97106932023-01-24 More accurate estimation of cell composition in bulk expression through robust integration of single-cell information Karimnezhad, Ali Bioinform Adv Original Paper MOTIVATION: The rapid single-cell transcriptomic technology developments have led to an increasing interest in cellular heterogeneity within cell populations. Although cell-type proportions can be obtained directly from single-cell RNA sequencing (scRNA-seq), it is costly and not feasible in every study. Alternatively, with fewer experimental complications, cell-type compositions are characterized from bulk RNA-seq data. Many computational tools have been developed and reported in the literature. However, they fail to appropriately incorporate the covariance structures in both scRNA-seq and bulk RNA-seq datasets in use. RESULTS: We present a covariance-based single-cell decomposition (CSCD) method that estimates cell-type proportions in bulk data through building a reference expression profile based on a single-cell data, and learning gene-specific bulk expression transformations using a constrained linear inverse model. The approach is similar to Bisque, a cell-type decomposition method that was recently developed. Bisque is limited to a univariate model, thus unable to incorporate gene-gene correlations into the analysis. We introduce a more advanced model that successfully incorporates the covariance structures in both scRNA-seq and bulk RNA-seq datasets into the analysis, and fixes the collinearity issue by utilizing a linear shrinkage estimation of the corresponding covariance matrices. We applied CSCD to several publicly available datasets and measured the performance of CSCD, Bisque and six other common methods in the literature. Our results indicate that CSCD is more accurate and comprehensive than most of the existing methods. AVAILABILITY AND IMPLEMENTATION: The R package is available on https://github.com/empiricalbayes/CSCDRNA. Oxford University Press 2022-07-27 /pmc/articles/PMC9710693/ /pubmed/36699374 http://dx.doi.org/10.1093/bioadv/vbac049 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Karimnezhad, Ali
More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
title More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
title_full More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
title_fullStr More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
title_full_unstemmed More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
title_short More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
title_sort more accurate estimation of cell composition in bulk expression through robust integration of single-cell information
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710693/
https://www.ncbi.nlm.nih.gov/pubmed/36699374
http://dx.doi.org/10.1093/bioadv/vbac049
work_keys_str_mv AT karimnezhadali moreaccurateestimationofcellcompositioninbulkexpressionthroughrobustintegrationofsinglecellinformation