Cargando…
More accurate estimation of cell composition in bulk expression through robust integration of single-cell information
MOTIVATION: The rapid single-cell transcriptomic technology developments have led to an increasing interest in cellular heterogeneity within cell populations. Although cell-type proportions can be obtained directly from single-cell RNA sequencing (scRNA-seq), it is costly and not feasible in every s...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710693/ https://www.ncbi.nlm.nih.gov/pubmed/36699374 http://dx.doi.org/10.1093/bioadv/vbac049 |
_version_ | 1784841420081725440 |
---|---|
author | Karimnezhad, Ali |
author_facet | Karimnezhad, Ali |
author_sort | Karimnezhad, Ali |
collection | PubMed |
description | MOTIVATION: The rapid single-cell transcriptomic technology developments have led to an increasing interest in cellular heterogeneity within cell populations. Although cell-type proportions can be obtained directly from single-cell RNA sequencing (scRNA-seq), it is costly and not feasible in every study. Alternatively, with fewer experimental complications, cell-type compositions are characterized from bulk RNA-seq data. Many computational tools have been developed and reported in the literature. However, they fail to appropriately incorporate the covariance structures in both scRNA-seq and bulk RNA-seq datasets in use. RESULTS: We present a covariance-based single-cell decomposition (CSCD) method that estimates cell-type proportions in bulk data through building a reference expression profile based on a single-cell data, and learning gene-specific bulk expression transformations using a constrained linear inverse model. The approach is similar to Bisque, a cell-type decomposition method that was recently developed. Bisque is limited to a univariate model, thus unable to incorporate gene-gene correlations into the analysis. We introduce a more advanced model that successfully incorporates the covariance structures in both scRNA-seq and bulk RNA-seq datasets into the analysis, and fixes the collinearity issue by utilizing a linear shrinkage estimation of the corresponding covariance matrices. We applied CSCD to several publicly available datasets and measured the performance of CSCD, Bisque and six other common methods in the literature. Our results indicate that CSCD is more accurate and comprehensive than most of the existing methods. AVAILABILITY AND IMPLEMENTATION: The R package is available on https://github.com/empiricalbayes/CSCDRNA. |
format | Online Article Text |
id | pubmed-9710693 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-97106932023-01-24 More accurate estimation of cell composition in bulk expression through robust integration of single-cell information Karimnezhad, Ali Bioinform Adv Original Paper MOTIVATION: The rapid single-cell transcriptomic technology developments have led to an increasing interest in cellular heterogeneity within cell populations. Although cell-type proportions can be obtained directly from single-cell RNA sequencing (scRNA-seq), it is costly and not feasible in every study. Alternatively, with fewer experimental complications, cell-type compositions are characterized from bulk RNA-seq data. Many computational tools have been developed and reported in the literature. However, they fail to appropriately incorporate the covariance structures in both scRNA-seq and bulk RNA-seq datasets in use. RESULTS: We present a covariance-based single-cell decomposition (CSCD) method that estimates cell-type proportions in bulk data through building a reference expression profile based on a single-cell data, and learning gene-specific bulk expression transformations using a constrained linear inverse model. The approach is similar to Bisque, a cell-type decomposition method that was recently developed. Bisque is limited to a univariate model, thus unable to incorporate gene-gene correlations into the analysis. We introduce a more advanced model that successfully incorporates the covariance structures in both scRNA-seq and bulk RNA-seq datasets into the analysis, and fixes the collinearity issue by utilizing a linear shrinkage estimation of the corresponding covariance matrices. We applied CSCD to several publicly available datasets and measured the performance of CSCD, Bisque and six other common methods in the literature. Our results indicate that CSCD is more accurate and comprehensive than most of the existing methods. AVAILABILITY AND IMPLEMENTATION: The R package is available on https://github.com/empiricalbayes/CSCDRNA. Oxford University Press 2022-07-27 /pmc/articles/PMC9710693/ /pubmed/36699374 http://dx.doi.org/10.1093/bioadv/vbac049 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Karimnezhad, Ali More accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
title | More accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
title_full | More accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
title_fullStr | More accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
title_full_unstemmed | More accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
title_short | More accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
title_sort | more accurate estimation of cell composition in bulk expression through robust integration of single-cell information |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710693/ https://www.ncbi.nlm.nih.gov/pubmed/36699374 http://dx.doi.org/10.1093/bioadv/vbac049 |
work_keys_str_mv | AT karimnezhadali moreaccurateestimationofcellcompositioninbulkexpressionthroughrobustintegrationofsinglecellinformation |