Cargando…

Likelihood-based deconvolution of bulk gene expression data using single-cell references

Direct comparison of bulk gene expression profiles is complicated by distinct cell type mixtures in each sample that obscure whether observed differences are actually caused by changes in the expression levels themselves or are simply a result of differing cell type compositions. Single-cell technol...

Descripción completa

Detalles Bibliográficos
Autores principales: Erdmann-Pham, Dan D., Fischer, Jonathan, Hong, Justin, Song, Yun S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494215/
https://www.ncbi.nlm.nih.gov/pubmed/34301624
http://dx.doi.org/10.1101/gr.272344.120
_version_ 1784579263607865344
author Erdmann-Pham, Dan D.
Fischer, Jonathan
Hong, Justin
Song, Yun S.
author_facet Erdmann-Pham, Dan D.
Fischer, Jonathan
Hong, Justin
Song, Yun S.
author_sort Erdmann-Pham, Dan D.
collection PubMed
description Direct comparison of bulk gene expression profiles is complicated by distinct cell type mixtures in each sample that obscure whether observed differences are actually caused by changes in the expression levels themselves or are simply a result of differing cell type compositions. Single-cell technology has made it possible to measure gene expression in individual cells, achieving higher resolution at the expense of increased noise. If carefully incorporated, such single-cell data can be used to deconvolve bulk samples to yield accurate estimates of the true cell type proportions, thus enabling one to disentangle the effects of differential expression and cell type mixtures. Here, we propose a generative model and a likelihood-based inference method that uses asymptotic statistical theory and a novel optimization procedure to perform deconvolution of bulk RNA-seq data to produce accurate cell type proportion estimates. We show the effectiveness of our method, called RNA-Sieve, across a diverse array of scenarios involving real data and discuss extensions made uniquely possible by our probabilistic framework, including a demonstration of well-calibrated confidence intervals.
format Online
Article
Text
id pubmed-8494215
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-84942152022-04-01 Likelihood-based deconvolution of bulk gene expression data using single-cell references Erdmann-Pham, Dan D. Fischer, Jonathan Hong, Justin Song, Yun S. Genome Res Method Direct comparison of bulk gene expression profiles is complicated by distinct cell type mixtures in each sample that obscure whether observed differences are actually caused by changes in the expression levels themselves or are simply a result of differing cell type compositions. Single-cell technology has made it possible to measure gene expression in individual cells, achieving higher resolution at the expense of increased noise. If carefully incorporated, such single-cell data can be used to deconvolve bulk samples to yield accurate estimates of the true cell type proportions, thus enabling one to disentangle the effects of differential expression and cell type mixtures. Here, we propose a generative model and a likelihood-based inference method that uses asymptotic statistical theory and a novel optimization procedure to perform deconvolution of bulk RNA-seq data to produce accurate cell type proportion estimates. We show the effectiveness of our method, called RNA-Sieve, across a diverse array of scenarios involving real data and discuss extensions made uniquely possible by our probabilistic framework, including a demonstration of well-calibrated confidence intervals. Cold Spring Harbor Laboratory Press 2021-10 /pmc/articles/PMC8494215/ /pubmed/34301624 http://dx.doi.org/10.1101/gr.272344.120 Text en © 2021 Erdmann-Pham et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Method
Erdmann-Pham, Dan D.
Fischer, Jonathan
Hong, Justin
Song, Yun S.
Likelihood-based deconvolution of bulk gene expression data using single-cell references
title Likelihood-based deconvolution of bulk gene expression data using single-cell references
title_full Likelihood-based deconvolution of bulk gene expression data using single-cell references
title_fullStr Likelihood-based deconvolution of bulk gene expression data using single-cell references
title_full_unstemmed Likelihood-based deconvolution of bulk gene expression data using single-cell references
title_short Likelihood-based deconvolution of bulk gene expression data using single-cell references
title_sort likelihood-based deconvolution of bulk gene expression data using single-cell references
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494215/
https://www.ncbi.nlm.nih.gov/pubmed/34301624
http://dx.doi.org/10.1101/gr.272344.120
work_keys_str_mv AT erdmannphamdand likelihoodbaseddeconvolutionofbulkgeneexpressiondatausingsinglecellreferences
AT fischerjonathan likelihoodbaseddeconvolutionofbulkgeneexpressiondatausingsinglecellreferences
AT hongjustin likelihoodbaseddeconvolutionofbulkgeneexpressiondatausingsinglecellreferences
AT songyuns likelihoodbaseddeconvolutionofbulkgeneexpressiondatausingsinglecellreferences