Cargando…

Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data

When assessed over a large number of samples, bulk RNA sequencing provides reliable data for gene expression at the tissue level. Single-cell RNA sequencing (scRNA-seq) deepens those analyses by evaluating gene expression at the cellular level. Both data types lend insights into disease etiology. Wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jiebiao, Roeder, Kathryn, Devlin, Bernie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494232/
https://www.ncbi.nlm.nih.gov/pubmed/33837133
http://dx.doi.org/10.1101/gr.268722.120
_version_ 1784579267623911424
author Wang, Jiebiao
Roeder, Kathryn
Devlin, Bernie
author_facet Wang, Jiebiao
Roeder, Kathryn
Devlin, Bernie
author_sort Wang, Jiebiao
collection PubMed
description When assessed over a large number of samples, bulk RNA sequencing provides reliable data for gene expression at the tissue level. Single-cell RNA sequencing (scRNA-seq) deepens those analyses by evaluating gene expression at the cellular level. Both data types lend insights into disease etiology. With current technologies, scRNA-seq data are known to be noisy. Constrained by costs, scRNA-seq data are typically generated from a relatively small number of subjects, which limits their utility for some analyses, such as identification of gene expression quantitative trait loci (eQTLs). To address these issues while maintaining the unique advantages of each data type, we develop a Bayesian method (bMIND) to integrate bulk and scRNA-seq data. With a prior derived from scRNA-seq data, we propose to estimate sample-level cell type–specific (CTS) expression from bulk expression data. The CTS expression enables large-scale sample-level downstream analyses, such as detection of CTS differentially expressed genes (DEGs) and eQTLs. Through simulations, we show that bMIND improves the accuracy of sample-level CTS expression estimates and increases the power to discover CTS DEGs when compared to existing methods. To further our understanding of two complex phenotypes, autism spectrum disorder and Alzheimer's disease, we apply bMIND to gene expression data of relevant brain tissue to identify CTS DEGs. Our results complement findings for CTS DEGs obtained from snRNA-seq studies, replicating certain DEGs in specific cell types while nominating other novel genes for those cell types. Finally, we calculate CTS eQTLs for 11 brain regions by analyzing Genotype-Tissue Expression Project data, creating a new resource for biological insights.
format Online
Article
Text
id pubmed-8494232
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-84942322021-10-07 Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data Wang, Jiebiao Roeder, Kathryn Devlin, Bernie Genome Res Method When assessed over a large number of samples, bulk RNA sequencing provides reliable data for gene expression at the tissue level. Single-cell RNA sequencing (scRNA-seq) deepens those analyses by evaluating gene expression at the cellular level. Both data types lend insights into disease etiology. With current technologies, scRNA-seq data are known to be noisy. Constrained by costs, scRNA-seq data are typically generated from a relatively small number of subjects, which limits their utility for some analyses, such as identification of gene expression quantitative trait loci (eQTLs). To address these issues while maintaining the unique advantages of each data type, we develop a Bayesian method (bMIND) to integrate bulk and scRNA-seq data. With a prior derived from scRNA-seq data, we propose to estimate sample-level cell type–specific (CTS) expression from bulk expression data. The CTS expression enables large-scale sample-level downstream analyses, such as detection of CTS differentially expressed genes (DEGs) and eQTLs. Through simulations, we show that bMIND improves the accuracy of sample-level CTS expression estimates and increases the power to discover CTS DEGs when compared to existing methods. To further our understanding of two complex phenotypes, autism spectrum disorder and Alzheimer's disease, we apply bMIND to gene expression data of relevant brain tissue to identify CTS DEGs. Our results complement findings for CTS DEGs obtained from snRNA-seq studies, replicating certain DEGs in specific cell types while nominating other novel genes for those cell types. Finally, we calculate CTS eQTLs for 11 brain regions by analyzing Genotype-Tissue Expression Project data, creating a new resource for biological insights. Cold Spring Harbor Laboratory Press 2021-10 /pmc/articles/PMC8494232/ /pubmed/33837133 http://dx.doi.org/10.1101/gr.268722.120 Text en © 2021 Wang et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Method
Wang, Jiebiao
Roeder, Kathryn
Devlin, Bernie
Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
title Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
title_full Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
title_fullStr Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
title_full_unstemmed Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
title_short Bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
title_sort bayesian estimation of cell type–specific gene expression with prior derived from single-cell data
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494232/
https://www.ncbi.nlm.nih.gov/pubmed/33837133
http://dx.doi.org/10.1101/gr.268722.120
work_keys_str_mv AT wangjiebiao bayesianestimationofcelltypespecificgeneexpressionwithpriorderivedfromsinglecelldata
AT roederkathryn bayesianestimationofcelltypespecificgeneexpressionwithpriorderivedfromsinglecelldata
AT devlinbernie bayesianestimationofcelltypespecificgeneexpressionwithpriorderivedfromsinglecelldata