Cargando…

Bayesian log-normal deconvolution for enhanced in silico microdissection of bulk gene expression data

Deconvolution of bulk gene expression profiles into the cellular components is pivotal to portraying tissue’s complex cellular make-up, such as the tumor microenvironment. However, the inherently variable nature of gene expression requires a comprehensive statistical model and reliable prior knowled...

Descripción completa

Detalles Bibliográficos
Autores principales: Andrade Barbosa, Bárbara, van Asten, Saskia D., Oh, Ji Won, Farina-Sarasqueta, Arantza, Verheij, Joanne, Dijk, Frederike, van Laarhoven, Hanneke W. M., Ylstra, Bauke, Garcia Vallejo, Juan J., van de Wiel, Mark A., Kim, Yongsoo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8528834/
https://www.ncbi.nlm.nih.gov/pubmed/34671028
http://dx.doi.org/10.1038/s41467-021-26328-2
Descripción
Sumario:Deconvolution of bulk gene expression profiles into the cellular components is pivotal to portraying tissue’s complex cellular make-up, such as the tumor microenvironment. However, the inherently variable nature of gene expression requires a comprehensive statistical model and reliable prior knowledge of individual cell types that can be obtained from single-cell RNA sequencing. We introduce BLADE (Bayesian Log-normAl Deconvolution), a unified Bayesian framework to estimate both cellular composition and gene expression profiles for each cell type. Unlike previous comprehensive statistical approaches, BLADE can handle > 20 types of cells due to the efficient variational inference. Throughout an intensive evaluation with > 700 simulated and real datasets, BLADE demonstrated enhanced robustness against gene expression variability and better completeness than conventional methods, in particular, to reconstruct gene expression profiles of each cell type. In summary, BLADE is a powerful tool to unravel heterogeneous cellular activity in complex biological systems from standard bulk gene expression data.