Cargando…

An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data

Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omic...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Xifang, Sun, Shiquan, Yang, Sheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6830085/
https://www.ncbi.nlm.nih.gov/pubmed/31569701
http://dx.doi.org/10.3390/cells8101161
_version_ 1783465705833758720
author Sun, Xifang
Sun, Shiquan
Yang, Sheng
author_facet Sun, Xifang
Sun, Shiquan
Yang, Sheng
author_sort Sun, Xifang
collection PubMed
description Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively.
format Online
Article
Text
id pubmed-6830085
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-68300852019-11-18 An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data Sun, Xifang Sun, Shiquan Yang, Sheng Cells Article Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively. MDPI 2019-09-27 /pmc/articles/PMC6830085/ /pubmed/31569701 http://dx.doi.org/10.3390/cells8101161 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Sun, Xifang
Sun, Shiquan
Yang, Sheng
An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
title An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
title_full An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
title_fullStr An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
title_full_unstemmed An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
title_short An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
title_sort efficient and flexible method for deconvoluting bulk rna-seq data with single-cell rna-seq data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6830085/
https://www.ncbi.nlm.nih.gov/pubmed/31569701
http://dx.doi.org/10.3390/cells8101161
work_keys_str_mv AT sunxifang anefficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata
AT sunshiquan anefficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata
AT yangsheng anefficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata
AT sunxifang efficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata
AT sunshiquan efficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata
AT yangsheng efficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata