Cargando…
An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omic...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6830085/ https://www.ncbi.nlm.nih.gov/pubmed/31569701 http://dx.doi.org/10.3390/cells8101161 |
_version_ | 1783465705833758720 |
---|---|
author | Sun, Xifang Sun, Shiquan Yang, Sheng |
author_facet | Sun, Xifang Sun, Shiquan Yang, Sheng |
author_sort | Sun, Xifang |
collection | PubMed |
description | Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively. |
format | Online Article Text |
id | pubmed-6830085 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-68300852019-11-18 An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data Sun, Xifang Sun, Shiquan Yang, Sheng Cells Article Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively. MDPI 2019-09-27 /pmc/articles/PMC6830085/ /pubmed/31569701 http://dx.doi.org/10.3390/cells8101161 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Sun, Xifang Sun, Shiquan Yang, Sheng An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data |
title | An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data |
title_full | An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data |
title_fullStr | An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data |
title_full_unstemmed | An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data |
title_short | An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data |
title_sort | efficient and flexible method for deconvoluting bulk rna-seq data with single-cell rna-seq data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6830085/ https://www.ncbi.nlm.nih.gov/pubmed/31569701 http://dx.doi.org/10.3390/cells8101161 |
work_keys_str_mv | AT sunxifang anefficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata AT sunshiquan anefficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata AT yangsheng anefficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata AT sunxifang efficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata AT sunshiquan efficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata AT yangsheng efficientandflexiblemethodfordeconvolutingbulkrnaseqdatawithsinglecellrnaseqdata |