Cargando…

CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data

Quantifying cell-type proportions and their corresponding gene expression profiles in tissue samples would enhance understanding of the contributions of individual cell types to the physiological states of the tissue. Current approaches that address tissue heterogeneity have drawbacks. Experimental...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Kai, Meng, Qian, Shats, Igor, Umbach, David M., Li, Melissa, Li, Yuanyuan, Li, Xiaoling, Li, Leping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6907860/
https://www.ncbi.nlm.nih.gov/pubmed/31790389
http://dx.doi.org/10.1371/journal.pcbi.1007510
_version_ 1783478613987819520
author Kang, Kai
Meng, Qian
Shats, Igor
Umbach, David M.
Li, Melissa
Li, Yuanyuan
Li, Xiaoling
Li, Leping
author_facet Kang, Kai
Meng, Qian
Shats, Igor
Umbach, David M.
Li, Melissa
Li, Yuanyuan
Li, Xiaoling
Li, Leping
author_sort Kang, Kai
collection PubMed
description Quantifying cell-type proportions and their corresponding gene expression profiles in tissue samples would enhance understanding of the contributions of individual cell types to the physiological states of the tissue. Current approaches that address tissue heterogeneity have drawbacks. Experimental techniques, such as fluorescence-activated cell sorting, and single cell RNA sequencing are expensive. Computational approaches that use expression data from heterogeneous samples are promising, but most of the current methods estimate either cell-type proportions or cell-type-specific expression profiles by requiring the other as input. Although such partial deconvolution methods have been successfully applied to tumor samples, the additional input required may be unavailable. We introduce a novel complete deconvolution method, CDSeq, that uses only RNA-Seq data from bulk tissue samples to simultaneously estimate both cell-type proportions and cell-type-specific expression profiles. Using several synthetic and real experimental datasets with known cell-type composition and cell-type-specific expression profiles, we compared CDSeq’s complete deconvolution performance with seven other established deconvolution methods. Complete deconvolution using CDSeq represents a substantial technical advance over partial deconvolution approaches and will be useful for studying cell mixtures in tissue samples. CDSeq is available at GitHub repository (MATLAB and Octave code): https://github.com/kkang7/CDSeq.
format Online
Article
Text
id pubmed-6907860
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-69078602019-12-27 CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data Kang, Kai Meng, Qian Shats, Igor Umbach, David M. Li, Melissa Li, Yuanyuan Li, Xiaoling Li, Leping PLoS Comput Biol Research Article Quantifying cell-type proportions and their corresponding gene expression profiles in tissue samples would enhance understanding of the contributions of individual cell types to the physiological states of the tissue. Current approaches that address tissue heterogeneity have drawbacks. Experimental techniques, such as fluorescence-activated cell sorting, and single cell RNA sequencing are expensive. Computational approaches that use expression data from heterogeneous samples are promising, but most of the current methods estimate either cell-type proportions or cell-type-specific expression profiles by requiring the other as input. Although such partial deconvolution methods have been successfully applied to tumor samples, the additional input required may be unavailable. We introduce a novel complete deconvolution method, CDSeq, that uses only RNA-Seq data from bulk tissue samples to simultaneously estimate both cell-type proportions and cell-type-specific expression profiles. Using several synthetic and real experimental datasets with known cell-type composition and cell-type-specific expression profiles, we compared CDSeq’s complete deconvolution performance with seven other established deconvolution methods. Complete deconvolution using CDSeq represents a substantial technical advance over partial deconvolution approaches and will be useful for studying cell mixtures in tissue samples. CDSeq is available at GitHub repository (MATLAB and Octave code): https://github.com/kkang7/CDSeq. Public Library of Science 2019-12-02 /pmc/articles/PMC6907860/ /pubmed/31790389 http://dx.doi.org/10.1371/journal.pcbi.1007510 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Kang, Kai
Meng, Qian
Shats, Igor
Umbach, David M.
Li, Melissa
Li, Yuanyuan
Li, Xiaoling
Li, Leping
CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
title CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
title_full CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
title_fullStr CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
title_full_unstemmed CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
title_short CDSeq: A novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
title_sort cdseq: a novel complete deconvolution method for dissecting heterogeneous samples using gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6907860/
https://www.ncbi.nlm.nih.gov/pubmed/31790389
http://dx.doi.org/10.1371/journal.pcbi.1007510
work_keys_str_mv AT kangkai cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT mengqian cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT shatsigor cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT umbachdavidm cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT limelissa cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT liyuanyuan cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT lixiaoling cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata
AT lileping cdseqanovelcompletedeconvolutionmethodfordissectingheterogeneoussamplesusinggeneexpressiondata