Cargando…

Deconvolution of heterogeneous tumor samples using partial reference signals

Deconvolution of heterogeneous bulk tumor samples into distinct cellular populations is an important yet challenging problem, particularly when only partial references are available. A common approach to dealing with this problem is to deconvolve the mixed signals using available references and leve...

Descripción completa

Detalles Bibliográficos
Autores principales: Qin, Yufang, Zhang, Weiwei, Sun, Xiaoqiang, Nan, Siwei, Wei, Nana, Wu, Hua-Jun, Zheng, Xiaoqi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728196/
https://www.ncbi.nlm.nih.gov/pubmed/33253170
http://dx.doi.org/10.1371/journal.pcbi.1008452
_version_ 1783621221960646656
author Qin, Yufang
Zhang, Weiwei
Sun, Xiaoqiang
Nan, Siwei
Wei, Nana
Wu, Hua-Jun
Zheng, Xiaoqi
author_facet Qin, Yufang
Zhang, Weiwei
Sun, Xiaoqiang
Nan, Siwei
Wei, Nana
Wu, Hua-Jun
Zheng, Xiaoqi
author_sort Qin, Yufang
collection PubMed
description Deconvolution of heterogeneous bulk tumor samples into distinct cellular populations is an important yet challenging problem, particularly when only partial references are available. A common approach to dealing with this problem is to deconvolve the mixed signals using available references and leverage the remaining signal as a new cell component. However, as indicated in our simulation, such an approach tends to over-estimate the proportions of known cell types and fails to detect novel cell types. Here, we propose PREDE, a partial reference-based deconvolution method using an iterative non-negative matrix factorization algorithm. Our method is verified to be effective in estimating cell proportions and expression profiles of unknown cell types based on simulated datasets at a variety of parameter settings. Applying our method to TCGA tumor samples, we found that proportions of pure cancer cells better indicate different subtypes of tumor samples. We also detected several cell types for each cancer type whose proportions successfully predicted patient survival. Our method makes a significant contribution to deconvolution of heterogeneous tumor samples and could be widely applied to varieties of high throughput bulk data. PREDE is implemented in R and is freely available from GitHub (https://xiaoqizheng.github.io/PREDE).
format Online
Article
Text
id pubmed-7728196
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77281962020-12-16 Deconvolution of heterogeneous tumor samples using partial reference signals Qin, Yufang Zhang, Weiwei Sun, Xiaoqiang Nan, Siwei Wei, Nana Wu, Hua-Jun Zheng, Xiaoqi PLoS Comput Biol Research Article Deconvolution of heterogeneous bulk tumor samples into distinct cellular populations is an important yet challenging problem, particularly when only partial references are available. A common approach to dealing with this problem is to deconvolve the mixed signals using available references and leverage the remaining signal as a new cell component. However, as indicated in our simulation, such an approach tends to over-estimate the proportions of known cell types and fails to detect novel cell types. Here, we propose PREDE, a partial reference-based deconvolution method using an iterative non-negative matrix factorization algorithm. Our method is verified to be effective in estimating cell proportions and expression profiles of unknown cell types based on simulated datasets at a variety of parameter settings. Applying our method to TCGA tumor samples, we found that proportions of pure cancer cells better indicate different subtypes of tumor samples. We also detected several cell types for each cancer type whose proportions successfully predicted patient survival. Our method makes a significant contribution to deconvolution of heterogeneous tumor samples and could be widely applied to varieties of high throughput bulk data. PREDE is implemented in R and is freely available from GitHub (https://xiaoqizheng.github.io/PREDE). Public Library of Science 2020-11-30 /pmc/articles/PMC7728196/ /pubmed/33253170 http://dx.doi.org/10.1371/journal.pcbi.1008452 Text en © 2020 Qin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Qin, Yufang
Zhang, Weiwei
Sun, Xiaoqiang
Nan, Siwei
Wei, Nana
Wu, Hua-Jun
Zheng, Xiaoqi
Deconvolution of heterogeneous tumor samples using partial reference signals
title Deconvolution of heterogeneous tumor samples using partial reference signals
title_full Deconvolution of heterogeneous tumor samples using partial reference signals
title_fullStr Deconvolution of heterogeneous tumor samples using partial reference signals
title_full_unstemmed Deconvolution of heterogeneous tumor samples using partial reference signals
title_short Deconvolution of heterogeneous tumor samples using partial reference signals
title_sort deconvolution of heterogeneous tumor samples using partial reference signals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7728196/
https://www.ncbi.nlm.nih.gov/pubmed/33253170
http://dx.doi.org/10.1371/journal.pcbi.1008452
work_keys_str_mv AT qinyufang deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals
AT zhangweiwei deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals
AT sunxiaoqiang deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals
AT nansiwei deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals
AT weinana deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals
AT wuhuajun deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals
AT zhengxiaoqi deconvolutionofheterogeneoustumorsamplesusingpartialreferencesignals