Cargando…

Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes

BACKGROUND: RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissue...

Descripción completa

Detalles Bibliográficos
Autores principales: Cobos, Francisco Avila, Panah, Mohammad Javad Najaf, Epps, Jessica, Long, Xiaochen, Man, Tsz-Kwong, Chiu, Hua-Sheng, Chomsky, Elad, Kiner, Evgeny, Krueger, Michael J., di Bernardo, Diego, Voloch, Luis, Molenaar, Jan, van Hooff, Sander R., Westermann, Frank, Jansky, Selina, Redell, Michele L., Mestdagh, Pieter, Sumazin, Pavel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394903/
https://www.ncbi.nlm.nih.gov/pubmed/37528411
http://dx.doi.org/10.1186/s13059-023-03016-6
_version_ 1785083474749685760
author Cobos, Francisco Avila
Panah, Mohammad Javad Najaf
Epps, Jessica
Long, Xiaochen
Man, Tsz-Kwong
Chiu, Hua-Sheng
Chomsky, Elad
Kiner, Evgeny
Krueger, Michael J.
di Bernardo, Diego
Voloch, Luis
Molenaar, Jan
van Hooff, Sander R.
Westermann, Frank
Jansky, Selina
Redell, Michele L.
Mestdagh, Pieter
Sumazin, Pavel
author_facet Cobos, Francisco Avila
Panah, Mohammad Javad Najaf
Epps, Jessica
Long, Xiaochen
Man, Tsz-Kwong
Chiu, Hua-Sheng
Chomsky, Elad
Kiner, Evgeny
Krueger, Michael J.
di Bernardo, Diego
Voloch, Luis
Molenaar, Jan
van Hooff, Sander R.
Westermann, Frank
Jansky, Selina
Redell, Michele L.
Mestdagh, Pieter
Sumazin, Pavel
author_sort Cobos, Francisco Avila
collection PubMed
description BACKGROUND: RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissues. However, the use of these technologies is operationally challenging because of high costs and stringent sample-collection requirements. Computational deconvolution methods that infer the composition of bulk-profiled samples using scnRNA-seq-characterized cell types can broaden scnRNA-seq applications, but their effectiveness remains controversial. RESULTS: We produced the first systematic evaluation of deconvolution methods on datasets with either known or scnRNA-seq-estimated compositions. Our analyses revealed biases that are common to scnRNA-seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-seq and scnRNA-seq profiles can help improve the accuracy of both scnRNA-seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), which combines RNA-seq transformation and dampened weighted least-squares deconvolution approaches, consistently outperformed other methods in predicting the composition of cell mixtures and tissue samples. CONCLUSIONS: We showed that analysis of concurrent RNA-seq and scnRNA-seq profiles with SQUID can produce accurate cell-type abundance estimates and that this accuracy improvement was necessary for identifying outcomes-predictive cancer cell subclones in pediatric acute myeloid leukemia and neuroblastoma datasets. These results suggest that deconvolution accuracy improvements are vital to enabling its applications in the life sciences. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03016-6.
format Online
Article
Text
id pubmed-10394903
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103949032023-08-03 Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes Cobos, Francisco Avila Panah, Mohammad Javad Najaf Epps, Jessica Long, Xiaochen Man, Tsz-Kwong Chiu, Hua-Sheng Chomsky, Elad Kiner, Evgeny Krueger, Michael J. di Bernardo, Diego Voloch, Luis Molenaar, Jan van Hooff, Sander R. Westermann, Frank Jansky, Selina Redell, Michele L. Mestdagh, Pieter Sumazin, Pavel Genome Biol Research BACKGROUND: RNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissues. However, the use of these technologies is operationally challenging because of high costs and stringent sample-collection requirements. Computational deconvolution methods that infer the composition of bulk-profiled samples using scnRNA-seq-characterized cell types can broaden scnRNA-seq applications, but their effectiveness remains controversial. RESULTS: We produced the first systematic evaluation of deconvolution methods on datasets with either known or scnRNA-seq-estimated compositions. Our analyses revealed biases that are common to scnRNA-seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-seq and scnRNA-seq profiles can help improve the accuracy of both scnRNA-seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), which combines RNA-seq transformation and dampened weighted least-squares deconvolution approaches, consistently outperformed other methods in predicting the composition of cell mixtures and tissue samples. CONCLUSIONS: We showed that analysis of concurrent RNA-seq and scnRNA-seq profiles with SQUID can produce accurate cell-type abundance estimates and that this accuracy improvement was necessary for identifying outcomes-predictive cancer cell subclones in pediatric acute myeloid leukemia and neuroblastoma datasets. These results suggest that deconvolution accuracy improvements are vital to enabling its applications in the life sciences. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03016-6. BioMed Central 2023-08-01 /pmc/articles/PMC10394903/ /pubmed/37528411 http://dx.doi.org/10.1186/s13059-023-03016-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Cobos, Francisco Avila
Panah, Mohammad Javad Najaf
Epps, Jessica
Long, Xiaochen
Man, Tsz-Kwong
Chiu, Hua-Sheng
Chomsky, Elad
Kiner, Evgeny
Krueger, Michael J.
di Bernardo, Diego
Voloch, Luis
Molenaar, Jan
van Hooff, Sander R.
Westermann, Frank
Jansky, Selina
Redell, Michele L.
Mestdagh, Pieter
Sumazin, Pavel
Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
title Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
title_full Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
title_fullStr Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
title_full_unstemmed Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
title_short Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
title_sort effective methods for bulk rna-seq deconvolution using scnrna-seq transcriptomes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394903/
https://www.ncbi.nlm.nih.gov/pubmed/37528411
http://dx.doi.org/10.1186/s13059-023-03016-6
work_keys_str_mv AT cobosfranciscoavila effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT panahmohammadjavadnajaf effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT eppsjessica effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT longxiaochen effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT mantszkwong effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT chiuhuasheng effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT chomskyelad effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT kinerevgeny effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT kruegermichaelj effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT dibernardodiego effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT volochluis effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT molenaarjan effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT vanhooffsanderr effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT westermannfrank effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT janskyselina effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT redellmichelel effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT mestdaghpieter effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes
AT sumazinpavel effectivemethodsforbulkrnaseqdeconvolutionusingscnrnaseqtranscriptomes