Cargando…

cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling

MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challe...

Descripción completa

Detalles Bibliográficos
Autores principales: Al Kamran Khan, Md Abdullah, Wu, Jian, Sun, Yuhan, Barrow, Alexander D, Papenfuss, Anthony T, Mangiola, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692870/
https://www.ncbi.nlm.nih.gov/pubmed/37952182
http://dx.doi.org/10.1093/bioinformatics/btad685
_version_ 1785153037151502336
author Al Kamran Khan, Md Abdullah
Wu, Jian
Sun, Yuhan
Barrow, Alexander D
Papenfuss, Anthony T
Mangiola, Stefano
author_facet Al Kamran Khan, Md Abdullah
Wu, Jian
Sun, Yuhan
Barrow, Alexander D
Papenfuss, Anthony T
Mangiola, Stefano
author_sort Al Kamran Khan, Md Abdullah
collection PubMed
description MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challenges persist due to data heterogeneity and discrepancies across different studies. One limitation of prevailing tools such as CIBERSORTx is their inability to address hierarchical data structures and handle nonoverlapping gene sets across samples, relying on filtering or imputation. RESULTS: Here, we present cellsig, a Bayesian sparse multilevel model designed to improve signature estimation by adjusting data for multilevel effects and modelling for gene-set sparsity. Our model is tailored to large-scale, heterogeneous pseudobulk and bulk RNA sequencing data collections with nonoverlapping gene sets. We tested the performances of cellsig on a novel curated Human Bulk Cell-type Catalogue, which harmonizes 1435 samples across 58 datasets. We show that cellsig significantly enhances cell-type marker gene ranking performance. This approach is valuable for cell-type signature selection, with implications for marker gene validation, single-cell annotation, and deconvolution benchmarks. AVAILABILITY AND IMPLEMENTATION: Codes and the interactive app are available at https://github.com/stemangiola/cellsig; and the database is available at https://doi.org/10.5281/zenodo.7582421.
format Online
Article
Text
id pubmed-10692870
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106928702023-12-03 cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling Al Kamran Khan, Md Abdullah Wu, Jian Sun, Yuhan Barrow, Alexander D Papenfuss, Anthony T Mangiola, Stefano Bioinformatics Original Paper MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challenges persist due to data heterogeneity and discrepancies across different studies. One limitation of prevailing tools such as CIBERSORTx is their inability to address hierarchical data structures and handle nonoverlapping gene sets across samples, relying on filtering or imputation. RESULTS: Here, we present cellsig, a Bayesian sparse multilevel model designed to improve signature estimation by adjusting data for multilevel effects and modelling for gene-set sparsity. Our model is tailored to large-scale, heterogeneous pseudobulk and bulk RNA sequencing data collections with nonoverlapping gene sets. We tested the performances of cellsig on a novel curated Human Bulk Cell-type Catalogue, which harmonizes 1435 samples across 58 datasets. We show that cellsig significantly enhances cell-type marker gene ranking performance. This approach is valuable for cell-type signature selection, with implications for marker gene validation, single-cell annotation, and deconvolution benchmarks. AVAILABILITY AND IMPLEMENTATION: Codes and the interactive app are available at https://github.com/stemangiola/cellsig; and the database is available at https://doi.org/10.5281/zenodo.7582421. Oxford University Press 2023-11-11 /pmc/articles/PMC10692870/ /pubmed/37952182 http://dx.doi.org/10.1093/bioinformatics/btad685 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Al Kamran Khan, Md Abdullah
Wu, Jian
Sun, Yuhan
Barrow, Alexander D
Papenfuss, Anthony T
Mangiola, Stefano
cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
title cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
title_full cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
title_fullStr cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
title_full_unstemmed cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
title_short cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
title_sort cellsig plug-in enhances cibersortx signature selection for multidataset transcriptomes with sparse multilevel modelling
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692870/
https://www.ncbi.nlm.nih.gov/pubmed/37952182
http://dx.doi.org/10.1093/bioinformatics/btad685
work_keys_str_mv AT alkamrankhanmdabdullah cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling
AT wujian cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling
AT sunyuhan cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling
AT barrowalexanderd cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling
AT papenfussanthonyt cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling
AT mangiolastefano cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling