Cargando…
cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling
MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challe...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692870/ https://www.ncbi.nlm.nih.gov/pubmed/37952182 http://dx.doi.org/10.1093/bioinformatics/btad685 |
_version_ | 1785153037151502336 |
---|---|
author | Al Kamran Khan, Md Abdullah Wu, Jian Sun, Yuhan Barrow, Alexander D Papenfuss, Anthony T Mangiola, Stefano |
author_facet | Al Kamran Khan, Md Abdullah Wu, Jian Sun, Yuhan Barrow, Alexander D Papenfuss, Anthony T Mangiola, Stefano |
author_sort | Al Kamran Khan, Md Abdullah |
collection | PubMed |
description | MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challenges persist due to data heterogeneity and discrepancies across different studies. One limitation of prevailing tools such as CIBERSORTx is their inability to address hierarchical data structures and handle nonoverlapping gene sets across samples, relying on filtering or imputation. RESULTS: Here, we present cellsig, a Bayesian sparse multilevel model designed to improve signature estimation by adjusting data for multilevel effects and modelling for gene-set sparsity. Our model is tailored to large-scale, heterogeneous pseudobulk and bulk RNA sequencing data collections with nonoverlapping gene sets. We tested the performances of cellsig on a novel curated Human Bulk Cell-type Catalogue, which harmonizes 1435 samples across 58 datasets. We show that cellsig significantly enhances cell-type marker gene ranking performance. This approach is valuable for cell-type signature selection, with implications for marker gene validation, single-cell annotation, and deconvolution benchmarks. AVAILABILITY AND IMPLEMENTATION: Codes and the interactive app are available at https://github.com/stemangiola/cellsig; and the database is available at https://doi.org/10.5281/zenodo.7582421. |
format | Online Article Text |
id | pubmed-10692870 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106928702023-12-03 cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling Al Kamran Khan, Md Abdullah Wu, Jian Sun, Yuhan Barrow, Alexander D Papenfuss, Anthony T Mangiola, Stefano Bioinformatics Original Paper MOTIVATION: The precise characterization of cell-type transcriptomes is pivotal to understanding cellular lineages, deconvolution of bulk transcriptomes, and clinical applications. Single-cell RNA sequencing resources like the Human Cell Atlas have revolutionised cell-type profiling. However, challenges persist due to data heterogeneity and discrepancies across different studies. One limitation of prevailing tools such as CIBERSORTx is their inability to address hierarchical data structures and handle nonoverlapping gene sets across samples, relying on filtering or imputation. RESULTS: Here, we present cellsig, a Bayesian sparse multilevel model designed to improve signature estimation by adjusting data for multilevel effects and modelling for gene-set sparsity. Our model is tailored to large-scale, heterogeneous pseudobulk and bulk RNA sequencing data collections with nonoverlapping gene sets. We tested the performances of cellsig on a novel curated Human Bulk Cell-type Catalogue, which harmonizes 1435 samples across 58 datasets. We show that cellsig significantly enhances cell-type marker gene ranking performance. This approach is valuable for cell-type signature selection, with implications for marker gene validation, single-cell annotation, and deconvolution benchmarks. AVAILABILITY AND IMPLEMENTATION: Codes and the interactive app are available at https://github.com/stemangiola/cellsig; and the database is available at https://doi.org/10.5281/zenodo.7582421. Oxford University Press 2023-11-11 /pmc/articles/PMC10692870/ /pubmed/37952182 http://dx.doi.org/10.1093/bioinformatics/btad685 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Al Kamran Khan, Md Abdullah Wu, Jian Sun, Yuhan Barrow, Alexander D Papenfuss, Anthony T Mangiola, Stefano cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling |
title | cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling |
title_full | cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling |
title_fullStr | cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling |
title_full_unstemmed | cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling |
title_short | cellsig plug-in enhances CIBERSORTx signature selection for multidataset transcriptomes with sparse multilevel modelling |
title_sort | cellsig plug-in enhances cibersortx signature selection for multidataset transcriptomes with sparse multilevel modelling |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692870/ https://www.ncbi.nlm.nih.gov/pubmed/37952182 http://dx.doi.org/10.1093/bioinformatics/btad685 |
work_keys_str_mv | AT alkamrankhanmdabdullah cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling AT wujian cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling AT sunyuhan cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling AT barrowalexanderd cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling AT papenfussanthonyt cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling AT mangiolastefano cellsigpluginenhancescibersortxsignatureselectionformultidatasettranscriptomeswithsparsemultilevelmodelling |