Cargando…

Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets

Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical n...

Descripción completa

Detalles Bibliográficos
Autores principales: Yi, Haidong, Plotkin, Alec, Stanley, Natalie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002703/
https://www.ncbi.nlm.nih.gov/pubmed/36909641
http://dx.doi.org/10.1101/2023.02.24.529894
_version_ 1784904445054681088
author Yi, Haidong
Plotkin, Alec
Stanley, Natalie
author_facet Yi, Haidong
Plotkin, Alec
Stanley, Natalie
author_sort Yi, Haidong
collection PubMed
description Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods.
format Online
Article
Text
id pubmed-10002703
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100027032023-03-11 Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets Yi, Haidong Plotkin, Alec Stanley, Natalie bioRxiv Article Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods. Cold Spring Harbor Laboratory 2023-02-27 /pmc/articles/PMC10002703/ /pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Yi, Haidong
Plotkin, Alec
Stanley, Natalie
Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_full Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_fullStr Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_full_unstemmed Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_short Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_sort benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002703/
https://www.ncbi.nlm.nih.gov/pubmed/36909641
http://dx.doi.org/10.1101/2023.02.24.529894
work_keys_str_mv AT yihaidong benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets
AT plotkinalec benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets
AT stanleynatalie benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets