Cargando…

Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets

Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical n...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yi, Haidong, Plotkin, Alec, Stanley, Natalie
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cold Spring Harbor Laboratory 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002703/ https://www.ncbi.nlm.nih.gov/pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894

_version_	1784904445054681088
author	Yi, Haidong Plotkin, Alec Stanley, Natalie
author_facet	Yi, Haidong Plotkin, Alec Stanley, Natalie
author_sort	Yi, Haidong
collection	PubMed
description	Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods.
format	Online Article Text
id	pubmed-10002703
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cold Spring Harbor Laboratory
record_format	MEDLINE/PubMed
spelling	pubmed-100027032023-03-11 Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets Yi, Haidong Plotkin, Alec Stanley, Natalie bioRxiv Article Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods. Cold Spring Harbor Laboratory 2023-02-27 /pmc/articles/PMC10002703/ /pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle	Article Yi, Haidong Plotkin, Alec Stanley, Natalie Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title	Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_full	Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_fullStr	Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_full_unstemmed	Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_short	Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
title_sort	benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002703/ https://www.ncbi.nlm.nih.gov/pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894
work_keys_str_mv	AT yihaidong benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets AT plotkinalec benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets AT stanleynatalie benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets

Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets

Ejemplares similares