Cargando…
Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets
Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical n...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002703/ https://www.ncbi.nlm.nih.gov/pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894 |
_version_ | 1784904445054681088 |
---|---|
author | Yi, Haidong Plotkin, Alec Stanley, Natalie |
author_facet | Yi, Haidong Plotkin, Alec Stanley, Natalie |
author_sort | Yi, Haidong |
collection | PubMed |
description | Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods. |
format | Online Article Text |
id | pubmed-10002703 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100027032023-03-11 Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets Yi, Haidong Plotkin, Alec Stanley, Natalie bioRxiv Article Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods. Cold Spring Harbor Laboratory 2023-02-27 /pmc/articles/PMC10002703/ /pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Yi, Haidong Plotkin, Alec Stanley, Natalie Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
title | Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
title_full | Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
title_fullStr | Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
title_full_unstemmed | Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
title_short | Benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
title_sort | benchmarking differential abundance methods for finding condition-specific prototypical cells in multi-sample single-cell datasets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002703/ https://www.ncbi.nlm.nih.gov/pubmed/36909641 http://dx.doi.org/10.1101/2023.02.24.529894 |
work_keys_str_mv | AT yihaidong benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets AT plotkinalec benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets AT stanleynatalie benchmarkingdifferentialabundancemethodsforfindingconditionspecificprototypicalcellsinmultisamplesinglecelldatasets |