Cargando…
A comprehensive assessment of cell type-specific differential expression methods in bulk data
Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. A...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851321/ https://www.ncbi.nlm.nih.gov/pubmed/36472568 http://dx.doi.org/10.1093/bib/bbac516 |
_version_ | 1784872370759008256 |
---|---|
author | Meng, Guanqun Tang, Wen Huang, Emina Li, Ziyi Feng, Hao |
author_facet | Meng, Guanqun Tang, Wen Huang, Emina Li, Ziyi Feng, Hao |
author_sort | Meng, Guanqun |
collection | PubMed |
description | Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. Although several computational methods have been developed to identify cell type-specific differentially expressed genes (csDEG) from RNA-seq data, a systematic evaluation is yet to be performed. Here, we thoroughly benchmark six recently published methods: CellDMC, CARseq, TOAST, LRCDE, CeDAR and TCA, together with two classical methods, csSAM and DESeq2, for a comprehensive comparison. We aim to systematically evaluate the performance of popular csDEG detection methods and provide guidance to researchers. In simulation studies, we benchmark available methods under various scenarios of baseline expression levels, sample sizes, cell type compositions, expression level alterations, technical noises and biological dispersions. Real data analyses of three large datasets on inflammatory bowel disease, lung cancer and autism provide evaluation in both the gene level and the pathway level. We find that csDEG calling is strongly affected by effect size, baseline expression level and cell type compositions. Results imply that csDEG discovery is a challenging task itself, with room to improvements on handling low signal-to-noise ratio and low expression genes. |
format | Online Article Text |
id | pubmed-9851321 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98513212023-01-20 A comprehensive assessment of cell type-specific differential expression methods in bulk data Meng, Guanqun Tang, Wen Huang, Emina Li, Ziyi Feng, Hao Brief Bioinform Review Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. Although several computational methods have been developed to identify cell type-specific differentially expressed genes (csDEG) from RNA-seq data, a systematic evaluation is yet to be performed. Here, we thoroughly benchmark six recently published methods: CellDMC, CARseq, TOAST, LRCDE, CeDAR and TCA, together with two classical methods, csSAM and DESeq2, for a comprehensive comparison. We aim to systematically evaluate the performance of popular csDEG detection methods and provide guidance to researchers. In simulation studies, we benchmark available methods under various scenarios of baseline expression levels, sample sizes, cell type compositions, expression level alterations, technical noises and biological dispersions. Real data analyses of three large datasets on inflammatory bowel disease, lung cancer and autism provide evaluation in both the gene level and the pathway level. We find that csDEG calling is strongly affected by effect size, baseline expression level and cell type compositions. Results imply that csDEG discovery is a challenging task itself, with room to improvements on handling low signal-to-noise ratio and low expression genes. Oxford University Press 2022-12-06 /pmc/articles/PMC9851321/ /pubmed/36472568 http://dx.doi.org/10.1093/bib/bbac516 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Review Meng, Guanqun Tang, Wen Huang, Emina Li, Ziyi Feng, Hao A comprehensive assessment of cell type-specific differential expression methods in bulk data |
title | A comprehensive assessment of cell type-specific differential expression methods in bulk data |
title_full | A comprehensive assessment of cell type-specific differential expression methods in bulk data |
title_fullStr | A comprehensive assessment of cell type-specific differential expression methods in bulk data |
title_full_unstemmed | A comprehensive assessment of cell type-specific differential expression methods in bulk data |
title_short | A comprehensive assessment of cell type-specific differential expression methods in bulk data |
title_sort | comprehensive assessment of cell type-specific differential expression methods in bulk data |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851321/ https://www.ncbi.nlm.nih.gov/pubmed/36472568 http://dx.doi.org/10.1093/bib/bbac516 |
work_keys_str_mv | AT mengguanqun acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT tangwen acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT huangemina acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT liziyi acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT fenghao acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT mengguanqun comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT tangwen comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT huangemina comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT liziyi comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata AT fenghao comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata |