Cargando…

A comprehensive assessment of cell type-specific differential expression methods in bulk data

Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. A...

Descripción completa

Detalles Bibliográficos
Autores principales: Meng, Guanqun, Tang, Wen, Huang, Emina, Li, Ziyi, Feng, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851321/
https://www.ncbi.nlm.nih.gov/pubmed/36472568
http://dx.doi.org/10.1093/bib/bbac516
_version_ 1784872370759008256
author Meng, Guanqun
Tang, Wen
Huang, Emina
Li, Ziyi
Feng, Hao
author_facet Meng, Guanqun
Tang, Wen
Huang, Emina
Li, Ziyi
Feng, Hao
author_sort Meng, Guanqun
collection PubMed
description Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. Although several computational methods have been developed to identify cell type-specific differentially expressed genes (csDEG) from RNA-seq data, a systematic evaluation is yet to be performed. Here, we thoroughly benchmark six recently published methods: CellDMC, CARseq, TOAST, LRCDE, CeDAR and TCA, together with two classical methods, csSAM and DESeq2, for a comprehensive comparison. We aim to systematically evaluate the performance of popular csDEG detection methods and provide guidance to researchers. In simulation studies, we benchmark available methods under various scenarios of baseline expression levels, sample sizes, cell type compositions, expression level alterations, technical noises and biological dispersions. Real data analyses of three large datasets on inflammatory bowel disease, lung cancer and autism provide evaluation in both the gene level and the pathway level. We find that csDEG calling is strongly affected by effect size, baseline expression level and cell type compositions. Results imply that csDEG discovery is a challenging task itself, with room to improvements on handling low signal-to-noise ratio and low expression genes.
format Online
Article
Text
id pubmed-9851321
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98513212023-01-20 A comprehensive assessment of cell type-specific differential expression methods in bulk data Meng, Guanqun Tang, Wen Huang, Emina Li, Ziyi Feng, Hao Brief Bioinform Review Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. Although several computational methods have been developed to identify cell type-specific differentially expressed genes (csDEG) from RNA-seq data, a systematic evaluation is yet to be performed. Here, we thoroughly benchmark six recently published methods: CellDMC, CARseq, TOAST, LRCDE, CeDAR and TCA, together with two classical methods, csSAM and DESeq2, for a comprehensive comparison. We aim to systematically evaluate the performance of popular csDEG detection methods and provide guidance to researchers. In simulation studies, we benchmark available methods under various scenarios of baseline expression levels, sample sizes, cell type compositions, expression level alterations, technical noises and biological dispersions. Real data analyses of three large datasets on inflammatory bowel disease, lung cancer and autism provide evaluation in both the gene level and the pathway level. We find that csDEG calling is strongly affected by effect size, baseline expression level and cell type compositions. Results imply that csDEG discovery is a challenging task itself, with room to improvements on handling low signal-to-noise ratio and low expression genes. Oxford University Press 2022-12-06 /pmc/articles/PMC9851321/ /pubmed/36472568 http://dx.doi.org/10.1093/bib/bbac516 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Review
Meng, Guanqun
Tang, Wen
Huang, Emina
Li, Ziyi
Feng, Hao
A comprehensive assessment of cell type-specific differential expression methods in bulk data
title A comprehensive assessment of cell type-specific differential expression methods in bulk data
title_full A comprehensive assessment of cell type-specific differential expression methods in bulk data
title_fullStr A comprehensive assessment of cell type-specific differential expression methods in bulk data
title_full_unstemmed A comprehensive assessment of cell type-specific differential expression methods in bulk data
title_short A comprehensive assessment of cell type-specific differential expression methods in bulk data
title_sort comprehensive assessment of cell type-specific differential expression methods in bulk data
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9851321/
https://www.ncbi.nlm.nih.gov/pubmed/36472568
http://dx.doi.org/10.1093/bib/bbac516
work_keys_str_mv AT mengguanqun acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT tangwen acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT huangemina acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT liziyi acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT fenghao acomprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT mengguanqun comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT tangwen comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT huangemina comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT liziyi comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata
AT fenghao comprehensiveassessmentofcelltypespecificdifferentialexpressionmethodsinbulkdata