Cargando…

Cell-type-specific co-expression inference from single cell RNA-sequencing data

The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by v...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Chang, Xu, Zichun, Shan, Xinning, Cai, Biao, Zhao, Hongyu, Zhang, Jingfei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9774209/
https://www.ncbi.nlm.nih.gov/pubmed/36561173
http://dx.doi.org/10.1101/2022.12.13.520181
_version_ 1784855353270206464
author Su, Chang
Xu, Zichun
Shan, Xinning
Cai, Biao
Zhao, Hongyu
Zhang, Jingfei
author_facet Su, Chang
Xu, Zichun
Shan, Xinning
Cai, Biao
Zhao, Hongyu
Zhang, Jingfei
author_sort Su, Chang
collection PubMed
description The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimer’s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods.
format Online
Article
Text
id pubmed-9774209
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-97742092022-12-23 Cell-type-specific co-expression inference from single cell RNA-sequencing data Su, Chang Xu, Zichun Shan, Xinning Cai, Biao Zhao, Hongyu Zhang, Jingfei bioRxiv Article The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimer’s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods. Cold Spring Harbor Laboratory 2022-12-15 /pmc/articles/PMC9774209/ /pubmed/36561173 http://dx.doi.org/10.1101/2022.12.13.520181 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Su, Chang
Xu, Zichun
Shan, Xinning
Cai, Biao
Zhao, Hongyu
Zhang, Jingfei
Cell-type-specific co-expression inference from single cell RNA-sequencing data
title Cell-type-specific co-expression inference from single cell RNA-sequencing data
title_full Cell-type-specific co-expression inference from single cell RNA-sequencing data
title_fullStr Cell-type-specific co-expression inference from single cell RNA-sequencing data
title_full_unstemmed Cell-type-specific co-expression inference from single cell RNA-sequencing data
title_short Cell-type-specific co-expression inference from single cell RNA-sequencing data
title_sort cell-type-specific co-expression inference from single cell rna-sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9774209/
https://www.ncbi.nlm.nih.gov/pubmed/36561173
http://dx.doi.org/10.1101/2022.12.13.520181
work_keys_str_mv AT suchang celltypespecificcoexpressioninferencefromsinglecellrnasequencingdata
AT xuzichun celltypespecificcoexpressioninferencefromsinglecellrnasequencingdata
AT shanxinning celltypespecificcoexpressioninferencefromsinglecellrnasequencingdata
AT caibiao celltypespecificcoexpressioninferencefromsinglecellrnasequencingdata
AT zhaohongyu celltypespecificcoexpressioninferencefromsinglecellrnasequencingdata
AT zhangjingfei celltypespecificcoexpressioninferencefromsinglecellrnasequencingdata