Cargando…

Debiased personalized gene coexpression networks for population-scale scRNA-seq data

Population-scale single-cell RNA-seq (scRNA-seq) data sets create unique opportunities for quantifying expression variation across individuals at the gene coexpression network level. Estimation of coexpression networks is well established for bulk RNA-seq; however, single-cell measurements pose nove...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Shan, Keleş, Sündüz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519377/
https://www.ncbi.nlm.nih.gov/pubmed/37295843
http://dx.doi.org/10.1101/gr.277363.122
_version_ 1785145953594900480
author Lu, Shan
Keleş, Sündüz
author_facet Lu, Shan
Keleş, Sündüz
author_sort Lu, Shan
collection PubMed
description Population-scale single-cell RNA-seq (scRNA-seq) data sets create unique opportunities for quantifying expression variation across individuals at the gene coexpression network level. Estimation of coexpression networks is well established for bulk RNA-seq; however, single-cell measurements pose novel challenges owing to technical limitations and noise levels of this technology. Gene–gene correlation estimates from scRNA-seq tend to be severely biased toward zero for genes with low and sparse expression. Here, we present Dozer to debias gene–gene correlation estimates from scRNA-seq data sets and accurately quantify network-level variation across individuals. Dozer corrects correlation estimates in the general Poisson measurement model and provides a metric to quantify genes measured with high noise. Computational experiments establish that Dozer estimates are robust to mean expression levels of the genes and the sequencing depths of the data sets. Compared with alternatives, Dozer results in fewer false-positive edges in the coexpression networks, yields more accurate estimates of network centrality measures and modules, and improves the faithfulness of networks estimated from separate batches of the data sets. We showcase unique analyses enabled by Dozer in two population-scale scRNA-seq applications. Coexpression network–based centrality analysis of multiple differentiating human induced pluripotent stem cell (iPSC) lines yields biologically coherent gene groups that are associated with iPSC differentiation efficiency. Application with population-scale scRNA-seq of oligodendrocytes from postmortem human tissues of Alzheimer's disease and controls uniquely reveals coexpression modules of innate immune response with distinct coexpression levels between the diagnoses. Dozer represents an important advance in estimating personalized coexpression networks from scRNA-seq data.
format Online
Article
Text
id pubmed-10519377
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-105193772023-12-01 Debiased personalized gene coexpression networks for population-scale scRNA-seq data Lu, Shan Keleş, Sündüz Genome Res Methods Population-scale single-cell RNA-seq (scRNA-seq) data sets create unique opportunities for quantifying expression variation across individuals at the gene coexpression network level. Estimation of coexpression networks is well established for bulk RNA-seq; however, single-cell measurements pose novel challenges owing to technical limitations and noise levels of this technology. Gene–gene correlation estimates from scRNA-seq tend to be severely biased toward zero for genes with low and sparse expression. Here, we present Dozer to debias gene–gene correlation estimates from scRNA-seq data sets and accurately quantify network-level variation across individuals. Dozer corrects correlation estimates in the general Poisson measurement model and provides a metric to quantify genes measured with high noise. Computational experiments establish that Dozer estimates are robust to mean expression levels of the genes and the sequencing depths of the data sets. Compared with alternatives, Dozer results in fewer false-positive edges in the coexpression networks, yields more accurate estimates of network centrality measures and modules, and improves the faithfulness of networks estimated from separate batches of the data sets. We showcase unique analyses enabled by Dozer in two population-scale scRNA-seq applications. Coexpression network–based centrality analysis of multiple differentiating human induced pluripotent stem cell (iPSC) lines yields biologically coherent gene groups that are associated with iPSC differentiation efficiency. Application with population-scale scRNA-seq of oligodendrocytes from postmortem human tissues of Alzheimer's disease and controls uniquely reveals coexpression modules of innate immune response with distinct coexpression levels between the diagnoses. Dozer represents an important advance in estimating personalized coexpression networks from scRNA-seq data. Cold Spring Harbor Laboratory Press 2023-06 /pmc/articles/PMC10519377/ /pubmed/37295843 http://dx.doi.org/10.1101/gr.277363.122 Text en © 2023 Lu and Keleş; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Methods
Lu, Shan
Keleş, Sündüz
Debiased personalized gene coexpression networks for population-scale scRNA-seq data
title Debiased personalized gene coexpression networks for population-scale scRNA-seq data
title_full Debiased personalized gene coexpression networks for population-scale scRNA-seq data
title_fullStr Debiased personalized gene coexpression networks for population-scale scRNA-seq data
title_full_unstemmed Debiased personalized gene coexpression networks for population-scale scRNA-seq data
title_short Debiased personalized gene coexpression networks for population-scale scRNA-seq data
title_sort debiased personalized gene coexpression networks for population-scale scrna-seq data
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519377/
https://www.ncbi.nlm.nih.gov/pubmed/37295843
http://dx.doi.org/10.1101/gr.277363.122
work_keys_str_mv AT lushan debiasedpersonalizedgenecoexpressionnetworksforpopulationscalescrnaseqdata
AT kelessunduz debiasedpersonalizedgenecoexpressionnetworksforpopulationscalescrnaseqdata