Cargando…

Addressing the mean-correlation relationship in co-expression analysis

Estimates of correlation between pairs of genes in co-expression analysis are commonly used to construct networks among genes using gene expression data. As previously noted, the distribution of such correlations depends on the observed expression level of the involved genes, which we refer to this...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yi, Hicks, Stephanie C., Hansen, Kasper D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9009771/
https://www.ncbi.nlm.nih.gov/pubmed/35353807
http://dx.doi.org/10.1371/journal.pcbi.1009954
_version_ 1784687335742373888
author Wang, Yi
Hicks, Stephanie C.
Hansen, Kasper D.
author_facet Wang, Yi
Hicks, Stephanie C.
Hansen, Kasper D.
author_sort Wang, Yi
collection PubMed
description Estimates of correlation between pairs of genes in co-expression analysis are commonly used to construct networks among genes using gene expression data. As previously noted, the distribution of such correlations depends on the observed expression level of the involved genes, which we refer to this as a mean-correlation relationship in RNA-seq data, both bulk and single-cell. This dependence introduces an unwanted technical bias in co-expression analysis whereby highly expressed genes are more likely to be highly correlated. Such a relationship is not observed in protein-protein interaction data, suggesting that it is not reflecting biology. Ignoring this bias can lead to missing potentially biologically relevant pairs of genes that are lowly expressed, such as transcription factors. To address this problem, we introduce spatial quantile normalization (SpQN), a method for normalizing local distributions in a correlation matrix. We show that spatial quantile normalization removes the mean-correlation relationship and corrects the expression bias in network reconstruction.
format Online
Article
Text
id pubmed-9009771
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-90097712022-04-15 Addressing the mean-correlation relationship in co-expression analysis Wang, Yi Hicks, Stephanie C. Hansen, Kasper D. PLoS Comput Biol Research Article Estimates of correlation between pairs of genes in co-expression analysis are commonly used to construct networks among genes using gene expression data. As previously noted, the distribution of such correlations depends on the observed expression level of the involved genes, which we refer to this as a mean-correlation relationship in RNA-seq data, both bulk and single-cell. This dependence introduces an unwanted technical bias in co-expression analysis whereby highly expressed genes are more likely to be highly correlated. Such a relationship is not observed in protein-protein interaction data, suggesting that it is not reflecting biology. Ignoring this bias can lead to missing potentially biologically relevant pairs of genes that are lowly expressed, such as transcription factors. To address this problem, we introduce spatial quantile normalization (SpQN), a method for normalizing local distributions in a correlation matrix. We show that spatial quantile normalization removes the mean-correlation relationship and corrects the expression bias in network reconstruction. Public Library of Science 2022-03-30 /pmc/articles/PMC9009771/ /pubmed/35353807 http://dx.doi.org/10.1371/journal.pcbi.1009954 Text en © 2022 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Yi
Hicks, Stephanie C.
Hansen, Kasper D.
Addressing the mean-correlation relationship in co-expression analysis
title Addressing the mean-correlation relationship in co-expression analysis
title_full Addressing the mean-correlation relationship in co-expression analysis
title_fullStr Addressing the mean-correlation relationship in co-expression analysis
title_full_unstemmed Addressing the mean-correlation relationship in co-expression analysis
title_short Addressing the mean-correlation relationship in co-expression analysis
title_sort addressing the mean-correlation relationship in co-expression analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9009771/
https://www.ncbi.nlm.nih.gov/pubmed/35353807
http://dx.doi.org/10.1371/journal.pcbi.1009954
work_keys_str_mv AT wangyi addressingthemeancorrelationrelationshipincoexpressionanalysis
AT hicksstephaniec addressingthemeancorrelationrelationshipincoexpressionanalysis
AT hansenkasperd addressingthemeancorrelationrelationshipincoexpressionanalysis