Cargando…

Distance correlation application to gene co-expression network analysis

BACKGROUND: To construct gene co-expression networks, it is necessary to evaluate the correlation between different gene expression profiles. However, commonly used correlation metrics, including both linear (such as Pearson’s correlation) and monotonic (such as Spearman’s correlation) dependence me...

Descripción completa

Detalles Bibliográficos
Autores principales: Hou, Jie, Ye, Xiufen, Feng, Weixing, Zhang, Qiaosheng, Han, Yatong, Liu, Yusong, Li, Yu, Wei, Yufen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8862277/
https://www.ncbi.nlm.nih.gov/pubmed/35193539
http://dx.doi.org/10.1186/s12859-022-04609-x
_version_ 1784655024630005760
author Hou, Jie
Ye, Xiufen
Feng, Weixing
Zhang, Qiaosheng
Han, Yatong
Liu, Yusong
Li, Yu
Wei, Yufen
author_facet Hou, Jie
Ye, Xiufen
Feng, Weixing
Zhang, Qiaosheng
Han, Yatong
Liu, Yusong
Li, Yu
Wei, Yufen
author_sort Hou, Jie
collection PubMed
description BACKGROUND: To construct gene co-expression networks, it is necessary to evaluate the correlation between different gene expression profiles. However, commonly used correlation metrics, including both linear (such as Pearson’s correlation) and monotonic (such as Spearman’s correlation) dependence metrics, are not enough to observe the nature of real biological systems. Hence, introducing a more informative correlation metric when constructing gene co-expression networks is still an interesting topic. RESULTS: In this paper, we test distance correlation, a correlation metric integrating both linear and non-linear dependence, with other three typical metrics (Pearson’s correlation, Spearman’s correlation, and maximal information coefficient) on four different arrays (macrophage and liver) and RNA-seq (cervical cancer and pancreatic cancer) datasets. Among all the metrics, distance correlation is distribution free and can provide better performance on complex relationships and anti-outlier. Furthermore, distance correlation is applied to Weighted Gene Co-expression Network Analysis (WGCNA) for constructing a gene co-expression network analysis method which we named Distance Correlation-based Weighted Gene Co-expression Network Analysis (DC-WGCNA). Compared with traditional WGCNA, DC-WGCNA can enhance the result of enrichment analysis and improve the module stability. CONCLUSIONS: Distance correlation is better at revealing complex biological relationships between gene profiles compared with other correlation metrics, which contribute to more meaningful modules when analyzing gene co-expression networks. However, due to the high time complexity of distance correlation, the implementation requires more computer memory. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04609-x.
format Online
Article
Text
id pubmed-8862277
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88622772022-02-23 Distance correlation application to gene co-expression network analysis Hou, Jie Ye, Xiufen Feng, Weixing Zhang, Qiaosheng Han, Yatong Liu, Yusong Li, Yu Wei, Yufen BMC Bioinformatics Research Article BACKGROUND: To construct gene co-expression networks, it is necessary to evaluate the correlation between different gene expression profiles. However, commonly used correlation metrics, including both linear (such as Pearson’s correlation) and monotonic (such as Spearman’s correlation) dependence metrics, are not enough to observe the nature of real biological systems. Hence, introducing a more informative correlation metric when constructing gene co-expression networks is still an interesting topic. RESULTS: In this paper, we test distance correlation, a correlation metric integrating both linear and non-linear dependence, with other three typical metrics (Pearson’s correlation, Spearman’s correlation, and maximal information coefficient) on four different arrays (macrophage and liver) and RNA-seq (cervical cancer and pancreatic cancer) datasets. Among all the metrics, distance correlation is distribution free and can provide better performance on complex relationships and anti-outlier. Furthermore, distance correlation is applied to Weighted Gene Co-expression Network Analysis (WGCNA) for constructing a gene co-expression network analysis method which we named Distance Correlation-based Weighted Gene Co-expression Network Analysis (DC-WGCNA). Compared with traditional WGCNA, DC-WGCNA can enhance the result of enrichment analysis and improve the module stability. CONCLUSIONS: Distance correlation is better at revealing complex biological relationships between gene profiles compared with other correlation metrics, which contribute to more meaningful modules when analyzing gene co-expression networks. However, due to the high time complexity of distance correlation, the implementation requires more computer memory. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04609-x. BioMed Central 2022-02-21 /pmc/articles/PMC8862277/ /pubmed/35193539 http://dx.doi.org/10.1186/s12859-022-04609-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Hou, Jie
Ye, Xiufen
Feng, Weixing
Zhang, Qiaosheng
Han, Yatong
Liu, Yusong
Li, Yu
Wei, Yufen
Distance correlation application to gene co-expression network analysis
title Distance correlation application to gene co-expression network analysis
title_full Distance correlation application to gene co-expression network analysis
title_fullStr Distance correlation application to gene co-expression network analysis
title_full_unstemmed Distance correlation application to gene co-expression network analysis
title_short Distance correlation application to gene co-expression network analysis
title_sort distance correlation application to gene co-expression network analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8862277/
https://www.ncbi.nlm.nih.gov/pubmed/35193539
http://dx.doi.org/10.1186/s12859-022-04609-x
work_keys_str_mv AT houjie distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT yexiufen distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT fengweixing distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT zhangqiaosheng distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT hanyatong distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT liuyusong distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT liyu distancecorrelationapplicationtogenecoexpressionnetworkanalysis
AT weiyufen distancecorrelationapplicationtogenecoexpressionnetworkanalysis