Cargando…

Robust gene coexpression networks using signed distance correlation

MOTIVATION: Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lac...

Descripción completa

Detalles Bibliográficos
Autores principales: Pardo-Diaz, Javier, Bozhilova, Lyuba V, Beguerisse-Díaz, Mariano, Poole, Philip S, Deane, Charlotte M, Reinert, Gesine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8557847/
https://www.ncbi.nlm.nih.gov/pubmed/33523234
http://dx.doi.org/10.1093/bioinformatics/btab041
_version_ 1784592437056897024
author Pardo-Diaz, Javier
Bozhilova, Lyuba V
Beguerisse-Díaz, Mariano
Poole, Philip S
Deane, Charlotte M
Reinert, Gesine
author_facet Pardo-Diaz, Javier
Bozhilova, Lyuba V
Beguerisse-Díaz, Mariano
Poole, Philip S
Deane, Charlotte M
Reinert, Gesine
author_sort Pardo-Diaz, Javier
collection PubMed
description MOTIVATION: Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information. RESULTS: We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods, such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information. AVAILABILITY AND IMPLEMENTATION: Code is available online (https://github.com/javier-pardodiaz/sdcorGCN). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8557847
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85578472021-11-01 Robust gene coexpression networks using signed distance correlation Pardo-Diaz, Javier Bozhilova, Lyuba V Beguerisse-Díaz, Mariano Poole, Philip S Deane, Charlotte M Reinert, Gesine Bioinformatics Original Papers MOTIVATION: Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information. RESULTS: We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods, such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information. AVAILABILITY AND IMPLEMENTATION: Code is available online (https://github.com/javier-pardodiaz/sdcorGCN). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-02-01 /pmc/articles/PMC8557847/ /pubmed/33523234 http://dx.doi.org/10.1093/bioinformatics/btab041 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Pardo-Diaz, Javier
Bozhilova, Lyuba V
Beguerisse-Díaz, Mariano
Poole, Philip S
Deane, Charlotte M
Reinert, Gesine
Robust gene coexpression networks using signed distance correlation
title Robust gene coexpression networks using signed distance correlation
title_full Robust gene coexpression networks using signed distance correlation
title_fullStr Robust gene coexpression networks using signed distance correlation
title_full_unstemmed Robust gene coexpression networks using signed distance correlation
title_short Robust gene coexpression networks using signed distance correlation
title_sort robust gene coexpression networks using signed distance correlation
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8557847/
https://www.ncbi.nlm.nih.gov/pubmed/33523234
http://dx.doi.org/10.1093/bioinformatics/btab041
work_keys_str_mv AT pardodiazjavier robustgenecoexpressionnetworksusingsigneddistancecorrelation
AT bozhilovalyubav robustgenecoexpressionnetworksusingsigneddistancecorrelation
AT beguerissediazmariano robustgenecoexpressionnetworksusingsigneddistancecorrelation
AT poolephilips robustgenecoexpressionnetworksusingsigneddistancecorrelation
AT deanecharlottem robustgenecoexpressionnetworksusingsigneddistancecorrelation
AT reinertgesine robustgenecoexpressionnetworksusingsigneddistancecorrelation