Cargando…
Robust gene coexpression networks using signed distance correlation
MOTIVATION: Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lac...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8557847/ https://www.ncbi.nlm.nih.gov/pubmed/33523234 http://dx.doi.org/10.1093/bioinformatics/btab041 |
_version_ | 1784592437056897024 |
---|---|
author | Pardo-Diaz, Javier Bozhilova, Lyuba V Beguerisse-Díaz, Mariano Poole, Philip S Deane, Charlotte M Reinert, Gesine |
author_facet | Pardo-Diaz, Javier Bozhilova, Lyuba V Beguerisse-Díaz, Mariano Poole, Philip S Deane, Charlotte M Reinert, Gesine |
author_sort | Pardo-Diaz, Javier |
collection | PubMed |
description | MOTIVATION: Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information. RESULTS: We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods, such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information. AVAILABILITY AND IMPLEMENTATION: Code is available online (https://github.com/javier-pardodiaz/sdcorGCN). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8557847 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-85578472021-11-01 Robust gene coexpression networks using signed distance correlation Pardo-Diaz, Javier Bozhilova, Lyuba V Beguerisse-Díaz, Mariano Poole, Philip S Deane, Charlotte M Reinert, Gesine Bioinformatics Original Papers MOTIVATION: Even within well-studied organisms, many genes lack useful functional annotations. One way to generate such functional information is to infer biological relationships between genes/proteins, using a network of gene coexpression data that includes functional annotations. However, the lack of trustworthy functional annotations can impede the validation of such networks. Hence, there is a need for a principled method to construct gene coexpression networks that capture biological information and are structurally stable even in the absence of functional information. RESULTS: We introduce the concept of signed distance correlation as a measure of dependency between two variables, and apply it to generate gene coexpression networks. Distance correlation offers a more intuitive approach to network construction than commonly used methods, such as Pearson correlation and mutual information. We propose a framework to generate self-consistent networks using signed distance correlation purely from gene expression data, with no additional information. We analyse data from three different organisms to illustrate how networks generated with our method are more stable and capture more biological information compared to networks obtained from Pearson correlation or mutual information. AVAILABILITY AND IMPLEMENTATION: Code is available online (https://github.com/javier-pardodiaz/sdcorGCN). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-02-01 /pmc/articles/PMC8557847/ /pubmed/33523234 http://dx.doi.org/10.1093/bioinformatics/btab041 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Pardo-Diaz, Javier Bozhilova, Lyuba V Beguerisse-Díaz, Mariano Poole, Philip S Deane, Charlotte M Reinert, Gesine Robust gene coexpression networks using signed distance correlation |
title | Robust gene coexpression networks using signed distance correlation |
title_full | Robust gene coexpression networks using signed distance correlation |
title_fullStr | Robust gene coexpression networks using signed distance correlation |
title_full_unstemmed | Robust gene coexpression networks using signed distance correlation |
title_short | Robust gene coexpression networks using signed distance correlation |
title_sort | robust gene coexpression networks using signed distance correlation |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8557847/ https://www.ncbi.nlm.nih.gov/pubmed/33523234 http://dx.doi.org/10.1093/bioinformatics/btab041 |
work_keys_str_mv | AT pardodiazjavier robustgenecoexpressionnetworksusingsigneddistancecorrelation AT bozhilovalyubav robustgenecoexpressionnetworksusingsigneddistancecorrelation AT beguerissediazmariano robustgenecoexpressionnetworksusingsigneddistancecorrelation AT poolephilips robustgenecoexpressionnetworksusingsigneddistancecorrelation AT deanecharlottem robustgenecoexpressionnetworksusingsigneddistancecorrelation AT reinertgesine robustgenecoexpressionnetworksusingsigneddistancecorrelation |