Cargando…
A Novel Calibration Step in Gene Co-Expression Network Construction
High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581019/ https://www.ncbi.nlm.nih.gov/pubmed/36303738 http://dx.doi.org/10.3389/fbinf.2021.704817 |
_version_ | 1784812523612012544 |
---|---|
author | Aghaieabiane, Niloofar Koutis, Ioannis |
author_facet | Aghaieabiane, Niloofar Koutis, Ioannis |
author_sort | Aghaieabiane, Niloofar |
collection | PubMed |
description | High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a GCN, nodes correspond to genes, and the weight of the connection between two nodes is a measure of similarity in the expression behavior of the two genes. In general, GCN construction and analysis includes three steps; 1) calculating a similarity value for each pair of genes 2) using these similarity values to construct a fully connected weighted network 3) finding clusters of genes in the network, commonly called modules. The specific implementation of these three steps can significantly impact the final output and the downstream biological analysis. GCN construction is a well-studied topic. Existing algorithms rely on relatively simple statistical and mathematical tools to implement these steps. Currently, software package WGCNA appears to be the most widely accepted standard. We hypothesize that the raw features provided by sequencing data can be leveraged to extract modules of higher quality. A novel preprocessing step of the gene expression data set is introduced that in effect calibrates the expression levels of individual genes, before computing pairwise similarities. Further, the similarity is computed as an inner-product of positive vectors. In experiments, this provides a significant improvement over WGCNA, as measured by aggregate p-values of the gene ontology term enrichment of the computed modules. |
format | Online Article Text |
id | pubmed-9581019 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95810192022-10-26 A Novel Calibration Step in Gene Co-Expression Network Construction Aghaieabiane, Niloofar Koutis, Ioannis Front Bioinform Bioinformatics High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a GCN, nodes correspond to genes, and the weight of the connection between two nodes is a measure of similarity in the expression behavior of the two genes. In general, GCN construction and analysis includes three steps; 1) calculating a similarity value for each pair of genes 2) using these similarity values to construct a fully connected weighted network 3) finding clusters of genes in the network, commonly called modules. The specific implementation of these three steps can significantly impact the final output and the downstream biological analysis. GCN construction is a well-studied topic. Existing algorithms rely on relatively simple statistical and mathematical tools to implement these steps. Currently, software package WGCNA appears to be the most widely accepted standard. We hypothesize that the raw features provided by sequencing data can be leveraged to extract modules of higher quality. A novel preprocessing step of the gene expression data set is introduced that in effect calibrates the expression levels of individual genes, before computing pairwise similarities. Further, the similarity is computed as an inner-product of positive vectors. In experiments, this provides a significant improvement over WGCNA, as measured by aggregate p-values of the gene ontology term enrichment of the computed modules. Frontiers Media S.A. 2021-11-23 /pmc/articles/PMC9581019/ /pubmed/36303738 http://dx.doi.org/10.3389/fbinf.2021.704817 Text en Copyright © 2021 Aghaieabiane and Koutis. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Aghaieabiane, Niloofar Koutis, Ioannis A Novel Calibration Step in Gene Co-Expression Network Construction |
title | A Novel Calibration Step in Gene Co-Expression Network Construction |
title_full | A Novel Calibration Step in Gene Co-Expression Network Construction |
title_fullStr | A Novel Calibration Step in Gene Co-Expression Network Construction |
title_full_unstemmed | A Novel Calibration Step in Gene Co-Expression Network Construction |
title_short | A Novel Calibration Step in Gene Co-Expression Network Construction |
title_sort | novel calibration step in gene co-expression network construction |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581019/ https://www.ncbi.nlm.nih.gov/pubmed/36303738 http://dx.doi.org/10.3389/fbinf.2021.704817 |
work_keys_str_mv | AT aghaieabianeniloofar anovelcalibrationstepingenecoexpressionnetworkconstruction AT koutisioannis anovelcalibrationstepingenecoexpressionnetworkconstruction AT aghaieabianeniloofar novelcalibrationstepingenecoexpressionnetworkconstruction AT koutisioannis novelcalibrationstepingenecoexpressionnetworkconstruction |