Cargando…

A Novel Calibration Step in Gene Co-Expression Network Construction

High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a...

Descripción completa

Detalles Bibliográficos
Autores principales: Aghaieabiane, Niloofar, Koutis, Ioannis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581019/
https://www.ncbi.nlm.nih.gov/pubmed/36303738
http://dx.doi.org/10.3389/fbinf.2021.704817
_version_ 1784812523612012544
author Aghaieabiane, Niloofar
Koutis, Ioannis
author_facet Aghaieabiane, Niloofar
Koutis, Ioannis
author_sort Aghaieabiane, Niloofar
collection PubMed
description High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a GCN, nodes correspond to genes, and the weight of the connection between two nodes is a measure of similarity in the expression behavior of the two genes. In general, GCN construction and analysis includes three steps; 1) calculating a similarity value for each pair of genes 2) using these similarity values to construct a fully connected weighted network 3) finding clusters of genes in the network, commonly called modules. The specific implementation of these three steps can significantly impact the final output and the downstream biological analysis. GCN construction is a well-studied topic. Existing algorithms rely on relatively simple statistical and mathematical tools to implement these steps. Currently, software package WGCNA appears to be the most widely accepted standard. We hypothesize that the raw features provided by sequencing data can be leveraged to extract modules of higher quality. A novel preprocessing step of the gene expression data set is introduced that in effect calibrates the expression levels of individual genes, before computing pairwise similarities. Further, the similarity is computed as an inner-product of positive vectors. In experiments, this provides a significant improvement over WGCNA, as measured by aggregate p-values of the gene ontology term enrichment of the computed modules.
format Online
Article
Text
id pubmed-9581019
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95810192022-10-26 A Novel Calibration Step in Gene Co-Expression Network Construction Aghaieabiane, Niloofar Koutis, Ioannis Front Bioinform Bioinformatics High-throughput technologies such as DNA microarrays and RNA-sequencing are used to measure the expression levels of large numbers of genes simultaneously. To support the extraction of biological knowledge, individual gene expression levels are transformed to Gene Co-expression Networks (GCNs). In a GCN, nodes correspond to genes, and the weight of the connection between two nodes is a measure of similarity in the expression behavior of the two genes. In general, GCN construction and analysis includes three steps; 1) calculating a similarity value for each pair of genes 2) using these similarity values to construct a fully connected weighted network 3) finding clusters of genes in the network, commonly called modules. The specific implementation of these three steps can significantly impact the final output and the downstream biological analysis. GCN construction is a well-studied topic. Existing algorithms rely on relatively simple statistical and mathematical tools to implement these steps. Currently, software package WGCNA appears to be the most widely accepted standard. We hypothesize that the raw features provided by sequencing data can be leveraged to extract modules of higher quality. A novel preprocessing step of the gene expression data set is introduced that in effect calibrates the expression levels of individual genes, before computing pairwise similarities. Further, the similarity is computed as an inner-product of positive vectors. In experiments, this provides a significant improvement over WGCNA, as measured by aggregate p-values of the gene ontology term enrichment of the computed modules. Frontiers Media S.A. 2021-11-23 /pmc/articles/PMC9581019/ /pubmed/36303738 http://dx.doi.org/10.3389/fbinf.2021.704817 Text en Copyright © 2021 Aghaieabiane and Koutis. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Aghaieabiane, Niloofar
Koutis, Ioannis
A Novel Calibration Step in Gene Co-Expression Network Construction
title A Novel Calibration Step in Gene Co-Expression Network Construction
title_full A Novel Calibration Step in Gene Co-Expression Network Construction
title_fullStr A Novel Calibration Step in Gene Co-Expression Network Construction
title_full_unstemmed A Novel Calibration Step in Gene Co-Expression Network Construction
title_short A Novel Calibration Step in Gene Co-Expression Network Construction
title_sort novel calibration step in gene co-expression network construction
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581019/
https://www.ncbi.nlm.nih.gov/pubmed/36303738
http://dx.doi.org/10.3389/fbinf.2021.704817
work_keys_str_mv AT aghaieabianeniloofar anovelcalibrationstepingenecoexpressionnetworkconstruction
AT koutisioannis anovelcalibrationstepingenecoexpressionnetworkconstruction
AT aghaieabianeniloofar novelcalibrationstepingenecoexpressionnetworkconstruction
AT koutisioannis novelcalibrationstepingenecoexpressionnetworkconstruction