Cargando…

A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values

BACKGROUND: Gene co-expression networks (GCNs) can be used to determine gene regulation and attribute gene function to biological processes. Different high throughput technologies, including one and two-channel microarrays and RNA-sequencing, allow evaluating thousands of gene expression data simult...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuang, Junyao, Buchon, Nicolas, Michel, Kristin, Scoglio, Caterina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9082846/
https://www.ncbi.nlm.nih.gov/pubmed/35534830
http://dx.doi.org/10.1186/s12859-022-04697-9
_version_ 1784703292264153088
author Kuang, Junyao
Buchon, Nicolas
Michel, Kristin
Scoglio, Caterina
author_facet Kuang, Junyao
Buchon, Nicolas
Michel, Kristin
Scoglio, Caterina
author_sort Kuang, Junyao
collection PubMed
description BACKGROUND: Gene co-expression networks (GCNs) can be used to determine gene regulation and attribute gene function to biological processes. Different high throughput technologies, including one and two-channel microarrays and RNA-sequencing, allow evaluating thousands of gene expression data simultaneously, but these methodologies provide results that cannot be directly compared. Thus, it is complex to analyze co-expression relations between genes, especially when there are missing values arising for experimental reasons. Networks are a helpful tool for studying gene co-expression, where nodes represent genes and edges represent co-expression of pairs of genes. RESULTS: In this paper, we establish a method for constructing a gene co-expression network for the Anopheles gambiae transcriptome from 257 unique studies obtained with different methodologies and experimental designs. We introduce the sliding threshold approach to select node pairs with high Pearson correlation coefficients. The resulting network, which we name AgGCN1.0, is robust to random removal of conditions and has similar characteristics to small-world and scale-free networks. Analysis of network sub-graphs revealed that the core is largely comprised of genes that encode components of the mitochondrial respiratory chain and the ribosome, while different communities are enriched for genes involved in distinct biological processes. CONCLUSION: Analysis of the network reveals that both the architecture of the core sub-network and the network communities are based on gene function, supporting the power of the proposed method for GCN construction. Application of network science methodology reveals that the overall network structure is driven to maximize the integration of essential cellular functions, possibly allowing the flexibility to add novel functions.
format Online
Article
Text
id pubmed-9082846
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-90828462022-05-10 A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values Kuang, Junyao Buchon, Nicolas Michel, Kristin Scoglio, Caterina BMC Bioinformatics Research BACKGROUND: Gene co-expression networks (GCNs) can be used to determine gene regulation and attribute gene function to biological processes. Different high throughput technologies, including one and two-channel microarrays and RNA-sequencing, allow evaluating thousands of gene expression data simultaneously, but these methodologies provide results that cannot be directly compared. Thus, it is complex to analyze co-expression relations between genes, especially when there are missing values arising for experimental reasons. Networks are a helpful tool for studying gene co-expression, where nodes represent genes and edges represent co-expression of pairs of genes. RESULTS: In this paper, we establish a method for constructing a gene co-expression network for the Anopheles gambiae transcriptome from 257 unique studies obtained with different methodologies and experimental designs. We introduce the sliding threshold approach to select node pairs with high Pearson correlation coefficients. The resulting network, which we name AgGCN1.0, is robust to random removal of conditions and has similar characteristics to small-world and scale-free networks. Analysis of network sub-graphs revealed that the core is largely comprised of genes that encode components of the mitochondrial respiratory chain and the ribosome, while different communities are enriched for genes involved in distinct biological processes. CONCLUSION: Analysis of the network reveals that both the architecture of the core sub-network and the network communities are based on gene function, supporting the power of the proposed method for GCN construction. Application of network science methodology reveals that the overall network structure is driven to maximize the integration of essential cellular functions, possibly allowing the flexibility to add novel functions. BioMed Central 2022-05-09 /pmc/articles/PMC9082846/ /pubmed/35534830 http://dx.doi.org/10.1186/s12859-022-04697-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Kuang, Junyao
Buchon, Nicolas
Michel, Kristin
Scoglio, Caterina
A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
title A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
title_full A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
title_fullStr A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
title_full_unstemmed A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
title_short A global [Formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
title_sort global [formula: see text] gene co-expression network constructed from hundreds of experimental conditions with missing values
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9082846/
https://www.ncbi.nlm.nih.gov/pubmed/35534830
http://dx.doi.org/10.1186/s12859-022-04697-9
work_keys_str_mv AT kuangjunyao aglobalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT buchonnicolas aglobalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT michelkristin aglobalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT scogliocaterina aglobalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT kuangjunyao globalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT buchonnicolas globalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT michelkristin globalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues
AT scogliocaterina globalformulaseetextgenecoexpressionnetworkconstructedfromhundredsofexperimentalconditionswithmissingvalues