Cargando…

A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology

Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yan, Shi, Weiyu, Sun, Yeqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9936134/
https://www.ncbi.nlm.nih.gov/pubmed/36797662
http://dx.doi.org/10.1186/s12864-023-09157-z
_version_ 1784890172507160576
author Zhang, Yan
Shi, Weiyu
Sun, Yeqing
author_facet Zhang, Yan
Shi, Weiyu
Sun, Yeqing
author_sort Zhang, Yan
collection PubMed
description Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on Genetic Algorithm and Gene Ontology, was proposed in this work. GMIGAGO is an overlapping gene module identification algorithm, which mainly includes two stages: In the first stage (initial identification of gene modules), Improved Partitioning Around Medoids Based on Genetic Algorithm (PAM-GA) is used for the initial clustering on gene expression profiling, and traditional gene co-expression modules can be obtained. Only similarity of expression levels is considered at this stage. In the second stage (optimization of functional similarity within gene modules), Genetic Algorithm for Functional Similarity Optimization (FSO-GA) is used to optimize gene modules based on gene ontology, and functional similarity within gene modules can be improved. Without loss of generality, we compared GMIGAGO with state-of-the-art gene module identification methods on six gene expression datasets, and GMIGAGO identified the gene modules with the highest functional similarity (much higher than state-of-the-art algorithms). GMIGAGO was applied in BRCA, THCA, HNSC, COVID-19, Stem, and Radiation datasets, and it identified some interesting modules which performed important biological functions. The hub genes in these modules could be used as potential targets for diseases or radiation protection. In summary, GMIGAGO has excellent performance in mining molecular mechanisms, and it can also identify potential biomarkers for individual precision therapy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09157-z.
format Online
Article
Text
id pubmed-9936134
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-99361342023-02-17 A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology Zhang, Yan Shi, Weiyu Sun, Yeqing BMC Genomics Research Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on Genetic Algorithm and Gene Ontology, was proposed in this work. GMIGAGO is an overlapping gene module identification algorithm, which mainly includes two stages: In the first stage (initial identification of gene modules), Improved Partitioning Around Medoids Based on Genetic Algorithm (PAM-GA) is used for the initial clustering on gene expression profiling, and traditional gene co-expression modules can be obtained. Only similarity of expression levels is considered at this stage. In the second stage (optimization of functional similarity within gene modules), Genetic Algorithm for Functional Similarity Optimization (FSO-GA) is used to optimize gene modules based on gene ontology, and functional similarity within gene modules can be improved. Without loss of generality, we compared GMIGAGO with state-of-the-art gene module identification methods on six gene expression datasets, and GMIGAGO identified the gene modules with the highest functional similarity (much higher than state-of-the-art algorithms). GMIGAGO was applied in BRCA, THCA, HNSC, COVID-19, Stem, and Radiation datasets, and it identified some interesting modules which performed important biological functions. The hub genes in these modules could be used as potential targets for diseases or radiation protection. In summary, GMIGAGO has excellent performance in mining molecular mechanisms, and it can also identify potential biomarkers for individual precision therapy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09157-z. BioMed Central 2023-02-17 /pmc/articles/PMC9936134/ /pubmed/36797662 http://dx.doi.org/10.1186/s12864-023-09157-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhang, Yan
Shi, Weiyu
Sun, Yeqing
A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
title A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
title_full A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
title_fullStr A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
title_full_unstemmed A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
title_short A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
title_sort functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9936134/
https://www.ncbi.nlm.nih.gov/pubmed/36797662
http://dx.doi.org/10.1186/s12864-023-09157-z
work_keys_str_mv AT zhangyan afunctionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology
AT shiweiyu afunctionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology
AT sunyeqing afunctionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology
AT zhangyan functionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology
AT shiweiyu functionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology
AT sunyeqing functionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology