Cargando…
A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology
Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9936134/ https://www.ncbi.nlm.nih.gov/pubmed/36797662 http://dx.doi.org/10.1186/s12864-023-09157-z |
_version_ | 1784890172507160576 |
---|---|
author | Zhang, Yan Shi, Weiyu Sun, Yeqing |
author_facet | Zhang, Yan Shi, Weiyu Sun, Yeqing |
author_sort | Zhang, Yan |
collection | PubMed |
description | Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on Genetic Algorithm and Gene Ontology, was proposed in this work. GMIGAGO is an overlapping gene module identification algorithm, which mainly includes two stages: In the first stage (initial identification of gene modules), Improved Partitioning Around Medoids Based on Genetic Algorithm (PAM-GA) is used for the initial clustering on gene expression profiling, and traditional gene co-expression modules can be obtained. Only similarity of expression levels is considered at this stage. In the second stage (optimization of functional similarity within gene modules), Genetic Algorithm for Functional Similarity Optimization (FSO-GA) is used to optimize gene modules based on gene ontology, and functional similarity within gene modules can be improved. Without loss of generality, we compared GMIGAGO with state-of-the-art gene module identification methods on six gene expression datasets, and GMIGAGO identified the gene modules with the highest functional similarity (much higher than state-of-the-art algorithms). GMIGAGO was applied in BRCA, THCA, HNSC, COVID-19, Stem, and Radiation datasets, and it identified some interesting modules which performed important biological functions. The hub genes in these modules could be used as potential targets for diseases or radiation protection. In summary, GMIGAGO has excellent performance in mining molecular mechanisms, and it can also identify potential biomarkers for individual precision therapy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09157-z. |
format | Online Article Text |
id | pubmed-9936134 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-99361342023-02-17 A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology Zhang, Yan Shi, Weiyu Sun, Yeqing BMC Genomics Research Since genes do not function individually, the gene module is considered an important tool for interpreting gene expression profiles. In order to consider both functional similarity and expression similarity in module identification, GMIGAGO, a functional Gene Module Identification algorithm based on Genetic Algorithm and Gene Ontology, was proposed in this work. GMIGAGO is an overlapping gene module identification algorithm, which mainly includes two stages: In the first stage (initial identification of gene modules), Improved Partitioning Around Medoids Based on Genetic Algorithm (PAM-GA) is used for the initial clustering on gene expression profiling, and traditional gene co-expression modules can be obtained. Only similarity of expression levels is considered at this stage. In the second stage (optimization of functional similarity within gene modules), Genetic Algorithm for Functional Similarity Optimization (FSO-GA) is used to optimize gene modules based on gene ontology, and functional similarity within gene modules can be improved. Without loss of generality, we compared GMIGAGO with state-of-the-art gene module identification methods on six gene expression datasets, and GMIGAGO identified the gene modules with the highest functional similarity (much higher than state-of-the-art algorithms). GMIGAGO was applied in BRCA, THCA, HNSC, COVID-19, Stem, and Radiation datasets, and it identified some interesting modules which performed important biological functions. The hub genes in these modules could be used as potential targets for diseases or radiation protection. In summary, GMIGAGO has excellent performance in mining molecular mechanisms, and it can also identify potential biomarkers for individual precision therapy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09157-z. BioMed Central 2023-02-17 /pmc/articles/PMC9936134/ /pubmed/36797662 http://dx.doi.org/10.1186/s12864-023-09157-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhang, Yan Shi, Weiyu Sun, Yeqing A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
title | A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
title_full | A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
title_fullStr | A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
title_full_unstemmed | A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
title_short | A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
title_sort | functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9936134/ https://www.ncbi.nlm.nih.gov/pubmed/36797662 http://dx.doi.org/10.1186/s12864-023-09157-z |
work_keys_str_mv | AT zhangyan afunctionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology AT shiweiyu afunctionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology AT sunyeqing afunctionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology AT zhangyan functionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology AT shiweiyu functionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology AT sunyeqing functionalgenemoduleidentificationalgorithmingeneexpressiondatabasedongeneticalgorithmandgeneontology |