Cargando…

A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study

SIMPLE SUMMARY: Due to lacking exploitation capability, traditional genetic algorithm cannot accurately identify the minimal best gene subset. Thus, the improved splicing method is introduced into a genetic algorithm to enhance exploitation capability for achieving balance between exploitation and e...

Descripción completa

Detalles Bibliográficos
Autores principales: Miao, Maoxuan, Wu, Jinran, Cai, Fengjing, Wang, You-Gan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772977/
https://www.ncbi.nlm.nih.gov/pubmed/35049823
http://dx.doi.org/10.3390/ani12020201
_version_ 1784635970857992192
author Miao, Maoxuan
Wu, Jinran
Cai, Fengjing
Wang, You-Gan
author_facet Miao, Maoxuan
Wu, Jinran
Cai, Fengjing
Wang, You-Gan
author_sort Miao, Maoxuan
collection PubMed
description SIMPLE SUMMARY: Due to lacking exploitation capability, traditional genetic algorithm cannot accurately identify the minimal best gene subset. Thus, the improved splicing method is introduced into a genetic algorithm to enhance exploitation capability for achieving balance between exploitation and exploration of GA. It can effectively identify true gene subsets with high probability. Furthermore, a dataset of the body weight of Hu sheep has been used to show that the proposed method can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms including genetic algorithm and adaptive best-subset selection algorithm. ABSTRACT: Selecting the minimal best subset out of a huge number of factors for influencing the response is a fundamental and very challenging NP-hard problem because the presence of many redundant genes results in over-fitting easily while missing an important gene can more detrimental impact on predictions, and computation is prohibitive for exhaust search. We propose a modified memetic algorithm (MA) based on an improved splicing method to overcome the problems in the traditional genetic algorithm exploitation capability and dimension reduction in the predictor variables. The new algorithm accelerates the search in identifying the minimal best subset of genes by incorporating it into the new local search operator and hence improving the splicing method. The improvement is also due to another two novel aspects: (a) updating subsets of genes iteratively until the no more reduction in the loss function by splicing and increasing the probability of selecting the true subsets of genes; and (b) introducing add and del operators based on backward sacrifice into the splicing method to limit the size of gene subsets. Additionally, according to the experimental results, our proposed optimizer can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms. Moreover, the mutation operator is replaced by it to enhance exploitation capability and initial individuals are improved by it to enhance efficiency of search. A dataset of the body weight of Hu sheep was used to evaluate the superiority of the modified MA against the genetic algorithm. According to our experimental results, our proposed optimizer can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms including the most advanced adaptive best-subset selection algorithm.
format Online
Article
Text
id pubmed-8772977
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87729772022-01-21 A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study Miao, Maoxuan Wu, Jinran Cai, Fengjing Wang, You-Gan Animals (Basel) Article SIMPLE SUMMARY: Due to lacking exploitation capability, traditional genetic algorithm cannot accurately identify the minimal best gene subset. Thus, the improved splicing method is introduced into a genetic algorithm to enhance exploitation capability for achieving balance between exploitation and exploration of GA. It can effectively identify true gene subsets with high probability. Furthermore, a dataset of the body weight of Hu sheep has been used to show that the proposed method can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms including genetic algorithm and adaptive best-subset selection algorithm. ABSTRACT: Selecting the minimal best subset out of a huge number of factors for influencing the response is a fundamental and very challenging NP-hard problem because the presence of many redundant genes results in over-fitting easily while missing an important gene can more detrimental impact on predictions, and computation is prohibitive for exhaust search. We propose a modified memetic algorithm (MA) based on an improved splicing method to overcome the problems in the traditional genetic algorithm exploitation capability and dimension reduction in the predictor variables. The new algorithm accelerates the search in identifying the minimal best subset of genes by incorporating it into the new local search operator and hence improving the splicing method. The improvement is also due to another two novel aspects: (a) updating subsets of genes iteratively until the no more reduction in the loss function by splicing and increasing the probability of selecting the true subsets of genes; and (b) introducing add and del operators based on backward sacrifice into the splicing method to limit the size of gene subsets. Additionally, according to the experimental results, our proposed optimizer can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms. Moreover, the mutation operator is replaced by it to enhance exploitation capability and initial individuals are improved by it to enhance efficiency of search. A dataset of the body weight of Hu sheep was used to evaluate the superiority of the modified MA against the genetic algorithm. According to our experimental results, our proposed optimizer can obtain a better minimal subset of genes with a few iterations, compared with all considered algorithms including the most advanced adaptive best-subset selection algorithm. MDPI 2022-01-15 /pmc/articles/PMC8772977/ /pubmed/35049823 http://dx.doi.org/10.3390/ani12020201 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Miao, Maoxuan
Wu, Jinran
Cai, Fengjing
Wang, You-Gan
A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study
title A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study
title_full A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study
title_fullStr A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study
title_full_unstemmed A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study
title_short A Modified Memetic Algorithm with an Application to Gene Selection in a Sheep Body Weight Study
title_sort modified memetic algorithm with an application to gene selection in a sheep body weight study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772977/
https://www.ncbi.nlm.nih.gov/pubmed/35049823
http://dx.doi.org/10.3390/ani12020201
work_keys_str_mv AT miaomaoxuan amodifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT wujinran amodifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT caifengjing amodifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT wangyougan amodifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT miaomaoxuan modifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT wujinran modifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT caifengjing modifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy
AT wangyougan modifiedmemeticalgorithmwithanapplicationtogeneselectioninasheepbodyweightstudy