Cargando…

Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster

Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experime...

Descripción completa

Detalles Bibliográficos
Autores principales: Aromolaran, Olufemi Tony, Isewon, Itunu, Adedeji, Eunice, Oswald, Marcus, Adebiyi, Ezekiel, Koenig, Rainer, Oyelade, Jelili
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411809/
https://www.ncbi.nlm.nih.gov/pubmed/37556452
http://dx.doi.org/10.1371/journal.pone.0288023
_version_ 1785086752579387392
author Aromolaran, Olufemi Tony
Isewon, Itunu
Adedeji, Eunice
Oswald, Marcus
Adebiyi, Ezekiel
Koenig, Rainer
Oyelade, Jelili
author_facet Aromolaran, Olufemi Tony
Isewon, Itunu
Adedeji, Eunice
Oswald, Marcus
Adebiyi, Ezekiel
Koenig, Rainer
Oyelade, Jelili
author_sort Aromolaran, Olufemi Tony
collection PubMed
description Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experimental methods are laborious, time and resource consuming, hence computational techniques have been used to complement the experimental methods. Computational techniques such as supervised machine learning, or flux balance analysis are grossly limited due to the unavailability of required data for training the model or simulating the conditions for gene essentiality. This study developed a heuristic-enabled active machine learning method based on a light gradient boosting model to predict essential immune response and embryonic developmental genes in Drosophila melanogaster. We proposed a new sampling selection technique and introduced a heuristic function which replaces the human component in traditional active learning models. The heuristic function dynamically selects the unlabelled samples to improve the performance of the classifier in the next iteration. Testing the proposed model with four benchmark datasets, the proposed model showed superior performance when compared to traditional active learning models (random sampling and uncertainty sampling). Applying the model to identify conditionally essential genes, four novel essential immune response genes and a list of 48 novel genes that are essential in embryonic developmental condition were identified. We performed functional enrichment analysis of the predicted genes to elucidate their biological processes and the result evidence our predictions. Immune response and embryonic development related processes were significantly enriched in the essential immune response and embryonic developmental genes, respectively. Finally, we propose the predicted essential genes for future experimental studies and use of the developed tool accessible at http://heal.covenantuniversity.edu.ng for conditional essentiality predictions.
format Online
Article
Text
id pubmed-10411809
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104118092023-08-10 Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster Aromolaran, Olufemi Tony Isewon, Itunu Adedeji, Eunice Oswald, Marcus Adebiyi, Ezekiel Koenig, Rainer Oyelade, Jelili PLoS One Research Article Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experimental methods are laborious, time and resource consuming, hence computational techniques have been used to complement the experimental methods. Computational techniques such as supervised machine learning, or flux balance analysis are grossly limited due to the unavailability of required data for training the model or simulating the conditions for gene essentiality. This study developed a heuristic-enabled active machine learning method based on a light gradient boosting model to predict essential immune response and embryonic developmental genes in Drosophila melanogaster. We proposed a new sampling selection technique and introduced a heuristic function which replaces the human component in traditional active learning models. The heuristic function dynamically selects the unlabelled samples to improve the performance of the classifier in the next iteration. Testing the proposed model with four benchmark datasets, the proposed model showed superior performance when compared to traditional active learning models (random sampling and uncertainty sampling). Applying the model to identify conditionally essential genes, four novel essential immune response genes and a list of 48 novel genes that are essential in embryonic developmental condition were identified. We performed functional enrichment analysis of the predicted genes to elucidate their biological processes and the result evidence our predictions. Immune response and embryonic development related processes were significantly enriched in the essential immune response and embryonic developmental genes, respectively. Finally, we propose the predicted essential genes for future experimental studies and use of the developed tool accessible at http://heal.covenantuniversity.edu.ng for conditional essentiality predictions. Public Library of Science 2023-08-09 /pmc/articles/PMC10411809/ /pubmed/37556452 http://dx.doi.org/10.1371/journal.pone.0288023 Text en © 2023 Aromolaran et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Aromolaran, Olufemi Tony
Isewon, Itunu
Adedeji, Eunice
Oswald, Marcus
Adebiyi, Ezekiel
Koenig, Rainer
Oyelade, Jelili
Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
title Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
title_full Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
title_fullStr Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
title_full_unstemmed Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
title_short Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
title_sort heuristic-enabled active machine learning: a case study of predicting essential developmental stage and immune response genes in drosophila melanogaster
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411809/
https://www.ncbi.nlm.nih.gov/pubmed/37556452
http://dx.doi.org/10.1371/journal.pone.0288023
work_keys_str_mv AT aromolaranolufemitony heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster
AT isewonitunu heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster
AT adedejieunice heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster
AT oswaldmarcus heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster
AT adebiyiezekiel heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster
AT koenigrainer heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster
AT oyeladejelili heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster