Cargando…
Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster
Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experime...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411809/ https://www.ncbi.nlm.nih.gov/pubmed/37556452 http://dx.doi.org/10.1371/journal.pone.0288023 |
_version_ | 1785086752579387392 |
---|---|
author | Aromolaran, Olufemi Tony Isewon, Itunu Adedeji, Eunice Oswald, Marcus Adebiyi, Ezekiel Koenig, Rainer Oyelade, Jelili |
author_facet | Aromolaran, Olufemi Tony Isewon, Itunu Adedeji, Eunice Oswald, Marcus Adebiyi, Ezekiel Koenig, Rainer Oyelade, Jelili |
author_sort | Aromolaran, Olufemi Tony |
collection | PubMed |
description | Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experimental methods are laborious, time and resource consuming, hence computational techniques have been used to complement the experimental methods. Computational techniques such as supervised machine learning, or flux balance analysis are grossly limited due to the unavailability of required data for training the model or simulating the conditions for gene essentiality. This study developed a heuristic-enabled active machine learning method based on a light gradient boosting model to predict essential immune response and embryonic developmental genes in Drosophila melanogaster. We proposed a new sampling selection technique and introduced a heuristic function which replaces the human component in traditional active learning models. The heuristic function dynamically selects the unlabelled samples to improve the performance of the classifier in the next iteration. Testing the proposed model with four benchmark datasets, the proposed model showed superior performance when compared to traditional active learning models (random sampling and uncertainty sampling). Applying the model to identify conditionally essential genes, four novel essential immune response genes and a list of 48 novel genes that are essential in embryonic developmental condition were identified. We performed functional enrichment analysis of the predicted genes to elucidate their biological processes and the result evidence our predictions. Immune response and embryonic development related processes were significantly enriched in the essential immune response and embryonic developmental genes, respectively. Finally, we propose the predicted essential genes for future experimental studies and use of the developed tool accessible at http://heal.covenantuniversity.edu.ng for conditional essentiality predictions. |
format | Online Article Text |
id | pubmed-10411809 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-104118092023-08-10 Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster Aromolaran, Olufemi Tony Isewon, Itunu Adedeji, Eunice Oswald, Marcus Adebiyi, Ezekiel Koenig, Rainer Oyelade, Jelili PLoS One Research Article Computational prediction of absolute essential genes using machine learning has gained wide attention in recent years. However, essential genes are mostly conditional and not absolute. Experimental techniques provide a reliable approach of identifying conditionally essential genes; however, experimental methods are laborious, time and resource consuming, hence computational techniques have been used to complement the experimental methods. Computational techniques such as supervised machine learning, or flux balance analysis are grossly limited due to the unavailability of required data for training the model or simulating the conditions for gene essentiality. This study developed a heuristic-enabled active machine learning method based on a light gradient boosting model to predict essential immune response and embryonic developmental genes in Drosophila melanogaster. We proposed a new sampling selection technique and introduced a heuristic function which replaces the human component in traditional active learning models. The heuristic function dynamically selects the unlabelled samples to improve the performance of the classifier in the next iteration. Testing the proposed model with four benchmark datasets, the proposed model showed superior performance when compared to traditional active learning models (random sampling and uncertainty sampling). Applying the model to identify conditionally essential genes, four novel essential immune response genes and a list of 48 novel genes that are essential in embryonic developmental condition were identified. We performed functional enrichment analysis of the predicted genes to elucidate their biological processes and the result evidence our predictions. Immune response and embryonic development related processes were significantly enriched in the essential immune response and embryonic developmental genes, respectively. Finally, we propose the predicted essential genes for future experimental studies and use of the developed tool accessible at http://heal.covenantuniversity.edu.ng for conditional essentiality predictions. Public Library of Science 2023-08-09 /pmc/articles/PMC10411809/ /pubmed/37556452 http://dx.doi.org/10.1371/journal.pone.0288023 Text en © 2023 Aromolaran et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Aromolaran, Olufemi Tony Isewon, Itunu Adedeji, Eunice Oswald, Marcus Adebiyi, Ezekiel Koenig, Rainer Oyelade, Jelili Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster |
title | Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster |
title_full | Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster |
title_fullStr | Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster |
title_full_unstemmed | Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster |
title_short | Heuristic-enabled active machine learning: A case study of predicting essential developmental stage and immune response genes in Drosophila melanogaster |
title_sort | heuristic-enabled active machine learning: a case study of predicting essential developmental stage and immune response genes in drosophila melanogaster |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10411809/ https://www.ncbi.nlm.nih.gov/pubmed/37556452 http://dx.doi.org/10.1371/journal.pone.0288023 |
work_keys_str_mv | AT aromolaranolufemitony heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster AT isewonitunu heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster AT adedejieunice heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster AT oswaldmarcus heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster AT adebiyiezekiel heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster AT koenigrainer heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster AT oyeladejelili heuristicenabledactivemachinelearningacasestudyofpredictingessentialdevelopmentalstageandimmuneresponsegenesindrosophilamelanogaster |