Cargando…

AGRN: accurate gene regulatory network inference using ensemble machine learning methods

MOTIVATION: Biological processes are regulated by underlying genes and their interactions that form gene regulatory networks (GRNs). Dysregulation of these GRNs can cause complex diseases such as cancer, Alzheimer’s and diabetes. Hence, accurate GRN inference is critical for elucidating gene functio...

Descripción completa

Detalles Bibliográficos
Autores principales: Alawad, Duaa Mohammad, Katebi, Ataur, Kabir, Md Wasi Ul, Hoque, Md Tamjidul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082608/
https://www.ncbi.nlm.nih.gov/pubmed/37038446
http://dx.doi.org/10.1093/bioadv/vbad032
_version_ 1785021347935551488
author Alawad, Duaa Mohammad
Katebi, Ataur
Kabir, Md Wasi Ul
Hoque, Md Tamjidul
author_facet Alawad, Duaa Mohammad
Katebi, Ataur
Kabir, Md Wasi Ul
Hoque, Md Tamjidul
author_sort Alawad, Duaa Mohammad
collection PubMed
description MOTIVATION: Biological processes are regulated by underlying genes and their interactions that form gene regulatory networks (GRNs). Dysregulation of these GRNs can cause complex diseases such as cancer, Alzheimer’s and diabetes. Hence, accurate GRN inference is critical for elucidating gene function, allowing for the faster identification and prioritization of candidate genes for functional investigation. Several statistical and machine learning-based methods have been developed to infer GRNs based on biological and synthetic datasets. Here, we developed a method named AGRN that infers GRNs by employing an ensemble of machine learning algorithms. RESULTS: From the idea that a single method may not perform well on all datasets, we calculate the gene importance scores using three machine learning methods—random forest, extra tree and support vector regressors. We calculate the importance scores from Shapley Additive Explanations, a recently published method to explain machine learning models. We have found that the importance scores from Shapley values perform better than the traditional importance scoring methods based on almost all the benchmark datasets. We have analyzed the performance of AGRN using the datasets from the DREAM4 and DREAM5 challenges for GRN inference. The proposed method, AGRN—an ensemble machine learning method with Shapley values, outperforms the existing methods both in the DREAM4 and DREAM5 datasets. With improved accuracy, we believe that AGRN inferred GRNs would enhance our mechanistic understanding of biological processes in health and disease. AVAILABILITYAND IMPLEMENTATION: https://github.com/DuaaAlawad/AGRN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-10082608
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100826082023-04-09 AGRN: accurate gene regulatory network inference using ensemble machine learning methods Alawad, Duaa Mohammad Katebi, Ataur Kabir, Md Wasi Ul Hoque, Md Tamjidul Bioinform Adv Original Paper MOTIVATION: Biological processes are regulated by underlying genes and their interactions that form gene regulatory networks (GRNs). Dysregulation of these GRNs can cause complex diseases such as cancer, Alzheimer’s and diabetes. Hence, accurate GRN inference is critical for elucidating gene function, allowing for the faster identification and prioritization of candidate genes for functional investigation. Several statistical and machine learning-based methods have been developed to infer GRNs based on biological and synthetic datasets. Here, we developed a method named AGRN that infers GRNs by employing an ensemble of machine learning algorithms. RESULTS: From the idea that a single method may not perform well on all datasets, we calculate the gene importance scores using three machine learning methods—random forest, extra tree and support vector regressors. We calculate the importance scores from Shapley Additive Explanations, a recently published method to explain machine learning models. We have found that the importance scores from Shapley values perform better than the traditional importance scoring methods based on almost all the benchmark datasets. We have analyzed the performance of AGRN using the datasets from the DREAM4 and DREAM5 challenges for GRN inference. The proposed method, AGRN—an ensemble machine learning method with Shapley values, outperforms the existing methods both in the DREAM4 and DREAM5 datasets. With improved accuracy, we believe that AGRN inferred GRNs would enhance our mechanistic understanding of biological processes in health and disease. AVAILABILITYAND IMPLEMENTATION: https://github.com/DuaaAlawad/AGRN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-04-05 /pmc/articles/PMC10082608/ /pubmed/37038446 http://dx.doi.org/10.1093/bioadv/vbad032 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Alawad, Duaa Mohammad
Katebi, Ataur
Kabir, Md Wasi Ul
Hoque, Md Tamjidul
AGRN: accurate gene regulatory network inference using ensemble machine learning methods
title AGRN: accurate gene regulatory network inference using ensemble machine learning methods
title_full AGRN: accurate gene regulatory network inference using ensemble machine learning methods
title_fullStr AGRN: accurate gene regulatory network inference using ensemble machine learning methods
title_full_unstemmed AGRN: accurate gene regulatory network inference using ensemble machine learning methods
title_short AGRN: accurate gene regulatory network inference using ensemble machine learning methods
title_sort agrn: accurate gene regulatory network inference using ensemble machine learning methods
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082608/
https://www.ncbi.nlm.nih.gov/pubmed/37038446
http://dx.doi.org/10.1093/bioadv/vbad032
work_keys_str_mv AT alawadduaamohammad agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods
AT katebiataur agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods
AT kabirmdwasiul agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods
AT hoquemdtamjidul agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods