Cargando…
AGRN: accurate gene regulatory network inference using ensemble machine learning methods
MOTIVATION: Biological processes are regulated by underlying genes and their interactions that form gene regulatory networks (GRNs). Dysregulation of these GRNs can cause complex diseases such as cancer, Alzheimer’s and diabetes. Hence, accurate GRN inference is critical for elucidating gene functio...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082608/ https://www.ncbi.nlm.nih.gov/pubmed/37038446 http://dx.doi.org/10.1093/bioadv/vbad032 |
_version_ | 1785021347935551488 |
---|---|
author | Alawad, Duaa Mohammad Katebi, Ataur Kabir, Md Wasi Ul Hoque, Md Tamjidul |
author_facet | Alawad, Duaa Mohammad Katebi, Ataur Kabir, Md Wasi Ul Hoque, Md Tamjidul |
author_sort | Alawad, Duaa Mohammad |
collection | PubMed |
description | MOTIVATION: Biological processes are regulated by underlying genes and their interactions that form gene regulatory networks (GRNs). Dysregulation of these GRNs can cause complex diseases such as cancer, Alzheimer’s and diabetes. Hence, accurate GRN inference is critical for elucidating gene function, allowing for the faster identification and prioritization of candidate genes for functional investigation. Several statistical and machine learning-based methods have been developed to infer GRNs based on biological and synthetic datasets. Here, we developed a method named AGRN that infers GRNs by employing an ensemble of machine learning algorithms. RESULTS: From the idea that a single method may not perform well on all datasets, we calculate the gene importance scores using three machine learning methods—random forest, extra tree and support vector regressors. We calculate the importance scores from Shapley Additive Explanations, a recently published method to explain machine learning models. We have found that the importance scores from Shapley values perform better than the traditional importance scoring methods based on almost all the benchmark datasets. We have analyzed the performance of AGRN using the datasets from the DREAM4 and DREAM5 challenges for GRN inference. The proposed method, AGRN—an ensemble machine learning method with Shapley values, outperforms the existing methods both in the DREAM4 and DREAM5 datasets. With improved accuracy, we believe that AGRN inferred GRNs would enhance our mechanistic understanding of biological processes in health and disease. AVAILABILITYAND IMPLEMENTATION: https://github.com/DuaaAlawad/AGRN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-10082608 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-100826082023-04-09 AGRN: accurate gene regulatory network inference using ensemble machine learning methods Alawad, Duaa Mohammad Katebi, Ataur Kabir, Md Wasi Ul Hoque, Md Tamjidul Bioinform Adv Original Paper MOTIVATION: Biological processes are regulated by underlying genes and their interactions that form gene regulatory networks (GRNs). Dysregulation of these GRNs can cause complex diseases such as cancer, Alzheimer’s and diabetes. Hence, accurate GRN inference is critical for elucidating gene function, allowing for the faster identification and prioritization of candidate genes for functional investigation. Several statistical and machine learning-based methods have been developed to infer GRNs based on biological and synthetic datasets. Here, we developed a method named AGRN that infers GRNs by employing an ensemble of machine learning algorithms. RESULTS: From the idea that a single method may not perform well on all datasets, we calculate the gene importance scores using three machine learning methods—random forest, extra tree and support vector regressors. We calculate the importance scores from Shapley Additive Explanations, a recently published method to explain machine learning models. We have found that the importance scores from Shapley values perform better than the traditional importance scoring methods based on almost all the benchmark datasets. We have analyzed the performance of AGRN using the datasets from the DREAM4 and DREAM5 challenges for GRN inference. The proposed method, AGRN—an ensemble machine learning method with Shapley values, outperforms the existing methods both in the DREAM4 and DREAM5 datasets. With improved accuracy, we believe that AGRN inferred GRNs would enhance our mechanistic understanding of biological processes in health and disease. AVAILABILITYAND IMPLEMENTATION: https://github.com/DuaaAlawad/AGRN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-04-05 /pmc/articles/PMC10082608/ /pubmed/37038446 http://dx.doi.org/10.1093/bioadv/vbad032 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Alawad, Duaa Mohammad Katebi, Ataur Kabir, Md Wasi Ul Hoque, Md Tamjidul AGRN: accurate gene regulatory network inference using ensemble machine learning methods |
title | AGRN: accurate gene regulatory network inference using ensemble machine learning methods |
title_full | AGRN: accurate gene regulatory network inference using ensemble machine learning methods |
title_fullStr | AGRN: accurate gene regulatory network inference using ensemble machine learning methods |
title_full_unstemmed | AGRN: accurate gene regulatory network inference using ensemble machine learning methods |
title_short | AGRN: accurate gene regulatory network inference using ensemble machine learning methods |
title_sort | agrn: accurate gene regulatory network inference using ensemble machine learning methods |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082608/ https://www.ncbi.nlm.nih.gov/pubmed/37038446 http://dx.doi.org/10.1093/bioadv/vbad032 |
work_keys_str_mv | AT alawadduaamohammad agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods AT katebiataur agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods AT kabirmdwasiul agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods AT hoquemdtamjidul agrnaccurategeneregulatorynetworkinferenceusingensemblemachinelearningmethods |