Cargando…
GediNET for discovering gene associations across diseases using knowledge based machine learning approach
The most common approaches to discovering genes associated with specific diseases are based on machine learning and use a variety of feature selection techniques to identify significant genes that can serve as biomarkers for a given disease. More recently, the integration in this process of prior kn...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675776/ https://www.ncbi.nlm.nih.gov/pubmed/36402891 http://dx.doi.org/10.1038/s41598-022-24421-0 |
_version_ | 1784833444196384768 |
---|---|
author | Qumsiyeh, Emma Showe, Louise Yousef, Malik |
author_facet | Qumsiyeh, Emma Showe, Louise Yousef, Malik |
author_sort | Qumsiyeh, Emma |
collection | PubMed |
description | The most common approaches to discovering genes associated with specific diseases are based on machine learning and use a variety of feature selection techniques to identify significant genes that can serve as biomarkers for a given disease. More recently, the integration in this process of prior knowledge-based approaches has shown significant promise in the discovery of new biomarkers with potential translational applications. In this study, we developed a novel approach, GediNET, that integrates prior biological knowledge to gene Groups that are shown to be associated with a specific disease such as a cancer. The novelty of GediNET is that it then also allows the discovery of significant associations between that specific disease and other diseases. The initial step in this process involves the identification of gene Groups. The Groups are then subjected to a Scoring component to identify the top performing classification Groups. The top-ranked gene Groups are then used to train a Machine Learning Model. The process of Grouping, Scoring and Modelling (G-S-M) is used by GediNET to identify other diseases that are similarly associated with this signature. GediNET identifies these relationships through Disease–Disease Association (DDA) based machine learning. DDA explores novel associations between diseases and identifies relationships which could be used to further improve approaches to diagnosis, prognosis, and treatment. The GediNET KNIME workflow can be downloaded from: https://github.com/malikyousef/GediNET.git or https://kni.me/w/3kH1SQV_mMUsMTS. |
format | Online Article Text |
id | pubmed-9675776 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96757762022-11-21 GediNET for discovering gene associations across diseases using knowledge based machine learning approach Qumsiyeh, Emma Showe, Louise Yousef, Malik Sci Rep Article The most common approaches to discovering genes associated with specific diseases are based on machine learning and use a variety of feature selection techniques to identify significant genes that can serve as biomarkers for a given disease. More recently, the integration in this process of prior knowledge-based approaches has shown significant promise in the discovery of new biomarkers with potential translational applications. In this study, we developed a novel approach, GediNET, that integrates prior biological knowledge to gene Groups that are shown to be associated with a specific disease such as a cancer. The novelty of GediNET is that it then also allows the discovery of significant associations between that specific disease and other diseases. The initial step in this process involves the identification of gene Groups. The Groups are then subjected to a Scoring component to identify the top performing classification Groups. The top-ranked gene Groups are then used to train a Machine Learning Model. The process of Grouping, Scoring and Modelling (G-S-M) is used by GediNET to identify other diseases that are similarly associated with this signature. GediNET identifies these relationships through Disease–Disease Association (DDA) based machine learning. DDA explores novel associations between diseases and identifies relationships which could be used to further improve approaches to diagnosis, prognosis, and treatment. The GediNET KNIME workflow can be downloaded from: https://github.com/malikyousef/GediNET.git or https://kni.me/w/3kH1SQV_mMUsMTS. Nature Publishing Group UK 2022-11-19 /pmc/articles/PMC9675776/ /pubmed/36402891 http://dx.doi.org/10.1038/s41598-022-24421-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Qumsiyeh, Emma Showe, Louise Yousef, Malik GediNET for discovering gene associations across diseases using knowledge based machine learning approach |
title | GediNET for discovering gene associations across diseases using knowledge based machine learning approach |
title_full | GediNET for discovering gene associations across diseases using knowledge based machine learning approach |
title_fullStr | GediNET for discovering gene associations across diseases using knowledge based machine learning approach |
title_full_unstemmed | GediNET for discovering gene associations across diseases using knowledge based machine learning approach |
title_short | GediNET for discovering gene associations across diseases using knowledge based machine learning approach |
title_sort | gedinet for discovering gene associations across diseases using knowledge based machine learning approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675776/ https://www.ncbi.nlm.nih.gov/pubmed/36402891 http://dx.doi.org/10.1038/s41598-022-24421-0 |
work_keys_str_mv | AT qumsiyehemma gedinetfordiscoveringgeneassociationsacrossdiseasesusingknowledgebasedmachinelearningapproach AT showelouise gedinetfordiscoveringgeneassociationsacrossdiseasesusingknowledgebasedmachinelearningapproach AT yousefmalik gedinetfordiscoveringgeneassociationsacrossdiseasesusingknowledgebasedmachinelearningapproach |