Cargando…

GediNET for discovering gene associations across diseases using knowledge based machine learning approach

The most common approaches to discovering genes associated with specific diseases are based on machine learning and use a variety of feature selection techniques to identify significant genes that can serve as biomarkers for a given disease. More recently, the integration in this process of prior kn...

Descripción completa

Detalles Bibliográficos
Autores principales: Qumsiyeh, Emma, Showe, Louise, Yousef, Malik
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675776/
https://www.ncbi.nlm.nih.gov/pubmed/36402891
http://dx.doi.org/10.1038/s41598-022-24421-0
_version_ 1784833444196384768
author Qumsiyeh, Emma
Showe, Louise
Yousef, Malik
author_facet Qumsiyeh, Emma
Showe, Louise
Yousef, Malik
author_sort Qumsiyeh, Emma
collection PubMed
description The most common approaches to discovering genes associated with specific diseases are based on machine learning and use a variety of feature selection techniques to identify significant genes that can serve as biomarkers for a given disease. More recently, the integration in this process of prior knowledge-based approaches has shown significant promise in the discovery of new biomarkers with potential translational applications. In this study, we developed a novel approach, GediNET, that integrates prior biological knowledge to gene Groups that are shown to be associated with a specific disease such as a cancer. The novelty of GediNET is that it then also allows the discovery of significant associations between that specific disease and other diseases. The initial step in this process involves the identification of gene Groups. The Groups are then subjected to a Scoring component to identify the top performing classification Groups. The top-ranked gene Groups are then used to train a Machine Learning Model. The process of Grouping, Scoring and Modelling (G-S-M) is used by GediNET to identify other diseases that are similarly associated with this signature. GediNET identifies these relationships through Disease–Disease Association (DDA) based machine learning. DDA explores novel associations between diseases and identifies relationships which could be used to further improve approaches to diagnosis, prognosis, and treatment. The GediNET KNIME workflow can be downloaded from: https://github.com/malikyousef/GediNET.git or https://kni.me/w/3kH1SQV_mMUsMTS.
format Online
Article
Text
id pubmed-9675776
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96757762022-11-21 GediNET for discovering gene associations across diseases using knowledge based machine learning approach Qumsiyeh, Emma Showe, Louise Yousef, Malik Sci Rep Article The most common approaches to discovering genes associated with specific diseases are based on machine learning and use a variety of feature selection techniques to identify significant genes that can serve as biomarkers for a given disease. More recently, the integration in this process of prior knowledge-based approaches has shown significant promise in the discovery of new biomarkers with potential translational applications. In this study, we developed a novel approach, GediNET, that integrates prior biological knowledge to gene Groups that are shown to be associated with a specific disease such as a cancer. The novelty of GediNET is that it then also allows the discovery of significant associations between that specific disease and other diseases. The initial step in this process involves the identification of gene Groups. The Groups are then subjected to a Scoring component to identify the top performing classification Groups. The top-ranked gene Groups are then used to train a Machine Learning Model. The process of Grouping, Scoring and Modelling (G-S-M) is used by GediNET to identify other diseases that are similarly associated with this signature. GediNET identifies these relationships through Disease–Disease Association (DDA) based machine learning. DDA explores novel associations between diseases and identifies relationships which could be used to further improve approaches to diagnosis, prognosis, and treatment. The GediNET KNIME workflow can be downloaded from: https://github.com/malikyousef/GediNET.git or https://kni.me/w/3kH1SQV_mMUsMTS. Nature Publishing Group UK 2022-11-19 /pmc/articles/PMC9675776/ /pubmed/36402891 http://dx.doi.org/10.1038/s41598-022-24421-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Qumsiyeh, Emma
Showe, Louise
Yousef, Malik
GediNET for discovering gene associations across diseases using knowledge based machine learning approach
title GediNET for discovering gene associations across diseases using knowledge based machine learning approach
title_full GediNET for discovering gene associations across diseases using knowledge based machine learning approach
title_fullStr GediNET for discovering gene associations across diseases using knowledge based machine learning approach
title_full_unstemmed GediNET for discovering gene associations across diseases using knowledge based machine learning approach
title_short GediNET for discovering gene associations across diseases using knowledge based machine learning approach
title_sort gedinet for discovering gene associations across diseases using knowledge based machine learning approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675776/
https://www.ncbi.nlm.nih.gov/pubmed/36402891
http://dx.doi.org/10.1038/s41598-022-24421-0
work_keys_str_mv AT qumsiyehemma gedinetfordiscoveringgeneassociationsacrossdiseasesusingknowledgebasedmachinelearningapproach
AT showelouise gedinetfordiscoveringgeneassociationsacrossdiseasesusingknowledgebasedmachinelearningapproach
AT yousefmalik gedinetfordiscoveringgeneassociationsacrossdiseasesusingknowledgebasedmachinelearningapproach