Cargando…
Using deep learning to associate human genes with age-related diseases
MOTIVATION: One way to identify genes possibly associated with ageing is to build a classification model (from the machine learning field) capable of classifying genes as associated with multiple age-related diseases. To build this model, we use a pre-compiled list of human genes associated with age...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7141856/ https://www.ncbi.nlm.nih.gov/pubmed/31845988 http://dx.doi.org/10.1093/bioinformatics/btz887 |
_version_ | 1783519271936065536 |
---|---|
author | Fabris, Fabio Palmer, Daniel Salama, Khalid M de Magalhães, João Pedro Freitas, Alex A |
author_facet | Fabris, Fabio Palmer, Daniel Salama, Khalid M de Magalhães, João Pedro Freitas, Alex A |
author_sort | Fabris, Fabio |
collection | PubMed |
description | MOTIVATION: One way to identify genes possibly associated with ageing is to build a classification model (from the machine learning field) capable of classifying genes as associated with multiple age-related diseases. To build this model, we use a pre-compiled list of human genes associated with age-related diseases and apply a novel Deep Neural Network (DNN) method to find associations between gene descriptors (e.g. Gene Ontology terms, protein–protein interaction data and biological pathway information) and age-related diseases. RESULTS: The novelty of our new DNN method is its modular architecture, which has the capability of combining several sources of biological data to predict which ageing-related diseases a gene is associated with (if any). Our DNN method achieves better predictive performance than standard DNN approaches, a Gradient Boosted Tree classifier (a strong baseline method) and a Logistic Regression classifier. Given the DNN model produced by our method, we use two approaches to identify human genes that are not known to be associated with age-related diseases according to our dataset. First, we investigate genes that are close to other disease-associated genes in a complex multi-dimensional feature space learned by the DNN algorithm. Second, using the class label probabilities output by our DNN approach, we identify genes with a high probability of being associated with age-related diseases according to the model. We provide evidence of these putative associations retrieved from the DNN model with literature support. AVAILABILITY AND IMPLEMENTATION: The source code and datasets can be found at: https://github.com/fabiofabris/Bioinfo2019. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7141856 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-71418562020-04-13 Using deep learning to associate human genes with age-related diseases Fabris, Fabio Palmer, Daniel Salama, Khalid M de Magalhães, João Pedro Freitas, Alex A Bioinformatics Original Papers MOTIVATION: One way to identify genes possibly associated with ageing is to build a classification model (from the machine learning field) capable of classifying genes as associated with multiple age-related diseases. To build this model, we use a pre-compiled list of human genes associated with age-related diseases and apply a novel Deep Neural Network (DNN) method to find associations between gene descriptors (e.g. Gene Ontology terms, protein–protein interaction data and biological pathway information) and age-related diseases. RESULTS: The novelty of our new DNN method is its modular architecture, which has the capability of combining several sources of biological data to predict which ageing-related diseases a gene is associated with (if any). Our DNN method achieves better predictive performance than standard DNN approaches, a Gradient Boosted Tree classifier (a strong baseline method) and a Logistic Regression classifier. Given the DNN model produced by our method, we use two approaches to identify human genes that are not known to be associated with age-related diseases according to our dataset. First, we investigate genes that are close to other disease-associated genes in a complex multi-dimensional feature space learned by the DNN algorithm. Second, using the class label probabilities output by our DNN approach, we identify genes with a high probability of being associated with age-related diseases according to the model. We provide evidence of these putative associations retrieved from the DNN model with literature support. AVAILABILITY AND IMPLEMENTATION: The source code and datasets can be found at: https://github.com/fabiofabris/Bioinfo2019. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-04-01 2019-12-17 /pmc/articles/PMC7141856/ /pubmed/31845988 http://dx.doi.org/10.1093/bioinformatics/btz887 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Fabris, Fabio Palmer, Daniel Salama, Khalid M de Magalhães, João Pedro Freitas, Alex A Using deep learning to associate human genes with age-related diseases |
title | Using deep learning to associate human genes with age-related diseases |
title_full | Using deep learning to associate human genes with age-related diseases |
title_fullStr | Using deep learning to associate human genes with age-related diseases |
title_full_unstemmed | Using deep learning to associate human genes with age-related diseases |
title_short | Using deep learning to associate human genes with age-related diseases |
title_sort | using deep learning to associate human genes with age-related diseases |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7141856/ https://www.ncbi.nlm.nih.gov/pubmed/31845988 http://dx.doi.org/10.1093/bioinformatics/btz887 |
work_keys_str_mv | AT fabrisfabio usingdeeplearningtoassociatehumangeneswithagerelateddiseases AT palmerdaniel usingdeeplearningtoassociatehumangeneswithagerelateddiseases AT salamakhalidm usingdeeplearningtoassociatehumangeneswithagerelateddiseases AT demagalhaesjoaopedro usingdeeplearningtoassociatehumangeneswithagerelateddiseases AT freitasalexa usingdeeplearningtoassociatehumangeneswithagerelateddiseases |