Cargando…
A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
Amyotrophic lateral sclerosis is a neurodegenerative disease of the upper and lower motor neurons resulting in death from neuromuscular respiratory failure, typically within two to five years of first symptoms. Several rare disruptive gene variants have been associated with ALS and are responsible f...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349022/ https://www.ncbi.nlm.nih.gov/pubmed/32575372 http://dx.doi.org/10.3390/genes11060668 |
_version_ | 1783556967787134976 |
---|---|
author | Bean, Daniel M. Al-Chalabi, Ammar Dobson, Richard J. B. Iacoangeli, Alfredo |
author_facet | Bean, Daniel M. Al-Chalabi, Ammar Dobson, Richard J. B. Iacoangeli, Alfredo |
author_sort | Bean, Daniel M. |
collection | PubMed |
description | Amyotrophic lateral sclerosis is a neurodegenerative disease of the upper and lower motor neurons resulting in death from neuromuscular respiratory failure, typically within two to five years of first symptoms. Several rare disruptive gene variants have been associated with ALS and are responsible for about 15% of all cases. Although our knowledge of the genetic landscape of this disease is improving, it remains limited. Machine learning models trained on the available protein–protein interaction and phenotype-genotype association data can use our current knowledge of the disease genetics for the prediction of novel candidate genes. Here, we describe a knowledge-based machine learning method for this purpose. We trained our model on protein–protein interaction data from IntAct, gene function annotation from Gene Ontology, and known disease-gene associations from DisGeNet. Using several sets of known ALS genes from public databases and a manual review as input, we generated a list of new candidate genes for each input set. We investigated the relevance of the predicted genes in ALS by using the available summary statistics from the largest ALS genome-wide association study and by performing functional and phenotype enrichment analysis. The predicted sets were enriched for genes associated with other neurodegenerative diseases known to overlap with ALS genetically and phenotypically, as well as for biological processes associated with the disease. Moreover, using ALS genes from ClinVar and our manual review as input, the predicted sets were enriched for ALS-associated genes (ClinVar p = 0.038 and manual review p = 0.060) when used for gene prioritisation in a genome-wide association study. |
format | Online Article Text |
id | pubmed-7349022 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-73490222020-07-22 A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis Bean, Daniel M. Al-Chalabi, Ammar Dobson, Richard J. B. Iacoangeli, Alfredo Genes (Basel) Article Amyotrophic lateral sclerosis is a neurodegenerative disease of the upper and lower motor neurons resulting in death from neuromuscular respiratory failure, typically within two to five years of first symptoms. Several rare disruptive gene variants have been associated with ALS and are responsible for about 15% of all cases. Although our knowledge of the genetic landscape of this disease is improving, it remains limited. Machine learning models trained on the available protein–protein interaction and phenotype-genotype association data can use our current knowledge of the disease genetics for the prediction of novel candidate genes. Here, we describe a knowledge-based machine learning method for this purpose. We trained our model on protein–protein interaction data from IntAct, gene function annotation from Gene Ontology, and known disease-gene associations from DisGeNet. Using several sets of known ALS genes from public databases and a manual review as input, we generated a list of new candidate genes for each input set. We investigated the relevance of the predicted genes in ALS by using the available summary statistics from the largest ALS genome-wide association study and by performing functional and phenotype enrichment analysis. The predicted sets were enriched for genes associated with other neurodegenerative diseases known to overlap with ALS genetically and phenotypically, as well as for biological processes associated with the disease. Moreover, using ALS genes from ClinVar and our manual review as input, the predicted sets were enriched for ALS-associated genes (ClinVar p = 0.038 and manual review p = 0.060) when used for gene prioritisation in a genome-wide association study. MDPI 2020-06-19 /pmc/articles/PMC7349022/ /pubmed/32575372 http://dx.doi.org/10.3390/genes11060668 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Bean, Daniel M. Al-Chalabi, Ammar Dobson, Richard J. B. Iacoangeli, Alfredo A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis |
title | A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis |
title_full | A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis |
title_fullStr | A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis |
title_full_unstemmed | A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis |
title_short | A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis |
title_sort | knowledge-based machine learning approach to gene prioritisation in amyotrophic lateral sclerosis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349022/ https://www.ncbi.nlm.nih.gov/pubmed/32575372 http://dx.doi.org/10.3390/genes11060668 |
work_keys_str_mv | AT beandanielm aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT alchalabiammar aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT dobsonrichardjb aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT iacoangelialfredo aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT beandanielm knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT alchalabiammar knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT dobsonrichardjb knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis AT iacoangelialfredo knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis |