Cargando…

A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis

Amyotrophic lateral sclerosis is a neurodegenerative disease of the upper and lower motor neurons resulting in death from neuromuscular respiratory failure, typically within two to five years of first symptoms. Several rare disruptive gene variants have been associated with ALS and are responsible f...

Descripción completa

Detalles Bibliográficos
Autores principales: Bean, Daniel M., Al-Chalabi, Ammar, Dobson, Richard J. B., Iacoangeli, Alfredo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349022/
https://www.ncbi.nlm.nih.gov/pubmed/32575372
http://dx.doi.org/10.3390/genes11060668
_version_ 1783556967787134976
author Bean, Daniel M.
Al-Chalabi, Ammar
Dobson, Richard J. B.
Iacoangeli, Alfredo
author_facet Bean, Daniel M.
Al-Chalabi, Ammar
Dobson, Richard J. B.
Iacoangeli, Alfredo
author_sort Bean, Daniel M.
collection PubMed
description Amyotrophic lateral sclerosis is a neurodegenerative disease of the upper and lower motor neurons resulting in death from neuromuscular respiratory failure, typically within two to five years of first symptoms. Several rare disruptive gene variants have been associated with ALS and are responsible for about 15% of all cases. Although our knowledge of the genetic landscape of this disease is improving, it remains limited. Machine learning models trained on the available protein–protein interaction and phenotype-genotype association data can use our current knowledge of the disease genetics for the prediction of novel candidate genes. Here, we describe a knowledge-based machine learning method for this purpose. We trained our model on protein–protein interaction data from IntAct, gene function annotation from Gene Ontology, and known disease-gene associations from DisGeNet. Using several sets of known ALS genes from public databases and a manual review as input, we generated a list of new candidate genes for each input set. We investigated the relevance of the predicted genes in ALS by using the available summary statistics from the largest ALS genome-wide association study and by performing functional and phenotype enrichment analysis. The predicted sets were enriched for genes associated with other neurodegenerative diseases known to overlap with ALS genetically and phenotypically, as well as for biological processes associated with the disease. Moreover, using ALS genes from ClinVar and our manual review as input, the predicted sets were enriched for ALS-associated genes (ClinVar p = 0.038 and manual review p = 0.060) when used for gene prioritisation in a genome-wide association study.
format Online
Article
Text
id pubmed-7349022
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73490222020-07-22 A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis Bean, Daniel M. Al-Chalabi, Ammar Dobson, Richard J. B. Iacoangeli, Alfredo Genes (Basel) Article Amyotrophic lateral sclerosis is a neurodegenerative disease of the upper and lower motor neurons resulting in death from neuromuscular respiratory failure, typically within two to five years of first symptoms. Several rare disruptive gene variants have been associated with ALS and are responsible for about 15% of all cases. Although our knowledge of the genetic landscape of this disease is improving, it remains limited. Machine learning models trained on the available protein–protein interaction and phenotype-genotype association data can use our current knowledge of the disease genetics for the prediction of novel candidate genes. Here, we describe a knowledge-based machine learning method for this purpose. We trained our model on protein–protein interaction data from IntAct, gene function annotation from Gene Ontology, and known disease-gene associations from DisGeNet. Using several sets of known ALS genes from public databases and a manual review as input, we generated a list of new candidate genes for each input set. We investigated the relevance of the predicted genes in ALS by using the available summary statistics from the largest ALS genome-wide association study and by performing functional and phenotype enrichment analysis. The predicted sets were enriched for genes associated with other neurodegenerative diseases known to overlap with ALS genetically and phenotypically, as well as for biological processes associated with the disease. Moreover, using ALS genes from ClinVar and our manual review as input, the predicted sets were enriched for ALS-associated genes (ClinVar p = 0.038 and manual review p = 0.060) when used for gene prioritisation in a genome-wide association study. MDPI 2020-06-19 /pmc/articles/PMC7349022/ /pubmed/32575372 http://dx.doi.org/10.3390/genes11060668 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bean, Daniel M.
Al-Chalabi, Ammar
Dobson, Richard J. B.
Iacoangeli, Alfredo
A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
title A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
title_full A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
title_fullStr A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
title_full_unstemmed A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
title_short A Knowledge-Based Machine Learning Approach to Gene Prioritisation in Amyotrophic Lateral Sclerosis
title_sort knowledge-based machine learning approach to gene prioritisation in amyotrophic lateral sclerosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349022/
https://www.ncbi.nlm.nih.gov/pubmed/32575372
http://dx.doi.org/10.3390/genes11060668
work_keys_str_mv AT beandanielm aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT alchalabiammar aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT dobsonrichardjb aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT iacoangelialfredo aknowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT beandanielm knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT alchalabiammar knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT dobsonrichardjb knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis
AT iacoangelialfredo knowledgebasedmachinelearningapproachtogeneprioritisationinamyotrophiclateralsclerosis