Cargando…

Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk

(1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene(n), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering hi...

Descripción completa

Detalles Bibliográficos
Autores principales: Briggs, Farren B. S., Sept, Corriene
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967327/
https://www.ncbi.nlm.nih.gov/pubmed/33802599
http://dx.doi.org/10.3390/ijerph18052518
_version_ 1783665850061946880
author Briggs, Farren B. S.
Sept, Corriene
author_facet Briggs, Farren B. S.
Sept, Corriene
author_sort Briggs, Farren B. S.
collection PubMed
description (1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene(n), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering higher order genetic relationships, but their use in MS have been limited. (2) Methods: Association rule mining (ARM), a combinatorial rule-based machine learning algorithm, was applied to genetic data for non-Latinx MS cases (n = 207) and controls (n = 179). The objective was to identify patterns (rules) amongst the known MS risk variants, including HLA-DRB1*15:01 presence, HLA-A*02:01 absence, and 194 of the 200 common autosomal variants. Probabilistic measures (confidence and support) were used to mine rules. (3) Results: 114 rules met minimum requirements of 80% confidence and 5% support. The top ranking rule by confidence consisted of HLA-DRB1*15:01, SLC30A7-rs56678847 and AC093277.1-rs6880809; carriers of these variants had a significantly greater risk for MS (odds ratio = 20.2, 95% CI: 8.5, 37.5; p = 4 × 10(−9)). Several variants were shared across rules, the most common was INTS8-rs78727559, which was in 32.5% of rules. (4) Conclusions: In summary, we demonstrate evidence that specific combinations of MS risk variants disproportionately confer elevated risk by applying a robust analytical framework to a modestly sized study population.
format Online
Article
Text
id pubmed-7967327
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79673272021-03-18 Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk Briggs, Farren B. S. Sept, Corriene Int J Environ Res Public Health Article (1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene(n), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering higher order genetic relationships, but their use in MS have been limited. (2) Methods: Association rule mining (ARM), a combinatorial rule-based machine learning algorithm, was applied to genetic data for non-Latinx MS cases (n = 207) and controls (n = 179). The objective was to identify patterns (rules) amongst the known MS risk variants, including HLA-DRB1*15:01 presence, HLA-A*02:01 absence, and 194 of the 200 common autosomal variants. Probabilistic measures (confidence and support) were used to mine rules. (3) Results: 114 rules met minimum requirements of 80% confidence and 5% support. The top ranking rule by confidence consisted of HLA-DRB1*15:01, SLC30A7-rs56678847 and AC093277.1-rs6880809; carriers of these variants had a significantly greater risk for MS (odds ratio = 20.2, 95% CI: 8.5, 37.5; p = 4 × 10(−9)). Several variants were shared across rules, the most common was INTS8-rs78727559, which was in 32.5% of rules. (4) Conclusions: In summary, we demonstrate evidence that specific combinations of MS risk variants disproportionately confer elevated risk by applying a robust analytical framework to a modestly sized study population. MDPI 2021-03-03 /pmc/articles/PMC7967327/ /pubmed/33802599 http://dx.doi.org/10.3390/ijerph18052518 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Briggs, Farren B. S.
Sept, Corriene
Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
title Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
title_full Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
title_fullStr Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
title_full_unstemmed Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
title_short Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
title_sort mining complex genetic patterns conferring multiple sclerosis risk
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967327/
https://www.ncbi.nlm.nih.gov/pubmed/33802599
http://dx.doi.org/10.3390/ijerph18052518
work_keys_str_mv AT briggsfarrenbs miningcomplexgeneticpatternsconferringmultiplesclerosisrisk
AT septcorriene miningcomplexgeneticpatternsconferringmultiplesclerosisrisk