Cargando…
Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk
(1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene(n), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering hi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967327/ https://www.ncbi.nlm.nih.gov/pubmed/33802599 http://dx.doi.org/10.3390/ijerph18052518 |
_version_ | 1783665850061946880 |
---|---|
author | Briggs, Farren B. S. Sept, Corriene |
author_facet | Briggs, Farren B. S. Sept, Corriene |
author_sort | Briggs, Farren B. S. |
collection | PubMed |
description | (1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene(n), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering higher order genetic relationships, but their use in MS have been limited. (2) Methods: Association rule mining (ARM), a combinatorial rule-based machine learning algorithm, was applied to genetic data for non-Latinx MS cases (n = 207) and controls (n = 179). The objective was to identify patterns (rules) amongst the known MS risk variants, including HLA-DRB1*15:01 presence, HLA-A*02:01 absence, and 194 of the 200 common autosomal variants. Probabilistic measures (confidence and support) were used to mine rules. (3) Results: 114 rules met minimum requirements of 80% confidence and 5% support. The top ranking rule by confidence consisted of HLA-DRB1*15:01, SLC30A7-rs56678847 and AC093277.1-rs6880809; carriers of these variants had a significantly greater risk for MS (odds ratio = 20.2, 95% CI: 8.5, 37.5; p = 4 × 10(−9)). Several variants were shared across rules, the most common was INTS8-rs78727559, which was in 32.5% of rules. (4) Conclusions: In summary, we demonstrate evidence that specific combinations of MS risk variants disproportionately confer elevated risk by applying a robust analytical framework to a modestly sized study population. |
format | Online Article Text |
id | pubmed-7967327 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-79673272021-03-18 Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk Briggs, Farren B. S. Sept, Corriene Int J Environ Res Public Health Article (1) Background: Complex genetic relationships, including gene-gene (G × G; epistasis), gene(n), and gene-environment (G × E) interactions, explain a substantial portion of the heritability in multiple sclerosis (MS). Machine learning and data mining methods are promising approaches for uncovering higher order genetic relationships, but their use in MS have been limited. (2) Methods: Association rule mining (ARM), a combinatorial rule-based machine learning algorithm, was applied to genetic data for non-Latinx MS cases (n = 207) and controls (n = 179). The objective was to identify patterns (rules) amongst the known MS risk variants, including HLA-DRB1*15:01 presence, HLA-A*02:01 absence, and 194 of the 200 common autosomal variants. Probabilistic measures (confidence and support) were used to mine rules. (3) Results: 114 rules met minimum requirements of 80% confidence and 5% support. The top ranking rule by confidence consisted of HLA-DRB1*15:01, SLC30A7-rs56678847 and AC093277.1-rs6880809; carriers of these variants had a significantly greater risk for MS (odds ratio = 20.2, 95% CI: 8.5, 37.5; p = 4 × 10(−9)). Several variants were shared across rules, the most common was INTS8-rs78727559, which was in 32.5% of rules. (4) Conclusions: In summary, we demonstrate evidence that specific combinations of MS risk variants disproportionately confer elevated risk by applying a robust analytical framework to a modestly sized study population. MDPI 2021-03-03 /pmc/articles/PMC7967327/ /pubmed/33802599 http://dx.doi.org/10.3390/ijerph18052518 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Briggs, Farren B. S. Sept, Corriene Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk |
title | Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk |
title_full | Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk |
title_fullStr | Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk |
title_full_unstemmed | Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk |
title_short | Mining Complex Genetic Patterns Conferring Multiple Sclerosis Risk |
title_sort | mining complex genetic patterns conferring multiple sclerosis risk |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7967327/ https://www.ncbi.nlm.nih.gov/pubmed/33802599 http://dx.doi.org/10.3390/ijerph18052518 |
work_keys_str_mv | AT briggsfarrenbs miningcomplexgeneticpatternsconferringmultiplesclerosisrisk AT septcorriene miningcomplexgeneticpatternsconferringmultiplesclerosisrisk |