Cargando…

Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models

Conventional genome-wide association studies (GWASs) of complex traits, such as Multiple Sclerosis (MS), are reliant on per-SNP p-values and are therefore heavily burdened by multiple testing correction. Thus, in order to detect more subtle alterations, ever increasing sample sizes are required, whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Burnard, Sean M., Lea, Rodney A., Benton, Miles, Eccles, David, Kennedy, Daniel W., Lechner-Scott, Jeannette, Scott, Rodney J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8774935/
https://www.ncbi.nlm.nih.gov/pubmed/35052430
http://dx.doi.org/10.3390/genes13010087
_version_ 1784636463206367232
author Burnard, Sean M.
Lea, Rodney A.
Benton, Miles
Eccles, David
Kennedy, Daniel W.
Lechner-Scott, Jeannette
Scott, Rodney J.
author_facet Burnard, Sean M.
Lea, Rodney A.
Benton, Miles
Eccles, David
Kennedy, Daniel W.
Lechner-Scott, Jeannette
Scott, Rodney J.
author_sort Burnard, Sean M.
collection PubMed
description Conventional genome-wide association studies (GWASs) of complex traits, such as Multiple Sclerosis (MS), are reliant on per-SNP p-values and are therefore heavily burdened by multiple testing correction. Thus, in order to detect more subtle alterations, ever increasing sample sizes are required, while ignoring potentially valuable information that is readily available in existing datasets. To overcome this, we used penalised regression incorporating elastic net with a stability selection method by iterative subsampling to detect the potential interaction of loci with MS risk. Through re-analysis of the ANZgene dataset (1617 cases and 1988 controls) and an IMSGC dataset as a replication cohort (1313 cases and 1458 controls), we identified new association signals for MS predisposition, including SNPs above and below conventional significance thresholds while targeting two natural killer receptor loci and the well-established HLA loci. For example, rs2844482 (98.1% iterations), otherwise ignored by conventional statistics (p = 0.673) in the same dataset, was independently strongly associated with MS in another GWAS that required more than 40 times the number of cases (~45 K). Further comparison of our hits to those present in a large-scale meta-analysis, confirmed that the majority of SNPs identified by the elastic net model reached conventional statistical GWAS thresholds (p < 5 × 10(−8)) in this much larger dataset. Moreover, we found that gene variants involved in oxidative stress, in addition to innate immunity, were associated with MS. Overall, this study highlights the benefit of using more advanced statistical methods to (re-)analyse subtle genetic variation among loci that have a biological basis for their contribution to disease risk.
format Online
Article
Text
id pubmed-8774935
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87749352022-01-21 Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models Burnard, Sean M. Lea, Rodney A. Benton, Miles Eccles, David Kennedy, Daniel W. Lechner-Scott, Jeannette Scott, Rodney J. Genes (Basel) Article Conventional genome-wide association studies (GWASs) of complex traits, such as Multiple Sclerosis (MS), are reliant on per-SNP p-values and are therefore heavily burdened by multiple testing correction. Thus, in order to detect more subtle alterations, ever increasing sample sizes are required, while ignoring potentially valuable information that is readily available in existing datasets. To overcome this, we used penalised regression incorporating elastic net with a stability selection method by iterative subsampling to detect the potential interaction of loci with MS risk. Through re-analysis of the ANZgene dataset (1617 cases and 1988 controls) and an IMSGC dataset as a replication cohort (1313 cases and 1458 controls), we identified new association signals for MS predisposition, including SNPs above and below conventional significance thresholds while targeting two natural killer receptor loci and the well-established HLA loci. For example, rs2844482 (98.1% iterations), otherwise ignored by conventional statistics (p = 0.673) in the same dataset, was independently strongly associated with MS in another GWAS that required more than 40 times the number of cases (~45 K). Further comparison of our hits to those present in a large-scale meta-analysis, confirmed that the majority of SNPs identified by the elastic net model reached conventional statistical GWAS thresholds (p < 5 × 10(−8)) in this much larger dataset. Moreover, we found that gene variants involved in oxidative stress, in addition to innate immunity, were associated with MS. Overall, this study highlights the benefit of using more advanced statistical methods to (re-)analyse subtle genetic variation among loci that have a biological basis for their contribution to disease risk. MDPI 2021-12-29 /pmc/articles/PMC8774935/ /pubmed/35052430 http://dx.doi.org/10.3390/genes13010087 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Burnard, Sean M.
Lea, Rodney A.
Benton, Miles
Eccles, David
Kennedy, Daniel W.
Lechner-Scott, Jeannette
Scott, Rodney J.
Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models
title Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models
title_full Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models
title_fullStr Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models
title_full_unstemmed Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models
title_short Capturing SNP Association across the NK Receptor and HLA Gene Regions in Multiple Sclerosis by Targeted Penalised Regression Models
title_sort capturing snp association across the nk receptor and hla gene regions in multiple sclerosis by targeted penalised regression models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8774935/
https://www.ncbi.nlm.nih.gov/pubmed/35052430
http://dx.doi.org/10.3390/genes13010087
work_keys_str_mv AT burnardseanm capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels
AT learodneya capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels
AT bentonmiles capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels
AT ecclesdavid capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels
AT kennedydanielw capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels
AT lechnerscottjeannette capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels
AT scottrodneyj capturingsnpassociationacrossthenkreceptorandhlageneregionsinmultiplesclerosisbytargetedpenalisedregressionmodels