Cargando…

Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies

MOTIVATION: Allowance for increasingly large samples is a key to identify the association of genetic variants with Alzheimer’s disease (AD) in genome-wide association studies (GWAS). Accordingly, we aimed to develop a method that incorporates patients with mild cognitive impairment and unknown cogni...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Jun Young, Lee, Jang Jae, Lee, Younghwa, Lee, Dongsoo, Gim, Jungsoo, Farrer, Lindsay, Lee, Kun Ho, Won, Sungho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10539075/
https://www.ncbi.nlm.nih.gov/pubmed/37665736
http://dx.doi.org/10.1093/bioinformatics/btad534
_version_ 1785113424044228608
author Park, Jun Young
Lee, Jang Jae
Lee, Younghwa
Lee, Dongsoo
Gim, Jungsoo
Farrer, Lindsay
Lee, Kun Ho
Won, Sungho
author_facet Park, Jun Young
Lee, Jang Jae
Lee, Younghwa
Lee, Dongsoo
Gim, Jungsoo
Farrer, Lindsay
Lee, Kun Ho
Won, Sungho
author_sort Park, Jun Young
collection PubMed
description MOTIVATION: Allowance for increasingly large samples is a key to identify the association of genetic variants with Alzheimer’s disease (AD) in genome-wide association studies (GWAS). Accordingly, we aimed to develop a method that incorporates patients with mild cognitive impairment and unknown cognitive status in GWAS using a machine learning-based AD prediction model. RESULTS: Simulation analyses showed that weighting imputed phenotypes method increased the statistical power compared to ordinary logistic regression using only AD cases and controls. Applied to real-world data, the penalized logistic method had the highest AUC (0.96) for AD prediction and weighting imputed phenotypes method performed well in terms of power. We identified an association (P< [Formula: see text]) of AD with several variants in the APOE region and rs143625563 in LMX1A. Our method, which allows the inclusion of individuals with mild cognitive impairment, improves the statistical power of GWAS for AD. We discovered a novel association with LMX1A. AVAILABILITY AND IMPLEMENTATION: Simulation codes can be accessed at https://github.com/Junkkkk/wGEE_GWAS.
format Online
Article
Text
id pubmed-10539075
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105390752023-09-29 Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies Park, Jun Young Lee, Jang Jae Lee, Younghwa Lee, Dongsoo Gim, Jungsoo Farrer, Lindsay Lee, Kun Ho Won, Sungho Bioinformatics Original Paper MOTIVATION: Allowance for increasingly large samples is a key to identify the association of genetic variants with Alzheimer’s disease (AD) in genome-wide association studies (GWAS). Accordingly, we aimed to develop a method that incorporates patients with mild cognitive impairment and unknown cognitive status in GWAS using a machine learning-based AD prediction model. RESULTS: Simulation analyses showed that weighting imputed phenotypes method increased the statistical power compared to ordinary logistic regression using only AD cases and controls. Applied to real-world data, the penalized logistic method had the highest AUC (0.96) for AD prediction and weighting imputed phenotypes method performed well in terms of power. We identified an association (P< [Formula: see text]) of AD with several variants in the APOE region and rs143625563 in LMX1A. Our method, which allows the inclusion of individuals with mild cognitive impairment, improves the statistical power of GWAS for AD. We discovered a novel association with LMX1A. AVAILABILITY AND IMPLEMENTATION: Simulation codes can be accessed at https://github.com/Junkkkk/wGEE_GWAS. Oxford University Press 2023-09-04 /pmc/articles/PMC10539075/ /pubmed/37665736 http://dx.doi.org/10.1093/bioinformatics/btad534 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Park, Jun Young
Lee, Jang Jae
Lee, Younghwa
Lee, Dongsoo
Gim, Jungsoo
Farrer, Lindsay
Lee, Kun Ho
Won, Sungho
Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
title Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
title_full Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
title_fullStr Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
title_full_unstemmed Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
title_short Machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
title_sort machine learning-based quantification for disease uncertainty increases the statistical power of genetic association studies
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10539075/
https://www.ncbi.nlm.nih.gov/pubmed/37665736
http://dx.doi.org/10.1093/bioinformatics/btad534
work_keys_str_mv AT parkjunyoung machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT leejangjae machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT leeyounghwa machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT leedongsoo machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT gimjungsoo machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT farrerlindsay machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT leekunho machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies
AT wonsungho machinelearningbasedquantificationfordiseaseuncertaintyincreasesthestatisticalpowerofgeneticassociationstudies