Cargando…

Genetic association studies using disease liabilities from deep neural networks

The case-control study is a widely used method for investigating the genetic landscape of binary traits. However, the health-related outcome or disease status of participants in long-term, prospective cohort studies such as the UK Biobank are subject to change. Here, we develop an approach for the g...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Lu, Sadler, Marie C., Altman, Russ B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882423/
https://www.ncbi.nlm.nih.gov/pubmed/36712099
http://dx.doi.org/10.1101/2023.01.18.23284383
_version_ 1784879290790182912
author Yang, Lu
Sadler, Marie C.
Altman, Russ B.
author_facet Yang, Lu
Sadler, Marie C.
Altman, Russ B.
author_sort Yang, Lu
collection PubMed
description The case-control study is a widely used method for investigating the genetic landscape of binary traits. However, the health-related outcome or disease status of participants in long-term, prospective cohort studies such as the UK Biobank are subject to change. Here, we develop an approach for the genetic association study leveraging disease liabilities computed from a deep patient phenotyping framework (AI-based liability). Analyzing 44 common traits in 261,807 participants from the UK Biobank, we identified novel loci compared to the conventional case-control (CC) association studies. Our results showed that combining liability scores with CC status was more powerful than the CC-GWAS in detecting independent genetic loci across different diseases. This boost in statistical power was further reflected in increased SNP-based heritability estimates. Moreover, polygenic risk scores calculated from AI-based liabilities better identified newly diagnosed cases in the 2022 release of the UK Biobank that served as controls in the 2019 version (6.2% percentile rank increase on average). These findings demonstrate the utility of deep neural networks that are able to model disease liabilities from high-dimensional phenotypic data in large-scale population cohorts. Our pipeline of genome-wide association studies with disease liabilities can be applied to other biobanks with rich phenotype and genotype data.
format Online
Article
Text
id pubmed-9882423
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98824232023-01-28 Genetic association studies using disease liabilities from deep neural networks Yang, Lu Sadler, Marie C. Altman, Russ B. medRxiv Article The case-control study is a widely used method for investigating the genetic landscape of binary traits. However, the health-related outcome or disease status of participants in long-term, prospective cohort studies such as the UK Biobank are subject to change. Here, we develop an approach for the genetic association study leveraging disease liabilities computed from a deep patient phenotyping framework (AI-based liability). Analyzing 44 common traits in 261,807 participants from the UK Biobank, we identified novel loci compared to the conventional case-control (CC) association studies. Our results showed that combining liability scores with CC status was more powerful than the CC-GWAS in detecting independent genetic loci across different diseases. This boost in statistical power was further reflected in increased SNP-based heritability estimates. Moreover, polygenic risk scores calculated from AI-based liabilities better identified newly diagnosed cases in the 2022 release of the UK Biobank that served as controls in the 2019 version (6.2% percentile rank increase on average). These findings demonstrate the utility of deep neural networks that are able to model disease liabilities from high-dimensional phenotypic data in large-scale population cohorts. Our pipeline of genome-wide association studies with disease liabilities can be applied to other biobanks with rich phenotype and genotype data. Cold Spring Harbor Laboratory 2023-01-19 /pmc/articles/PMC9882423/ /pubmed/36712099 http://dx.doi.org/10.1101/2023.01.18.23284383 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Yang, Lu
Sadler, Marie C.
Altman, Russ B.
Genetic association studies using disease liabilities from deep neural networks
title Genetic association studies using disease liabilities from deep neural networks
title_full Genetic association studies using disease liabilities from deep neural networks
title_fullStr Genetic association studies using disease liabilities from deep neural networks
title_full_unstemmed Genetic association studies using disease liabilities from deep neural networks
title_short Genetic association studies using disease liabilities from deep neural networks
title_sort genetic association studies using disease liabilities from deep neural networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882423/
https://www.ncbi.nlm.nih.gov/pubmed/36712099
http://dx.doi.org/10.1101/2023.01.18.23284383
work_keys_str_mv AT yanglu geneticassociationstudiesusingdiseaseliabilitiesfromdeepneuralnetworks
AT sadlermariec geneticassociationstudiesusingdiseaseliabilitiesfromdeepneuralnetworks
AT altmanrussb geneticassociationstudiesusingdiseaseliabilitiesfromdeepneuralnetworks