Cargando…

Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis

Many complex diseases share common genetic determinants and are comorbid in a population. We hypothesized that the co-occurrences of diseases and their overlapping genetic etiology can be exploited to simultaneously improve multiple diseases’ polygenic risk scores (PRS). This hypothesis was tested u...

Descripción completa

Detalles Bibliográficos
Autores principales: Badré, Adrien, Pan, Chongle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328362/
https://www.ncbi.nlm.nih.gov/pubmed/37418352
http://dx.doi.org/10.1371/journal.pcbi.1011211
_version_ 1785069781964029952
author Badré, Adrien
Pan, Chongle
author_facet Badré, Adrien
Pan, Chongle
author_sort Badré, Adrien
collection PubMed
description Many complex diseases share common genetic determinants and are comorbid in a population. We hypothesized that the co-occurrences of diseases and their overlapping genetic etiology can be exploited to simultaneously improve multiple diseases’ polygenic risk scores (PRS). This hypothesis was tested using a multi-task learning (MTL) approach based on an explainable neural network architecture. We found that parallel estimations of the PRS for 17 prevalent cancers in a pan-cancer MTL model were generally more accurate than independent estimations for individual cancers in comparable single-task learning (STL) models. Such performance improvement conferred by positive transfer learning was also observed consistently for 60 prevalent non-cancer diseases in a pan-disease MTL model. Interpretation of the MTL models revealed significant genetic correlations between the important sets of single nucleotide polymorphisms used by the neural network for PRS estimation. This suggested a well-connected network of diseases with shared genetic basis.
format Online
Article
Text
id pubmed-10328362
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-103283622023-07-08 Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis Badré, Adrien Pan, Chongle PLoS Comput Biol Research Article Many complex diseases share common genetic determinants and are comorbid in a population. We hypothesized that the co-occurrences of diseases and their overlapping genetic etiology can be exploited to simultaneously improve multiple diseases’ polygenic risk scores (PRS). This hypothesis was tested using a multi-task learning (MTL) approach based on an explainable neural network architecture. We found that parallel estimations of the PRS for 17 prevalent cancers in a pan-cancer MTL model were generally more accurate than independent estimations for individual cancers in comparable single-task learning (STL) models. Such performance improvement conferred by positive transfer learning was also observed consistently for 60 prevalent non-cancer diseases in a pan-disease MTL model. Interpretation of the MTL models revealed significant genetic correlations between the important sets of single nucleotide polymorphisms used by the neural network for PRS estimation. This suggested a well-connected network of diseases with shared genetic basis. Public Library of Science 2023-07-07 /pmc/articles/PMC10328362/ /pubmed/37418352 http://dx.doi.org/10.1371/journal.pcbi.1011211 Text en © 2023 Badré, Pan https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Badré, Adrien
Pan, Chongle
Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
title Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
title_full Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
title_fullStr Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
title_full_unstemmed Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
title_short Explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
title_sort explainable multi-task learning improves the parallel estimation of polygenic risk scores for many diseases through shared genetic basis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10328362/
https://www.ncbi.nlm.nih.gov/pubmed/37418352
http://dx.doi.org/10.1371/journal.pcbi.1011211
work_keys_str_mv AT badreadrien explainablemultitasklearningimprovestheparallelestimationofpolygenicriskscoresformanydiseasesthroughsharedgeneticbasis
AT panchongle explainablemultitasklearningimprovestheparallelestimationofpolygenicriskscoresformanydiseasesthroughsharedgeneticbasis