Cargando…

Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk

The analysis of longitudinal data from electronic health records (EHRs) has the potential to improve clinical diagnoses and enable personalized medicine, motivating efforts to identify disease subtypes from patient comorbidity information. Here we introduce an age-dependent topic modeling (ATM) meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Xilin, Zhang, Martin Jinye, Zhang, Yidong, Durvasula, Arun, Inouye, Michael, Holmes, Chris, Price, Alkes L., McVean, Gil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10632146/
https://www.ncbi.nlm.nih.gov/pubmed/37814053
http://dx.doi.org/10.1038/s41588-023-01522-8
_version_ 1785132516030545920
author Jiang, Xilin
Zhang, Martin Jinye
Zhang, Yidong
Durvasula, Arun
Inouye, Michael
Holmes, Chris
Price, Alkes L.
McVean, Gil
author_facet Jiang, Xilin
Zhang, Martin Jinye
Zhang, Yidong
Durvasula, Arun
Inouye, Michael
Holmes, Chris
Price, Alkes L.
McVean, Gil
author_sort Jiang, Xilin
collection PubMed
description The analysis of longitudinal data from electronic health records (EHRs) has the potential to improve clinical diagnoses and enable personalized medicine, motivating efforts to identify disease subtypes from patient comorbidity information. Here we introduce an age-dependent topic modeling (ATM) method that provides a low-rank representation of longitudinal records of hundreds of distinct diseases in large EHR datasets. We applied ATM to 282,957 UK Biobank samples, identifying 52 diseases with heterogeneous comorbidity profiles; analyses of 211,908 All of Us samples produced concordant results. We defined subtypes of the 52 heterogeneous diseases based on their comorbidity profiles and compared genetic risk across disease subtypes using polygenic risk scores (PRSs), identifying 18 disease subtypes whose PRS differed significantly from other subtypes of the same disease. We further identified specific genetic variants with subtype-dependent effects on disease risk. In conclusion, ATM identifies disease subtypes with differential genome-wide and locus-specific genetic risk profiles.
format Online
Article
Text
id pubmed-10632146
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-106321462023-11-10 Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk Jiang, Xilin Zhang, Martin Jinye Zhang, Yidong Durvasula, Arun Inouye, Michael Holmes, Chris Price, Alkes L. McVean, Gil Nat Genet Article The analysis of longitudinal data from electronic health records (EHRs) has the potential to improve clinical diagnoses and enable personalized medicine, motivating efforts to identify disease subtypes from patient comorbidity information. Here we introduce an age-dependent topic modeling (ATM) method that provides a low-rank representation of longitudinal records of hundreds of distinct diseases in large EHR datasets. We applied ATM to 282,957 UK Biobank samples, identifying 52 diseases with heterogeneous comorbidity profiles; analyses of 211,908 All of Us samples produced concordant results. We defined subtypes of the 52 heterogeneous diseases based on their comorbidity profiles and compared genetic risk across disease subtypes using polygenic risk scores (PRSs), identifying 18 disease subtypes whose PRS differed significantly from other subtypes of the same disease. We further identified specific genetic variants with subtype-dependent effects on disease risk. In conclusion, ATM identifies disease subtypes with differential genome-wide and locus-specific genetic risk profiles. Nature Publishing Group US 2023-10-09 2023 /pmc/articles/PMC10632146/ /pubmed/37814053 http://dx.doi.org/10.1038/s41588-023-01522-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Jiang, Xilin
Zhang, Martin Jinye
Zhang, Yidong
Durvasula, Arun
Inouye, Michael
Holmes, Chris
Price, Alkes L.
McVean, Gil
Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
title Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
title_full Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
title_fullStr Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
title_full_unstemmed Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
title_short Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
title_sort age-dependent topic modeling of comorbidities in uk biobank identifies disease subtypes with differential genetic risk
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10632146/
https://www.ncbi.nlm.nih.gov/pubmed/37814053
http://dx.doi.org/10.1038/s41588-023-01522-8
work_keys_str_mv AT jiangxilin agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT zhangmartinjinye agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT zhangyidong agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT durvasulaarun agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT inouyemichael agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT holmeschris agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT pricealkesl agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk
AT mcveangil agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk