Cargando…
Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk
The analysis of longitudinal data from electronic health records (EHRs) has the potential to improve clinical diagnoses and enable personalized medicine, motivating efforts to identify disease subtypes from patient comorbidity information. Here we introduce an age-dependent topic modeling (ATM) meth...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10632146/ https://www.ncbi.nlm.nih.gov/pubmed/37814053 http://dx.doi.org/10.1038/s41588-023-01522-8 |
_version_ | 1785132516030545920 |
---|---|
author | Jiang, Xilin Zhang, Martin Jinye Zhang, Yidong Durvasula, Arun Inouye, Michael Holmes, Chris Price, Alkes L. McVean, Gil |
author_facet | Jiang, Xilin Zhang, Martin Jinye Zhang, Yidong Durvasula, Arun Inouye, Michael Holmes, Chris Price, Alkes L. McVean, Gil |
author_sort | Jiang, Xilin |
collection | PubMed |
description | The analysis of longitudinal data from electronic health records (EHRs) has the potential to improve clinical diagnoses and enable personalized medicine, motivating efforts to identify disease subtypes from patient comorbidity information. Here we introduce an age-dependent topic modeling (ATM) method that provides a low-rank representation of longitudinal records of hundreds of distinct diseases in large EHR datasets. We applied ATM to 282,957 UK Biobank samples, identifying 52 diseases with heterogeneous comorbidity profiles; analyses of 211,908 All of Us samples produced concordant results. We defined subtypes of the 52 heterogeneous diseases based on their comorbidity profiles and compared genetic risk across disease subtypes using polygenic risk scores (PRSs), identifying 18 disease subtypes whose PRS differed significantly from other subtypes of the same disease. We further identified specific genetic variants with subtype-dependent effects on disease risk. In conclusion, ATM identifies disease subtypes with differential genome-wide and locus-specific genetic risk profiles. |
format | Online Article Text |
id | pubmed-10632146 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-106321462023-11-10 Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk Jiang, Xilin Zhang, Martin Jinye Zhang, Yidong Durvasula, Arun Inouye, Michael Holmes, Chris Price, Alkes L. McVean, Gil Nat Genet Article The analysis of longitudinal data from electronic health records (EHRs) has the potential to improve clinical diagnoses and enable personalized medicine, motivating efforts to identify disease subtypes from patient comorbidity information. Here we introduce an age-dependent topic modeling (ATM) method that provides a low-rank representation of longitudinal records of hundreds of distinct diseases in large EHR datasets. We applied ATM to 282,957 UK Biobank samples, identifying 52 diseases with heterogeneous comorbidity profiles; analyses of 211,908 All of Us samples produced concordant results. We defined subtypes of the 52 heterogeneous diseases based on their comorbidity profiles and compared genetic risk across disease subtypes using polygenic risk scores (PRSs), identifying 18 disease subtypes whose PRS differed significantly from other subtypes of the same disease. We further identified specific genetic variants with subtype-dependent effects on disease risk. In conclusion, ATM identifies disease subtypes with differential genome-wide and locus-specific genetic risk profiles. Nature Publishing Group US 2023-10-09 2023 /pmc/articles/PMC10632146/ /pubmed/37814053 http://dx.doi.org/10.1038/s41588-023-01522-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Jiang, Xilin Zhang, Martin Jinye Zhang, Yidong Durvasula, Arun Inouye, Michael Holmes, Chris Price, Alkes L. McVean, Gil Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk |
title | Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk |
title_full | Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk |
title_fullStr | Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk |
title_full_unstemmed | Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk |
title_short | Age-dependent topic modeling of comorbidities in UK Biobank identifies disease subtypes with differential genetic risk |
title_sort | age-dependent topic modeling of comorbidities in uk biobank identifies disease subtypes with differential genetic risk |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10632146/ https://www.ncbi.nlm.nih.gov/pubmed/37814053 http://dx.doi.org/10.1038/s41588-023-01522-8 |
work_keys_str_mv | AT jiangxilin agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT zhangmartinjinye agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT zhangyidong agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT durvasulaarun agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT inouyemichael agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT holmeschris agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT pricealkesl agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk AT mcveangil agedependenttopicmodelingofcomorbiditiesinukbiobankidentifiesdiseasesubtypeswithdifferentialgeneticrisk |