Cargando…
Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9411568/ https://www.ncbi.nlm.nih.gov/pubmed/36008412 http://dx.doi.org/10.1038/s41525-022-00320-1 |
_version_ | 1784775298137456640 |
---|---|
author | Thompson, Mike Hill, Brian L. Rakocz, Nadav Chiang, Jeffrey N. Geschwind, Daniel Sankararaman, Sriram Hofer, Ira Cannesson, Maxime Zaitlen, Noah Halperin, Eran |
author_facet | Thompson, Mike Hill, Brian L. Rakocz, Nadav Chiang, Jeffrey N. Geschwind, Daniel Sankararaman, Sriram Hofer, Ira Cannesson, Maxime Zaitlen, Noah Halperin, Eran |
author_sort | Thompson, Mike |
collection | PubMed |
description | Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk scores (PRS). In this work, we considered the epigenetic analog of PRS, methylation risk scores (MRS), a linear combination of methylation states. We measured methylation across a large cohort (n = 831) of diverse samples in the UCLA Health biobank, for which both genetic and complete EHR data are available. We constructed MRS for 607 phenotypes spanning diagnoses, clinical lab tests, and medication prescriptions. When added to a baseline set of predictive features, MRS significantly improved the imputation of 139 outcomes, whereas the PRS improved only 22 (median improvement for methylation 10.74%, 141.52%, and 15.46% in medications, labs, and diagnosis codes, respectively, whereas genotypes only improved the labs at a median increase of 18.42%). We added significant MRS to state-of-the-art EHR imputation methods that leverage the entire set of medical records, and found that including MRS as a medical feature in the algorithm significantly improves EHR imputation in 37% of lab tests examined (median R(2) increase 47.6%). Finally, we replicated several MRS in multiple external studies of methylation (minimum p-value of 2.72 × 10(−7)) and replicated 22 of 30 tested MRS internally in two separate cohorts of different ethnicity. Our publicly available results and weights show promise for methylation risk scores as clinical and scientific tools. |
format | Online Article Text |
id | pubmed-9411568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-94115682022-08-27 Methylation risk scores are associated with a collection of phenotypes within electronic health record systems Thompson, Mike Hill, Brian L. Rakocz, Nadav Chiang, Jeffrey N. Geschwind, Daniel Sankararaman, Sriram Hofer, Ira Cannesson, Maxime Zaitlen, Noah Halperin, Eran NPJ Genom Med Article Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk scores (PRS). In this work, we considered the epigenetic analog of PRS, methylation risk scores (MRS), a linear combination of methylation states. We measured methylation across a large cohort (n = 831) of diverse samples in the UCLA Health biobank, for which both genetic and complete EHR data are available. We constructed MRS for 607 phenotypes spanning diagnoses, clinical lab tests, and medication prescriptions. When added to a baseline set of predictive features, MRS significantly improved the imputation of 139 outcomes, whereas the PRS improved only 22 (median improvement for methylation 10.74%, 141.52%, and 15.46% in medications, labs, and diagnosis codes, respectively, whereas genotypes only improved the labs at a median increase of 18.42%). We added significant MRS to state-of-the-art EHR imputation methods that leverage the entire set of medical records, and found that including MRS as a medical feature in the algorithm significantly improves EHR imputation in 37% of lab tests examined (median R(2) increase 47.6%). Finally, we replicated several MRS in multiple external studies of methylation (minimum p-value of 2.72 × 10(−7)) and replicated 22 of 30 tested MRS internally in two separate cohorts of different ethnicity. Our publicly available results and weights show promise for methylation risk scores as clinical and scientific tools. Nature Publishing Group UK 2022-08-25 /pmc/articles/PMC9411568/ /pubmed/36008412 http://dx.doi.org/10.1038/s41525-022-00320-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Thompson, Mike Hill, Brian L. Rakocz, Nadav Chiang, Jeffrey N. Geschwind, Daniel Sankararaman, Sriram Hofer, Ira Cannesson, Maxime Zaitlen, Noah Halperin, Eran Methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
title | Methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
title_full | Methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
title_fullStr | Methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
title_full_unstemmed | Methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
title_short | Methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
title_sort | methylation risk scores are associated with a collection of phenotypes within electronic health record systems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9411568/ https://www.ncbi.nlm.nih.gov/pubmed/36008412 http://dx.doi.org/10.1038/s41525-022-00320-1 |
work_keys_str_mv | AT thompsonmike methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT hillbrianl methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT rakocznadav methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT chiangjeffreyn methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT geschwinddaniel methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT sankararamansriram methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT hoferira methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT cannessonmaxime methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT zaitlennoah methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems AT halperineran methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems |