Cargando…

Methylation risk scores are associated with a collection of phenotypes within electronic health record systems

Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk...

Descripción completa

Detalles Bibliográficos
Autores principales: Thompson, Mike, Hill, Brian L., Rakocz, Nadav, Chiang, Jeffrey N., Geschwind, Daniel, Sankararaman, Sriram, Hofer, Ira, Cannesson, Maxime, Zaitlen, Noah, Halperin, Eran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9411568/
https://www.ncbi.nlm.nih.gov/pubmed/36008412
http://dx.doi.org/10.1038/s41525-022-00320-1
_version_ 1784775298137456640
author Thompson, Mike
Hill, Brian L.
Rakocz, Nadav
Chiang, Jeffrey N.
Geschwind, Daniel
Sankararaman, Sriram
Hofer, Ira
Cannesson, Maxime
Zaitlen, Noah
Halperin, Eran
author_facet Thompson, Mike
Hill, Brian L.
Rakocz, Nadav
Chiang, Jeffrey N.
Geschwind, Daniel
Sankararaman, Sriram
Hofer, Ira
Cannesson, Maxime
Zaitlen, Noah
Halperin, Eran
author_sort Thompson, Mike
collection PubMed
description Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk scores (PRS). In this work, we considered the epigenetic analog of PRS, methylation risk scores (MRS), a linear combination of methylation states. We measured methylation across a large cohort (n = 831) of diverse samples in the UCLA Health biobank, for which both genetic and complete EHR data are available. We constructed MRS for 607 phenotypes spanning diagnoses, clinical lab tests, and medication prescriptions. When added to a baseline set of predictive features, MRS significantly improved the imputation of 139 outcomes, whereas the PRS improved only 22 (median improvement for methylation 10.74%, 141.52%, and 15.46% in medications, labs, and diagnosis codes, respectively, whereas genotypes only improved the labs at a median increase of 18.42%). We added significant MRS to state-of-the-art EHR imputation methods that leverage the entire set of medical records, and found that including MRS as a medical feature in the algorithm significantly improves EHR imputation in 37% of lab tests examined (median R(2) increase 47.6%). Finally, we replicated several MRS in multiple external studies of methylation (minimum p-value of 2.72 × 10(−7)) and replicated 22 of 30 tested MRS internally in two separate cohorts of different ethnicity. Our publicly available results and weights show promise for methylation risk scores as clinical and scientific tools.
format Online
Article
Text
id pubmed-9411568
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-94115682022-08-27 Methylation risk scores are associated with a collection of phenotypes within electronic health record systems Thompson, Mike Hill, Brian L. Rakocz, Nadav Chiang, Jeffrey N. Geschwind, Daniel Sankararaman, Sriram Hofer, Ira Cannesson, Maxime Zaitlen, Noah Halperin, Eran NPJ Genom Med Article Inference of clinical phenotypes is a fundamental task in precision medicine, and has therefore been heavily investigated in recent years in the context of electronic health records (EHR) using a large arsenal of machine learning techniques, as well as in the context of genetics using polygenic risk scores (PRS). In this work, we considered the epigenetic analog of PRS, methylation risk scores (MRS), a linear combination of methylation states. We measured methylation across a large cohort (n = 831) of diverse samples in the UCLA Health biobank, for which both genetic and complete EHR data are available. We constructed MRS for 607 phenotypes spanning diagnoses, clinical lab tests, and medication prescriptions. When added to a baseline set of predictive features, MRS significantly improved the imputation of 139 outcomes, whereas the PRS improved only 22 (median improvement for methylation 10.74%, 141.52%, and 15.46% in medications, labs, and diagnosis codes, respectively, whereas genotypes only improved the labs at a median increase of 18.42%). We added significant MRS to state-of-the-art EHR imputation methods that leverage the entire set of medical records, and found that including MRS as a medical feature in the algorithm significantly improves EHR imputation in 37% of lab tests examined (median R(2) increase 47.6%). Finally, we replicated several MRS in multiple external studies of methylation (minimum p-value of 2.72 × 10(−7)) and replicated 22 of 30 tested MRS internally in two separate cohorts of different ethnicity. Our publicly available results and weights show promise for methylation risk scores as clinical and scientific tools. Nature Publishing Group UK 2022-08-25 /pmc/articles/PMC9411568/ /pubmed/36008412 http://dx.doi.org/10.1038/s41525-022-00320-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Thompson, Mike
Hill, Brian L.
Rakocz, Nadav
Chiang, Jeffrey N.
Geschwind, Daniel
Sankararaman, Sriram
Hofer, Ira
Cannesson, Maxime
Zaitlen, Noah
Halperin, Eran
Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
title Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
title_full Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
title_fullStr Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
title_full_unstemmed Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
title_short Methylation risk scores are associated with a collection of phenotypes within electronic health record systems
title_sort methylation risk scores are associated with a collection of phenotypes within electronic health record systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9411568/
https://www.ncbi.nlm.nih.gov/pubmed/36008412
http://dx.doi.org/10.1038/s41525-022-00320-1
work_keys_str_mv AT thompsonmike methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT hillbrianl methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT rakocznadav methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT chiangjeffreyn methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT geschwinddaniel methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT sankararamansriram methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT hoferira methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT cannessonmaxime methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT zaitlennoah methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems
AT halperineran methylationriskscoresareassociatedwithacollectionofphenotypeswithinelectronichealthrecordsystems