Cargando…

Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk

The benefits of using electronic health records (EHRs) for disease risk screening and personalized health-care decisions are being increasingly recognized. Here we present a computationally feasible statistical approach with which to address the methodological challenges involved in utilizing histor...

Descripción completa

Detalles Bibliográficos
Autores principales: Paige, Ellie, Barrett, Jessica, Stevens, David, Keogh, Ruth H, Sweeting, Michael J, Nazareth, Irwin, Petersen, Irene, Wood, Angela M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030927/
https://www.ncbi.nlm.nih.gov/pubmed/29584812
http://dx.doi.org/10.1093/aje/kwy018
_version_ 1783337222580207616
author Paige, Ellie
Barrett, Jessica
Stevens, David
Keogh, Ruth H
Sweeting, Michael J
Nazareth, Irwin
Petersen, Irene
Wood, Angela M
author_facet Paige, Ellie
Barrett, Jessica
Stevens, David
Keogh, Ruth H
Sweeting, Michael J
Nazareth, Irwin
Petersen, Irene
Wood, Angela M
author_sort Paige, Ellie
collection PubMed
description The benefits of using electronic health records (EHRs) for disease risk screening and personalized health-care decisions are being increasingly recognized. Here we present a computationally feasible statistical approach with which to address the methodological challenges involved in utilizing historical repeat measures of multiple risk factors recorded in EHRs to systematically identify patients at high risk of future disease. The approach is principally based on a 2-stage dynamic landmark model. The first stage estimates current risk factor values from all available historical repeat risk factor measurements via landmark-age–specific multivariate linear mixed-effects models with correlated random intercepts, which account for sporadically recorded repeat measures, unobserved data, and measurement errors. The second stage predicts future disease risk from a sex-stratified Cox proportional hazards model, with estimated current risk factor values from the first stage. We exemplify these methods by developing and validating a dynamic 10-year cardiovascular disease risk prediction model using primary-care EHRs for age, diabetes status, hypertension treatment, smoking status, systolic blood pressure, total cholesterol, and high-density lipoprotein cholesterol in 41,373 persons from 10 primary-care practices in England and Wales contributing to The Health Improvement Network (1997–2016). Using cross-validation, the model was well-calibrated (Brier score = 0.041, 95% confidence interval: 0.039, 0.042) and had good discrimination (C-index = 0.768, 95% confidence interval: 0.759, 0.777).
format Online
Article
Text
id pubmed-6030927
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60309272018-07-10 Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk Paige, Ellie Barrett, Jessica Stevens, David Keogh, Ruth H Sweeting, Michael J Nazareth, Irwin Petersen, Irene Wood, Angela M Am J Epidemiol Practice of Epidemiology The benefits of using electronic health records (EHRs) for disease risk screening and personalized health-care decisions are being increasingly recognized. Here we present a computationally feasible statistical approach with which to address the methodological challenges involved in utilizing historical repeat measures of multiple risk factors recorded in EHRs to systematically identify patients at high risk of future disease. The approach is principally based on a 2-stage dynamic landmark model. The first stage estimates current risk factor values from all available historical repeat risk factor measurements via landmark-age–specific multivariate linear mixed-effects models with correlated random intercepts, which account for sporadically recorded repeat measures, unobserved data, and measurement errors. The second stage predicts future disease risk from a sex-stratified Cox proportional hazards model, with estimated current risk factor values from the first stage. We exemplify these methods by developing and validating a dynamic 10-year cardiovascular disease risk prediction model using primary-care EHRs for age, diabetes status, hypertension treatment, smoking status, systolic blood pressure, total cholesterol, and high-density lipoprotein cholesterol in 41,373 persons from 10 primary-care practices in England and Wales contributing to The Health Improvement Network (1997–2016). Using cross-validation, the model was well-calibrated (Brier score = 0.041, 95% confidence interval: 0.039, 0.042) and had good discrimination (C-index = 0.768, 95% confidence interval: 0.759, 0.777). Oxford University Press 2018-07 2018-03-23 /pmc/articles/PMC6030927/ /pubmed/29584812 http://dx.doi.org/10.1093/aje/kwy018 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Practice of Epidemiology
Paige, Ellie
Barrett, Jessica
Stevens, David
Keogh, Ruth H
Sweeting, Michael J
Nazareth, Irwin
Petersen, Irene
Wood, Angela M
Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk
title Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk
title_full Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk
title_fullStr Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk
title_full_unstemmed Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk
title_short Landmark Models for Optimizing the Use of Repeated Measurements of Risk Factors in Electronic Health Records to Predict Future Disease Risk
title_sort landmark models for optimizing the use of repeated measurements of risk factors in electronic health records to predict future disease risk
topic Practice of Epidemiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030927/
https://www.ncbi.nlm.nih.gov/pubmed/29584812
http://dx.doi.org/10.1093/aje/kwy018
work_keys_str_mv AT paigeellie landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT barrettjessica landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT stevensdavid landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT keoghruthh landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT sweetingmichaelj landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT nazarethirwin landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT petersenirene landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk
AT woodangelam landmarkmodelsforoptimizingtheuseofrepeatedmeasurementsofriskfactorsinelectronichealthrecordstopredictfuturediseaserisk