Cargando…

Parameterizing time in electronic health record studies

Background Fields like nonlinear physics offer methods for analyzing time series, but many methods require that the time series be stationary—no change in properties over time. Objective Medicine is far from stationary, but the challenge may be able to be ameliorated by reparameterizing time because...

Descripción completa

Detalles Bibliográficos
Autores principales: Hripcsak, George, Albers, David J, Perotte, Adler
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6169471/
https://www.ncbi.nlm.nih.gov/pubmed/25725004
http://dx.doi.org/10.1093/jamia/ocu051
_version_ 1783360524651593728
author Hripcsak, George
Albers, David J
Perotte, Adler
author_facet Hripcsak, George
Albers, David J
Perotte, Adler
author_sort Hripcsak, George
collection PubMed
description Background Fields like nonlinear physics offer methods for analyzing time series, but many methods require that the time series be stationary—no change in properties over time. Objective Medicine is far from stationary, but the challenge may be able to be ameliorated by reparameterizing time because clinicians tend to measure patients more frequently when they are ill and are more likely to vary. Methods We compared time parameterizations, measuring variability of rate of change and magnitude of change, and looking for homogeneity of bins of temporal separation between pairs of time points. We studied four common laboratory tests drawn from 25 years of electronic health records on 4 million patients. Results We found that sequence time—that is, simply counting the number of measurements from some start—produced more stationary time series, better explained the variation in values, and had more homogeneous bins than either traditional clock time or a recently proposed intermediate parameterization. Sequence time produced more accurate predictions in a single Gaussian process model experiment. Conclusions Of the three parameterizations, sequence time appeared to produce the most stationary series, possibly because clinicians adjust their sampling to the acuity of the patient. Parameterizing by sequence time may be applicable to association and clustering experiments on electronic health record data. A limitation of this study is that laboratory data were derived from only one institution. Sequence time appears to be an important potential parameterization.
format Online
Article
Text
id pubmed-6169471
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61694712018-10-09 Parameterizing time in electronic health record studies Hripcsak, George Albers, David J Perotte, Adler J Am Med Inform Assoc Research and Applications Background Fields like nonlinear physics offer methods for analyzing time series, but many methods require that the time series be stationary—no change in properties over time. Objective Medicine is far from stationary, but the challenge may be able to be ameliorated by reparameterizing time because clinicians tend to measure patients more frequently when they are ill and are more likely to vary. Methods We compared time parameterizations, measuring variability of rate of change and magnitude of change, and looking for homogeneity of bins of temporal separation between pairs of time points. We studied four common laboratory tests drawn from 25 years of electronic health records on 4 million patients. Results We found that sequence time—that is, simply counting the number of measurements from some start—produced more stationary time series, better explained the variation in values, and had more homogeneous bins than either traditional clock time or a recently proposed intermediate parameterization. Sequence time produced more accurate predictions in a single Gaussian process model experiment. Conclusions Of the three parameterizations, sequence time appeared to produce the most stationary series, possibly because clinicians adjust their sampling to the acuity of the patient. Parameterizing by sequence time may be applicable to association and clustering experiments on electronic health record data. A limitation of this study is that laboratory data were derived from only one institution. Sequence time appears to be an important potential parameterization. Oxford University Press 2015-07 2015-02-27 /pmc/articles/PMC6169471/ /pubmed/25725004 http://dx.doi.org/10.1093/jamia/ocu051 Text en © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Research and Applications
Hripcsak, George
Albers, David J
Perotte, Adler
Parameterizing time in electronic health record studies
title Parameterizing time in electronic health record studies
title_full Parameterizing time in electronic health record studies
title_fullStr Parameterizing time in electronic health record studies
title_full_unstemmed Parameterizing time in electronic health record studies
title_short Parameterizing time in electronic health record studies
title_sort parameterizing time in electronic health record studies
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6169471/
https://www.ncbi.nlm.nih.gov/pubmed/25725004
http://dx.doi.org/10.1093/jamia/ocu051
work_keys_str_mv AT hripcsakgeorge parameterizingtimeinelectronichealthrecordstudies
AT albersdavidj parameterizingtimeinelectronichealthrecordstudies
AT perotteadler parameterizingtimeinelectronichealthrecordstudies