Cargando…

Use of attribute association error probability estimates to evaluate quality of medical record geocodes

BACKGROUND: The utility of patient attributes associated with the spatiotemporal analysis of medical records lies not just in their values but also the strength of association between them. Estimating the extent to which a hierarchy of conditional probability exists between patient attribute associa...

Descripción completa

Detalles Bibliográficos
Autores principales: Klaus, Christian A., Carrasco, Luis E., Goldberg, Daniel W., Henry, Kevin A., Sherman, Recinda L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4570180/
https://www.ncbi.nlm.nih.gov/pubmed/26370237
http://dx.doi.org/10.1186/s12942-015-0019-3
_version_ 1782390159653732352
author Klaus, Christian A.
Carrasco, Luis E.
Goldberg, Daniel W.
Henry, Kevin A.
Sherman, Recinda L.
author_facet Klaus, Christian A.
Carrasco, Luis E.
Goldberg, Daniel W.
Henry, Kevin A.
Sherman, Recinda L.
author_sort Klaus, Christian A.
collection PubMed
description BACKGROUND: The utility of patient attributes associated with the spatiotemporal analysis of medical records lies not just in their values but also the strength of association between them. Estimating the extent to which a hierarchy of conditional probability exists between patient attribute associations such as patient identifying fields, patient and date of diagnosis, and patient and address at diagnosis is fundamental to estimating the strength of association between patient and geocode, and patient and enumeration area. We propose a hierarchy for the attribute associations within medical records that enable spatiotemporal relationships. We also present a set of metrics that store attribute association error probability (AAEP), to estimate error probability for all attribute associations upon which certainty in a patient geocode depends. METHODS: A series of experiments were undertaken to understand how error estimation could be operationalized within health data and what levels of AAEP in real data reveal themselves using these methods. Specifically, the goals of this evaluation were to (1) assess if the concept of our error assessment techniques could be implemented by a population-based cancer registry; (2) apply the techniques to real data from a large health data agency and characterize the observed levels of AAEP; and (3) demonstrate how detected AAEP might impact spatiotemporal health research. RESULTS: We present an evaluation of AAEP metrics generated for cancer cases in a North Carolina county. We show examples of how we estimated AAEP for selected attribute associations and circumstances. We demonstrate the distribution of AAEP in our case sample across attribute associations, and demonstrate ways in which disease registry specific operations influence the prevalence of AAEP estimates for specific attribute associations. CONCLUSIONS: The effort to detect and store estimates of AAEP is worthwhile because of the increase in confidence fostered by the attribute association level approach to the assessment of uncertainty in patient geocodes, relative to existing geocoding related uncertainty metrics.
format Online
Article
Text
id pubmed-4570180
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45701802015-09-16 Use of attribute association error probability estimates to evaluate quality of medical record geocodes Klaus, Christian A. Carrasco, Luis E. Goldberg, Daniel W. Henry, Kevin A. Sherman, Recinda L. Int J Health Geogr Methodology BACKGROUND: The utility of patient attributes associated with the spatiotemporal analysis of medical records lies not just in their values but also the strength of association between them. Estimating the extent to which a hierarchy of conditional probability exists between patient attribute associations such as patient identifying fields, patient and date of diagnosis, and patient and address at diagnosis is fundamental to estimating the strength of association between patient and geocode, and patient and enumeration area. We propose a hierarchy for the attribute associations within medical records that enable spatiotemporal relationships. We also present a set of metrics that store attribute association error probability (AAEP), to estimate error probability for all attribute associations upon which certainty in a patient geocode depends. METHODS: A series of experiments were undertaken to understand how error estimation could be operationalized within health data and what levels of AAEP in real data reveal themselves using these methods. Specifically, the goals of this evaluation were to (1) assess if the concept of our error assessment techniques could be implemented by a population-based cancer registry; (2) apply the techniques to real data from a large health data agency and characterize the observed levels of AAEP; and (3) demonstrate how detected AAEP might impact spatiotemporal health research. RESULTS: We present an evaluation of AAEP metrics generated for cancer cases in a North Carolina county. We show examples of how we estimated AAEP for selected attribute associations and circumstances. We demonstrate the distribution of AAEP in our case sample across attribute associations, and demonstrate ways in which disease registry specific operations influence the prevalence of AAEP estimates for specific attribute associations. CONCLUSIONS: The effort to detect and store estimates of AAEP is worthwhile because of the increase in confidence fostered by the attribute association level approach to the assessment of uncertainty in patient geocodes, relative to existing geocoding related uncertainty metrics. BioMed Central 2015-09-15 /pmc/articles/PMC4570180/ /pubmed/26370237 http://dx.doi.org/10.1186/s12942-015-0019-3 Text en © Klaus et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Klaus, Christian A.
Carrasco, Luis E.
Goldberg, Daniel W.
Henry, Kevin A.
Sherman, Recinda L.
Use of attribute association error probability estimates to evaluate quality of medical record geocodes
title Use of attribute association error probability estimates to evaluate quality of medical record geocodes
title_full Use of attribute association error probability estimates to evaluate quality of medical record geocodes
title_fullStr Use of attribute association error probability estimates to evaluate quality of medical record geocodes
title_full_unstemmed Use of attribute association error probability estimates to evaluate quality of medical record geocodes
title_short Use of attribute association error probability estimates to evaluate quality of medical record geocodes
title_sort use of attribute association error probability estimates to evaluate quality of medical record geocodes
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4570180/
https://www.ncbi.nlm.nih.gov/pubmed/26370237
http://dx.doi.org/10.1186/s12942-015-0019-3
work_keys_str_mv AT klauschristiana useofattributeassociationerrorprobabilityestimatestoevaluatequalityofmedicalrecordgeocodes
AT carrascoluise useofattributeassociationerrorprobabilityestimatestoevaluatequalityofmedicalrecordgeocodes
AT goldbergdanielw useofattributeassociationerrorprobabilityestimatestoevaluatequalityofmedicalrecordgeocodes
AT henrykevina useofattributeassociationerrorprobabilityestimatestoevaluatequalityofmedicalrecordgeocodes
AT shermanrecindal useofattributeassociationerrorprobabilityestimatestoevaluatequalityofmedicalrecordgeocodes