Cargando…

Secondary Use of EHR: Data Quality Issues and Informatics Opportunities

Given the large-scale deployment of Electronic Health Records (EHR), secondary use of EHR data will be increasingly needed in all kinds of health services or clinical research. This paper reports some data quality issues we encountered in a survival analysis of pancreatic cancer patients. Using the...

Descripción completa

Detalles Bibliográficos
Autores principales: Botsis, Taxiarchis, Hartvigsen, Gunnar, Chen, Fei, Weng, Chunhua
Formato: Texto
Lenguaje:English
Publicado: American Medical Informatics Association 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041534/
https://www.ncbi.nlm.nih.gov/pubmed/21347133
_version_ 1782198441099657216
author Botsis, Taxiarchis
Hartvigsen, Gunnar
Chen, Fei
Weng, Chunhua
author_facet Botsis, Taxiarchis
Hartvigsen, Gunnar
Chen, Fei
Weng, Chunhua
author_sort Botsis, Taxiarchis
collection PubMed
description Given the large-scale deployment of Electronic Health Records (EHR), secondary use of EHR data will be increasingly needed in all kinds of health services or clinical research. This paper reports some data quality issues we encountered in a survival analysis of pancreatic cancer patients. Using the clinical data warehouse at Columbia University Medical Center in the City of New York, we mined EHR data elements collected between 1999 and 2009 for a cohort of pancreatic cancer patients. Of the 3068 patients who had ICD-9-CM diagnoses for pancreatic cancer, only 1589 had corresponding disease documentation in pathology reports. Incompleteness was the leading data quality issue; many study variables had missing values to various degrees. Inaccuracy and inconsistency were the next common problems. In this paper, we present the manifestations of these data quality issues and discuss some strategies for using emerging informatics technologies to solve these problems.
format Text
id pubmed-3041534
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher American Medical Informatics Association
record_format MEDLINE/PubMed
spelling pubmed-30415342011-02-23 Secondary Use of EHR: Data Quality Issues and Informatics Opportunities Botsis, Taxiarchis Hartvigsen, Gunnar Chen, Fei Weng, Chunhua Summit on Translat Bioinforma Articles Given the large-scale deployment of Electronic Health Records (EHR), secondary use of EHR data will be increasingly needed in all kinds of health services or clinical research. This paper reports some data quality issues we encountered in a survival analysis of pancreatic cancer patients. Using the clinical data warehouse at Columbia University Medical Center in the City of New York, we mined EHR data elements collected between 1999 and 2009 for a cohort of pancreatic cancer patients. Of the 3068 patients who had ICD-9-CM diagnoses for pancreatic cancer, only 1589 had corresponding disease documentation in pathology reports. Incompleteness was the leading data quality issue; many study variables had missing values to various degrees. Inaccuracy and inconsistency were the next common problems. In this paper, we present the manifestations of these data quality issues and discuss some strategies for using emerging informatics technologies to solve these problems. American Medical Informatics Association 2010-03-01 /pmc/articles/PMC3041534/ /pubmed/21347133 Text en ©2010 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose
spellingShingle Articles
Botsis, Taxiarchis
Hartvigsen, Gunnar
Chen, Fei
Weng, Chunhua
Secondary Use of EHR: Data Quality Issues and Informatics Opportunities
title Secondary Use of EHR: Data Quality Issues and Informatics Opportunities
title_full Secondary Use of EHR: Data Quality Issues and Informatics Opportunities
title_fullStr Secondary Use of EHR: Data Quality Issues and Informatics Opportunities
title_full_unstemmed Secondary Use of EHR: Data Quality Issues and Informatics Opportunities
title_short Secondary Use of EHR: Data Quality Issues and Informatics Opportunities
title_sort secondary use of ehr: data quality issues and informatics opportunities
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041534/
https://www.ncbi.nlm.nih.gov/pubmed/21347133
work_keys_str_mv AT botsistaxiarchis secondaryuseofehrdataqualityissuesandinformaticsopportunities
AT hartvigsengunnar secondaryuseofehrdataqualityissuesandinformaticsopportunities
AT chenfei secondaryuseofehrdataqualityissuesandinformaticsopportunities
AT wengchunhua secondaryuseofehrdataqualityissuesandinformaticsopportunities