Cargando…

Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study

OBJECTIVE: Data errors are a well-documented part of clinical datasets as is their potential to confound downstream analysis. In this study, we explore the reliability of manually transcribed data across different pathology fields in a prostate cancer database and also measure error rates attributab...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Matthew K H, Yao, Henry H I, Pedersen, John S, Peters, Justin S, Costello, Anthony J, Murphy, Declan G, Hovens, Christopher M, Corcoran, Niall M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3657671/
https://www.ncbi.nlm.nih.gov/pubmed/23793682
http://dx.doi.org/10.1136/bmjopen-2012-002406
_version_ 1782270161778114560
author Hong, Matthew K H
Yao, Henry H I
Pedersen, John S
Peters, Justin S
Costello, Anthony J
Murphy, Declan G
Hovens, Christopher M
Corcoran, Niall M
author_facet Hong, Matthew K H
Yao, Henry H I
Pedersen, John S
Peters, Justin S
Costello, Anthony J
Murphy, Declan G
Hovens, Christopher M
Corcoran, Niall M
author_sort Hong, Matthew K H
collection PubMed
description OBJECTIVE: Data errors are a well-documented part of clinical datasets as is their potential to confound downstream analysis. In this study, we explore the reliability of manually transcribed data across different pathology fields in a prostate cancer database and also measure error rates attributable to the source data. DESIGN: Descriptive study. SETTING: Specialist urology service at a single centre in metropolitan Victoria in Australia. PARTICIPANTS: Between 2004 and 2011, 1471 patients underwent radical prostatectomy at our institution. In a large proportion of these cases, clinicopathological variables were recorded by manual data-entry. In 2011, we obtained electronic versions of the same printed pathology reports for our cohort. The data were electronically imported in parallel to any existing manual entry record enabling direct comparison between them. OUTCOME MEASURES: Error rates of manually entered data compared with electronically imported data across clinicopathological fields. RESULTS: 421 patients had at least 10 comparable pathology fields between the electronic import and manual records and were selected for study. 320 patients had concordant data between manually entered and electronically populated fields in a median of 12 pathology fields (range 10–13), indicating an outright accuracy in manually entered pathology data in 76% of patients. Across all fields, the error rate was 2.8%, while individual field error ranges from 0.5% to 6.4%. Fields in text formats were significantly more error-prone than those with direct measurements or involving numerical figures (p<0.001). 971 cases were available for review of error within the source data, with figures of 0.1–0.9%. CONCLUSIONS: While the overall rate of error was low in manually entered data, individual pathology fields were variably prone to error. High-quality pathology data can be obtained for both prospective and retrospective parts of our data repository and the electronic checking of source pathology data for error is feasible.
format Online
Article
Text
id pubmed-3657671
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-36576712013-05-21 Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study Hong, Matthew K H Yao, Henry H I Pedersen, John S Peters, Justin S Costello, Anthony J Murphy, Declan G Hovens, Christopher M Corcoran, Niall M BMJ Open Health Informatics OBJECTIVE: Data errors are a well-documented part of clinical datasets as is their potential to confound downstream analysis. In this study, we explore the reliability of manually transcribed data across different pathology fields in a prostate cancer database and also measure error rates attributable to the source data. DESIGN: Descriptive study. SETTING: Specialist urology service at a single centre in metropolitan Victoria in Australia. PARTICIPANTS: Between 2004 and 2011, 1471 patients underwent radical prostatectomy at our institution. In a large proportion of these cases, clinicopathological variables were recorded by manual data-entry. In 2011, we obtained electronic versions of the same printed pathology reports for our cohort. The data were electronically imported in parallel to any existing manual entry record enabling direct comparison between them. OUTCOME MEASURES: Error rates of manually entered data compared with electronically imported data across clinicopathological fields. RESULTS: 421 patients had at least 10 comparable pathology fields between the electronic import and manual records and were selected for study. 320 patients had concordant data between manually entered and electronically populated fields in a median of 12 pathology fields (range 10–13), indicating an outright accuracy in manually entered pathology data in 76% of patients. Across all fields, the error rate was 2.8%, while individual field error ranges from 0.5% to 6.4%. Fields in text formats were significantly more error-prone than those with direct measurements or involving numerical figures (p<0.001). 971 cases were available for review of error within the source data, with figures of 0.1–0.9%. CONCLUSIONS: While the overall rate of error was low in manually entered data, individual pathology fields were variably prone to error. High-quality pathology data can be obtained for both prospective and retrospective parts of our data repository and the electronic checking of source pathology data for error is feasible. BMJ Publishing Group 2013-05-17 /pmc/articles/PMC3657671/ /pubmed/23793682 http://dx.doi.org/10.1136/bmjopen-2012-002406 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/3.0/ and http://creativecommons.org/licenses/by-nc/3.0/legalcode
spellingShingle Health Informatics
Hong, Matthew K H
Yao, Henry H I
Pedersen, John S
Peters, Justin S
Costello, Anthony J
Murphy, Declan G
Hovens, Christopher M
Corcoran, Niall M
Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
title Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
title_full Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
title_fullStr Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
title_full_unstemmed Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
title_short Error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
title_sort error rates in a clinical data repository: lessons from the transition to electronic data transfer—a descriptive study
topic Health Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3657671/
https://www.ncbi.nlm.nih.gov/pubmed/23793682
http://dx.doi.org/10.1136/bmjopen-2012-002406
work_keys_str_mv AT hongmatthewkh errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT yaohenryhi errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT pedersenjohns errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT petersjustins errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT costelloanthonyj errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT murphydeclang errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT hovenschristopherm errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy
AT corcoranniallm errorratesinaclinicaldatarepositorylessonsfromthetransitiontoelectronicdatatransferadescriptivestudy