Cargando…

Identifying clinical features in primary care electronic health record studies: methods for codelist development

OBJECTIVE: Analysis of routinely collected electronic health record (EHR) data from primary care is reliant on the creation of codelists to define clinical features of interest. To improve scientific rigour, transparency and replicability, we describe and demonstrate a standardised reproducible meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Watson, Jessica, Nicholson, Brian D, Hamilton, Willie, Price, Sarah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719324/
https://www.ncbi.nlm.nih.gov/pubmed/29170293
http://dx.doi.org/10.1136/bmjopen-2017-019637
_version_ 1783284468108230656
author Watson, Jessica
Nicholson, Brian D
Hamilton, Willie
Price, Sarah
author_facet Watson, Jessica
Nicholson, Brian D
Hamilton, Willie
Price, Sarah
author_sort Watson, Jessica
collection PubMed
description OBJECTIVE: Analysis of routinely collected electronic health record (EHR) data from primary care is reliant on the creation of codelists to define clinical features of interest. To improve scientific rigour, transparency and replicability, we describe and demonstrate a standardised reproducible methodology for clinical codelist development. DESIGN: We describe a three-stage process for developing clinical codelists. First, the clear definition a priori of the clinical feature of interest using reliable clinical resources. Second, development of a list of potential codes using statistical software to comprehensively search all available codes. Third, a modified Delphi process to reach consensus between primary care practitioners on the most relevant codes, including the generation of an ‘uncertainty’ variable to allow sensitivity analysis. SETTING: These methods are illustrated by developing a codelist for shortness of breath in a primary care EHR sample, including modifiable syntax for commonly used statistical software. PARTICIPANTS: The codelist was used to estimate the frequency of shortness of breath in a cohort of 28 216 patients aged over 18 years who received an incident diagnosis of lung cancer between 1 January 2000 and 30 November 2016 in the Clinical Practice Research Datalink (CPRD). RESULTS: Of 78 candidate codes, 29 were excluded as inappropriate. Complete agreement was reached for 44 (90%) of the remaining codes, with partial disagreement over 5 (10%). 13 091 episodes of shortness of breath were identified in the cohort of 28 216 patients. Sensitivity analysis demonstrates that codes with the greatest uncertainty tend to be rarely used in clinical practice. CONCLUSIONS: Although initially time consuming, using a rigorous and reproducible method for codelist generation ‘future-proofs’ findings and an auditable, modifiable syntax for codelist generation enables sharing and replication of EHR studies. Published codelists should be badged by quality and report the methods of codelist generation including: definitions and justifications associated with each codelist; the syntax or search method; the number of candidate codes identified; and the categorisation of codes after Delphi review.
format Online
Article
Text
id pubmed-5719324
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-57193242017-12-08 Identifying clinical features in primary care electronic health record studies: methods for codelist development Watson, Jessica Nicholson, Brian D Hamilton, Willie Price, Sarah BMJ Open Health Services Research OBJECTIVE: Analysis of routinely collected electronic health record (EHR) data from primary care is reliant on the creation of codelists to define clinical features of interest. To improve scientific rigour, transparency and replicability, we describe and demonstrate a standardised reproducible methodology for clinical codelist development. DESIGN: We describe a three-stage process for developing clinical codelists. First, the clear definition a priori of the clinical feature of interest using reliable clinical resources. Second, development of a list of potential codes using statistical software to comprehensively search all available codes. Third, a modified Delphi process to reach consensus between primary care practitioners on the most relevant codes, including the generation of an ‘uncertainty’ variable to allow sensitivity analysis. SETTING: These methods are illustrated by developing a codelist for shortness of breath in a primary care EHR sample, including modifiable syntax for commonly used statistical software. PARTICIPANTS: The codelist was used to estimate the frequency of shortness of breath in a cohort of 28 216 patients aged over 18 years who received an incident diagnosis of lung cancer between 1 January 2000 and 30 November 2016 in the Clinical Practice Research Datalink (CPRD). RESULTS: Of 78 candidate codes, 29 were excluded as inappropriate. Complete agreement was reached for 44 (90%) of the remaining codes, with partial disagreement over 5 (10%). 13 091 episodes of shortness of breath were identified in the cohort of 28 216 patients. Sensitivity analysis demonstrates that codes with the greatest uncertainty tend to be rarely used in clinical practice. CONCLUSIONS: Although initially time consuming, using a rigorous and reproducible method for codelist generation ‘future-proofs’ findings and an auditable, modifiable syntax for codelist generation enables sharing and replication of EHR studies. Published codelists should be badged by quality and report the methods of codelist generation including: definitions and justifications associated with each codelist; the syntax or search method; the number of candidate codes identified; and the categorisation of codes after Delphi review. BMJ Publishing Group 2017-11-22 /pmc/articles/PMC5719324/ /pubmed/29170293 http://dx.doi.org/10.1136/bmjopen-2017-019637 Text en © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted. This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY 4.0) license, which permits others to distribute, remix, adapt and build upon this work, for commercial use, provided the original work is properly cited. See: http://creativecommons.org/licenses/by/4.0/
spellingShingle Health Services Research
Watson, Jessica
Nicholson, Brian D
Hamilton, Willie
Price, Sarah
Identifying clinical features in primary care electronic health record studies: methods for codelist development
title Identifying clinical features in primary care electronic health record studies: methods for codelist development
title_full Identifying clinical features in primary care electronic health record studies: methods for codelist development
title_fullStr Identifying clinical features in primary care electronic health record studies: methods for codelist development
title_full_unstemmed Identifying clinical features in primary care electronic health record studies: methods for codelist development
title_short Identifying clinical features in primary care electronic health record studies: methods for codelist development
title_sort identifying clinical features in primary care electronic health record studies: methods for codelist development
topic Health Services Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719324/
https://www.ncbi.nlm.nih.gov/pubmed/29170293
http://dx.doi.org/10.1136/bmjopen-2017-019637
work_keys_str_mv AT watsonjessica identifyingclinicalfeaturesinprimarycareelectronichealthrecordstudiesmethodsforcodelistdevelopment
AT nicholsonbriand identifyingclinicalfeaturesinprimarycareelectronichealthrecordstudiesmethodsforcodelistdevelopment
AT hamiltonwillie identifyingclinicalfeaturesinprimarycareelectronichealthrecordstudiesmethodsforcodelistdevelopment
AT pricesarah identifyingclinicalfeaturesinprimarycareelectronichealthrecordstudiesmethodsforcodelistdevelopment