Cargando…

Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records

BACKGROUND: Patients’ smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regul...

Descripción completa

Detalles Bibliográficos
Autores principales: Atkinson, Mark D., Kennedy, Jonathan I., John, Ann, Lewis, Keir E., Lyons, Ronan A., Brophy, Sinead T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5217540/
https://www.ncbi.nlm.nih.gov/pubmed/28056955
http://dx.doi.org/10.1186/s12911-016-0400-6
_version_ 1782492125880909824
author Atkinson, Mark D.
Kennedy, Jonathan I.
John, Ann
Lewis, Keir E.
Lyons, Ronan A.
Brophy, Sinead T.
author_facet Atkinson, Mark D.
Kennedy, Jonathan I.
John, Ann
Lewis, Keir E.
Lyons, Ronan A.
Brophy, Sinead T.
author_sort Atkinson, Mark D.
collection PubMed
description BACKGROUND: Patients’ smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regularly employed smoking status codes. Large databases of primary care data are increasingly used for epidemiological analysis; smoking status is an important covariate in many such analyses. However, the variable definition is rarely documented in the literature. METHODS: The Secure Anonymised Information Linkage (SAIL) databank is a repository for a national collection of person-based anonymised health and socio-economic administrative data in Wales, UK. An exploration of GP smoking status data from the SAIL databank was carried out to explore the range of codes available and how they could be used in the identification of different categories of smokers, ex-smokers and never smokers. An algorithm was developed which addresses inconsistencies and changes in smoking status recording across the life course and compared with recorded smoking status as recorded in the Welsh Health Survey (WHS), 2013 and 2014 at individual level. However, the WHS could not be regarded as a “gold standard” for validation. RESULTS: There were 6836 individuals in the linked dataset. Missing data were more common in GP records (6%) than in WHS (1.1%). Our algorithm assigns ex-smoker status to 34% of never-smokers, and detects 30% more smokers than are declared in the WHS data. When distinguishing between current smokers and non-smokers, the similarity between the WHS and GP data using the nearest date of comparison was κ = 0.78. When temporal conflicts had been accounted for, the similarity was κ = 0.64, showing the importance of addressing conflicts. CONCLUSIONS: We present an algorithm for the identification of a patient’s smoking status using GP self-reported data. We have included sufficient details to allow others to replicate this work, thus increasing the standards of documentation within this research area and assessment of smoking status in routine data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0400-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5217540
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52175402017-01-09 Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records Atkinson, Mark D. Kennedy, Jonathan I. John, Ann Lewis, Keir E. Lyons, Ronan A. Brophy, Sinead T. BMC Med Inform Decis Mak Research Article BACKGROUND: Patients’ smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regularly employed smoking status codes. Large databases of primary care data are increasingly used for epidemiological analysis; smoking status is an important covariate in many such analyses. However, the variable definition is rarely documented in the literature. METHODS: The Secure Anonymised Information Linkage (SAIL) databank is a repository for a national collection of person-based anonymised health and socio-economic administrative data in Wales, UK. An exploration of GP smoking status data from the SAIL databank was carried out to explore the range of codes available and how they could be used in the identification of different categories of smokers, ex-smokers and never smokers. An algorithm was developed which addresses inconsistencies and changes in smoking status recording across the life course and compared with recorded smoking status as recorded in the Welsh Health Survey (WHS), 2013 and 2014 at individual level. However, the WHS could not be regarded as a “gold standard” for validation. RESULTS: There were 6836 individuals in the linked dataset. Missing data were more common in GP records (6%) than in WHS (1.1%). Our algorithm assigns ex-smoker status to 34% of never-smokers, and detects 30% more smokers than are declared in the WHS data. When distinguishing between current smokers and non-smokers, the similarity between the WHS and GP data using the nearest date of comparison was κ = 0.78. When temporal conflicts had been accounted for, the similarity was κ = 0.64, showing the importance of addressing conflicts. CONCLUSIONS: We present an algorithm for the identification of a patient’s smoking status using GP self-reported data. We have included sufficient details to allow others to replicate this work, thus increasing the standards of documentation within this research area and assessment of smoking status in routine data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0400-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-05 /pmc/articles/PMC5217540/ /pubmed/28056955 http://dx.doi.org/10.1186/s12911-016-0400-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Atkinson, Mark D.
Kennedy, Jonathan I.
John, Ann
Lewis, Keir E.
Lyons, Ronan A.
Brophy, Sinead T.
Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
title Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
title_full Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
title_fullStr Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
title_full_unstemmed Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
title_short Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
title_sort development of an algorithm for determining smoking status and behaviour over the life course from uk electronic primary care records
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5217540/
https://www.ncbi.nlm.nih.gov/pubmed/28056955
http://dx.doi.org/10.1186/s12911-016-0400-6
work_keys_str_mv AT atkinsonmarkd developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords
AT kennedyjonathani developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords
AT johnann developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords
AT lewiskeire developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords
AT lyonsronana developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords
AT brophysineadt developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords
AT developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords