Cargando…
Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records
BACKGROUND: Patients’ smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regul...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5217540/ https://www.ncbi.nlm.nih.gov/pubmed/28056955 http://dx.doi.org/10.1186/s12911-016-0400-6 |
_version_ | 1782492125880909824 |
---|---|
author | Atkinson, Mark D. Kennedy, Jonathan I. John, Ann Lewis, Keir E. Lyons, Ronan A. Brophy, Sinead T. |
author_facet | Atkinson, Mark D. Kennedy, Jonathan I. John, Ann Lewis, Keir E. Lyons, Ronan A. Brophy, Sinead T. |
author_sort | Atkinson, Mark D. |
collection | PubMed |
description | BACKGROUND: Patients’ smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regularly employed smoking status codes. Large databases of primary care data are increasingly used for epidemiological analysis; smoking status is an important covariate in many such analyses. However, the variable definition is rarely documented in the literature. METHODS: The Secure Anonymised Information Linkage (SAIL) databank is a repository for a national collection of person-based anonymised health and socio-economic administrative data in Wales, UK. An exploration of GP smoking status data from the SAIL databank was carried out to explore the range of codes available and how they could be used in the identification of different categories of smokers, ex-smokers and never smokers. An algorithm was developed which addresses inconsistencies and changes in smoking status recording across the life course and compared with recorded smoking status as recorded in the Welsh Health Survey (WHS), 2013 and 2014 at individual level. However, the WHS could not be regarded as a “gold standard” for validation. RESULTS: There were 6836 individuals in the linked dataset. Missing data were more common in GP records (6%) than in WHS (1.1%). Our algorithm assigns ex-smoker status to 34% of never-smokers, and detects 30% more smokers than are declared in the WHS data. When distinguishing between current smokers and non-smokers, the similarity between the WHS and GP data using the nearest date of comparison was κ = 0.78. When temporal conflicts had been accounted for, the similarity was κ = 0.64, showing the importance of addressing conflicts. CONCLUSIONS: We present an algorithm for the identification of a patient’s smoking status using GP self-reported data. We have included sufficient details to allow others to replicate this work, thus increasing the standards of documentation within this research area and assessment of smoking status in routine data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0400-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5217540 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52175402017-01-09 Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records Atkinson, Mark D. Kennedy, Jonathan I. John, Ann Lewis, Keir E. Lyons, Ronan A. Brophy, Sinead T. BMC Med Inform Decis Mak Research Article BACKGROUND: Patients’ smoking status is routinely collected by General Practitioners (GP) in UK primary health care. There is an abundance of Read codes pertaining to smoking, including those relating to smoking cessation therapy, prescription, and administration codes, in addition to the more regularly employed smoking status codes. Large databases of primary care data are increasingly used for epidemiological analysis; smoking status is an important covariate in many such analyses. However, the variable definition is rarely documented in the literature. METHODS: The Secure Anonymised Information Linkage (SAIL) databank is a repository for a national collection of person-based anonymised health and socio-economic administrative data in Wales, UK. An exploration of GP smoking status data from the SAIL databank was carried out to explore the range of codes available and how they could be used in the identification of different categories of smokers, ex-smokers and never smokers. An algorithm was developed which addresses inconsistencies and changes in smoking status recording across the life course and compared with recorded smoking status as recorded in the Welsh Health Survey (WHS), 2013 and 2014 at individual level. However, the WHS could not be regarded as a “gold standard” for validation. RESULTS: There were 6836 individuals in the linked dataset. Missing data were more common in GP records (6%) than in WHS (1.1%). Our algorithm assigns ex-smoker status to 34% of never-smokers, and detects 30% more smokers than are declared in the WHS data. When distinguishing between current smokers and non-smokers, the similarity between the WHS and GP data using the nearest date of comparison was κ = 0.78. When temporal conflicts had been accounted for, the similarity was κ = 0.64, showing the importance of addressing conflicts. CONCLUSIONS: We present an algorithm for the identification of a patient’s smoking status using GP self-reported data. We have included sufficient details to allow others to replicate this work, thus increasing the standards of documentation within this research area and assessment of smoking status in routine data. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-016-0400-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-05 /pmc/articles/PMC5217540/ /pubmed/28056955 http://dx.doi.org/10.1186/s12911-016-0400-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Atkinson, Mark D. Kennedy, Jonathan I. John, Ann Lewis, Keir E. Lyons, Ronan A. Brophy, Sinead T. Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records |
title | Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records |
title_full | Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records |
title_fullStr | Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records |
title_full_unstemmed | Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records |
title_short | Development of an algorithm for determining smoking status and behaviour over the life course from UK electronic primary care records |
title_sort | development of an algorithm for determining smoking status and behaviour over the life course from uk electronic primary care records |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5217540/ https://www.ncbi.nlm.nih.gov/pubmed/28056955 http://dx.doi.org/10.1186/s12911-016-0400-6 |
work_keys_str_mv | AT atkinsonmarkd developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords AT kennedyjonathani developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords AT johnann developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords AT lewiskeire developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords AT lyonsronana developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords AT brophysineadt developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords AT developmentofanalgorithmfordeterminingsmokingstatusandbehaviouroverthelifecoursefromukelectronicprimarycarerecords |