Cargando…

Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources

OBJECTIVES: Develop a novel algorithm to categorise alcohol consumption using primary care electronic health records (EHRs) and asses its reliability by comparing this classification with self-reported alcohol consumption data obtained from the UK Biobank (UKB) cohort. DESIGN: Cross-sectional study....

Descripción completa

Detalles Bibliográficos
Autores principales: Fraile-Navarro, David, Azcoaga-Lorenzo, Amaya, Agrawal, Utkarsh, Jani, Bhautesh, Fagbamigbe, Adeniyi, Currie, Dorothy, Baldacchino, Alexander, Sullivan, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8808438/
https://www.ncbi.nlm.nih.gov/pubmed/35105585
http://dx.doi.org/10.1136/bmjopen-2021-054376
_version_ 1784643887412805632
author Fraile-Navarro, David
Azcoaga-Lorenzo, Amaya
Agrawal, Utkarsh
Jani, Bhautesh
Fagbamigbe, Adeniyi
Currie, Dorothy
Baldacchino, Alexander
Sullivan, Frank
author_facet Fraile-Navarro, David
Azcoaga-Lorenzo, Amaya
Agrawal, Utkarsh
Jani, Bhautesh
Fagbamigbe, Adeniyi
Currie, Dorothy
Baldacchino, Alexander
Sullivan, Frank
author_sort Fraile-Navarro, David
collection PubMed
description OBJECTIVES: Develop a novel algorithm to categorise alcohol consumption using primary care electronic health records (EHRs) and asses its reliability by comparing this classification with self-reported alcohol consumption data obtained from the UK Biobank (UKB) cohort. DESIGN: Cross-sectional study. SETTING: The UKB, a population-based cohort with participants aged between 40 and 69 years recruited across the UK between 2006 and 2010. PARTICIPANTS: UKB participants from Scotland with linked primary care data. PRIMARY AND SECONDARY OUTCOME MEASURES: Create a rule-based multiclass algorithm to classify alcohol consumption reported by Scottish UKB participants and compare it with their classification using data present in primary care EHRs based on Read Codes. We evaluated agreement metrics (simple agreement and kappa statistic). RESULTS: Among the Scottish UKB participants, 18 838 (69%) had at least one Read Code related to alcohol consumption and were used in the classification. The agreement of alcohol consumption categories between UKB and primary care data, including assessments within 5 years was 59.6%, and kappa was 0.23 (95% CI 0.21 to 0.24). Differences in classification between the two sources were statistically significant (p<0.001); More individuals were classified as ‘sensible drinkers’ and in lower alcohol consumption levels in primary care records compared with the UKB. Agreement improved slightly when using only numerical values (k=0.29; 95% CI 0.27 to 0.31) and decreased when using qualitative descriptors only (k=0.18;95% CI 0.16 to 0.20). CONCLUSION: Our algorithm classifies alcohol consumption recorded in Primary Care EHRs into discrete meaningful categories. These results suggest that alcohol consumption may be underestimated in primary care EHRs. Using numerical values (alcohol units) may improve classification when compared with qualitative descriptors.
format Online
Article
Text
id pubmed-8808438
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-88084382022-02-09 Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources Fraile-Navarro, David Azcoaga-Lorenzo, Amaya Agrawal, Utkarsh Jani, Bhautesh Fagbamigbe, Adeniyi Currie, Dorothy Baldacchino, Alexander Sullivan, Frank BMJ Open Health Informatics OBJECTIVES: Develop a novel algorithm to categorise alcohol consumption using primary care electronic health records (EHRs) and asses its reliability by comparing this classification with self-reported alcohol consumption data obtained from the UK Biobank (UKB) cohort. DESIGN: Cross-sectional study. SETTING: The UKB, a population-based cohort with participants aged between 40 and 69 years recruited across the UK between 2006 and 2010. PARTICIPANTS: UKB participants from Scotland with linked primary care data. PRIMARY AND SECONDARY OUTCOME MEASURES: Create a rule-based multiclass algorithm to classify alcohol consumption reported by Scottish UKB participants and compare it with their classification using data present in primary care EHRs based on Read Codes. We evaluated agreement metrics (simple agreement and kappa statistic). RESULTS: Among the Scottish UKB participants, 18 838 (69%) had at least one Read Code related to alcohol consumption and were used in the classification. The agreement of alcohol consumption categories between UKB and primary care data, including assessments within 5 years was 59.6%, and kappa was 0.23 (95% CI 0.21 to 0.24). Differences in classification between the two sources were statistically significant (p<0.001); More individuals were classified as ‘sensible drinkers’ and in lower alcohol consumption levels in primary care records compared with the UKB. Agreement improved slightly when using only numerical values (k=0.29; 95% CI 0.27 to 0.31) and decreased when using qualitative descriptors only (k=0.18;95% CI 0.16 to 0.20). CONCLUSION: Our algorithm classifies alcohol consumption recorded in Primary Care EHRs into discrete meaningful categories. These results suggest that alcohol consumption may be underestimated in primary care EHRs. Using numerical values (alcohol units) may improve classification when compared with qualitative descriptors. BMJ Publishing Group 2022-02-01 /pmc/articles/PMC8808438/ /pubmed/35105585 http://dx.doi.org/10.1136/bmjopen-2021-054376 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Health Informatics
Fraile-Navarro, David
Azcoaga-Lorenzo, Amaya
Agrawal, Utkarsh
Jani, Bhautesh
Fagbamigbe, Adeniyi
Currie, Dorothy
Baldacchino, Alexander
Sullivan, Frank
Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources
title Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources
title_full Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources
title_fullStr Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources
title_full_unstemmed Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources
title_short Development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from UK Biobank and primary care electronic health data sources
title_sort development of an algorithm to classify primary care electronic health records of alcohol consumption: experience using data linkage from uk biobank and primary care electronic health data sources
topic Health Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8808438/
https://www.ncbi.nlm.nih.gov/pubmed/35105585
http://dx.doi.org/10.1136/bmjopen-2021-054376
work_keys_str_mv AT frailenavarrodavid developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT azcoagalorenzoamaya developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT agrawalutkarsh developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT janibhautesh developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT fagbamigbeadeniyi developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT curriedorothy developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT baldacchinoalexander developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources
AT sullivanfrank developmentofanalgorithmtoclassifyprimarycareelectronichealthrecordsofalcoholconsumptionexperienceusingdatalinkagefromukbiobankandprimarycareelectronichealthdatasources