Cargando…

Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care

BACKGROUND: Most antibiotics dispensed by community pharmacies in Canada are prescribed by family physicians, but using the prescribing information contained within primary care electronic medical records (EMRs) for secondary purposes can be challenging owing to variable data quality. We used antibi...

Descripción completa

Detalles Bibliográficos
Autores principales: Garies, Stephanie, Taylor, Matt, Soos, Boglarka, Lindeman, Cliff, Drummond, Neil, Pham, Anh, Aponte-Hao, Zhi, Williamson, Tyler
Formato: Online Artículo Texto
Lenguaje:English
Publicado: CMA Impact Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10620009/
https://www.ncbi.nlm.nih.gov/pubmed/37907215
http://dx.doi.org/10.9778/cmajo.20220235
_version_ 1785130115790798848
author Garies, Stephanie
Taylor, Matt
Soos, Boglarka
Lindeman, Cliff
Drummond, Neil
Pham, Anh
Aponte-Hao, Zhi
Williamson, Tyler
author_facet Garies, Stephanie
Taylor, Matt
Soos, Boglarka
Lindeman, Cliff
Drummond, Neil
Pham, Anh
Aponte-Hao, Zhi
Williamson, Tyler
author_sort Garies, Stephanie
collection PubMed
description BACKGROUND: Most antibiotics dispensed by community pharmacies in Canada are prescribed by family physicians, but using the prescribing information contained within primary care electronic medical records (EMRs) for secondary purposes can be challenging owing to variable data quality. We used antibiotic medications as an exemplar to validate a machine-learning approach for cleaning and coding medication data in a pan-Canadian primary care EMR database. METHODS: The Canadian Primary Care Sentinel Surveillance Network database contained an estimated 42 million medication records, which we mapped to an Anatomic Therapeutic Chemical (ATC) code by applying a semisupervised classification model developed using reference standard labels derived from the Health Canada Drug Product Database. We validated the resulting ATC codes in a subset of antibiotic records (16 119 unique strings) to determine whether the algorithm correctly classified the medication according to manual review of the original medication record. RESULTS: In the antibiotic subset, the algorithm showed high validity (sensitivity 99.5%, specificity 92.4%, positive predictive value 98.6%, negative predictive value 97.0%) in classifying whether the medication was an antibiotic. INTERPRETATION: Our machine-learning algorithm classified unstructured antibiotic medication data from primary care with a high degree of accuracy. Access to cleaned EMR data can support important secondary uses, including community-based antibiotic prescribing surveillance and practice improvement.
format Online
Article
Text
id pubmed-10620009
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher CMA Impact Inc.
record_format MEDLINE/PubMed
spelling pubmed-106200092023-11-02 Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care Garies, Stephanie Taylor, Matt Soos, Boglarka Lindeman, Cliff Drummond, Neil Pham, Anh Aponte-Hao, Zhi Williamson, Tyler CMAJ Open Research BACKGROUND: Most antibiotics dispensed by community pharmacies in Canada are prescribed by family physicians, but using the prescribing information contained within primary care electronic medical records (EMRs) for secondary purposes can be challenging owing to variable data quality. We used antibiotic medications as an exemplar to validate a machine-learning approach for cleaning and coding medication data in a pan-Canadian primary care EMR database. METHODS: The Canadian Primary Care Sentinel Surveillance Network database contained an estimated 42 million medication records, which we mapped to an Anatomic Therapeutic Chemical (ATC) code by applying a semisupervised classification model developed using reference standard labels derived from the Health Canada Drug Product Database. We validated the resulting ATC codes in a subset of antibiotic records (16 119 unique strings) to determine whether the algorithm correctly classified the medication according to manual review of the original medication record. RESULTS: In the antibiotic subset, the algorithm showed high validity (sensitivity 99.5%, specificity 92.4%, positive predictive value 98.6%, negative predictive value 97.0%) in classifying whether the medication was an antibiotic. INTERPRETATION: Our machine-learning algorithm classified unstructured antibiotic medication data from primary care with a high degree of accuracy. Access to cleaned EMR data can support important secondary uses, including community-based antibiotic prescribing surveillance and practice improvement. CMA Impact Inc. 2023-10-31 /pmc/articles/PMC10620009/ /pubmed/37907215 http://dx.doi.org/10.9778/cmajo.20220235 Text en © 2023 CMA Impact Inc. or its licensors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed in accordance with the terms of the Creative Commons Attribution (CC BY-NC-ND 4.0) licence, which permits use, distribution and reproduction in any medium, provided that the original publication is properly cited, the use is noncommercial (i.e., research or educational use), and no modifications or adaptations are made. See: https://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle Research
Garies, Stephanie
Taylor, Matt
Soos, Boglarka
Lindeman, Cliff
Drummond, Neil
Pham, Anh
Aponte-Hao, Zhi
Williamson, Tyler
Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
title Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
title_full Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
title_fullStr Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
title_full_unstemmed Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
title_short Using machine learning to standardize medication records in a pan-Canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
title_sort using machine learning to standardize medication records in a pan-canadian electronic medical record database: a data-driven algorithm study focused on antibiotics prescribed in primary care
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10620009/
https://www.ncbi.nlm.nih.gov/pubmed/37907215
http://dx.doi.org/10.9778/cmajo.20220235
work_keys_str_mv AT gariesstephanie usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT taylormatt usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT soosboglarka usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT lindemancliff usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT drummondneil usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT phamanh usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT apontehaozhi usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare
AT williamsontyler usingmachinelearningtostandardizemedicationrecordsinapancanadianelectronicmedicalrecorddatabaseadatadrivenalgorithmstudyfocusedonantibioticsprescribedinprimarycare