Cargando…

A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids

Background Value sets are lists of terms (e.g., opioid medication names) and their corresponding codes from standard clinical vocabularies (e.g., RxNorm) created with the intent of supporting health information exchange and research. Value sets are manually-created and often exhibit errors. Objecti...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Linyi, Grando, Adela, Sarker, Abeed
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Georg Thieme Verlag KG 2021
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716187/ https://www.ncbi.nlm.nih.gov/pubmed/34965602 http://dx.doi.org/10.1055/s-0041-1740358

_version_	1784624269286703104
author	Li, Linyi Grando, Adela Sarker, Abeed
author_facet	Li, Linyi Grando, Adela Sarker, Abeed
author_sort	Li, Linyi
collection	PubMed
description	Background Value sets are lists of terms (e.g., opioid medication names) and their corresponding codes from standard clinical vocabularies (e.g., RxNorm) created with the intent of supporting health information exchange and research. Value sets are manually-created and often exhibit errors. Objectives The aim of the study is to develop a semi-automatic, data-centric natural language processing (NLP) method to assess medication-related value set correctness and evaluate it on a set of opioid medication value sets. Methods We developed an NLP algorithm that utilizes value sets containing mostly true positives and true negatives to learn lexical patterns associated with the true positives, and then employs these patterns to identify potential errors in unseen value sets. We evaluated the algorithm on a set of opioid medication value sets, using the recall, precision and F (1) -score metrics. We applied the trained model to assess the correctness of unseen opioid value sets based on recall. To replicate the application of the algorithm in real-world settings, a domain expert manually conducted error analysis to identify potential system and value set errors. Results Thirty-eight value sets were retrieved from the Value Set Authority Center, and six (two opioid, four non-opioid) were used to develop and evaluate the system. Average precision, recall, and F (1) -score were 0.932, 0.904, and 0.909, respectively on uncorrected value sets; and 0.958, 0.953, and 0.953, respectively after manual correction of the same value sets. On 20 unseen opioid value sets, the algorithm obtained average recall of 0.89. Error analyses revealed that the main sources of system misclassifications were differences in how opioids were coded in the value sets—while the training value sets had generic names mostly, some of the unseen value sets had new trade names and ingredients. Conclusion The proposed approach is data-centric, reusable, customizable, and not resource intensive. It may help domain experts to easily validate value sets.
format	Online Article Text
id	pubmed-8716187
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Georg Thieme Verlag KG
record_format	MEDLINE/PubMed
spelling	pubmed-87161872021-12-30 A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids Li, Linyi Grando, Adela Sarker, Abeed Methods Inf Med Background Value sets are lists of terms (e.g., opioid medication names) and their corresponding codes from standard clinical vocabularies (e.g., RxNorm) created with the intent of supporting health information exchange and research. Value sets are manually-created and often exhibit errors. Objectives The aim of the study is to develop a semi-automatic, data-centric natural language processing (NLP) method to assess medication-related value set correctness and evaluate it on a set of opioid medication value sets. Methods We developed an NLP algorithm that utilizes value sets containing mostly true positives and true negatives to learn lexical patterns associated with the true positives, and then employs these patterns to identify potential errors in unseen value sets. We evaluated the algorithm on a set of opioid medication value sets, using the recall, precision and F (1) -score metrics. We applied the trained model to assess the correctness of unseen opioid value sets based on recall. To replicate the application of the algorithm in real-world settings, a domain expert manually conducted error analysis to identify potential system and value set errors. Results Thirty-eight value sets were retrieved from the Value Set Authority Center, and six (two opioid, four non-opioid) were used to develop and evaluate the system. Average precision, recall, and F (1) -score were 0.932, 0.904, and 0.909, respectively on uncorrected value sets; and 0.958, 0.953, and 0.953, respectively after manual correction of the same value sets. On 20 unseen opioid value sets, the algorithm obtained average recall of 0.89. Error analyses revealed that the main sources of system misclassifications were differences in how opioids were coded in the value sets—while the training value sets had generic names mostly, some of the unseen value sets had new trade names and ingredients. Conclusion The proposed approach is data-centric, reusable, customizable, and not resource intensive. It may help domain experts to easily validate value sets. Georg Thieme Verlag KG 2021-12-24 /pmc/articles/PMC8716187/ /pubmed/34965602 http://dx.doi.org/10.1055/s-0041-1740358 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. ( https://creativecommons.org/licenses/by-nc-nd/4.0/ ) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited.
spellingShingle	Li, Linyi Grando, Adela Sarker, Abeed A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids
title	A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids
title_full	A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids
title_fullStr	A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids
title_full_unstemmed	A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids
title_short	A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids
title_sort	data-driven iterative approach for semi-automatically assessing the correctness of medication value sets: a proof of concept based on opioids
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8716187/ https://www.ncbi.nlm.nih.gov/pubmed/34965602 http://dx.doi.org/10.1055/s-0041-1740358
work_keys_str_mv	AT lilinyi adatadriveniterativeapproachforsemiautomaticallyassessingthecorrectnessofmedicationvaluesetsaproofofconceptbasedonopioids AT grandoadela adatadriveniterativeapproachforsemiautomaticallyassessingthecorrectnessofmedicationvaluesetsaproofofconceptbasedonopioids AT sarkerabeed adatadriveniterativeapproachforsemiautomaticallyassessingthecorrectnessofmedicationvaluesetsaproofofconceptbasedonopioids AT lilinyi datadriveniterativeapproachforsemiautomaticallyassessingthecorrectnessofmedicationvaluesetsaproofofconceptbasedonopioids AT grandoadela datadriveniterativeapproachforsemiautomaticallyassessingthecorrectnessofmedicationvaluesetsaproofofconceptbasedonopioids AT sarkerabeed datadriveniterativeapproachforsemiautomaticallyassessingthecorrectnessofmedicationvaluesetsaproofofconceptbasedonopioids

A Data-Driven Iterative Approach for Semi-automatically Assessing the Correctness of Medication Value Sets: A Proof of Concept Based on Opioids

Ejemplares similares