Cargando…

Using natural language processing methods to classify use status of dietary supplements in clinical notes

BACKGROUND: Despite widespread use, the safety of dietary supplements is open to doubt due to the fact that they can interact with prescribed medications, leading to dangerous clinical outcomes. Electronic health records (EHRs) provide a potential way for active pharmacovigilance on dietary suppleme...

Descripción completa

Detalles Bibliográficos
Autores principales: Fan, Yadan, Zhang, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6069512/
https://www.ncbi.nlm.nih.gov/pubmed/30066648
http://dx.doi.org/10.1186/s12911-018-0626-6
_version_ 1783343514046693376
author Fan, Yadan
Zhang, Rui
author_facet Fan, Yadan
Zhang, Rui
author_sort Fan, Yadan
collection PubMed
description BACKGROUND: Despite widespread use, the safety of dietary supplements is open to doubt due to the fact that they can interact with prescribed medications, leading to dangerous clinical outcomes. Electronic health records (EHRs) provide a potential way for active pharmacovigilance on dietary supplements since a fair amount of dietary supplement information, especially those on use status, can be found in clinical notes. Extracting such information is extremely significant for subsequent supplement safety research. METHODS: In this study, we collected 2500 sentences for 25 commonly used dietary supplements and annotated into four classes: Continuing (C), Discontinued (D), Started (S) and Unclassified (U). Both rule-based and machine learning-based classifiers were developed on the same training set and evaluated using the hold-out test set. The performances of the two classifiers were also compared. RESULTS: The rule-based classifier achieved F-measure of 0.90, 0.85, 0.90, and 0.86 in C, D, S, and U status, respectively. The optimal machine learning-based classifier (Maximum Entropy) achieved F-measure of 0.90, 0.92, 0.91 and 0.88 in C, D, S, and U status, respectively. The comparison result shows that the machine learning-based classifier has a better performance, which is more efficient and scalable especially when the sample size doubles. CONCLUSIONS: Machine learning-based classifier outperforms rule-based classifier in categorization of the use status of dietary supplements in clinical notes. Future work includes applying deep learning methods and developing a hybrid system to approach use status classification task.
format Online
Article
Text
id pubmed-6069512
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60695122018-08-03 Using natural language processing methods to classify use status of dietary supplements in clinical notes Fan, Yadan Zhang, Rui BMC Med Inform Decis Mak Research BACKGROUND: Despite widespread use, the safety of dietary supplements is open to doubt due to the fact that they can interact with prescribed medications, leading to dangerous clinical outcomes. Electronic health records (EHRs) provide a potential way for active pharmacovigilance on dietary supplements since a fair amount of dietary supplement information, especially those on use status, can be found in clinical notes. Extracting such information is extremely significant for subsequent supplement safety research. METHODS: In this study, we collected 2500 sentences for 25 commonly used dietary supplements and annotated into four classes: Continuing (C), Discontinued (D), Started (S) and Unclassified (U). Both rule-based and machine learning-based classifiers were developed on the same training set and evaluated using the hold-out test set. The performances of the two classifiers were also compared. RESULTS: The rule-based classifier achieved F-measure of 0.90, 0.85, 0.90, and 0.86 in C, D, S, and U status, respectively. The optimal machine learning-based classifier (Maximum Entropy) achieved F-measure of 0.90, 0.92, 0.91 and 0.88 in C, D, S, and U status, respectively. The comparison result shows that the machine learning-based classifier has a better performance, which is more efficient and scalable especially when the sample size doubles. CONCLUSIONS: Machine learning-based classifier outperforms rule-based classifier in categorization of the use status of dietary supplements in clinical notes. Future work includes applying deep learning methods and developing a hybrid system to approach use status classification task. BioMed Central 2018-07-23 /pmc/articles/PMC6069512/ /pubmed/30066648 http://dx.doi.org/10.1186/s12911-018-0626-6 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Fan, Yadan
Zhang, Rui
Using natural language processing methods to classify use status of dietary supplements in clinical notes
title Using natural language processing methods to classify use status of dietary supplements in clinical notes
title_full Using natural language processing methods to classify use status of dietary supplements in clinical notes
title_fullStr Using natural language processing methods to classify use status of dietary supplements in clinical notes
title_full_unstemmed Using natural language processing methods to classify use status of dietary supplements in clinical notes
title_short Using natural language processing methods to classify use status of dietary supplements in clinical notes
title_sort using natural language processing methods to classify use status of dietary supplements in clinical notes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6069512/
https://www.ncbi.nlm.nih.gov/pubmed/30066648
http://dx.doi.org/10.1186/s12911-018-0626-6
work_keys_str_mv AT fanyadan usingnaturallanguageprocessingmethodstoclassifyusestatusofdietarysupplementsinclinicalnotes
AT zhangrui usingnaturallanguageprocessingmethodstoclassifyusestatusofdietarysupplementsinclinicalnotes