Cargando…
Extracting and standardizing medication information in clinical text – the MedEx-UIMA system
Extraction of medication information embedded in clinical text is important for research using electronic health records (EHRs). However, most of current medication information extraction systems identify drug and signature entities without mapping them to standard representation. In this study, we...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Informatics Association
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419757/ https://www.ncbi.nlm.nih.gov/pubmed/25954575 |
_version_ | 1782369635694280704 |
---|---|
author | Jiang, Min Wu, Yonghui Shah, Anushi Priyanka, Priyanka Denny, Joshua C. Xu, Hua |
author_facet | Jiang, Min Wu, Yonghui Shah, Anushi Priyanka, Priyanka Denny, Joshua C. Xu, Hua |
author_sort | Jiang, Min |
collection | PubMed |
description | Extraction of medication information embedded in clinical text is important for research using electronic health records (EHRs). However, most of current medication information extraction systems identify drug and signature entities without mapping them to standard representation. In this study, we introduced the open source Java implementation of MedEx, an existing high-performance medication information extraction system, based on the Unstructured Information Management Architecture (UIMA) framework. In addition, we developed new encoding modules in the MedEx-UIMA system, which mapped an extracted drug name/dose/form to both generalized and specific RxNorm concepts and translated drug frequency information to ISO standard. We processed 826 documents by both systems and verified that MedEx-UIMA and MedEx (the Python version) performed similarly by comparing both results. Using two manually annotated test sets that contained 300 drug entries from medication list and 300 drug entries from narrative reports, the MedEx-UIMA system achieved F-measures of 98.5% and 97.5% respectively for encoding drug names to corresponding RxNorm generic drug ingredients, and F-measures of 85.4% and 88.1% respectively for mapping drug names/dose/form to the most specific RxNorm concepts. It also achieved an F-measure of 90.4% for normalizing frequency information to ISO standard. The open source MedEx-UIMA system is freely available online at http://code.google.com/p/medex-uima/. |
format | Online Article Text |
id | pubmed-4419757 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | American Medical Informatics Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-44197572015-05-07 Extracting and standardizing medication information in clinical text – the MedEx-UIMA system Jiang, Min Wu, Yonghui Shah, Anushi Priyanka, Priyanka Denny, Joshua C. Xu, Hua AMIA Jt Summits Transl Sci Proc Articles Extraction of medication information embedded in clinical text is important for research using electronic health records (EHRs). However, most of current medication information extraction systems identify drug and signature entities without mapping them to standard representation. In this study, we introduced the open source Java implementation of MedEx, an existing high-performance medication information extraction system, based on the Unstructured Information Management Architecture (UIMA) framework. In addition, we developed new encoding modules in the MedEx-UIMA system, which mapped an extracted drug name/dose/form to both generalized and specific RxNorm concepts and translated drug frequency information to ISO standard. We processed 826 documents by both systems and verified that MedEx-UIMA and MedEx (the Python version) performed similarly by comparing both results. Using two manually annotated test sets that contained 300 drug entries from medication list and 300 drug entries from narrative reports, the MedEx-UIMA system achieved F-measures of 98.5% and 97.5% respectively for encoding drug names to corresponding RxNorm generic drug ingredients, and F-measures of 85.4% and 88.1% respectively for mapping drug names/dose/form to the most specific RxNorm concepts. It also achieved an F-measure of 90.4% for normalizing frequency information to ISO standard. The open source MedEx-UIMA system is freely available online at http://code.google.com/p/medex-uima/. American Medical Informatics Association 2014-04-07 /pmc/articles/PMC4419757/ /pubmed/25954575 Text en ©2014 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose |
spellingShingle | Articles Jiang, Min Wu, Yonghui Shah, Anushi Priyanka, Priyanka Denny, Joshua C. Xu, Hua Extracting and standardizing medication information in clinical text – the MedEx-UIMA system |
title | Extracting and standardizing medication information in clinical text – the MedEx-UIMA system |
title_full | Extracting and standardizing medication information in clinical text – the MedEx-UIMA system |
title_fullStr | Extracting and standardizing medication information in clinical text – the MedEx-UIMA system |
title_full_unstemmed | Extracting and standardizing medication information in clinical text – the MedEx-UIMA system |
title_short | Extracting and standardizing medication information in clinical text – the MedEx-UIMA system |
title_sort | extracting and standardizing medication information in clinical text – the medex-uima system |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419757/ https://www.ncbi.nlm.nih.gov/pubmed/25954575 |
work_keys_str_mv | AT jiangmin extractingandstandardizingmedicationinformationinclinicaltextthemedexuimasystem AT wuyonghui extractingandstandardizingmedicationinformationinclinicaltextthemedexuimasystem AT shahanushi extractingandstandardizingmedicationinformationinclinicaltextthemedexuimasystem AT priyankapriyanka extractingandstandardizingmedicationinformationinclinicaltextthemedexuimasystem AT dennyjoshuac extractingandstandardizingmedicationinformationinclinicaltextthemedexuimasystem AT xuhua extractingandstandardizingmedicationinformationinclinicaltextthemedexuimasystem |