Cargando…
DLI-IT: a deep learning approach to drug label identification through image and text embedding
BACKGROUND: Drug label, or packaging insert play a significant role in all the operations from production through drug distribution channels to the end consumer. Image of the label also called Display Panel or label could be used to identify illegal, illicit, unapproved and potentially dangerous dru...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7158001/ https://www.ncbi.nlm.nih.gov/pubmed/32293428 http://dx.doi.org/10.1186/s12911-020-1078-3 |
_version_ | 1783522448902193152 |
---|---|
author | Liu, Xiangwen Meehan, Joe Tong, Weida Wu, Leihong Xu, Xiaowei Xu, Joshua |
author_facet | Liu, Xiangwen Meehan, Joe Tong, Weida Wu, Leihong Xu, Xiaowei Xu, Joshua |
author_sort | Liu, Xiangwen |
collection | PubMed |
description | BACKGROUND: Drug label, or packaging insert play a significant role in all the operations from production through drug distribution channels to the end consumer. Image of the label also called Display Panel or label could be used to identify illegal, illicit, unapproved and potentially dangerous drugs. Due to the time-consuming process and high labor cost of investigation, an artificial intelligence-based deep learning model is necessary for fast and accurate identification of the drugs. METHODS: In addition to image-based identification technology, we take advantages of rich text information on the pharmaceutical package insert of drug label images. In this study, we developed the Drug Label Identification through Image and Text embedding model (DLI-IT) to model text-based patterns of historical data for detection of suspicious drugs. In DLI-IT, we first trained a Connectionist Text Proposal Network (CTPN) to crop the raw image into sub-images based on the text. The texts from the cropped sub-images are recognized independently through the Tesseract OCR Engine and combined as one document for each raw image. Finally, we applied universal sentence embedding to transform these documents into vectors and find the most similar reference images to the test image through the cosine similarity. RESULTS: We trained the DLI-IT model on 1749 opioid and 2365 non-opioid drug label images. The model was then tested on 300 external opioid drug label images, the result demonstrated our model achieves up-to 88% of the precision in drug label identification, which outperforms previous image-based or text-based identification method by up-to 35% improvement. CONCLUSION: To conclude, by combining Image and Text embedding analysis under deep learning framework, our DLI-IT approach achieved a competitive performance in advancing drug label identification. |
format | Online Article Text |
id | pubmed-7158001 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-71580012020-04-20 DLI-IT: a deep learning approach to drug label identification through image and text embedding Liu, Xiangwen Meehan, Joe Tong, Weida Wu, Leihong Xu, Xiaowei Xu, Joshua BMC Med Inform Decis Mak Research Article BACKGROUND: Drug label, or packaging insert play a significant role in all the operations from production through drug distribution channels to the end consumer. Image of the label also called Display Panel or label could be used to identify illegal, illicit, unapproved and potentially dangerous drugs. Due to the time-consuming process and high labor cost of investigation, an artificial intelligence-based deep learning model is necessary for fast and accurate identification of the drugs. METHODS: In addition to image-based identification technology, we take advantages of rich text information on the pharmaceutical package insert of drug label images. In this study, we developed the Drug Label Identification through Image and Text embedding model (DLI-IT) to model text-based patterns of historical data for detection of suspicious drugs. In DLI-IT, we first trained a Connectionist Text Proposal Network (CTPN) to crop the raw image into sub-images based on the text. The texts from the cropped sub-images are recognized independently through the Tesseract OCR Engine and combined as one document for each raw image. Finally, we applied universal sentence embedding to transform these documents into vectors and find the most similar reference images to the test image through the cosine similarity. RESULTS: We trained the DLI-IT model on 1749 opioid and 2365 non-opioid drug label images. The model was then tested on 300 external opioid drug label images, the result demonstrated our model achieves up-to 88% of the precision in drug label identification, which outperforms previous image-based or text-based identification method by up-to 35% improvement. CONCLUSION: To conclude, by combining Image and Text embedding analysis under deep learning framework, our DLI-IT approach achieved a competitive performance in advancing drug label identification. BioMed Central 2020-04-15 /pmc/articles/PMC7158001/ /pubmed/32293428 http://dx.doi.org/10.1186/s12911-020-1078-3 Text en © The Author(s). 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Liu, Xiangwen Meehan, Joe Tong, Weida Wu, Leihong Xu, Xiaowei Xu, Joshua DLI-IT: a deep learning approach to drug label identification through image and text embedding |
title | DLI-IT: a deep learning approach to drug label identification through image and text embedding |
title_full | DLI-IT: a deep learning approach to drug label identification through image and text embedding |
title_fullStr | DLI-IT: a deep learning approach to drug label identification through image and text embedding |
title_full_unstemmed | DLI-IT: a deep learning approach to drug label identification through image and text embedding |
title_short | DLI-IT: a deep learning approach to drug label identification through image and text embedding |
title_sort | dli-it: a deep learning approach to drug label identification through image and text embedding |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7158001/ https://www.ncbi.nlm.nih.gov/pubmed/32293428 http://dx.doi.org/10.1186/s12911-020-1078-3 |
work_keys_str_mv | AT liuxiangwen dliitadeeplearningapproachtodruglabelidentificationthroughimageandtextembedding AT meehanjoe dliitadeeplearningapproachtodruglabelidentificationthroughimageandtextembedding AT tongweida dliitadeeplearningapproachtodruglabelidentificationthroughimageandtextembedding AT wuleihong dliitadeeplearningapproachtodruglabelidentificationthroughimageandtextembedding AT xuxiaowei dliitadeeplearningapproachtodruglabelidentificationthroughimageandtextembedding AT xujoshua dliitadeeplearningapproachtodruglabelidentificationthroughimageandtextembedding |