Cargando…

High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion

Due to the significant resemblance in visual appearance, pill misuse is prevalent and has become a critical issue, responsible for one-third of all deaths worldwide. Pill identification, thus, is a crucial concern that needs to be investigated thoroughly. Recently, several attempts have been made to...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Anh Duy, Pham, Huy Hieu, Trung, Huynh Thanh, Nguyen, Quoc Viet Hung, Truong, Thao Nguyen, Nguyen, Phi Le
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10538799/
https://www.ncbi.nlm.nih.gov/pubmed/37768910
http://dx.doi.org/10.1371/journal.pone.0291865
_version_ 1785113379501768704
author Nguyen, Anh Duy
Pham, Huy Hieu
Trung, Huynh Thanh
Nguyen, Quoc Viet Hung
Truong, Thao Nguyen
Nguyen, Phi Le
author_facet Nguyen, Anh Duy
Pham, Huy Hieu
Trung, Huynh Thanh
Nguyen, Quoc Viet Hung
Truong, Thao Nguyen
Nguyen, Phi Le
author_sort Nguyen, Anh Duy
collection PubMed
description Due to the significant resemblance in visual appearance, pill misuse is prevalent and has become a critical issue, responsible for one-third of all deaths worldwide. Pill identification, thus, is a crucial concern that needs to be investigated thoroughly. Recently, several attempts have been made to exploit deep learning to tackle the pill identification problem. However, most published works consider only single-pill identification and fail to distinguish hard samples with identical appearances. Also, most existing pill image datasets only feature single pill images captured in carefully controlled environments under ideal lighting conditions and clean backgrounds. In this work, we are the first to tackle the multi-pill detection problem in real-world settings, aiming at localizing and identifying pills captured by users during pill intake. Moreover, we also introduce a multi-pill image dataset taken in unconstrained conditions. To handle hard samples, we propose a novel method for constructing heterogeneous a priori graphs incorporating three forms of inter-pill relationships, including co-occurrence likelihood, relative size, and visual semantic correlation. We then offer a framework for integrating a priori with pills’ visual features to enhance detection accuracy. Our experimental results have proved the robustness, reliability, and explainability of the proposed framework. Experimentally, it outperforms all detection benchmarks in terms of all evaluation metrics. Specifically, our proposed framework improves COCO mAP metrics by 9.4% over Faster R-CNN and 12.0% compared to vanilla YOLOv5. Our study opens up new opportunities for protecting patients from medication errors using an AI-based pill identification solution.
format Online
Article
Text
id pubmed-10538799
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105387992023-09-29 High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion Nguyen, Anh Duy Pham, Huy Hieu Trung, Huynh Thanh Nguyen, Quoc Viet Hung Truong, Thao Nguyen Nguyen, Phi Le PLoS One Research Article Due to the significant resemblance in visual appearance, pill misuse is prevalent and has become a critical issue, responsible for one-third of all deaths worldwide. Pill identification, thus, is a crucial concern that needs to be investigated thoroughly. Recently, several attempts have been made to exploit deep learning to tackle the pill identification problem. However, most published works consider only single-pill identification and fail to distinguish hard samples with identical appearances. Also, most existing pill image datasets only feature single pill images captured in carefully controlled environments under ideal lighting conditions and clean backgrounds. In this work, we are the first to tackle the multi-pill detection problem in real-world settings, aiming at localizing and identifying pills captured by users during pill intake. Moreover, we also introduce a multi-pill image dataset taken in unconstrained conditions. To handle hard samples, we propose a novel method for constructing heterogeneous a priori graphs incorporating three forms of inter-pill relationships, including co-occurrence likelihood, relative size, and visual semantic correlation. We then offer a framework for integrating a priori with pills’ visual features to enhance detection accuracy. Our experimental results have proved the robustness, reliability, and explainability of the proposed framework. Experimentally, it outperforms all detection benchmarks in terms of all evaluation metrics. Specifically, our proposed framework improves COCO mAP metrics by 9.4% over Faster R-CNN and 12.0% compared to vanilla YOLOv5. Our study opens up new opportunities for protecting patients from medication errors using an AI-based pill identification solution. Public Library of Science 2023-09-28 /pmc/articles/PMC10538799/ /pubmed/37768910 http://dx.doi.org/10.1371/journal.pone.0291865 Text en © 2023 Nguyen et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Nguyen, Anh Duy
Pham, Huy Hieu
Trung, Huynh Thanh
Nguyen, Quoc Viet Hung
Truong, Thao Nguyen
Nguyen, Phi Le
High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
title High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
title_full High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
title_fullStr High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
title_full_unstemmed High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
title_short High accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
title_sort high accurate and explainable multi-pill detection framework with graph neural network-assisted multimodal data fusion
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10538799/
https://www.ncbi.nlm.nih.gov/pubmed/37768910
http://dx.doi.org/10.1371/journal.pone.0291865
work_keys_str_mv AT nguyenanhduy highaccurateandexplainablemultipilldetectionframeworkwithgraphneuralnetworkassistedmultimodaldatafusion
AT phamhuyhieu highaccurateandexplainablemultipilldetectionframeworkwithgraphneuralnetworkassistedmultimodaldatafusion
AT trunghuynhthanh highaccurateandexplainablemultipilldetectionframeworkwithgraphneuralnetworkassistedmultimodaldatafusion
AT nguyenquocviethung highaccurateandexplainablemultipilldetectionframeworkwithgraphneuralnetworkassistedmultimodaldatafusion
AT truongthaonguyen highaccurateandexplainablemultipilldetectionframeworkwithgraphneuralnetworkassistedmultimodaldatafusion
AT nguyenphile highaccurateandexplainablemultipilldetectionframeworkwithgraphneuralnetworkassistedmultimodaldatafusion