Cargando…

Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study

PURPOSE: The COVID-19 pandemic has drastically disrupted global healthcare systems. With the higher demand for healthcare and misinformation related to COVID-19, there is a need to explore alternative models to improve communication. Artificial Intelligence (AI) and Natural Language Processing (NLP)...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Lily Wei Yun, Ng, Wei Yan, Lei, Xiaofeng, Tan, Shaun Chern Yuan, Wang, Zhaoran, Yan, Ming, Pargi, Mohan Kashyap, Zhang, Xiaoman, Lim, Jane Sujuan, Gunasekeran, Dinesh Visva, Tan, Franklin Chee Ping, Lee, Chen Ee, Yeo, Khung Keong, Tan, Hiang Khoon, Ho, Henry Sun Sien, Tan, Benedict Wee Bor, Wong, Tien Yin, Kwek, Kenneth Yung Chiang, Goh, Rick Siow Mong, Liu, Yong, Ting, Daniel Shu Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9968846/
https://www.ncbi.nlm.nih.gov/pubmed/36860378
http://dx.doi.org/10.3389/fpubh.2023.1063466
_version_ 1784897585441406976
author Yang, Lily Wei Yun
Ng, Wei Yan
Lei, Xiaofeng
Tan, Shaun Chern Yuan
Wang, Zhaoran
Yan, Ming
Pargi, Mohan Kashyap
Zhang, Xiaoman
Lim, Jane Sujuan
Gunasekeran, Dinesh Visva
Tan, Franklin Chee Ping
Lee, Chen Ee
Yeo, Khung Keong
Tan, Hiang Khoon
Ho, Henry Sun Sien
Tan, Benedict Wee Bor
Wong, Tien Yin
Kwek, Kenneth Yung Chiang
Goh, Rick Siow Mong
Liu, Yong
Ting, Daniel Shu Wei
author_facet Yang, Lily Wei Yun
Ng, Wei Yan
Lei, Xiaofeng
Tan, Shaun Chern Yuan
Wang, Zhaoran
Yan, Ming
Pargi, Mohan Kashyap
Zhang, Xiaoman
Lim, Jane Sujuan
Gunasekeran, Dinesh Visva
Tan, Franklin Chee Ping
Lee, Chen Ee
Yeo, Khung Keong
Tan, Hiang Khoon
Ho, Henry Sun Sien
Tan, Benedict Wee Bor
Wong, Tien Yin
Kwek, Kenneth Yung Chiang
Goh, Rick Siow Mong
Liu, Yong
Ting, Daniel Shu Wei
author_sort Yang, Lily Wei Yun
collection PubMed
description PURPOSE: The COVID-19 pandemic has drastically disrupted global healthcare systems. With the higher demand for healthcare and misinformation related to COVID-19, there is a need to explore alternative models to improve communication. Artificial Intelligence (AI) and Natural Language Processing (NLP) have emerged as promising solutions to improve healthcare delivery. Chatbots could fill a pivotal role in the dissemination and easy accessibility of accurate information in a pandemic. In this study, we developed a multi-lingual NLP-based AI chatbot, DR-COVID, which responds accurately to open-ended, COVID-19 related questions. This was used to facilitate pandemic education and healthcare delivery. METHODS: First, we developed DR-COVID with an ensemble NLP model on the Telegram platform (https://t.me/drcovid_nlp_chatbot). Second, we evaluated various performance metrics. Third, we evaluated multi-lingual text-to-text translation to Chinese, Malay, Tamil, Filipino, Thai, Japanese, French, Spanish, and Portuguese. We utilized 2,728 training questions and 821 test questions in English. Primary outcome measurements were (A) overall and top 3 accuracies; (B) Area Under the Curve (AUC), precision, recall, and F1 score. Overall accuracy referred to a correct response for the top answer, whereas top 3 accuracy referred to an appropriate response for any one answer amongst the top 3 answers. AUC and its relevant matrices were obtained from the Receiver Operation Characteristics (ROC) curve. Secondary outcomes were (A) multi-lingual accuracy; (B) comparison to enterprise-grade chatbot systems. The sharing of training and testing datasets on an open-source platform will also contribute to existing data. RESULTS: Our NLP model, utilizing the ensemble architecture, achieved overall and top 3 accuracies of 0.838 [95% confidence interval (CI): 0.826–0.851] and 0.922 [95% CI: 0.913–0.932] respectively. For overall and top 3 results, AUC scores of 0.917 [95% CI: 0.911–0.925] and 0.960 [95% CI: 0.955–0.964] were achieved respectively. We achieved multi-linguicism with nine non-English languages, with Portuguese performing the best overall at 0.900. Lastly, DR-COVID generated answers more accurately and quickly than other chatbots, within 1.12–2.15 s across three devices tested. CONCLUSION: DR-COVID is a clinically effective NLP-based conversational AI chatbot, and a promising solution for healthcare delivery in the pandemic era.
format Online
Article
Text
id pubmed-9968846
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99688462023-02-28 Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study Yang, Lily Wei Yun Ng, Wei Yan Lei, Xiaofeng Tan, Shaun Chern Yuan Wang, Zhaoran Yan, Ming Pargi, Mohan Kashyap Zhang, Xiaoman Lim, Jane Sujuan Gunasekeran, Dinesh Visva Tan, Franklin Chee Ping Lee, Chen Ee Yeo, Khung Keong Tan, Hiang Khoon Ho, Henry Sun Sien Tan, Benedict Wee Bor Wong, Tien Yin Kwek, Kenneth Yung Chiang Goh, Rick Siow Mong Liu, Yong Ting, Daniel Shu Wei Front Public Health Public Health PURPOSE: The COVID-19 pandemic has drastically disrupted global healthcare systems. With the higher demand for healthcare and misinformation related to COVID-19, there is a need to explore alternative models to improve communication. Artificial Intelligence (AI) and Natural Language Processing (NLP) have emerged as promising solutions to improve healthcare delivery. Chatbots could fill a pivotal role in the dissemination and easy accessibility of accurate information in a pandemic. In this study, we developed a multi-lingual NLP-based AI chatbot, DR-COVID, which responds accurately to open-ended, COVID-19 related questions. This was used to facilitate pandemic education and healthcare delivery. METHODS: First, we developed DR-COVID with an ensemble NLP model on the Telegram platform (https://t.me/drcovid_nlp_chatbot). Second, we evaluated various performance metrics. Third, we evaluated multi-lingual text-to-text translation to Chinese, Malay, Tamil, Filipino, Thai, Japanese, French, Spanish, and Portuguese. We utilized 2,728 training questions and 821 test questions in English. Primary outcome measurements were (A) overall and top 3 accuracies; (B) Area Under the Curve (AUC), precision, recall, and F1 score. Overall accuracy referred to a correct response for the top answer, whereas top 3 accuracy referred to an appropriate response for any one answer amongst the top 3 answers. AUC and its relevant matrices were obtained from the Receiver Operation Characteristics (ROC) curve. Secondary outcomes were (A) multi-lingual accuracy; (B) comparison to enterprise-grade chatbot systems. The sharing of training and testing datasets on an open-source platform will also contribute to existing data. RESULTS: Our NLP model, utilizing the ensemble architecture, achieved overall and top 3 accuracies of 0.838 [95% confidence interval (CI): 0.826–0.851] and 0.922 [95% CI: 0.913–0.932] respectively. For overall and top 3 results, AUC scores of 0.917 [95% CI: 0.911–0.925] and 0.960 [95% CI: 0.955–0.964] were achieved respectively. We achieved multi-linguicism with nine non-English languages, with Portuguese performing the best overall at 0.900. Lastly, DR-COVID generated answers more accurately and quickly than other chatbots, within 1.12–2.15 s across three devices tested. CONCLUSION: DR-COVID is a clinically effective NLP-based conversational AI chatbot, and a promising solution for healthcare delivery in the pandemic era. Frontiers Media S.A. 2023-02-13 /pmc/articles/PMC9968846/ /pubmed/36860378 http://dx.doi.org/10.3389/fpubh.2023.1063466 Text en Copyright © 2023 Yang, Ng, Lei, Tan, Wang, Yan, Pargi, Zhang, Lim, Gunasekeran, Tan, Lee, Yeo, Tan, Ho, Tan, Wong, Kwek, Goh, Liu and Ting. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Yang, Lily Wei Yun
Ng, Wei Yan
Lei, Xiaofeng
Tan, Shaun Chern Yuan
Wang, Zhaoran
Yan, Ming
Pargi, Mohan Kashyap
Zhang, Xiaoman
Lim, Jane Sujuan
Gunasekeran, Dinesh Visva
Tan, Franklin Chee Ping
Lee, Chen Ee
Yeo, Khung Keong
Tan, Hiang Khoon
Ho, Henry Sun Sien
Tan, Benedict Wee Bor
Wong, Tien Yin
Kwek, Kenneth Yung Chiang
Goh, Rick Siow Mong
Liu, Yong
Ting, Daniel Shu Wei
Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study
title Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study
title_full Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study
title_fullStr Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study
title_full_unstemmed Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study
title_short Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study
title_sort development and testing of a multi-lingual natural language processing-based deep learning system in 10 languages for covid-19 pandemic crisis: a multi-center study
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9968846/
https://www.ncbi.nlm.nih.gov/pubmed/36860378
http://dx.doi.org/10.3389/fpubh.2023.1063466
work_keys_str_mv AT yanglilyweiyun developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT ngweiyan developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT leixiaofeng developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT tanshaunchernyuan developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT wangzhaoran developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT yanming developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT pargimohankashyap developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT zhangxiaoman developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT limjanesujuan developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT gunasekerandineshvisva developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT tanfranklincheeping developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT leechenee developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT yeokhungkeong developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT tanhiangkhoon developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT hohenrysunsien developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT tanbenedictweebor developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT wongtienyin developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT kwekkennethyungchiang developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT gohricksiowmong developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT liuyong developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy
AT tingdanielshuwei developmentandtestingofamultilingualnaturallanguageprocessingbaseddeeplearningsystemin10languagesforcovid19pandemiccrisisamulticenterstudy