Cargando…

Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers

Background: Due to its convenience, wide availability, low usage cost, neural machine translation (NMT) has increasing applications in diverse clinical settings and web-based self-diagnosis of diseases. Given the developing nature of NMT tools, this can pose safety risks to multicultural communities...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Wenxiu, Ji, Meng, Zhao, Mengdan, Zhou, Tianqi, Yang, Fan, Qian, Xiaobo, Chow, Chi-Yin, Lam, Kam-Yiu, Hao, Tianyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8566668/
https://www.ncbi.nlm.nih.gov/pubmed/34744846
http://dx.doi.org/10.3389/fpsyt.2021.771562
_version_ 1784594063955066880
author Xie, Wenxiu
Ji, Meng
Zhao, Mengdan
Zhou, Tianqi
Yang, Fan
Qian, Xiaobo
Chow, Chi-Yin
Lam, Kam-Yiu
Hao, Tianyong
author_facet Xie, Wenxiu
Ji, Meng
Zhao, Mengdan
Zhou, Tianqi
Yang, Fan
Qian, Xiaobo
Chow, Chi-Yin
Lam, Kam-Yiu
Hao, Tianyong
author_sort Xie, Wenxiu
collection PubMed
description Background: Due to its convenience, wide availability, low usage cost, neural machine translation (NMT) has increasing applications in diverse clinical settings and web-based self-diagnosis of diseases. Given the developing nature of NMT tools, this can pose safety risks to multicultural communities with limited bilingual skills, low education, and low health literacy. Research is needed to scrutinise the reliability, credibility, usability of automatically translated patient health information. Objective: We aimed to develop high-performing Bayesian machine learning classifiers to assist clinical professionals and healthcare workers in assessing the quality and usability of NMT on depressive disorders. The tool did not require any prior knowledge from frontline health and medical professionals of the target language used by patients. Methods: We used Relevance Vector Machine (RVM) to increase generalisability and clinical interpretability of classifiers. It is a typical sparse Bayesian classifier less prone to overfitting with small training datasets. We optimised RVM by leveraging automatic recursive feature elimination and expert feature refinement from the perspective of health linguistics. We evaluated the diagnostic utility of the Bayesian classifier under different probability cut-offs in terms of sensitivity, specificity, positive and negative likelihood ratios against clinical thresholds for diagnostic tests. Finally, we illustrated interpretation of RVM tool in clinic using Bayes' nomogram. Results: After automatic and expert-based feature optimisation, the best-performing RVM classifier (RVM_DUFS12) gained the highest AUC (0.8872) among 52 competing models with distinct optimised, normalised features sets. It also had statistically higher sensitivity and specificity compared to other models. We evaluated the diagnostic utility of the best-performing model using Bayes' nomogram: it had a positive likelihood ratio (LR+) of 4.62 (95% C.I.: 2.53, 8.43), and the associated posterior probability (odds) was 83% (5.0) (95% C.I.: 73%, 90%), meaning that approximately 10 in 12 English texts with positive test are likely to contain information that would cause clinically significant conceptual errors if translated by Google; it had a negative likelihood ratio (LR-) of 0.18 (95% C.I.: 0.10,0.35) and associated posterior probability (odds) was 16% (0.2) (95% C.I: 10%, 27%), meaning that about 10 in 12 English texts with negative test can be safely translated using Google.
format Online
Article
Text
id pubmed-8566668
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85666682021-11-05 Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers Xie, Wenxiu Ji, Meng Zhao, Mengdan Zhou, Tianqi Yang, Fan Qian, Xiaobo Chow, Chi-Yin Lam, Kam-Yiu Hao, Tianyong Front Psychiatry Psychiatry Background: Due to its convenience, wide availability, low usage cost, neural machine translation (NMT) has increasing applications in diverse clinical settings and web-based self-diagnosis of diseases. Given the developing nature of NMT tools, this can pose safety risks to multicultural communities with limited bilingual skills, low education, and low health literacy. Research is needed to scrutinise the reliability, credibility, usability of automatically translated patient health information. Objective: We aimed to develop high-performing Bayesian machine learning classifiers to assist clinical professionals and healthcare workers in assessing the quality and usability of NMT on depressive disorders. The tool did not require any prior knowledge from frontline health and medical professionals of the target language used by patients. Methods: We used Relevance Vector Machine (RVM) to increase generalisability and clinical interpretability of classifiers. It is a typical sparse Bayesian classifier less prone to overfitting with small training datasets. We optimised RVM by leveraging automatic recursive feature elimination and expert feature refinement from the perspective of health linguistics. We evaluated the diagnostic utility of the Bayesian classifier under different probability cut-offs in terms of sensitivity, specificity, positive and negative likelihood ratios against clinical thresholds for diagnostic tests. Finally, we illustrated interpretation of RVM tool in clinic using Bayes' nomogram. Results: After automatic and expert-based feature optimisation, the best-performing RVM classifier (RVM_DUFS12) gained the highest AUC (0.8872) among 52 competing models with distinct optimised, normalised features sets. It also had statistically higher sensitivity and specificity compared to other models. We evaluated the diagnostic utility of the best-performing model using Bayes' nomogram: it had a positive likelihood ratio (LR+) of 4.62 (95% C.I.: 2.53, 8.43), and the associated posterior probability (odds) was 83% (5.0) (95% C.I.: 73%, 90%), meaning that approximately 10 in 12 English texts with positive test are likely to contain information that would cause clinically significant conceptual errors if translated by Google; it had a negative likelihood ratio (LR-) of 0.18 (95% C.I.: 0.10,0.35) and associated posterior probability (odds) was 16% (0.2) (95% C.I: 10%, 27%), meaning that about 10 in 12 English texts with negative test can be safely translated using Google. Frontiers Media S.A. 2021-10-21 /pmc/articles/PMC8566668/ /pubmed/34744846 http://dx.doi.org/10.3389/fpsyt.2021.771562 Text en Copyright © 2021 Xie, Ji, Zhao, Zhou, Yang, Qian, Chow, Lam and Hao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychiatry
Xie, Wenxiu
Ji, Meng
Zhao, Mengdan
Zhou, Tianqi
Yang, Fan
Qian, Xiaobo
Chow, Chi-Yin
Lam, Kam-Yiu
Hao, Tianyong
Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers
title Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers
title_full Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers
title_fullStr Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers
title_full_unstemmed Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers
title_short Detecting Symptom Errors in Neural Machine Translation of Patient Health Information on Depressive Disorders: Developing Interpretable Bayesian Machine Learning Classifiers
title_sort detecting symptom errors in neural machine translation of patient health information on depressive disorders: developing interpretable bayesian machine learning classifiers
topic Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8566668/
https://www.ncbi.nlm.nih.gov/pubmed/34744846
http://dx.doi.org/10.3389/fpsyt.2021.771562
work_keys_str_mv AT xiewenxiu detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT jimeng detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT zhaomengdan detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT zhoutianqi detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT yangfan detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT qianxiaobo detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT chowchiyin detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT lamkamyiu detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers
AT haotianyong detectingsymptomerrorsinneuralmachinetranslationofpatienthealthinformationondepressivedisordersdevelopinginterpretablebayesianmachinelearningclassifiers