Cargando…

Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure

Machine learning (ML) has been extensively involved in assistant disease diagnosis and prediction systems to emancipate the serious dependence on medical resources and improve healthcare quality. Moreover, with the booming of pre-training language models (PLMs), the application prospect and promotio...

Descripción completa

Detalles Bibliográficos
Autores principales: Dong, Bowen, Wang, Zhuo, Li, Zhenyu, Duan, Zhichao, Xu, Jiacheng, Pan, Tengyu, Zhang, Rui, Liu, Ning, Li, Xiuxing, Wang, Jie, Liu, Caiyan, Dong, Liling, Mao, Chenhui, Gao, Jing, Wang, Jianyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10400680/
https://www.ncbi.nlm.nih.gov/pubmed/37537202
http://dx.doi.org/10.1038/s41598-023-39543-2
_version_ 1785084497663885312
author Dong, Bowen
Wang, Zhuo
Li, Zhenyu
Duan, Zhichao
Xu, Jiacheng
Pan, Tengyu
Zhang, Rui
Liu, Ning
Li, Xiuxing
Wang, Jie
Liu, Caiyan
Dong, Liling
Mao, Chenhui
Gao, Jing
Wang, Jianyong
author_facet Dong, Bowen
Wang, Zhuo
Li, Zhenyu
Duan, Zhichao
Xu, Jiacheng
Pan, Tengyu
Zhang, Rui
Liu, Ning
Li, Xiuxing
Wang, Jie
Liu, Caiyan
Dong, Liling
Mao, Chenhui
Gao, Jing
Wang, Jianyong
author_sort Dong, Bowen
collection PubMed
description Machine learning (ML) has been extensively involved in assistant disease diagnosis and prediction systems to emancipate the serious dependence on medical resources and improve healthcare quality. Moreover, with the booming of pre-training language models (PLMs), the application prospect and promotion potential of machine learning methods in the relevant field have been further inspired. PLMs have recently achieved tremendous success in diverse text processing tasks, whereas limited by the significant semantic gap between the pre-training corpus and the structured electronic health records (EHRs), PLMs cannot converge to anticipated disease diagnosis and prediction results. Unfortunately, establishing connections between PLMs and EHRs typically requires the extraction of curated predictor variables from structured EHR resources, which is tedious and labor-intensive, and even discards vast implicit information.In this work, we propose an Input Prompting and Discriminative language model with the Mixture-of-experts framework (IPDM) by promoting the model’s capabilities to learn knowledge from heterogeneous information and facilitating the feature-aware ability of the model. Furthermore, leveraging the prompt-tuning mechanism, IPDM can inherit the impacts of the pre-training in downstream tasks exclusively through minor modifications. IPDM remarkably outperforms existing models, proved by experiments on one disease diagnosis task and two disease prediction tasks. Finally, experiments with few-feature and few-sample demonstrate that IPDM achieves significant stability and impressive performance in predicting chronic diseases with unclear early-onset characteristics or sudden diseases with insufficient data, which verifies the superiority of IPDM over existing mainstream methods, and reveals the IPDM can powerfully address the aforementioned challenges via establishing a stable and low-resource medical diagnostic system for various clinical scenarios.
format Online
Article
Text
id pubmed-10400680
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104006802023-08-05 Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure Dong, Bowen Wang, Zhuo Li, Zhenyu Duan, Zhichao Xu, Jiacheng Pan, Tengyu Zhang, Rui Liu, Ning Li, Xiuxing Wang, Jie Liu, Caiyan Dong, Liling Mao, Chenhui Gao, Jing Wang, Jianyong Sci Rep Article Machine learning (ML) has been extensively involved in assistant disease diagnosis and prediction systems to emancipate the serious dependence on medical resources and improve healthcare quality. Moreover, with the booming of pre-training language models (PLMs), the application prospect and promotion potential of machine learning methods in the relevant field have been further inspired. PLMs have recently achieved tremendous success in diverse text processing tasks, whereas limited by the significant semantic gap between the pre-training corpus and the structured electronic health records (EHRs), PLMs cannot converge to anticipated disease diagnosis and prediction results. Unfortunately, establishing connections between PLMs and EHRs typically requires the extraction of curated predictor variables from structured EHR resources, which is tedious and labor-intensive, and even discards vast implicit information.In this work, we propose an Input Prompting and Discriminative language model with the Mixture-of-experts framework (IPDM) by promoting the model’s capabilities to learn knowledge from heterogeneous information and facilitating the feature-aware ability of the model. Furthermore, leveraging the prompt-tuning mechanism, IPDM can inherit the impacts of the pre-training in downstream tasks exclusively through minor modifications. IPDM remarkably outperforms existing models, proved by experiments on one disease diagnosis task and two disease prediction tasks. Finally, experiments with few-feature and few-sample demonstrate that IPDM achieves significant stability and impressive performance in predicting chronic diseases with unclear early-onset characteristics or sudden diseases with insufficient data, which verifies the superiority of IPDM over existing mainstream methods, and reveals the IPDM can powerfully address the aforementioned challenges via establishing a stable and low-resource medical diagnostic system for various clinical scenarios. Nature Publishing Group UK 2023-08-03 /pmc/articles/PMC10400680/ /pubmed/37537202 http://dx.doi.org/10.1038/s41598-023-39543-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Dong, Bowen
Wang, Zhuo
Li, Zhenyu
Duan, Zhichao
Xu, Jiacheng
Pan, Tengyu
Zhang, Rui
Liu, Ning
Li, Xiuxing
Wang, Jie
Liu, Caiyan
Dong, Liling
Mao, Chenhui
Gao, Jing
Wang, Jianyong
Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure
title Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure
title_full Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure
title_fullStr Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure
title_full_unstemmed Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure
title_short Toward a stable and low-resource PLM-based medical diagnostic system via prompt tuning and MoE structure
title_sort toward a stable and low-resource plm-based medical diagnostic system via prompt tuning and moe structure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10400680/
https://www.ncbi.nlm.nih.gov/pubmed/37537202
http://dx.doi.org/10.1038/s41598-023-39543-2
work_keys_str_mv AT dongbowen towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT wangzhuo towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT lizhenyu towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT duanzhichao towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT xujiacheng towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT pantengyu towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT zhangrui towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT liuning towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT lixiuxing towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT wangjie towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT liucaiyan towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT dongliling towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT maochenhui towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT gaojing towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure
AT wangjianyong towardastableandlowresourceplmbasedmedicaldiagnosticsystemviaprompttuningandmoestructure