Cargando…

Predicting medical specialty from text based on a domain-specific pre-trained BERT

BACKGROUND: Owing to the prevalence of the coronavirus disease (COVID-19), coping with clinical issues at the individual level has become important to the healthcare system. Accordingly, precise initiation of treatment after a hospital visit is required for expedited processes and effective diagnose...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Yoojoong, Kim, Jong-Ho, Kim, Young-Min, Song, Sanghoun, Joo, Hyung Joon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier B.V. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9731829/
https://www.ncbi.nlm.nih.gov/pubmed/36512987
http://dx.doi.org/10.1016/j.ijmedinf.2022.104956
_version_ 1784845988176855040
author Kim, Yoojoong
Kim, Jong-Ho
Kim, Young-Min
Song, Sanghoun
Joo, Hyung Joon
author_facet Kim, Yoojoong
Kim, Jong-Ho
Kim, Young-Min
Song, Sanghoun
Joo, Hyung Joon
author_sort Kim, Yoojoong
collection PubMed
description BACKGROUND: Owing to the prevalence of the coronavirus disease (COVID-19), coping with clinical issues at the individual level has become important to the healthcare system. Accordingly, precise initiation of treatment after a hospital visit is required for expedited processes and effective diagnoses of outpatients. To achieve this, artificial intelligence in medical natural language processing (NLP), such as a healthcare chatbot or a clinical decision support system, can be suitable tools for an advanced clinical system. Furthermore, support for decisions on the medical specialty from the initial visit can be helpful. MATERIALS AND METHODS: In this study, we propose a medical specialty prediction model from patient-side medical question text based on pre-trained bidirectional encoder representations from transformers (BERT). The dataset comprised pairs of medical question texts and labeled specialties scraped from a website for the medical question-and-answer service. The model was fine-tuned for predicting the required medical specialty labels among 27 labels from medical question texts. To demonstrate the feasibility, we conducted experiments on a real-world dataset and elaborately evaluated the predictive performance compared with four deep learning NLP models through cross-validation and test set evaluation. RESULTS: The proposed model showed improved performance compared with competitive models in terms of overall specialties. In addition, we demonstrate the usefulness of the proposed model by performing case studies for visualization applications. CONCLUSION: The proposed model can benefit hospital patient management and reasonable recommendations for specialties for patients.
format Online
Article
Text
id pubmed-9731829
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Authors. Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-97318292022-12-09 Predicting medical specialty from text based on a domain-specific pre-trained BERT Kim, Yoojoong Kim, Jong-Ho Kim, Young-Min Song, Sanghoun Joo, Hyung Joon Int J Med Inform Article BACKGROUND: Owing to the prevalence of the coronavirus disease (COVID-19), coping with clinical issues at the individual level has become important to the healthcare system. Accordingly, precise initiation of treatment after a hospital visit is required for expedited processes and effective diagnoses of outpatients. To achieve this, artificial intelligence in medical natural language processing (NLP), such as a healthcare chatbot or a clinical decision support system, can be suitable tools for an advanced clinical system. Furthermore, support for decisions on the medical specialty from the initial visit can be helpful. MATERIALS AND METHODS: In this study, we propose a medical specialty prediction model from patient-side medical question text based on pre-trained bidirectional encoder representations from transformers (BERT). The dataset comprised pairs of medical question texts and labeled specialties scraped from a website for the medical question-and-answer service. The model was fine-tuned for predicting the required medical specialty labels among 27 labels from medical question texts. To demonstrate the feasibility, we conducted experiments on a real-world dataset and elaborately evaluated the predictive performance compared with four deep learning NLP models through cross-validation and test set evaluation. RESULTS: The proposed model showed improved performance compared with competitive models in terms of overall specialties. In addition, we demonstrate the usefulness of the proposed model by performing case studies for visualization applications. CONCLUSION: The proposed model can benefit hospital patient management and reasonable recommendations for specialties for patients. The Authors. Published by Elsevier B.V. 2023-02 2022-12-09 /pmc/articles/PMC9731829/ /pubmed/36512987 http://dx.doi.org/10.1016/j.ijmedinf.2022.104956 Text en © 2022 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Kim, Yoojoong
Kim, Jong-Ho
Kim, Young-Min
Song, Sanghoun
Joo, Hyung Joon
Predicting medical specialty from text based on a domain-specific pre-trained BERT
title Predicting medical specialty from text based on a domain-specific pre-trained BERT
title_full Predicting medical specialty from text based on a domain-specific pre-trained BERT
title_fullStr Predicting medical specialty from text based on a domain-specific pre-trained BERT
title_full_unstemmed Predicting medical specialty from text based on a domain-specific pre-trained BERT
title_short Predicting medical specialty from text based on a domain-specific pre-trained BERT
title_sort predicting medical specialty from text based on a domain-specific pre-trained bert
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9731829/
https://www.ncbi.nlm.nih.gov/pubmed/36512987
http://dx.doi.org/10.1016/j.ijmedinf.2022.104956
work_keys_str_mv AT kimyoojoong predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert
AT kimjongho predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert
AT kimyoungmin predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert
AT songsanghoun predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert
AT joohyungjoon predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert