Cargando…
Predicting medical specialty from text based on a domain-specific pre-trained BERT
BACKGROUND: Owing to the prevalence of the coronavirus disease (COVID-19), coping with clinical issues at the individual level has become important to the healthcare system. Accordingly, precise initiation of treatment after a hospital visit is required for expedited processes and effective diagnose...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Authors. Published by Elsevier B.V.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9731829/ https://www.ncbi.nlm.nih.gov/pubmed/36512987 http://dx.doi.org/10.1016/j.ijmedinf.2022.104956 |
_version_ | 1784845988176855040 |
---|---|
author | Kim, Yoojoong Kim, Jong-Ho Kim, Young-Min Song, Sanghoun Joo, Hyung Joon |
author_facet | Kim, Yoojoong Kim, Jong-Ho Kim, Young-Min Song, Sanghoun Joo, Hyung Joon |
author_sort | Kim, Yoojoong |
collection | PubMed |
description | BACKGROUND: Owing to the prevalence of the coronavirus disease (COVID-19), coping with clinical issues at the individual level has become important to the healthcare system. Accordingly, precise initiation of treatment after a hospital visit is required for expedited processes and effective diagnoses of outpatients. To achieve this, artificial intelligence in medical natural language processing (NLP), such as a healthcare chatbot or a clinical decision support system, can be suitable tools for an advanced clinical system. Furthermore, support for decisions on the medical specialty from the initial visit can be helpful. MATERIALS AND METHODS: In this study, we propose a medical specialty prediction model from patient-side medical question text based on pre-trained bidirectional encoder representations from transformers (BERT). The dataset comprised pairs of medical question texts and labeled specialties scraped from a website for the medical question-and-answer service. The model was fine-tuned for predicting the required medical specialty labels among 27 labels from medical question texts. To demonstrate the feasibility, we conducted experiments on a real-world dataset and elaborately evaluated the predictive performance compared with four deep learning NLP models through cross-validation and test set evaluation. RESULTS: The proposed model showed improved performance compared with competitive models in terms of overall specialties. In addition, we demonstrate the usefulness of the proposed model by performing case studies for visualization applications. CONCLUSION: The proposed model can benefit hospital patient management and reasonable recommendations for specialties for patients. |
format | Online Article Text |
id | pubmed-9731829 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Authors. Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97318292022-12-09 Predicting medical specialty from text based on a domain-specific pre-trained BERT Kim, Yoojoong Kim, Jong-Ho Kim, Young-Min Song, Sanghoun Joo, Hyung Joon Int J Med Inform Article BACKGROUND: Owing to the prevalence of the coronavirus disease (COVID-19), coping with clinical issues at the individual level has become important to the healthcare system. Accordingly, precise initiation of treatment after a hospital visit is required for expedited processes and effective diagnoses of outpatients. To achieve this, artificial intelligence in medical natural language processing (NLP), such as a healthcare chatbot or a clinical decision support system, can be suitable tools for an advanced clinical system. Furthermore, support for decisions on the medical specialty from the initial visit can be helpful. MATERIALS AND METHODS: In this study, we propose a medical specialty prediction model from patient-side medical question text based on pre-trained bidirectional encoder representations from transformers (BERT). The dataset comprised pairs of medical question texts and labeled specialties scraped from a website for the medical question-and-answer service. The model was fine-tuned for predicting the required medical specialty labels among 27 labels from medical question texts. To demonstrate the feasibility, we conducted experiments on a real-world dataset and elaborately evaluated the predictive performance compared with four deep learning NLP models through cross-validation and test set evaluation. RESULTS: The proposed model showed improved performance compared with competitive models in terms of overall specialties. In addition, we demonstrate the usefulness of the proposed model by performing case studies for visualization applications. CONCLUSION: The proposed model can benefit hospital patient management and reasonable recommendations for specialties for patients. The Authors. Published by Elsevier B.V. 2023-02 2022-12-09 /pmc/articles/PMC9731829/ /pubmed/36512987 http://dx.doi.org/10.1016/j.ijmedinf.2022.104956 Text en © 2022 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Kim, Yoojoong Kim, Jong-Ho Kim, Young-Min Song, Sanghoun Joo, Hyung Joon Predicting medical specialty from text based on a domain-specific pre-trained BERT |
title | Predicting medical specialty from text based on a domain-specific pre-trained BERT |
title_full | Predicting medical specialty from text based on a domain-specific pre-trained BERT |
title_fullStr | Predicting medical specialty from text based on a domain-specific pre-trained BERT |
title_full_unstemmed | Predicting medical specialty from text based on a domain-specific pre-trained BERT |
title_short | Predicting medical specialty from text based on a domain-specific pre-trained BERT |
title_sort | predicting medical specialty from text based on a domain-specific pre-trained bert |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9731829/ https://www.ncbi.nlm.nih.gov/pubmed/36512987 http://dx.doi.org/10.1016/j.ijmedinf.2022.104956 |
work_keys_str_mv | AT kimyoojoong predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert AT kimjongho predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert AT kimyoungmin predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert AT songsanghoun predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert AT joohyungjoon predictingmedicalspecialtyfromtextbasedonadomainspecificpretrainedbert |