Cargando…

Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT

Knowledge extraction from rich text in online health communities can supplement and improve the existing knowledge base, supporting evidence-based medicine and clinical decision making. The extracted time series health management data of users can help users with similar conditions when managing the...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yanli, Li, Xinmiao, Yang, Yu, Wang, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9779596/
https://www.ncbi.nlm.nih.gov/pubmed/36554472
http://dx.doi.org/10.3390/ijerph192416590
_version_ 1784856648275197952
author Zhang, Yanli
Li, Xinmiao
Yang, Yu
Wang, Tao
author_facet Zhang, Yanli
Li, Xinmiao
Yang, Yu
Wang, Tao
author_sort Zhang, Yanli
collection PubMed
description Knowledge extraction from rich text in online health communities can supplement and improve the existing knowledge base, supporting evidence-based medicine and clinical decision making. The extracted time series health management data of users can help users with similar conditions when managing their health. By annotating four relationships, this study constructed a deep learning model, BERT-BiGRU-ATT, to extract disease–medication relationships. A Chinese-pretrained BERT model was used to generate word embeddings for the question-and-answer data from online health communities in China. In addition, the bidirectional gated recurrent unit, combined with an attention mechanism, was employed to capture sequence context features and then to classify text related to diseases and drugs using a softmax classifier and to obtain the time series data provided by users. By using various word embedding training experiments and comparisons with classical models, the superiority of our model in relation to extraction was verified. Based on the knowledge extraction, the evolution of a user’s disease progression was analyzed according to the time series data provided by users to further analyze the evolution of the user’s disease progression. BERT word embedding, GRU, and attention mechanisms in our research play major roles in knowledge extraction. The knowledge extraction results obtained are expected to supplement and improve the existing knowledge base, assist doctors’ diagnosis, and help users with dynamic lifecycle health management, such as user disease treatment management. In future studies, a co-reference resolution can be introduced to further improve the effect of extracting the relationships among diseases, drugs, and drug effects.
format Online
Article
Text
id pubmed-9779596
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-97795962022-12-23 Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT Zhang, Yanli Li, Xinmiao Yang, Yu Wang, Tao Int J Environ Res Public Health Article Knowledge extraction from rich text in online health communities can supplement and improve the existing knowledge base, supporting evidence-based medicine and clinical decision making. The extracted time series health management data of users can help users with similar conditions when managing their health. By annotating four relationships, this study constructed a deep learning model, BERT-BiGRU-ATT, to extract disease–medication relationships. A Chinese-pretrained BERT model was used to generate word embeddings for the question-and-answer data from online health communities in China. In addition, the bidirectional gated recurrent unit, combined with an attention mechanism, was employed to capture sequence context features and then to classify text related to diseases and drugs using a softmax classifier and to obtain the time series data provided by users. By using various word embedding training experiments and comparisons with classical models, the superiority of our model in relation to extraction was verified. Based on the knowledge extraction, the evolution of a user’s disease progression was analyzed according to the time series data provided by users to further analyze the evolution of the user’s disease progression. BERT word embedding, GRU, and attention mechanisms in our research play major roles in knowledge extraction. The knowledge extraction results obtained are expected to supplement and improve the existing knowledge base, assist doctors’ diagnosis, and help users with dynamic lifecycle health management, such as user disease treatment management. In future studies, a co-reference resolution can be introduced to further improve the effect of extracting the relationships among diseases, drugs, and drug effects. MDPI 2022-12-09 /pmc/articles/PMC9779596/ /pubmed/36554472 http://dx.doi.org/10.3390/ijerph192416590 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Yanli
Li, Xinmiao
Yang, Yu
Wang, Tao
Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT
title Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT
title_full Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT
title_fullStr Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT
title_full_unstemmed Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT
title_short Disease- and Drug-Related Knowledge Extraction for Health Management from Online Health Communities Based on BERT-BiGRU-ATT
title_sort disease- and drug-related knowledge extraction for health management from online health communities based on bert-bigru-att
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9779596/
https://www.ncbi.nlm.nih.gov/pubmed/36554472
http://dx.doi.org/10.3390/ijerph192416590
work_keys_str_mv AT zhangyanli diseaseanddrugrelatedknowledgeextractionforhealthmanagementfromonlinehealthcommunitiesbasedonbertbigruatt
AT lixinmiao diseaseanddrugrelatedknowledgeextractionforhealthmanagementfromonlinehealthcommunitiesbasedonbertbigruatt
AT yangyu diseaseanddrugrelatedknowledgeextractionforhealthmanagementfromonlinehealthcommunitiesbasedonbertbigruatt
AT wangtao diseaseanddrugrelatedknowledgeextractionforhealthmanagementfromonlinehealthcommunitiesbasedonbertbigruatt