Cargando…

Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models

BACKGROUND: Efficient exploration of knowledge for the treatment of recurrent glioblastoma (GBM) is critical for both clinicians and researchers. However, due to the large number of clinical trials and published articles, searching for this knowledge is very labor-intensive. In the current study, us...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jang, Bum-Sup, Park, Andrew J., Kim, In Ah
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9559267/ https://www.ncbi.nlm.nih.gov/pubmed/36229835 http://dx.doi.org/10.1186/s12911-022-02003-4

_version_	1784807616999849984
author	Jang, Bum-Sup Park, Andrew J. Kim, In Ah
author_facet	Jang, Bum-Sup Park, Andrew J. Kim, In Ah
author_sort	Jang, Bum-Sup
collection	PubMed
description	BACKGROUND: Efficient exploration of knowledge for the treatment of recurrent glioblastoma (GBM) is critical for both clinicians and researchers. However, due to the large number of clinical trials and published articles, searching for this knowledge is very labor-intensive. In the current study, using natural language processing (NLP), we analyzed medical research corpora related to recurrent glioblastoma to find potential targets and treatments. METHODS: We fine-tuned the ‘SAPBERT’, which was pretrained on biomedical ontologies, to perform question/answering (QA) and name entity recognition (NER) tasks for medical corpora. The model was fine-tuned with the SQUAD2 dataset and multiple NER datasets designed for QA task and NER task, respectively. Corpora were collected by searching the terms “recurrent glioblastoma” and “drug target”, published from 2000 to 2020 in the Web of science (N = 288 articles). Also, clinical trial corpora were collected from ‘clinicaltrial.gov’ using the searching term of ‘recurrent glioblastoma” (N = 587 studies). RESULTS: For the QA task, the model showed an F1 score of 0.79. For the NER task, the model showed F1 scores of 0.90 and 0.76 for drug and gene name recognition, respectively. When asked what the molecular targets were promising for recurrent glioblastoma, the model answered that RTK inhibitors or LPA-1 antagonists were promising. From collected clinical trials, the model summarized them in the order of bevacizumab, temozolomide, lomustine, and nivolumab. Based on published articles, the model found the many drug-gene pairs with the NER task, and we presented them with a circus plot and related summarization (https://github.com/bigwiz83/NLP_rGBM). CONCLUSION: Using NLP deep learning models, we could explore potential targets and treatments based on medical research and clinical trial corpora. The knowledge found by the models may be used for treating recurrent glioblastoma.
format	Online Article Text
id	pubmed-9559267
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-95592672022-10-14 Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models Jang, Bum-Sup Park, Andrew J. Kim, In Ah BMC Med Inform Decis Mak Research BACKGROUND: Efficient exploration of knowledge for the treatment of recurrent glioblastoma (GBM) is critical for both clinicians and researchers. However, due to the large number of clinical trials and published articles, searching for this knowledge is very labor-intensive. In the current study, using natural language processing (NLP), we analyzed medical research corpora related to recurrent glioblastoma to find potential targets and treatments. METHODS: We fine-tuned the ‘SAPBERT’, which was pretrained on biomedical ontologies, to perform question/answering (QA) and name entity recognition (NER) tasks for medical corpora. The model was fine-tuned with the SQUAD2 dataset and multiple NER datasets designed for QA task and NER task, respectively. Corpora were collected by searching the terms “recurrent glioblastoma” and “drug target”, published from 2000 to 2020 in the Web of science (N = 288 articles). Also, clinical trial corpora were collected from ‘clinicaltrial.gov’ using the searching term of ‘recurrent glioblastoma” (N = 587 studies). RESULTS: For the QA task, the model showed an F1 score of 0.79. For the NER task, the model showed F1 scores of 0.90 and 0.76 for drug and gene name recognition, respectively. When asked what the molecular targets were promising for recurrent glioblastoma, the model answered that RTK inhibitors or LPA-1 antagonists were promising. From collected clinical trials, the model summarized them in the order of bevacizumab, temozolomide, lomustine, and nivolumab. Based on published articles, the model found the many drug-gene pairs with the NER task, and we presented them with a circus plot and related summarization (https://github.com/bigwiz83/NLP_rGBM). CONCLUSION: Using NLP deep learning models, we could explore potential targets and treatments based on medical research and clinical trial corpora. The knowledge found by the models may be used for treating recurrent glioblastoma. BioMed Central 2022-10-13 /pmc/articles/PMC9559267/ /pubmed/36229835 http://dx.doi.org/10.1186/s12911-022-02003-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Research Jang, Bum-Sup Park, Andrew J. Kim, In Ah Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
title	Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
title_full	Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
title_fullStr	Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
title_full_unstemmed	Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
title_short	Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
title_sort	exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9559267/ https://www.ncbi.nlm.nih.gov/pubmed/36229835 http://dx.doi.org/10.1186/s12911-022-02003-4
work_keys_str_mv	AT jangbumsup explorationofbiomedicalknowledgeforrecurrentglioblastomausingnaturallanguageprocessingdeeplearningmodels AT parkandrewj explorationofbiomedicalknowledgeforrecurrentglioblastomausingnaturallanguageprocessingdeeplearningmodels AT kiminah explorationofbiomedicalknowledgeforrecurrentglioblastomausingnaturallanguageprocessingdeeplearningmodels

Exploration of biomedical knowledge for recurrent glioblastoma using natural language processing deep learning models

Ejemplares similares