Cargando…

Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework

BACKGROUND: Liver cancer is a substantial disease burden in China. As one of the primary diagnostic tools for detecting liver cancer, dynamic contrast-enhanced computed tomography provides detailed evidences for diagnosis that are recorded in free-text radiology reports. OBJECTIVE: The aim of our st...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Honglei, Zhang, Zhiqiang, Xu, Yan, Wang, Ni, Huang, Yanqun, Yang, Zhenghan, Jiang, Rui, Chen, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837998/
https://www.ncbi.nlm.nih.gov/pubmed/33433395
http://dx.doi.org/10.2196/19689
_version_ 1783643073438285824
author Liu, Honglei
Zhang, Zhiqiang
Xu, Yan
Wang, Ni
Huang, Yanqun
Yang, Zhenghan
Jiang, Rui
Chen, Hui
author_facet Liu, Honglei
Zhang, Zhiqiang
Xu, Yan
Wang, Ni
Huang, Yanqun
Yang, Zhenghan
Jiang, Rui
Chen, Hui
author_sort Liu, Honglei
collection PubMed
description BACKGROUND: Liver cancer is a substantial disease burden in China. As one of the primary diagnostic tools for detecting liver cancer, dynamic contrast-enhanced computed tomography provides detailed evidences for diagnosis that are recorded in free-text radiology reports. OBJECTIVE: The aim of our study was to apply a deep learning model and rule-based natural language processing (NLP) method to identify evidences for liver cancer diagnosis automatically. METHODS: We proposed a pretrained, fine-tuned BERT (Bidirectional Encoder Representations from Transformers)-based BiLSTM-CRF (Bidirectional Long Short-Term Memory-Conditional Random Field) model to recognize the phrases of APHE (hyperintense enhancement in the arterial phase) and PDPH (hypointense in the portal and delayed phases). To identify more essential diagnostic evidences, we used the traditional rule-based NLP methods for the extraction of radiological features. APHE, PDPH, and other extracted radiological features were used to design a computer-aided liver cancer diagnosis framework by random forest. RESULTS: The BERT-BiLSTM-CRF predicted the phrases of APHE and PDPH with an F1 score of 98.40% and 90.67%, respectively. The prediction model using combined features had a higher performance (F1 score, 88.55%) than those using APHE and PDPH (84.88%) or other extracted radiological features (83.52%). APHE and PDPH were the top 2 essential features for liver cancer diagnosis. CONCLUSIONS: This work was a comprehensive NLP study, wherein we identified evidences for the diagnosis of liver cancer from Chinese radiology reports, considering both clinical knowledge and radiology findings. The BERT-based deep learning method for the extraction of diagnostic evidence achieved state-of-the-art performance. The high performance proves the feasibility of the BERT-BiLSTM-CRF model in information extraction from Chinese radiology reports. The findings of our study suggest that the deep learning–based method for automatically identifying evidences for diagnosis can be extended to other types of Chinese clinical texts.
format Online
Article
Text
id pubmed-7837998
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-78379982021-01-29 Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework Liu, Honglei Zhang, Zhiqiang Xu, Yan Wang, Ni Huang, Yanqun Yang, Zhenghan Jiang, Rui Chen, Hui J Med Internet Res Original Paper BACKGROUND: Liver cancer is a substantial disease burden in China. As one of the primary diagnostic tools for detecting liver cancer, dynamic contrast-enhanced computed tomography provides detailed evidences for diagnosis that are recorded in free-text radiology reports. OBJECTIVE: The aim of our study was to apply a deep learning model and rule-based natural language processing (NLP) method to identify evidences for liver cancer diagnosis automatically. METHODS: We proposed a pretrained, fine-tuned BERT (Bidirectional Encoder Representations from Transformers)-based BiLSTM-CRF (Bidirectional Long Short-Term Memory-Conditional Random Field) model to recognize the phrases of APHE (hyperintense enhancement in the arterial phase) and PDPH (hypointense in the portal and delayed phases). To identify more essential diagnostic evidences, we used the traditional rule-based NLP methods for the extraction of radiological features. APHE, PDPH, and other extracted radiological features were used to design a computer-aided liver cancer diagnosis framework by random forest. RESULTS: The BERT-BiLSTM-CRF predicted the phrases of APHE and PDPH with an F1 score of 98.40% and 90.67%, respectively. The prediction model using combined features had a higher performance (F1 score, 88.55%) than those using APHE and PDPH (84.88%) or other extracted radiological features (83.52%). APHE and PDPH were the top 2 essential features for liver cancer diagnosis. CONCLUSIONS: This work was a comprehensive NLP study, wherein we identified evidences for the diagnosis of liver cancer from Chinese radiology reports, considering both clinical knowledge and radiology findings. The BERT-based deep learning method for the extraction of diagnostic evidence achieved state-of-the-art performance. The high performance proves the feasibility of the BERT-BiLSTM-CRF model in information extraction from Chinese radiology reports. The findings of our study suggest that the deep learning–based method for automatically identifying evidences for diagnosis can be extended to other types of Chinese clinical texts. JMIR Publications 2021-01-12 /pmc/articles/PMC7837998/ /pubmed/33433395 http://dx.doi.org/10.2196/19689 Text en ©Honglei Liu, Zhiqiang Zhang, Yan Xu, Ni Wang, Yanqun Huang, Zhenghan Yang, Rui Jiang, Hui Chen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 12.01.2021. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Liu, Honglei
Zhang, Zhiqiang
Xu, Yan
Wang, Ni
Huang, Yanqun
Yang, Zhenghan
Jiang, Rui
Chen, Hui
Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_full Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_fullStr Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_full_unstemmed Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_short Use of BERT (Bidirectional Encoder Representations from Transformers)-Based Deep Learning Method for Extracting Evidences in Chinese Radiology Reports: Development of a Computer-Aided Liver Cancer Diagnosis Framework
title_sort use of bert (bidirectional encoder representations from transformers)-based deep learning method for extracting evidences in chinese radiology reports: development of a computer-aided liver cancer diagnosis framework
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837998/
https://www.ncbi.nlm.nih.gov/pubmed/33433395
http://dx.doi.org/10.2196/19689
work_keys_str_mv AT liuhonglei useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT zhangzhiqiang useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT xuyan useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT wangni useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT huangyanqun useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT yangzhenghan useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT jiangrui useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework
AT chenhui useofbertbidirectionalencoderrepresentationsfromtransformersbaseddeeplearningmethodforextractingevidencesinchineseradiologyreportsdevelopmentofacomputeraidedlivercancerdiagnosisframework