Cargando…

Applying deep matching networks to Chinese medical question answering: a study and a dataset

BACKGROUND: Medical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward. It can be attributed to the difficulty of Chinese text processing and the lack o...

Descripción completa

Detalles Bibliográficos
Autores principales:	He, Junqing, Fu, Mingming, Tu, Manshu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454599/ https://www.ncbi.nlm.nih.gov/pubmed/30961607 http://dx.doi.org/10.1186/s12911-019-0761-8

_version_	1783409568116637696
author	He, Junqing Fu, Mingming Tu, Manshu
author_facet	He, Junqing Fu, Mingming Tu, Manshu
author_sort	He, Junqing
collection	PubMed
description	BACKGROUND: Medical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward. It can be attributed to the difficulty of Chinese text processing and the lack of large-scale datasets. To bridge the gap, this paper introduces a Chinese medical QA dataset and proposes effective methods for the task. METHODS: We first construct a large scale Chinese medical QA dataset. Then we leverage deep matching neural networks to capture semantic interaction between words in questions and answers. Considering that Chinese Word Segmentation (CWS) tools may fail to identify clinical terms, we design a module to merge the word segments and produce a new representation. It learns the common compositions of words or segments by using convolutional kernels and selects the strongest signals by windowed pooling. RESULTS: The best performer among popular CWS tools on our dataset is found. In our experiments, deep matching models substantially outperform existing methods. Results also show that our proposed semantic clustered representation module improves the performance of models by up to 5.5% Precision at 1 and 4.9% Mean Average Precision. CONCLUSIONS: In this paper, we introduce a large scale Chinese medical QA dataset and cast the task into a semantic matching problem. We also compare different CWS tools and input units. Among the two state-of-the-art deep matching neural networks, MatchPyramid performs better. Results also show the effectiveness of the proposed semantic clustered representation module.
format	Online Article Text
id	pubmed-6454599
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-64545992019-04-19 Applying deep matching networks to Chinese medical question answering: a study and a dataset He, Junqing Fu, Mingming Tu, Manshu BMC Med Inform Decis Mak Research BACKGROUND: Medical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward. It can be attributed to the difficulty of Chinese text processing and the lack of large-scale datasets. To bridge the gap, this paper introduces a Chinese medical QA dataset and proposes effective methods for the task. METHODS: We first construct a large scale Chinese medical QA dataset. Then we leverage deep matching neural networks to capture semantic interaction between words in questions and answers. Considering that Chinese Word Segmentation (CWS) tools may fail to identify clinical terms, we design a module to merge the word segments and produce a new representation. It learns the common compositions of words or segments by using convolutional kernels and selects the strongest signals by windowed pooling. RESULTS: The best performer among popular CWS tools on our dataset is found. In our experiments, deep matching models substantially outperform existing methods. Results also show that our proposed semantic clustered representation module improves the performance of models by up to 5.5% Precision at 1 and 4.9% Mean Average Precision. CONCLUSIONS: In this paper, we introduce a large scale Chinese medical QA dataset and cast the task into a semantic matching problem. We also compare different CWS tools and input units. Among the two state-of-the-art deep matching neural networks, MatchPyramid performs better. Results also show the effectiveness of the proposed semantic clustered representation module. BioMed Central 2019-04-09 /pmc/articles/PMC6454599/ /pubmed/30961607 http://dx.doi.org/10.1186/s12911-019-0761-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research He, Junqing Fu, Mingming Tu, Manshu Applying deep matching networks to Chinese medical question answering: a study and a dataset
title	Applying deep matching networks to Chinese medical question answering: a study and a dataset
title_full	Applying deep matching networks to Chinese medical question answering: a study and a dataset
title_fullStr	Applying deep matching networks to Chinese medical question answering: a study and a dataset
title_full_unstemmed	Applying deep matching networks to Chinese medical question answering: a study and a dataset
title_short	Applying deep matching networks to Chinese medical question answering: a study and a dataset
title_sort	applying deep matching networks to chinese medical question answering: a study and a dataset
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454599/ https://www.ncbi.nlm.nih.gov/pubmed/30961607 http://dx.doi.org/10.1186/s12911-019-0761-8
work_keys_str_mv	AT hejunqing applyingdeepmatchingnetworkstochinesemedicalquestionansweringastudyandadataset AT fumingming applyingdeepmatchingnetworkstochinesemedicalquestionansweringastudyandadataset AT tumanshu applyingdeepmatchingnetworkstochinesemedicalquestionansweringastudyandadataset

Applying deep matching networks to Chinese medical question answering: a study and a dataset

Ejemplares similares