Cargando…

Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent

OBJECTIVES: Anaphora recognition is a process to identify exactly which noun has been used previously and relates to a pronoun that is included in a specific sentence later. Therefore, anaphora recognition is an essential element of a dialogue agent system. In the current study, all the merits of ru...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Junggi, Lee, Youngho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Medical Informatics 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4231177/
https://www.ncbi.nlm.nih.gov/pubmed/25405063
http://dx.doi.org/10.4258/hir.2014.20.4.272
_version_ 1782344397880295424
author Yang, Junggi
Lee, Youngho
author_facet Yang, Junggi
Lee, Youngho
author_sort Yang, Junggi
collection PubMed
description OBJECTIVES: Anaphora recognition is a process to identify exactly which noun has been used previously and relates to a pronoun that is included in a specific sentence later. Therefore, anaphora recognition is an essential element of a dialogue agent system. In the current study, all the merits of rule-based, machine learning-based, semantic-based anaphora recognition systems were combined to design and realize a new hybrid-type anaphora recognition system with an optimum capacity. METHODS: Anaphora recognition rules were encoded on the basis of the internal traits of referred expressions and adjacent contexts to realize a rule-based system and to serve as a baseline. A semantic database, related to predicate instances of sentences including referred expressions, was constructed to identify semantic co-relationships between the referent candidates (to which semantic tags were attached) and the semantic information of predicates. This approach would upgrade the anaphora recognition system by reducing the number of referent candidates. Additionally, to realize a machine learning-based system, an anaphora recognition model was developed on the basis of training data, which indicated referred expressions and referents. The three methods were further combined to develop a new single hybrid-based anaphora recognition system. RESULTS: The precision rate of the rule-based systems was 54.9%. However, the precision rate of the hybrid-based system was 63.7%, proving it to be the most efficient method. CONCLUSIONS: The hybrid-based method, developed by the combination of rule-based and machine learning-based methods, represents a new system with enhanced functional capabilities as compared to other pre-existing individual methods.
format Online
Article
Text
id pubmed-4231177
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Korean Society of Medical Informatics
record_format MEDLINE/PubMed
spelling pubmed-42311772014-11-17 Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent Yang, Junggi Lee, Youngho Healthc Inform Res Original Article OBJECTIVES: Anaphora recognition is a process to identify exactly which noun has been used previously and relates to a pronoun that is included in a specific sentence later. Therefore, anaphora recognition is an essential element of a dialogue agent system. In the current study, all the merits of rule-based, machine learning-based, semantic-based anaphora recognition systems were combined to design and realize a new hybrid-type anaphora recognition system with an optimum capacity. METHODS: Anaphora recognition rules were encoded on the basis of the internal traits of referred expressions and adjacent contexts to realize a rule-based system and to serve as a baseline. A semantic database, related to predicate instances of sentences including referred expressions, was constructed to identify semantic co-relationships between the referent candidates (to which semantic tags were attached) and the semantic information of predicates. This approach would upgrade the anaphora recognition system by reducing the number of referent candidates. Additionally, to realize a machine learning-based system, an anaphora recognition model was developed on the basis of training data, which indicated referred expressions and referents. The three methods were further combined to develop a new single hybrid-based anaphora recognition system. RESULTS: The precision rate of the rule-based systems was 54.9%. However, the precision rate of the hybrid-based system was 63.7%, proving it to be the most efficient method. CONCLUSIONS: The hybrid-based method, developed by the combination of rule-based and machine learning-based methods, represents a new system with enhanced functional capabilities as compared to other pre-existing individual methods. Korean Society of Medical Informatics 2014-10 2014-10-31 /pmc/articles/PMC4231177/ /pubmed/25405063 http://dx.doi.org/10.4258/hir.2014.20.4.272 Text en © 2014 The Korean Society of Medical Informatics http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Yang, Junggi
Lee, Youngho
Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent
title Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent
title_full Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent
title_fullStr Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent
title_full_unstemmed Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent
title_short Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent
title_sort korean anaphora recognition system to develop healthcare dialogue-type agent
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4231177/
https://www.ncbi.nlm.nih.gov/pubmed/25405063
http://dx.doi.org/10.4258/hir.2014.20.4.272
work_keys_str_mv AT yangjunggi koreananaphorarecognitionsystemtodevelophealthcaredialoguetypeagent
AT leeyoungho koreananaphorarecognitionsystemtodevelophealthcaredialoguetypeagent