Cargando…
A Graph-Based Keyphrase Extraction Model with Three-Way Decision
Keyphrase extraction has been a popular research topic in the field of natural language processing in recent years. But how to extract keyphrases precisely and effectively is still a challenge. The mainstream methods are supervised learning methods and graph-based methods. Generally, the effects of...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338155/ http://dx.doi.org/10.1007/978-3-030-52705-1_8 |
_version_ | 1783554620723822592 |
---|---|
author | Chen, Tianlei Miao, Duoqian Zhang, Yuebing |
author_facet | Chen, Tianlei Miao, Duoqian Zhang, Yuebing |
author_sort | Chen, Tianlei |
collection | PubMed |
description | Keyphrase extraction has been a popular research topic in the field of natural language processing in recent years. But how to extract keyphrases precisely and effectively is still a challenge. The mainstream methods are supervised learning methods and graph-based methods. Generally, the effects of supervised methods are better than unsupervised methods. However, there are many problems in supervised methods such as the difficulty in obtaining training data, the cost of labeling and the limitation of the classification function trained by training data. In recent years, the development of the graph-based method has made great progress and its performance of extraction is getting closer and closer to the supervised method, so the graph-based method of keyphrase extraction has got a wide concern from researchers. In this paper, we propose a new model that applies the three-way decision theory to graph-based keyphrase extraction model. In our model, we propose algorithms dividing the set of candidate phrases into the positive domain, the boundary domain and the negative domain depending on graph-based attributes, and combining candidate phrases in the positive domain and the boundary domain qualified by graph-based attributes and non- graph-based attributes to get keyphrases. Experimental results show that our model can effectively improve the extraction precision compared with baseline methods. |
format | Online Article Text |
id | pubmed-7338155 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-73381552020-07-07 A Graph-Based Keyphrase Extraction Model with Three-Way Decision Chen, Tianlei Miao, Duoqian Zhang, Yuebing Rough Sets Article Keyphrase extraction has been a popular research topic in the field of natural language processing in recent years. But how to extract keyphrases precisely and effectively is still a challenge. The mainstream methods are supervised learning methods and graph-based methods. Generally, the effects of supervised methods are better than unsupervised methods. However, there are many problems in supervised methods such as the difficulty in obtaining training data, the cost of labeling and the limitation of the classification function trained by training data. In recent years, the development of the graph-based method has made great progress and its performance of extraction is getting closer and closer to the supervised method, so the graph-based method of keyphrase extraction has got a wide concern from researchers. In this paper, we propose a new model that applies the three-way decision theory to graph-based keyphrase extraction model. In our model, we propose algorithms dividing the set of candidate phrases into the positive domain, the boundary domain and the negative domain depending on graph-based attributes, and combining candidate phrases in the positive domain and the boundary domain qualified by graph-based attributes and non- graph-based attributes to get keyphrases. Experimental results show that our model can effectively improve the extraction precision compared with baseline methods. 2020-06-10 /pmc/articles/PMC7338155/ http://dx.doi.org/10.1007/978-3-030-52705-1_8 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Chen, Tianlei Miao, Duoqian Zhang, Yuebing A Graph-Based Keyphrase Extraction Model with Three-Way Decision |
title | A Graph-Based Keyphrase Extraction Model with Three-Way Decision |
title_full | A Graph-Based Keyphrase Extraction Model with Three-Way Decision |
title_fullStr | A Graph-Based Keyphrase Extraction Model with Three-Way Decision |
title_full_unstemmed | A Graph-Based Keyphrase Extraction Model with Three-Way Decision |
title_short | A Graph-Based Keyphrase Extraction Model with Three-Way Decision |
title_sort | graph-based keyphrase extraction model with three-way decision |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338155/ http://dx.doi.org/10.1007/978-3-030-52705-1_8 |
work_keys_str_mv | AT chentianlei agraphbasedkeyphraseextractionmodelwiththreewaydecision AT miaoduoqian agraphbasedkeyphraseextractionmodelwiththreewaydecision AT zhangyuebing agraphbasedkeyphraseextractionmodelwiththreewaydecision AT chentianlei graphbasedkeyphraseextractionmodelwiththreewaydecision AT miaoduoqian graphbasedkeyphraseextractionmodelwiththreewaydecision AT zhangyuebing graphbasedkeyphraseextractionmodelwiththreewaydecision |