Cargando…

Entity perception of Two-Step-Matching framework for public opinions()

Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by B...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ren-De, Ma, Hao-Tian, Wang, Zi-Yi, Guo, Qiang, Liu, Jian-Guo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324343/
http://dx.doi.org/10.1016/j.jnlssr.2020.06.005
_version_ 1783551920350167040
author Li, Ren-De
Ma, Hao-Tian
Wang, Zi-Yi
Guo, Qiang
Liu, Jian-Guo
author_facet Li, Ren-De
Ma, Hao-Tian
Wang, Zi-Yi
Guo, Qiang
Liu, Jian-Guo
author_sort Li, Ren-De
collection PubMed
description Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions.
format Online
Article
Text
id pubmed-7324343
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd.
record_format MEDLINE/PubMed
spelling pubmed-73243432020-06-30 Entity perception of Two-Step-Matching framework for public opinions() Li, Ren-De Ma, Hao-Tian Wang, Zi-Yi Guo, Qiang Liu, Jian-Guo Journal of Safety Science and Resilience Article Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions. China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. 2020-09 2020-06-30 /pmc/articles/PMC7324343/ http://dx.doi.org/10.1016/j.jnlssr.2020.06.005 Text en © 2022 China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Li, Ren-De
Ma, Hao-Tian
Wang, Zi-Yi
Guo, Qiang
Liu, Jian-Guo
Entity perception of Two-Step-Matching framework for public opinions()
title Entity perception of Two-Step-Matching framework for public opinions()
title_full Entity perception of Two-Step-Matching framework for public opinions()
title_fullStr Entity perception of Two-Step-Matching framework for public opinions()
title_full_unstemmed Entity perception of Two-Step-Matching framework for public opinions()
title_short Entity perception of Two-Step-Matching framework for public opinions()
title_sort entity perception of two-step-matching framework for public opinions()
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324343/
http://dx.doi.org/10.1016/j.jnlssr.2020.06.005
work_keys_str_mv AT lirende entityperceptionoftwostepmatchingframeworkforpublicopinions
AT mahaotian entityperceptionoftwostepmatchingframeworkforpublicopinions
AT wangziyi entityperceptionoftwostepmatchingframeworkforpublicopinions
AT guoqiang entityperceptionoftwostepmatchingframeworkforpublicopinions
AT liujianguo entityperceptionoftwostepmatchingframeworkforpublicopinions