Cargando…
Entity perception of Two-Step-Matching framework for public opinions()
Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by B...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324343/ http://dx.doi.org/10.1016/j.jnlssr.2020.06.005 |
_version_ | 1783551920350167040 |
---|---|
author | Li, Ren-De Ma, Hao-Tian Wang, Zi-Yi Guo, Qiang Liu, Jian-Guo |
author_facet | Li, Ren-De Ma, Hao-Tian Wang, Zi-Yi Guo, Qiang Liu, Jian-Guo |
author_sort | Li, Ren-De |
collection | PubMed |
description | Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions. |
format | Online Article Text |
id | pubmed-7324343 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-73243432020-06-30 Entity perception of Two-Step-Matching framework for public opinions() Li, Ren-De Ma, Hao-Tian Wang, Zi-Yi Guo, Qiang Liu, Jian-Guo Journal of Safety Science and Resilience Article Entity perception of ambiguous user comments is a critical problem of target identification for huge amount of public opinions. In this paper, a Two-Step-Matching method is proposed to identify the precise target entity from multiple entities mentioned. Firstly, potential entities are extracted by BiLSTM-CRF model and characteristic words by TF-IDF model from public comments. Secondly, the first matching is implemented between potential entities and an official business directory by Jaro–Winkler distance algorithm. Then, in order to find the precise one, an industry-characteristic dictionary is developed into the second matching process. The precise entity is identified according to the count of characteristic words matching to industry-characteristic dictionary. In addition, associated rate (global indicator) and accuracy rate (sample indicator) are defined for evaluation of matching accuracy. The results for three data sets of public opinions about major public health events show that the highest associated rate and accuracy rate arrive at 0.93 and 0.95, averagely enhanced by 32% and 30% above the case of using the first matching process alone. This framework provides the method to find the true target entity of really wanted expression from public opinions. China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. 2020-09 2020-06-30 /pmc/articles/PMC7324343/ http://dx.doi.org/10.1016/j.jnlssr.2020.06.005 Text en © 2022 China Science Publishing & Media Ltd. Publishing Services by Elsevier B.V. on behalf of KeAi Communications Co. Ltd. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Li, Ren-De Ma, Hao-Tian Wang, Zi-Yi Guo, Qiang Liu, Jian-Guo Entity perception of Two-Step-Matching framework for public opinions() |
title | Entity perception of Two-Step-Matching framework for public opinions() |
title_full | Entity perception of Two-Step-Matching framework for public opinions() |
title_fullStr | Entity perception of Two-Step-Matching framework for public opinions() |
title_full_unstemmed | Entity perception of Two-Step-Matching framework for public opinions() |
title_short | Entity perception of Two-Step-Matching framework for public opinions() |
title_sort | entity perception of two-step-matching framework for public opinions() |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324343/ http://dx.doi.org/10.1016/j.jnlssr.2020.06.005 |
work_keys_str_mv | AT lirende entityperceptionoftwostepmatchingframeworkforpublicopinions AT mahaotian entityperceptionoftwostepmatchingframeworkforpublicopinions AT wangziyi entityperceptionoftwostepmatchingframeworkforpublicopinions AT guoqiang entityperceptionoftwostepmatchingframeworkforpublicopinions AT liujianguo entityperceptionoftwostepmatchingframeworkforpublicopinions |