Cargando…
Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988042/ https://www.ncbi.nlm.nih.gov/pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564 |
_version_ | 1784682875155644416 |
---|---|
author | Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq |
author_facet | Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq |
author_sort | Kölbl, Max |
collection | PubMed |
description | The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets. |
format | Online Article Text |
id | pubmed-8988042 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-89880422022-04-08 Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq Front Artif Intell Artificial Intelligence The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets. Frontiers Media S.A. 2022-03-24 /pmc/articles/PMC8988042/ /pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564 Text en Copyright © 2022 Kölbl, Kyogoku, Philipp, Richter, Rietdorf and Yousef. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution |
title | Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution |
title_full | Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution |
title_fullStr | Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution |
title_full_unstemmed | Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution |
title_short | Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution |
title_sort | beyond the failure of direct-matching in keyword evaluation: a sketch of a graph based solution |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988042/ https://www.ncbi.nlm.nih.gov/pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564 |
work_keys_str_mv | AT kolblmax beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT kyogokuyuki beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT philippjnathanael beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT richtermichael beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT rietdorfclements beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT youseftariq beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution |