Cargando…

Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution

The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing...

Descripción completa

Detalles Bibliográficos
Autores principales: Kölbl, Max, Kyogoku, Yuki, Philipp, J. Nathanael, Richter, Michael, Rietdorf, Clements, Yousef, Tariq
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988042/
https://www.ncbi.nlm.nih.gov/pubmed/35402902
http://dx.doi.org/10.3389/frai.2022.801564
_version_ 1784682875155644416
author Kölbl, Max
Kyogoku, Yuki
Philipp, J. Nathanael
Richter, Michael
Rietdorf, Clements
Yousef, Tariq
author_facet Kölbl, Max
Kyogoku, Yuki
Philipp, J. Nathanael
Richter, Michael
Rietdorf, Clements
Yousef, Tariq
author_sort Kölbl, Max
collection PubMed
description The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets.
format Online
Article
Text
id pubmed-8988042
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89880422022-04-08 Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq Front Artif Intell Artificial Intelligence The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets. Frontiers Media S.A. 2022-03-24 /pmc/articles/PMC8988042/ /pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564 Text en Copyright © 2022 Kölbl, Kyogoku, Philipp, Richter, Rietdorf and Yousef. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Kölbl, Max
Kyogoku, Yuki
Philipp, J. Nathanael
Richter, Michael
Rietdorf, Clements
Yousef, Tariq
Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_full Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_fullStr Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_full_unstemmed Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_short Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_sort beyond the failure of direct-matching in keyword evaluation: a sketch of a graph based solution
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988042/
https://www.ncbi.nlm.nih.gov/pubmed/35402902
http://dx.doi.org/10.3389/frai.2022.801564
work_keys_str_mv AT kolblmax beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution
AT kyogokuyuki beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution
AT philippjnathanael beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution
AT richtermichael beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution
AT rietdorfclements beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution
AT youseftariq beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution