Cargando…

Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution

The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kölbl, Max, Kyogoku, Yuki, Philipp, J. Nathanael, Richter, Michael, Rietdorf, Clements, Yousef, Tariq
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988042/ https://www.ncbi.nlm.nih.gov/pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564

_version_	1784682875155644416
author	Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq
author_facet	Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq
author_sort	Kölbl, Max
collection	PubMed
description	The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets.
format	Online Article Text
id	pubmed-8988042
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-89880422022-04-08 Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq Front Artif Intell Artificial Intelligence The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets. Frontiers Media S.A. 2022-03-24 /pmc/articles/PMC8988042/ /pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564 Text en Copyright © 2022 Kölbl, Kyogoku, Philipp, Richter, Rietdorf and Yousef. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Artificial Intelligence Kölbl, Max Kyogoku, Yuki Philipp, J. Nathanael Richter, Michael Rietdorf, Clements Yousef, Tariq Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title	Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_full	Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_fullStr	Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_full_unstemmed	Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_short	Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution
title_sort	beyond the failure of direct-matching in keyword evaluation: a sketch of a graph based solution
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8988042/ https://www.ncbi.nlm.nih.gov/pubmed/35402902 http://dx.doi.org/10.3389/frai.2022.801564
work_keys_str_mv	AT kolblmax beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT kyogokuyuki beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT philippjnathanael beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT richtermichael beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT rietdorfclements beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution AT youseftariq beyondthefailureofdirectmatchinginkeywordevaluationasketchofagraphbasedsolution

Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution

Ejemplares similares