Cargando…

What’s a good prediction? Challenges in evaluating an agent’s knowledge

Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to obse...

Descripción completa

Detalles Bibliográficos
Autores principales: Kearney, Alex, Koop, Anna J, Pilarski, Patrick M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240643/
https://www.ncbi.nlm.nih.gov/pubmed/37284424
http://dx.doi.org/10.1177/10597123221095880
_version_ 1785053813159231488
author Kearney, Alex
Koop, Anna J
Pilarski, Patrick M
author_facet Kearney, Alex
Koop, Anna J
Pilarski, Patrick M
author_sort Kearney, Alex
collection PubMed
description Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to observable values. However, the prevailing reliance on estimator accuracy as a proxy for the usefulness of the knowledge has the potential to lead us astray. We demonstrate the conflict between accuracy and usefulness through a series of illustrative examples including both a thought experiment and an empirical example in Minecraft, using the General Value Function framework (GVF). Having identified challenges in assessing an agent’s knowledge, we propose an alternate evaluation approach that arises naturally in the online continual learning setting: we recommend evaluation by examining internal learning processes, specifically the relevance of a GVF’s features to the prediction task at hand. This paper contributes a first look into evaluation of predictions through their use, an integral component of predictive knowledge which is as of yet unexplored.
format Online
Article
Text
id pubmed-10240643
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-102406432023-06-06 What’s a good prediction? Challenges in evaluating an agent’s knowledge Kearney, Alex Koop, Anna J Pilarski, Patrick M Adapt Behav Articles Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to observable values. However, the prevailing reliance on estimator accuracy as a proxy for the usefulness of the knowledge has the potential to lead us astray. We demonstrate the conflict between accuracy and usefulness through a series of illustrative examples including both a thought experiment and an empirical example in Minecraft, using the General Value Function framework (GVF). Having identified challenges in assessing an agent’s knowledge, we propose an alternate evaluation approach that arises naturally in the online continual learning setting: we recommend evaluation by examining internal learning processes, specifically the relevance of a GVF’s features to the prediction task at hand. This paper contributes a first look into evaluation of predictions through their use, an integral component of predictive knowledge which is as of yet unexplored. SAGE Publications 2022-06-09 2023-06 /pmc/articles/PMC10240643/ /pubmed/37284424 http://dx.doi.org/10.1177/10597123221095880 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Kearney, Alex
Koop, Anna J
Pilarski, Patrick M
What’s a good prediction? Challenges in evaluating an agent’s knowledge
title What’s a good prediction? Challenges in evaluating an agent’s knowledge
title_full What’s a good prediction? Challenges in evaluating an agent’s knowledge
title_fullStr What’s a good prediction? Challenges in evaluating an agent’s knowledge
title_full_unstemmed What’s a good prediction? Challenges in evaluating an agent’s knowledge
title_short What’s a good prediction? Challenges in evaluating an agent’s knowledge
title_sort what’s a good prediction? challenges in evaluating an agent’s knowledge
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240643/
https://www.ncbi.nlm.nih.gov/pubmed/37284424
http://dx.doi.org/10.1177/10597123221095880
work_keys_str_mv AT kearneyalex whatsagoodpredictionchallengesinevaluatinganagentsknowledge
AT koopannaj whatsagoodpredictionchallengesinevaluatinganagentsknowledge
AT pilarskipatrickm whatsagoodpredictionchallengesinevaluatinganagentsknowledge