Cargando…
What’s a good prediction? Challenges in evaluating an agent’s knowledge
Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to obse...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240643/ https://www.ncbi.nlm.nih.gov/pubmed/37284424 http://dx.doi.org/10.1177/10597123221095880 |
_version_ | 1785053813159231488 |
---|---|
author | Kearney, Alex Koop, Anna J Pilarski, Patrick M |
author_facet | Kearney, Alex Koop, Anna J Pilarski, Patrick M |
author_sort | Kearney, Alex |
collection | PubMed |
description | Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to observable values. However, the prevailing reliance on estimator accuracy as a proxy for the usefulness of the knowledge has the potential to lead us astray. We demonstrate the conflict between accuracy and usefulness through a series of illustrative examples including both a thought experiment and an empirical example in Minecraft, using the General Value Function framework (GVF). Having identified challenges in assessing an agent’s knowledge, we propose an alternate evaluation approach that arises naturally in the online continual learning setting: we recommend evaluation by examining internal learning processes, specifically the relevance of a GVF’s features to the prediction task at hand. This paper contributes a first look into evaluation of predictions through their use, an integral component of predictive knowledge which is as of yet unexplored. |
format | Online Article Text |
id | pubmed-10240643 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-102406432023-06-06 What’s a good prediction? Challenges in evaluating an agent’s knowledge Kearney, Alex Koop, Anna J Pilarski, Patrick M Adapt Behav Articles Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remain an open challenge. The most common approaches to evaluating models is to assess their accuracy with respect to observable values. However, the prevailing reliance on estimator accuracy as a proxy for the usefulness of the knowledge has the potential to lead us astray. We demonstrate the conflict between accuracy and usefulness through a series of illustrative examples including both a thought experiment and an empirical example in Minecraft, using the General Value Function framework (GVF). Having identified challenges in assessing an agent’s knowledge, we propose an alternate evaluation approach that arises naturally in the online continual learning setting: we recommend evaluation by examining internal learning processes, specifically the relevance of a GVF’s features to the prediction task at hand. This paper contributes a first look into evaluation of predictions through their use, an integral component of predictive knowledge which is as of yet unexplored. SAGE Publications 2022-06-09 2023-06 /pmc/articles/PMC10240643/ /pubmed/37284424 http://dx.doi.org/10.1177/10597123221095880 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Articles Kearney, Alex Koop, Anna J Pilarski, Patrick M What’s a good prediction? Challenges in evaluating an agent’s knowledge |
title | What’s a good prediction? Challenges in evaluating an agent’s knowledge |
title_full | What’s a good prediction? Challenges in evaluating an agent’s knowledge |
title_fullStr | What’s a good prediction? Challenges in evaluating an agent’s knowledge |
title_full_unstemmed | What’s a good prediction? Challenges in evaluating an agent’s knowledge |
title_short | What’s a good prediction? Challenges in evaluating an agent’s knowledge |
title_sort | what’s a good prediction? challenges in evaluating an agent’s knowledge |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10240643/ https://www.ncbi.nlm.nih.gov/pubmed/37284424 http://dx.doi.org/10.1177/10597123221095880 |
work_keys_str_mv | AT kearneyalex whatsagoodpredictionchallengesinevaluatinganagentsknowledge AT koopannaj whatsagoodpredictionchallengesinevaluatinganagentsknowledge AT pilarskipatrickm whatsagoodpredictionchallengesinevaluatinganagentsknowledge |