Cargando…

Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies

In recent years, Reinforcement learning (RL), especially Deep RL (DRL), has shown outstanding performance in video games from Atari, Mario, to StarCraft. However, little evidence has shown that DRL can be successfully applied to real-life human-centric tasks such as education or healthcare. Differen...

Descripción completa

Detalles Bibliográficos
Autores principales: Sanz Ausin, Markel, Maniktala, Mehak, Barnes, Tiffany, Chi, Min
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334180/
http://dx.doi.org/10.1007/978-3-030-52237-7_38
_version_ 1783553883115618304
author Sanz Ausin, Markel
Maniktala, Mehak
Barnes, Tiffany
Chi, Min
author_facet Sanz Ausin, Markel
Maniktala, Mehak
Barnes, Tiffany
Chi, Min
author_sort Sanz Ausin, Markel
collection PubMed
description In recent years, Reinforcement learning (RL), especially Deep RL (DRL), has shown outstanding performance in video games from Atari, Mario, to StarCraft. However, little evidence has shown that DRL can be successfully applied to real-life human-centric tasks such as education or healthcare. Different from classic game-playing where the RL goal is to make an agent smart, in human-centric tasks the ultimate RL goal is to make the human-agent interactions productive and fruitful. Additionally, in many real-life human-centric tasks, data can be noisy and limited. As a sub-field of RL, batch RL is designed for handling situations where data is limited yet noisy, and building simulations is challenging. In two consecutive classroom studies, we investigated applying batch DRL to the task of pedagogical policy induction for an Intelligent Tutoring System (ITS), and empirically evaluated the effectiveness of induced pedagogical policies. In Fall 2018 (F18), the DRL policy is compared against an expert-designed baseline policy and in Spring 2019 (S19), we examined the impact of explaining the batch DRL-induced policy with student decisions and the expert baseline policy. Our results showed that 1) while no significant difference was found between the batch RL-induced policy and the expert policy in F18, the batch RL-induced policy with simple explanations significantly improved students’ learning performance more than the expert policy alone in S19; and 2) no significant differences were found between the student decision making and the expert policy. Overall, our results suggest that pairing simple explanations with induced RL policies can be an important and effective technique for applying RL to real-life human-centric tasks.
format Online
Article
Text
id pubmed-7334180
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73341802020-07-06 Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies Sanz Ausin, Markel Maniktala, Mehak Barnes, Tiffany Chi, Min Artificial Intelligence in Education Article In recent years, Reinforcement learning (RL), especially Deep RL (DRL), has shown outstanding performance in video games from Atari, Mario, to StarCraft. However, little evidence has shown that DRL can be successfully applied to real-life human-centric tasks such as education or healthcare. Different from classic game-playing where the RL goal is to make an agent smart, in human-centric tasks the ultimate RL goal is to make the human-agent interactions productive and fruitful. Additionally, in many real-life human-centric tasks, data can be noisy and limited. As a sub-field of RL, batch RL is designed for handling situations where data is limited yet noisy, and building simulations is challenging. In two consecutive classroom studies, we investigated applying batch DRL to the task of pedagogical policy induction for an Intelligent Tutoring System (ITS), and empirically evaluated the effectiveness of induced pedagogical policies. In Fall 2018 (F18), the DRL policy is compared against an expert-designed baseline policy and in Spring 2019 (S19), we examined the impact of explaining the batch DRL-induced policy with student decisions and the expert baseline policy. Our results showed that 1) while no significant difference was found between the batch RL-induced policy and the expert policy in F18, the batch RL-induced policy with simple explanations significantly improved students’ learning performance more than the expert policy alone in S19; and 2) no significant differences were found between the student decision making and the expert policy. Overall, our results suggest that pairing simple explanations with induced RL policies can be an important and effective technique for applying RL to real-life human-centric tasks. 2020-06-09 /pmc/articles/PMC7334180/ http://dx.doi.org/10.1007/978-3-030-52237-7_38 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Sanz Ausin, Markel
Maniktala, Mehak
Barnes, Tiffany
Chi, Min
Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies
title Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies
title_full Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies
title_fullStr Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies
title_full_unstemmed Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies
title_short Exploring the Impact of Simple Explanations and Agency on Batch Deep Reinforcement Learning Induced Pedagogical Policies
title_sort exploring the impact of simple explanations and agency on batch deep reinforcement learning induced pedagogical policies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334180/
http://dx.doi.org/10.1007/978-3-030-52237-7_38
work_keys_str_mv AT sanzausinmarkel exploringtheimpactofsimpleexplanationsandagencyonbatchdeepreinforcementlearninginducedpedagogicalpolicies
AT maniktalamehak exploringtheimpactofsimpleexplanationsandagencyonbatchdeepreinforcementlearninginducedpedagogicalpolicies
AT barnestiffany exploringtheimpactofsimpleexplanationsandagencyonbatchdeepreinforcementlearninginducedpedagogicalpolicies
AT chimin exploringtheimpactofsimpleexplanationsandagencyonbatchdeepreinforcementlearninginducedpedagogicalpolicies