Cargando…

Teaching Multiple Inverse Reinforcement Learners

In this paper, we propose the first machine teaching algorithm for multiple inverse reinforcement learners. As our initial contribution, we formalize the problem of optimally teaching a sequential task to a heterogeneous class of learners. We then contribute a theoretical analysis of such problem, i...

Descripción completa

Detalles Bibliográficos
Autores principales:	Melo, Francisco S., Lopes, Manuel
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8482012/ https://www.ncbi.nlm.nih.gov/pubmed/34604737 http://dx.doi.org/10.3389/frai.2021.625183

_version_	1784576804427661312
author	Melo, Francisco S. Lopes, Manuel
author_facet	Melo, Francisco S. Lopes, Manuel
author_sort	Melo, Francisco S.
collection	PubMed
description	In this paper, we propose the first machine teaching algorithm for multiple inverse reinforcement learners. As our initial contribution, we formalize the problem of optimally teaching a sequential task to a heterogeneous class of learners. We then contribute a theoretical analysis of such problem, identifying conditions under which it is possible to conduct such teaching using the same demonstration for all learners. Our analysis shows that, contrary to other teaching problems, teaching a sequential task to a heterogeneous class of learners with a single demonstration may not be possible, as the differences between individual agents increase. We then contribute two algorithms that address the main difficulties identified by our theoretical analysis. The first algorithm, which we dub SplitTeach, starts by teaching the class as a whole until all students have learned all that they can learn as a group; it then teaches each student individually, ensuring that all students are able to perfectly acquire the target task. The second approach, which we dub JointTeach, selects a single demonstration to be provided to the whole class so that all students learn the target task as well as a single demonstration allows. While SplitTeach ensures optimal teaching at the cost of a bigger teaching effort, JointTeach ensures minimal effort, although the learners are not guaranteed to perfectly recover the target task. We conclude by illustrating our methods in several simulation domains. The simulation results agree with our theoretical findings, showcasing that indeed class teaching is not possible in the presence of heterogeneous students. At the same time, they also illustrate the main properties of our proposed algorithms: in all domains, SplitTeach guarantees perfect teaching and, in terms of teaching effort, is always at least as good as individualized teaching (often better); on the other hand, JointTeach attains minimal teaching effort in all domains, even if sometimes it compromises the teaching performance.
format	Online Article Text
id	pubmed-8482012
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-84820122021-10-01 Teaching Multiple Inverse Reinforcement Learners Melo, Francisco S. Lopes, Manuel Front Artif Intell Artificial Intelligence In this paper, we propose the first machine teaching algorithm for multiple inverse reinforcement learners. As our initial contribution, we formalize the problem of optimally teaching a sequential task to a heterogeneous class of learners. We then contribute a theoretical analysis of such problem, identifying conditions under which it is possible to conduct such teaching using the same demonstration for all learners. Our analysis shows that, contrary to other teaching problems, teaching a sequential task to a heterogeneous class of learners with a single demonstration may not be possible, as the differences between individual agents increase. We then contribute two algorithms that address the main difficulties identified by our theoretical analysis. The first algorithm, which we dub SplitTeach, starts by teaching the class as a whole until all students have learned all that they can learn as a group; it then teaches each student individually, ensuring that all students are able to perfectly acquire the target task. The second approach, which we dub JointTeach, selects a single demonstration to be provided to the whole class so that all students learn the target task as well as a single demonstration allows. While SplitTeach ensures optimal teaching at the cost of a bigger teaching effort, JointTeach ensures minimal effort, although the learners are not guaranteed to perfectly recover the target task. We conclude by illustrating our methods in several simulation domains. The simulation results agree with our theoretical findings, showcasing that indeed class teaching is not possible in the presence of heterogeneous students. At the same time, they also illustrate the main properties of our proposed algorithms: in all domains, SplitTeach guarantees perfect teaching and, in terms of teaching effort, is always at least as good as individualized teaching (often better); on the other hand, JointTeach attains minimal teaching effort in all domains, even if sometimes it compromises the teaching performance. Frontiers Media S.A. 2021-09-16 /pmc/articles/PMC8482012/ /pubmed/34604737 http://dx.doi.org/10.3389/frai.2021.625183 Text en Copyright © 2021 Melo and Lopes. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Artificial Intelligence Melo, Francisco S. Lopes, Manuel Teaching Multiple Inverse Reinforcement Learners
title	Teaching Multiple Inverse Reinforcement Learners
title_full	Teaching Multiple Inverse Reinforcement Learners
title_fullStr	Teaching Multiple Inverse Reinforcement Learners
title_full_unstemmed	Teaching Multiple Inverse Reinforcement Learners
title_short	Teaching Multiple Inverse Reinforcement Learners
title_sort	teaching multiple inverse reinforcement learners
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8482012/ https://www.ncbi.nlm.nih.gov/pubmed/34604737 http://dx.doi.org/10.3389/frai.2021.625183
work_keys_str_mv	AT melofranciscos teachingmultipleinversereinforcementlearners AT lopesmanuel teachingmultipleinversereinforcementlearners

Teaching Multiple Inverse Reinforcement Learners

Ejemplares similares