Cargando…

Representing, learning, and controlling complex object interactions

We present a framework for representing scenarios with complex object interactions, where a robot cannot directly interact with the object it wishes to control and must instead influence it via intermediate objects. For instance, a robot learning to drive a car can only change the car’s pose indirec...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yilun, Burchfiel, Benjamin, Konidaris, George
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417403/
https://www.ncbi.nlm.nih.gov/pubmed/30956402
http://dx.doi.org/10.1007/s10514-018-9740-7
_version_ 1783403567134015488
author Zhou, Yilun
Burchfiel, Benjamin
Konidaris, George
author_facet Zhou, Yilun
Burchfiel, Benjamin
Konidaris, George
author_sort Zhou, Yilun
collection PubMed
description We present a framework for representing scenarios with complex object interactions, where a robot cannot directly interact with the object it wishes to control and must instead influence it via intermediate objects. For instance, a robot learning to drive a car can only change the car’s pose indirectly via the steering wheel, and must represent and reason about the relationship between its own grippers and the steering wheel, and the relationship between the steering wheel and the car. We formalize these interactions as chains and graphs of Markov decision processes (MDPs) and show how such models can be learned from data. We also consider how they can be controlled given known or learned dynamics. We show that our complex model can be collapsed into a single MDP and solved to find an optimal policy for the combined system. Since the resulting MDP may be very large, we also introduce a planning algorithm that efficiently produces a potentially suboptimal policy. We apply these models to two systems in which a robot uses learning from demonstration to achieve indirect control: playing a computer game using a joystick, and using a hot water dispenser to heat a cup of water.
format Online
Article
Text
id pubmed-6417403
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-64174032019-04-03 Representing, learning, and controlling complex object interactions Zhou, Yilun Burchfiel, Benjamin Konidaris, George Auton Robots Article We present a framework for representing scenarios with complex object interactions, where a robot cannot directly interact with the object it wishes to control and must instead influence it via intermediate objects. For instance, a robot learning to drive a car can only change the car’s pose indirectly via the steering wheel, and must represent and reason about the relationship between its own grippers and the steering wheel, and the relationship between the steering wheel and the car. We formalize these interactions as chains and graphs of Markov decision processes (MDPs) and show how such models can be learned from data. We also consider how they can be controlled given known or learned dynamics. We show that our complex model can be collapsed into a single MDP and solved to find an optimal policy for the combined system. Since the resulting MDP may be very large, we also introduce a planning algorithm that efficiently produces a potentially suboptimal policy. We apply these models to two systems in which a robot uses learning from demonstration to achieve indirect control: playing a computer game using a joystick, and using a hot water dispenser to heat a cup of water. Springer US 2018-04-30 2018 /pmc/articles/PMC6417403/ /pubmed/30956402 http://dx.doi.org/10.1007/s10514-018-9740-7 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Zhou, Yilun
Burchfiel, Benjamin
Konidaris, George
Representing, learning, and controlling complex object interactions
title Representing, learning, and controlling complex object interactions
title_full Representing, learning, and controlling complex object interactions
title_fullStr Representing, learning, and controlling complex object interactions
title_full_unstemmed Representing, learning, and controlling complex object interactions
title_short Representing, learning, and controlling complex object interactions
title_sort representing, learning, and controlling complex object interactions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6417403/
https://www.ncbi.nlm.nih.gov/pubmed/30956402
http://dx.doi.org/10.1007/s10514-018-9740-7
work_keys_str_mv AT zhouyilun representinglearningandcontrollingcomplexobjectinteractions
AT burchfielbenjamin representinglearningandcontrollingcomplexobjectinteractions
AT konidarisgeorge representinglearningandcontrollingcomplexobjectinteractions