Cargando…

Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading

One of the key challenges in implementing reinforcement learning methods for real-world robotic applications is the design of a suitable reward function. In field robotics, the absence of abundant datasets, limited training time, and high variation of environmental conditions complicate the task fur...

Descripción completa

Detalles Bibliográficos
Autores principales:	Strokina, Nataliya, Yang, Wenyan, Pajarinen, Joni, Serbenyuk, Nikolay, Kämäräinen, Joni, Ghabcheloo, Reza
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9194444/ https://www.ncbi.nlm.nih.gov/pubmed/35712549 http://dx.doi.org/10.3389/frobt.2022.838059

_version_	1784726728999960576
author	Strokina, Nataliya Yang, Wenyan Pajarinen, Joni Serbenyuk, Nikolay Kämäräinen, Joni Ghabcheloo, Reza
author_facet	Strokina, Nataliya Yang, Wenyan Pajarinen, Joni Serbenyuk, Nikolay Kämäräinen, Joni Ghabcheloo, Reza
author_sort	Strokina, Nataliya
collection	PubMed
description	One of the key challenges in implementing reinforcement learning methods for real-world robotic applications is the design of a suitable reward function. In field robotics, the absence of abundant datasets, limited training time, and high variation of environmental conditions complicate the task further. In this paper, we review reward learning techniques together with visual representations commonly used in current state-of-the-art works in robotics. We investigate a practical approach proposed in prior work to associate the reward with the stage of the progress in task completion based on visual observation. This approach was demonstrated in controlled laboratory conditions. We study its potential for a real-scale field application, autonomous pile loading, tested outdoors in three seasons: summer, autumn, and winter. In our framework, the cumulative reward combines the predictions about the process stage and the task completion (terminal stage). We use supervised classification methods to train prediction models and investigate the most common state-of-the-art visual representations. We use task-specific contrastive features for terminal stage prediction.
format	Online Article Text
id	pubmed-9194444
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-91944442022-06-15 Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading Strokina, Nataliya Yang, Wenyan Pajarinen, Joni Serbenyuk, Nikolay Kämäräinen, Joni Ghabcheloo, Reza Front Robot AI Robotics and AI One of the key challenges in implementing reinforcement learning methods for real-world robotic applications is the design of a suitable reward function. In field robotics, the absence of abundant datasets, limited training time, and high variation of environmental conditions complicate the task further. In this paper, we review reward learning techniques together with visual representations commonly used in current state-of-the-art works in robotics. We investigate a practical approach proposed in prior work to associate the reward with the stage of the progress in task completion based on visual observation. This approach was demonstrated in controlled laboratory conditions. We study its potential for a real-scale field application, autonomous pile loading, tested outdoors in three seasons: summer, autumn, and winter. In our framework, the cumulative reward combines the predictions about the process stage and the task completion (terminal stage). We use supervised classification methods to train prediction models and investigate the most common state-of-the-art visual representations. We use task-specific contrastive features for terminal stage prediction. Frontiers Media S.A. 2022-05-31 /pmc/articles/PMC9194444/ /pubmed/35712549 http://dx.doi.org/10.3389/frobt.2022.838059 Text en Copyright © 2022 Strokina, Yang, Pajarinen, Serbenyuk, Kämäräinen and Ghabcheloo. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI Strokina, Nataliya Yang, Wenyan Pajarinen, Joni Serbenyuk, Nikolay Kämäräinen, Joni Ghabcheloo, Reza Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading
title	Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading
title_full	Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading
title_fullStr	Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading
title_full_unstemmed	Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading
title_short	Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading
title_sort	visual rewards from observation for sequential tasks: autonomous pile loading
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9194444/ https://www.ncbi.nlm.nih.gov/pubmed/35712549 http://dx.doi.org/10.3389/frobt.2022.838059
work_keys_str_mv	AT strokinanataliya visualrewardsfromobservationforsequentialtasksautonomouspileloading AT yangwenyan visualrewardsfromobservationforsequentialtasksautonomouspileloading AT pajarinenjoni visualrewardsfromobservationforsequentialtasksautonomouspileloading AT serbenyuknikolay visualrewardsfromobservationforsequentialtasksautonomouspileloading AT kamarainenjoni visualrewardsfromobservationforsequentialtasksautonomouspileloading AT ghabchelooreza visualrewardsfromobservationforsequentialtasksautonomouspileloading

Visual Rewards From Observation for Sequential Tasks: Autonomous Pile Loading

Ejemplares similares