Cargando…

Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †

We consider a robot that must sort objects transported by a conveyor belt into different classes. Multiple observations must be performed before taking a decision on the class of each object, because the imperfect sensing sometimes detects the incorrect object class. The objective is to sort the seq...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mezei, Ady-Daniel, Tamás, Levente, Buşoniu, Lucian
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7249059/ https://www.ncbi.nlm.nih.gov/pubmed/32349393 http://dx.doi.org/10.3390/s20092481

_version_	1783538515751993344
author	Mezei, Ady-Daniel Tamás, Levente Buşoniu, Lucian
author_facet	Mezei, Ady-Daniel Tamás, Levente Buşoniu, Lucian
author_sort	Mezei, Ady-Daniel
collection	PubMed
description	We consider a robot that must sort objects transported by a conveyor belt into different classes. Multiple observations must be performed before taking a decision on the class of each object, because the imperfect sensing sometimes detects the incorrect object class. The objective is to sort the sequence of objects in a minimal number of observation and decision steps. We describe this task in the framework of partially observable Markov decision processes, and we propose a reward function that explicitly takes into account the information gain of the viewpoint selection actions applied. The DESPOT algorithm is applied to solve the problem, automatically obtaining a sequence of observation viewpoints and class decision actions. Observations are made either only for the object on the first position of the conveyor belt or for multiple adjacent positions at once. The performance of the single- and multiple-position variants is compared, and the impact of including the information gain is analyzed. Real-life experiments with a Baxter robot and an industrial conveyor belt are provided.
format	Online Article Text
id	pubmed-7249059
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-72490592020-06-10 Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards † Mezei, Ady-Daniel Tamás, Levente Buşoniu, Lucian Sensors (Basel) Article We consider a robot that must sort objects transported by a conveyor belt into different classes. Multiple observations must be performed before taking a decision on the class of each object, because the imperfect sensing sometimes detects the incorrect object class. The objective is to sort the sequence of objects in a minimal number of observation and decision steps. We describe this task in the framework of partially observable Markov decision processes, and we propose a reward function that explicitly takes into account the information gain of the viewpoint selection actions applied. The DESPOT algorithm is applied to solve the problem, automatically obtaining a sequence of observation viewpoints and class decision actions. Observations are made either only for the object on the first position of the conveyor belt or for multiple adjacent positions at once. The performance of the single- and multiple-position variants is compared, and the impact of including the information gain is analyzed. Real-life experiments with a Baxter robot and an industrial conveyor belt are provided. MDPI 2020-04-27 /pmc/articles/PMC7249059/ /pubmed/32349393 http://dx.doi.org/10.3390/s20092481 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Mezei, Ady-Daniel Tamás, Levente Buşoniu, Lucian Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †
title	Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †
title_full	Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †
title_fullStr	Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †
title_full_unstemmed	Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †
title_short	Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †
title_sort	sorting objects from a conveyor belt using pomdps with multiple-object observations and information-gain rewards †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7249059/ https://www.ncbi.nlm.nih.gov/pubmed/32349393 http://dx.doi.org/10.3390/s20092481
work_keys_str_mv	AT mezeiadydaniel sortingobjectsfromaconveyorbeltusingpomdpswithmultipleobjectobservationsandinformationgainrewards AT tamaslevente sortingobjectsfromaconveyorbeltusingpomdpswithmultipleobjectobservationsandinformationgainrewards AT busoniulucian sortingobjectsfromaconveyorbeltusingpomdpswithmultipleobjectobservationsandinformationgainrewards

Sorting Objects from a Conveyor Belt Using POMDPs with Multiple-Object Observations and Information-Gain Rewards †

Ejemplares similares