Cargando…

A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots

Object detection and classification have countless applications in human–robot interacting systems. It is a necessary skill for autonomous robots that perform tasks in household scenarios. Despite the great advances in deep learning and computer vision, social robots performing non-trivial tasks usu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gutiérrez, Marco A., Manso, Luis J., Pandya, Harit, Núñez, Pedro
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2017
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5335998/ https://www.ncbi.nlm.nih.gov/pubmed/28208671 http://dx.doi.org/10.3390/s17020353

_version_	1782512138589306880
author	Gutiérrez, Marco A. Manso, Luis J. Pandya, Harit Núñez, Pedro
author_facet	Gutiérrez, Marco A. Manso, Luis J. Pandya, Harit Núñez, Pedro
author_sort	Gutiérrez, Marco A.
collection	PubMed
description	Object detection and classification have countless applications in human–robot interacting systems. It is a necessary skill for autonomous robots that perform tasks in household scenarios. Despite the great advances in deep learning and computer vision, social robots performing non-trivial tasks usually spend most of their time finding and modeling objects. Working in real scenarios means dealing with constant environment changes and relatively low-quality sensor data due to the distance at which objects are often found. Ambient intelligence systems equipped with different sensors can also benefit from the ability to find objects, enabling them to inform humans about their location. For these applications to succeed, systems need to detect the objects that may potentially contain other objects, working with relatively low-resolution sensor data. A passive learning architecture for sensors has been designed in order to take advantage of multimodal information, obtained using an RGB-D camera and trained semantic language models. The main contribution of the architecture lies in the improvement of the performance of the sensor under conditions of low resolution and high light variations using a combination of image labeling and word semantics. The tests performed on each of the stages of the architecture compare this solution with current research labeling techniques for the application of an autonomous social robot working in an apartment. The results obtained demonstrate that the proposed sensor architecture outperforms state-of-the-art approaches.
format	Online Article Text
id	pubmed-5335998
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-53359982017-03-16 A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots Gutiérrez, Marco A. Manso, Luis J. Pandya, Harit Núñez, Pedro Sensors (Basel) Article Object detection and classification have countless applications in human–robot interacting systems. It is a necessary skill for autonomous robots that perform tasks in household scenarios. Despite the great advances in deep learning and computer vision, social robots performing non-trivial tasks usually spend most of their time finding and modeling objects. Working in real scenarios means dealing with constant environment changes and relatively low-quality sensor data due to the distance at which objects are often found. Ambient intelligence systems equipped with different sensors can also benefit from the ability to find objects, enabling them to inform humans about their location. For these applications to succeed, systems need to detect the objects that may potentially contain other objects, working with relatively low-resolution sensor data. A passive learning architecture for sensors has been designed in order to take advantage of multimodal information, obtained using an RGB-D camera and trained semantic language models. The main contribution of the architecture lies in the improvement of the performance of the sensor under conditions of low resolution and high light variations using a combination of image labeling and word semantics. The tests performed on each of the stages of the architecture compare this solution with current research labeling techniques for the application of an autonomous social robot working in an apartment. The results obtained demonstrate that the proposed sensor architecture outperforms state-of-the-art approaches. MDPI 2017-02-11 /pmc/articles/PMC5335998/ /pubmed/28208671 http://dx.doi.org/10.3390/s17020353 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Gutiérrez, Marco A. Manso, Luis J. Pandya, Harit Núñez, Pedro A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots
title	A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots
title_full	A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots
title_fullStr	A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots
title_full_unstemmed	A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots
title_short	A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots
title_sort	passive learning sensor architecture for multimodal image labeling: an application for social robots
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5335998/ https://www.ncbi.nlm.nih.gov/pubmed/28208671 http://dx.doi.org/10.3390/s17020353
work_keys_str_mv	AT gutierrezmarcoa apassivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT mansoluisj apassivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT pandyaharit apassivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT nunezpedro apassivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT gutierrezmarcoa passivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT mansoluisj passivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT pandyaharit passivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots AT nunezpedro passivelearningsensorarchitectureformultimodalimagelabelinganapplicationforsocialrobots

A Passive Learning Sensor Architecture for Multimodal Image Labeling: An Application for Social Robots

Ejemplares similares