Cargando…

Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition

This paper presents a novel method for integration of industrially-oriented human-robot speech communication and vision-based object recognition. Such integration is necessary to provide context for task-oriented voice commands. Context-based speech communication is easier, the commands are shorter,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rogowski, Adam, Bieliszczuk, Krzysztof, Rapcewicz, Jerzy
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7767307/ https://www.ncbi.nlm.nih.gov/pubmed/33353038 http://dx.doi.org/10.3390/s20247287

_version_	1783628928030605312
author	Rogowski, Adam Bieliszczuk, Krzysztof Rapcewicz, Jerzy
author_facet	Rogowski, Adam Bieliszczuk, Krzysztof Rapcewicz, Jerzy
author_sort	Rogowski, Adam
collection	PubMed
description	This paper presents a novel method for integration of industrially-oriented human-robot speech communication and vision-based object recognition. Such integration is necessary to provide context for task-oriented voice commands. Context-based speech communication is easier, the commands are shorter, hence their recognition rate is higher. In recent years, significant research was devoted to integration of speech and gesture recognition. However, little attention was paid to vision-based identification of objects in industrial environment (like workpieces or tools) represented by general terms used in voice commands. There are no reports on any methods facilitating the abovementioned integration. Image and speech recognition systems usually operate on different data structures, describing reality on different levels of abstraction, hence development of context-based voice control systems is a laborious and time-consuming task. The aim of our research was to solve this problem. The core of our method is extension of Voice Command Description (VCD) format describing syntax and semantics of task-oriented commands, as well as its integration with Flexible Editable Contour Templates (FECT) used for classification of contours derived from image recognition systems. To the best of our knowledge, it is the first solution that facilitates development of customized vision-based voice control applications for industrial robots.
format	Online Article Text
id	pubmed-7767307
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-77673072020-12-28 Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition Rogowski, Adam Bieliszczuk, Krzysztof Rapcewicz, Jerzy Sensors (Basel) Article This paper presents a novel method for integration of industrially-oriented human-robot speech communication and vision-based object recognition. Such integration is necessary to provide context for task-oriented voice commands. Context-based speech communication is easier, the commands are shorter, hence their recognition rate is higher. In recent years, significant research was devoted to integration of speech and gesture recognition. However, little attention was paid to vision-based identification of objects in industrial environment (like workpieces or tools) represented by general terms used in voice commands. There are no reports on any methods facilitating the abovementioned integration. Image and speech recognition systems usually operate on different data structures, describing reality on different levels of abstraction, hence development of context-based voice control systems is a laborious and time-consuming task. The aim of our research was to solve this problem. The core of our method is extension of Voice Command Description (VCD) format describing syntax and semantics of task-oriented commands, as well as its integration with Flexible Editable Contour Templates (FECT) used for classification of contours derived from image recognition systems. To the best of our knowledge, it is the first solution that facilitates development of customized vision-based voice control applications for industrial robots. MDPI 2020-12-18 /pmc/articles/PMC7767307/ /pubmed/33353038 http://dx.doi.org/10.3390/s20247287 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Rogowski, Adam Bieliszczuk, Krzysztof Rapcewicz, Jerzy Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition
title	Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition
title_full	Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition
title_fullStr	Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition
title_full_unstemmed	Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition
title_short	Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition
title_sort	integration of industrially-oriented human-robot speech communication and vision-based object recognition
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7767307/ https://www.ncbi.nlm.nih.gov/pubmed/33353038 http://dx.doi.org/10.3390/s20247287
work_keys_str_mv	AT rogowskiadam integrationofindustriallyorientedhumanrobotspeechcommunicationandvisionbasedobjectrecognition AT bieliszczukkrzysztof integrationofindustriallyorientedhumanrobotspeechcommunicationandvisionbasedobjectrecognition AT rapcewiczjerzy integrationofindustriallyorientedhumanrobotspeechcommunicationandvisionbasedobjectrecognition

Integration of Industrially-Oriented Human-Robot Speech Communication and Vision-Based Object Recognition

Ejemplares similares