Cargando…
Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture
Intelligent agents, such as robots, have to serve a multitude of autonomous functions. Examples are, e.g., collision avoidance, navigation and route planning, active sensing of its environment, or the interaction and non-verbal communication with people in the extended reach space. Here, we focus on...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5360715/ https://www.ncbi.nlm.nih.gov/pubmed/28381998 http://dx.doi.org/10.3389/fnbot.2017.00013 |
_version_ | 1782516636372172800 |
---|---|
author | Layher, Georg Brosch, Tobias Neumann, Heiko |
author_facet | Layher, Georg Brosch, Tobias Neumann, Heiko |
author_sort | Layher, Georg |
collection | PubMed |
description | Intelligent agents, such as robots, have to serve a multitude of autonomous functions. Examples are, e.g., collision avoidance, navigation and route planning, active sensing of its environment, or the interaction and non-verbal communication with people in the extended reach space. Here, we focus on the recognition of the action of a human agent based on a biologically inspired visual architecture of analyzing articulated movements. The proposed processing architecture builds upon coarsely segregated streams of sensory processing along different pathways which separately process form and motion information (Layher et al., 2014). Action recognition is performed in an event-based scheme by identifying representations of characteristic pose configurations (key poses) in an image sequence. In line with perceptual studies, key poses are selected unsupervised utilizing a feature-driven criterion which combines extrema in the motion energy with the horizontal and the vertical extendedness of a body shape. Per class representations of key pose frames are learned using a deep convolutional neural network consisting of 15 convolutional layers. The network is trained using the energy-efficient deep neuromorphic networks (Eedn) framework (Esser et al., 2016), which realizes the mapping of the trained synaptic weights onto the IBM Neurosynaptic System platform (Merolla et al., 2014). After the mapping, the trained network achieves real-time capabilities for processing input streams and classify input images at about 1,000 frames per second while the computational stages only consume about 70 mW of energy (without spike transduction). Particularly regarding mobile robotic systems, a low energy profile might be crucial in a variety of application scenarios. Cross-validation results are reported for two different datasets and compared to state-of-the-art action recognition approaches. The results demonstrate, that (I) the presented approach is on par with other key pose based methods described in the literature, which select key pose frames by optimizing classification accuracy, (II) compared to the training on the full set of frames, representations trained on key pose frames result in a higher confidence in class assignments, and (III) key pose representations show promising generalization capabilities in a cross-dataset evaluation. |
format | Online Article Text |
id | pubmed-5360715 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-53607152017-04-05 Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture Layher, Georg Brosch, Tobias Neumann, Heiko Front Neurorobot Neuroscience Intelligent agents, such as robots, have to serve a multitude of autonomous functions. Examples are, e.g., collision avoidance, navigation and route planning, active sensing of its environment, or the interaction and non-verbal communication with people in the extended reach space. Here, we focus on the recognition of the action of a human agent based on a biologically inspired visual architecture of analyzing articulated movements. The proposed processing architecture builds upon coarsely segregated streams of sensory processing along different pathways which separately process form and motion information (Layher et al., 2014). Action recognition is performed in an event-based scheme by identifying representations of characteristic pose configurations (key poses) in an image sequence. In line with perceptual studies, key poses are selected unsupervised utilizing a feature-driven criterion which combines extrema in the motion energy with the horizontal and the vertical extendedness of a body shape. Per class representations of key pose frames are learned using a deep convolutional neural network consisting of 15 convolutional layers. The network is trained using the energy-efficient deep neuromorphic networks (Eedn) framework (Esser et al., 2016), which realizes the mapping of the trained synaptic weights onto the IBM Neurosynaptic System platform (Merolla et al., 2014). After the mapping, the trained network achieves real-time capabilities for processing input streams and classify input images at about 1,000 frames per second while the computational stages only consume about 70 mW of energy (without spike transduction). Particularly regarding mobile robotic systems, a low energy profile might be crucial in a variety of application scenarios. Cross-validation results are reported for two different datasets and compared to state-of-the-art action recognition approaches. The results demonstrate, that (I) the presented approach is on par with other key pose based methods described in the literature, which select key pose frames by optimizing classification accuracy, (II) compared to the training on the full set of frames, representations trained on key pose frames result in a higher confidence in class assignments, and (III) key pose representations show promising generalization capabilities in a cross-dataset evaluation. Frontiers Media S.A. 2017-03-22 /pmc/articles/PMC5360715/ /pubmed/28381998 http://dx.doi.org/10.3389/fnbot.2017.00013 Text en Copyright © 2017 Layher, Brosch and Neumann. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Layher, Georg Brosch, Tobias Neumann, Heiko Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture |
title | Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture |
title_full | Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture |
title_fullStr | Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture |
title_full_unstemmed | Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture |
title_short | Real-Time Biologically Inspired Action Recognition from Key Poses Using a Neuromorphic Architecture |
title_sort | real-time biologically inspired action recognition from key poses using a neuromorphic architecture |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5360715/ https://www.ncbi.nlm.nih.gov/pubmed/28381998 http://dx.doi.org/10.3389/fnbot.2017.00013 |
work_keys_str_mv | AT layhergeorg realtimebiologicallyinspiredactionrecognitionfromkeyposesusinganeuromorphicarchitecture AT broschtobias realtimebiologicallyinspiredactionrecognitionfromkeyposesusinganeuromorphicarchitecture AT neumannheiko realtimebiologicallyinspiredactionrecognitionfromkeyposesusinganeuromorphicarchitecture |