Cargando…

Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition

Real-world applications such as first-person video activity recognition require intelligent edge devices. However, size, weight, and power constraints of the embedded platforms cannot support resource intensive state-of-the-art algorithms. Machine learning lite algorithms, such as reservoir computin...

Descripción completa

Detalles Bibliográficos
Autores principales: Soures, Nicholas, Kudithipudi, Dhireesha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6621912/
https://www.ncbi.nlm.nih.gov/pubmed/31333404
http://dx.doi.org/10.3389/fnins.2019.00686
_version_ 1783434129375756288
author Soures, Nicholas
Kudithipudi, Dhireesha
author_facet Soures, Nicholas
Kudithipudi, Dhireesha
author_sort Soures, Nicholas
collection PubMed
description Real-world applications such as first-person video activity recognition require intelligent edge devices. However, size, weight, and power constraints of the embedded platforms cannot support resource intensive state-of-the-art algorithms. Machine learning lite algorithms, such as reservoir computing, with shallow 3-layer networks are computationally frugal as only the output layer is trained. By reducing network depth and plasticity, reservoir computing minimizes computational power and complexity, making the algorithms optimal for edge devices. However, as a trade-off for their frugal nature, reservoir computing sacrifices computational power compared to state-of-the-art methods. A good compromise between reservoir computing and fully supervised networks are the proposed deep-LSM networks. The deep-LSM is a deep spiking neural network which captures dynamic information over multiple time-scales with a combination of randomly connected layers and unsupervised layers. The deep-LSM processes the captured dynamic information through an attention modulated readout layer to perform classification. We demonstrate that the deep-LSM achieves an average of 84.78% accuracy on the DogCentric video activity recognition task, beating state-of-the-art. The deep-LSM also shows up to 91.13% memory savings and up to 91.55% reduction in synaptic operations when compared to similar recurrent neural network models. Based on these results we claim that the deep-LSM is capable of overcoming limitations of traditional reservoir computing, while maintaining the low computational cost associated with reservoir computing.
format Online
Article
Text
id pubmed-6621912
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-66219122019-07-22 Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition Soures, Nicholas Kudithipudi, Dhireesha Front Neurosci Neuroscience Real-world applications such as first-person video activity recognition require intelligent edge devices. However, size, weight, and power constraints of the embedded platforms cannot support resource intensive state-of-the-art algorithms. Machine learning lite algorithms, such as reservoir computing, with shallow 3-layer networks are computationally frugal as only the output layer is trained. By reducing network depth and plasticity, reservoir computing minimizes computational power and complexity, making the algorithms optimal for edge devices. However, as a trade-off for their frugal nature, reservoir computing sacrifices computational power compared to state-of-the-art methods. A good compromise between reservoir computing and fully supervised networks are the proposed deep-LSM networks. The deep-LSM is a deep spiking neural network which captures dynamic information over multiple time-scales with a combination of randomly connected layers and unsupervised layers. The deep-LSM processes the captured dynamic information through an attention modulated readout layer to perform classification. We demonstrate that the deep-LSM achieves an average of 84.78% accuracy on the DogCentric video activity recognition task, beating state-of-the-art. The deep-LSM also shows up to 91.13% memory savings and up to 91.55% reduction in synaptic operations when compared to similar recurrent neural network models. Based on these results we claim that the deep-LSM is capable of overcoming limitations of traditional reservoir computing, while maintaining the low computational cost associated with reservoir computing. Frontiers Media S.A. 2019-07-04 /pmc/articles/PMC6621912/ /pubmed/31333404 http://dx.doi.org/10.3389/fnins.2019.00686 Text en Copyright © 2019 Soures and Kudithipudi. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Soures, Nicholas
Kudithipudi, Dhireesha
Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
title Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
title_full Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
title_fullStr Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
title_full_unstemmed Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
title_short Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
title_sort deep liquid state machines with neural plasticity for video activity recognition
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6621912/
https://www.ncbi.nlm.nih.gov/pubmed/31333404
http://dx.doi.org/10.3389/fnins.2019.00686
work_keys_str_mv AT souresnicholas deepliquidstatemachineswithneuralplasticityforvideoactivityrecognition
AT kudithipudidhireesha deepliquidstatemachineswithneuralplasticityforvideoactivityrecognition