Cargando…
Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition
Real-world applications such as first-person video activity recognition require intelligent edge devices. However, size, weight, and power constraints of the embedded platforms cannot support resource intensive state-of-the-art algorithms. Machine learning lite algorithms, such as reservoir computin...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6621912/ https://www.ncbi.nlm.nih.gov/pubmed/31333404 http://dx.doi.org/10.3389/fnins.2019.00686 |
_version_ | 1783434129375756288 |
---|---|
author | Soures, Nicholas Kudithipudi, Dhireesha |
author_facet | Soures, Nicholas Kudithipudi, Dhireesha |
author_sort | Soures, Nicholas |
collection | PubMed |
description | Real-world applications such as first-person video activity recognition require intelligent edge devices. However, size, weight, and power constraints of the embedded platforms cannot support resource intensive state-of-the-art algorithms. Machine learning lite algorithms, such as reservoir computing, with shallow 3-layer networks are computationally frugal as only the output layer is trained. By reducing network depth and plasticity, reservoir computing minimizes computational power and complexity, making the algorithms optimal for edge devices. However, as a trade-off for their frugal nature, reservoir computing sacrifices computational power compared to state-of-the-art methods. A good compromise between reservoir computing and fully supervised networks are the proposed deep-LSM networks. The deep-LSM is a deep spiking neural network which captures dynamic information over multiple time-scales with a combination of randomly connected layers and unsupervised layers. The deep-LSM processes the captured dynamic information through an attention modulated readout layer to perform classification. We demonstrate that the deep-LSM achieves an average of 84.78% accuracy on the DogCentric video activity recognition task, beating state-of-the-art. The deep-LSM also shows up to 91.13% memory savings and up to 91.55% reduction in synaptic operations when compared to similar recurrent neural network models. Based on these results we claim that the deep-LSM is capable of overcoming limitations of traditional reservoir computing, while maintaining the low computational cost associated with reservoir computing. |
format | Online Article Text |
id | pubmed-6621912 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-66219122019-07-22 Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition Soures, Nicholas Kudithipudi, Dhireesha Front Neurosci Neuroscience Real-world applications such as first-person video activity recognition require intelligent edge devices. However, size, weight, and power constraints of the embedded platforms cannot support resource intensive state-of-the-art algorithms. Machine learning lite algorithms, such as reservoir computing, with shallow 3-layer networks are computationally frugal as only the output layer is trained. By reducing network depth and plasticity, reservoir computing minimizes computational power and complexity, making the algorithms optimal for edge devices. However, as a trade-off for their frugal nature, reservoir computing sacrifices computational power compared to state-of-the-art methods. A good compromise between reservoir computing and fully supervised networks are the proposed deep-LSM networks. The deep-LSM is a deep spiking neural network which captures dynamic information over multiple time-scales with a combination of randomly connected layers and unsupervised layers. The deep-LSM processes the captured dynamic information through an attention modulated readout layer to perform classification. We demonstrate that the deep-LSM achieves an average of 84.78% accuracy on the DogCentric video activity recognition task, beating state-of-the-art. The deep-LSM also shows up to 91.13% memory savings and up to 91.55% reduction in synaptic operations when compared to similar recurrent neural network models. Based on these results we claim that the deep-LSM is capable of overcoming limitations of traditional reservoir computing, while maintaining the low computational cost associated with reservoir computing. Frontiers Media S.A. 2019-07-04 /pmc/articles/PMC6621912/ /pubmed/31333404 http://dx.doi.org/10.3389/fnins.2019.00686 Text en Copyright © 2019 Soures and Kudithipudi. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Soures, Nicholas Kudithipudi, Dhireesha Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition |
title | Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition |
title_full | Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition |
title_fullStr | Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition |
title_full_unstemmed | Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition |
title_short | Deep Liquid State Machines With Neural Plasticity for Video Activity Recognition |
title_sort | deep liquid state machines with neural plasticity for video activity recognition |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6621912/ https://www.ncbi.nlm.nih.gov/pubmed/31333404 http://dx.doi.org/10.3389/fnins.2019.00686 |
work_keys_str_mv | AT souresnicholas deepliquidstatemachineswithneuralplasticityforvideoactivityrecognition AT kudithipudidhireesha deepliquidstatemachineswithneuralplasticityforvideoactivityrecognition |