Cargando…

A shared synapse architecture for efficient FPGA implementation of autoencoders

This paper proposes a shared synapse architecture for autoencoders (AEs), and implements an AE with the proposed architecture as a digital circuit on a field-programmable gate array (FPGA). In the proposed architecture, the values of the synapse weights are shared between the synapses of an input an...

Descripción completa

Detalles Bibliográficos
Autores principales: Suzuki, Akihiro, Morie, Takashi, Tamukoh, Hakaru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5854352/
https://www.ncbi.nlm.nih.gov/pubmed/29543909
http://dx.doi.org/10.1371/journal.pone.0194049
_version_ 1783306899333054464
author Suzuki, Akihiro
Morie, Takashi
Tamukoh, Hakaru
author_facet Suzuki, Akihiro
Morie, Takashi
Tamukoh, Hakaru
author_sort Suzuki, Akihiro
collection PubMed
description This paper proposes a shared synapse architecture for autoencoders (AEs), and implements an AE with the proposed architecture as a digital circuit on a field-programmable gate array (FPGA). In the proposed architecture, the values of the synapse weights are shared between the synapses of an input and a hidden layer, and between the synapses of a hidden and an output layer. This architecture utilizes less of the limited resources of an FPGA than an architecture which does not share the synapse weights, and reduces the amount of synapse modules used by half. For the proposed circuit to be implemented into various types of AEs, it utilizes three kinds of parameters; one to change the number of layers’ units, one to change the bit width of an internal value, and a learning rate. By altering a network configuration using these parameters, the proposed architecture can be used to construct a stacked AE. The proposed circuits are logically synthesized, and the number of their resources is determined. Our experimental results show that single and stacked AE circuits utilizing the proposed shared synapse architecture operate as regular AEs and as regular stacked AEs. The scalability of the proposed circuit and the relationship between the bit widths and the learning results are also determined. The clock cycles of the proposed circuits are formulated, and this formula is used to estimate the theoretical performance of the circuit when the circuit is used to construct arbitrary networks.
format Online
Article
Text
id pubmed-5854352
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58543522018-03-28 A shared synapse architecture for efficient FPGA implementation of autoencoders Suzuki, Akihiro Morie, Takashi Tamukoh, Hakaru PLoS One Research Article This paper proposes a shared synapse architecture for autoencoders (AEs), and implements an AE with the proposed architecture as a digital circuit on a field-programmable gate array (FPGA). In the proposed architecture, the values of the synapse weights are shared between the synapses of an input and a hidden layer, and between the synapses of a hidden and an output layer. This architecture utilizes less of the limited resources of an FPGA than an architecture which does not share the synapse weights, and reduces the amount of synapse modules used by half. For the proposed circuit to be implemented into various types of AEs, it utilizes three kinds of parameters; one to change the number of layers’ units, one to change the bit width of an internal value, and a learning rate. By altering a network configuration using these parameters, the proposed architecture can be used to construct a stacked AE. The proposed circuits are logically synthesized, and the number of their resources is determined. Our experimental results show that single and stacked AE circuits utilizing the proposed shared synapse architecture operate as regular AEs and as regular stacked AEs. The scalability of the proposed circuit and the relationship between the bit widths and the learning results are also determined. The clock cycles of the proposed circuits are formulated, and this formula is used to estimate the theoretical performance of the circuit when the circuit is used to construct arbitrary networks. Public Library of Science 2018-03-15 /pmc/articles/PMC5854352/ /pubmed/29543909 http://dx.doi.org/10.1371/journal.pone.0194049 Text en © 2018 Suzuki et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Suzuki, Akihiro
Morie, Takashi
Tamukoh, Hakaru
A shared synapse architecture for efficient FPGA implementation of autoencoders
title A shared synapse architecture for efficient FPGA implementation of autoencoders
title_full A shared synapse architecture for efficient FPGA implementation of autoencoders
title_fullStr A shared synapse architecture for efficient FPGA implementation of autoencoders
title_full_unstemmed A shared synapse architecture for efficient FPGA implementation of autoencoders
title_short A shared synapse architecture for efficient FPGA implementation of autoencoders
title_sort shared synapse architecture for efficient fpga implementation of autoencoders
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5854352/
https://www.ncbi.nlm.nih.gov/pubmed/29543909
http://dx.doi.org/10.1371/journal.pone.0194049
work_keys_str_mv AT suzukiakihiro asharedsynapsearchitectureforefficientfpgaimplementationofautoencoders
AT morietakashi asharedsynapsearchitectureforefficientfpgaimplementationofautoencoders
AT tamukohhakaru asharedsynapsearchitectureforefficientfpgaimplementationofautoencoders
AT suzukiakihiro sharedsynapsearchitectureforefficientfpgaimplementationofautoencoders
AT morietakashi sharedsynapsearchitectureforefficientfpgaimplementationofautoencoders
AT tamukohhakaru sharedsynapsearchitectureforefficientfpgaimplementationofautoencoders