Cargando…
A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration
Deep Neural Networks (DNNs) deployment for IoT Edge applications requires strong skills in hardware and software. In this paper, a novel design framework fully automated for Edge applications is proposed to perform such a deployment on System-on-Chips. Based on a high-level Python interface that mim...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467982/ https://www.ncbi.nlm.nih.gov/pubmed/34577258 http://dx.doi.org/10.3390/s21186050 |
_version_ | 1784573542909607936 |
---|---|
author | Belabed, Tarek Ramos Gomes da Silva, Vitor Quenon, Alexandre Valderamma, Carlos Souani, Chokri |
author_facet | Belabed, Tarek Ramos Gomes da Silva, Vitor Quenon, Alexandre Valderamma, Carlos Souani, Chokri |
author_sort | Belabed, Tarek |
collection | PubMed |
description | Deep Neural Networks (DNNs) deployment for IoT Edge applications requires strong skills in hardware and software. In this paper, a novel design framework fully automated for Edge applications is proposed to perform such a deployment on System-on-Chips. Based on a high-level Python interface that mimics the leading Deep Learning software frameworks, it offers an easy way to implement a hardware-accelerated DNN on an FPGA. To do this, our design methodology covers the three main phases: (a) customization: where the user specifies the optimizations needed on each DNN layer, (b) generation: the framework generates on the Cloud the necessary binaries for both FPGA and software parts, and (c) deployment: the SoC on the Edge receives the resulting files serving to program the FPGA and related Python libraries for user applications. Among the study cases, an optimized DNN for the MNIST database can speed up more than 60× a software version on the ZYNQ 7020 SoC and still consume less than [Formula: see text] [Formula: see text]. A comparison with the state-of-the-art frameworks demonstrates that our methodology offers the best trade-off between throughput, power consumption, and system cost. |
format | Online Article Text |
id | pubmed-8467982 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84679822021-09-27 A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration Belabed, Tarek Ramos Gomes da Silva, Vitor Quenon, Alexandre Valderamma, Carlos Souani, Chokri Sensors (Basel) Article Deep Neural Networks (DNNs) deployment for IoT Edge applications requires strong skills in hardware and software. In this paper, a novel design framework fully automated for Edge applications is proposed to perform such a deployment on System-on-Chips. Based on a high-level Python interface that mimics the leading Deep Learning software frameworks, it offers an easy way to implement a hardware-accelerated DNN on an FPGA. To do this, our design methodology covers the three main phases: (a) customization: where the user specifies the optimizations needed on each DNN layer, (b) generation: the framework generates on the Cloud the necessary binaries for both FPGA and software parts, and (c) deployment: the SoC on the Edge receives the resulting files serving to program the FPGA and related Python libraries for user applications. Among the study cases, an optimized DNN for the MNIST database can speed up more than 60× a software version on the ZYNQ 7020 SoC and still consume less than [Formula: see text] [Formula: see text]. A comparison with the state-of-the-art frameworks demonstrates that our methodology offers the best trade-off between throughput, power consumption, and system cost. MDPI 2021-09-09 /pmc/articles/PMC8467982/ /pubmed/34577258 http://dx.doi.org/10.3390/s21186050 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Belabed, Tarek Ramos Gomes da Silva, Vitor Quenon, Alexandre Valderamma, Carlos Souani, Chokri A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration |
title | A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration |
title_full | A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration |
title_fullStr | A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration |
title_full_unstemmed | A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration |
title_short | A Novel Automate Python Edge-to-Edge: From Automated Generation on Cloud to User Application Deployment on Edge of Deep Neural Networks for Low Power IoT Systems FPGA-Based Acceleration |
title_sort | novel automate python edge-to-edge: from automated generation on cloud to user application deployment on edge of deep neural networks for low power iot systems fpga-based acceleration |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467982/ https://www.ncbi.nlm.nih.gov/pubmed/34577258 http://dx.doi.org/10.3390/s21186050 |
work_keys_str_mv | AT belabedtarek anovelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT ramosgomesdasilvavitor anovelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT quenonalexandre anovelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT valderammacarlos anovelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT souanichokri anovelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT belabedtarek novelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT ramosgomesdasilvavitor novelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT quenonalexandre novelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT valderammacarlos novelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration AT souanichokri novelautomatepythonedgetoedgefromautomatedgenerationoncloudtouserapplicationdeploymentonedgeofdeepneuralnetworksforlowpoweriotsystemsfpgabasedacceleration |