Cargando…
Research on OpenCL optimization for FPGA deep learning application
In recent years, with the development of computer science, deep learning is held as competent enough to solve the problem of inference and learning in high dimensional space. Therefore, it has received unprecedented attention from both the academia and the business community. Compared with CPU/GPU,...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786543/ https://www.ncbi.nlm.nih.gov/pubmed/31600218 http://dx.doi.org/10.1371/journal.pone.0222984 |
_version_ | 1783458082587672576 |
---|---|
author | Zhang, Shuo Wu, Yanxia Men, Chaoguang He, Hongtao Liang, Kai |
author_facet | Zhang, Shuo Wu, Yanxia Men, Chaoguang He, Hongtao Liang, Kai |
author_sort | Zhang, Shuo |
collection | PubMed |
description | In recent years, with the development of computer science, deep learning is held as competent enough to solve the problem of inference and learning in high dimensional space. Therefore, it has received unprecedented attention from both the academia and the business community. Compared with CPU/GPU, FPGA has attracted much attention for its high-energy efficiency, short development cycle and reconfigurability in the aspect of deep learning algorithm. However, because of the limited research on OpenCL optimization on FPGA of deep learning algorithms, OpenCL tools and models applied to CPU/GPU cannot be directly used on FPGA. This makes it difficult for software programmers to use FPGA when implementing deep learning algorithms for a rewarding performance. To solve this problem, this paper proposed an OpenCL computational model based on FPGA template architecture to optimize the time-consuming convolution layer in deep learning. The comparison between the program applying the computational model and the corresponding optimization program provided by Xilinx indicates that the former is 8-40 times higher than the latter in terms of performance. |
format | Online Article Text |
id | pubmed-6786543 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-67865432019-10-19 Research on OpenCL optimization for FPGA deep learning application Zhang, Shuo Wu, Yanxia Men, Chaoguang He, Hongtao Liang, Kai PLoS One Research Article In recent years, with the development of computer science, deep learning is held as competent enough to solve the problem of inference and learning in high dimensional space. Therefore, it has received unprecedented attention from both the academia and the business community. Compared with CPU/GPU, FPGA has attracted much attention for its high-energy efficiency, short development cycle and reconfigurability in the aspect of deep learning algorithm. However, because of the limited research on OpenCL optimization on FPGA of deep learning algorithms, OpenCL tools and models applied to CPU/GPU cannot be directly used on FPGA. This makes it difficult for software programmers to use FPGA when implementing deep learning algorithms for a rewarding performance. To solve this problem, this paper proposed an OpenCL computational model based on FPGA template architecture to optimize the time-consuming convolution layer in deep learning. The comparison between the program applying the computational model and the corresponding optimization program provided by Xilinx indicates that the former is 8-40 times higher than the latter in terms of performance. Public Library of Science 2019-10-10 /pmc/articles/PMC6786543/ /pubmed/31600218 http://dx.doi.org/10.1371/journal.pone.0222984 Text en © 2019 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Zhang, Shuo Wu, Yanxia Men, Chaoguang He, Hongtao Liang, Kai Research on OpenCL optimization for FPGA deep learning application |
title | Research on OpenCL optimization for FPGA deep learning application |
title_full | Research on OpenCL optimization for FPGA deep learning application |
title_fullStr | Research on OpenCL optimization for FPGA deep learning application |
title_full_unstemmed | Research on OpenCL optimization for FPGA deep learning application |
title_short | Research on OpenCL optimization for FPGA deep learning application |
title_sort | research on opencl optimization for fpga deep learning application |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786543/ https://www.ncbi.nlm.nih.gov/pubmed/31600218 http://dx.doi.org/10.1371/journal.pone.0222984 |
work_keys_str_mv | AT zhangshuo researchonopencloptimizationforfpgadeeplearningapplication AT wuyanxia researchonopencloptimizationforfpgadeeplearningapplication AT menchaoguang researchonopencloptimizationforfpgadeeplearningapplication AT hehongtao researchonopencloptimizationforfpgadeeplearningapplication AT liangkai researchonopencloptimizationforfpgadeeplearningapplication |