Cargando…

High Performance Implementation of 3D Convolutional Neural Networks on a GPU

Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input an...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Qiang, Wang, Zelong, Wen, Mei, Zhang, Chunyuan, Wang, Yijie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698830/
https://www.ncbi.nlm.nih.gov/pubmed/29250109
http://dx.doi.org/10.1155/2017/8348671
_version_ 1783280836237328384
author Lan, Qiang
Wang, Zelong
Wen, Mei
Zhang, Chunyuan
Wang, Yijie
author_facet Lan, Qiang
Wang, Zelong
Wen, Mei
Zhang, Chunyuan
Wang, Yijie
author_sort Lan, Qiang
collection PubMed
description Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version.
format Online
Article
Text
id pubmed-5698830
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-56988302017-12-17 High Performance Implementation of 3D Convolutional Neural Networks on a GPU Lan, Qiang Wang, Zelong Wen, Mei Zhang, Chunyuan Wang, Yijie Comput Intell Neurosci Research Article Convolutional neural networks have proven to be highly successful in applications such as image classification, object tracking, and many other tasks based on 2D inputs. Recently, researchers have started to apply convolutional neural networks to video classification, which constitutes a 3D input and requires far larger amounts of memory and much more computation. FFT based methods can reduce the amount of computation, but this generally comes at the cost of an increased memory requirement. On the other hand, the Winograd Minimal Filtering Algorithm (WMFA) can reduce the number of operations required and thus can speed up the computation, without increasing the required memory. This strategy was shown to be successful for 2D neural networks. We implement the algorithm for 3D convolutional neural networks and apply it to a popular 3D convolutional neural network which is used to classify videos and compare it to cuDNN. For our highly optimized implementation of the algorithm, we observe a twofold speedup for most of the 3D convolution layers of our test network compared to the cuDNN version. Hindawi 2017 2017-11-08 /pmc/articles/PMC5698830/ /pubmed/29250109 http://dx.doi.org/10.1155/2017/8348671 Text en Copyright © 2017 Qiang Lan et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lan, Qiang
Wang, Zelong
Wen, Mei
Zhang, Chunyuan
Wang, Yijie
High Performance Implementation of 3D Convolutional Neural Networks on a GPU
title High Performance Implementation of 3D Convolutional Neural Networks on a GPU
title_full High Performance Implementation of 3D Convolutional Neural Networks on a GPU
title_fullStr High Performance Implementation of 3D Convolutional Neural Networks on a GPU
title_full_unstemmed High Performance Implementation of 3D Convolutional Neural Networks on a GPU
title_short High Performance Implementation of 3D Convolutional Neural Networks on a GPU
title_sort high performance implementation of 3d convolutional neural networks on a gpu
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5698830/
https://www.ncbi.nlm.nih.gov/pubmed/29250109
http://dx.doi.org/10.1155/2017/8348671
work_keys_str_mv AT lanqiang highperformanceimplementationof3dconvolutionalneuralnetworksonagpu
AT wangzelong highperformanceimplementationof3dconvolutionalneuralnetworksonagpu
AT wenmei highperformanceimplementationof3dconvolutionalneuralnetworksonagpu
AT zhangchunyuan highperformanceimplementationof3dconvolutionalneuralnetworksonagpu
AT wangyijie highperformanceimplementationof3dconvolutionalneuralnetworksonagpu