An Implementation of a Convolutional Accelerator based on a GPGPU for a Deep Learning

Jeon, Hee-Kyeong;Lee, Kwang-yeob;Kim, Chi-yong;

doi:10.7471/ikeee.2016.20.3.303

Journal of IKEEE (전기전자학회논문지)

Volume 20 Issue 3
/
Pages.303-306
/
2016
/
1226-7244(pISSN)
/
2288-243X(eISSN)

Institute of Korean Electrical and Electronics Engineers (한국전기전자학회)

DOI QR Code

An Implementation of a Convolutional Accelerator based on a GPGPU for a Deep Learning

Deep Learning을 위한 GPGPU 기반 Convolution 가속기 구현

Jeon, Hee-Kyeong (Dept. of Computer Engineering, Seokyeong University) ;
Lee, Kwang-yeob (Dept. of Computer Engineering, Seokyeong University) ;
Kim, Chi-yong (Dept. of Computer Science, Seokyeong University)

Received : 2016.09.13
Accepted : 2016.09.29
Published : 2016.09.30

https://doi.org/10.7471/ikeee.2016.20.3.303 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a method to accelerate convolutional neural network by utilizing a GPGPU. Convolutional neural network is a sort of the neural network learning features of images. Convolutional neural network is suitable for the image processing required to learn a lot of data such as images. The convolutional layer of the conventional CNN required a large number of multiplications and it is difficult to operate in the real-time on the embedded environment. In this paper, we reduce the number of multiplications through Winograd convolution operation and perform parallel processing of the convolution by utilizing SIMT-based GPGPU. The experiment was conducted using ModelSim and TestDrive, and the experimental results showed that the processing time was improved by about 17%, compared to the conventional convolution.

본 논문에서는 GPGPU를 활용하여 Convolutional neural network의 가속화 방법을 제안한다. Convolutional neural network는 이미지의 특징 값을 학습하여 분류하는 neural network의 일종으로 대량의 데이터를 학습해야하는 영상 처리에 적합하다. 기존의 Convolutional neural network의 convolution layer는 다수의 곱셈 연산을 필요로 하여 임베디드 환경에서 실시간으로 동작하기에 어려움이 있다. 본 논문에서는 이러한 단점을 해결하기 위하여 winograd convolution 연산을 통하여 곱셈 연산을 줄이고 GPGPU의 SIMT 구조를 활용하여 convolution 연산을 병렬 처리한다. 실험은 ModelSim, TestDrive를 사용하여 진행하였고 실험 결과 기존의 convolution 연산보다 처리 시간이 약 17% 개선되었다.

Keywords

References

David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, et al., "Mastering the game of Go with deep neural networks and tree search," . Nature, 529, pp. 484-489, 28 January 2016. https://doi.org/10.1038/nature16961
http://smart.science.go.kr/scienceSubject/iot/view.action?menuCd=DOM_000000101001012000&subj ect_sid=1322
Shmuel Winograd, " Arithmetic complexity of computations," volume 33. Siam, 1980.
Lavin, Andrew. "Fast algorithms for convolutional neural networks." arXiv preprint arXiv:1509.09308 2015.
Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.
Agarwal, R., and J. Cooley. "New algorithms for digital convolution." IEEE Transactions on Acoustics, Speech, and Signal Processing 25.5 (1977): 392-410. https://doi.org/10.1109/TASSP.1977.1162981
Yunseop Hwang, Kwang yeob Lee, Junmo Jeong, "Design of SIMT Architecture-based Reconfigurable Image Signal Processor," International conference on future information & communication engineering, 25 June 2015.
https://sourceforge.net/projects/test-drive/

Cited by

The training of convolution neural network for advanced driver assistant system vol.4, pp.4, 2016, https://doi.org/10.17703/IJACT2016.4.4.23

Journal of IKEEE (전기전자학회논문지)

An Implementation of a Convolutional Accelerator based on a GPGPU for a Deep Learning

Deep Learning을 위한 GPGPU 기반 Convolution 가속기 구현

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)