Encoding convolutional neural networks for efficient hardware acceleration효율적인 하드웨어 가속을 위한 CNN 인코딩

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 150
  • Download : 0
Recent many industrial and commercial fields are making an effort to deploy the DCNN on mobile devices. However, the layer depth of DCNNs deepens as user-applications become more complicated and require higher accuracy, thus resulting in an increasing number of computation and parameter size. Two solutions for mitigating this issue is quantization of the parameters of DCNNs and data encoding to lower the complexity of computing units and memory footprint with a little accuracy drop. In this thesis, to mitigate the above problems in two quantized networks, which are extremely-quantized weight CNNs, multi-bit quantized CNNs, data encoding schemes are proposed. And then, customized hardware accelerators are designed to verify the efficiency of these proposed schemes. As a result, in extremely-quantized weight CNNs, effective bit per weight is reduced to 0.67-0.80 bit achieving 4.52-7.70x and 1.52-2.21x improvement of performance and energy efficiency respectively, with higher accuracy compared to previous binary weight CNN work. In multi-bit quantized CNNs, about 12.7-26.0\% energy saving is achieved from iso-accuracy comparison, so that a design option, that can be the most efficient compared to the baselines considering accuracy-energy trade-off, is provided.
Advisors
Kim, Lee-Supresearcher김이섭researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2020.2,[iv, 41 p. :]

Keywords

CNN▼aquantization▼aEncoding▼aHardware acceleration; CNN▼a양자화▼a인코딩▼a하드웨어 가속

URI
http://hdl.handle.net/10203/284744
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=911362&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0