(An) energy-efficient CNN accelerator with similar feature skipping for face recognition in mobile devices모바일 얼굴 인식을 위한 에너지 효율적인 유사 특징 연산 생략 심층 신경망 가속기

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 270
  • Download : 0
A low power face recognition (FR) convolutional neural network (CNN) processor is proposed with high power efficiency to achieve always-on FR in mobile devices. Three key features enable a power-efficient FR CNN. First, tile-based clustering (THC) is proposed for reducing the computation overhead of hierarchical clustering Second, a low latency tile-based hierarchical clustering core is proposed. It supports an approximated clustering method that removes distance updates and increases pipeline utilization by up to 98.6%. Finally, a similar feature skipping binary convolution core with separated accumulation cores is proposed to increase power efficiency through skipping redundant MAC operation between duplicated input features and weights. It can reduce 35.8% of the total computation considering both CNN and THC. The proposed features enable the proposed processor to operate FR CNN with 17.3 TOPS/W power efficiency, which is 1.3× higher compared to the previous state-of-the-art FR CNN processor. Implemented in 65nm CMOS technology, the 6mm2 FR CNN processor shows 0.5mW power consumption at 1 frames-per-second always-on face recognition.
Advisors
Yoo, Hoi-Junresearcher유회준researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2020.2,[1책 :]

Keywords

Face recognition▼aConvolutional neural network▼aTile-based processing▼aHierarchical clustering▼aActivation skipping; 얼굴 인식▼a심층신경망▼a타일 기반 연산▼a클러스터링▼a입력 기반 연산 생략

URI
http://hdl.handle.net/10203/283802
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=901534&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0