(A) NAND flash-based on-die processing architecture for stochastic optimization of large scale DNNs거대 신경망의 확률적 최적화를 위한 낸드 플래시 기반 온-다이 처리 구조

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 59
  • Download : 0
Training deep neural network (DNN) models is a resource-intensive, iterative process. For this reason, nowadays, complex optimizers like Adam are widely adopted as it increases the speed and efficiency of training. These optimizers, however, employ additional variables and raise the memory demand 2x to 3x of model parameters itself, worsening the memory capacity bottleneck when training large scale DNN models. Moreover, as the size of DNN models is projected to grow even further, it is not practical to assume that the future models will fit in accelerator memory. This has triggered various efforts to offload models to flash-based storage. However, when the model, especially the optimizer is offloaded to flash, the limited I/O bandwidth becomes a bottleneck, severely slowing down the overall training process. To this end, a solid-state drive (SSD) system with on-die processing (ODP) flash memory architecture is presented for gradient descent-based DNN models. Proposed scheme accelerates, or enables the training of large scale models by processing optimization stage in the storage device, specifically inside the flash dies. ODP capability of proposed architecture eliminates the heavy data movement over external interconnect and internal flash channels. In addition, proposed scheme involves customized flash translation layer (FTL) to support efficient memory management. Overall, it achieves, on average, a 2.8x speedup and a 3.6x improved energy efficiency in the weight update stage over baseline SSD offloading.
Advisors
Kim, Lee-Supresearcher김이섭researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2022.8,[iv, 41 p. :]

Keywords

Near-data processing▼aDNN optimization▼aSolid state drives▼aIn-storage processing; 데이터-근접 처리▼a딥-뉴럴 네트워크 최적화▼a고체 상태 드라이브▼a인-스토리지 처리

URI
http://hdl.handle.net/10203/309837
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1008472&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0