Automatic parallelization for efficient distributed deep learning효율적인 분산 딥 러닝을 위한 자동 병렬화 알고리즘

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 309
  • Download : 0
Deep learning is increasingly popular as it is applied to many problems of various fields with excellent performance. Unfortunately, it often takes a large amount of time to train a practical deep model for real-world quality. While distributed deep learning that harnesses multiple GPUs in parallel can potentially reduce training time significantly, it is not only onerous but also error-prone to manually write the logic for distributed training of each model. Some research works have tackled the problem by proposing algorithms that automatically parallelize deep learning models given a single-GPU model. However, they often support only basic functions for distributed training, and misses a number of important factors for efficient distributed deep learning.In this paper, we propose an automatic parallelization for efficient distributed training beyond basic parallelization of a deep learning model. First, we minimize the overhead in switching the device set to a new configuration, which allows fast re-adjustment of aggregate resource of a GPU cluster. Second, it enables training with a large batch size even with a single GPU, which leads to better utilization of cluster resource. Finally, it realizes near-uniform load balancing, reducing the variation of processing latency and better resource utilization. Evaluation shows that our auto-parallelization expands a device set up to 6 times faster and shrink a device set almost instantly, and improves the scalability up to 2.97 times, compared to existing methods.
Advisors
Park, KyoungSooresearcher박경수researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2019
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2019.2,[iii, 29 p. :]

Keywords

Deep learning▼adistributed deep learning▼aauto-parallelization▼aperformance optimization; 딥 러닝▼a분산 딥 러닝▼a자동 병렬화▼a성능 최적화

URI
http://hdl.handle.net/10203/266784
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=843384&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0