DSpace at KOASAS: Automatic parallelization for efficient distributed deep learning

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Automatic parallelization for efficient distributed deep learning효율적인 분산 딥 러닝을 위한 자동 병렬화 알고리즘

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 314
Download : 0

Export

Kim, TaeHyun

Deep learning is increasingly popular as it is applied to many problems of various fields with excellent performance. Unfortunately, it often takes a large amount of time to train a practical deep model for real-world quality. While distributed deep learning that harnesses multiple GPUs in parallel can potentially reduce training time significantly, it is not only onerous but also error-prone to manually write the logic for distributed training of each model. Some research works have tackled the problem by proposing algorithms that automatically parallelize deep learning models given a single-GPU model. However, they often support only basic functions for distributed training, and misses a number of important factors for efficient distributed deep learning.In this paper, we propose an automatic parallelization for efficient distributed training beyond basic parallelization of a deep learning model. First, we minimize the overhead in switching the device set to a new configuration, which allows fast re-adjustment of aggregate resource of a GPU cluster. Second, it enables training with a large batch size even with a single GPU, which leads to better utilization of cluster resource. Finally, it realizes near-uniform load balancing, reducing the variation of processing latency and better resource utilization. Evaluation shows that our auto-parallelization expands a device set up to 6 times faster and shrink a device set almost instantly, and improves the scalability up to 2.97 times, compared to existing methods.

Advisors: Park, KyoungSoo researcher; 박경수 researcher

Description: 한국과학기술원 :전기및전자공학부,

Publisher: 한국과학기술원

Issue Date: 2019

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2019.2,[iii, 29 p. :]

Keywords: Deep learning▼adistributed deep learning▼aauto-parallelization▼aperformance optimization; 딥 러닝▼a분산 딥 러닝▼a자동 병렬화▼a성능 최적화

URI: http://hdl.handle.net/10203/266784

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=843384&flag=dissertation

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Automatic parallelization for efficient distributed deep learning효율적인 분산 딥 러닝을 위한 자동 병렬화 알고리즘

KOASAS

Communities & Collections