Efficient GPU resource scheduler for accelerating artificial intelligence applications인공지능 애플리케이션 가속을 위한 효율적인 GPU 자원 스케줄러

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 79
  • Download : 0
Modern deep learning models used to implement high-performance artificial intelligence applications typically require a large amount of computation for training and inference, and GPUs are commonly utilized to reduce the time required for this. Unfortunately, even the state-of-the-art deep learning platforms often waste a substantial amount of GPU resources due to inefficient scheduling of computational operations, which ends up bloating up the completion time of the applications. We tackle this issue by designing an efficient scheduler and implementing a system optimized for it, especially for (1) a job-level scheduler of a shared GPU cluster for multi-tenant users who train their own deep learning models independently (2) an operator-level scheduler of one or more GPUs that co-operate to run training or inference of a single model in parallel. We present the design and the full implementation of the proposed systems and show that they bring a substantial reduction of overall execution time of real-world deep learning workloads over existing state-of-the-art systems.
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2022
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2022.2,[iv, 80 p. :]

Keywords

Artificial intelligence▼aDeep learning▼aGPU▼aScheduler; 인공지능▼a딥러닝▼aGPU▼a스케줄러

URI
http://hdl.handle.net/10203/309113
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1000291&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0