FSHMEM : partitioned global address space support for FPGA hardware accelerators at scaleFSHMEM: 대규모 FPGA 기반 하드웨어 가속기 활용을 위한 Partitioned Global Address Space 하드웨어-소프트웨어 개발

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 77
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Joo Young-
dc.contributor.advisor김주영-
dc.contributor.authorArthanto, Yashael Faith-
dc.date.accessioned2023-06-26T19:33:48Z-
dc.date.available2023-06-26T19:33:48Z-
dc.date.issued2022-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1008381&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/309862-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2022.8,[iii, 25 p. :]-
dc.description.abstractPartitioned Global Address Space (PGAS), a parallel programming model that uses a globally shared memory and one-sided communication, has shown promising potential for High-Performance Computing (HPC) applications. At the same time, the HPC community has started to adopt FPGA as an alternative computing framework to the conventional CPU and GPU. However, the exploration of using PGAS along with FPGAs is in a preliminary stage, unlike the message passing model. This work proposes FSHMEM, a software/hardware framework that brings the PGAS programming model to FPGAs. We implement GASNet, a middleware that enables the PGAS programming model, on hardware to provide native PGAS integration on FPGA and the software interface with high compatibility to legacy PGAS programs. Our communication benchmark shows that FSHMEM outperforms prior works by 9.5× by attaining a peak bandwidth of 3813 MB/s, which is 95% of the theoretical maximum. Moreover, the remote write and read operation in FSHMEM has a low latency with an average of 0.35 μs and 0.59 μs, respectively. Lastly, we conducted a case study on two FSHMEM devices implemented on Intel D5005 PACs that contain Intel Deep Learning Accelerator (DLA). This system shows an excellent scalability potential by achieving 1.94× and 1.98× performance gain for parallel matrix-matrix multiplication and convolution, respectively.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectParallel Programming Model▼aPGAS▼aFPGA▼aGASNet▼aSHMEM-
dc.subject병렬 프로그래밍 모델▼aPGAS▼aFPGA▼aGASNet▼aSHMEM-
dc.titleFSHMEM-
dc.title.alternativeFSHMEM: 대규모 FPGA 기반 하드웨어 가속기 활용을 위한 Partitioned Global Address Space 하드웨어-소프트웨어 개발-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthor아산토 야샤엘 페이트-
dc.title.subtitlepartitioned global address space support for FPGA hardware accelerators at scale-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0