Efficient task-mapping of parallel applications using a space-filling curve

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 72
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKwon, Oh-Kyoungko
dc.contributor.authorKang, Ji-Hoonko
dc.contributor.authorLee, Seungchulko
dc.contributor.authorKim, Wonjungko
dc.contributor.authorSong, Junehwako
dc.date.accessioned2023-09-13T03:03:04Z-
dc.date.available2023-09-13T03:03:04Z-
dc.date.created2023-09-13-
dc.date.issued2022-10-
dc.identifier.citation31st International Conference on Parallel Architectures and Compilation Techniques, PACT 2022, pp.384 - 397-
dc.identifier.urihttp://hdl.handle.net/10203/312570-
dc.description.abstractImproving the communication performance of parallel programs is an important but difcult problem in a large-scale distributed memory-based cluster. Efforts to improve parallel scalability often face severe huddles in managing communication overheads. This paper proposes a framework of a space-fling curve(SFC)-based task-remapping for communication intensive parallel applications. An SFC-based mapping, when applied for task-mapping of parallel applications preserves locality in terms of communications and produce a less fragmented task-mapping, reducing communication overheads. The framework also provides tools for performance analysis to see if the proposed task-mapping is appropriate for a given application running on a target system. It further develops a binary classifer as a predictor to decide whether or not to apply the proposed mapping before run-time. We evaluate the framework with three communication intensive applications in Cartesian coordinates: P3DFFT solver and Channel code using 2D domain decomposition model, and Poisson solver using 3D domain decomposition. The evaluation is conducted on a large-scale cluster system of fat-tree topology with up to 1,024 compute nodes. The proposed task-mapping achieves the overall performance improvement ranging from ∼30% to ∼66% over the baseline approach depending on the workloads. Also, when used in combination with the binary classifer-based predictor, it achieves the expected performance gains from 4% to 8%.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleEfficient task-mapping of parallel applications using a space-filling curve-
dc.typeConference-
dc.identifier.scopusid2-s2.0-85147330600-
dc.type.rimsCONF-
dc.citation.beginningpage384-
dc.citation.endingpage397-
dc.citation.publicationname31st International Conference on Parallel Architectures and Compilation Techniques, PACT 2022-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationChicago, IL-
dc.identifier.doi10.1145/3559009.3569657-
dc.contributor.localauthorSong, Junehwa-
dc.contributor.nonIdAuthorKwon, Oh-Kyoung-
dc.contributor.nonIdAuthorKang, Ji-Hoon-
dc.contributor.nonIdAuthorLee, Seungchul-
dc.contributor.nonIdAuthorKim, Wonjung-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0