DSpace at KOASAS: NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units

Cited 23 time in

Cited 12 time in

Hit : 230
Download : 0

Export

Hyun, Bongjoon / Kwon, Youngeun / Choi, Yujeong / Kim, John Dongjun researcher / Rhu, Minsoo researcher

To satisfy the compute and memory demands of deep neural networks (DNNs), neural processing units (NPUs) are widely being utilized for accelerating DNNs. Similar to how GPUs have evolved from a slave device into a mainstream processor architecture, it is likely that NPUs will become first-class citizens in this fast-evolving heterogeneous architecture space. This paper makes a case for enabling address translation in NPUs to decouple the virtual and physical memory address space. Through a careful data-driven application characterization study, we root-cause several limitations of prior GPU-centric address translation schemes and propose a memory management unit (MMU) that is tailored for NPUs. Compared to an oracular MMU design point, our proposal incurs only an average 0.06% performance overhead.

Publisher: ACM

Issue Date: 2020-03-20

Language: English

Citation: The 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-25), pp.1109 - 1124

DOI: 10.1145/3373376.3378494

URI: http://hdl.handle.net/10203/276229

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 23 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

NeuMMU: Architectural Support for Efficient Address Translations in Neural Processing Units

This item is cited by other documents in WoS

KOASAS

Communities & Collections