DSpace at KOASAS: Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

Cited 101 time in

Cited 0 time in

Hit : 163
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Rhu, Minsoo	ko
dc.contributor.author	O'Connor, Mike	ko
dc.contributor.author	Chatterjee, Niladrish	ko
dc.contributor.author	Pool, Jeff	ko
dc.contributor.author	Kwon, Youngeun	ko
dc.contributor.author	Keckler, Steve	ko
dc.date.accessioned	2018-12-20T02:18:32Z	-
dc.date.available	2018-12-20T02:18:32Z	-
dc.date.created	2018-11-29	-
dc.date.created	2018-11-29	-
dc.date.issued	2018-02-26	-
dc.identifier.citation	24th IEEE International Symposium on High Performance Computer Architecture, HPCA 2018, pp.78 - 91	-
dc.identifier.issn	1530-0897	-
dc.identifier.uri	http://hdl.handle.net/10203/247539	-
dc.description.abstract	Popular deep learning frameworks require users to fine-tune their memory usage so that the training data of a deep neural network (DNN) fits within the GPU physical memory. Prior work tries to address this restriction by virtualizing the memory usage of DNNs, enabling both CPU and GPU memory to be utilized for memory allocations. Despite its merits, virtualizing memory can incur significant performance overheads when the time needed to copy data back and forth from CPU memory is higher than the latency to perform DNN computations. We introduce a high-performance virtualization strategy based on a 'compressing DMA engine' (cDMA) that drastically reduces the size of the data structures that are targeted for CPU-side allocations. The cDMA engine offers an average 2.6x (maximum 13.8x) compression ratio by exploiting the sparsity inherent in offloaded data, improving the performance of virtualized DNNs by an average 53% (maximum 79%) when evaluated on an NVIDIA Titan Xp.	-
dc.language	English	-
dc.publisher	IEEE Computer Society	-
dc.title	Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks	-
dc.type	Conference	-
dc.identifier.wosid	000440297700007	-
dc.identifier.scopusid	2-s2.0-85046798314	-
dc.type.rims	CONF	-
dc.citation.beginningpage	78	-
dc.citation.endingpage	91	-
dc.citation.publicationname	24th IEEE International Symposium on High Performance Computer Architecture, HPCA 2018	-
dc.identifier.conferencecountry	AU	-
dc.identifier.conferencelocation	Hotel Pyramide Congress Center, Vienna	-
dc.identifier.doi	10.1109/HPCA.2018.00017	-
dc.contributor.localauthor	Rhu, Minsoo	-
dc.contributor.nonIdAuthor	O'Connor, Mike	-
dc.contributor.nonIdAuthor	Chatterjee, Niladrish	-
dc.contributor.nonIdAuthor	Pool, Jeff	-
dc.contributor.nonIdAuthor	Keckler, Steve	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 101 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

This item is cited by other documents in WoS

KOASAS

Communities & Collections