DSpace at KOASAS: EPU: An Energy-Efficient Explainable AI Accelerator With Sparsity-Free Computation and Heat Map Compression/Pruning

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

EPU: An Energy-Efficient Explainable AI Accelerator With Sparsity-Free Computation and Heat Map Compression/Pruning

Cited 0 time in webofscience

Cited 0 time in

Hit : 35
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Junsoo	ko
dc.contributor.author	Han, Seunghee	ko
dc.contributor.author	Ko, Geonwoo	ko
dc.contributor.author	Kim, Ji-Hoon	ko
dc.contributor.author	Lee, Changha	ko
dc.contributor.author	Kim, Taewoo	ko
dc.contributor.author	Youn, Chan-Hyun	ko
dc.contributor.author	Kim, Joo-Young	ko
dc.date.accessioned	2024-06-21T05:00:15Z	-
dc.date.available	2024-06-21T05:00:15Z	-
dc.date.created	2024-06-18	-
dc.date.issued	2024-03	-
dc.identifier.citation	IEEE JOURNAL OF SOLID-STATE CIRCUITS, v.59, no.3, pp.830 - 841	-
dc.identifier.issn	0018-9200	-
dc.identifier.uri	http://hdl.handle.net/10203/319917	-
dc.description.abstract	Deep neural networks (DNNs) have recently gained significant prominence in various real-world applications such as image recognition, natural language processing, and autonomous vehicles. However, due to their black-box nature in system, the underlying mechanisms of DNNs behind the inference results remain opaque to users. In order to address this challenge, researchers have focused on developing explainable artificial intelligence (AI) algorithms. Explainable AI aims to provide a clear and human-understandable explanation of the model's decision, thereby building more reliable systems. However, the explanation task differs from well-known inference and training processes as it involves interactions with the user. Consequently, existing inference and training accelerators face inefficiencies when processing explainable AI on edge devices. This article introduces explainable processing unit (EPU), the first hardware accelerator designed for explainable AI workloads. The EPU utilizes a novel data compression format for the output heat maps and intermediate gradients to enhance the overall system performance by reducing both memory footprint and external memory access. Its sparsity-free computing core efficiently handles the input sparsity with negligible control overhead, resulting in a throughput boost of up to 9.48x. It also proposes a dynamic workload scheduling with a customized on-chip network for distinct inference and explanation tasks to maximize internal data reuse hence reducing external memory access by 63.7%. Furthermore, the EPU incorporates point-wise gradient pruning (PGP) that can significantly reduce the size of heat maps by a factor of 7.01x combined with the proposed compression format. Finally, the EPU chip fabricated in a 28 nm CMOS process achieves a remarkable heat map generation rate of 367 frames/s for ResNet-34 while maintaining the state-of-the-art area and energy efficiency of 112.3 GOPS/mm(2) and 26.55 TOPS/W, respectively.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	EPU: An Energy-Efficient Explainable AI Accelerator With Sparsity-Free Computation and Heat Map Compression/Pruning	-
dc.type	Article	-
dc.identifier.wosid	001166565600001	-
dc.identifier.scopusid	2-s2.0-85184015939	-
dc.type.rims	ART	-
dc.citation.volume	59	-
dc.citation.issue	3	-
dc.citation.beginningpage	830	-
dc.citation.endingpage	841	-
dc.citation.publicationname	IEEE JOURNAL OF SOLID-STATE CIRCUITS	-
dc.identifier.doi	10.1109/jssc.2023.3346913	-
dc.contributor.localauthor	Youn, Chan-Hyun	-
dc.contributor.localauthor	Kim, Joo-Young	-
dc.contributor.nonIdAuthor	Han, Seunghee	-
dc.contributor.nonIdAuthor	Ko, Geonwoo	-
dc.contributor.nonIdAuthor	Kim, Ji-Hoon	-
dc.contributor.nonIdAuthor	Kim, Taewoo	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Artificial intelligence	-
dc.subject.keywordAuthor	Heat maps	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Convolutional neural networks	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Labeling	-
dc.subject.keywordAuthor	Semantics	-
dc.subject.keywordAuthor	Convolutional neural network (CNN)	-
dc.subject.keywordAuthor	deep neural network (DNN)	-
dc.subject.keywordAuthor	explainable artificial intelligence (XAI)	-
dc.subject.keywordAuthor	multiple DNN acceleration	-
dc.subject.keywordAuthor	neural processing unit (NPU)	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

EPU: An Energy-Efficient Explainable AI Accelerator With Sparsity-Free Computation and Heat Map Compression/Pruning

KOASAS

Communities & Collections