DSpace at KOASAS: Agamotto: A Performance Optimization Framework for CNN Accelerator With Row Stationary Dataflow

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

Agamotto: A Performance Optimization Framework for CNN Accelerator With Row Stationary Dataflow

Cited 4 time in

Cited 0 time in

Hit : 177
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Kim, Donghyuk	ko
dc.contributor.author	Jeong, Sanghyun	ko
dc.contributor.author	Kim, Joo-Young	ko
dc.date.accessioned	2023-06-07T07:00:42Z	-
dc.date.available	2023-06-07T07:00:42Z	-
dc.date.created	2023-04-17	-
dc.date.issued	2023-06	-
dc.identifier.citation	IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, v.70, no.6, pp.2487 - 2496	-
dc.identifier.issn	1549-8328	-
dc.identifier.uri	http://hdl.handle.net/10203/307089	-
dc.description.abstract	We propose a software/hardware co-design framework called Agamotto for the complete design automation and performance optimization of the row stationary-based CNN accelerator. We design a scalable accelerator template whose critical design parameters can be configured. Based on the hardware template, Agamotto estimates the performance of the numerous possible hardware implementations for the target FPGA device and CNN model using the latency modeling tool. It chooses the best hardware design and generates the instructions and optimal runtime variables for each target CNN layer. As a result, Agamotto can generate the best hardware design within 61.67 seconds, achieving up to 2.8x higher hardware utilization than the original accelerator. In addition, experimental results show that the performance estimation is accurate, showing only 4.8% difference against the FPGA runtime for the end-to-end CNN model execution. The accelerator implemented on the Xilinx VCU118 evaluation board achieves 402 giga operations per second (GOPS) at 200 MHz, resulting in 13 frames per second (FPS) for the end-to-end execution of VGG-16. It is flexible enough to run more complex CNN models such as ResNet-50 and DarkNet-53, achieving 29.3 FPS and 16.9 FPS, respectively.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Agamotto: A Performance Optimization Framework for CNN Accelerator With Row Stationary Dataflow	-
dc.type	Article	-
dc.identifier.wosid	000958818900001	-
dc.identifier.scopusid	2-s2.0-85151491443	-
dc.type.rims	ART	-
dc.citation.volume	70	-
dc.citation.issue	6	-
dc.citation.beginningpage	2487	-
dc.citation.endingpage	2496	-
dc.citation.publicationname	IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS	-
dc.identifier.doi	10.1109/TCSI.2023.3258411	-
dc.contributor.localauthor	Kim, Joo-Young	-
dc.contributor.nonIdAuthor	Jeong, Sanghyun	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Hardware	-
dc.subject.keywordAuthor	Convolutional neural networks	-
dc.subject.keywordAuthor	Convolution	-
dc.subject.keywordAuthor	Computational modeling	-
dc.subject.keywordAuthor	Field programmable gate arrays	-
dc.subject.keywordAuthor	Data models	-
dc.subject.keywordAuthor	Arrays	-
dc.subject.keywordAuthor	CNN	-
dc.subject.keywordAuthor	row stationary dataflow	-
dc.subject.keywordAuthor	mapping strategy	-
dc.subject.keywordAuthor	performance optimization	-
dc.subject.keywordAuthor	software	-
dc.subject.keywordAuthor	hardware co-design	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 4 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Agamotto: A Performance Optimization Framework for CNN Accelerator With Row Stationary Dataflow

This item is cited by other documents in WoS

KOASAS

Communities & Collections