Quantization-Error-Robust Deep Neural Network for Embedded Accelerators

Cited 2 time in webofscience Cited 0 time in scopus
  • Hit : 156
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorJung, Youngbeomko
dc.contributor.authorKim, Hyeonukko
dc.contributor.authorChoi, Yeongjaeko
dc.contributor.authorKim, Lee-Supko
dc.date.accessioned2022-02-14T06:41:20Z-
dc.date.available2022-02-14T06:41:20Z-
dc.date.created2021-11-25-
dc.date.created2021-11-25-
dc.date.created2021-11-25-
dc.date.issued2022-02-
dc.identifier.citationIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, v.69, no.2, pp.609 - 613-
dc.identifier.issn1549-7747-
dc.identifier.urihttp://hdl.handle.net/10203/292219-
dc.description.abstractQuantization with low precision has become an essential technique for adopting deep neural networks in energy-and memory-constrained devices. However, there is a limit to the reducing precision by the inevitable loss of accuracy due to the quantization error. To overcome this obstacle, we propose methods reforming and quantizing a network that achieves high accuracy even at low precision without any runtime overhead in embedded accelerators. Our proposition consists of two analytical approaches: 1) network optimization to find the most error-resilient equivalent network in the precision-constrained environment and 2) quantization exploiting adaptive rounding offset control. The experimental results show accuracies of up to 98.31% and 99.96% of floating-point results in 6-bit and 8-bit quantization networks, respectively. Besides, our methods allow the lower precision accelerator design, reducing the energy consumption by 8.5%.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleQuantization-Error-Robust Deep Neural Network for Embedded Accelerators-
dc.typeArticle-
dc.identifier.wosid000748372000074-
dc.identifier.scopusid2-s2.0-85124311225-
dc.type.rimsART-
dc.citation.volume69-
dc.citation.issue2-
dc.citation.beginningpage609-
dc.citation.endingpage613-
dc.citation.publicationnameIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS-
dc.identifier.doi10.1109/TCSII.2021.3103192-
dc.contributor.localauthorKim, Lee-Sup-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorQuantization (signal)Adaptive systemsTrainingSignal resolutionNeural networksDeep learningCircuits and systemsDeep neural networkacceleratorquantizationrescaling equivalentadaptive rounding offset control-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0