A Low-Power Deep Neural Network Online Learning Processor for Real-Time Object Tracking Application

Cited 32 time in webofscience Cited 26 time in scopus
  • Hit : 506
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorHan, Donghyeonko
dc.contributor.authorLee, Jinsuko
dc.contributor.authorLee, Jinmookko
dc.contributor.authorYoo, Hoi-Junko
dc.date.accessioned2019-05-15T13:25:23Z-
dc.date.available2019-05-15T13:25:23Z-
dc.date.created2019-05-13-
dc.date.created2019-05-13-
dc.date.issued2019-05-
dc.identifier.citationIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, v.66, no.5, pp.1794 - 1804-
dc.identifier.issn1549-8328-
dc.identifier.urihttp://hdl.handle.net/10203/261860-
dc.description.abstractA deep neural network (DNN) online learning processor is proposed with high throughput and low power consumption to achieve real-time object tracking in mobile devices. Four key features enable a low-power DNN online learning. First, a proposed processor is designed with a unified core architecture and it achieves 1.33x higher throughput than the previous state-of-the-art DNN learning processor. Second, the new algorithms, binary feedback alignment (BFA), and dynamic fixed-point based run-length compression (RLC), are proposed and reduce power consumption through the reduction of external memory accesses (EMA). The BFA and dynamic fixed-point-based RLC reduce the EMA by 11.4% and 32.5%, respectively. Third, the new data feeding units, including an integral RLC (iRLC) decoder and a transpose RLC (tRLC) decoder, are co-designed to maximize throughput alongside the proposed algorithms. Finally, a dropout controller in this processor reduces redundant power consumption coming from the unified core and the data feeding architecture by the proposed dynamic clock-gating scheme. This enables the proposed processor to operate DNN online learning with 38.1% lower power consumption. Implemented with 65 nm CMOS technology, the 3.52 mm(2) DNN online learning processor shows 126 mW power consumption and the processor achieves 30.4 frames-per-second throughput in the object tracking application.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleA Low-Power Deep Neural Network Online Learning Processor for Real-Time Object Tracking Application-
dc.typeArticle-
dc.identifier.wosid000465305700013-
dc.identifier.scopusid2-s2.0-85057824082-
dc.type.rimsART-
dc.citation.volume66-
dc.citation.issue5-
dc.citation.beginningpage1794-
dc.citation.endingpage1804-
dc.citation.publicationnameIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS-
dc.identifier.doi10.1109/TCSI.2018.2880363-
dc.contributor.localauthorYoo, Hoi-Jun-
dc.contributor.nonIdAuthorLee, Jinsu-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle; Proceedings Paper-
dc.subject.keywordAuthorDeep neural network-
dc.subject.keywordAuthoronline learning-
dc.subject.keywordAuthorobject tracking-
dc.subject.keywordAuthorfeedback alignment-
dc.subject.keywordAuthorrun-length compression-
dc.subject.keywordAuthordynamic fixed-point representation-
dc.subject.keywordAuthordropout-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 32 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0