Image-Optimized Rolling Cache: Reducing the Miss Penalty for Memory-Intensive Vision Algorithms

Cited 2 time in webofscience Cited 2 time in scopus
  • Hit : 403
  • Download : 4
DC FieldValueLanguage
dc.contributor.authorKim, Young-Geunko
dc.contributor.authorKweon, In-Soko
dc.date.accessioned2014-08-27T01:10:16Z-
dc.date.available2014-08-27T01:10:16Z-
dc.date.created2013-10-11-
dc.date.created2013-10-11-
dc.date.issued2014-03-
dc.identifier.citationIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, v.24, no.3, pp.539 - 551-
dc.identifier.issn1051-8215-
dc.identifier.urihttp://hdl.handle.net/10203/187303-
dc.description.abstractIn real-time memory-intensive image processing and vision applications, increasing image resolution requires the use of external SDR/DDR memories. However, the arbitrary pixel access patterns used in most algorithms reduce their memory throughput as a result of increasing access latency. Efficient cache design is paramount in real-time memory-intensive applications. Its effectiveness depends on the spatial and temporal locality of data access. In image processing, the spatial locality denotes the neighboring pixels, located horizontally and vertically in 2-D. However, the conventional caches used in general processors cannot define the vertical locality. We propose a rolling cache optimized for image format and algorithms, a method to reduce the miss penalty by moving the cache horizontally and vertically, and a parallel processing architecture with interpolation, multilevel and multiple caches. To support our idea, we compare it with other types of caches and show that the average memory access time and the memory bandwidth are decreased by 28% and 74%, respectively, for a 2048x2048 image. Its performance is greater than that of the 16-way set associative cache, but the tag memory size is a bit larger than that of the direct-mapped cache. Using two different applications, we show that the proposed architecture is applicable to a number of algorithms if data access follows an arbitrary curve or block-wise pattern, which is the usual case with image processing and vision algorithms. If an application is based on local data access in resource-limited systems, it is possible to achieve high performance with lower operational frequency using the proposed architecture.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectREAL-TIME-
dc.subjectBARREL DISTORTION-
dc.subjectARCHITECTURE-
dc.subjectPERFORMANCE-
dc.subjectTRACKING-
dc.subjectFPGA-
dc.titleImage-Optimized Rolling Cache: Reducing the Miss Penalty for Memory-Intensive Vision Algorithms-
dc.typeArticle-
dc.identifier.wosid000334522000016-
dc.identifier.scopusid2-s2.0-84896532866-
dc.type.rimsART-
dc.citation.volume24-
dc.citation.issue3-
dc.citation.beginningpage539-
dc.citation.endingpage551-
dc.citation.publicationnameIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY-
dc.identifier.doi10.1109/TCSVT.2013.2278144-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorKweon, In-So-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorCache-
dc.subject.keywordAuthorcomputer vision-
dc.subject.keywordAuthorhigh performance system-
dc.subject.keywordAuthorhomogeneous transformation-
dc.subject.keywordAuthorimage processing-
dc.subject.keywordAuthorinterpolation-
dc.subject.keywordAuthorobject tracking-
dc.subject.keywordAuthorpipelined architecture-
dc.subject.keywordAuthorrolling cache-
dc.subject.keywordPlusREAL-TIME-
dc.subject.keywordPlusBARREL DISTORTION-
dc.subject.keywordPlusTRACKING-
dc.subject.keywordPlusARCHITECTURE-
dc.subject.keywordPlusPERFORMANCE-
dc.subject.keywordPlusFPGA-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0