Image-optimized rolling cache and parallel processing architecture for real-time memory-intensive vision algorithms실시간 메모리 집중적인 비전 알고리즘들을 위한 영상 최적화된 롤링 캐쉬 및 병렬 처리 하드웨어 구조

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 914
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKweon, In-So-
dc.contributor.advisor권인소-
dc.contributor.authorKim, Young-Geun-
dc.contributor.author김영근-
dc.date.accessioned2015-04-23T07:07:11Z-
dc.date.available2015-04-23T07:07:11Z-
dc.date.issued2013-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=561893&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/197127-
dc.description학위논문(박사) - 한국과학기술원 : 로봇공학학제전공, 2013.8, [ viii, 91 p. ]-
dc.description.abstractFPGAs are often used as customized hardware accelerators for real-time image processing and vision applications. However, increasing image resolution requires the use of external SDR/DDR memories, and the arbitrary pixel access patterns used in most algorithms reduce their throughput as a result of increasing access latency. Efficient cache design is paramount in real-time memory-intensive applications. Its effectiveness depends on the spatial and temporal locality of data access. In image processing, the spatial locality denotes the neighboring pixels, located horizontally and vertically in 2-D. However, the conventional caches used in general processors cannot define the vertical locality. We present a rolling cache optimized for image format and algorithms, a method to reduce the miss penalty by moving the cache horizontally and vertically, and a parallel processing architecture with interpolation, multi-level and multiple caches. To support our idea, we compare it with other types of caches and show that the average memory access time and the memory bandwidth are decreased by 28% and 74%, respectively, for a 2048 x 2048 image. Its performance is greater than that of the 16-way set associative cache, but the tag memory size is slightly larger than that of the direct-mapped cache.Using three different experiments, we show that the proposed architecture is applicable to a number of algorithms, if data access follows an arbitrary curve or block-wise pattern, which is the usual case with image processing and vision algorithms. If an application is based on local data access in resource-limited systems such as mobile devices, it is possible to achieve high performance with lower operational frequency using the proposed architecture.To prove the effectiveness of the rolling cache-based parallel processing architecture for camera markets, three major applications are chosen for experiment.The first is random homogeneous transformation for CCTV markets, where the n...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectrolling cache-
dc.subject왜곡 보정-
dc.subjectCMOS 영상센서-
dc.subject영상 안정화-
dc.subject물체 추적-
dc.subject영상 변환-
dc.subjectcomputer vision-
dc.subjecthigh performance system-
dc.subjectpipelined architecture-
dc.subjectInterpolation-
dc.subjecthomogeneous transformation-
dc.subjectobject tracking-
dc.subjectimage stabilization-
dc.subjectCMOS image sensor-
dc.subjectdistortion correction-
dc.subject롤링 캐쉬-
dc.subject컴퓨터 비전-
dc.subject고성능 시스템-
dc.subject파이프라인 구조-
dc.subject영상 보간-
dc.titleImage-optimized rolling cache and parallel processing architecture for real-time memory-intensive vision algorithms-
dc.title.alternative실시간 메모리 집중적인 비전 알고리즘들을 위한 영상 최적화된 롤링 캐쉬 및 병렬 처리 하드웨어 구조-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN561893/325007 -
dc.description.department한국과학기술원 : 로봇공학학제전공, -
dc.identifier.uid020085032-
dc.contributor.localauthorKweon, In-So-
dc.contributor.localauthor권인소-
Appears in Collection
RE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0