Bandwidth-efficient mobile geometry processor with tessellation functionality and power-saving techniques효율적 메모리 대역폭 사용과 전력소모 감소를 위한 테셀레이션 가능 모바일 기하 프로세서에 관한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 627
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Lee-Sup-
dc.contributor.advisor김이섭-
dc.contributor.authorChung, Kyu-Sik-
dc.contributor.author정규식-
dc.date.accessioned2011-12-14-
dc.date.available2011-12-14-
dc.date.issued2009-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=327765&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/35532-
dc.description학위논문(박사) - 한국과학기술원 : 전기및전자공학전공, 2009. 8., [ ix, 122 p. ]-
dc.description.abstract3D graphics hardware for mobile multimedia devices should be implemented within limited memory bandwidth, area, and power budget. Among various bandwidth-saving techniques, tessellation reduces the amount of geometry data transfer by generating highly detailed geometry from coarse meshes inside the 3D graphics hardware. Despite its obvious effectiveness, only a few high-performance gaming systems have integrated dedicated tessellators with additional floating-point datapath and complex control logic. In this thesis, we propose the architecture of a shader-based tessellator for mobile 3D graphics. The proposed tessellator is implemented with a negligible hardware penalty because floating-point computations of tessellation are accelerated by the existing GPU pipeline and only tessellation-specific control logic is handled by an additional hardware unit. Tightly coupled with a vertex shader, the additional unit dynamically produces topological configurations and parametric coordinates of refinement patterns in the type of indexed triangle strips for object-level adaptive tessellation. The crack-free topological configurations improve the efficiency of a vertex cache so as to avoid redundant shader operations. In addition to the tessellation functionality, the shader architecture is enhanced for area and energy efficiency as well as higher performance. The latency of floating-point datapath is reduced by adopting fast DP4 units. The floating-point computations of the special function unit are also performed by the DP4 units to improve area efficiency. Clock gating by tool-based automatic method and manual clock-gating cell insertion reduces unnecessary power dissipation of idle modules. We additionally reduce redundant on-chip memory accesses by utilizing the operational characteristics of the multi-threaded shader architecture and reducing the size of frequently accessed general purpose registers. The proposed geometry processor is fabricated on three chips...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject3D graphics-
dc.subjectGPU-
dc.subjectshader-
dc.subjecttessellation-
dc.subjectVLSI-
dc.subject3차원 그래픽스-
dc.subject쉐이더-
dc.subject테셀레이션-
dc.subject3D graphics-
dc.subjectGPU-
dc.subjectshader-
dc.subjecttessellation-
dc.subjectVLSI-
dc.subject3차원 그래픽스-
dc.subject쉐이더-
dc.subject테셀레이션-
dc.titleBandwidth-efficient mobile geometry processor with tessellation functionality and power-saving techniques-
dc.title.alternative효율적 메모리 대역폭 사용과 전력소모 감소를 위한 테셀레이션 가능 모바일 기하 프로세서에 관한 연구-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN327765/325007 -
dc.description.department한국과학기술원 : 전기및전자공학전공, -
dc.identifier.uid020035271-
dc.contributor.localauthorKim, Lee-Sup-
dc.contributor.localauthor김이섭-
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0