DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, Hong-Yun | ko |
dc.contributor.author | Kim, Young-Jun | ko |
dc.contributor.author | Kim, Lee-Sup | ko |
dc.date.accessioned | 2013-03-09T20:14:38Z | - |
dc.date.available | 2013-03-09T20:14:38Z | - |
dc.date.created | 2012-04-06 | - |
dc.date.created | 2012-04-06 | - |
dc.date.issued | 2012-02 | - |
dc.identifier.citation | IEEE JOURNAL OF SOLID-STATE CIRCUITS, v.47, no.2, pp.518 - 535 | - |
dc.identifier.issn | 0018-9200 | - |
dc.identifier.uri | http://hdl.handle.net/10203/97367 | - |
dc.description.abstract | This paper presents a mobile ray tracing processor (MRTP) with reconfigurable stream multi-processors (RSMPs) for high datapath utilization. The MRTP includes three RSMPs that operate in multiple instruction multiple data (MIMD) mode asynchronously to exploit instruction-level parallelism. Each RSMP is based on single instruction multiple thread (SIMT) architecture to exploit thread-level parallelism. An RSMP consists of twelve scalar processing elements (SPEs) that run multiple threads in parallel synchronously: twelve scalar threads or four vector threads depending on an operating mode. A low datapath utilization caused by a branch divergence in SIMT architecture is improved by 19.9% on average by reconfiguring twelve SPEs between scalar SIMT and vector SIMT with 0.1% area overheads. Special function instructions occupy only 2% similar to 8% of kernel instructions so that a partial special function unit (PSFU) is implemented instead of a large dedicated SFU. The access conflicts with a look-up table (LUT) caused by concurrent accesses of twelve SPEs are reduced by a table loader (TBLD). The TBLD monitors concurrent requests from twelve SPEs and reduces an access count to LUT by distributing a coefficient to multiple SPEs with only one read-access to LUT. MRTP with area of 4 x 4 mm(2) has been fabricated in 0.13 mu m CMOS technology. MRTP achieves a peak performance of 673 K rays per second while consuming 156 mW at 100 MHz with V-DD = 1.2 V. | - |
dc.language | English | - |
dc.publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | - |
dc.title | MRTP: Mobile Ray Tracing Processor With Reconfigurable Stream Multi-Processors for High Datapath Utilization | - |
dc.type | Article | - |
dc.identifier.wosid | 000299724500014 | - |
dc.identifier.scopusid | 2-s2.0-84856467368 | - |
dc.type.rims | ART | - |
dc.citation.volume | 47 | - |
dc.citation.issue | 2 | - |
dc.citation.beginningpage | 518 | - |
dc.citation.endingpage | 535 | - |
dc.citation.publicationname | IEEE JOURNAL OF SOLID-STATE CIRCUITS | - |
dc.identifier.doi | 10.1109/JSSC.2011.2171417 | - |
dc.contributor.localauthor | Kim, Lee-Sup | - |
dc.type.journalArticle | Article | - |
dc.subject.keywordAuthor | Many-core system | - |
dc.subject.keywordAuthor | mobile processor | - |
dc.subject.keywordAuthor | ray tracing | - |
dc.subject.keywordAuthor | SIMD | - |
dc.subject.keywordAuthor | 3D graphics | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.