MRTP: Mobile Ray Tracing Processor With Reconfigurable Stream Multi-Processors for High Datapath Utilization

Cited 14 time in webofscience Cited 0 time in scopus
  • Hit : 306
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Hong-Yunko
dc.contributor.authorKim, Young-Junko
dc.contributor.authorKim, Lee-Supko
dc.date.accessioned2013-03-09T20:14:38Z-
dc.date.available2013-03-09T20:14:38Z-
dc.date.created2012-04-06-
dc.date.created2012-04-06-
dc.date.issued2012-02-
dc.identifier.citationIEEE JOURNAL OF SOLID-STATE CIRCUITS, v.47, no.2, pp.518 - 535-
dc.identifier.issn0018-9200-
dc.identifier.urihttp://hdl.handle.net/10203/97367-
dc.description.abstractThis paper presents a mobile ray tracing processor (MRTP) with reconfigurable stream multi-processors (RSMPs) for high datapath utilization. The MRTP includes three RSMPs that operate in multiple instruction multiple data (MIMD) mode asynchronously to exploit instruction-level parallelism. Each RSMP is based on single instruction multiple thread (SIMT) architecture to exploit thread-level parallelism. An RSMP consists of twelve scalar processing elements (SPEs) that run multiple threads in parallel synchronously: twelve scalar threads or four vector threads depending on an operating mode. A low datapath utilization caused by a branch divergence in SIMT architecture is improved by 19.9% on average by reconfiguring twelve SPEs between scalar SIMT and vector SIMT with 0.1% area overheads. Special function instructions occupy only 2% similar to 8% of kernel instructions so that a partial special function unit (PSFU) is implemented instead of a large dedicated SFU. The access conflicts with a look-up table (LUT) caused by concurrent accesses of twelve SPEs are reduced by a table loader (TBLD). The TBLD monitors concurrent requests from twelve SPEs and reduces an access count to LUT by distributing a coefficient to multiple SPEs with only one read-access to LUT. MRTP with area of 4 x 4 mm(2) has been fabricated in 0.13 mu m CMOS technology. MRTP achieves a peak performance of 673 K rays per second while consuming 156 mW at 100 MHz with V-DD = 1.2 V.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleMRTP: Mobile Ray Tracing Processor With Reconfigurable Stream Multi-Processors for High Datapath Utilization-
dc.typeArticle-
dc.identifier.wosid000299724500014-
dc.identifier.scopusid2-s2.0-84856467368-
dc.type.rimsART-
dc.citation.volume47-
dc.citation.issue2-
dc.citation.beginningpage518-
dc.citation.endingpage535-
dc.citation.publicationnameIEEE JOURNAL OF SOLID-STATE CIRCUITS-
dc.identifier.doi10.1109/JSSC.2011.2171417-
dc.contributor.localauthorKim, Lee-Sup-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorMany-core system-
dc.subject.keywordAuthormobile processor-
dc.subject.keywordAuthorray tracing-
dc.subject.keywordAuthorSIMD-
dc.subject.keywordAuthor3D graphics-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 14 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0