Location-Aware Computation Mapping for Manycore Processors

Cited 0 time in webofscience Cited 10 time in scopus
  • Hit : 143
  • Download : 0
Employing an on-chip network in a manycore system (to improve scalability) makes the latencies of data accesses issued by a core non-uniform, which significant impact application performance. This paper presents a compiler strategy which involves exposing architecture information to the compiler to enable optimized computation-to-core mapping. Our scheme takes into account the relative positions of (and distances between) cores, last-level caches (LLCs) and memory controllers (MCs) in a manycore system, and generates a mapping of computations to cores with the goal of minimizing the on-chip network traffic. Our experiments of 12 multi-threaded applications reveal that, on average, our approach reduces the on-chip network latency in a 6x6 manycore system by 49.5% in the case of private LLCs and 52.7% in the case of shared LLCs. These improvements translate to the corresponding execution time improvements of 14.8% and 15.2% for the private LLC and shared LLC based systems.
Publisher
ACM and IEEE Computer Society
Issue Date
2017-09-09
Language
English
Citation

26th International Conference on Parallel Architectures and Compilation Techniques, PACT 2017, pp.138 - 139

DOI
10.1109/PACT.2017.20
URI
http://hdl.handle.net/10203/269609
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0