Eliminating Barrier Synchronization for Compiler-Parallelized Codes on Software DSMs

Cited 10 time in webofscience Cited 13 time in scopus
  • Hit : 277
  • Download : 0
Software distributed-shared-memory (DSM) systems provide an appealing target for parallelizing compilers due to their flexibility. Previous studies demonstrate such systems can provide performance comparable to message-passing compilers for dense-matrix kernels. However, synchronization and load imbalance are significant sources of overhead. In this paper, we investigate the impact of compilation techniques for eliminating barrier synchronization overhead in software DSMs. Our compile-time barrier elimination algorithm extends previous techniques in three ways: (1) we perform inexpensive communication analysis through local subscript analysis when using chunk iteration partitioning for parallel loops; (2) we exploit delayed updates in lazy-release-consistency DSMs to eliminate barriers guarding only anti-dependences; (3) when possible we replace barriers with customized nearest-neighbor synchronization. Experiments on an IBM SP-2 indicate these techniques can improve parallel performance by 20% on average and by up to 60% for some applications.
Publisher
Springer/Plenum Publishers
Issue Date
1998-10
Language
English
Article Type
Article
Citation

INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, v.26, no.5, pp.591 - 612

ISSN
0885-7458
DOI
10.1023/A:1018724631720
URI
http://hdl.handle.net/10203/70997
Appears in Collection
RIMS Journal Papers
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 10 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0