Efficient Compiler and Run-Time Support for Parallel Irregular Reductions

Cited 17 time in webofscience Cited 22 time in scopus
  • Hit : 362
  • Download : 0
Many scientific applications are comprised of irregular reductions on large data sets. In shared-memory parallel programs, these irregular reductions are typically computed in parallel using replicated buffers, then combined using synchronization. We develop LOCALWRITE, a new technique which partitions irregular reductions so that each processor computes values only for locally assigned data, eliminating the need for buffers or synchronized writes. Computation is replicated if its results are needed on multiple processors. We experimentally evaluate its performance for three irregular codes on a software DSM running on a distributed-memory multiprocessor and two shared-memory multiprocessors while varying connectivity, locality, and adaptivity. Results show LOCALWRITE improves performance significantly compared to using replicated buffers, and can match or exceed explicit message-passing gather/scatter for applications with low locality or high adaptivity. (C) 2000 Elsevier Science B.V. All rights reserved.
Publisher
Elsevier Science Bv
Issue Date
2000-12
Language
English
Article Type
Article
Citation

PARALLEL COMPUTING, v.26, no.13-14, pp.1861 - 1887

ISSN
0167-8191
DOI
10.1016/S0167-8191(00)00062-4
URI
http://hdl.handle.net/10203/70255
Appears in Collection
RIMS Journal Papers
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 17 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0