Amortised deep parameter optimisation of GPGPU work group size for OpenCV

Cited 4 time in webofscience Cited 0 time in scopus
  • Hit : 32
  • Download : 0
GPGPU (General Purpose computing on Graphics Processing Units) enables massive parallelism by taking advantage of the Single Instruction Multiple Data (SIMD) architecture of the large number of cores found on modern graphics cards. A parameter called local work group size controls how many work items are concurrently executed on a single compute unit. Though critical to the performance, there is no deterministic way to tune it, leaving developers to manual trial and error. This paper applies amortised optimisation to determine the best local work group size for GPGPU implementations of OpenCV template matching feature. The empirical evaluation shows that optimised local work group size can outperform the default value with large effect sizes.
Publisher
Springer Verlag
Issue Date
2016-10
Language
English
Citation

8th International Symposium on Search Based Software Engineering, SSBSE 2016, pp.211 - 217

ISSN
0302-9743
DOI
10.1007/978-3-319-47106-8_14
URI
http://hdl.handle.net/10203/313472
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0