Performance of heterogeneous computing with graphics processing unit and many integrated core for hartree potential calculations on a numerical grid

Cited 8 time in webofscience Cited 0 time in scopus
  • Hit : 561
  • Download : 0
We investigated the performance of heterogeneous computing with graphics processing units (GPUs) and many integrated core (MIC) with 20 CPU cores (20xCPU). As a practical example toward large scale electronic structure calculations using grid-based methods, we evaluated the Hartree potentials of silver nanoparticles with various sizes (3.1, 3.7, 4.9, 6.1, and 6.9 nm) via a direct integral method supported by the sinc basis set. The so-called work stealing scheduler was used for efficient heterogeneous computing via the balanced dynamic distribution of workloads between all processors on a given architecture without any prior information on their individual performances. 20xCPU+1GPU was up to approximate to 1.5 and approximate to 3.1 times faster than 1GPU and 20xCPU, respectively. 20xCPU+2GPU was approximate to 4.3 times faster than 20xCPU. The performance enhancement by CPU+MIC was considerably lower than expected because of the large initialization overhead of MIC, although its theoretical performance is similar with that of CPU+GPU. (c) 2016 Wiley Periodicals, Inc.
Publisher
WILEY-BLACKWELL
Issue Date
2016-09
Language
English
Article Type
Article
Keywords

DENSITY-FUNCTIONAL THEORY; MULTIRESOLUTION QUANTUM-CHEMISTRY; ELECTRONIC-STRUCTURE CALCULATIONS; COUPLED-CLUSTER; KOHN-SHAM; GPU; FOCK; APPROXIMATION; ACCELERATION; DYNAMICS

Citation

JOURNAL OF COMPUTATIONAL CHEMISTRY, v.37, no.24, pp.2193 - 2201

ISSN
0192-8651
DOI
10.1002/jcc.24443
URI
http://hdl.handle.net/10203/213224
Appears in Collection
CH-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 8 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0