Energy-Efficient Task Partitioning for CNN-based Object Detection in Heterogeneous Computing Environment

Cited 4 time in webofscience Cited 0 time in scopus
  • Hit : 188
  • Download : 0
Along with the high accuracy and the various use-cases of CNN, the number of services which are based on CNN continues to grow. Thanks to the development of GPU, a hardware accelerator for parallel processing, CNN have become powerful despite of its large amount of computations. In recent year, many studies have suggested using FPGA as a CNN accelerator due to its advantages over GPU. However, using these two accelerators together can greatly improve the processing performance because GPU and FPGA have complementary characteristics. Although there are some scheduling algorithms in the literature for the heterogeneous platform, they do not consider power efficiency and compliance with the deadline of an application at the same time. This paper found that the most power efficient accelerator is different for each sub-layers of CNN. It confirmed that task partitioning in the unit of sub-layers can improve the energy-efficiency of the system. Based on this finding this paper proposes an energy-efficient adaptive task partitioning scheme for CNN-based service. Experimental results show that the proposed scheduling consumes less energy than EDP method while satisfying the requested deadline of tasks.
Publisher
The korean institue of communications and information sciences (KICS)
Issue Date
2018-10-17
Language
English
Citation

9th International Conference on Information and Communication Technology Convergence (ICTC), pp.31 - 36

ISSN
2162-1233
DOI
10.1109/ICTC.2018.8539528
URI
http://hdl.handle.net/10203/247316
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0