Layer-wise hint-based training for knowledge transfer in a teacher-student framework

Cited 7 time in webofscience Cited 7 time in scopus
  • Hit : 457
  • Download : 158
We devise a layer-wise hint training method to improve the existing hint-based knowledge distillation (KD) training approach, which is employed for knowledge transfer in a teacher-student framework using a residual network (ResNet). To achieve this objective, the proposed method first iteratively trains the student ResNet and incrementally employs hint-based information extracted from the pretrained teacher ResNet containing several hint and guided layers. Next, typical softening factor-based KD training is performed using the previously estimated hint-based information. We compare the recognition accuracy of the proposed approach with that of KD training without hints, hint-based KD training, and ResNet-based layer-wise pretraining using reliable datasets, including CIFAR-10, CIFAR-100, and MNIST. When using the selected multiple hint-based information items and their layer-wise transfer in the proposed method, the trained student ResNet more accurately reflects the pretrained teacher ResNet's rich information than the baseline training methods, for all the benchmark datasets we consider in this study.
Publisher
WILEY
Issue Date
2019-04
Language
English
Article Type
Article
Citation

ETRI JOURNAL, v.41, no.2, pp.242 - 253

ISSN
1225-6463
DOI
10.4218/etrij.2018-0152
URI
http://hdl.handle.net/10203/262120
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
000466377100010.pdf(939.32 kB)Download
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 7 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0