Specific initialization method for endomorphism-like operators in residual networksResidual network들의 자기 사상 (endomorphism) 과 유사한 연산자들의 초기화 방법에 대한 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 252
  • Download : 0
Residual neural networks have changed the way we approach machine learning, and deep learning specifically, giving a new scope of training, and allowing deeper neural networks to exist. However, whereas their contribution are large, there are still many unknowns on the reason why networks designed this way perform so well. On this paper, we provide a set of analysis of residual networks in opposition to their non residual counterpart, arguing that one of the strength of this type of architecture is to give a nice initialization space that allows the network to efficiently train. This analysis leads us toward a new type of initialization that specifically target endomorphism-type layers, which we explain how to select on different network architectures. We give results of zero and low variance initialization for those targeted layer in many different training setting. We then provide a set of analysis of the possible better understanding of neural networks this approach could give, using a recently introduced similarity measure designed for neural networks post activation comparison, Central Kernel Alignment. Using this similarity measure, we conclude that one of the major merits of this approach is helping de-correlating targeted layers during training.
Advisors
Yun, Se-Youngresearcher윤세영researcher
Description
한국과학기술원 :지식서비스공학대학원,
Publisher
한국과학기술원
Issue Date
2020
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 지식서비스공학대학원, 2020.8,[v, 37 p. :]

Keywords

Residual Neural Networks▼aConvolutional Neural Networks▼aLoss Space Analysis▼aNeural Networks initialization scheme▼aCentral Kernel Alignment; 잔류 신경망▼a콘볼루션 신경망▼a손실 공간 분석▼a신경망 초기화 체계▼a중앙 커널 정렬

URI
http://hdl.handle.net/10203/284911
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=925064&flag=dissertation
Appears in Collection
KSE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0