Magnitude and angle dynamics in training single ReLU neurons

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 16
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLee, Sangminko
dc.contributor.authorSim, Byeongsuko
dc.contributor.authorYe, Jong Chulko
dc.date.accessioned2024-08-29T02:00:09Z-
dc.date.available2024-08-29T02:00:09Z-
dc.date.created2024-07-25-
dc.date.issued2024-10-
dc.identifier.citationNEURAL NETWORKS, v.178-
dc.identifier.issn0893-6080-
dc.identifier.urihttp://hdl.handle.net/10203/322448-
dc.description.abstractUnderstanding the training dynamics of deep ReLU networks is a significant area of interest in deep learning. However, there remains a lack of complete elucidation regarding the weight vector dynamics, even for single ReLU neurons. To bridge this gap, our study delves into the training dynamics of the gradient flow w(t) for single ReLU neurons under the square loss, dissecting it into its magnitude ||w(t)|| and angle phi(t) components. Through this decomposition, we establish upper and lower bounds on these components to elucidate the convergence dynamics. Furthermore, we demonstrate the empirical extension of our findings to general two-layer multi-neuron networks. All theoretical results are generalized to the gradient descent method and rigorously verified through experiments.-
dc.languageEnglish-
dc.publisherPERGAMON-ELSEVIER SCIENCE LTD-
dc.titleMagnitude and angle dynamics in training single ReLU neurons-
dc.typeArticle-
dc.identifier.wosid001266414300001-
dc.identifier.scopusid2-s2.0-85197494452-
dc.type.rimsART-
dc.citation.volume178-
dc.citation.publicationnameNEURAL NETWORKS-
dc.identifier.doi10.1016/j.neunet.2024.106435-
dc.contributor.localauthorYe, Jong Chul-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorSingle ReLU neurons-
dc.subject.keywordAuthorMagnitude and angle dynamics-
dc.subject.keywordAuthorGradient flow-
Appears in Collection
AI-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0