Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 48
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorSong, Minhakko
dc.contributor.authorYun, Chulheeko
dc.date.accessioned2024-02-04T15:00:21Z-
dc.date.available2024-02-04T15:00:21Z-
dc.date.created2024-02-04-
dc.date.issued2023-12-13-
dc.identifier.citation37th Annual Conference on Neural Information Processing Systems-
dc.identifier.urihttp://hdl.handle.net/10203/317998-
dc.description.abstractCohen et al. (2021) empirically study the evolution of the largest eigenvalue of the loss Hessian, also known as sharpness, along the gradient descent (GD) trajectory and observe a phenomenon called the Edge of Stability (EoS). The sharpness increases at the early phase of training (referred to as progressive sharpening), and eventually saturates close to the threshold of 2/(step size). In this paper, we start by demonstrating through empirical studies that when the EoS phenomenon occurs, different GD trajectories (after a proper reparameterization) align on a specific bifurcation diagram independent of initialization. We then rigorously prove this trajectory alignment phenomenon for a two-layer fully-connected linear network and a single-neuron nonlinear network trained with a single data point. Our trajectory alignment analysis establishes both progressive sharpening and EoS phenomena, encompassing and extending recent findings in the literature.-
dc.languageEnglish-
dc.publisherNeural Information Processing Systems-
dc.titleTrajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.publicationname37th Annual Conference on Neural Information Processing Systems-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationNew Orleans, LA-
dc.contributor.localauthorYun, Chulhee-
dc.contributor.nonIdAuthorSong, Minhak-
Appears in Collection
AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0