SGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 27
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorCho, Hanseulko
dc.contributor.authorYun, Chulheeko
dc.date.accessioned2023-12-07T23:00:31Z-
dc.date.available2023-12-07T23:00:31Z-
dc.date.created2023-12-07-
dc.date.issued2023-05-03-
dc.identifier.citation11th International Conference on Learning Representations, ICLR 2023-
dc.identifier.urihttp://hdl.handle.net/10203/316024-
dc.description.abstractStochastic gradient descent-ascent (SGDA) is one of the main workhorses for solving finite-sum minimax optimization problems. Most practical implementations of SGDA randomly reshuffle components and sequentially use them (i.e., without-replacement sampling); however, there are few theoretical results on this approach for minimax algorithms, especially outside the easier-to-analyze (strongly-)monotone setups. To narrow this gap, we study the convergence bounds of SGDA with random reshuffling (SGDA-RR) for smooth nonconvex-nonconcave objectives with Polyak-{\L}ojasiewicz (P{\L}) geometry. We analyze both simultaneous and alternating SGDA-RR for nonconvex-P{\L} and primal-P{\L}-P{\L} objectives, and obtain convergence rates faster than with-replacement SGDA. Our rates extend to mini-batch SGDA-RR, recovering known rates for full-batch gradient descent-ascent (GDA). Lastly, we present a comprehensive lower bound for GDA with an arbitrary step-size ratio, which matches the full-batch upper bound for the primal-P{\L}-P{\L} case.-
dc.languageEnglish-
dc.publisherInternational Conference on Learning Representations (ICLR)-
dc.titleSGDA with shuffling: faster convergence for nonconvex-PŁ minimax optimization-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.publicationname11th International Conference on Learning Representations, ICLR 2023-
dc.identifier.conferencecountryRW-
dc.identifier.conferencelocationKigali-
dc.contributor.localauthorYun, Chulhee-
Appears in Collection
AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0