Theoretical insights into mixup: perspectives on decision boundaries믹스업에 대한 이론적 통찰: 결정 경계의 관점으로

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 5
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor윤철희-
dc.contributor.authorOh, Junsoo-
dc.contributor.author오준수-
dc.date.accessioned2024-07-30T19:30:41Z-
dc.date.available2024-07-30T19:30:41Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096073&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321368-
dc.description학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2024.2,[iv, 63 p. :]-
dc.description.abstractperhaps surprisingly, the task of finding optimal decision boundaries becomes harder for more separable distributions. For Mixup training, we show that Mixup mitigates this problem by significantly reducing the sample complexity. To this end, we develop new concentration results applicable to $n^2$ pair-wise augmented data points constructed from $n$ independent data, by carefully dealing with dependencies between overlapping pairs. Lastly, we study other masking-based Mixup-style techniques and show that they can distort the training loss and make its minimizer converge to a suboptimal classifier in terms of test accuracy.-
dc.description.abstractWe investigate how pair-wise data augmentation techniques like Mixup affect the sample complexity of finding optimal decision boundaries in a binary linear classification problem. For a family of data distributions with a separability constant~$\kappa$, we analyze how well the optimal classifier in terms of training loss aligns with the optimal one in test accuracy (i.e., Bayes optimal classifier). For vanilla training without augmentation, we uncover an interesting phenomenon named the \emph{curse of separability}. As we increase $\kappa$ to make the data distribution more separable, the sample complexity of vanilla training increases exponentially in $\kappa$-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject믹스업▼a최적의 결정 경계▼a샘플 복잡도▼a이론▼a일반화-
dc.subjectMixup▼aOptimal decision boundary▼aSample complexity▼aTheory▼aGeneralization-
dc.titleTheoretical insights into mixup: perspectives on decision boundaries-
dc.title.alternative믹스업에 대한 이론적 통찰: 결정 경계의 관점으로-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :김재철AI대학원,-
dc.contributor.alternativeauthorYun, Chulhee-
Appears in Collection
AI-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0