TranSort: transformer for differentiable sorting미분가능한 정렬을 위한 트랜스포머

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
Differentiable Sorting Algorithm is used in end-to-end differentiable frameworks, enabling gradient-based optimization of models that involve sorting operations. The Differentiable Sorting Network, the most recent state-of-the-art Differentiable Sorting Algorithm, necessitates an equal gap between input scalars for accurate sorting. We consider the sorting operation as a seq2seq generation task, where the input sequence consists of unsorted scalars, and the output sequence represents the argsort result of the unsorted scalars. From that perspective, we present TranSort, a transformer architecture proposed as an alternative to Differentiable Sorting Algorithm. TranSort demonstrates stable sorting performance on various distribution of input scalars, distinguishing itself from Differentiable Sorting Network. Moreover, we present empirical evidence highlighting the enhanced performance of end-to-end learning tasks when utilizing TranSort compared to previous Differentiable Sorting Algorithms.
Advisors
김우창researcher
Description
한국과학기술원 :데이터사이언스대학원,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 데이터사이언스대학원, 2023.8,[iii, 20 p. :]

Keywords

미분가능한 정렬▼a트랜스포머▼a종단 간 학습▼a시퀀스-투-시퀀스 생성; Differentiable sorting▼aTransformer▼aEnd-to-end learning▼aseq2seq generation

URI
http://hdl.handle.net/10203/320560
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045748&flag=dissertation
Appears in Collection
IE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0