Leveraging Large Language Models With Vocabulary Sharing For Sign Language Translation

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 49
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLEE, HUIJEko
dc.contributor.authorKim, Jung-Hoko
dc.contributor.authorHwang, Euijunko
dc.contributor.authorKim, Jaewooko
dc.contributor.authorPark, Jong-Cheolko
dc.date.accessioned2023-11-14T03:01:46Z-
dc.date.available2023-11-14T03:01:46Z-
dc.date.created2023-11-13-
dc.date.issued2023-06-08-
dc.identifier.citation2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023-
dc.identifier.urihttp://hdl.handle.net/10203/314596-
dc.description.abstractSign language translation (SLT) is a task that provides translation between spoken and sign languages used in the same country, which tend to show high lexical similarity but low syntactic similarity. The recent emergence of large language models (LLMs) has been remarkable for all downstream tasks in natural language processing, but they have yet to be applied to SLT. In this paper, we explore how to use an LLM with vocabulary sharing for two gloss-based SLT tasks (text-to-gloss (T2G) and gloss-to-text (G2T)) on the NIASL2021 dataset, which consists of 180,848 preprocessed Korean and Korean Sign Language (KSL) sentence pairs. The experimental results showed that Ko-GPT-Trinity-1.2B+VS, a GPT-3-based SLT model with vocabulary sharing, outperformed other SLT models, achieving BLEU-4 scores of 22.06 and 45.89 on T2G and G2T tasks, respectively. We expect that the adoption of an LLM with vocabulary sharing will significantly lessen the resource scarcity problem of SLT.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleLeveraging Large Language Models With Vocabulary Sharing For Sign Language Translation-
dc.typeConference-
dc.identifier.wosid001046933700148-
dc.identifier.scopusid2-s2.0-85168242667-
dc.type.rimsCONF-
dc.citation.publicationname2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023-
dc.identifier.conferencecountryGR-
dc.identifier.conferencelocationRhodes Island-
dc.identifier.doi10.1109/ICASSPW59220.2023.10193533-
dc.contributor.localauthorPark, Jong-Cheol-
dc.contributor.nonIdAuthorKim, Jung-Ho-
dc.contributor.nonIdAuthorKim, Jaewoo-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0