DSpace at KOASAS: Leveraging Large Language Models With Vocabulary Sharing For Sign Language Translation

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

Leveraging Large Language Models With Vocabulary Sharing For Sign Language Translation

Cited 0 time in webofscience

Cited 0 time in

Hit : 49
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	LEE, HUIJE	ko
dc.contributor.author	Kim, Jung-Ho	ko
dc.contributor.author	Hwang, Euijun	ko
dc.contributor.author	Kim, Jaewoo	ko
dc.contributor.author	Park, Jong-Cheol	ko
dc.date.accessioned	2023-11-14T03:01:46Z	-
dc.date.available	2023-11-14T03:01:46Z	-
dc.date.created	2023-11-13	-
dc.date.issued	2023-06-08	-
dc.identifier.citation	2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023	-
dc.identifier.uri	http://hdl.handle.net/10203/314596	-
dc.description.abstract	Sign language translation (SLT) is a task that provides translation between spoken and sign languages used in the same country, which tend to show high lexical similarity but low syntactic similarity. The recent emergence of large language models (LLMs) has been remarkable for all downstream tasks in natural language processing, but they have yet to be applied to SLT. In this paper, we explore how to use an LLM with vocabulary sharing for two gloss-based SLT tasks (text-to-gloss (T2G) and gloss-to-text (G2T)) on the NIASL2021 dataset, which consists of 180,848 preprocessed Korean and Korean Sign Language (KSL) sentence pairs. The experimental results showed that Ko-GPT-Trinity-1.2B+VS, a GPT-3-based SLT model with vocabulary sharing, outperformed other SLT models, achieving BLEU-4 scores of 22.06 and 45.89 on T2G and G2T tasks, respectively. We expect that the adoption of an LLM with vocabulary sharing will significantly lessen the resource scarcity problem of SLT.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Leveraging Large Language Models With Vocabulary Sharing For Sign Language Translation	-
dc.type	Conference	-
dc.identifier.wosid	001046933700148	-
dc.identifier.scopusid	2-s2.0-85168242667	-
dc.type.rims	CONF	-
dc.citation.publicationname	2023 IEEE International Conference on Acoustics, Speech and Signal Processing Workshops, ICASSPW 2023	-
dc.identifier.conferencecountry	GR	-
dc.identifier.conferencelocation	Rhodes Island	-
dc.identifier.doi	10.1109/ICASSPW59220.2023.10193533	-
dc.contributor.localauthor	Park, Jong-Cheol	-
dc.contributor.nonIdAuthor	Kim, Jung-Ho	-
dc.contributor.nonIdAuthor	Kim, Jaewoo	-

Appears in Collection: CS-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Leveraging Large Language Models With Vocabulary Sharing For Sign Language Translation

KOASAS

Communities & Collections