Synergy with Translation Artifacts for Training and Inference in Multilingual Tasks

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 213
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorOh, Jaehoonko
dc.contributor.authorKo, Jongwooko
dc.contributor.authorYun, Seyoungko
dc.date.accessioned2022-12-14T07:00:17Z-
dc.date.available2022-12-14T07:00:17Z-
dc.date.created2022-12-03-
dc.date.created2022-12-03-
dc.date.issued2022-12-09-
dc.identifier.citationConference on Empirical Methods in Natural Language Processing, EMNLP 2022, pp.6747 - 6754-
dc.identifier.urihttp://hdl.handle.net/10203/302993-
dc.description.abstractTranslation has played a crucial role in improving the performance on multilingual tasks: (1) to generate the target language data from the source language data for training and (2) to generate the source language data from the target language data for inference. However, prior works have not considered the use of both translations simultaneously. This paper shows that combining them can synergize the results on various multilingual sentence classification tasks. We empirically find that translation artifacts stylized by translators are the main factor of the performance gain. Based on this analysis, we adopt two training methods, SupCon and MixUp, considering translation artifacts. Furthermore, we propose a cross-lingual fine-tuning algorithm called MUSC, which uses SupCon and MixUp jointly and improves the performance. Our code is available at https://github.com/jongwooko/MUSC.-
dc.languageEnglish-
dc.publisherEMNLP-
dc.titleSynergy with Translation Artifacts for Training and Inference in Multilingual Tasks-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.beginningpage6747-
dc.citation.endingpage6754-
dc.citation.publicationnameConference on Empirical Methods in Natural Language Processing, EMNLP 2022-
dc.identifier.conferencecountryAR-
dc.identifier.conferencelocationAbu Dhabi-
dc.contributor.localauthorYun, Seyoung-
Appears in Collection
AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0