DSpace at KOASAS: Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

DSpace at KOASAS

College of Natural Sciences(자연과학대학)Dept. of Mathematical Sciences(수리과학과)MA-Conference Papers(학술회의논문)

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 278
Download : 0

Export

Lee, Cheolhyoung / Cho, Kyunghyun / Kang, Wanmo researcher

In natural language processing, it has been observed recently that generalization could be greatly improved by finetuning a large-scale language model pretrained on a large unlabeled corpus. Despite its recent success and wide adoption, finetuning a large pretrained language model on a downstream task is prone to degenerate performance when there are only a small number of training instances available. In this paper, we introduce a new regularization technique, to which we refer as “mixout”, motivated by dropout. Mixout stochastically mixes the parameters of two models. We show that our mixout technique regularizes learning to minimize the deviation from one of the two models and that the strength of regularization adapts along the optimization trajectory. We empirically evaluate the proposed mixout and its variants on finetuning a pretrained language model on downstream tasks. More specifically, we demonstrate that the stability of finetuning and the average accuracy greatly increase when we use the proposed approach to regularize finetuning of BERT on downstream tasks in GLUE.

Publisher: International Conference on Learning Representations

Issue Date: 2020-04-30

Language: English

Citation: International Conference on Learning Representations (ICLR)

URI: http://hdl.handle.net/10203/286240

Appears in Collection: MA-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

KOASAS

Communities & Collections