Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation

Cited 5 time in webofscience Cited 0 time in scopus
  • Hit : 163
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKang, Minkiko
dc.contributor.authorHan, Moonsuko
dc.contributor.authorHwang, Sung Juko
dc.date.accessioned2021-01-28T06:07:11Z-
dc.date.available2021-01-28T06:07:11Z-
dc.date.created2020-12-03-
dc.date.created2020-12-03-
dc.date.created2020-12-03-
dc.date.issued2020-11-16-
dc.identifier.citationConference on Empirical Methods in Natural Language Processing (EMNLP), pp.6102 - 6120-
dc.identifier.urihttp://hdl.handle.net/10203/280128-
dc.description.abstractWe propose a method to automatically generate a domain- and task-adaptive maskings of the given text for self-supervised pre-training, such that we can effectively adapt the language model to a particular target task (e.g. question answering). Specifically, we present a novel reinforcement learning-based framework which learns the masking policy, such that using the generated masks for further pre-training of the target language model helps improve task performance on unseen texts. We use off-policy actor-critic with entropy regularization and experience replay for reinforcement learning, and propose a Transformer-based policy network that can consider the relative importance of words in a given text. We validate our Neural Mask Generator (NMG) on several question answering and text classification datasets using BERT and DistilBERT as the language models, on which it outperforms rule-based masking strategies, by automatically learning optimal adaptive maskings.-
dc.languageEnglish-
dc.publisherACL-
dc.titleNeural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation-
dc.typeConference-
dc.identifier.wosid000855160706026-
dc.identifier.scopusid2-s2.0-85101667429-
dc.type.rimsCONF-
dc.citation.beginningpage6102-
dc.citation.endingpage6120-
dc.citation.publicationnameConference on Empirical Methods in Natural Language Processing (EMNLP)-
dc.identifier.conferencecountryIT-
dc.identifier.conferencelocationVirtual-
dc.contributor.localauthorHwang, Sung Ju-
dc.contributor.nonIdAuthorKang, Minki-
Appears in Collection
AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 5 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0