Continually updating generative retrieval on dynamic corpora동적 환경에서의 생성 검색 방법론

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 2
  • Download : 0
Generative retrieval has recently gained significant amount of recognition by the research community for its simplicity, high performance, and the ability to fully leverage the power of autoregressive models. However, the majority of prior work on generative retrieval does not consider realistic applications where temporal knowledge is accumulated over time. In this paper, we present DynamicGR, a parameter-efficient continual pre-training method that integrates dynamically changing corpora into the generative retrieval model. We conduct a comprehensive evaluation of the performance and efficiency of generative retrieval models against strong bi-encoder baselines on the StreamingQA benchmark. With the DynamicGR pretraining strategy, we demonstrate a promising performance in generative retrieval, showing an improvement of 7% over conventional parameter-efficient updates with low-rank adaptation (LoRA) on attention parameters. Furthermore, we show that DynamicGR can make generative retrieval as competitive as the bi-encoder approaches when considering performance and efficiency in dynamic scenarios. Our work will be open-sourced.
Advisors
서민준researcher
Description
한국과학기술원 :김재철AI대학원,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2023.8,[iv, 20 p. :]

Keywords

정보 검색▼a생성 기반 검색▼a시간적 정보 검색▼a지식 업데이트▼a효율적인 사전 학습▼a점진적 학습▼a동적 말뭉치▼a검색 효율성; Information retrieval▼aGenerative retrieval▼aTemporal information retrieval▼aKnowledge update▼aParameter-efficient pre-training▼aIncremental learning▼aDynamic corpora▼aSearch efficiency

URI
http://hdl.handle.net/10203/320520
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045708&flag=dissertation
Appears in Collection
AI-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0