DSpace at KOASAS: Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 319
Download : 0

Export

Jang, Youngsoo / Lee, Jongmin / Kim, Kee-Eung researcher

We consider a strategic dialogue task, where the ability to infer the other agent's goal is critical to the success of the conversational agent. While this problem can be naturally formulated as Bayesian planning, it is known to be a very difficult problem due to its enormous search space consisting of all possible utterances. In this paper, we propose an efficient Bayes-adaptive planning algorithm for goal-oriented dialogues, which combines RNN-based dialogue generation and MCTS-based Bayesian planning in a novel way, leading to a robust decision-making under the uncertainty of the other agent's goal. We then introduce reinforcement learning for the dialogue agent that uses MCTS as a strong policy improvement operator, casting reinforcement learning as iterative alternation of planning and supervised-learning of self-generated dialogues. In the experiments, we demonstrate that our Bayes-adaptive dialogue planning agent significantly outperforms the state-of-the-art in a negotiation dialogue domain. We also show that reinforcement learning via MCTS further improves end-task performance without diverging from human language.

Publisher: NeurIPS Workshop on Conversational AI (ConvAI)

Issue Date: 2019-12-14

Language: English

Citation: NeurIPS Workshop on Conversational AI (ConvAI)

URI: http://hdl.handle.net/10203/270556

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues

KOASAS

Communities & Collections