DSpace at KOASAS: Alphago for belief space planning

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Theses_Master(석사논문)

Alphago for belief space planning알파고의 직관을 이용한 믿음 공간 계획법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 168
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Kim, Beomjoon	-
dc.contributor.advisor	김범준	-
dc.contributor.author	Ahn, Jiyong	-
dc.date.accessioned	2023-06-22T19:31:11Z	-
dc.date.available	2023-06-22T19:31:11Z	-
dc.date.issued	2023	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032324&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/308178	-
dc.description	학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2023.2,[iii, 18 p. :]	-
dc.description.abstract	In robotics, the ability to make decisions in an environment that includes uncertainty is essential. The robot should infer the current state through a series of actions and observations and plan to achieve the goal based on them. This problem can be modeled as Partially Observable Markov Decision Process (POMDP). However, high-dimensional state, action, and observation spaces in long horizon make a POMDP problem more complicated. A large number of simulations are needed to make a plan for this complex POMDP problem using previous online planning algorithms which do not guide a planner. In this paper, we learn policy network and value network by imitating prior experience data generated from simulations. Then use these networks to guide online planning algorithm to resolve complex POMDP problems more efficiently. We model the Light-Dark Room domain, one of localization problem in robotics, as a continuous POMDP. Our guided planning algorithm achieves higher success rates in this problem with less number of simulations.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Partially observable Markov decision process(POMDP)▼aOnline planning▼aImitaion learning	-
dc.subject	Partially observable Markov decision process(POMDP)▼a온라인 계획법▼a모방 학습	-
dc.title	Alphago for belief space planning	-
dc.title.alternative	알파고의 직관을 이용한 믿음 공간 계획법	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :김재철AI대학원,	-
dc.contributor.alternativeauthor	안지용	-

Appears in Collection: AI-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Alphago for belief space planning알파고의 직관을 이용한 믿음 공간 계획법

KOASAS

Communities & Collections