Tree search in partially observable markovian decision process with preference learning for object manipulation물체 조작을 위한 연속 공간 부분 관측 마르코프 결정 프로세스 및 선호 학습을 이용한 트리 탐색

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 1
  • Download : 0
To be deployed in a household environment to assist a human, a robot must be able to make decisions under partial observability. A robot can cope with the uncertainty of its observation by making sensing actions such as clearing an object in front of a shelf that occludes the target behind and estimating its shape. Partially Observable Markovian Decision Process (POMDP) is the principled framework that enables the robot to perform such information-gathering actions. However, robot manipulation domains involve high-dimensional and continuous observation and action spaces, yet most POMDP solvers have been limited to discrete spaces. Recently, POMCPOW, a continuous POMDP solver that resorts to sampling and progressive widening, has been proposed, yet it is too slow to be practical in robot manipulation problems that incorporate camera observations and multiple objects. To facilitate a more efficient search, we propose a framework that learns to guide the search from past planning experience. Our method specifically adopts preference learning that can leverage both success and failure trajectories to learn effective heuristics for search even with a small number of past experiences. We demonstrate the efficacy of our framework in several continuous partially observable robotics domains, such as light-dark room domain and real-world robot manipulation.
Advisors
김범준researcher
Description
한국과학기술원 :김재철AI대학원,
Publisher
한국과학기술원
Issue Date
2024
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2024.2,[iii, 23 p. :]

Keywords

부분 관측 마르코프 결정 프로세스▼a선호학습▼a물체 조작; POMDP▼aPOMCPOW▼aPreference learning▼aObject manipulation

URI
http://hdl.handle.net/10203/321376
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096081&flag=dissertation
Appears in Collection
AI-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0