DSpace at KOASAS: Toward risk-based optimistic exploration for cooperative multi-agent reinforcement learning

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Theses_Master(석사논문)

Toward risk-based optimistic exploration for cooperative multi-agent reinforcement learning협력적 다중 에이전트 강화학습을 위한 위험도 기반의 낙천적 탐색방법

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 118
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Yun, Seyoung	-
dc.contributor.advisor	윤세영	-
dc.contributor.author	Oh, Jihwan	-
dc.date.accessioned	2023-06-22T19:31:18Z	-
dc.date.available	2023-06-22T19:31:18Z	-
dc.date.issued	2023	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032332&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/308197	-
dc.description	학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2023.2,[iv, 28 p. :]	-
dc.description.abstract	Multi-agent setting is intricate and unpredictable since the behaviors of multiple agents influence one another. To address this environmental uncertainty, distributional reinforcement learning algorithms that incorporate uncertainty via distributional output have been integrated with multi-agent reinforcement learning methods, achieving state-of-the-art performance. However, distributional multi-agent reinforcement learning algorithms still rely on the traditional ε-greedy, which does not take cooperative strategy into account. In this paper, we present a risk-based exploration that leads to collaboratively optimistic behavior by shifting the sampling region of distribution. Initially, we take expectations from the upper quantiles of state-action values, which are optimistic actions, and gradually shift the sampling region of quantiles to the full distribution for exploitation. By ensuring that each agent is exposed to the same level of risk, we can force them to take cooperatively optimistic actions. Our method shows remarkable performance in multi-agent settings requiring cooperative exploration based on quantile regression by virtue of risk property.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Distributional reinforcement learning▼aExploration▼aMulti-agent learning▼aRisk▼aUncertainty	-
dc.subject	분포강화학습▼a탐색▼a다중 에이전트 학습▼a위험도▼a불확실성	-
dc.title	Toward risk-based optimistic exploration for cooperative multi-agent reinforcement learning	-
dc.title.alternative	협력적 다중 에이전트 강화학습을 위한 위험도 기반의 낙천적 탐색방법	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :김재철AI대학원,	-
dc.contributor.alternativeauthor	오지환	-

Appears in Collection: AI-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Toward risk-based optimistic exploration for cooperative multi-agent reinforcement learning협력적 다중 에이전트 강화학습을 위한 위험도 기반의 낙천적 탐색방법

KOASAS

Communities & Collections