DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | 이동환 | - |
dc.contributor.author | Park, Kihong | - |
dc.contributor.author | 박기홍 | - |
dc.date.accessioned | 2024-07-30T19:31:27Z | - |
dc.date.available | 2024-07-30T19:31:27Z | - |
dc.date.issued | 2024 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097165&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/321593 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iii, 20 p. :] | - |
dc.description.abstract | With the recent advancements in deep neural networks, reinforcement learning has demonstrated remarkable performance in various fields such as games, language models, and robotics. However, currently prevalent reinforcement learning algorithms employ the target network to address the double sampling issue, which necessitates an additional Q-network and delays the update. In this thesis, we tackle the aforementioned problem by training the dynamics model instead of using the target network, aiming to resolve the double sampling issue. Specifically, our approach modified deep Q-network by sampling another independent next state from the learned dynamics model and introducing a new loss function that takes into account the double sampling issue. With the proposed method, we aim to optimize the Q-network through a more precise gradient closer to the true gradient of mean squared Bellman error. In experiments, the proposed algorithm robustly achieved higher undiscounted returns and predicted action-values more stably compared to deep Q-network. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | 강화학습▼a모델기반 강화학습▼a심층 큐 네트워크▼a이중 샘플링 문제 | - |
dc.subject | Reinforcement learning▼aModel-based reinforcement learning▼aDeep Q-network▼aDouble sampling issue | - |
dc.title | Addressing double sampling issue by learning dynamics model | - |
dc.title.alternative | 모델 학습을 통한 이중 샘플링 문제 해소 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | Lee, Donghwan | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.