DSpace at KOASAS: Reinforcement Learning with Action-Free Pre-Training from Videos

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Reinforcement Learning with Action-Free Pre-Training from Videos

Cited 11 time in

Cited 0 time in scopus

Hit : 232
Download : 0

Export

Seo, Younggyo / Lee, Kimin researcher / James, Stephen / Abbeel, Pieter

Recent unsupervised pre-training methods have shown to be effective on language and vision domains by learning useful representations for multiple downstream tasks. In this paper, we investigate if such unsupervised pre-training methods can also be effective for vision-based reinforcement learning (RL). To this end, we introduce a framework that learns representations useful for understanding the dynamics via generative pre-training on videos. Our framework consists of two phases: we pre-train an action-free latent video prediction model, and then utilize the pre-trained representations for efficiently learning action-conditional world models on unseen environments. To incorporate additional action inputs during fine-tuning, we introduce a new architecture that stacks an action-conditional latent prediction model on top of the pre-trained action-free prediction model. Moreover, for better exploration, we propose a video-based intrinsic bonus that leverages pre-trained representations. We demonstrate that our framework significantly improves both final performances and sample-efficiency of vision-based RL in a variety of manipulation and locomotion tasks. Code is available at https://github.com/younggyoseo/apv.

Publisher: JMLR-JOURNAL MACHINE LEARNING RESEARCH

Issue Date: 2022-07-20

Language: English

Citation: 38th International Conference on Machine Learning (ICML), pp.19561 - 19579

ISSN: 2640-3498

URI: http://hdl.handle.net/10203/306680

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 11 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Reinforcement Learning with Action-Free Pre-Training from Videos

This item is cited by other documents in WoS

KOASAS

Communities & Collections