Reinforcement Learning with Action-Free Pre-Training from Videos

Cited 9 time in webofscience Cited 0 time in scopus
  • Hit : 230
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorSeo, Younggyoko
dc.contributor.authorLee, Kiminko
dc.contributor.authorJames, Stephenko
dc.contributor.authorAbbeel, Pieterko
dc.date.accessioned2023-05-10T10:00:35Z-
dc.date.available2023-05-10T10:00:35Z-
dc.date.created2023-05-03-
dc.date.created2023-05-03-
dc.date.issued2022-07-20-
dc.identifier.citation38th International Conference on Machine Learning (ICML), pp.19561 - 19579-
dc.identifier.issn2640-3498-
dc.identifier.urihttp://hdl.handle.net/10203/306680-
dc.description.abstractRecent unsupervised pre-training methods have shown to be effective on language and vision domains by learning useful representations for multiple downstream tasks. In this paper, we investigate if such unsupervised pre-training methods can also be effective for vision-based reinforcement learning (RL). To this end, we introduce a framework that learns representations useful for understanding the dynamics via generative pre-training on videos. Our framework consists of two phases: we pre-train an action-free latent video prediction model, and then utilize the pre-trained representations for efficiently learning action-conditional world models on unseen environments. To incorporate additional action inputs during fine-tuning, we introduce a new architecture that stacks an action-conditional latent prediction model on top of the pre-trained action-free prediction model. Moreover, for better exploration, we propose a video-based intrinsic bonus that leverages pre-trained representations. We demonstrate that our framework significantly improves both final performances and sample-efficiency of vision-based RL in a variety of manipulation and locomotion tasks. Code is available at https://github.com/younggyoseo/apv.-
dc.languageEnglish-
dc.publisherJMLR-JOURNAL MACHINE LEARNING RESEARCH-
dc.titleReinforcement Learning with Action-Free Pre-Training from Videos-
dc.typeConference-
dc.identifier.wosid000900130200026-
dc.type.rimsCONF-
dc.citation.beginningpage19561-
dc.citation.endingpage19579-
dc.citation.publicationname38th International Conference on Machine Learning (ICML)-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationBaltimore, MD-
dc.contributor.localauthorLee, Kimin-
dc.contributor.nonIdAuthorJames, Stephen-
dc.contributor.nonIdAuthorAbbeel, Pieter-
Appears in Collection
AI-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 9 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0