Selective Data Augmentation for Improving the Performance of Offline Reinforcement Learning

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 66
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorHan, Jungwooko
dc.contributor.authorKim, Jinwhanko
dc.date.accessioned2023-02-10T02:03:17Z-
dc.date.available2023-02-10T02:03:17Z-
dc.date.created2023-02-09-
dc.date.created2023-02-09-
dc.date.issued2022-11-
dc.identifier.citation22nd International Conference on Control, Automation and Systems, ICCAS 2022, pp.222 - 226-
dc.identifier.issn1598-7833-
dc.identifier.urihttp://hdl.handle.net/10203/305141-
dc.description.abstractThis study proposes a new data augmentation technique for offline reinforcement learning (RL). Rather than randomly choosing data points to carry out the data augmentation, our methodology selectively chooses data from sparse subspaces of the dataset to effectively augment the data region that is insufficient in the original dataset. For the augmentation, the subspaces of the dataset would be represented in the latent space created by the variational autoencoder (VAE). Data is then sampled from the latent space and converted back to the original space by using the decoder of the VAE so that the augmented data can be added to the original dataset. By using the VAE, virtual data that does not severely deviate from the original data could be generated because the VAE creates new data points by using the latent space that captures the original data distribution. We evaluate the performance of our methodology using several offline RL datasets generated from OpenAI Gym benchmark control simulations which mainly use state-based inputs.-
dc.languageEnglish-
dc.publisherIEEE Computer Society-
dc.titleSelective Data Augmentation for Improving the Performance of Offline Reinforcement Learning-
dc.typeConference-
dc.identifier.wosid000927498500042-
dc.identifier.scopusid2-s2.0-85146552146-
dc.type.rimsCONF-
dc.citation.beginningpage222-
dc.citation.endingpage226-
dc.citation.publicationname22nd International Conference on Control, Automation and Systems, ICCAS 2022-
dc.identifier.conferencecountryKO-
dc.identifier.conferencelocationBEXCO, Busan-
dc.identifier.doi10.23919/ICCAS55662.2022.10003747-
dc.contributor.localauthorKim, Jinwhan-
dc.contributor.nonIdAuthorHan, Jungwoo-
Appears in Collection
ME-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0