Monte Carlo Tree Search in Continuous Spaces Using Voronoi Optimistic Optimization with Regret Bounds

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 190
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Beomjoonko
dc.contributor.authorLee, Kyungjaeko
dc.contributor.authorLim, Sungbinko
dc.contributor.authorKaelbling, Leslieko
dc.contributor.authorLozano-Perez, Tomasko
dc.date.accessioned2021-02-04T05:50:23Z-
dc.date.available2021-02-04T05:50:23Z-
dc.date.created2021-02-04-
dc.date.created2021-02-04-
dc.date.created2021-02-04-
dc.date.issued2020-02-
dc.identifier.citationThe Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), pp.9916 - 9924-
dc.identifier.urihttp://hdl.handle.net/10203/280568-
dc.description.abstractMany important applications, including robotics, data-center management, and process control, require planning action sequences in domains with continuous state and action spaces and discontinuous objective functions. Monte Carlo tree search (MCTS) is an effective strategy for planning in discrete action spaces. We provide a novel MCTS algorithm (voot) for deterministic environments with continuous action spaces, which, in turn, is based on a novel black-box function-optimization algorithm (voo) to efficiently sample actions. The voo algorithm uses Voronoi partitioning to guide sampling, and is particularly efficient in high-dimensional spaces. The voot algorithm has an instance of voo at each node in the tree. We provide regret bounds for both algorithms and demonstrate their empirical effectiveness in several high-dimensional problems including two difficult robotics planning problems.-
dc.languageEnglish-
dc.publisherAssociation for the Advancement of Artificial Intelligence (AAAI)-
dc.titleMonte Carlo Tree Search in Continuous Spaces Using Voronoi Optimistic Optimization with Regret Bounds-
dc.typeConference-
dc.type.rimsCONF-
dc.citation.beginningpage9916-
dc.citation.endingpage9924-
dc.citation.publicationnameThe Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20)-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationHilton New York Midtown-
dc.identifier.doi10.1609/aaai.v34i06.6546-
dc.contributor.localauthorKim, Beomjoon-
dc.contributor.nonIdAuthorLee, Kyungjae-
dc.contributor.nonIdAuthorLim, Sungbin-
dc.contributor.nonIdAuthorKaelbling, Leslie-
dc.contributor.nonIdAuthorLozano-Perez, Tomas-
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0