Estimating the level of inference using an order-mimic agent

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 226
  • Download : 0
Multi-agent reinforcement learning (RL) considers problems of learning policies and predicting values through interactions with multiple opponents. To make the solutions feasible, one assumes single-type opponents. However, this may not hold in most real-world situations. Interactions with a mixture of different types of agents make it extremely hard to learn. This study examines the hypothesis that when the potential types of agents are unknown, the level of agent inference can act as a proxy for characterizing the opponents. We present a computational framework to estimate the level of agent's inference using a deterministic and stochastic order-mimic agent. We then propose a calibration method for unbiased estimation, which offsets the adverse effect of order-mimic agents on the environment's order estimation. Finally, to generalize the method to a wide range of contexts, we proposed iterative inference level estimation. We demonstrate the feasibility of the proposed method in computer simulations with agents mimicking agents' behavior with various inference levels. Our framework can estimate the learning capacity of various algorithms and humans; therefore it can be used to design high-level inference models that can effectively handle the complexity of multi-agent learning problems.
Publisher
Korea University Institute for Artificial Intelligence, Korea Brain Education Society
Issue Date
2021-11-10
Language
English
Citation

the 6th Asian Conference on Pattern Recognition (ACPR2021), pp.116 - 126

ISSN
0302-9743
DOI
10.1007/978-3-031-02444-3_9
URI
http://hdl.handle.net/10203/291737
Appears in Collection
BC-Conference Papers(학술대회논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0