Reinforcement with Fading Memories

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 191
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorXu, Kuangko
dc.contributor.authorYun, Se-Youngko
dc.date.accessioned2020-03-19T02:37:00Z-
dc.date.available2020-03-19T02:37:00Z-
dc.date.created2019-12-03-
dc.date.created2019-12-03-
dc.date.issued2018-06-12-
dc.identifier.citationACM SIGMETRICS, pp.90 - 92-
dc.identifier.urihttp://hdl.handle.net/10203/272758-
dc.description.abstractWe study the effect of imperfect memory on decision making in the context of a stochastic sequential action-reward problem. An agent chooses a sequence of actions which generate discrete rewards at different rates. She is allowed to make new choices at rate β, while past rewards disappear from her memory at rate μ. We focus on a family of decision rules where the agent makes a new choice by randomly selecting an action with a probability approximately proportional to the amount of past rewards associated with each action in her memory. We provide closed-form formulae for the agent's steady-state choice distribution in the regime where the memory span is large (μ -> 0), and show that the agent's success critically depends on how quickly she updates her choices relative to the speed of memory decay. If β >> μ, the agent almost always chooses the best action, i.e., the one with the highest reward rate. Conversely, if β << μ, the agent chooses an action with a probability roughly proportional to its reward rate.-
dc.languageEnglish-
dc.publisherAssociation for Computing Machinery (ACM)-
dc.titleReinforcement with Fading Memories-
dc.typeConference-
dc.identifier.scopusid2-s2.0-85052024617-
dc.type.rimsCONF-
dc.citation.beginningpage90-
dc.citation.endingpage92-
dc.citation.publicationnameACM SIGMETRICS-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationBeckman CenterIrvine-
dc.identifier.doi10.1145/3292040.3219653-
dc.contributor.localauthorYun, Se-Young-
dc.contributor.nonIdAuthorXu, Kuang-
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0