Observational and reinforcement pattern-learning : An exploratory study

Fiche du document

Type de document
Périmètre
Langue
Identifiants
Relations

Ce document est lié à :
info:eu-repo/semantics/altIdentifier/doi/10.1016/j.euroecorev.2018.01.009

Collection

Archives ouvertes

Licence

info:eu-repo/semantics/OpenAccess




Citer ce document

Nobuyuki Hanaki et al., « Observational and reinforcement pattern-learning : An exploratory study », HAL SHS (Sciences de l’Homme et de la Société), ID : 10.1016/j.euroecorev.2018.01.009


Métriques


Partage / Export

Résumé En

Understanding how individuals learn in an unknown environment is an important problem in economics. We model and examine experimentally behavior in a very simple multi-armed bandit framework in which participants do not know the inter-temporal payoff structure. We propose a baseline reinforcement learning model that allows for pattern-recognition and change in the strategy space. We also analyse three augmented versions that accommodate observational learning from the actions and/or payoffs of another player. The models successfully reproduce the distributional properties of observed discovery times and total payoffs. Our study further shows that when one of the pair discovers the hidden pattern, observing another's actions and/or payoffs improves discovery time compared to the baseline case.

document thumbnail

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines