Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 60,240 bioRxiv papers from 267,843 authors.

Mapping the unknown: The spatially correlated multi-armed bandit

By Charley M. Wu, Eric Schulz, Maarten Speekenbrink, Jonathan D. Nelson, Björn Meder

Posted 06 Feb 2017
bioRxiv DOI: 10.1101/106286

We introduce the spatially correlated multi-armed bandit as a task coupling function learning with the exploration-exploitation trade-off. Participants interacted with bi-variate reward functions on a two-dimensional grid, with the goal of either gaining the largest average score or finding the largest payoff. By providing an opportunity to learn the underlying reward function through spatial correlations, we model to what extent people form beliefs about unexplored payoffs and how that guides search behavior. Participants adapted to assigned payoff conditions, performed better in smooth than in rough environments, and--surprisingly--sometimes performed equally well in short as in long search horizons. Our modeling results indicate a preference for local search options, which when accounted for, still suggests participants were best-described as forming local inferences about unexplored regions, combined with a search strategy that directly traded off between exploiting high expected rewards and exploring to reduce uncertainty about the spatial structure of rewards.

Download data

  • Downloaded 1,029 times
  • Download rankings, all-time:
    • Site-wide: 6,541 out of 60,240
    • In animal behavior and cognition: 68 out of 946
  • Year to date:
    • Site-wide: 37,241 out of 60,240
  • Since beginning of last month:
    • Site-wide: 41,897 out of 60,240

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide

Sign up for the Rxivist weekly newsletter! (Click here for more details.)