Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 52,519 bioRxiv papers from 243,473 authors.

Generalization guides human exploration in vast decision spaces

By Charley M. Wu, Eric Schulz, Maarten Speekenbrink, Jonathan D. Nelson, Bjöorn Meder

Posted 01 Aug 2017
bioRxiv DOI: 10.1101/171371 (published DOI: 10.1038/s41562-018-0467-4)

From foraging for food to learning complex games, many aspects of human behaviour can be framed as a search problem with a vast space of possible actions. Under finite search horizons, optimal solutions are generally unobtainable. Yet how do humans navigate vast problem spaces, which require intelligent exploration of unobserved actions? Using a variety of bandit tasks with up to 121 arms, we study how humans search for rewards under limited search horizons, where the spatial correlation of rewards (in both generated and natural environments) provides traction for generalization. Across a variety of different probabilistic and heuristic models, we find evidence that Gaussian Process function learning--combined with an optimistic Upper Confidence Bound sampling strategy--provides a robust account of how people use generalization to guide search. Our modelling results and parameter estimates are recoverable, and can be used to simulate human-like performance, providing novel insights about human behaviour in complex environments.

Download data

  • Downloaded 1,772 times
  • Download rankings, all-time:
    • Site-wide: 2,263 out of 52,519
    • In animal behavior and cognition: 21 out of 816
  • Year to date:
    • Site-wide: 4,004 out of 52,519
  • Since beginning of last month:
    • Site-wide: 4,772 out of 52,519

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News