Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 65,724 bioRxiv papers from 291,101 authors.

Prefrontal cortex as a meta-reinforcement learning system

By Jane X Wang, Zeb Kurth-Nelson, Dharshan Kumaran, Dhruva Tirumala, Hubert Soyer, Joel Z Leibo, Demis Hassabis, Matthew Botvinick

Posted 06 Apr 2018
bioRxiv DOI: 10.1101/295964 (published DOI: 10.1038/s41593-018-0147-8)

Over the past twenty years, neuroscience research on reward-based learning has converged on a canonical model, under which the neurotransmitter dopamine 'stamps in' associations between situations, actions and rewards by modulating the strength of synaptic connections between neurons. However, a growing number of recent findings have placed this standard model under strain. In the present work, we draw on recent advances in artificial intelligence to introduce a new theory of reward-based learning. Here, the dopamine system trains another part of the brain, the prefrontal cortex, to operate as its own free-standing learning system. This new perspective accommodates the findings that motivated the standard model, but also deals gracefully with a wider range of observations, providing a fresh foundation for future research.

Download data

  • Downloaded 28,100 times
  • Download rankings, all-time:
    • Site-wide: 16 out of 65,724
    • In neuroscience: 4 out of 11,770
  • Year to date:
    • Site-wide: 38 out of 65,724
  • Since beginning of last month:
    • Site-wide: 283 out of 65,724

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News