When models matter: Environmental demand guides the arbitration between model-based and model-free control

  • Leslie K. Held
  • , Elise Lesage
  • , Wouter Kool
  • , Senne Braem

Research output: Contribution to journalArticlepeer-review

Abstract

As humans, we often repeat previously rewarded actions without thinking, but we also possess the ability to plan ahead and simulate actions based on an internal model of the environment. These two types of control are commonly conceptualized as model-free versus model-based control. While there is a body of research on interindividual differences in using either strategy, we aimed to test whether people can learn to regulate which strategy to use based on environmental demand. We used a two-stage decision-making task where participants tracked the drifting rewards associated with two second-stage states. Each trial started with one of two possible first-stage states, each offering two choices that deterministically led to one of the second-stage states. Successful generalization between first-stage options indicated model-based control, while mere repetition of previously rewarded choices reflected model-free behavior. We manipulated how often participants (n = 140) were exposed to alternations versus repetitions of first-stage states. When these states frequently repeat, there is a reduced need to consult the transition structure, because it pays off to adopt model-free control and simply retake previously rewarded actions. Conversely, when first-stage states frequently alternate, it is more beneficial to adopt model-based control, considering the transition structure and generalizing reward outcomes between them. In line with our hypothesis, we show that participants exposed to more first-stage state alternations were more model-based in a test phase than participants exposed to more first-stage state repetitions. These findings suggest that people learn to arbitrate between different reinforcement-learning strategies consistent with a cost–benefit analysis sensitive to environmental demands.

Original languageEnglish
JournalCognitive, Affective and Behavioral Neuroscience
DOIs
StateAccepted/In press - 2025

Keywords

  • Dual-system RL
  • Model-based
  • Model-free
  • Reinforcement learning
  • Two-step task

Fingerprint

Dive into the research topics of 'When models matter: Environmental demand guides the arbitration between model-based and model-free control'. Together they form a unique fingerprint.

Cite this