Decision threshold learning in the Basal Ganglia for multiple alternatives

Griffith, Thom, Baker, Sophie-Anne and Lepora, Nathan F. 2025. Decision threshold learning in the Basal Ganglia for multiple alternatives. Neural Computation 37 (7) , pp. 1256-1287. 10.1162/neco_a_01760

[thumbnail of Neural_Computation_June_2025__Neurips_template_for_openaccess_ (1).pdf]

Preview

PDF - Accepted Post-Print Version
Download (5MB) | Preview

Official URL: https://doi.org/10.1162/neco_a_01760

Abstract

In recent years, researchers have integrated the historically separate, reinforcement learning (RL), and evidence-accumulation-to-bound approaches to decision modeling. A particular outcome of these efforts has been the RL-DDM, a model that combines value learning through reinforcement with a diffusion decision model (DDM). While the RL-DDM is a conceptually elegant extension of the original DDM, it faces a similar problem to the DDM in that it does not scale well to decisions with more than two options. Furthermore, in its current form, the RL-DDM lacks flexibility when it comes to adapting to rapid, context-cued changes in the reward environment. The question of how to best extend combined RL and DDM models so they can handle multiple choices remains open. Moreover, it is currently unclear how these algorithmic solutions should map to neurophysical processes in the brain, particularly in relation to so-called go/no-go-type models of decision making in the basal ganglia. Here, we propose a solution that addresses these issues by combining a previously proposed decision model based on the multichoice sequential probability ratio test (MSPRT), with a dual-pathway model of decision threshold learning in the basal ganglia region of the brain. Our model learns decision thresholds to optimize the trade-off between time cost and the cost of errors and so efficiently allocates the amount of time for decision deliberation. In addition, the model is context dependent and hence flexible to changes to the speed-accuracy trade-off (SAT) in the environment. Furthermore, the model reproduces the magnitude effect, a phenomenon seen experimentally in value-based decisions and is agnostic to the types of evidence and so can be used on perceptual decisions, value-based decisions, and other types of modeled evidence. The broader significance of the model is that it contributes to the active research area of how learning systems interact by linking the previously separate models of RL-DDM to dopaminergic models of motivation and risk taking in the basal ganglia, as well as scaling to multiple alternatives.

Item Type:	Article
Date Type:	Publication
Status:	Published
Schools:	Schools > Psychology
Publisher:	Massachusetts Institute of Technology Press
ISSN:	0899-7667
Date of First Compliant Deposit:	17 June 2025
Date of Acceptance:	9 February 2025
Last Modified:	02 Jul 2025 13:30
URI:	https://orca.cardiff.ac.uk/id/eprint/178799

Actions (repository staff only)

Edit Item

Dimensions

Altmetric

Download Statistics

Downloads

Downloads per month over past year

View more statistics

CORE (COnnecting REpositories)