Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T. Issue 15 (21st July 2022)
- Record Type:
- Journal Article
- Title:
- Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T. Issue 15 (21st July 2022)
- Main Title:
- Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T
- Authors:
- Colas, Jaron T.
Dundon, Neil M.
Gerraty, Raphael T.
Saragosa‐Harris, Natalie M.
Szymula, Karol P.
Tanwisuth, Koranis
Tyszka, J. Michael
van Geen, Camilla
Ju, Harang
Toga, Arthur W.
Gold, Joshua I.
Bassett, Dani S.
Hartley, Catherine A.
Shohamy, Daphna
Grafton, Scott T.
O'Doherty, John P. - Abstract:
- Abstract: The model‐free algorithms of "reinforcement learning" (RL) have gained clout across disciplines, but so too have model‐based alternatives. The present study emphasizes other dimensions of this model space in consideration of associative or discriminative generalization across states and actions. This "generalized reinforcement learning" (GRL) model, a frugal extension of RL, parsimoniously retains the single reward‐prediction error (RPE), but the scope of learning goes beyond the experienced state and action. Instead, the generalized RPE is efficiently relayed for bidirectional counterfactual updating of value estimates for other representations. Aided by structural information but as an implicit rather than explicit cognitive map, GRL provided the most precise account of human behavior and individual differences in a reversal‐learning task with hierarchical structure that encouraged inverse generalization across both states and actions. Reflecting inference that could be true, false (i.e., overgeneralization), or absent (i.e., undergeneralization), state generalization distinguished those who learned well more so than action generalization. With high‐resolution high‐field fMRI targeting the dopaminergic midbrain, the GRL model's RPE signals (alongside value and decision signals) were localized within not only the striatum but also the substantia nigra and the ventral tegmental area, including specific effects of generalization that also extend to the hippocampus.Abstract: The model‐free algorithms of "reinforcement learning" (RL) have gained clout across disciplines, but so too have model‐based alternatives. The present study emphasizes other dimensions of this model space in consideration of associative or discriminative generalization across states and actions. This "generalized reinforcement learning" (GRL) model, a frugal extension of RL, parsimoniously retains the single reward‐prediction error (RPE), but the scope of learning goes beyond the experienced state and action. Instead, the generalized RPE is efficiently relayed for bidirectional counterfactual updating of value estimates for other representations. Aided by structural information but as an implicit rather than explicit cognitive map, GRL provided the most precise account of human behavior and individual differences in a reversal‐learning task with hierarchical structure that encouraged inverse generalization across both states and actions. Reflecting inference that could be true, false (i.e., overgeneralization), or absent (i.e., undergeneralization), state generalization distinguished those who learned well more so than action generalization. With high‐resolution high‐field fMRI targeting the dopaminergic midbrain, the GRL model's RPE signals (alongside value and decision signals) were localized within not only the striatum but also the substantia nigra and the ventral tegmental area, including specific effects of generalization that also extend to the hippocampus. Factoring in generalization as a multidimensional process in value‐based learning, these findings shed light on complexities that, while challenging classic RL, can still be resolved within the bounds of its core computations. Abstract : This "generalized reinforcement learning" (GRL) model, a frugal extension of RL, parsimoniously retains the single reward‐prediction error (RPE), but the scope of learning goes beyond the experienced state and action. Aided by structural information but as an implicit rather than explicit cognitive map, GRL provided the most precise account of human behavior and individual differences in a reversal‐learning task with hierarchical structure that encouraged inverse generalization across both states and actions. With high‐resolution high‐field fMRI targeting the dopaminergic midbrain, the GRL model's RPE signals (alongside value and decision signals) were localized within not only the striatum but also the substantia nigra and the ventral tegmental area, including specific effects of generalization that also extend to the hippocampus. … (more)
- Is Part Of:
- Human brain mapping. Volume 43:Issue 15(2022)
- Journal:
- Human brain mapping
- Issue:
- Volume 43:Issue 15(2022)
- Issue Display:
- Volume 43, Issue 15 (2022)
- Year:
- 2022
- Volume:
- 43
- Issue:
- 15
- Issue Sort Value:
- 2022-0043-0015-0000
- Page Start:
- 4750
- Page End:
- 4790
- Publication Date:
- 2022-07-21
- Subjects:
- cognitive map -- counterfactual learning -- dopaminergic midbrain -- generalization -- hippocampus -- individual differences -- model‐free and model‐based -- multifield fMRI -- reinforcement learning -- striatum
Brain mapping -- Periodicals
611.81 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1097-0193 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/hbm.25988 ↗
- Languages:
- English
- ISSNs:
- 1065-9471
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4336.031000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 23935.xml