TY - JOUR
T1 - Stable Representations of Decision Variables for Flexible Behavior
AU - Bari, Bilal A.
AU - Grossman, Cooper D.
AU - Lubin, Emily E.
AU - Rajagopalan, Adithya E.
AU - Cressy, Jianna I.
AU - Cohen, Jeremiah Y.
N1 - Publisher Copyright:
© 2019 Elsevier Inc.
PY - 2019/9/4
Y1 - 2019/9/4
N2 - Decisions occur in dynamic environments. In the framework of reinforcement learning, the probability of performing an action is influenced by decision variables. Discrepancies between predicted and obtained rewards (reward prediction errors) update these variables, but they are otherwise stable between decisions. Although reward prediction errors have been mapped to midbrain dopamine neurons, it is unclear how the brain represents decision variables themselves. We trained mice on a dynamic foraging task in which they chose between alternatives that delivered reward with changing probabilities. Neurons in the medial prefrontal cortex, including projections to the dorsomedial striatum, maintained persistent firing rate changes over long timescales. These changes stably represented relative action values (to bias choices) and total action values (to bias response times) with slow decay. In contrast, decision variables were weakly represented in the anterolateral motor cortex, a region necessary for generating choices. Thus, we define a stable neural mechanism to drive flexible behavior. Flexible behavior requires a memory of previous interactions with the environment. The medial prefrontal cortex persistently represents value-based decision variables, bridging the time between choices. These decision variables are sent to the dorsomedial striatum to bias action selection.
AB - Decisions occur in dynamic environments. In the framework of reinforcement learning, the probability of performing an action is influenced by decision variables. Discrepancies between predicted and obtained rewards (reward prediction errors) update these variables, but they are otherwise stable between decisions. Although reward prediction errors have been mapped to midbrain dopamine neurons, it is unclear how the brain represents decision variables themselves. We trained mice on a dynamic foraging task in which they chose between alternatives that delivered reward with changing probabilities. Neurons in the medial prefrontal cortex, including projections to the dorsomedial striatum, maintained persistent firing rate changes over long timescales. These changes stably represented relative action values (to bias choices) and total action values (to bias response times) with slow decay. In contrast, decision variables were weakly represented in the anterolateral motor cortex, a region necessary for generating choices. Thus, we define a stable neural mechanism to drive flexible behavior. Flexible behavior requires a memory of previous interactions with the environment. The medial prefrontal cortex persistently represents value-based decision variables, bridging the time between choices. These decision variables are sent to the dorsomedial striatum to bias action selection.
UR - http://www.scopus.com/inward/record.url?scp=85070739650&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85070739650&partnerID=8YFLogxK
U2 - 10.1016/j.neuron.2019.06.001
DO - 10.1016/j.neuron.2019.06.001
M3 - Article
C2 - 31280924
AN - SCOPUS:85070739650
SN - 0896-6273
VL - 103
SP - 922-933.e7
JO - Neuron
JF - Neuron
IS - 5
ER -