ArticlePDF Available

Reward salience but not spatial attention dominates the value representation in the orbitofrontal cortex

Authors:

Abstract and Figures

The orbitofrontal cortex (OFC) encodes value and plays a key role in value-based decision-making. However, the attentional modulation of the OFC’s value encoding is poorly understood. We trained two monkeys to detect a luminance change at a cued location between a pair of visual stimuli, which were over-trained pictures associated with different amounts of juice reward and, thus, different reward salience. Both the monkeys’ behavior and the dorsolateral prefrontal cortex neuronal activities indicated that the monkeys actively directed their spatial attention toward the cued stimulus during the task. However, the OFC’s neuronal responses were dominated by the stimulus with higher reward salience and encoded its value. The value of the less salient stimulus was only weakly represented regardless of spatial attention. The results demonstrate that reward and spatial attention are distinctly represented in the prefrontal cortex and the OFC maintains a stable representation of reward salience minimally affected by attention.
Behavioral tasks and monkeys’ performance a Monkey D had to detect when any of the two stimuli changed its luminance. The location of the luminance change was indicated by a cue (90% valid) that appeared on the side opposite to the luminance change. When the cue was invalid (10% of trials), the luminance change occurred on the same side of the cue. The monkey reported the luminance change by making an eye movement to the eye-movement target located above the fixation point. The stimuli were associated with different amounts of juice reward (inset). Random pairings of stimuli were selected for each trial. The stimulus with a larger associated reward is referred to as the salient stimulus, and the stimulus where attention should be direct to is the cued stimulus. The computer randomly selected one stimulus from the pair and delivered its associated reward for a correct response. b The proportion of correct responses when the cue was valid and invalid and when the luminance change occurred at the salient and the non-salient stimulus. c The reaction times when the cue was valid and invalid and when the luminance change occurred at the salient and the non-salient stimulus. d Monkey G was required to detect the luminance change at the target location, indicated with the cue frame (target trials, 80%), and ignore the change at the un-cued location (distractor trials, 20%). In half of the distractor trials, another luminance change happened at the cued location after the distractor changed its luminance (distractor+target trials, 10%). The stimuli were associated with different amounts of juice reward (inset). e The proportion of correct responses in the three trial types and when the luminance change occurred at the salient and the non-salient stimulus. f The reaction time of detecting the target and the distractor luminance change and when the luminance change occurred at the salient and the non-salient stimulus. The error bars are SEM across sessions (monkey D: n = 71; monkey G: n = 100), * denotes p < 0.01, ** denotes p < 0.001, two-tailed Wilcoxon signed-rank test.
… 
DLPFC encoded spatial attention a The responses of an example DLPFC neuron. The red line indicates trials when the cue was on the right side and the blue line indicates trials when the cue was on the left side. The shaded areas around each line denote SEM across trials. b The posterior probability of the attention location decoded from DLPFC pseudo-population ensemble activities. Both the training and the testing of the LDA decoder used sliding windows of 25 ms at 10 ms steps. c The posterior probability of the attention location from the decoder that was trained and tested with the DLPFC neuronal responses at the same time (the diagonal line in b). Significance was assessed with two-tailed paired t-tests (p < 0.01 with FDR corrections for multiple comparisons), tested against the data with shuffled cue location labels. The black solid trace indicates the posterior probability calculated from the actual data. The black dashed trace indicates the posterior probability calculated from the shuffled data. The black segments at the top indicate when the actual data performed significantly better than the shuffled data. Thin gray lines represent SEM across trials. d. Same as b except that the decoder was trained with the mean activities at 50–200 ms before the stimulus onset. The black segments at the bottom indicate when the performance was significantly lower than the control, indicating the flip of attention to the opposite side. Significance was assessed with two-tailed paired t-tests (p < 0.01 with FDR corrections for multiple comparisons). Thin gray lines represent SEM across trials. e The posterior probability of the attention location decoded from the fast (black line) and the slow trials (gray line). The trials were divided by the median reaction time in each neuron. Thin gray lines represent SEM across trials. Significance was assessed with two-tailed paired t-tests (fast versus slow, at p < 0.01 with FDR corrections for multiple comparisons).
… 
Attentional modulation of the OFC responses a The responses of an example OFC neuron in trials when the same rewards were associated with the two stimuli, aligned to the stimulus onset (left) and the luminance change (right). The reward sizes are indicated by different shades of gray. Shading areas indicate SEM across trials. b The population response of the positively tuned OFC neurons (n = 73) in trials when the same rewards were associated with the two stimuli. Shading areas indicate SEM across neurons. c Same as b, but for the negatively tuned OFC neurons (n = 53). d The responses of the example neuron with the trials grouped by SV. The reward sizes are indicated by different shades of yellow. Shading areas indicate SEM across trials. e The responses of the positively tuned OFC neurons with the trials grouped by SV. The reward sizes are indicated by different shades of yellow. Shading areas indicate SEM across neurons. f Same as (d), but for the negatively tuned OFC neurons. g The average responses of the example neuron during the period between the stimulus onset and the luminance change. The black trace is based on the trials with the same-reward stimulus pairs. The yellow trace is based on the trials grouped by SV. The error bars indicate SEM across trials. Two-way ANOVA (group: F1413 = 0.79, p = 0.37; value: F4413 = 40.40, p ≪ 0.001). h The average responses of the positively tuned OFC neurons during the period between the stimulus onset and the luminance change. The black trace is based on the trials with the same-reward stimulus pairs. The yellow trace is based on the trials grouped by SV. The error bars indicate SEM across neurons. Two-way ANOVA (group: F1724 = 3.22, p = 0.07; value: F4724 = 77.71, p ≪ 0.001). i Same as (h), but for the negatively tuned OFC neurons. Two-way ANOVA (group: F1524 = 1.97, p = 0.16; value: F4524 = 46.03, p ≪ 0.001). j Same as g, but trials are grouped by CV (red) and NCV (purple). Two-way ANOVA (group: F1690 = 0, p = 0.99; value: F4690 = 21.04,p ≪ 0.001). k Same as h, but trials are grouped by CV (red) and NCV (purple). Two-way ANOVA (group: F1724 = 0.16, p = 0.68; value: F4724 = 104.56, p ≪ 0.001). l Same as i, but trials are grouped by CV (red) and NCV (purple). Two-way ANOVA (group: F1524 = 0.01, p = 0.92; value: F4524 = 88.54, p ≪ 0.001).
… 
OFC neuronal responses dominantly encoded the salient value a OFC neuron (n = 349) firing rates were regressed against attention cue location, SV, and NSV. Plotted is the time course of the population average coefficients of partial determination (CPD). Significance was assessed with two-tailed paired t-tests (p < 0.005, with FDR corrections for multiple comparisons) compared to a baseline computed with the CPD between 0 and 200 ms before the cue onset averaged across different regressors. The blue, yellow, and green bars at the top indicate the significant CPDs of cue location, SV, and NSV, respectively, and the black bar indicates significant differences between SV and NSV. The error bars indicate SEM across neurons. b The CPDs of SV against the CPDs of NSV for individual value-selective OFC neurons (n = 118). The yellow data points are the neurons with non-zero coefficients for SV only, the green data points are the neurons with non-zero coefficients for NSV only, and the black data points are the neurons that encoded both SV and NSV (one-sample t-test, p < 0.05, without multiple comparisons). Two-tailed paired t-test was conducted to compare the mean CPDs of SV and NSV of all the OFC neurons (n = 349). c Top: the distribution of the CPDs for SV of the value-selective OFC neurons (n = 118). Bottom: the distribution of the OFC neurons’ CPDs for NSV. Vertical dashed lines indicate the mean. Filled bars indicate significant neurons. d Same as a, but for CV and UCV (n = 349). e Same as b, but for CV and UCV (n = 127). f Top: the distribution of the CPDs for CV of the value-selective OFC neurons (n = 127). Bottom: the distribution of the OFC neurons’ CPDs for UCV. Vertical dashed lines indicate the means. Filled bars indicate significant neurons.
… 
This content is subject to copyright. Terms and conditions apply.
Article https://doi.org/10.1038/s41467-022-34084-0
Reward salience but not spatial attention
dominates the value representation
in the orbitofrontal cortex
Wenyi Zhang
1,2,3
,YangXie
1,2,3
& Tianming Yang
1
The orbitofrontal cortex (OFC) encodes value and plays a key role in value-
based decision-making. However, the attentional modulation of the OFCs
value encoding is poorly understood. We trained two monkeys to detect a
luminance change at a cued location between a pair of visual stimuli, which
were over-trained pictures associated with different amounts of juice reward
and, thus, different reward salience. Both the monkeysbehavior and the
dorsolateral prefrontal cortex neuronal activities indicated that the monkeys
actively directed their spatial attention toward the cued stimulus during the
task. However, the OFCs neuronal responses were dominated by the stimulus
with higher reward salience and encoded its value. The value of the less salient
stimulus was only weakly represented regardless of spatial attention. The
results demonstrate that reward and spatial attention are distinctly repre-
sented in the prefrontal cortex and the OFC maintains a stable representation
of reward salience minimally affected by attention.
Imagine that you are in a bookstore trying to nd an ideal gift for your
young child. While you direct your search in the childrens book sec-
tion, an eye-catching poster of your favorite writer in the bestseller
section would divert your attention and lead you to check out the
books over there. While task demands or behavior context often lead
to internally generated attention, commonly referred to as top-down
attention, salient stimuli, either based on physical or reward salience,
may capture attention via a bottom-up mechanism16. Attention from
top-down and bottom-up sources may be integrated or compete
against each other in the brain and affect decision making.
Previous studies suggested that attention may affect value-based
decision making by modifying choice optionssubjective value7,8.
Moreover, attention, both in the overt form with gaze shifts and in the
covert form without eye movements, has been shown to modulate the
representation of value in the orbitofrontal cortex (OFC)911, which is a
key brain area involved in value-based decision making and adaptive
behavior1216.Inaddition,theOFCneuronalactivitiesencodedeach
choice options value alternately during value-based decision making,
which might be related to attentional shifts17,18. Based on these studies,
it has been proposed that the OFC activity reects the value of the
attended item while multiple items are presented.
However, in these studies, the researchers inferred the location of
attention either from reward or visual salience11 or from gaze
location9,10. Without clear behavior or neurophysiology markers of
attention, these studies only provided indirect evidence for the pro-
posed theory. Especially when the behavior task and the stimulus or
reward salience direct attention to distinct locations, an experiment
that allows one to measure the attention location directly is necessary
for investigating how attention from different sources modulates the
OFCs value encoding.
To this end, we trained two macaque monkeys to perform a visual
detection task with a Posner cueing paradigm19. A pair of visual stimuli
were presented simultaneously, and a cue indicated which of the sti-
muli would change its luminance. The monkeys had to direct their
attention toward the cued stimulus to detect the luminance change
better. In addition, the stimuli were well-trained shapes that were
associated with different amounts of reward. They acquired different
levels of salience through over-training and couldcapture attention via
Received: 11 August 2021
Accepted: 11 October 2022
Check for updates
1
Institute of Neuroscience, Key Laboratory of Primate Neurobiology, Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of
Sciences, Shanghai 200031, China.
2
University of Chinese Academy of Sciences, Beijing 100049, China.
3
These authors contributed equally: Wenyi Zhang,
Yang Xie. e-mail: tyang@ion.ac.cn
Nature Communications | (2022) 13:6306 1
1234567890():,;
1234567890():,;
Content courtesy of Springer Nature, terms of use apply. Rights reserved
a bottom-up mechanism. We were able to verify that the monkeys
directed their attention according to the cue based on both the
monkeysbehavior and the dorsolateral prefrontal cortex (DLPFC)
neuronal activities. However, contrary to the proposed theory, we
observed that the value encoding in the OFC was dominated by reward
salience, regardless of where the cue and the attention were. The
results indicate that the OFC maintains a stable salience-based repre-
sentation of value that is only weakly modulated by spatial attention.
Results
Behavioral task and subject performance
We trained two monkeys to perform a visual detection task (Fig. 1a, d).
In this task, the monkeys were instructed to covertly pay attention to
one of the two stimuli to detect a luminance change at a random delay.
The stimulus with the luminance change was cued by a frame that was
presented on the side opposite to the change location 200 ms before
the onset of the stimulus pair. The monkeys were required to report
the change by making a saccade toward an eye-movement target
located above the xation spot to get a juice reward. For monkey D, the
cue was valid in 90% of the trials. In the remaining trials (invalid-cue
trials), the change location was on the same side of the frame (un-cued
location). Monkey G was also trained to detect the luminance change
at the cued location (target trials: 80%). In addition, to encourage
monkey G to direct its attention appropriately, we required it to ignore
the luminance change at the un-cued location and not to make a
response (distractor trials: 20%). In half of the distractor trials, another
luminance change would happen at the cued location after the dis-
tractor(distractor+target trials: 10%), and monkey G was rewarded for
detecting the change. In the other half of the distractor trials (dis-
tractor-only trials: 10%), there was no luminance change at the target
stimulus, and the monkey had to hold its xation till the endof the trial
to receive a reward.
The stimulus set included 5 pictures for each animal. Each picture
was associated with a juice reward of a different size (Fig. 1a, d, inset).
The reward for correct responses was randomly chosen between the
rewards associated with the two stimuli. Therefore, the reward out-
comes were only 50% certain. The initial frame cue, as well as the
stimulus where the luminance occurred, did not convey information
about which stimuluss reward would be delivered. We initially trained
the monkeys withsingle-stimulus trials, in which only one stimulus was
presented, and the frame always appeared on its opposite side. Its
associated reward would be delivered for a correct response, and there
was no ambiguity about the reward outcome.
For the ease of the discussion, between the two stimuli presented
in each trial, we refer to the stimulus that is cued to have a luminance
change as the cued stimulus, its value as cued value (CV). The other
stimulus is referred to as the un-cued stimulus and its value as un-cued
value (UCV). In addition, as the monkeys were over-trained with this
task and were familiar with the stimuli, the stimuli acquired different
levels of salience due to their reward associations: high salience for
stimuli associated with larger rewards and low salience for stimuli
associated with smaller rewards. Accordingly, we refer to the stimulus
a
Monkey D
Fixation
Attention cue on
Visual stimulus on
Attention cue off
Luminance change
1000 ms
200 ms
200 ms
400-1800 ms
Valid-cue
Invalid-cue
~0 1 2 4 8
Stimulus set:
Juice (drops):
90%
10%
Cued Salient
d
Monkey G
Fixation
Attention cue on
Visual stimulus on
Attention cue off
Luminance change
1000 ms
200 ms
200 ms
400-2300 ms
Target
Distractor-only
~0 1 2 4 8
Stimulus set:
Juice (drops):
Distractor
+ target
80%
10%
10%
Cued Salient
p=0.081
Valid Invalid
150
200
250
Reaction time (ms)
**
Valid Invalid
0.5
0.6
0.7
0.8
0.9
1
Hit rate
Lum change on salient stim
Lum change on non-salient stim
** p=0.25
**
bc
Target (hit)
Distractor-only/distractor+target
(corr rej)
Distractor+target (hit)
0.5
0.6
0.7
0.8
0.9
1
Hit/correct rejection rate
**
**
**
150
200
250
300
350
Reaction time (ms)
Target (hit)
Distractor-only/distractor+target
(false alarm)
Distractor+target (hit)
**
*
**
** **
ef
p=0.37
Fig. 1 | Behavioral tasks and monkeysperformance. a Monkey D had to detect
when any of the two stimuli changed its luminance. The location of the luminance
changewas indicated bya cue (90% valid) thatappeared on theside oppositeto the
luminance change. When the cue was invalid (10% of trials), the luminance change
occurred on the same side of the cue. The monkey reported the luminance change
by makingan eye movementto the eye-movement targetlocated abovethe xation
point. The stimuli were associated with different amounts of juice reward (inset).
Random pairings of stimuli were selected for each trial. The stimulus with a larger
associated reward is referred to as the salient stimulus, and the stimulus where
attention shouldbe direct to is thecued stimulus.The computer randomly selected
one stimulus from the pair and delivered its associated reward for a correct
response. bThe proportion of correct responses whenthe cue was validand invalid
and when the luminance change occurred at the salient and the non-salient sti-
mulus. cThe reaction times when the cue was valid and invalid and when the
luminance changeoccurred at the salient and thenon-salient stimulus. dMonkey G
was requir ed to detect the luminance change at the ta rget location, ind icated with
the cue frame (target trials, 80%), and ignore the change at the un-cued location
(distractor trials, 20%). In half of the distractor trials, another luminance change
happened at the cued location after the distractor changed its luminance (dis-
tractor+target trials, 10%). The stimuli were associated with different amounts of
juice reward (inset). eThe proportion of correct responses in the three trial types
and when the luminance change occurred at the salient and the non-salient sti-
mulus. fThe reaction time of detecting the target and the distractor luminance
change and when the luminancechange occurred atthe salient and the non-salient
stimulus. The error bars are SEM across sessions (monkey D: n= 71; monkey G:
n= 100), * denotes p< 0.01, ** denotes p< 0.001, two-tailed Wilcoxon signed-
rank test.
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 2
Content courtesy of Springer Nature, terms of use apply. Rights reserved
associated with a larger reward as the salient stimulus, and the other
one as the non-salient stimulus. Their values are termed salient value
(SV) and non-salient value (NSV). SV and NSV are the same in trials with
a pair of identical stimuli.
Both monkeys learned the task and used the cue appropriately to
direct their attention. When the cue was valid, monkey D were more
accurate (valid: mean hit rate = 94.2%, SEM = 0.2%; invalid: mean hit
rate = 90.3%, SEM = 0.4%. p0.001, two-tailed Wilcoxon signed-rank
test. SEMs are across sessions) and responded faster (valid: mean
reaction time (RT) = 224.5 ms, SEM = 2.9 ms; invalid: mean RT = 234.1
ms, SEM = 3.6 ms. p0.001, two-tailed Wilcoxon signed-rank test).
Similarly, Monkey G accurately detected the luminance change at the
target (target: mean hit rate = 92.2%, SEM = 0.3%; distractor+target:
mean hit rate = 90.0%, SEM = 0.6%) and ignored the distractorslumi-
nance change (distractor trials: mean correct rejection rate = 81.4%,
SEM =0.6%). The response latency of the target was shorter than that
of the distractor in the false alarm trials (target responses in target
trials: mean RT = 278.2 ms, SEM = 0.9ms; target responses indistractor
+target trials: mean RT = 271.2 ms, SEM = 1.1 ms; distractor responses:
mean RT = 297.8 ms, SEM = 1.0 ms. p« 0.001 for the difference
between the target response in either target or distractor+ target trials
and the distractor responses, two-tailed Wilcoxon signed-rank test).
The stimulus salience also affected the monkeysperformance,
although to a much lesser degree than the cue. The monkeys were
more likely to detect a luminance change of a salient stimulusthan that
of a non-salient stimulus (monkey D: valid salient: mean hit rate =
94.8%, SEM = 0.2%; valid non-salient: mean hit rate = 93.9%, SEM =
0.2%; monkey G: target salient: mean hit rate = 94.0%, SEM = 0.3%;
target non-salient: mean hit rate = 91.2%, SEM = 0.4%. p0.001 for
both monkeys, two-tailed Wilcoxon signed-rank test, Fig. 1b, e). Mon-
key G also responded faster when the target stimulus was the salient
stimulus (target salient: 273.8 ms, SEM=1.0ms; target non-salient:
282.2 ms, SEM = 1.0 ms. p0.001, two-tailed Wilcoxon signed-rank
test, Fig. 1f), while such an improvement was not seen in monkey D
(Fig. 1c). These behavior improvements caused by reward salience
were however much smaller than those caused by the cue (monkey D:
valid non-salient: mean RT = 225.4 ms, SEM = 2.9 ms; invalid salient
mean RT = 236.3 ms, SEM = 3.1 ms; monkey G: target non-salient: mean
RT = 282.2 ms, SEM = 1.0 ms; distractor salient: mean RT = 294.4 ms,
SEM = 1.2 ms. p0.001, two-tailed Wilcoxon signed-rank test,
Fig. 1c, f). As the reward for correct responses was randomly chosen
between the two stimuli, paying attention to the salient stimulus did
not bring any behavior benets to the monkeys or provide information
about the expected reward. Correspondingly, the behavior of the
monkeys indicated that their attention was dominated by the cue.
DLPFC population encodes spatial attention shifts
In addition to the evidence from the monkeysbehavior, we recorded
single-unit activity from the DLPFC neurons to further conrm that the
monkeysattention was directed to the cued location. A total of 406
DLPFC single units (monkey D: 240; monkey G: 166) from Walkers
areas 8a, 8b, 46d, and 46v (Supplementary Fig. 1) were recorded, and
their activities were used to decode the location of attention20,21.
One important feature of our task is that the frame serving as the
cue was displayed on the side opposite to where the attention needed
to be directed. The onset ofthe frame might rst capture the monkeys
attention to a wrong location via a bottom-up process. Afterward, the
monkeys had to shift their attention to the opposite side where the
luminance change was expected. This attention shift was captured by
many DLPFC neurons that encoded spatial attention location. In Fig. 2a
we show such an example neuron. The neuron had larger responses
initially when the frame appeared to the left of the xation point (blue
trace). After the cue offset, its response became greater in trials that
had the cue frame on the right but the cued stimulus on the left (red
trace). The neurons responses reected the attention shift from the
side of the frame to the opposite side where the luminance change was
expected.
Population analyses conrmed that the DLPFC neurons encoded
spatial attention location and reected the attentional shift. To pool
neurons with different spatial selectivity together, we considered the
whole population of recorded neurons as a high-dimensional repre-
sentation of task-related variables17,22,23 and trained a decoder based
on the linear discriminant analysis (LDA) algorithm with a pseudo
neuronal ensemble to decode the frame location. The pseudo neuro-
nal ensemble was composed in such a way that each neuron had the
same number of left and right frame trials (see Methods). Thedecoders
based on the LDA provided the posterior probabilities of the frame
location.
We trained and tested the decoders with DLPFC neurons
responses using a sliding window (size: 25 ms, step: 10 ms) and plotted
the decodersperformance as a heat map (Fig. 2b). When the decoders
were trained and tested with the responses at the same time point,
indicated by the diagonal in Fig. 2b, the cue location can be reliably
decoded shortlyafter the onset of the attention cue (Fig. 2c, left) until
after the luminance change and the animalsresponse(Fig.2c, right).
During the period from ~200 ms after the stimulus onset until after the
luminance change, the encoding of the attentionlocation was stable so
that the cross-temporal decoding performance was maintained
(Fig. 2b). Furthermore, the cross-temporal decoders performance
revealed the attention shift after the shape onset. We trained a decoder
with the neuronal responses 50200 ms before the stimuli onset when
only the frame but not the stimuli were on the screen. The decoders
performance was then tested with the responses at the other time
points in a trial (Fig. 2d). The decoders performance quickly dropped
below the shufed level after the stimulus onset and reached its
minimum after the luminance change. Lower-than-shufed perfor-
mance indicates that the attention location shifted to the side opposite
to where it was when the decoder was trained. This attention shift can
be also observed in Fig. 2b in the dark blue regionsan indication of
the performance below the chance levelwhere the decoders were
trained and tested at different time points when the frame was pre-
sented and when the stimuli were presented.
The encoding of attention location in the DLPFC correlated with
the monkeysperformance. We divided the trials by the monkeysRT
into two halves and tested the decoders performance in each half of
the trials. The decoder performed signicantly better before the onset
of the luminance in trials when the monkeys reacted faster to the
luminance change, suggesting that the attention was more likely to be
allocated correctly in those trials (Fig. 2e, Supplementary Fig. 2). In
addition, we were able to decode attention in the correct trials better
than in the error trials (Supplementary Fig. 3).
OFC responses were dominated by the salient value
After verifying that the monkeys directed their attention to the proper
location, we investigated how the OFC neuronsvalue encoding was
modulated by attention.
We recorded the activities of 349 OFC neurons (monkey D: 212;
monkey G: 137) from Walkers areas 11 and 13 (Supplementary Fig. 1).
Many of them encoded stimulus values. An example neuron is shown
in Fig. 3a, d, g, j. We rst used trials with two identical stimuli to
measure the neurons value selectivity. In these trials, the monkeys
should be certain of the reward they would get for a correct response,
and our previous study indicated that the OFC neurons responded
similarly to a pair of identical stimuli and to the same stimulus pre-
sented alone11. The example neuron encoded the value of the stimuli;
its responses were larger when the value was larger (Fig. 3a). To study
the neurons responses to a pair of distinct stimuli, we rst groupedthe
trials by SV. The neuronsresponsesreected SV stably during
the stimulus period well until after the luminance change (Fig. 3d).
We calculated the neuronsring rates between the stimulus onset and
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 3
Content courtesy of Springer Nature, terms of use apply. Rights reserved
the luminance change and plotted them against SV (Fig. 3g). The
responses to SV were similar to the responses when a pair of the same
stimuli were presented. The non-salient stimulus was largely ignored.
In comparison, the neuron exhibited similar responses when the trials
were grouped by CV and when the trials were grouped by the UCV
(Fig. 3j). Both CV and UCV contributed to the neurons responses
similarly. The results suggested that the reward salience but not the
cue dominated the example neurons responses.
The OFC population exhibited the same trend. Among the 349 OFC
neurons from which we recorded, 126 neurons were selective to value
(Supplementary Table 1). We divided these neurons into positively
tuned neurons and negatively tuned neurons based on their value
tuning (see Methods). The positively tuned neurons had larger
responses when the values were higher (Fig. 3b, 73 out of 126 value-
tuned neurons), and the negatively tuned ones had the opposite tuning
(Fig. 3c, 53 out of 126 value-tuned neurons). Both groups of neurons
encoded SV stably during the stimulus period (Fig. 3e, f). Their ring
rates sorted by SV were similar to when a pair of the stimuli with the
same value as SV were presented (Fig. 3h, i and Supplementary Fig. 4),
and their responses did not distinguish between CV and NCV (Fig. 3k, l
and Supplementary Fig. 5). The results were consistent across the two
individual monkeys (Supplementary Fig. 6). Finally, to study the inter-
action between cue and reward salience, we grouped the trials by cued-
SV, un-cued-SV, cued-NSV, and un-cued-NSV and plotted the OFC neu-
ronsresponses to each group condition. The responses were mostly
divided between the salient and non-salient groups, while the cued and
the un-cued groups exhibited similar responses, indicating little inter-
action between cue and reward salience. (Supplementary Fig. 7).
b
d
-400 -200 0 200 400 600 -400 -200 0 200 400
0
0.2
0.4
0.6
0.8
1
Posterior probability
Time from stim on (ms)
-400
-200
0
200
400
600
Time from stim on (ms) Time from lum change (ms)
-400 -200 0 200 400
Time from lum change (ms)
Training
Testing
0.2
0.3
0.4
0.5
0.6
0.7
0.8
Posterior
probability
-400 -200 0 200 400 600
-400
-200
0
200
400 Cue on Cue off
Cue on Cue off
Stim on Luminance change
Luminance change
Stim on
c
-400 -200 0 200 400 600 -400 -200 0 200 400
0
0.2
0.4
0.6
0.8
1
Posterior probability
Attention cue loc
Control
Cue on Cue offStim on Luminance change
Cue on Cue offStim on Luminance change
e
-400 -200 0 200 400 600 -400 -200 0 200 400
Time from stim on (ms)
0
0.2
0.4
0.6
0.8
1
Posterior probability
Quick
Slow
Time from lum change (ms)
Cue on Cue offStim on Luminance change
-400 -200 0 200 400 600 -400 -200 0 200 400
0
5
10
15
20
25
30
35
Firing rates (spikes/s)
Right attention cue
Left attention cue
Time from stim on (ms)
Time from lum change (ms)
Cue on Cue offStim on Luminance change
a
Time from stim on (ms)
Time from lum change (ms)
Time from stim on (ms)
Time from lum change (ms)
Attention cue loc
Control
Fig. 2 | DLPFC encoded spatial attention. a The responses of an example DLPFC
neuron.The red line indicates trials when thecue was on the right side and the blue
line indicates trials when thecue was on the left side.The shaded areasaround each
line denote SEM across trials. bThe posterior probability of the attention location
decodedfrom DLPFC pseudo-population ensemble activities.Both the training and
the testing of the LDA decoder used sliding windows of 25 ms at 10ms steps. cThe
posterior probability of the attention location from the decoder that was trained
and testedwith the DLPFCneuronal responses atthe same time (thediagonal linein
b). Signicance was assessed with two-tailed paired t-tests (p<0.01 with FDR cor-
rections for multiple comparisons), tested against the data with shufed cue
location labels. The black solid trace indicates the posterior probability calculated
from the actual data. The black dashed trace indicates the posterior probability
calculated from the shufed data.The black segments at the top indicate when the
actual data performed signicantly better than the shufed data. Thin gray lines
represent SEM across trials. d.Sameasbexcept that the decoder was trained with
the mean activities at 50200 ms before the stimulus onset. The black segments at
the bottomindicate whenthe performance was signicantly lower thanthe control,
indicating the ip of attention to the opposite side. Signicance was assessed with
two-tailed paired t-tests (p<0.01 with FDR corrections for multiple comparisons).
Thin gray lines represent SEM across trials. eThe posterior probability of the
attention location decoded from the fast (black line) and the slow trials (gray line).
The trialswere divided by the median reaction time in each neuron. Thin gray lines
represent SEM across trials.Signicance was assessed with two-tailed paired t-tests
(fast versus slow, at p< 0.01 with FDR corrections for multiple comparisons).
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 4
Content courtesy of Springer Nature, terms of use apply. Rights reserved
We further quantied the contributions of SV and NSV to the OFC
neuronal responses with a linear regression model that contained both
value variables as well as a binary term that indicated the frame loca-
tion. We calculated the coefcient of partial determination (CPD) to
measure how much variance was explained by each variable (see
Methods). The average CPD of SV rose signicantly above the baseline
shortly after the stimulus onset (Fig. 4a). In contrast, the CPD of NSV
remained low and was not signicantly different from the baseline in
most time bins. The paired t-tests that compared the CPDs between SV
and NSV indicated that the OFC neurons encoded the SV much better
in almost all the time bins after the stimulus onset (p< 0.005 with FDR
correction for multiple comparisons). The frame location was tran-
siently encoded when the frame was presented and after the lumi-
nance change but was not signicantly encoded during most of the
stimulus presentation period.
This SV dominance is also observed at the level of individual
neurons. We calculated each neurons average responses during the
stimulus period and used them to determine the CPD for SV and NSV
a
-1
-0.5
0
0.5
1
1.5
2
Normalized firing rates (spikes/s)
~0 drop
1 drop
2 drops
4 drops
8 drops
-400 -200 0 200 400 600
Time from stim on (ms)
-400-200 0 200 400
Time from lum change (ms)
Cue on Cue offStim on Luminance change
(n=73)
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
-400 -200 0 200 400 600
Time from stim on (ms)
-400 -200 0 200 400
Time from lum change (ms)
Cue on Cue offStim on Luminance change
b
(n=53)
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
-400 -200 0 200 400 600
Time from stim on (ms)
-400 -200 0 200 400
Time from lum change (ms)
Cue on Cue offStim on Luminance change
c
Same
value
-1
-0.5
0
0.5
1.5
2
Normalized firing rates (spikes/s)
-400 -200 0 200 400 600
Time from stim on (ms)
~0 drop
1 drop
2 drops
4 drops
8 drops
-400-200 0 200 400
Time from lum change (ms)
d
Cue on Cue offStim on Luminance change
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
-400 -200 0 200 400 600
Time from stim on (ms)
-400 -200 0 200 400
Time from lum change (ms)
Cue on Cue off
Stim on Luminance change
e
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
-400 -200 0 200 400 600
Time from stim on (ms)
-400 -200 0 200 400
Time from lum change (ms)
Cue on Cue offStim on Luminance change
f
Value size
~0 248
-1.5
-1
-0.5
0
0.5
1
1.5
2
Normalized firing rates (spikes/s)
Salient value
Same value
1
g
~0 2 4 8
Value size
-1.5
-1
-0.5
0
0.5
1
1.5
2
Normalized firing rates (spikes/s)
Cued value
Uncued value
1
j
~0 1 2 4 8
Value size
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
h
~0 1 2 4 8
Value size
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
k
~0 1 2 4 8
Val ue si ze
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
i
~0 1 2 4 8
Value size
-0.4
-0.2
0
0.2
0.4
0.6
Normalized firing rates (spikes/s)
l
Salient
value
Same value
versus
salient value
Cued value
versus
uncued value
Example neuron
Positive-tuned
neurons
Negative-tuned
neurons
(n=73)
(n=73)
(n=73)
(n=53)
(n=53)
(n=53)
1
-1.5
-1.5
Fig. 3 | Attentional modulation of the OFC responses. a The responses of an
example OFC neuron intrials when the same rewards wereassociated with the two
stimuli, aligned to the stimulus onset (left) and the luminance change (right). The
reward sizes are indicated by different shades of gray. Shading areas indicate SEM
across trials. bThe population response of the positively tuned OFC neurons
(n= 73) in trials when the same rewards were associated with the two stimuli.
Shading areas indicate SEM across neurons. cSame as b, but for the negatively
tuned OFC neurons (n=53).dThe responses of the example neuron with the trials
grouped by SV. The reward sizes are indicated by different shades of yellow.
Shading areas indicate SEM across trials. eThe responses of the positively tuned
OFC neurons with the trials grouped by SV. The reward sizes are indicated by
different shades of yellow. Shading areas indicate SEM across neurons. fSame as
(d), but for the negatively tuned OFC neurons. gThe average responses of the
example neuron during the period between the stimulus onset and the luminance
change. The black trace is based on the trials with the same-reward stimulus pairs.
The yellow trace is based on the trials grouped by SV. The error bars indicate SEM
across trials. Two-way ANOVA (group: F
1413
=0.79,p=0.37;value: F
4413
= 40.40,
p0.001). hThe average responses of the positively tuned OFC neurons during
the period betweenthe stimulusonset and the luminance change. The blacktrace is
based on the trials with the same-reward stimulus pairs. The yellow trace is based
on the trials grouped by SV. The error bars indicate SEM across neurons. Two-way
ANOVA (group: F
1724
=3.22, p=0.07;value:F
4724
= 77.71, p0.001). iSame as (h),
but for the negatively tuned OFC neurons. Two-way ANOVA (group: F
1524
=1.97,
p=0.16; value: F
4524
=46.03,p0.001). jSame as g, but trials are grouped by CV
(red) and NCV (purple). Two-way ANOVA (group: F
1690
=0, p= 0.99; value:
F
4690
=21.04
,
p0.001). kSame as h, but trials are grouped by CV (red) and NCV
(purple). Two-way ANOVA (group: F
1724
=0.16, p= 0.68; value: F
4724
=104.56,
p0.001). lSame as i, but trials are grouped by CV (red) and NCV (purple). Two-
way ANOVA (group: F
1524
=0.01, p=0.92;value:F
4524
=88.54,p0.001).
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 5
Content courtesy of Springer Nature, terms of use apply. Rights reserved
for each value-selective neuron (Fig. 4b). The data points of the neu-
rons spread along the axis of the CPD for SV (SV: 105/349 signicant,
p< 0.05, linear regression). Their CPDs for NSV, on the other hand,
clustered within a small range (NSV: 34/349 signicant, p< 0.05, linear
regression). The distribution of the neuronsCPD for SV was much
more extended in the range than that for NSV, and the mean of the
CPDs for the SV was larger than that of UCV (t= 6.95, p=1.8e11, two-
tailed paired t-test) (Fig. 4c).
By contrast, when we regressed the OFC neuronal responses
against CV and UCV, we obtained overall similar CPDs, suggesting that
CV and UCV contributed similarly to the responses (Fig. 4d). Other
than a short period after the stimulus onset and another one after the
luminancechange,thedifferencebetweenCVsandUCVsCPDswas
not signicant during most of the stimulus period. During the period
when the CPD of CV was larger, there were extra visual inputs, the
frame initially and the luminance change in the end. When we plotted
the neuronsCPDs for CV and those for UCV, they tended to lie closely
to the diagonal (Fig. 4e). Nevertheless, the CPDs for CV were slightly
but signicantly higher than those for NCV (t=3.09, p=2.2e3, two-
tailed paired t-test). These results indicate that the cue and the spatial
attention weakly and transiently modulated the neuronsvalue tuning.
These results were again consistent across the two individualmonkeys
(Supplementary Fig. 8). Further regression analyses based on the four
combinations of SV, NSV, CV, and NSV also point to the same con-
clusion (Supplementary Fig. 9).
Finally, to conrm that the weak attentional modulation was not
due to a washout effect by the many trials in which CV and UCV had
similar values, we studied the trial conditions in which we would
expect the largest modulation on the neuronsresponses by spatial
attention. According to our previous study11, the cue-induced mod-
ulation might be most evident when the cue directed the attention
toward the non-salient picture when the SV was the greatest (8 drops
of juice). In this condition, the cue shifted the attention away from the
most salient picture. This would provide us an opportunity to observe
any potential modulation of the neuronsresponses by the cue.
Therefore, in these trials, we compared the neuronsresponses when
the cue directed the attention toward the 8-drops-of-juice cue (CV = 8)
and when the cue directed the attention away from it (CV =~0, 1, 2, 4,
or 8 and UCV=8). Again, we looked at the positively and negatively
tuned OFC neurons separately. When the attention was on the 8-drops-
of-juice stimulus, the cue and the reward salience were consistent, and
the OFC neurons largely ignored the value of the less salient stimulus
(linear regression, H
0
:slope=0:p= 0.44, positively tuned neurons;
p= 0.30, negatively tuned neurons). The responses were highest for
the positively tuned neurons and lowest for the negativelytuned ones,
both similar to their responses when a pair of 8-drops-of-juice stimuli
were presented (Fig. 5a, b). When the cue directed the attention away
from the 8-drops-of-juice stimulus, it lowered the responses of posi-
tively tuned neurons very slightly (F
1723
=3.74, p= 0.05) and failed to
increase the responses of the negatively tuned neurons (F
1524
= 1.18,
p= 0.28). A complete shift of responses toward the less salient sti-
mulus would have produced responses similar to those when a pair of
the less salient stimuli were presented (black trace in Fig. 5a, b).
Attending away from the most salient stimulus did not abolish the
-400 -200 0 200 400 600 -400 -200 0 200 400
Time from lum change (ms)
0
0.01
0.02
0.03
0.04
CPD
Attention cue loc
Salient value
Non-salient value
-400 -200 0 200 400 600 -400 -200 0 200 400
Time from lum change (ms)
0
0.01
0.02
0.03
0.04
CPD
Attention cue loc
Cued value
Uncued value
0 0.1 0.2 0.3 0.4
CPD of salient value
0
0.1
0.2
0.3
0.4
CPD of non-salient value
(p = 1.8e-11)
0 0.1 0.2 0.3 0.4
CPD
0
10
20
30
40
50
Number of units
0.074
Sig salient value
Non-sig salient value
0
20
40
60
80
100
Number of units
0.014
Sig non-salient value
Non-sig non-salient value
0 0.1 0.2 0.3 0.4
CPD of cued value
0
0.1
0.2
0.3
0.4
CPD of uncued value
(p = 2.2e-3)
0
10
20
30
40
50
Number of units
0.054
Sig cued value
Non-sig cued value
0 0.1 0.2 0.3 0.4
CPD
0
20
40
60
80
100
Number of units
0.043
Sig uncued value
Non-sig uncued value
Time from stim on (ms)
Cue on Cue offStim on Luminance change
Time from stim on (ms)
Cue on Cue offStim on Luminance change
a
d
c
f
b
e
n=349 n=118n=118
n=127
n=349n=127
Fig. 4 | OFC neuronal responses dominantly encoded the salient value. a OFC
neuron (n= 349) ring rates were regressed against attention cue location, SV, and
NSV. Plotted is the time course of the population average coefcients of partial
determination (CPD). Signicance was assessed with two-tailed paired t-tests
(p< 0.005,with FDR corrections for multiple comparisons) compared to a baseline
computed with the CPD between 0 and 200ms before the cue onset averaged
across different regressors. The blue, yellow,and green bars at the top indicate the
signicant CPDs of cue location, SV, and NSV, respectively, and the black bar
indicates signicant differences between SV and NSV. The error bars indicate SEM
across neurons. bThe CPDs of SV against the CPDs of NSV for individual value-
selective OFC neurons (n= 118). The yellow data points are the neurons with non-
zero coefcients for SV only, the green data points are the neurons with non-zero
coefcients for NSV only, and the black data points are the neurons that encoded
both SV and NSV (one-samplet-test, p< 0.05, without multiple comparisons). Two-
tailed paired t-test was conducted to compare the mean CPDs of SV and NSV of all
the OFC neurons (n=349).cTop: the distribution of the CPDs for SV of the value-
selective OFC neurons (n= 118).Bottom: the distribution of the OFC neuronsCPDs
for NSV. Vertical dashed lines indicate the mean. Filled bars indicate signicant
neurons. dSame as a, but for CV and UCV (n=349).eSameas b, but for CV and UCV
(n= 127). fTop: the distribution of the CPDs for CV of the value-selective OFC
neurons (n= 127). B ottom: the distr ibution of the OFC neuronsCPDs for UCV.
Vertical dashed lines indicate the means. Filled bars indicate signicant neurons.
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 6
Content courtesy of Springer Nature, terms of use apply. Rights reserved
dominant representation of SV by the OFC, and the overall neuronal
responses were very similar regardless of whether the cue was on the
most salient stimulus or not. The results were conrmed in the two
individual monkeys (Supplementary Fig. 10).
DLPFCs value encoding was also dominated by reward salience
It has been suggested that the DLPFC sits downstream of the OFC in
the processing of value information. We wonder whether the dom-
inance of the value encoding in OFC by reward salience may be
inherited by the DLPFC. On the other hand, the DLPFC has a strong
spatial attention signal that is lacking in the OFC (Supplementary
Fig. 11). One may expect that spatial cue and attention may play a more
substantial role in DLPFCs value encoding.
We carried out the analyses on DLPFC neuronal responses parallel
totheanalysesontheOFC.Theresults,whicharesummarizedinFig.6
and Supplementary Figs. 1214, indicated that the DLPFCsvalue
encoding was similarly dominated by reward salience, and spatial
attention did not affect DLPFCs value encoding signicantly. The
regression analyses similar to Fig. 4but done to the DLPFC neuronal
responses reveal that the population average CPD of SV was sig-
nicantly above the baseline right after the stimulus onset and lasted
until the end of the trials (Fig. 6a). The CPD of NSV was low, although
initially signicant, and did not last until the luminance change.
Importantly, the CPDs of SV were consistently larger than the CPDs of
NSV during the whole stimulus period (Fig. 6a). The CPDs of individual
units for SV were also signicantly greater than those for NSV (Fig. 6b,
c, t=7.27, p=1.9e12, two-tailed paired t-test). The dominance of SV
was strong in monkey G (Supplementary Fig.13gi). Monkey Ds DLPFC
neurons encoded SV relatively weakly, yet the encoding was still
stronger than that of NSV (Supplementary Fig. 13ac). In contrast, the
encodings of CV and UCV in the DLPFC were similar, both when in the
combined data (Fig. 6df, t=1.56,p= 0.12, two-tailed paired t-test) and
in the individual monkeys (Supplementary Fig. 13df, jl). Therefore,
despite that a strong spatial attention signal is present in the DLPFC
(blue traces in Fig. 6a, d), its modulation of the value encoding
was weak.
Comparison between OFC and DLPFC
Finally, to compare the representations of spatial attention and value
in the OFC and the DLPFC, we plotted the CPD for attention cue
location against the CPD for SV for each neuron from the two areas
(Fig. 7). The CPDs were calculated from regression models similar to
those used in Figs. 4b, 6b, but the average neuronal responses were
computed with the period between the frame cue offset and the
luminance change, during which spatialattention wasmostly drivenby
the top-down mechanism and on the opposite side of the cue. The
CPDs of OFC neurons are largely distributed along the SV axis, while
the CPDs of DLPFC neurons were more evenly distributed along both
axes. Furthermore, for the OFC neurons, the distribution of the neu-
ronsdifference between the two CPDs strongly skewed toward SV
(Fig. 7insert, mean ΔCPD = 0.072, p=1.7e9, two-tailed paired t-test).
In contrast, the DLPFC neurons did not exhibit such a difference (mean
ΔCPD = 0.0031, p= 0.43, two-tailed paired t-test). The analyses con-
rmed that SV dominatedthe OFC responses, while bothSV and spatial
attention were well represented in the DLPFC. Similar results were
observed in individual monkeys (Supplementary Fig. 15).
Discussion
Here, we demonstrated that while both the behavior and the DLPFC
neuronal responses indicated that the monkeys directed their atten-
tion to the cued location, the OFC neuronal responses were never-
theless dominated by stimulus reward salience and only weakly
affected by spatial attention. The results argue against the previously
proposed theory that attention serves a s a selection mechanism for the
OFCs value encoding when multiple items with different value asso-
ciations are presented11.
The dominance of reward salience that we observed cannot be
attributed to spatial attention, decision making, or eye movements.
First, unlike reward salience, which was constant in a trial, the cue that
indicated the luminance change was on the opposite side of the
luminance change, and the spatial attention needed to switch sides.
Therefore, spatial attention could be clearly identied with the switch
of location and be distinguished. Second, the monkeys had to report
their decision by making a saccade to a target above the xation point.
The motor preparation signals were thereby dissociated from the
attention and salience signals. Third, the reward that the monkey
received was randomlychosen between the two stimuli. Neither the CV
nor the SVwas the expected rewardoutcome. Lastly, no decisions were
guided by the values of the stimuli, so potential attention shifts
accompanying the value-based decision-making process were mini-
mized with the current behavior paradigm.
Spatial attention in our paradigm was dictated by the attention
cue via a top-down mechanism, while reward salience may create
attention from a bottom-up subcortical circuitry, which includes
~0 12 4
8
Value size
-0.6
-0.4
-0.2
0
0.2
0.4
Normalized firing rates (spikes/s)
(n =73)
Cued value = 8
Uncued value = 8
Same value
~0 12 4
8
Value size
-0.6
-0.4
-0.2
0
0.2
0.4
Normalized firing rates (spikes/s)
(n =53)
ab
Fig. 5 | Spatial attention failed to switch the value encoding in the OFC to the
non-salient stimulus. a The positively tuned OFC neurons(n= 73) r esponses to an
8-drops-of-juice stimulus (SV) paired witha stimulus associated with~0, 1, 2, 4, or 8
drops of juice (NSV). The red line denotes trials when the attention was toward the
8-drops-of-juice stimulus, and thepurple line denotes trialswhen the attention was
away from the 8-drops-of-juice stimulus. A two-way ANOVA (attention location:
F
1723
=3.74,p=0.05;value:F
4723
=3.45,p=8.4e3) indicatesa subtle but signicant
attentional modulation. A complete switch of the value encoding to NSV would
produce responses close to the black line, which are the neuronsresponses to the
trials of a pair of stimuli with the same reward. All error bars indicate SEM across
neurons. bSame as a, but for the negatively tuned OFC neurons (n= 53). Two-way
ANOVA with group (CV = 8/UCV = 8) and value was performed (attention location:
F
1524
=1.18, p=0.28;value:F
4524
=3.68, p=5.8e3).
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 7
Content courtesy of Springer Nature, terms of use apply. Rights reserved
superior colliculus, pulvinar, ventral midbrain, and amygdala24,25.
Although reward salience is distinct from the kind of bottom-up
attention based on physical salience, interestingly, we also observed
bottom-up physical salience signals in the OFC. First, the OFC neurons
robustly, although only briey, encoded the onset of the frame (Fig. 4a,
d, blue trace), which was a salient bottom-up visual signal. Second,
there was a slightly stronger representation of the cued stimulus than
that of the un-cued after the luminance change (Fig. 4d), which,
although subtle, might also capture the attention via a bottom-up
mechanism. Therefore, while the OFC activities were minimally affec-
ted by spatial attention from the top-down source, inuences from the
bottom-up sources, no matter whether they were based on physical or
reward salience, were readily reected in the OFC.
The dominance of reward salience, which was almost all-or-
none, is in stark contrast to how neurons in the visual cortex are
modulated by spatial attention26. The responses of the visual neurons
to multiple stimuli presented in their receptive eld can be well
described with normalization models that perform a weighted aver-
age of the responses to each stimulus presented alone. Attention
modulates how the responses are weighted by adding bias toward
the attended stimulus, and the neuronsstimulus preference plays an
important role. Whether attended or not, a preferred stimulus pre-
sented in a neurons receptive eld has a signicant inuence on its
responses27. OFC neurons, however, only encode the value of the
most salient stimulus. That is true even for the negatively tuned
neurons, which prefer the non-salient stimulus. Less salient stimuli
are largely ignored by OFC neurons, and spatial attention, when
directed toward a less salient stimulus, does not enhance its encod-
ing very much in the OFC.
One might argue that the weak representation of NSV in the OFC
was because of the task design, in which the reward for a correct
response was randomly chosen between the two stimuli. Yet, the value
that was relevant to the monkeys in this taskthe mean value of the
two stimuliwas also not represented by the OFC. For that to be true,
one would expect that SV and NSV be similarly represented. Yet, the
ndings of the dominance of SV encoding and the weak attentional
modulation may not generalize to other species. Notably, the OFC in
rodents was reported to be modulated by task context28. Given the fact
that the OFCs in rodents and primates are not evolutionarily homo-
logues, such a discrepancy may highlight an interesting difference
previously unknown between the two species.
Finally, our results suggest that the representations of spatial
attention and reward expectancy are dissociable in the brain. The
modulation effects of attention and reward on neuronal activities are
similar, and many previous studies have not clearly distinguished
attention and reward in the experiment design2. Here, we demonstrated
dissociable representations of reward salience and spatial attention in
the prefrontal cortex. Spatial attention was well represented only in the
DLPFC. The reward salience signal remains stable in both the OFC and
the DLPFC even when spatial attention shifted sides via the top-down
mechanism. Such a representation of the salience signal independent of
top-down control may be necessary for the brain to evaluate the
potential targets toward which spatial attention could be directed.
Future experiments could be designed to investigate this hypothesis.
Methods
Subjects
Two male rhesus monkeys (Macaca mulatta) were used. They weighed
9.3 kg (subject D) and 6.8 kg (subject G) at the beginning of the
training. All procedures followed the protocol approved by the Animal
Care Committee of Shanghai Institutes for Biological Sciences, Chinese
Academy of Sciences (CEBSIT-2021004).
0 0.1 0.2 0.3
CPD of salient value
0
0.05
0.1
0.15
0.2
0.25
0.3
CPD of non-salient value
(p = 1.9e-12)
0
10
20
30
40
50
Number of units
0.045
Sig salient value
Non-sig salient value
0 0.1 0.2 0.3
CPD
0
20
40
60
80
100
120
Number of units
0.011 Sig non-salient value
Non-sig non-salient value
0 0.1 0.2 0.3
CPD of cued value
0
0.05
0.1
0.15
0.2
0.25
0.3
CPD of uncued value
(p = 0.12)
0
10
20
30
40
50
60
Number of units
0.039
Sig cued value
Non-sig cued value
0 0.1 0.2 0.3
CPD
0
10
20
30
40
50
60
Number of units
0.032
Sig uncued value
Non-sig uncued value
a
d
c
f
b
e
(n=141) (n=141)
(n=143)
(n=143)
-400 -200 0 200 400 600
0
0.005
0.01
0.015
0.02
0.025
0.03
0.035
CPD
Time from lum change (ms)Time from stim on (ms)
-400 -200 0 200 400
Attention cue loc
Salient value
Non-salient value
Cue on Stim on Cue off Luminance change
(n=406)
0
0.005
0.01
0.015
0.02
0.025
0.03
0.035
CPD
Attention cue loc
Cued value
Uncued value
-400 -200 0 200 400 600
Time from lum change (ms)Time from stim on (ms)
-400 -200 0 200 400
Cue on Stim on Cue off Luminance change
(n=406)
Fig. 6 | DLPFC neuronal responses were dominated by the salient value.
aDLPFC neuron (n= 406) ringrates were regressedagainst attention cue location,
SV, and NSV. Plotted is the time course of the population average coefcients of
partialdetermination(CPD). Conventions as in Fig. 4a.bThe CPDs of SV against the
CPDs of NSVfor individual value-selective DLPFC neurons (n= 141).Conventions as
in Fig. 4b. cTop: the distribution of the CPDs for SV of the value-selective DLPFC
neurons (n= 141). B ottom: the distr ibution of the DLPFC neuronsCPDs for NSV.
Vertical dashed lines indicate the mean. Filled bars indicate signicant neurons.
dSame as (a), but for CV and UCV (n=406).eSame as b, but for CV and UCV
(n= 143). fTop: the distribution of the CPDs for CV of the value-selective DLPFC
neurons (n= 143). Bottom: the distribution of the DLPFC neuronsCPDs for UCV.
Vertical dashed lines indicate the means. Filled bars indicate signicant neurons.
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 8
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Behavioral task and materials
Head-restrained monkeys were seated in a primate chair facing a
23.6 inch computer monitor at 60cm away from their eyes. The center
of the screen was adjusted to align to the midpoint of the two eyes.
Behavioral tasks were run with the MATLAB based software
MonkeyLogic29. Subjectseye position and pupil dilation were tracked
with an infrared oculomotor system at a sampling rate of 500 Hz
(EyeLink 1000). Juice was delivered by a computer-controlled
solenoid.
We trained two monkeys to perform a visual detection task
(Fig. 1a). The monkeys had to hold their gaze ata central xation point
on the screen within a wide window. After maintaining xation for
0.5 s, a yellow saccade target was presented 6.0° (monkey D) or 7.6°
(monkey G) above the xation point. The saccade target remained on
the screen until the end of the trial. After 0.5 s, a square frame
(attention cue) 2.4° wide was presented on the left or right (monkey D:
7.0°; monkey G: 7.6°) of the xation point. After another 0.2 s, two
visual stimuli of 2.4° size appeared on the left and right sides (monkey
D: 7.0°; monkey G: 7.6°) of the xation point. The stimulus locations
were rotated slightly (2 counter-clockwise) for monkey G to mini-
mize its left-right bias. The attention cue disappeared 0.2 s after the
stimulus onset. After a random variable delay, one of the two stimuli
would change its luminance.
For monkey D, there were 90% of trials (valid-cue trials) the
luminancechangewasontheoppositesideoftheattentioncue(cued
location). In the remaining trials (invalid-cue trials), the change wason
the same side of the cue (un-cued location). In both cases, the monkey
was required toreport the change within0.10.4 s by making a saccade
toward the eye movement target to receive a juice reward. The onset
latency of the luminance change was subject to an exponential decay
function (tau = 2.5 s, cut off at 1.4 s) plus 0.4 s.
To encourage monkey G to direct the attention appropriately
according to the cue, it was trained to report only the luminance
change at the cued location (target) but ignore the change at the un-
cued location (distractor). In 80% of trials, the luminance change
occurred at the cued location. Among the rest of 20% of trials, half of
them had a luminance change at the un-cuedlocation only (distractor-
only trials, 10%), and the other half had rst a luminance change at the
un-cued location and then another at the cued location (distractor +
target trials, 10%). The latencies of the luminance change at the target
and the distractor locations were subject to their respective expo-
nential decay functions (target: tau = 2.5 s, cut off at 1.9 s; distractor:
tau =1.25 s, cut off at 1.9 s) plus 0.4s. The monkey was rewarded when
they made a timely response to the target in both the target and the
distractor+target trials, and when they held their xation till the end of
the trial in the distractor-only trials.
The visual stimuli were associated with different amounts of juice
that the monkeys might get for a correct response. There were ve
stimuli for each monkey, each was associated with 1 small drop
(0.033ml), 1, 2, 4, and 8 standard drops (0.10, 0.16, 0.29, and 0.55ml)
of juice, respectively. For convenience, we label the stimulus with a
small drop of juice as ~0 standard drops in the gures and use 0 in the
regression models. The stimuli in each trial were randomly selected
from the stimulus set, and their locations (left or right) were coun-
terbalanced. For correct responses, one of the stimuli was randomly
selected, and the monkey would receive the reward associated with
that stimulus. Therefore, neither the cue nor the reward salience
provided information on which stimuluss associated reward would
be given.
During the initial training, only one stimulus was presented on the
screen, and the monkeys were rewarded with its associated juice
amount if they detected the luminance change correctly. The attention
cue always appeared on the opposite side of the xation point (always
valid). During the recording sessions, the single-stimulus and double-
stimulus trials were interleaved. There were 7.1% and 10.1% of single-
stimulus trials for monkey D and monkey G, and the rest were double-
stimuli trials.
Surgery and MRI
Before the behavioral training, both monkeys received a chronic
implant of a titanium headpost. After a 2-month recovery, they were
trained to perform the behavioral task until they achieved satisfactory
performance. Then, the monkeys received structural Magnetic Reso-
nance Imaging (MRI) scans for us todetermine the recording chamber
location. After the chamber implant surgery, a manganese-enhanced
MRI scan was conducted to verify the chamber placement. The scans
were carried out in a Siemens 3 T scanner.
During the surgery, the monkeys were sedated with ketamine
hydrochloride (10 mg/kg), and anesthesia was then induced and
maintained with isourane gas (1.52%, to effect). Body temperature,
heart rate, blood-oxygen concentration, and expired CO
2
were mon-
itored throughout the surgical procedures.
Neuronal recordings
A 2 cm * 1.5 cm chamber was implanted on the surface of the left
(subject G) and the right (subject D) prefrontal cortex, centered 31.5
(subject G) and 27.5 (subject D) mm anterior to the interaural line
(Supplementary Fig. 1). We recorded extracellular single-unit activities
with tungsten microelectrodes (FHC: 0.32MΩ;AlphaOmega:
0.53MΩ). Each electrode was driven by an independent microdrive
(AlphaOmega EPS) through a stainless-steel guide tub e. The guide tube
was placed within a grid with holes 1 mm apart. The depth of
the penetration was conrmed by the transitions between gray and
0 0.1 0.2 0.3 0.4
CPD of attention cue location
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
CPD of salient value
OFC
DLPFC
-0.2 0.2
Δ CPD
-0.0031
-0.072
0
0.02
0.04
0.06
Proportion of units
0
(n=225)
Fig. 7 | Comparison of the spatial attention and value signals between OFC and
DLPFC neurons. The CPDs of the attention cue location are plotted against the
CPDs of SV for individual OFC (orange) and DLPFC (magenta) neurons. The dotted
line is the diagonal. Only the neurons with non-zero coefcients for the attention
cue location or SV were plotted (n=225,p< 0.01, one-sample t-test without mul-
tiple comparisons). The insert shows the distribution of the CPD differences
between the two CPDs. Vertical dashed lines indicate the mean.
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 9
Content courtesy of Springer Nature, terms of use apply. Rights reserved
white matter. At mostfour electrodes were lowered at a time. Neuronal
signals were recorded with an AlphaOmega SnR system at a sampling
rate of 44 kHz. We recorded neurons from the OFCand the DLPFC. The
OFC recording locations were between the lateral and medial orbital
sulci in Walkers areas 11 and 13. The DLPFC neurons were recorded
from the area rostral to the arcuate sulcus (8a and 8b), including both
the dorsal and the ventral bank of the principal sulcus (46d and 46 v).
There were 171 recording sessions overall, 71 from monkey D and
100 from monkey G. Each recording session contained on average
1381 trials (monkey D: 1668 trials; monkey G: 1177 trials), of which 885
were correct trials (monkey D: 905 trials; monkey G: 871 trials).
Neurons with <100 valid-cue (target) trials were excluded from the
analyses.
Voltage signals of putative single neurons were isolated ofine
manually with Plexon Ofine Sorter (Plexon, Dallas, TX). Neurons with
poor isolation or with a lower than 1Hz response rate were excluded.
There were no additional selection criteria for neurons.
Behavioral analyses
All the behavior analyses were based on the monkeysperformance
during the recording sessions.
Accuracy and reaction time
For monkey D, an eyemovement to the saccade target within 0.10.4 s
after the luminance change is considered a hit. Otherwise, it would be
considered as a miss. For monkey G, hits and misses are similarly
dened, except that any eye movements to the saccade target within
the 0.10.4 s time window on the distractors luminance change are
counted as false alarms. The RT is dened as the time from the lumi-
nance change to the eye-movement initiation in the hit and false alarm
trials.
Neuronal selectivity
To determine how the value information was encoded by the neurons,
linear regressions (tlm function in Matlab Statistical Toolbox) were
performed for each task variable for each individual unit in three task
epochs:
Rnr,tðÞ=β0,ntðÞ+β1,ntðÞ*variable rðÞ+ε,ð1Þ
where R
n
(r, t) was the average neural response of unit nfor a given trial
rwithin time window t. In the single-stimulus trials, the only relevant
task variable is stimulus value (V
sin
). In the double-stimuli trials, the
task variablesinclude left stimulusvalue (LV), right stimulus value (RV),
difference between LV and RV (LV-RV), CV, UCV, difference between
CV and UCV (CV-UCV), SV, NSV, difference between SV and NSV (SV-
NSV), total value (TV) and attention cues location (att cue loc). εwas
independent Gaussian noise. The task epochs were the cue-stimulus
epoch (0.00.2 s after the stimulus onset), the early stimulus epoch
(0.20.6 s after the stimulus onset), and the late stimulus epoch
(0.00.4 s before the luminance change). A neuron is selective to a
variable if β
1,n
is signicantly different than 0 (p< 0.005).
A neuron is considered as value selective if it is selective to any
value variables above during any task epochs (cue-stimulus, early sti-
mulus,andlatestimulus).Wefurtherdividevalueselectiveneurons
into positively tuned and negatively tuned neurons. Neurons with
contradictory tuning are not included in Figs. 3,5, and Supplementary
Table1(e.g.,positivelytunedtoonevaluevariablebutnegatively
tuned to another value variable or positively tuned during one epoch
but negatively tuned during another epoch).
Decoding spatial attention
Linear discriminant analyses (LDA, classify function in Matlab Statis-
tical Toolbox) were used to decode spatial attention location from the
neuronal responses of DLPFC neurons. LDA aims to classify an
observation into one of the K classes through modeling the posterior
probability pðB=bA=aÞ,whereAis the observation (neural respon-
ses), and Bis the predictor (left attention cue or right attention cue).
The posterior probability is calculated through Bayestheorem which
requires prior probability p(B) and conditional probability density
functions pðA=aB=bÞ. LDA assumes that the probability density
function given each condition is a normal distribution. Because the left
attention cue and the right attention cue were equally likely, the prior
was set to be 0.5.
Before conducting the LDA, we z-scored the responses of each
neuron across all the trials and all time points (400600 ms around
stimulus onset and 400400 ms around luminance change). The
normalized responses were further smoothed in 100ms windows with
a Gaussian kernel (sigma = 50 ms). The LDA was performed using a
sliding window of 25 ms with 10 ms steps.
We constructed pseudo-neuron ensembles as follows. To balance
the contribution from each neuron, we rst selected the neurons with
more than 100 trials in both the left attention cue condition and the
right attention cue condition. We randomly chose 100 trials without
replacements from each condition for each neuron and constructed
the confusion matrix X2RM×T×Nwith the sampled trials, where Mis
the number of trials (M= 200, 100 trials in each cue condition), Tis the
number of time bins, and Nis the number of neurons (subject D:
N= 102; subject G: N=93).Toreducenoise,werantheprincipal
component analysis (PCA) and kept the rst Pcomponents that cap-
tured at least 70% of the variance for the LDA.
To estimate how spatial attention signals uctuated with time
(Fig. 2c, e), PCAs were run on the neuron dimension of Xat each time
bin. Only the rst Pprincipalcomponentsthatcapturedatleast70%of
the total variance were used to reconstruct the subspace
(Y2RM×T×P). The decoder was trained and tested with the recon-
structed subspace at each time bin. The posterior probability of
attention cue location given the neuronal responses was used to
quantify the decoder performance. The results were based on 200
independent leave-one-out cross-validations. Signicance was estab-
lishedby two-sided paired t-tests comparingthe actual data against the
posterior probabilities calculated in the same procedure but with the
shufed data.
In Fig. 2b, d, a PCA was run on the neuron dimension of the
confusion matrix that combined all time bins (X02RMT ×N). This
ensured that the neural representation at each time bin shared the
same subspace after PCA. Only the rst Qprincipal components that
captured at least 70% of thetotal variance were used to reconstructthe
subspace (Y02RMT ×Q). Y0was reshaped into a RM×T×Qmatrix to
enable cross-temporal decoding. The decoder was trained with the
neural responses in one time bin (Fig. 2b: 25 ms time bin, stepped by
10 ms; Fig. 2d: 50200 ms before stimulus onset), and tested with the
responses in all time bins.
Linear regression models
In Figs. 4a, d and 6a, d, linear regression models were ttoaverage
ring rates calculated with a 25 ms time window, aligned to the sti-
mulus onset and the luminance change, respectively. We constructed
two linear regression models:
Rnr,tðÞ=β0,ntðÞ+β1,ntðÞattention cue loc rðÞ+β2,ntðÞSV rðÞ+β3,ntðÞNSV rðÞ+ε,
ð2Þ
Rnr,tðÞ=β0,ntðÞ+β1,ntðÞattention cue loc rðÞ+β2,ntðÞCV rðÞ+β3,ntðÞUCV rðÞ+ε,
ð3Þ
where Rnr,tðÞis the average ring rate of neuron nin trial rat time t,
and the predictors were attention cue location(coded as 1 or 1), SV (0,
1,2,4,or8),NSV(0,1,2,4,or8),CV(0,1,2,4,or8),UCV(0,1,2,4,or8).
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 10
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Each predictor was z-scored. The CPD was calculated as:
CPDi,ntðÞ=SSEi,nðtÞSSEall,nðtÞ
SSEi,nðtÞ,ð4Þ
where CPDi,ntðÞis the CPD of variable iin neuron n, SSEi,nðtÞis the
residual sum of squares of the regression model without variable i,
SSEall,nðtÞis the residual sum of squares of the full model. The sig-
nicance of CPDi,ntðÞ is tested against the baseline (paired t-test,
p< 0.005 with FDR correction for multiple comparisons), which is the
CPD calculated with the ring rate in a 200 ms time window before the
attention cue onset averaged across different regressors:
CPDbaseline,n=1
3X
3
i=1
CPDi,nðt0Þ,ð5Þ
TheCPDsinFigs.4b, d, e, f and 6b, d, e, f were calculated similarly
but with the average ring rate between the stimulus onset and the
luminance change. To specically study the attention from the top-
down source, CPDs in Fig. 7were calculated with the average ring rate
between the frame cue offset and the luminance change.
Individual monkey analyses
All results presented in the main text are based on the combined data
from both monkeys. Analyses based on each monkey individually can
be found in Supplementary Figs. 2, 6, 8, 10, 13, 15 corresponding to
Figs. 27in the main text. The results are consistent.
Reporting summary
Further information on research design is available in the Nature
Research Reporting Summary linked to this article.
Data availability
The data used in this study are available at https://doi.org/10.5281/
zenodo.7090240.
Code availability
The custom codes supporting the ndings of this study are availableat
https://github.com/tmyang-lab/reward_salience_in_OFC.
References
1. Connor, C. E., Egeth, H. E. & Yantis, S. Visual attention: Bottom-up
versus top-down. Curr. Biol. 14,850852 (2004).
2. Maunsell, J. H. R. Neuronal representations of cognitive state:
Reward or attention? Trends Cogn. Sci. 8,261265 (2004).
3. Awh, E., Belopolsky, A. V. & Theeuwes, J. Top-down versus bottom-
up attentional control: a failed theoretical dichotomy. Trends Cogn.
Sci. 16,437443 (2012).
4. Chelazzi, L., Perlato, A., Santandrea, E. & Della Libera, C. Rewards
teach visual selective attention. Vis. Res. 85,5872 (2013).
5. Gottlieb,J.,Hayhoe,M.,Hikosaka,O.&Rangel,A.Attention,reward,
and information seeking. J. Neurosci. 34,1549715504 (2014).
6. Anderson, B. A., Laurent, P. A. & Yantis, S. Value-driven attentional
capture. Proc. Natl Acad. Sci. U. S. A. 108,1036710371 (2011).
7. Krajbich, I., Armel, C. & Rangel, A. Visual xations and the compu-
tation and comparison of value in simple choice. Nat. Neurosci. 13,
12921298 (2010).
8. Towal, R. B., Mormann, M. & Koch, C. Simultaneous modeling of
visual saliency and value computation improves predictions of
economic choice. Proc. Natl Acad. Sci. U. S. A. 110,
E3858E3867 (2013).
9. McGinty, V. B., Rangel, A. & Newsome, W. T. Orbitofrontal Cortex
Value Signals Depend on Fixation Location during Free Viewing.
Neuron 90,114 (2016).
10. McGinty, V. B. Overt attention toward appetitive cues enhances
their subjective value, independent of orbitofrontal cortex activity.
eNeuro 6,119 (2019).
11. Xie, Y., Nie, C. & Yang, T. Covert shift of attention modulates the
value encoding in the orbitofrontal cortex. Elife 7,e31507(2018).
12. Wallis, J. D. & Miller, E. K. Neuronal activity in primate dorsolateral
and orbital prefrontal cortex during performance of a reward pre-
ference task. Eur. J. Neurosci. 18,20692081 (2003).
13. Padoa-Schioppa, C. & Assad, J. A. Neurons in the orbitofrontal
cortex encode economic value. Nature 441, 223226 (2006).
14. Ballesta, S., Shi, W., Conen, K. E. & Padoa-Schioppa, C. Values
encoded in orbitofrontal cortex are causally related to economic
choices. Nature 588,450453 (2020).
15. Hunt, L. T. et al. Triple dissociation of attention and decision com-
putations across prefrontal cortex. Nat. Neurosci. 21,
14711481 (2018).
16. Murray, E. A. & Rudebeck, P. H. Specializations for reward-guided
decision-making in the primate ventral prefrontal cortex. Nat. Rev.
Neurosci. 19,404417 (2018).
17. Wallis, J. D. & Rich, E. L. Decoding subjective decisions from orbi-
tofrontal cortex. Nat. Neurosci. 7,973980 (2016).
18. Hayden, B. Y. & Moreno-Bote, R. A neuronal theory of sequential
economic choice. Brain Neurosci. Adv. 2,115 (2018).
19. Posner, M. I. Orienting of attention. Q. J. Exp. Psychol. 32,
325 (1980).
20. Tremblay, S., Pieper, F., Sachs, A. & Martinez-Trujillo, J. Attentional
Filtering of Visual Information by Neuronal Ensembles in the Pri-
mate Lateral Prefrontal Cortex. Neuron 85,202215 (2015).
21. Tremblay,S.,Doucet,G.,Pieper,F.,Sachs,A.&Martinez-Trujillo,J.
Single-Trial Decoding of Visual Attention from Local Field Potentials
in the Primate Lateral Prefrontal Cortex Is Frequency-Dependent. J.
Neurosci. 35,90389049 (2015).
22. Rigotti, M. et al. The importance of mixed selectivity in complex
cognitive tasks. Nature 497,585590 (2013).
23. Mante,V.,Sussillo,D.,Shenoy,K.V.&Newsome,W.T.Context-
dependent computation by recurrent dynamics in prefrontal cor-
tex. Nature 503,7884 (2013).
24. Takakuwa, N., Kato, R., Redgrave, P. & Isa, T. Emergence of visually-
evoked reward expectation signals in dopamine neurons via the
superior colliculus in V1 lesioned monkeys. Elife 6, e24459 (2017).
25. McFadyen, J., Mattingley, J. B. & Garrido, M. I. An afferent white
matter pathway from the pulvinar to the amygdala facilitates fear
recognition. Elife 8,e40766(2019).
26. Maunsell, J. H. R. Neuronal mechanisms of visual attention. Annu.
Rev. Vis. Sci. 1,373391 (2015).
27. Ni,A.M.,Ray,S.&Maunsell,J.H.R.Tunednormalizationexplains
the size of attention modulations. Neuron 73,803813 (2012).
28. Nogueira, R. et al. Lateral orbitofrontal cortex anticipates choices
and integrates prior with current information. Nat. Commun. 8,
113 (2017).
29. Asaad,W.F.,Santhanam,N.,Mcclellan,S.&Freedman,D.J.High-
performance execution of psychophysical tasks with complex
visual stimuli in MATLAB. J. Neurophysiol. 109,249260 (2013).
Acknowledgements
We thank John Maunsell for his comments during the preparation of
the paper. We also thank Ruixin Su, Wei Kong, and Lu Yu for their help
in all phases of the study. This work was supported by the National
Science and Technology Innovation 2030 Major Program (Grant No.
2021ZD0203701), National Key R&D Program of China (Grant No.
2019YFA0709504), Shanghai Municipal Science and Technology
Major Project (Grant No. 2018SHZDZX05), and the Strategic Priority
Research Program of Chinese Academy of Science (Grant No.
XDB32070100).
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 11
Content courtesy of Springer Nature, terms of use apply. Rights reserved
Author contributions
T.Y. conceived the original idea of the study. W.Z. and Y.X. collected the
behavioral and neurophysiological data and performed the analysis.
W.Z. and T.Y. wrote the paper. All authors designed the experiments,
discussed the results and provided the feedback on the paper.
Competing interests
The authors declare no competing interests.
Additional information
Supplementary information The online version contains
supplementary material available at
https://doi.org/10.1038/s41467-022-34084-0.
Correspondence and requests for materials should be addressed to
Tianming Yang.
Peer review information Nature Communications thanks Pieter Roelf-
sema and the other, anonymous, reviewer(s) for their contribution to the
peer review of this work. Peer reviewer reports are available.
Reprints and permissions information is available at
http://www.nature.com/reprints
Publishers note Springer Nature remains neutral with regard to jur-
isdictional claims in published maps and institutional afliations.
Open Access This article is licensed under a Creative Commons
Attribution 4.0 International License, which permits use, sharing,
adaptation, distribution and reproduction in any medium or format, as
long as you give appropriate credit to the original author(s) and the
source, provide a link to the Creative Commons license, and indicate if
changes were made. The images or other third party material in this
article are included in the articles Creative Commons license, unless
indicated otherwise in a credit line to the material. If material is not
included in the articles Creative Commons license and your intended
use is not permitted by statutory regulation or exceeds the permitted
use, you will need to obtain permission directly from the copyright
holder. To view a copy of this license, visit http://creativecommons.org/
licenses/by/4.0/.
© The Author(s) 2022
Article https://doi.org/10.1038/s41467-022-34084-0
Nature Communications | (2022) 13:6306 12
Content courtesy of Springer Nature, terms of use apply. Rights reserved
1.
2.
3.
4.
5.
6.
Terms and Conditions
Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).
Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-
scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By
accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these
purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.
These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal
subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription
(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will
apply.
We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within
ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not
otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as
detailed in the Privacy Policy.
While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may
not:
use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access
control;
use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is
otherwise unlawful;
falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in
writing;
use bots or other automated methods to access the content or redirect messages
override any security feature or exclusionary protocol; or
share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal
content.
In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,
royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal
content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any
other, institutional repository.
These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or
content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature
may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.
To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied
with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,
including merchantability or fitness for any particular purpose.
Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed
from third parties.
If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not
expressly permitted by these Terms, please contact Springer Nature at
onlineservice@springernature.com
... Therefore, head 2's attention weight creates a reward salience map for the objects in the game. Interestingly, similar value-based salience map has been demonstrated in the orbitofrontal cortex [36]. ...
... The first component, computed in the first layer, is driven by the reward saliency of the objects and can be considered as bottom-up. Similar saliency signals have been demonstrated in the prefrontal cortex of the brain [36]. The second component, computed in the second layer, focuses on the interaction between non-Pac-Man objects and Pac-Man itself. ...
Preprint
Full-text available
We proactively direct our eyes and attention to collect information during problem solving and decision making. Understanding gaze patterns is crucial for gaining insights into the computation underlying the problem-solving process. However, there is a lack of interpretable models that can account for how the brain directs the eyes to collect information and utilize it, especially in the context of complex problem solving. In the current study, we analyzed the gaze patterns of two monkeys playing the Pac-Man game. We trained a transformer network to mimic the monkeys' gameplay and found its attention pattern captures the monkeys' eye movements. In addition, the prediction based on the transformer network's attention outperforms the human subjects' predictions. Importantly, we dissected the computation underlying the attention mechanism of the transformer network, revealing its layered structures reflecting a value-based attention component and a component that captures the interactions between Pac-Man and other game objects. Based on these findings, we built a condensed attention model that is not only as accurate as the transformer network but also fully interpretable. Our results highlight the potential of using transformer neural networks to model and understand the cognitive processes underlying complex problem solving in the brain, opening new avenues for investigating the neural basis of cognition.
... Previous studies have primarily employed fMRI to probe the OFC's engagement in emotion regulation, inhibitory control, and reward processing (Klein--Flügge et al., 2022). Non-human primate research has indicated that OFC activation predominantly arises from reward salience rather than top-down attentional mechanisms (Zhang et al., 2022). However, recent investigations have suggested that the OFC plays a role in updating outcome values and integrating perceptual cues to achieve specific goals, hinting at its potential involvement in attentional control. ...
Article
Attentional control, guided by top-down processes, enables selective focus on pertinent information, while habituation, influenced by bottom-up factors and prior experiences, shapes cognitive responses by emphasizing stimulus relevance. These two fundamental processes collaborate to regulate cognitive behavior, with the pre�frontal cortex and its subregions playing a pivotal role. Nevertheless, the intricate neural mechanisms underlying the interaction between attentional control and habituation are still a subject of ongoing exploration. To our knowledge, there is a dearth of comprehensive studies on the functional connectivity between subsystems within the prefrontal cortex during attentional control processes in both primates and humans. Utilizing stereo�electroencephalogram (SEEG) recordings during the Stroop task, we observed top-down dominance effects and corresponding connectivity patterns among the orbitofrontal cortex (OFC), the middle frontal gyrus (MFG), and the inferior frontal gyrus (IFG) during heightened attentional control. These findings highlighting the involvement of OFC in habituation through top-down attention. Our study unveils unique connectivity profiles, shedding light on the neural interplay between top-down and bottom-up attentional control processes, shaping goal-directed attention.
... Precuneus and posterior IPL are key nodes of the (poste rior) default mode network and implicated in internally generated attention and critical regions for spatial attention [31][32][33][34] . It is postulated that attention from these topdown sources competes with bottom up sources using (reward) salience (that is, insula, ACC and OFC) to affect decisionmaking 35 . Prior studies show that the engagement of similar topdown attention neurocircuitry slows down the search for salient targets 36 , and our findings imply that resilience is hampered by the engagement of regions within the topdown attention or default mode network to reward or threat cues but benefits from attention to reward salience and regulatory control of higher order brain regions. ...
... In the present study, we aimed to address the above issues. Accordingly, we focused on the OFC, VS, and midbrain DA neurons that are major cortical and subcortical components of the corticosubcortical reward network (20)(21)(22)(23)(24)(25)(26). We developed an economic decision-making task in which monkeys decided to choose or not to choose an option based on its value. ...
Article
Individuals often assess past decisions by comparing what was gained with what would have been gained had they acted differently. Thoughts of past alternatives that counter what actually happened are called "counterfactuals." Recent theories emphasize the role of the prefrontal cortex in processing counterfactual outcomes in decision-making, although how subcortical regions contribute to this process remains to be elucidated. Here we report a clear distinction among the roles of the orbitofrontal cortex, ventral striatum and midbrain dopamine neurons in processing counterfactual outcomes in monkeys. Our findings suggest that actually gained and counterfactual outcome signals are both processed in the cortico-subcortical network constituted by these regions but in distinct manners and integrated only in the orbitofrontal cortex in a way to compare these outcomes. This study extends the prefrontal theory of counterfactual thinking and provides key insights regarding how the prefrontal cortex cooperates with subcortical regions to make decisions using counterfactual information.
... Precuneus and IPL are key nodes of the (posterior) default mode network and implicated in internally generated or top-down attention and critical regions for spatial attention [39][40][41][42] . It is postulated that attention from these top-down sources competes with bottom-up sources using (reward) salience (i.e., insula, ACC, OFC) to affect decision making 43 . Prior studies show that the engagement of similar top-down attention neurocircuitry slows down the search for salient targets 44 , and our ndings imply that resilience is hampered by the engagement of regions within the top-down attention or default mode network to reward or threat cues, but bene ts from attention to reward salience and regulatory control of higher order brain regions. ...
Preprint
Full-text available
Resilience is a dynamic process of recovery after trauma, but in most studies it is conceptualized as the absence of specific psychopathology following trauma. Using the large emergency department AURORA study (n=1,865, 63% women), we took a longitudinal, dynamic and transdiagnostic approach to define a static resilience (r) factor, that could explain >50% of variance in mental wellbeing 6-months following trauma, and a dynamic r-factor, which represented recovery from initial symptoms. We assessed its neurobiological profile across threat, inhibition, and reward processes using fMRI collected 2-weeks post-trauma. Our neuroimaging results suggest that resilience is promoted by activation of higher-level cognitive functioning and salience network regions in response to reward, whereas resilience is hampered by top-down attention and default mode network activation to threat and reward. These findings serve to redefine the concept of resilience as both dynamic and multifaceted, and identify mechanisms that promote resilience after trauma for early interventions.
... We have shown the moving the eyes to another location, even if empty, washes out the neural 449 memory of the previously seen offer in OFC: only the value of the offer that falls, or immediately fell, in 450 the fovea is strongly encoded. This might imply that OFC does not hold a memory of all previously 451 well-known offers in single-neuron OFC is independent of attentional shifts (Zhang et al., 2022). It is 467 possible that in this study overtraining makes simpler the encoding of value and thus gaze and attention can 468 be decoupled. ...
Preprint
Full-text available
During economic choice, we often consider options in alternation, until we commit to one. Nonetheless, neuroeconomics typically ignores the dynamic aspects of deliberation. We trained macaques to perform a value-based decision-making task in which two risky offers were presented in sequence at different locations of the visual field, each followed by a delay epoch where offers were invisible. Subjects looked at the offers in sequence, as expected. Surprisingly, during the delay epochs, we found that subjects still tend to look at empty locations where the visual offers had previously appeared; moreover, longer fixation to given empty location increases the probability of choosing the associated offer, even after controlling for the offer values. We show that activity in orbitofrontal cortex (OFC) reflects the value of the gazed offer, but also the value of the offer associated with the gazed spatial location, even if it is not the most recently viewed. This reactivation reflects a reevaluation process, as fluctuations in neural spiking during offer stimuli presentation and delays correlate with upcoming choice. Our results suggest that look-at-nothing gazing triggers the reactivation of a previously seen offer for further reevaluation, revealing novel aspects of deliberation.
... Our results are also more in accord with other work showing that fluctuations of covert attention are translated into alternations between encoding one of two presented offers as a function of time, surprisingly independent of gaze (Rich & Wallis, 2016). Recent work where attention is decoupled from stimulus saliency shows that value encoding of well-known offers in single-neuron OFC is independent of attentional shifts (Zhang et al., 2022). It is possible that in this study overtraining makes simpler the encoding of value and thus gaze and attention can be decoupled. ...
Preprint
Full-text available
During economic choice, we often consider options in alternation, until we commit to one. Nonetheless, neuroeconomics typically ignores the dynamic aspects of deliberation. We trained macaques to perform a value-based decision-making task where two risky offers were presented in sequence at different locations of the visual field, each followed by a delay epoch where offers were invisible. Subjects look at the offers in sequence, as expected. Surprisingly, during the delay epochs, the empty locations where the visual offers were formerly presented are looked at; and longer fixation to an empty location increases the probability of choosing the associated offer, even after controlling for the offer values. We show that activity in orbitofrontal (OFC) cortex reflects the value of the offer that is gazed, but also when its empty location is gazed. In particular, the value of the first offer is reactivated when animals gaze to its empty location in the last delay epoch. This reactivation reflects reevaluation, as activity fluctuations correlate with choice. Our results suggest that look-at-nothing gazing triggers the reactivation of a previously seen offer for further reevaluation, revealing novel aspects of deliberation.
Article
Although aversive responses to sensory stimuli are common in autism spectrum disorder (ASD), it remains unknown whether the social relevance of aversive sensory inputs affects their processing. We used functional magnetic resonance imaging (fMRI) to investigate neural responses to mildly aversive nonsocial and social sensory stimuli as well as how sensory over‐responsivity (SOR) severity relates to these responses. Participants included 21 ASD and 25 typically‐developing (TD) youth, aged 8.6–18.0 years. Results showed that TD youth exhibited significant neural discrimination of socially relevant versus irrelevant aversive sensory stimuli, particularly in the amygdala and orbitofrontal cortex (OFC), regions that are crucial for sensory and social processing. In contrast, ASD youth showed reduced neural discrimination of social versus nonsocial stimuli in the amygdala and OFC, as well as overall greater neural responses to nonsocial compared with social stimuli. Moreover, higher SOR in ASD was associated with heightened responses in sensory‐motor regions to socially‐relevant stimuli. These findings further our understanding of the relationship between sensory and social processing in ASD, suggesting limited attention to the social relevance compared with aversiveness level of sensory input in ASD versus TD youth, particularly in ASD youth with higher SOR.
Preprint
Individuals often assess past decisions by comparing what was gained with what would have been gained had they acted differently. Thoughts of past alternatives that counter what actually happened are called “counterfactuals”. Recent theories emphasize the role of the prefrontal cortex in processing counterfactual outcomes in decision-making, although how subcortical regions contribute to this process remains to be elucidated. Here we report a clear distinction among the roles of the orbitofrontal cortex, ventral striatum and midbrain dopamine neurons in processing counterfactual outcomes in monkeys. Our findings suggest that actually-gained and counterfactual outcome signals are both processed in the cortico-subcortical network constituted by these regions but in distinct manners, and integrated only in the orbitofrontal cortex in a way to compare these outcomes. This study extends the prefrontal theory of counterfactual thinking and provides key insights regarding how the prefrontal cortex cooperates with subcortical regions to make decisions using counterfactual information. Teaser Cortical and subcortical systems both contribute to counterfactual thinking of decision outcomes but in distinct manners.
Article
Full-text available
In the eighteenth century, Daniel Bernoulli, Adam Smith and Jeremy Bentham proposed that economic choices rely on the computation and comparison of subjective values¹. This hypothesis continues to inform modern economic theory² and research in behavioural economics³, but behavioural measures are ultimately not sufficient to verify the proposal⁴. Consistent with the hypothesis, when agents make choices, neurons in the orbitofrontal cortex (OFC) encode the subjective value of offered and chosen goods⁵. Value-encoding cells integrate multiple dimensions6–9, variability in the activity of each cell group correlates with variability in choices10,11 and the population dynamics suggests the formation of a decision¹². However, it is unclear whether these neural processes are causally related to choices. More generally, the evidence linking economic choices to value signals in the brain13–15 remains correlational¹⁶. Here we show that neuronal activity in the OFC is causal to economic choices. We conducted two experiments using electrical stimulation in rhesus monkeys (Macaca mulatta). Low-current stimulation increased the subjective value of individual offers and thus predictably biased choices. Conversely, high-current stimulation disrupted both the computation and the comparison of subjective values, and thus increased choice variability. These results demonstrate a causal chain linking subjective values encoded in OFC to valuation and choice.
Article
Full-text available
Neural representations of value underlie many behaviors that are crucial for survival. Previously, we found that value representations in primate orbitofrontal cortex (OFC) are modulated by attention, specifically, by overt shifts of gaze toward or away from reward-associated visual cues (McGinty et al., 2016). Here, we investigate the influence of overt attention on behavior by asking how gaze shifts correlate with reward anticipatory responses and whether activity in OFC mediates this correlation. Macaque monkeys viewed pavlovian conditioned appetitive cues on a visual display, while the fraction of time they spent looking toward or away from the cues was measured using an eye tracker. Also measured during cue presentation were the reward anticipation, indicated by conditioned licking responses (CRs), and single-neuron activity in OFC. In general, gaze allocation predicted subsequent licking responses: the longer the monkeys spent looking at a cue at a given time point in a trial, the more likely they were to produce an anticipatory CR later in that trial, as if the subjective value of the cue were increased. To address neural mechanisms, mediation analysis measured the extent to which the gaze-CR correlation could be statistically explained by the concurrently recorded firing of OFC neurons. The resulting mediation effects were indistinguishable from chance. Therefore, while overt attention may increase the subjective value of reward-associated cues (as revealed by anticipatory behaviors), the underlying mechanism remains unknown, as does the functional significance of gaze-driven modulation of OFC value signals.
Article
Full-text available
Our ability to rapidly detect threats is thought to be subserved by a subcortical pathway that quickly conveys visual information to the amygdala. This neural shortcut has been demonstrated in animals but has rarely been shown in the human brain. Importantly, it remains unclear whether such a pathway might influence neural activity and behavior. We conducted a multimodal neuroimaging study of 622 participants from the Human Connectome Project. We applied probabilistic tractography to diffusion-weighted images, reconstructing a subcortical pathway to the amygdala from the superior colliculus via the pulvinar. We then computationally modeled the flow of haemodynamic activity during a face-viewing task and found evidence for a functionally afferent pulvinar-amygdala pathway. Critically, individuals with greater fibre density in this pathway also had stronger dynamic coupling and enhanced fearful face recognition. Our findings provide converging evidence for the recruitment of an afferent subcortical pulvinar connection to the amygdala that facilitates fear recognition. Editorial note: This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that minor issues remain unresolved (see decision letter).
Article
Full-text available
Naturalistic decision-making typically involves sequential deployment of attention to choice alternatives to gather information before a decision is made. Attention filters how information enters decision circuits, thus implying that attentional control may shape how decision computations unfold. We recorded neuronal activity from three subregions of the prefrontal cortex (PFC) while monkeys performed an attention-guided decision-making task. From the first saccade to decision-relevant information, a triple dissociation of decision- and attention-related computations emerged in parallel across PFC subregions. During subsequent saccades, orbitofrontal cortex activity reflected the value comparison between currently and previously attended information. In contrast, the anterior cingulate cortex carried several signals reflecting belief updating in light of newly attended information, the integration of evidence to a decision bound and an emerging plan for what action to choose. Our findings show how anatomically dissociable PFC representations evolve during attention-guided information search, supporting computations critical for value-guided choice.
Article
Full-text available
Results of recent studies point towards a new framework for the neural bases of economic choice. The principles of this framework include the idea that evaluation is limited to a single option within the focus of attention and that we accept or reject that option relative to the entire set of alternatives. Rejection leads attention to a new option, although it can later switch back to a previously rejected one. The option to which a neuron’s firing rate refers is determined dynamically by attention and not stably by labelled lines. Value is always computed relative to the value of rejection. Comparison results not from explicit competition between discrete populations of neurons, but indirectly, as in a horse race, from the fact that the first option whose value crosses a threshold is selected. Consequently, comparison can occur within a single pool of neurons rather than by competition between two or more neuronal populations. The computations that constitute comparison thus occur at multiple levels, including premotor levels, simultaneously (i.e. the brain uses a distributed consensus), and not in discrete stages. This framework suggests a solution to a set of otherwise unresolved neuronal binding problems that result from the need to link options to values, comparisons to actions, and choices to outcomes.
Article
Full-text available
During value-based decision making, we often evaluate the value of each option sequentially by shifting our attention, even when the options are presented simultaneously. The orbitofrontal cortex (OFC) has been suggested to encode value during value-based decision making. Yet it is not known how its activity is modulated by attention shifts. We investigated this question by employing a passive viewing task that allowed us to disentangle effects of attention, value, choice and eye movement. We found that the attention modulated OFC activity through a winner-take-all mechanism. When we attracted the monkeys' attention covertly, the OFC neuronal activity reflected the reward value of the newly attended cue. The shift of attention could be explained by a normalization model. Our results strongly argue for the hypothesis that the OFC neuronal activity represents the value of the attended item. They provide important insights toward understanding the OFC's role in value-based decision making.
Article
Full-text available
Responses of midbrain dopamine (DA) neurons reflecting expected reward from sensory cues are critical for reward-based associative learning. However, critical pathways by which reward-related visual information is relayed to DA neurons remain unclear. To address this question, we investigated Pavlovian conditioning in macaque monkeys with unilateral primary visual cortex (V1) lesions (an animal model of ‘blindsight’). Anticipatory licking responses to obtain juice drops were elicited in response to visual conditioned stimuli (CS) in the affected visual field. Subsequent pharmacological inactivation of the superior colliculus (SC) suppressed the anticipatory licking. Concurrent single unit recordings indicated that DA responses reflecting the reward expectation could be recorded in the absence of V1, and that these responses were also suppressed by SC inactivation. These results indicate that the subcortical visual circuit can relay reward-predicting visual information to DA neurons and integrity of the SC is necessary for visually-elicited classically conditioned responses after V1 lesion.
Article
Full-text available
Adaptive behavior requires integrating prior with current information to anticipate upcoming events. Brain structures related to this computation should bring relevant signals from the recent past into the present. Here we report that rats can integrate the most recent prior information with sensory information, thereby improving behavior on a perceptual decision-making task with outcome-dependent past trial history. We find that anticipatory signals in the orbitofrontal cortex about upcoming choice increase over time and are even present before stimulus onset. These neuronal signals also represent the stimulus and relevant second-order combinations of past state variables. The encoding of choice, stimulus and second-order past state variables resides, up to movement onset, in overlapping populations. The neuronal representation of choice before stimulus onset and its build-up once the stimulus is presented suggest that orbitofrontal cortex plays a role in transforming immediate prior and stimulus information into choices using a compact state-space representation.
Article
The estimated values of choices, and therefore decision-making based on those values, are influenced by both the chance that the chosen items or goods can be obtained (availability) and their current worth (desirability) as well as by the ability to link the estimated values to choices (a process sometimes called credit assignment). In primates, the prefrontal cortex (PFC) has been thought to contribute to each of these processes; however, causal relationships between particular subdivisions of the PFC and specific functions have been difficult to establish. Recent lesion-based research studies have defined the roles of two different parts of the primate PFC — the orbitofrontal cortex (OFC) and the ventral lateral frontal cortex (VLFC) — and their subdivisions in evaluating each of these factors and in mediating credit assignment during reward-based decision-making.
Article
When making a subjective choice, the brain must compute a value for each option and compare those values to make a decision. The orbitofrontal cortex (OFC) is critically involved in this process, but the neural mechanisms remain obscure, in part due to limitations in our ability to measure and control the internal deliberations that can alter the dynamics of the decision process. Here we tracked these dynamics by recovering temporally precise neural states from multidimensional data in OFC. During individual choices, OFC alternated between states associated with the value of two available options, with dynamics that predicted whether a subject would decide quickly or vacillate between the two alternatives. Ensembles of value-encoding neurons contributed to these states, with individual neurons shifting activity patterns as the network evaluated each option. Thus, the mechanism of subjective decision-making involves the dynamic activation of OFC states associated with each choice alternative.