ArticlePDF Available

Reward salience but not spatial attention dominates the value representation in the orbitofrontal cortex

October 2022
Nature Communications 13(1)

October 2022
13(1)

DOI:10.1038/s41467-022-34084-0

License
CC BY 4.0

Authors:

Yang Xie

Institute of Neuroscience

Tianming Yang

Institute of Neuroscience

The orbitofrontal cortex (OFC) encodes value and plays a key role in value-based decision-making. However, the attentional modulation of the OFC’s value encoding is poorly understood. We trained two monkeys to detect a luminance change at a cued location between a pair of visual stimuli, which were over-trained pictures associated with different amounts of juice reward and, thus, different reward salience. Both the monkeys’ behavior and the dorsolateral prefrontal cortex neuronal activities indicated that the monkeys actively directed their spatial attention toward the cued stimulus during the task. However, the OFC’s neuronal responses were dominated by the stimulus with higher reward salience and encoded its value. The value of the less salient stimulus was only weakly represented regardless of spatial attention. The results demonstrate that reward and spatial attention are distinctly represented in the prefrontal cortex and the OFC maintains a stable representation of reward salience minimally affected by attention.

…

DLPFC encoded spatial attention a The responses of an example DLPFC neuron. The red line indicates trials when the cue was on the right side and the blue line indicates trials when the cue was on the left side. The shaded areas around each line denote SEM across trials. b The posterior probability of the attention location decoded from DLPFC pseudo-population ensemble activities. Both the training and the testing of the LDA decoder used sliding windows of 25 ms at 10 ms steps. c The posterior probability of the attention location from the decoder that was trained and tested with the DLPFC neuronal responses at the same time (the diagonal line in b). Significance was assessed with two-tailed paired t-tests (p < 0.01 with FDR corrections for multiple comparisons), tested against the data with shuffled cue location labels. The black solid trace indicates the posterior probability calculated from the actual data. The black dashed trace indicates the posterior probability calculated from the shuffled data. The black segments at the top indicate when the actual data performed significantly better than the shuffled data. Thin gray lines represent SEM across trials. d. Same as b except that the decoder was trained with the mean activities at 50–200 ms before the stimulus onset. The black segments at the bottom indicate when the performance was significantly lower than the control, indicating the flip of attention to the opposite side. Significance was assessed with two-tailed paired t-tests (p < 0.01 with FDR corrections for multiple comparisons). Thin gray lines represent SEM across trials. e The posterior probability of the attention location decoded from the fast (black line) and the slow trials (gray line). The trials were divided by the median reaction time in each neuron. Thin gray lines represent SEM across trials. Significance was assessed with two-tailed paired t-tests (fast versus slow, at p < 0.01 with FDR corrections for multiple comparisons).

…

Attentional modulation of the OFC responses a The responses of an example OFC neuron in trials when the same rewards were associated with the two stimuli, aligned to the stimulus onset (left) and the luminance change (right). The reward sizes are indicated by different shades of gray. Shading areas indicate SEM across trials. b The population response of the positively tuned OFC neurons (n = 73) in trials when the same rewards were associated with the two stimuli. Shading areas indicate SEM across neurons. c Same as b, but for the negatively tuned OFC neurons (n = 53). d The responses of the example neuron with the trials grouped by SV. The reward sizes are indicated by different shades of yellow. Shading areas indicate SEM across trials. e The responses of the positively tuned OFC neurons with the trials grouped by SV. The reward sizes are indicated by different shades of yellow. Shading areas indicate SEM across neurons. f Same as (d), but for the negatively tuned OFC neurons. g The average responses of the example neuron during the period between the stimulus onset and the luminance change. The black trace is based on the trials with the same-reward stimulus pairs. The yellow trace is based on the trials grouped by SV. The error bars indicate SEM across trials. Two-way ANOVA (group: F1413 = 0.79, p = 0.37; value: F4413 = 40.40, p ≪ 0.001). h The average responses of the positively tuned OFC neurons during the period between the stimulus onset and the luminance change. The black trace is based on the trials with the same-reward stimulus pairs. The yellow trace is based on the trials grouped by SV. The error bars indicate SEM across neurons. Two-way ANOVA (group: F1724 = 3.22, p = 0.07; value: F4724 = 77.71, p ≪ 0.001). i Same as (h), but for the negatively tuned OFC neurons. Two-way ANOVA (group: F1524 = 1.97, p = 0.16; value: F4524 = 46.03, p ≪ 0.001). j Same as g, but trials are grouped by CV (red) and NCV (purple). Two-way ANOVA (group: F1690 = 0, p = 0.99; value: F4690 = 21.04,p ≪ 0.001). k Same as h, but trials are grouped by CV (red) and NCV (purple). Two-way ANOVA (group: F1724 = 0.16, p = 0.68; value: F4724 = 104.56, p ≪ 0.001). l Same as i, but trials are grouped by CV (red) and NCV (purple). Two-way ANOVA (group: F1524 = 0.01, p = 0.92; value: F4524 = 88.54, p ≪ 0.001).

…

OFC neuronal responses dominantly encoded the salient value a OFC neuron (n = 349) firing rates were regressed against attention cue location, SV, and NSV. Plotted is the time course of the population average coefficients of partial determination (CPD). Significance was assessed with two-tailed paired t-tests (p < 0.005, with FDR corrections for multiple comparisons) compared to a baseline computed with the CPD between 0 and 200 ms before the cue onset averaged across different regressors. The blue, yellow, and green bars at the top indicate the significant CPDs of cue location, SV, and NSV, respectively, and the black bar indicates significant differences between SV and NSV. The error bars indicate SEM across neurons. b The CPDs of SV against the CPDs of NSV for individual value-selective OFC neurons (n = 118). The yellow data points are the neurons with non-zero coefficients for SV only, the green data points are the neurons with non-zero coefficients for NSV only, and the black data points are the neurons that encoded both SV and NSV (one-sample t-test, p < 0.05, without multiple comparisons). Two-tailed paired t-test was conducted to compare the mean CPDs of SV and NSV of all the OFC neurons (n = 349). c Top: the distribution of the CPDs for SV of the value-selective OFC neurons (n = 118). Bottom: the distribution of the OFC neurons’ CPDs for NSV. Vertical dashed lines indicate the mean. Filled bars indicate significant neurons. d Same as a, but for CV and UCV (n = 349). e Same as b, but for CV and UCV (n = 127). f Top: the distribution of the CPDs for CV of the value-selective OFC neurons (n = 127). Bottom: the distribution of the OFC neurons’ CPDs for UCV. Vertical dashed lines indicate the means. Filled bars indicate significant neurons.

…

Spatial attention failed to switch the value encoding in the OFC to the non-salient stimulus a The positively tuned OFC neurons’ (n = 73) responses to an 8-drops-of-juice stimulus (SV) paired with a stimulus associated with ~0, 1, 2, 4, or 8 drops of juice (NSV). The red line denotes trials when the attention was toward the 8-drops-of-juice stimulus, and the purple line denotes trials when the attention was away from the 8-drops-of-juice stimulus. A two-way ANOVA (attention location: F1723 = 3.74, p = 0.05; value: F4723 = 3.45, p = 8.4e−3) indicates a subtle but significant attentional modulation. A complete switch of the value encoding to NSV would produce responses close to the black line, which are the neurons’ responses to the trials of a pair of stimuli with the same reward. All error bars indicate SEM across neurons. b Same as a, but for the negatively tuned OFC neurons (n = 53). Two-way ANOVA with group (CV = 8/UCV = 8) and value was performed (attention location: F1524 = 1.18, p = 0.28; value: F4524 = 3.68, p = 5.8e–3).

…

Figures - available from: Nature Communications

This content is subject to copyright. Terms and conditions apply.

Access to this full-text is provided by Springer Nature.

Learn more

Content available from Nature Communications

This content is subject to copyright. Terms and conditions apply.

Article https://doi.org/10.1038/s41467-022-34084-0

Reward salience but not spatial attention

dominates the value representation

in the orbitofrontal cortex

Wenyi Zhang

1,2,3

,YangXie

1,2,3

& Tianming Yang

The orbitofrontal cortex (OFC) encodes value and plays a key role in value-

based decision-making. However, the attentional modulation of the OFC’s

value encoding is poorly understood. We trained two monkeys to detect a

luminance change at a cued location between a pair of visual stimuli, which

were over-trained pictures associated with different amounts of juice reward

and, thus, different reward salience. Both the monkeys’behavior and the

dorsolateral prefrontal cortex neuronal activities indicated that the monkeys

actively directed their spatial attention toward the cued stimulus during the

task. However, the OFC’s neuronal responses were dominated by the stimulus

with higher reward salience and encoded its value. The value of the less salient

stimulus was only weakly represented regardless of spatial attention. The

results demonstrate that reward and spatial attention are distinctly repre-

sented in the prefrontal cortex and the OFC maintains a stable representation

of reward salience minimally affected by attention.

Imagine that you are in a bookstore trying to ﬁnd an ideal gift for your

young child. While you direct your search in the children’s book sec-

tion, an eye-catching poster of your favorite writer in the bestseller

section would divert your attention and lead you to check out the

books over there. While task demands or behavior context often lead

to internally generated attention, commonly referred to as top-down

attention, salient stimuli, either based on physical or reward salience,

may capture attention via a bottom-up mechanism1–6. Attention from

top-down and bottom-up sources may be integrated or compete

against each other in the brain and affect decision making.

Previous studies suggested that attention may affect value-based

decision making by modifying choice options’subjective value7,8.

Moreover, attention, both in the overt form with gaze shifts and in the

covert form without eye movements, has been shown to modulate the

representation of value in the orbitofrontal cortex (OFC)9–11, which is a

key brain area involved in value-based decision making and adaptive

behavior12–16.Inaddition,theOFCneuronalactivitiesencodedeach

choice option’s value alternately during value-based decision making,

which might be related to attentional shifts17,18. Based on these studies,

it has been proposed that the OFC activity reﬂects the value of the

attended item while multiple items are presented.

However, in these studies, the researchers inferred the location of

attention either from reward or visual salience11 or from gaze

location9,10. Without clear behavior or neurophysiology markers of

attention, these studies only provided indirect evidence for the pro-

posed theory. Especially when the behavior task and the stimulus or

reward salience direct attention to distinct locations, an experiment

that allows one to measure the attention location directly is necessary

for investigating how attention from different sources modulates the

OFC’s value encoding.

To this end, we trained two macaque monkeys to perform a visual

detection task with a Posner cueing paradigm19. A pair of visual stimuli

were presented simultaneously, and a cue indicated which of the sti-

muli would change its luminance. The monkeys had to direct their

attention toward the cued stimulus to detect the luminance change

better. In addition, the stimuli were well-trained shapes that were

associated with different amounts of reward. They acquired different

levels of salience through over-training and couldcapture attention via

Received: 11 August 2021

Accepted: 11 October 2022

Check for updates

Institute of Neuroscience, Key Laboratory of Primate Neurobiology, Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of

Sciences, Shanghai 200031, China.

University of Chinese Academy of Sciences, Beijing 100049, China.

These authors contributed equally: Wenyi Zhang,

Yang Xie. e-mail: tyang@ion.ac.cn

Nature Communications | (2022) 13:6306 1

1234567890():,;

Content courtesy of Springer Nature, terms of use apply. Rights reserved

a bottom-up mechanism. We were able to verify that the monkeys

directed their attention according to the cue based on both the

monkeys’behavior and the dorsolateral prefrontal cortex (DLPFC)

neuronal activities. However, contrary to the proposed theory, we

observed that the value encoding in the OFC was dominated by reward

salience, regardless of where the cue and the attention were. The

results indicate that the OFC maintains a stable salience-based repre-

sentation of value that is only weakly modulated by spatial attention.

Results

Behavioral task and subject performance

We trained two monkeys to perform a visual detection task (Fig. 1a, d).

In this task, the monkeys were instructed to covertly pay attention to

one of the two stimuli to detect a luminance change at a random delay.

The stimulus with the luminance change was cued by a frame that was

presented on the side opposite to the change location 200 ms before

the onset of the stimulus pair. The monkeys were required to report

the change by making a saccade toward an eye-movement target

located above the ﬁxation spot to get a juice reward. For monkey D, the

cue was valid in 90% of the trials. In the remaining trials (invalid-cue

trials), the change location was on the same side of the frame (un-cued

location). Monkey G was also trained to detect the luminance change

at the cued location (target trials: 80%). In addition, to encourage

monkey G to direct its attention appropriately, we required it to ignore

the luminance change at the un-cued location and not to make a

response (distractor trials: 20%). In half of the distractor trials, another

luminance change would happen at the cued location after the dis-

tractor(distractor+target trials: 10%), and monkey G was rewarded for

detecting the change. In the other half of the distractor trials (dis-

tractor-only trials: 10%), there was no luminance change at the target

stimulus, and the monkey had to hold its ﬁxation till the endof the trial

to receive a reward.

The stimulus set included 5 pictures for each animal. Each picture

was associated with a juice reward of a different size (Fig. 1a, d, inset).

The reward for correct responses was randomly chosen between the

rewards associated with the two stimuli. Therefore, the reward out-

comes were only 50% certain. The initial frame cue, as well as the

stimulus where the luminance occurred, did not convey information

about which stimulus’s reward would be delivered. We initially trained

the monkeys withsingle-stimulus trials, in which only one stimulus was

presented, and the frame always appeared on its opposite side. Its

associated reward would be delivered for a correct response, and there

was no ambiguity about the reward outcome.

For the ease of the discussion, between the two stimuli presented

in each trial, we refer to the stimulus that is cued to have a luminance

change as the cued stimulus, its value as cued value (CV). The other

stimulus is referred to as the un-cued stimulus and its value as un-cued

value (UCV). In addition, as the monkeys were over-trained with this

task and were familiar with the stimuli, the stimuli acquired different

levels of salience due to their reward associations: high salience for

stimuli associated with larger rewards and low salience for stimuli

associated with smaller rewards. Accordingly, we refer to the stimulus

Monkey D

Fixation

Attention cue on

Visual stimulus on

Attention cue off

Luminance change

1000 ms

200 ms

400-1800 ms

Valid-cue

Invalid-cue

~0 1 2 4 8

Stimulus set:

Juice (drops):

90%

10%

Cued Salient

Monkey G

Fixation

Attention cue on

Visual stimulus on

Attention cue off

Luminance change

1000 ms

200 ms

400-2300 ms

Target

Distractor-only

~0 1 2 4 8

Stimulus set:

Juice (drops):

Distractor

+ target

80%

10%

Cued Salient

p=0.081

Valid Invalid

150

200

250

Reaction time (ms)

Valid Invalid

0.5

0.6

0.7

0.8

0.9

Hit rate

Lum change on salient stim

Lum change on non-salient stim

** p=0.25

Target (hit)

Distractor-only/distractor+target

(corr rej)

Distractor+target (hit)

0.5

0.6

0.7

0.8

0.9

Hit/correct rejection rate

150

200

250

300

350

Reaction time (ms)

Target (hit)

Distractor-only/distractor+target

(false alarm)

Distractor+target (hit)

** **

p=0.37

Fig. 1 | Behavioral tasks and monkeys’performance. a Monkey D had to detect

when any of the two stimuli changed its luminance. The location of the luminance

changewas indicated bya cue (90% valid) thatappeared on theside oppositeto the

luminance change. When the cue was invalid (10% of trials), the luminance change

occurred on the same side of the cue. The monkey reported the luminance change

by makingan eye movementto the eye-movement targetlocated abovethe ﬁxation

point. The stimuli were associated with different amounts of juice reward (inset).

Random pairings of stimuli were selected for each trial. The stimulus with a larger

associated reward is referred to as the salient stimulus, and the stimulus where

attention shouldbe direct to is thecued stimulus.The computer randomly selected

one stimulus from the pair and delivered its associated reward for a correct

response. bThe proportion of correct responses whenthe cue was validand invalid

and when the luminance change occurred at the salient and the non-salient sti-

mulus. cThe reaction times when the cue was valid and invalid and when the

luminance changeoccurred at the salient and thenon-salient stimulus. dMonkey G

was requir ed to detect the luminance change at the ta rget location, ind icated with

the cue frame (target trials, 80%), and ignore the change at the un-cued location

(distractor trials, 20%). In half of the distractor trials, another luminance change

happened at the cued location after the distractor changed its luminance (dis-

tractor+target trials, 10%). The stimuli were associated with different amounts of

juice reward (inset). eThe proportion of correct responses in the three trial types

and when the luminance change occurred at the salient and the non-salient sti-

mulus. fThe reaction time of detecting the target and the distractor luminance

change and when the luminancechange occurred atthe salient and the non-salient

stimulus. The error bars are SEM across sessions (monkey D: n= 71; monkey G:

n= 100), * denotes p< 0.01, ** denotes p< 0.001, two-tailed Wilcoxon signed-

rank test.

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 2

Content courtesy of Springer Nature, terms of use apply. Rights reserved

associated with a larger reward as the salient stimulus, and the other

one as the non-salient stimulus. Their values are termed salient value

(SV) and non-salient value (NSV). SV and NSV are the same in trials with

a pair of identical stimuli.

Both monkeys learned the task and used the cue appropriately to

direct their attention. When the cue was valid, monkey D were more

accurate (valid: mean hit rate = 94.2%, SEM = 0.2%; invalid: mean hit

rate = 90.3%, SEM = 0.4%. p≪0.001, two-tailed Wilcoxon signed-rank

test. SEMs are across sessions) and responded faster (valid: mean

reaction time (RT) = 224.5 ms, SEM = 2.9 ms; invalid: mean RT = 234.1

ms, SEM = 3.6 ms. p≪0.001, two-tailed Wilcoxon signed-rank test).

Similarly, Monkey G accurately detected the luminance change at the

target (target: mean hit rate = 92.2%, SEM = 0.3%; distractor+target:

mean hit rate = 90.0%, SEM = 0.6%) and ignored the distractor’slumi-

nance change (distractor trials: mean correct rejection rate = 81.4%,

SEM =0.6%). The response latency of the target was shorter than that

of the distractor in the false alarm trials (target responses in target

trials: mean RT = 278.2 ms, SEM = 0.9ms; target responses indistractor

+target trials: mean RT = 271.2 ms, SEM = 1.1 ms; distractor responses:

mean RT = 297.8 ms, SEM = 1.0 ms. p« 0.001 for the difference

between the target response in either target or distractor+ target trials

and the distractor responses, two-tailed Wilcoxon signed-rank test).

The stimulus salience also affected the monkeys’performance,

although to a much lesser degree than the cue. The monkeys were

more likely to detect a luminance change of a salient stimulusthan that

of a non-salient stimulus (monkey D: valid salient: mean hit rate =

94.8%, SEM = 0.2%; valid non-salient: mean hit rate = 93.9%, SEM =

0.2%; monkey G: target salient: mean hit rate = 94.0%, SEM = 0.3%;

target non-salient: mean hit rate = 91.2%, SEM = 0.4%. p≪0.001 for

both monkeys, two-tailed Wilcoxon signed-rank test, Fig. 1b, e). Mon-

key G also responded faster when the target stimulus was the salient

stimulus (target salient: 273.8 ms, SEM=1.0ms; target non-salient:

282.2 ms, SEM = 1.0 ms. p≪0.001, two-tailed Wilcoxon signed-rank

test, Fig. 1f), while such an improvement was not seen in monkey D

(Fig. 1c). These behavior improvements caused by reward salience

were however much smaller than those caused by the cue (monkey D:

valid non-salient: mean RT = 225.4 ms, SEM = 2.9 ms; invalid salient

mean RT = 236.3 ms, SEM = 3.1 ms; monkey G: target non-salient: mean

RT = 282.2 ms, SEM = 1.0 ms; distractor salient: mean RT = 294.4 ms,

SEM = 1.2 ms. p≪0.001, two-tailed Wilcoxon signed-rank test,

Fig. 1c, f). As the reward for correct responses was randomly chosen

between the two stimuli, paying attention to the salient stimulus did

not bring any behavior beneﬁts to the monkeys or provide information

about the expected reward. Correspondingly, the behavior of the

monkeys indicated that their attention was dominated by the cue.

DLPFC population encodes spatial attention shifts

In addition to the evidence from the monkeys’behavior, we recorded

single-unit activity from the DLPFC neurons to further conﬁrm that the

monkeys’attention was directed to the cued location. A total of 406

DLPFC single units (monkey D: 240; monkey G: 166) from Walker’s

areas 8a, 8b, 46d, and 46v (Supplementary Fig. 1) were recorded, and

their activities were used to decode the location of attention20,21.

One important feature of our task is that the frame serving as the

cue was displayed on the side opposite to where the attention needed

to be directed. The onset ofthe frame might ﬁrst capture the monkeys’

attention to a wrong location via a bottom-up process. Afterward, the

monkeys had to shift their attention to the opposite side where the

luminance change was expected. This attention shift was captured by

many DLPFC neurons that encoded spatial attention location. In Fig. 2a

we show such an example neuron. The neuron had larger responses

initially when the frame appeared to the left of the ﬁxation point (blue

trace). After the cue offset, its response became greater in trials that

had the cue frame on the right but the cued stimulus on the left (red

trace). The neuron’s responses reﬂected the attention shift from the

side of the frame to the opposite side where the luminance change was

expected.

Population analyses conﬁrmed that the DLPFC neurons encoded

spatial attention location and reﬂected the attentional shift. To pool

neurons with different spatial selectivity together, we considered the

whole population of recorded neurons as a high-dimensional repre-

sentation of task-related variables17,22,23 and trained a decoder based

on the linear discriminant analysis (LDA) algorithm with a pseudo

neuronal ensemble to decode the frame location. The pseudo neuro-

nal ensemble was composed in such a way that each neuron had the

same number of left and right frame trials (see Methods). Thedecoders

based on the LDA provided the posterior probabilities of the frame

location.

We trained and tested the decoders with DLPFC neurons’

responses using a sliding window (size: 25 ms, step: 10 ms) and plotted

the decoders’performance as a heat map (Fig. 2b). When the decoders

were trained and tested with the responses at the same time point,

indicated by the diagonal in Fig. 2b, the cue location can be reliably

decoded shortlyafter the onset of the attention cue (Fig. 2c, left) until

after the luminance change and the animal’sresponse(Fig.2c, right).

During the period from ~200 ms after the stimulus onset until after the

luminance change, the encoding of the attentionlocation was stable so

that the cross-temporal decoding performance was maintained

(Fig. 2b). Furthermore, the cross-temporal decoder’s performance

revealed the attention shift after the shape onset. We trained a decoder

with the neuronal responses 50–200 ms before the stimuli onset when

only the frame but not the stimuli were on the screen. The decoder’s

performance was then tested with the responses at the other time

points in a trial (Fig. 2d). The decoder’s performance quickly dropped

below the shufﬂed level after the stimulus onset and reached its

minimum after the luminance change. Lower-than-shufﬂed perfor-

mance indicates that the attention location shifted to the side opposite

to where it was when the decoder was trained. This attention shift can

be also observed in Fig. 2b in the dark blue regions—an indication of

the performance below the chance level—where the decoders were

trained and tested at different time points when the frame was pre-

sented and when the stimuli were presented.

The encoding of attention location in the DLPFC correlated with

the monkeys’performance. We divided the trials by the monkeys’RT

into two halves and tested the decoder’s performance in each half of

the trials. The decoder performed signiﬁcantly better before the onset

of the luminance in trials when the monkeys reacted faster to the

luminance change, suggesting that the attention was more likely to be

allocated correctly in those trials (Fig. 2e, Supplementary Fig. 2). In

addition, we were able to decode attention in the correct trials better

than in the error trials (Supplementary Fig. 3).

OFC responses were dominated by the salient value

After verifying that the monkeys directed their attention to the proper

location, we investigated how the OFC neurons’value encoding was

modulated by attention.

We recorded the activities of 349 OFC neurons (monkey D: 212;

monkey G: 137) from Walker’s areas 11 and 13 (Supplementary Fig. 1).

Many of them encoded stimulus values. An example neuron is shown

in Fig. 3a, d, g, j. We ﬁrst used trials with two identical stimuli to

measure the neuron’s value selectivity. In these trials, the monkeys

should be certain of the reward they would get for a correct response,

and our previous study indicated that the OFC neurons responded

similarly to a pair of identical stimuli and to the same stimulus pre-

sented alone11. The example neuron encoded the value of the stimuli;

its responses were larger when the value was larger (Fig. 3a). To study

the neuron’s responses to a pair of distinct stimuli, we ﬁrst groupedthe

trials by SV. The neuron’sresponsesreﬂected SV stably during

the stimulus period well until after the luminance change (Fig. 3d).

We calculated the neuron’sﬁring rates between the stimulus onset and

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 3

Content courtesy of Springer Nature, terms of use apply. Rights reserved

the luminance change and plotted them against SV (Fig. 3g). The

responses to SV were similar to the responses when a pair of the same

stimuli were presented. The non-salient stimulus was largely ignored.

In comparison, the neuron exhibited similar responses when the trials

were grouped by CV and when the trials were grouped by the UCV

(Fig. 3j). Both CV and UCV contributed to the neuron’s responses

similarly. The results suggested that the reward salience but not the

cue dominated the example neuron’s responses.

The OFC population exhibited the same trend. Among the 349 OFC

neurons from which we recorded, 126 neurons were selective to value

(Supplementary Table 1). We divided these neurons into positively

tuned neurons and negatively tuned neurons based on their value

tuning (see Methods). The positively tuned neurons had larger

responses when the values were higher (Fig. 3b, 73 out of 126 value-

tuned neurons), and the negatively tuned ones had the opposite tuning

(Fig. 3c, 53 out of 126 value-tuned neurons). Both groups of neurons

encoded SV stably during the stimulus period (Fig. 3e, f). Their ﬁring

rates sorted by SV were similar to when a pair of the stimuli with the

same value as SV were presented (Fig. 3h, i and Supplementary Fig. 4),

and their responses did not distinguish between CV and NCV (Fig. 3k, l

and Supplementary Fig. 5). The results were consistent across the two

individual monkeys (Supplementary Fig. 6). Finally, to study the inter-

action between cue and reward salience, we grouped the trials by cued-

SV, un-cued-SV, cued-NSV, and un-cued-NSV and plotted the OFC neu-

rons’responses to each group condition. The responses were mostly

divided between the salient and non-salient groups, while the cued and

the un-cued groups exhibited similar responses, indicating little inter-

action between cue and reward salience. (Supplementary Fig. 7).

-400 -200 0 200 400 600 -400 -200 0 200 400

0.2

0.4

0.6

0.8

Posterior probability

Time from stim on (ms)

-400

-200

200

400

600

Time from stim on (ms) Time from lum change (ms)

-400 -200 0 200 400

Time from lum change (ms)

Training

Testing

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Posterior

probability

-400 -200 0 200 400 600

-400

-200

200

400 Cue on Cue off

Cue on Cue off

Stim on Luminance change

Luminance change

Stim on

-400 -200 0 200 400 600 -400 -200 0 200 400

0.2

0.4

0.6

0.8

Posterior probability

Attention cue loc

Control

Cue on Cue offStim on Luminance change

-400 -200 0 200 400 600 -400 -200 0 200 400

Time from stim on (ms)

0.2

0.4

0.6

0.8

Posterior probability

Quick

Slow

Time from lum change (ms)

Cue on Cue offStim on Luminance change

-400 -200 0 200 400 600 -400 -200 0 200 400

Firing rates (spikes/s)

Right attention cue

Left attention cue

Time from stim on (ms)

Time from lum change (ms)

Cue on Cue offStim on Luminance change

Time from stim on (ms)

Time from lum change (ms)

Time from stim on (ms)

Time from lum change (ms)

Attention cue loc

Control

Fig. 2 | DLPFC encoded spatial attention. a The responses of an example DLPFC

neuron.The red line indicates trials when thecue was on the right side and the blue

line indicates trials when thecue was on the left side.The shaded areasaround each

line denote SEM across trials. bThe posterior probability of the attention location

decodedfrom DLPFC pseudo-population ensemble activities.Both the training and

the testing of the LDA decoder used sliding windows of 25 ms at 10ms steps. cThe

posterior probability of the attention location from the decoder that was trained

and testedwith the DLPFCneuronal responses atthe same time (thediagonal linein

b). Signiﬁcance was assessed with two-tailed paired t-tests (p<0.01 with FDR cor-

rections for multiple comparisons), tested against the data with shufﬂed cue

location labels. The black solid trace indicates the posterior probability calculated

from the actual data. The black dashed trace indicates the posterior probability

calculated from the shufﬂed data.The black segments at the top indicate when the

actual data performed signiﬁcantly better than the shufﬂed data. Thin gray lines

represent SEM across trials. d.Sameasbexcept that the decoder was trained with

the mean activities at 50–200 ms before the stimulus onset. The black segments at

the bottomindicate whenthe performance was signiﬁcantly lower thanthe control,

indicating the ﬂip of attention to the opposite side. Signiﬁcance was assessed with

two-tailed paired t-tests (p<0.01 with FDR corrections for multiple comparisons).

Thin gray lines represent SEM across trials. eThe posterior probability of the

attention location decoded from the fast (black line) and the slow trials (gray line).

The trialswere divided by the median reaction time in each neuron. Thin gray lines

represent SEM across trials.Signiﬁcance was assessed with two-tailed paired t-tests

(fast versus slow, at p< 0.01 with FDR corrections for multiple comparisons).

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 4

Content courtesy of Springer Nature, terms of use apply. Rights reserved

We further quantiﬁed the contributions of SV and NSV to the OFC

neuronal responses with a linear regression model that contained both

value variables as well as a binary term that indicated the frame loca-

tion. We calculated the coefﬁcient of partial determination (CPD) to

measure how much variance was explained by each variable (see

Methods). The average CPD of SV rose signiﬁcantly above the baseline

shortly after the stimulus onset (Fig. 4a). In contrast, the CPD of NSV

remained low and was not signiﬁcantly different from the baseline in

most time bins. The paired t-tests that compared the CPDs between SV

and NSV indicated that the OFC neurons encoded the SV much better

in almost all the time bins after the stimulus onset (p< 0.005 with FDR

correction for multiple comparisons). The frame location was tran-

siently encoded when the frame was presented and after the lumi-

nance change but was not signiﬁcantly encoded during most of the

stimulus presentation period.

This SV dominance is also observed at the level of individual

neurons. We calculated each neuron’s average responses during the

stimulus period and used them to determine the CPD for SV and NSV

-1

-0.5

0.5

1.5

Normalized firing rates (spikes/s)

~0 drop

1 drop

2 drops

4 drops

8 drops

-400 -200 0 200 400 600

Time from stim on (ms)

-400-200 0 200 400

Time from lum change (ms)

Cue on Cue offStim on Luminance change

(n=73)

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

-400 -200 0 200 400 600

Time from stim on (ms)

-400 -200 0 200 400

Time from lum change (ms)

Cue on Cue offStim on Luminance change

(n=53)

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

-400 -200 0 200 400 600

Time from stim on (ms)

-400 -200 0 200 400

Time from lum change (ms)

Cue on Cue offStim on Luminance change

Same

value

-1

-0.5

0.5

1.5

Normalized firing rates (spikes/s)

-400 -200 0 200 400 600

Time from stim on (ms)

~0 drop

1 drop

2 drops

4 drops

8 drops

-400-200 0 200 400

Time from lum change (ms)

Cue on Cue offStim on Luminance change

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

-400 -200 0 200 400 600

Time from stim on (ms)

-400 -200 0 200 400

Time from lum change (ms)

Cue on Cue off

Stim on Luminance change

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

-400 -200 0 200 400 600

Time from stim on (ms)

-400 -200 0 200 400

Time from lum change (ms)

Cue on Cue offStim on Luminance change

Value size

~0 248

-1.5

-1

-0.5

0.5

1.5

Normalized firing rates (spikes/s)

Salient value

Same value

~0 2 4 8

Value size

-1.5

-1

-0.5

0.5

1.5

Normalized firing rates (spikes/s)

Cued value

Uncued value

~0 1 2 4 8

Value size

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

~0 1 2 4 8

Value size

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

~0 1 2 4 8

Val ue si ze

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

~0 1 2 4 8

Value size

-0.4

-0.2

0.2

0.4

0.6

Normalized firing rates (spikes/s)

Salient

value

Same value

versus

salient value

Cued value

versus

uncued value

Example neuron

Positive-tuned

neurons

Negative-tuned

neurons

(n=73)

(n=53)

-1.5

Fig. 3 | Attentional modulation of the OFC responses. a The responses of an

example OFC neuron intrials when the same rewards wereassociated with the two

stimuli, aligned to the stimulus onset (left) and the luminance change (right). The

reward sizes are indicated by different shades of gray. Shading areas indicate SEM

across trials. bThe population response of the positively tuned OFC neurons

(n= 73) in trials when the same rewards were associated with the two stimuli.

Shading areas indicate SEM across neurons. cSame as b, but for the negatively

tuned OFC neurons (n=53).dThe responses of the example neuron with the trials

grouped by SV. The reward sizes are indicated by different shades of yellow.

Shading areas indicate SEM across trials. eThe responses of the positively tuned

OFC neurons with the trials grouped by SV. The reward sizes are indicated by

different shades of yellow. Shading areas indicate SEM across neurons. fSame as

(d), but for the negatively tuned OFC neurons. gThe average responses of the

example neuron during the period between the stimulus onset and the luminance

change. The black trace is based on the trials with the same-reward stimulus pairs.

The yellow trace is based on the trials grouped by SV. The error bars indicate SEM

across trials. Two-way ANOVA (group: F

1413

=0.79,p=0.37;value: F

4413

= 40.40,

p≪0.001). hThe average responses of the positively tuned OFC neurons during

the period betweenthe stimulusonset and the luminance change. The blacktrace is

based on the trials with the same-reward stimulus pairs. The yellow trace is based

on the trials grouped by SV. The error bars indicate SEM across neurons. Two-way

ANOVA (group: F

1724

=3.22, p=0.07;value:F

4724

= 77.71, p≪0.001). iSame as (h),

but for the negatively tuned OFC neurons. Two-way ANOVA (group: F

1524

=1.97,

p=0.16; value: F

4524

=46.03,p≪0.001). jSame as g, but trials are grouped by CV

(red) and NCV (purple). Two-way ANOVA (group: F

1690

=0, p= 0.99; value:

4690

=21.04

p≪0.001). kSame as h, but trials are grouped by CV (red) and NCV

(purple). Two-way ANOVA (group: F

1724

=0.16, p= 0.68; value: F

4724

=104.56,

p≪0.001). lSame as i, but trials are grouped by CV (red) and NCV (purple). Two-

way ANOVA (group: F

1524

=0.01, p=0.92;value:F

4524

=88.54,p≪0.001).

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 5

Content courtesy of Springer Nature, terms of use apply. Rights reserved

for each value-selective neuron (Fig. 4b). The data points of the neu-

rons spread along the axis of the CPD for SV (SV: 105/349 signiﬁcant,

p< 0.05, linear regression). Their CPDs for NSV, on the other hand,

clustered within a small range (NSV: 34/349 signiﬁcant, p< 0.05, linear

regression). The distribution of the neurons’CPD for SV was much

more extended in the range than that for NSV, and the mean of the

CPDs for the SV was larger than that of UCV (t= 6.95, p=1.8e−11, two-

tailed paired t-test) (Fig. 4c).

By contrast, when we regressed the OFC neuronal responses

against CV and UCV, we obtained overall similar CPDs, suggesting that

CV and UCV contributed similarly to the responses (Fig. 4d). Other

than a short period after the stimulus onset and another one after the

luminancechange,thedifferencebetweenCV’sandUCV’sCPDswas

not signiﬁcant during most of the stimulus period. During the period

when the CPD of CV was larger, there were extra visual inputs, the

frame initially and the luminance change in the end. When we plotted

the neurons’CPDs for CV and those for UCV, they tended to lie closely

to the diagonal (Fig. 4e). Nevertheless, the CPDs for CV were slightly

but signiﬁcantly higher than those for NCV (t=3.09, p=2.2e−3, two-

tailed paired t-test). These results indicate that the cue and the spatial

attention weakly and transiently modulated the neurons’value tuning.

These results were again consistent across the two individualmonkeys

(Supplementary Fig. 8). Further regression analyses based on the four

combinations of SV, NSV, CV, and NSV also point to the same con-

clusion (Supplementary Fig. 9).

Finally, to conﬁrm that the weak attentional modulation was not

due to a washout effect by the many trials in which CV and UCV had

similar values, we studied the trial conditions in which we would

expect the largest modulation on the neurons’responses by spatial

attention. According to our previous study11, the cue-induced mod-

ulation might be most evident when the cue directed the attention

toward the non-salient picture when the SV was the greatest (8 drops

of juice). In this condition, the cue shifted the attention away from the

most salient picture. This would provide us an opportunity to observe

any potential modulation of the neurons’responses by the cue.

Therefore, in these trials, we compared the neurons’responses when

the cue directed the attention toward the 8-drops-of-juice cue (CV = 8)

and when the cue directed the attention away from it (CV =~0, 1, 2, 4,

or 8 and UCV=8). Again, we looked at the positively and negatively

tuned OFC neurons separately. When the attention was on the 8-drops-

of-juice stimulus, the cue and the reward salience were consistent, and

the OFC neurons largely ignored the value of the less salient stimulus

(linear regression, H

:slope=0:p= 0.44, positively tuned neurons;

p= 0.30, negatively tuned neurons). The responses were highest for

the positively tuned neurons and lowest for the negativelytuned ones,

both similar to their responses when a pair of 8-drops-of-juice stimuli

were presented (Fig. 5a, b). When the cue directed the attention away

from the 8-drops-of-juice stimulus, it lowered the responses of posi-

tively tuned neurons very slightly (F

1723

=3.74, p= 0.05) and failed to

increase the responses of the negatively tuned neurons (F

1524

= 1.18,

p= 0.28). A complete shift of responses toward the less salient sti-

mulus would have produced responses similar to those when a pair of

the less salient stimuli were presented (black trace in Fig. 5a, b).

Attending away from the most salient stimulus did not abolish the

-400 -200 0 200 400 600 -400 -200 0 200 400

Time from lum change (ms)

0.01

0.02

0.03

0.04

CPD

Attention cue loc

Salient value

Non-salient value

-400 -200 0 200 400 600 -400 -200 0 200 400

Time from lum change (ms)

0.01

0.02

0.03

0.04

CPD

Attention cue loc

Cued value

Uncued value

0 0.1 0.2 0.3 0.4

CPD of salient value

0.1

0.2

0.3

0.4

CPD of non-salient value

(p = 1.8e-11)

0 0.1 0.2 0.3 0.4

CPD

Number of units

0.074

Sig salient value

Non-sig salient value

100

Number of units

0.014

Sig non-salient value

Non-sig non-salient value

0 0.1 0.2 0.3 0.4

CPD of cued value

0.1

0.2

0.3

0.4

CPD of uncued value

(p = 2.2e-3)

Number of units

0.054

Sig cued value

Non-sig cued value

0 0.1 0.2 0.3 0.4

CPD

100

Number of units

0.043

Sig uncued value

Non-sig uncued value

Time from stim on (ms)

Cue on Cue offStim on Luminance change

Time from stim on (ms)

Cue on Cue offStim on Luminance change

（n=349）（n=118）（n=118）

（n=127）

（n=349）（n=127）

Fig. 4 | OFC neuronal responses dominantly encoded the salient value. a OFC

neuron (n= 349) ﬁring rates were regressed against attention cue location, SV, and

NSV. Plotted is the time course of the population average coefﬁcients of partial

determination (CPD). Signiﬁcance was assessed with two-tailed paired t-tests

(p< 0.005,with FDR corrections for multiple comparisons) compared to a baseline

computed with the CPD between 0 and 200ms before the cue onset averaged

across different regressors. The blue, yellow,and green bars at the top indicate the

signiﬁcant CPDs of cue location, SV, and NSV, respectively, and the black bar

indicates signiﬁcant differences between SV and NSV. The error bars indicate SEM

across neurons. bThe CPDs of SV against the CPDs of NSV for individual value-

selective OFC neurons (n= 118). The yellow data points are the neurons with non-

zero coefﬁcients for SV only, the green data points are the neurons with non-zero

coefﬁcients for NSV only, and the black data points are the neurons that encoded

both SV and NSV (one-samplet-test, p< 0.05, without multiple comparisons). Two-

tailed paired t-test was conducted to compare the mean CPDs of SV and NSV of all

the OFC neurons (n=349).cTop: the distribution of the CPDs for SV of the value-

selective OFC neurons (n= 118).Bottom: the distribution of the OFC neurons’CPDs

for NSV. Vertical dashed lines indicate the mean. Filled bars indicate signiﬁcant

neurons. dSame as a, but for CV and UCV (n=349).eSameas b, but for CV and UCV

(n= 127). fTop: the distribution of the CPDs for CV of the value-selective OFC

neurons (n= 127). B ottom: the distr ibution of the OFC neurons’CPDs for UCV.

Vertical dashed lines indicate the means. Filled bars indicate signiﬁcant neurons.

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 6

Content courtesy of Springer Nature, terms of use apply. Rights reserved

dominant representation of SV by the OFC, and the overall neuronal

responses were very similar regardless of whether the cue was on the

most salient stimulus or not. The results were conﬁrmed in the two

individual monkeys (Supplementary Fig. 10).

DLPFC’s value encoding was also dominated by reward salience

It has been suggested that the DLPFC sits downstream of the OFC in

the processing of value information. We wonder whether the dom-

inance of the value encoding in OFC by reward salience may be

inherited by the DLPFC. On the other hand, the DLPFC has a strong

spatial attention signal that is lacking in the OFC (Supplementary

Fig. 11). One may expect that spatial cue and attention may play a more

substantial role in DLPFC’s value encoding.

We carried out the analyses on DLPFC neuronal responses parallel

totheanalysesontheOFC.Theresults,whicharesummarizedinFig.6

and Supplementary Figs. 12–14, indicated that the DLPFC’svalue

encoding was similarly dominated by reward salience, and spatial

attention did not affect DLPFC’s value encoding signiﬁcantly. The

regression analyses similar to Fig. 4but done to the DLPFC neuronal

responses reveal that the population average CPD of SV was sig-

niﬁcantly above the baseline right after the stimulus onset and lasted

until the end of the trials (Fig. 6a). The CPD of NSV was low, although

initially signiﬁcant, and did not last until the luminance change.

Importantly, the CPDs of SV were consistently larger than the CPDs of

NSV during the whole stimulus period (Fig. 6a). The CPDs of individual

units for SV were also signiﬁcantly greater than those for NSV (Fig. 6b,

c, t=7.27, p=1.9e−12, two-tailed paired t-test). The dominance of SV

was strong in monkey G (Supplementary Fig.13g–i). Monkey D’s DLPFC

neurons encoded SV relatively weakly, yet the encoding was still

stronger than that of NSV (Supplementary Fig. 13a–c). In contrast, the

encodings of CV and UCV in the DLPFC were similar, both when in the

combined data (Fig. 6d–f, t=1.56,p= 0.12, two-tailed paired t-test) and

in the individual monkeys (Supplementary Fig. 13d–f, j–l). Therefore,

despite that a strong spatial attention signal is present in the DLPFC

(blue traces in Fig. 6a, d), its modulation of the value encoding

was weak.

Comparison between OFC and DLPFC

Finally, to compare the representations of spatial attention and value

in the OFC and the DLPFC, we plotted the CPD for attention cue

location against the CPD for SV for each neuron from the two areas

(Fig. 7). The CPDs were calculated from regression models similar to

those used in Figs. 4b, 6b, but the average neuronal responses were

computed with the period between the frame cue offset and the

luminance change, during which spatialattention wasmostly drivenby

the top-down mechanism and on the opposite side of the cue. The

CPDs of OFC neurons are largely distributed along the SV axis, while

the CPDs of DLPFC neurons were more evenly distributed along both

axes. Furthermore, for the OFC neurons, the distribution of the neu-

rons’difference between the two CPDs strongly skewed toward SV

(Fig. 7insert, mean ΔCPD = −0.072, p=1.7e−9, two-tailed paired t-test).

In contrast, the DLPFC neurons did not exhibit such a difference (mean

ΔCPD = −0.0031, p= 0.43, two-tailed paired t-test). The analyses con-

ﬁrmed that SV dominatedthe OFC responses, while bothSV and spatial

attention were well represented in the DLPFC. Similar results were

observed in individual monkeys (Supplementary Fig. 15).

Discussion

Here, we demonstrated that while both the behavior and the DLPFC

neuronal responses indicated that the monkeys directed their atten-

tion to the cued location, the OFC neuronal responses were never-

theless dominated by stimulus reward salience and only weakly

affected by spatial attention. The results argue against the previously

proposed theory that attention serves a s a selection mechanism for the

OFC’s value encoding when multiple items with different value asso-

ciations are presented11.

The dominance of reward salience that we observed cannot be

attributed to spatial attention, decision making, or eye movements.

First, unlike reward salience, which was constant in a trial, the cue that

indicated the luminance change was on the opposite side of the

luminance change, and the spatial attention needed to switch sides.

Therefore, spatial attention could be clearly identiﬁed with the switch

of location and be distinguished. Second, the monkeys had to report

their decision by making a saccade to a target above the ﬁxation point.

The motor preparation signals were thereby dissociated from the

attention and salience signals. Third, the reward that the monkey

received was randomlychosen between the two stimuli. Neither the CV

nor the SVwas the expected rewardoutcome. Lastly, no decisions were

guided by the values of the stimuli, so potential attention shifts

accompanying the value-based decision-making process were mini-

mized with the current behavior paradigm.

Spatial attention in our paradigm was dictated by the attention

cue via a top-down mechanism, while reward salience may create

attention from a bottom-up subcortical circuitry, which includes

~0 12 4

Value size

-0.6

-0.4

-0.2

0.2

0.4

Normalized firing rates (spikes/s)

(n =73)

Cued value = 8

Uncued value = 8

Same value

~0 12 4

Value size

-0.6

-0.4

-0.2

0.2

0.4

Normalized firing rates (spikes/s)

(n =53)

Fig. 5 | Spatial attention failed to switch the value encoding in the OFC to the

non-salient stimulus. a The positively tuned OFC neurons’(n= 73) r esponses to an

8-drops-of-juice stimulus (SV) paired witha stimulus associated with~0, 1, 2, 4, or 8

drops of juice (NSV). The red line denotes trials when the attention was toward the

8-drops-of-juice stimulus, and thepurple line denotes trialswhen the attention was

away from the 8-drops-of-juice stimulus. A two-way ANOVA (attention location:

1723

=3.74,p=0.05;value:F

4723

=3.45,p=8.4e−3) indicatesa subtle but signiﬁcant

attentional modulation. A complete switch of the value encoding to NSV would

produce responses close to the black line, which are the neurons’responses to the

trials of a pair of stimuli with the same reward. All error bars indicate SEM across

neurons. bSame as a, but for the negatively tuned OFC neurons (n= 53). Two-way

ANOVA with group (CV = 8/UCV = 8) and value was performed (attention location:

1524

=1.18, p=0.28;value:F

4524

=3.68, p=5.8e–3).

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 7

Content courtesy of Springer Nature, terms of use apply. Rights reserved

superior colliculus, pulvinar, ventral midbrain, and amygdala24,25.

Although reward salience is distinct from the kind of bottom-up

attention based on physical salience, interestingly, we also observed

bottom-up physical salience signals in the OFC. First, the OFC neurons

robustly, although only brieﬂy, encoded the onset of the frame (Fig. 4a,

d, blue trace), which was a salient bottom-up visual signal. Second,

there was a slightly stronger representation of the cued stimulus than

that of the un-cued after the luminance change (Fig. 4d), which,

although subtle, might also capture the attention via a bottom-up

mechanism. Therefore, while the OFC activities were minimally affec-

ted by spatial attention from the top-down source, inﬂuences from the

bottom-up sources, no matter whether they were based on physical or

reward salience, were readily reﬂected in the OFC.

The dominance of reward salience, which was almost all-or-

none, is in stark contrast to how neurons in the visual cortex are

modulated by spatial attention26. The responses of the visual neurons

to multiple stimuli presented in their receptive ﬁeld can be well

described with normalization models that perform a weighted aver-

age of the responses to each stimulus presented alone. Attention

modulates how the responses are weighted by adding bias toward

the attended stimulus, and the neurons’stimulus preference plays an

important role. Whether attended or not, a preferred stimulus pre-

sented in a neuron’s receptive ﬁeld has a signiﬁcant inﬂuence on its

responses27. OFC neurons, however, only encode the value of the

most salient stimulus. That is true even for the negatively tuned

neurons, which prefer the non-salient stimulus. Less salient stimuli

are largely ignored by OFC neurons, and spatial attention, when

directed toward a less salient stimulus, does not enhance its encod-

ing very much in the OFC.

One might argue that the weak representation of NSV in the OFC

was because of the task design, in which the reward for a correct

response was randomly chosen between the two stimuli. Yet, the value

that was relevant to the monkeys in this task—the mean value of the

two stimuli—was also not represented by the OFC. For that to be true,

one would expect that SV and NSV be similarly represented. Yet, the

ﬁndings of the dominance of SV encoding and the weak attentional

modulation may not generalize to other species. Notably, the OFC in

rodents was reported to be modulated by task context28. Given the fact

that the OFCs in rodents and primates are not evolutionarily homo-

logues, such a discrepancy may highlight an interesting difference

previously unknown between the two species.

Finally, our results suggest that the representations of spatial

attention and reward expectancy are dissociable in the brain. The

modulation effects of attention and reward on neuronal activities are

similar, and many previous studies have not clearly distinguished

attention and reward in the experiment design2. Here, we demonstrated

dissociable representations of reward salience and spatial attention in

the prefrontal cortex. Spatial attention was well represented only in the

DLPFC. The reward salience signal remains stable in both the OFC and

the DLPFC even when spatial attention shifted sides via the top-down

mechanism. Such a representation of the salience signal independent of

top-down control may be necessary for the brain to evaluate the

potential targets toward which spatial attention could be directed.

Future experiments could be designed to investigate this hypothesis.

Methods

Subjects

Two male rhesus monkeys (Macaca mulatta) were used. They weighed

9.3 kg (subject D) and 6.8 kg (subject G) at the beginning of the

training. All procedures followed the protocol approved by the Animal

Care Committee of Shanghai Institutes for Biological Sciences, Chinese

Academy of Sciences (CEBSIT-2021004).

0 0.1 0.2 0.3

CPD of salient value

0.05

0.1

0.15

0.2

0.25

0.3

CPD of non-salient value

(p = 1.9e-12)

Number of units

0.045

Sig salient value

Non-sig salient value

0 0.1 0.2 0.3

CPD

100

120

Number of units

0.011 Sig non-salient value

Non-sig non-salient value

0 0.1 0.2 0.3

CPD of cued value

0.05

0.1

0.15

0.2

0.25

0.3

CPD of uncued value

(p = 0.12)

Number of units

0.039

Sig cued value

Non-sig cued value

0 0.1 0.2 0.3

CPD

Number of units

0.032

Sig uncued value

Non-sig uncued value

(n=141) (n=141)

(n=143)

-400 -200 0 200 400 600

0.005

0.01

0.015

0.02

0.025

0.03

0.035

CPD

Time from lum change (ms)Time from stim on (ms)

-400 -200 0 200 400

Attention cue loc

Salient value

Non-salient value

Cue on Stim on Cue off Luminance change

(n=406)

0.005

0.01

0.015

0.02

0.025

0.03

0.035

CPD

Attention cue loc

Cued value

Uncued value

-400 -200 0 200 400 600

Time from lum change (ms)Time from stim on (ms)

-400 -200 0 200 400

Cue on Stim on Cue off Luminance change

(n=406)

Fig. 6 | DLPFC neuronal responses were dominated by the salient value.

aDLPFC neuron (n= 406) ﬁringrates were regressedagainst attention cue location,

SV, and NSV. Plotted is the time course of the population average coefﬁcients of

partialdetermination(CPD). Conventions as in Fig. 4a.bThe CPDs of SV against the

CPDs of NSVfor individual value-selective DLPFC neurons (n= 141).Conventions as

in Fig. 4b. cTop: the distribution of the CPDs for SV of the value-selective DLPFC

neurons (n= 141). B ottom: the distr ibution of the DLPFC neurons’CPDs for NSV.

Vertical dashed lines indicate the mean. Filled bars indicate signiﬁcant neurons.

dSame as (a), but for CV and UCV (n=406).eSame as b, but for CV and UCV

(n= 143). fTop: the distribution of the CPDs for CV of the value-selective DLPFC

neurons (n= 143). Bottom: the distribution of the DLPFC neurons’CPDs for UCV.

Vertical dashed lines indicate the means. Filled bars indicate signiﬁcant neurons.

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 8

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Behavioral task and materials

Head-restrained monkeys were seated in a primate chair facing a

23.6 inch computer monitor at 60cm away from their eyes. The center

of the screen was adjusted to align to the midpoint of the two eyes.

Behavioral tasks were run with the MATLAB based software

MonkeyLogic29. Subjects’eye position and pupil dilation were tracked

with an infrared oculomotor system at a sampling rate of 500 Hz

(EyeLink 1000). Juice was delivered by a computer-controlled

solenoid.

We trained two monkeys to perform a visual detection task

(Fig. 1a). The monkeys had to hold their gaze ata central ﬁxation point

on the screen within a 3° wide window. After maintaining ﬁxation for

0.5 s, a yellow saccade target was presented 6.0° (monkey D) or 7.6°

(monkey G) above the ﬁxation point. The saccade target remained on

the screen until the end of the trial. After 0.5 s, a square frame

(attention cue) 2.4° wide was presented on the left or right (monkey D:

7.0°; monkey G: 7.6°) of the ﬁxation point. After another 0.2 s, two

visual stimuli of 2.4° size appeared on the left and right sides (monkey

D: 7.0°; monkey G: 7.6°) of the ﬁxation point. The stimulus locations

were rotated slightly (23° counter-clockwise) for monkey G to mini-

mize its left-right bias. The attention cue disappeared 0.2 s after the

stimulus onset. After a random variable delay, one of the two stimuli

would change its luminance.

For monkey D, there were 90% of trials (valid-cue trials) the

luminancechangewasontheoppositesideoftheattentioncue(cued

location). In the remaining trials (invalid-cue trials), the change wason

the same side of the cue (un-cued location). In both cases, the monkey

was required toreport the change within0.1–0.4 s by making a saccade

toward the eye movement target to receive a juice reward. The onset

latency of the luminance change was subject to an exponential decay

function (tau = 2.5 s, cut off at 1.4 s) plus 0.4 s.

To encourage monkey G to direct the attention appropriately

according to the cue, it was trained to report only the luminance

change at the cued location (target) but ignore the change at the un-

cued location (distractor). In 80% of trials, the luminance change

occurred at the cued location. Among the rest of 20% of trials, half of

them had a luminance change at the un-cuedlocation only (distractor-

only trials, 10%), and the other half had ﬁrst a luminance change at the

un-cued location and then another at the cued location (distractor +

target trials, 10%). The latencies of the luminance change at the target

and the distractor locations were subject to their respective expo-

nential decay functions (target: tau = 2.5 s, cut off at 1.9 s; distractor:

tau =1.25 s, cut off at 1.9 s) plus 0.4s. The monkey was rewarded when

they made a timely response to the target in both the target and the

distractor+target trials, and when they held their ﬁxation till the end of

the trial in the distractor-only trials.

The visual stimuli were associated with different amounts of juice

that the monkeys might get for a correct response. There were ﬁve

stimuli for each monkey, each was associated with 1 small drop

(0.033ml), 1, 2, 4, and 8 standard drops (0.10, 0.16, 0.29, and 0.55ml)

of juice, respectively. For convenience, we label the stimulus with a

small drop of juice as ~0 standard drops in the ﬁgures and use 0 in the

regression models. The stimuli in each trial were randomly selected

from the stimulus set, and their locations (left or right) were coun-

terbalanced. For correct responses, one of the stimuli was randomly

selected, and the monkey would receive the reward associated with

that stimulus. Therefore, neither the cue nor the reward salience

provided information on which stimulus’s associated reward would

be given.

During the initial training, only one stimulus was presented on the

screen, and the monkeys were rewarded with its associated juice

amount if they detected the luminance change correctly. The attention

cue always appeared on the opposite side of the ﬁxation point (always

valid). During the recording sessions, the single-stimulus and double-

stimulus trials were interleaved. There were 7.1% and 10.1% of single-

stimulus trials for monkey D and monkey G, and the rest were double-

stimuli trials.

Surgery and MRI

Before the behavioral training, both monkeys received a chronic

implant of a titanium headpost. After a 2-month recovery, they were

trained to perform the behavioral task until they achieved satisfactory

performance. Then, the monkeys received structural Magnetic Reso-

nance Imaging (MRI) scans for us todetermine the recording chamber

location. After the chamber implant surgery, a manganese-enhanced

MRI scan was conducted to verify the chamber placement. The scans

were carried out in a Siemens 3 T scanner.

During the surgery, the monkeys were sedated with ketamine

hydrochloride (10 mg/kg), and anesthesia was then induced and

maintained with isoﬂurane gas (1.5–2%, to effect). Body temperature,

heart rate, blood-oxygen concentration, and expired CO

were mon-

itored throughout the surgical procedures.

Neuronal recordings

A 2 cm * 1.5 cm chamber was implanted on the surface of the left

(subject G) and the right (subject D) prefrontal cortex, centered 31.5

(subject G) and 27.5 (subject D) mm anterior to the interaural line

(Supplementary Fig. 1). We recorded extracellular single-unit activities

with tungsten microelectrodes (FHC: 0.3–2MΩ;AlphaOmega:

0.5–3MΩ). Each electrode was driven by an independent microdrive

(AlphaOmega EPS) through a stainless-steel guide tub e. The guide tube

was placed within a grid with holes 1 mm apart. The depth of

the penetration was conﬁrmed by the transitions between gray and

0 0.1 0.2 0.3 0.4

CPD of attention cue location

0.05

0.1

0.15

0.2

0.25

0.3

0.35

0.4

CPD of salient value

OFC

DLPFC

-0.2 0.2

Δ CPD

-0.0031

-0.072

0.02

0.04

0.06

Proportion of units

(n=225)

Fig. 7 | Comparison of the spatial attention and value signals between OFC and

DLPFC neurons. The CPDs of the attention cue location are plotted against the

CPDs of SV for individual OFC (orange) and DLPFC (magenta) neurons. The dotted

line is the diagonal. Only the neurons with non-zero coefﬁcients for the attention

cue location or SV were plotted (n=225,p< 0.01, one-sample t-test without mul-

tiple comparisons). The insert shows the distribution of the CPD differences

between the two CPDs. Vertical dashed lines indicate the mean.

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 9

Content courtesy of Springer Nature, terms of use apply. Rights reserved

white matter. At mostfour electrodes were lowered at a time. Neuronal

signals were recorded with an AlphaOmega SnR system at a sampling

rate of 44 kHz. We recorded neurons from the OFCand the DLPFC. The

OFC recording locations were between the lateral and medial orbital

sulci in Walker’s areas 11 and 13. The DLPFC neurons were recorded

from the area rostral to the arcuate sulcus (8a and 8b), including both

the dorsal and the ventral bank of the principal sulcus (46d and 46 v).

There were 171 recording sessions overall, 71 from monkey D and

100 from monkey G. Each recording session contained on average

1381 trials (monkey D: 1668 trials; monkey G: 1177 trials), of which 885

were correct trials (monkey D: 905 trials; monkey G: 871 trials).

Neurons with <100 valid-cue (target) trials were excluded from the

analyses.

Voltage signals of putative single neurons were isolated ofﬂine

manually with Plexon Ofﬂine Sorter (Plexon, Dallas, TX). Neurons with

poor isolation or with a lower than 1Hz response rate were excluded.

There were no additional selection criteria for neurons.

Behavioral analyses

All the behavior analyses were based on the monkeys’performance

during the recording sessions.

Accuracy and reaction time

For monkey D, an eyemovement to the saccade target within 0.1–0.4 s

after the luminance change is considered a hit. Otherwise, it would be

considered as a miss. For monkey G, hits and misses are similarly

deﬁned, except that any eye movements to the saccade target within

the 0.1–0.4 s time window on the distractor’s luminance change are

counted as false alarms. The RT is deﬁned as the time from the lumi-

nance change to the eye-movement initiation in the hit and false alarm

trials.

Neuronal selectivity

To determine how the value information was encoded by the neurons,

linear regressions (ﬁtlm function in Matlab Statistical Toolbox) were

performed for each task variable for each individual unit in three task

epochs:

Rnr,tðÞ=β0,ntðÞ+β1,ntðÞ*variable rðÞ+ε,ð1Þ

where R

(r, t) was the average neural response of unit nfor a given trial

rwithin time window t. In the single-stimulus trials, the only relevant

task variable is stimulus value (V

sin

). In the double-stimuli trials, the

task variablesinclude left stimulusvalue (LV), right stimulus value (RV),

difference between LV and RV (LV-RV), CV, UCV, difference between

CV and UCV (CV-UCV), SV, NSV, difference between SV and NSV (SV-

NSV), total value (TV) and attention cue’s location (att cue loc). εwas

independent Gaussian noise. The task epochs were the cue-stimulus

epoch (0.0–0.2 s after the stimulus onset), the early stimulus epoch

(0.2–0.6 s after the stimulus onset), and the late stimulus epoch

(0.0–0.4 s before the luminance change). A neuron is selective to a

variable if β

1,n

is signiﬁcantly different than 0 (p< 0.005).

A neuron is considered as value selective if it is selective to any

value variables above during any task epochs (cue-stimulus, early sti-

mulus,andlatestimulus).Wefurtherdividevalueselectiveneurons

into positively tuned and negatively tuned neurons. Neurons with

contradictory tuning are not included in Figs. 3,5, and Supplementary

Table1(e.g.,positivelytunedtoonevaluevariablebutnegatively

tuned to another value variable or positively tuned during one epoch

but negatively tuned during another epoch).

Decoding spatial attention

Linear discriminant analyses (LDA, classify function in Matlab Statis-

tical Toolbox) were used to decode spatial attention location from the

neuronal responses of DLPFC neurons. LDA aims to classify an

observation into one of the K classes through modeling the posterior

probability pðB=b∣A=aÞ,whereAis the observation (neural respon-

ses), and Bis the predictor (left attention cue or right attention cue).

The posterior probability is calculated through Bayes’theorem which

requires prior probability p(B) and conditional probability density

functions pðA=a∣B=bÞ. LDA assumes that the probability density

function given each condition is a normal distribution. Because the left

attention cue and the right attention cue were equally likely, the prior

was set to be 0.5.

Before conducting the LDA, we z-scored the responses of each

neuron across all the trials and all time points (−400–600 ms around

stimulus onset and −400–400 ms around luminance change). The

normalized responses were further smoothed in 100ms windows with

a Gaussian kernel (sigma = 50 ms). The LDA was performed using a

sliding window of 25 ms with 10 ms steps.

We constructed pseudo-neuron ensembles as follows. To balance

the contribution from each neuron, we ﬁrst selected the neurons with

more than 100 trials in both the left attention cue condition and the

right attention cue condition. We randomly chose 100 trials without

replacements from each condition for each neuron and constructed

the confusion matrix X2RM×T×Nwith the sampled trials, where Mis

the number of trials (M= 200, 100 trials in each cue condition), Tis the

number of time bins, and Nis the number of neurons (subject D:

N= 102; subject G: N=93).Toreducenoise,werantheprincipal

component analysis (PCA) and kept the ﬁrst Pcomponents that cap-

tured at least 70% of the variance for the LDA.

To estimate how spatial attention signals ﬂuctuated with time

(Fig. 2c, e), PCAs were run on the neuron dimension of Xat each time

bin. Only the ﬁrst Pprincipalcomponentsthatcapturedatleast70%of

the total variance were used to reconstruct the subspace

(Y2RM×T×P). The decoder was trained and tested with the recon-

structed subspace at each time bin. The posterior probability of

attention cue location given the neuronal responses was used to

quantify the decoder performance. The results were based on 200

independent leave-one-out cross-validations. Signiﬁcance was estab-

lishedby two-sided paired t-tests comparingthe actual data against the

posterior probabilities calculated in the same procedure but with the

shufﬂed data.

In Fig. 2b, d, a PCA was run on the neuron dimension of the

confusion matrix that combined all time bins (X02RMT ×N). This

ensured that the neural representation at each time bin shared the

same subspace after PCA. Only the ﬁrst Qprincipal components that

captured at least 70% of thetotal variance were used to reconstructthe

subspace (Y02RMT ×Q). Y0was reshaped into a RM×T×Qmatrix to

enable cross-temporal decoding. The decoder was trained with the

neural responses in one time bin (Fig. 2b: 25 ms time bin, stepped by

10 ms; Fig. 2d: 50–200 ms before stimulus onset), and tested with the

responses in all time bins.

Linear regression models

In Figs. 4a, d and 6a, d, linear regression models were ﬁttoaverage

ﬁring rates calculated with a 25 ms time window, aligned to the sti-

mulus onset and the luminance change, respectively. We constructed

two linear regression models:

Rnr,tðÞ=β0,ntðÞ+β1,ntðÞattention cue loc rðÞ+β2,ntðÞSV rðÞ+β3,ntðÞNSV rðÞ+ε,

ð2Þ

Rnr,tðÞ=β0,ntðÞ+β1,ntðÞattention cue loc rðÞ+β2,ntðÞCV rðÞ+β3,ntðÞUCV rðÞ+ε,

ð3Þ

where Rnr,tðÞis the average ﬁring rate of neuron nin trial rat time t,

and the predictors were attention cue location(coded as −1 or 1), SV (0,

1,2,4,or8),NSV(0,1,2,4,or8),CV(0,1,2,4,or8),UCV(0,1,2,4,or8).

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 10

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Each predictor was z-scored. The CPD was calculated as:

CPDi,ntðÞ=SSEi,nðtÞSSEall,nðtÞ

SSEi,nðtÞ,ð4Þ

where CPDi,ntðÞis the CPD of variable iin neuron n, SSEi,nðtÞis the

residual sum of squares of the regression model without variable i,

SSEall,nðtÞis the residual sum of squares of the full model. The sig-

niﬁcance of CPDi,ntðÞ is tested against the baseline (paired t-test,

p< 0.005 with FDR correction for multiple comparisons), which is the

CPD calculated with the ﬁring rate in a 200 ms time window before the

attention cue onset averaged across different regressors:

CPDbaseline,n=1

i=1

CPDi,nðt0Þ,ð5Þ

TheCPDsinFigs.4b, d, e, f and 6b, d, e, f were calculated similarly

but with the average ﬁring rate between the stimulus onset and the

luminance change. To speciﬁcally study the attention from the top-

down source, CPDs in Fig. 7were calculated with the average ﬁring rate

between the frame cue offset and the luminance change.

Individual monkey analyses

All results presented in the main text are based on the combined data

from both monkeys. Analyses based on each monkey individually can

be found in Supplementary Figs. 2, 6, 8, 10, 13, 15 corresponding to

Figs. 2–7in the main text. The results are consistent.

Reporting summary

Further information on research design is available in the Nature

Research Reporting Summary linked to this article.

Data availability

The data used in this study are available at https://doi.org/10.5281/

zenodo.7090240.

Code availability

The custom codes supporting the ﬁndings of this study are availableat

https://github.com/tmyang-lab/reward_salience_in_OFC.

References

1. Connor, C. E., Egeth, H. E. & Yantis, S. Visual attention: Bottom-up

versus top-down. Curr. Biol. 14,850–852 (2004).

2. Maunsell, J. H. R. Neuronal representations of cognitive state:

Reward or attention? Trends Cogn. Sci. 8,261–265 (2004).

3. Awh, E., Belopolsky, A. V. & Theeuwes, J. Top-down versus bottom-

up attentional control: a failed theoretical dichotomy. Trends Cogn.

Sci. 16,437–443 (2012).

4. Chelazzi, L., Perlato, A., Santandrea, E. & Della Libera, C. Rewards

teach visual selective attention. Vis. Res. 85,58–72 (2013).

5. Gottlieb,J.,Hayhoe,M.,Hikosaka,O.&Rangel,A.Attention,reward,

and information seeking. J. Neurosci. 34,15497–15504 (2014).

6. Anderson, B. A., Laurent, P. A. & Yantis, S. Value-driven attentional

capture. Proc. Natl Acad. Sci. U. S. A. 108,10367–10371 (2011).

7. Krajbich, I., Armel, C. & Rangel, A. Visual ﬁxations and the compu-

tation and comparison of value in simple choice. Nat. Neurosci. 13,

1292–1298 (2010).

8. Towal, R. B., Mormann, M. & Koch, C. Simultaneous modeling of

visual saliency and value computation improves predictions of

economic choice. Proc. Natl Acad. Sci. U. S. A. 110,

E3858–E3867 (2013).

9. McGinty, V. B., Rangel, A. & Newsome, W. T. Orbitofrontal Cortex

Value Signals Depend on Fixation Location during Free Viewing.

Neuron 90,1–14 (2016).

10. McGinty, V. B. Overt attention toward appetitive cues enhances

their subjective value, independent of orbitofrontal cortex activity.

eNeuro 6,1–19 (2019).

11. Xie, Y., Nie, C. & Yang, T. Covert shift of attention modulates the

value encoding in the orbitofrontal cortex. Elife 7,e31507(2018).

12. Wallis, J. D. & Miller, E. K. Neuronal activity in primate dorsolateral

and orbital prefrontal cortex during performance of a reward pre-

ference task. Eur. J. Neurosci. 18,2069–2081 (2003).

13. Padoa-Schioppa, C. & Assad, J. A. Neurons in the orbitofrontal

cortex encode economic value. Nature 441, 223–226 (2006).

14. Ballesta, S., Shi, W., Conen, K. E. & Padoa-Schioppa, C. Values

encoded in orbitofrontal cortex are causally related to economic

choices. Nature 588,450–453 (2020).

15. Hunt, L. T. et al. Triple dissociation of attention and decision com-

putations across prefrontal cortex. Nat. Neurosci. 21,

1471–1481 (2018).

16. Murray, E. A. & Rudebeck, P. H. Specializations for reward-guided

decision-making in the primate ventral prefrontal cortex. Nat. Rev.

Neurosci. 19,404–417 (2018).

17. Wallis, J. D. & Rich, E. L. Decoding subjective decisions from orbi-

tofrontal cortex. Nat. Neurosci. 7,973–980 (2016).

18. Hayden, B. Y. & Moreno-Bote, R. A neuronal theory of sequential

economic choice. Brain Neurosci. Adv. 2,1–15 (2018).

19. Posner, M. I. Orienting of attention. Q. J. Exp. Psychol. 32,

3–25 (1980).

20. Tremblay, S., Pieper, F., Sachs, A. & Martinez-Trujillo, J. Attentional

Filtering of Visual Information by Neuronal Ensembles in the Pri-

mate Lateral Prefrontal Cortex. Neuron 85,202–215 (2015).

21. Tremblay,S.,Doucet,G.,Pieper,F.,Sachs,A.&Martinez-Trujillo,J.

Single-Trial Decoding of Visual Attention from Local Field Potentials

in the Primate Lateral Prefrontal Cortex Is Frequency-Dependent. J.

Neurosci. 35,9038–9049 (2015).

22. Rigotti, M. et al. The importance of mixed selectivity in complex

cognitive tasks. Nature 497,585–590 (2013).

23. Mante,V.,Sussillo,D.,Shenoy,K.V.&Newsome,W.T.Context-

dependent computation by recurrent dynamics in prefrontal cor-

tex. Nature 503,78–84 (2013).

24. Takakuwa, N., Kato, R., Redgrave, P. & Isa, T. Emergence of visually-

evoked reward expectation signals in dopamine neurons via the

superior colliculus in V1 lesioned monkeys. Elife 6, e24459 (2017).

25. McFadyen, J., Mattingley, J. B. & Garrido, M. I. An afferent white

matter pathway from the pulvinar to the amygdala facilitates fear

recognition. Elife 8,e40766(2019).

26. Maunsell, J. H. R. Neuronal mechanisms of visual attention. Annu.

Rev. Vis. Sci. 1,373–391 (2015).

27. Ni,A.M.,Ray,S.&Maunsell,J.H.R.Tunednormalizationexplains

the size of attention modulations. Neuron 73,803–813 (2012).

28. Nogueira, R. et al. Lateral orbitofrontal cortex anticipates choices

and integrates prior with current information. Nat. Commun. 8,

1–13 (2017).

29. Asaad,W.F.,Santhanam,N.,Mcclellan,S.&Freedman,D.J.High-

performance execution of psychophysical tasks with complex

visual stimuli in MATLAB. J. Neurophysiol. 109,249–260 (2013).

Acknowledgements

We thank John Maunsell for his comments during the preparation of

the paper. We also thank Ruixin Su, Wei Kong, and Lu Yu for their help

in all phases of the study. This work was supported by the National

Science and Technology Innovation 2030 Major Program (Grant No.

2021ZD0203701), National Key R&D Program of China (Grant No.

2019YFA0709504), Shanghai Municipal Science and Technology

Major Project (Grant No. 2018SHZDZX05), and the Strategic Priority

Research Program of Chinese Academy of Science (Grant No.

XDB32070100).

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 11

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Author contributions

T.Y. conceived the original idea of the study. W.Z. and Y.X. collected the

behavioral and neurophysiological data and performed the analysis.

W.Z. and T.Y. wrote the paper. All authors designed the experiments,

discussed the results and provided the feedback on the paper.

Competing interests

The authors declare no competing interests.

Additional information

Supplementary information The online version contains

supplementary material available at

https://doi.org/10.1038/s41467-022-34084-0.

Correspondence and requests for materials should be addressed to

Tianming Yang.

Peer review information Nature Communications thanks Pieter Roelf-

sema and the other, anonymous, reviewer(s) for their contribution to the

peer review of this work. Peer reviewer reports are available.

Reprints and permissions information is available at

http://www.nature.com/reprints

Publisher’s note Springer Nature remains neutral with regard to jur-

isdictional claims in published maps and institutional afﬁliations.

Open Access This article is licensed under a Creative Commons

Attribution 4.0 International License, which permits use, sharing,

adaptation, distribution and reproduction in any medium or format, as

long as you give appropriate credit to the original author(s) and the

source, provide a link to the Creative Commons license, and indicate if

changes were made. The images or other third party material in this

article are included in the article’s Creative Commons license, unless

indicated otherwise in a credit line to the material. If material is not

included in the article’s Creative Commons license and your intended

use is not permitted by statutory regulation or exceeds the permitted

use, you will need to obtain permission directly from the copyright

holder. To view a copy of this license, visit http://creativecommons.org/

licenses/by/4.0/.

Article https://doi.org/10.1038/s41467-022-34084-0

Nature Communications | (2022) 13:6306 12

Content courtesy of Springer Nature, terms of use apply. Rights reserved

Terms and Conditions

Springer Nature journal content, brought to you courtesy of Springer Nature Customer Service Center GmbH (“Springer Nature”).

Springer Nature supports a reasonable amount of sharing of research papers by authors, subscribers and authorised users (“Users”), for small-

scale personal, non-commercial use provided that all copyright, trade and service marks and other proprietary notices are maintained. By

accessing, sharing, receiving or otherwise using the Springer Nature journal content you agree to these terms of use (“Terms”). For these

purposes, Springer Nature considers academic use (by researchers and students) to be non-commercial.

These Terms are supplementary and will apply in addition to any applicable website terms and conditions, a relevant site licence or a personal

subscription. These Terms will prevail over any conflict or ambiguity with regards to the relevant terms, a site licence or a personal subscription

(to the extent of the conflict or ambiguity only). For Creative Commons-licensed articles, the terms of the Creative Commons license used will

apply.

We collect and use personal data to provide access to the Springer Nature journal content. We may also use these personal data internally within

ResearchGate and Springer Nature and as agreed share it, in an anonymised way, for purposes of tracking, analysis and reporting. We will not

otherwise disclose your personal data outside the ResearchGate or the Springer Nature group of companies unless we have your permission as

detailed in the Privacy Policy.

While Users may use the Springer Nature journal content for small scale, personal non-commercial use, it is important to note that Users may

not:

use such content for the purpose of providing other users with access on a regular or large scale basis or as a means to circumvent access

control;

use such content where to do so would be considered a criminal or statutory offence in any jurisdiction, or gives rise to civil liability, or is

otherwise unlawful;

falsely or misleadingly imply or suggest endorsement, approval , sponsorship, or association unless explicitly agreed to by Springer Nature in

writing;

use bots or other automated methods to access the content or redirect messages

override any security feature or exclusionary protocol; or

share the content in order to create substitute for Springer Nature products or services or a systematic database of Springer Nature journal

content.

In line with the restriction against commercial use, Springer Nature does not permit the creation of a product or service that creates revenue,

royalties, rent or income from our content or its inclusion as part of a paid for service or for other commercial gain. Springer Nature journal

content cannot be used for inter-library loans and librarians may not upload Springer Nature journal content on a large scale into their, or any

other, institutional repository.

These terms of use are reviewed regularly and may be amended at any time. Springer Nature is not obligated to publish any information or

content on this website and may remove it or features or functionality at our sole discretion, at any time with or without notice. Springer Nature

may revoke this licence to you at any time and remove access to any copies of the Springer Nature journal content which have been saved.

To the fullest extent permitted by law, Springer Nature makes no warranties, representations or guarantees to Users, either express or implied

with respect to the Springer nature journal content and all parties disclaim and waive any implied warranties or warranties imposed by law,

including merchantability or fitness for any particular purpose.

Please note that these rights do not automatically extend to content, data or other material published by Springer Nature that may be licensed

from third parties.

If you would like to use or distribute our Springer Nature journal content to a wider audience or on a regular basis or in any other manner not

expressly permitted by these Terms, please contact Springer Nature at

onlineservice@springernature.com

Self-Attention in Transformer Networks Explains Monkeys' Gaze Pattern in Pac-Man Game

Preprint

Full-text available

Jun 2024

We proactively direct our eyes and attention to collect information during problem solving and decision making. Understanding gaze patterns is crucial for gaining insights into the computation underlying the problem-solving process. However, there is a lack of interpretable models that can account for how the brain directs the eyes to collect information and utilize it, especially in the context of complex problem solving. In the current study, we analyzed the gaze patterns of two monkeys playing the Pac-Man game. We trained a transformer network to mimic the monkeys' gameplay and found its attention pattern captures the monkeys' eye movements. In addition, the prediction based on the transformer network's attention outperforms the human subjects' predictions. Importantly, we dissected the computation underlying the attention mechanism of the transformer network, revealing its layered structures reflecting a value-based attention component and a component that captures the interactions between Pac-Man and other game objects. Based on these findings, we built a condensed attention model that is not only as accurate as the transformer network but also fully interpretable. Our results highlight the potential of using transformer neural networks to model and understand the cognitive processes underlying complex problem solving in the brain, opening new avenues for investigating the neural basis of cognition.

Attentional Control Influence Habituation through Modulation of Connectivity Patterns within the Prefrontal Cortex: Insights from Stereo-EEG

Article

May 2024
NEUROIMAGE

Attentional control, guided by top-down processes, enables selective focus on pertinent information, while habituation, influenced by bottom-up factors and prior experiences, shapes cognitive responses by emphasizing stimulus relevance. These two fundamental processes collaborate to regulate cognitive behavior, with the pre�frontal cortex and its subregions playing a pivotal role. Nevertheless, the intricate neural mechanisms underlying the interaction between attentional control and habituation are still a subject of ongoing exploration. To our knowledge, there is a dearth of comprehensive studies on the functional connectivity between subsystems within the prefrontal cortex during attentional control processes in both primates and humans. Utilizing stereo�electroencephalogram (SEEG) recordings during the Stroop task, we observed top-down dominance effects and corresponding connectivity patterns among the orbitofrontal cortex (OFC), the middle frontal gyrus (MFG), and the inferior frontal gyrus (IFG) during heightened attentional control. These findings highlighting the involvement of OFC in habituation through top-down attention. Our study unveils unique connectivity profiles, shedding light on the neural interplay between top-down and bottom-up attentional control processes, shaping goal-directed attention.

Defining the r factor for post-trauma resilience and its neural predictors

Article

Apr 2024

Distinct roles of the orbitofrontal cortex, ventral striatum, and dopamine neurons in counterfactual thinking of decision outcomes

Article

Aug 2023

Individuals often assess past decisions by comparing what was gained with what would have been gained had they acted differently. Thoughts of past alternatives that counter what actually happened are called "counterfactuals." Recent theories emphasize the role of the prefrontal cortex in processing counterfactual outcomes in decision-making, although how subcortical regions contribute to this process remains to be elucidated. Here we report a clear distinction among the roles of the orbitofrontal cortex, ventral striatum and midbrain dopamine neurons in processing counterfactual outcomes in monkeys. Our findings suggest that actually gained and counterfactual outcome signals are both processed in the cortico-subcortical network constituted by these regions but in distinct manners and integrated only in the orbitofrontal cortex in a way to compare these outcomes. This study extends the prefrontal theory of counterfactual thinking and provides key insights regarding how the prefrontal cortex cooperates with subcortical regions to make decisions using counterfactual information.

Rethinking resilience using the r-factor: Neurobiological mechanisms of static and dynamic components of resilience following trauma

Preprint

Full-text available

Jun 2023

Resilience is a dynamic process of recovery after trauma, but in most studies it is conceptualized as the absence of specific psychopathology following trauma. Using the large emergency department AURORA study (n=1,865, 63% women), we took a longitudinal, dynamic and transdiagnostic approach to define a static resilience (r) factor, that could explain >50% of variance in mental wellbeing 6-months following trauma, and a dynamic r-factor, which represented recovery from initial symptoms. We assessed its neurobiological profile across threat, inhibition, and reward processes using fMRI collected 2-weeks post-trauma. Our neuroimaging results suggest that resilience is promoted by activation of higher-level cognitive functioning and salience network regions in response to reward, whereas resilience is hampered by top-down attention and default mode network activation to threat and reward. These findings serve to redefine the concept of resilience as both dynamic and multifaceted, and identify mechanisms that promote resilience after trauma for early interventions.

Gaze-centered gating and reactivation of value encoding in orbitofrontal cortex

Preprint

Full-text available

Apr 2023

During economic choice, we often consider options in alternation, until we commit to one. Nonetheless, neuroeconomics typically ignores the dynamic aspects of deliberation. We trained macaques to perform a value-based decision-making task in which two risky offers were presented in sequence at different locations of the visual field, each followed by a delay epoch where offers were invisible. Subjects looked at the offers in sequence, as expected. Surprisingly, during the delay epochs, we found that subjects still tend to look at empty locations where the visual offers had previously appeared; moreover, longer fixation to given empty location increases the probability of choosing the associated offer, even after controlling for the offer values. We show that activity in orbitofrontal cortex (OFC) reflects the value of the gazed offer, but also the value of the offer associated with the gazed spatial location, even if it is not the most recently viewed. This reactivation reflects a reevaluation process, as fluctuations in neural spiking during offer stimuli presentation and delays correlate with upcoming choice. Our results suggest that look-at-nothing gazing triggers the reactivation of a previously seen offer for further reevaluation, revealing novel aspects of deliberation.

Gaze-centered gating and re-activation of value encoding in orbitofrontal cortex

Preprint

Full-text available

Apr 2023

During economic choice, we often consider options in alternation, until we commit to one. Nonetheless, neuroeconomics typically ignores the dynamic aspects of deliberation. We trained macaques to perform a value-based decision-making task where two risky offers were presented in sequence at different locations of the visual field, each followed by a delay epoch where offers were invisible. Subjects look at the offers in sequence, as expected. Surprisingly, during the delay epochs, the empty locations where the visual offers were formerly presented are looked at; and longer fixation to an empty location increases the probability of choosing the associated offer, even after controlling for the offer values. We show that activity in orbitofrontal (OFC) cortex reflects the value of the offer that is gazed, but also when its empty location is gazed. In particular, the value of the first offer is reactivated when animals gaze to its empty location in the last delay epoch. This reactivation reflects reevaluation, as activity fluctuations correlate with choice. Our results suggest that look-at-nothing gazing triggers the reactivation of a previously seen offer for further reevaluation, revealing novel aspects of deliberation.

Sensory over-responsivity and atypical neural responses to socially relevant stimuli in autism

Article

Jul 2024
AUTISM RES

Although aversive responses to sensory stimuli are common in autism spectrum disorder (ASD), it remains unknown whether the social relevance of aversive sensory inputs affects their processing. We used functional magnetic resonance imaging (fMRI) to investigate neural responses to mildly aversive nonsocial and social sensory stimuli as well as how sensory over‐responsivity (SOR) severity relates to these responses. Participants included 21 ASD and 25 typically‐developing (TD) youth, aged 8.6–18.0 years. Results showed that TD youth exhibited significant neural discrimination of socially relevant versus irrelevant aversive sensory stimuli, particularly in the amygdala and orbitofrontal cortex (OFC), regions that are crucial for sensory and social processing. In contrast, ASD youth showed reduced neural discrimination of social versus nonsocial stimuli in the amygdala and OFC, as well as overall greater neural responses to nonsocial compared with social stimuli. Moreover, higher SOR in ASD was associated with heightened responses in sensory‐motor regions to socially‐relevant stimuli. These findings further our understanding of the relationship between sensory and social processing in ASD, suggesting limited attention to the social relevance compared with aversiveness level of sensory input in ASD versus TD youth, particularly in ASD youth with higher SOR.

Distinct roles of the orbitofrontal cortex, ventral striatum, and dopamine neurons in counterfactual thinking of decision outcomes

Preprint

Mar 2023

Individuals often assess past decisions by comparing what was gained with what would have been gained had they acted differently. Thoughts of past alternatives that counter what actually happened are called “counterfactuals”. Recent theories emphasize the role of the prefrontal cortex in processing counterfactual outcomes in decision-making, although how subcortical regions contribute to this process remains to be elucidated. Here we report a clear distinction among the roles of the orbitofrontal cortex, ventral striatum and midbrain dopamine neurons in processing counterfactual outcomes in monkeys. Our findings suggest that actually-gained and counterfactual outcome signals are both processed in the cortico-subcortical network constituted by these regions but in distinct manners, and integrated only in the orbitofrontal cortex in a way to compare these outcomes. This study extends the prefrontal theory of counterfactual thinking and provides key insights regarding how the prefrontal cortex cooperates with subcortical regions to make decisions using counterfactual information. Teaser Cortical and subcortical systems both contribute to counterfactual thinking of decision outcomes but in distinct manners.

Values encoded in orbitofrontal cortex are causally related to economic choices

Article

Full-text available

Dec 2020
NATURE

In the eighteenth century, Daniel Bernoulli, Adam Smith and Jeremy Bentham proposed that economic choices rely on the computation and comparison of subjective values¹. This hypothesis continues to inform modern economic theory² and research in behavioural economics³, but behavioural measures are ultimately not sufficient to verify the proposal⁴. Consistent with the hypothesis, when agents make choices, neurons in the orbitofrontal cortex (OFC) encode the subjective value of offered and chosen goods⁵. Value-encoding cells integrate multiple dimensions6–9, variability in the activity of each cell group correlates with variability in choices10,11 and the population dynamics suggests the formation of a decision¹². However, it is unclear whether these neural processes are causally related to choices. More generally, the evidence linking economic choices to value signals in the brain13–15 remains correlational¹⁶. Here we show that neuronal activity in the OFC is causal to economic choices. We conducted two experiments using electrical stimulation in rhesus monkeys (Macaca mulatta). Low-current stimulation increased the subjective value of individual offers and thus predictably biased choices. Conversely, high-current stimulation disrupted both the computation and the comparison of subjective values, and thus increased choice variability. These results demonstrate a causal chain linking subjective values encoded in OFC to valuation and choice.

Overt Attention toward Appetitive Cues Enhances Their Subjective Value, Independent of Orbitofrontal Cortex Activity

Article

Full-text available

Sep 2019

Vincent B McGinty

Neural representations of value underlie many behaviors that are crucial for survival. Previously, we found that value representations in primate orbitofrontal cortex (OFC) are modulated by attention, specifically, by overt shifts of gaze toward or away from reward-associated visual cues (McGinty et al., 2016). Here, we investigate the influence of overt attention on behavior by asking how gaze shifts correlate with reward anticipatory responses and whether activity in OFC mediates this correlation. Macaque monkeys viewed pavlovian conditioned appetitive cues on a visual display, while the fraction of time they spent looking toward or away from the cues was measured using an eye tracker. Also measured during cue presentation were the reward anticipation, indicated by conditioned licking responses (CRs), and single-neuron activity in OFC. In general, gaze allocation predicted subsequent licking responses: the longer the monkeys spent looking at a cue at a given time point in a trial, the more likely they were to produce an anticipatory CR later in that trial, as if the subjective value of the cue were increased. To address neural mechanisms, mediation analysis measured the extent to which the gaze-CR correlation could be statistically explained by the concurrently recorded firing of OFC neurons. The resulting mediation effects were indistinguishable from chance. Therefore, while overt attention may increase the subjective value of reward-associated cues (as revealed by anticipatory behaviors), the underlying mechanism remains unknown, as does the functional significance of gaze-driven modulation of OFC value signals.

An afferent white matter pathway from the pulvinar to the amygdala facilitates fear recognition

Article

Full-text available

Jan 2019
eLife

Our ability to rapidly detect threats is thought to be subserved by a subcortical pathway that quickly conveys visual information to the amygdala. This neural shortcut has been demonstrated in animals but has rarely been shown in the human brain. Importantly, it remains unclear whether such a pathway might influence neural activity and behavior. We conducted a multimodal neuroimaging study of 622 participants from the Human Connectome Project. We applied probabilistic tractography to diffusion-weighted images, reconstructing a subcortical pathway to the amygdala from the superior colliculus via the pulvinar. We then computationally modeled the flow of haemodynamic activity during a face-viewing task and found evidence for a functionally afferent pulvinar-amygdala pathway. Critically, individuals with greater fibre density in this pathway also had stronger dynamic coupling and enhanced fearful face recognition. Our findings provide converging evidence for the recruitment of an afferent subcortical pulvinar connection to the amygdala that facilitates fear recognition. Editorial note: This article has been through an editorial process in which the authors decide how to respond to the issues raised during peer review. The Reviewing Editor's assessment is that minor issues remain unresolved (see decision letter).

Triple Dissociation of Attention and Decision Computations across Prefrontal Cortex

Article

Full-text available

Oct 2018
NAT NEUROSCI

Naturalistic decision-making typically involves sequential deployment of attention to choice alternatives to gather information before a decision is made. Attention filters how information enters decision circuits, thus implying that attentional control may shape how decision computations unfold. We recorded neuronal activity from three subregions of the prefrontal cortex (PFC) while monkeys performed an attention-guided decision-making task. From the first saccade to decision-relevant information, a triple dissociation of decision- and attention-related computations emerged in parallel across PFC subregions. During subsequent saccades, orbitofrontal cortex activity reflected the value comparison between currently and previously attended information. In contrast, the anterior cingulate cortex carried several signals reflecting belief updating in light of newly attended information, the integration of evidence to a decision bound and an emerging plan for what action to choose. Our findings show how anatomically dissociable PFC representations evolve during attention-guided information search, supporting computations critical for value-guided choice.

A neuronal theory of sequential economic choice

Article

Full-text available

Apr 2018

Results of recent studies point towards a new framework for the neural bases of economic choice. The principles of this framework include the idea that evaluation is limited to a single option within the focus of attention and that we accept or reject that option relative to the entire set of alternatives. Rejection leads attention to a new option, although it can later switch back to a previously rejected one. The option to which a neuron’s firing rate refers is determined dynamically by attention and not stably by labelled lines. Value is always computed relative to the value of rejection. Comparison results not from explicit competition between discrete populations of neurons, but indirectly, as in a horse race, from the fact that the first option whose value crosses a threshold is selected. Consequently, comparison can occur within a single pool of neurons rather than by competition between two or more neuronal populations. The computations that constitute comparison thus occur at multiple levels, including premotor levels, simultaneously (i.e. the brain uses a distributed consensus), and not in discrete stages. This framework suggests a solution to a set of otherwise unresolved neuronal binding problems that result from the need to link options to values, comparisons to actions, and choices to outcomes.

Covert shift of attention modulates the value encoding in the orbitofrontal cortex

Article

Full-text available

Mar 2018
eLife

During value-based decision making, we often evaluate the value of each option sequentially by shifting our attention, even when the options are presented simultaneously. The orbitofrontal cortex (OFC) has been suggested to encode value during value-based decision making. Yet it is not known how its activity is modulated by attention shifts. We investigated this question by employing a passive viewing task that allowed us to disentangle effects of attention, value, choice and eye movement. We found that the attention modulated OFC activity through a winner-take-all mechanism. When we attracted the monkeys' attention covertly, the OFC neuronal activity reflected the reward value of the newly attended cue. The shift of attention could be explained by a normalization model. Our results strongly argue for the hypothesis that the OFC neuronal activity represents the value of the attended item. They provide important insights toward understanding the OFC's role in value-based decision making.

Emergence of visually-evoked reward expectation signals in dopamine neurons via the superior colliculus in V1 lesioned monkeys

Article

Full-text available

Jun 2017
eLife

Responses of midbrain dopamine (DA) neurons reflecting expected reward from sensory cues are critical for reward-based associative learning. However, critical pathways by which reward-related visual information is relayed to DA neurons remain unclear. To address this question, we investigated Pavlovian conditioning in macaque monkeys with unilateral primary visual cortex (V1) lesions (an animal model of ‘blindsight’). Anticipatory licking responses to obtain juice drops were elicited in response to visual conditioned stimuli (CS) in the affected visual field. Subsequent pharmacological inactivation of the superior colliculus (SC) suppressed the anticipatory licking. Concurrent single unit recordings indicated that DA responses reflecting the reward expectation could be recorded in the absence of V1, and that these responses were also suppressed by SC inactivation. These results indicate that the subcortical visual circuit can relay reward-predicting visual information to DA neurons and integrity of the SC is necessary for visually-elicited classically conditioned responses after V1 lesion.

Lateral orbitofrontal cortex anticipates choices and integrates prior with current information

Article

Full-text available

Mar 2017

Adaptive behavior requires integrating prior with current information to anticipate upcoming events. Brain structures related to this computation should bring relevant signals from the recent past into the present. Here we report that rats can integrate the most recent prior information with sensory information, thereby improving behavior on a perceptual decision-making task with outcome-dependent past trial history. We find that anticipatory signals in the orbitofrontal cortex about upcoming choice increase over time and are even present before stimulus onset. These neuronal signals also represent the stimulus and relevant second-order combinations of past state variables. The encoding of choice, stimulus and second-order past state variables resides, up to movement onset, in overlapping populations. The neuronal representation of choice before stimulus onset and its build-up once the stimulus is presented suggest that orbitofrontal cortex plays a role in transforming immediate prior and stimulus information into choices using a compact state-space representation.

Specializations for reward-guided decision-making in the primate ventral prefrontal cortex

Article

May 2018
NAT REV NEUROSCI

The estimated values of choices, and therefore decision-making based on those values, are influenced by both the chance that the chosen items or goods can be obtained (availability) and their current worth (desirability) as well as by the ability to link the estimated values to choices (a process sometimes called credit assignment). In primates, the prefrontal cortex (PFC) has been thought to contribute to each of these processes; however, causal relationships between particular subdivisions of the PFC and specific functions have been difficult to establish. Recent lesion-based research studies have defined the roles of two different parts of the primate PFC — the orbitofrontal cortex (OFC) and the ventral lateral frontal cortex (VLFC) — and their subdivisions in evaluating each of these factors and in mediating credit assignment during reward-based decision-making.

Decoding subjective decisions from orbitofrontal cortex

Article

Jun 2016

When making a subjective choice, the brain must compute a value for each option and compare those values to make a decision. The orbitofrontal cortex (OFC) is critically involved in this process, but the neural mechanisms remain obscure, in part due to limitations in our ability to measure and control the internal deliberations that can alter the dynamics of the decision process. Here we tracked these dynamics by recovering temporally precise neural states from multidimensional data in OFC. During individual choices, OFC alternated between states associated with the value of two available options, with dynamics that predicted whether a subject would decide quickly or vacillate between the two alternatives. Ensembles of value-encoding neurons contributed to these states, with individual neurons shifting activity patterns as the network evaluated each option. Thus, the mechanism of subjective decision-making involves the dynamic activation of OFC states associated with each choice alternative.

Reward salience but not spatial attention dominates the value representation in the orbitofrontal cortex

Abstract and Figures

Recommended publications

Bottom-up but not Top-down Attention Dominates the Value Representation in the Orbitofrontal Cortex

Covert Shift of Attention Modulates the Value Encoding in the Orbitofrontal Cortex

Covert shift of attention modulates the value encoding in the orbitofrontal cortex

Economic Choices under Simultaneous or Sequential Offers Rely on the Same Neural Circuit