ArticlePDF AvailableLiterature Review

Human Extrastriate Visual Cortex and the Perception of Faces, Words, Numbers, and Colors

Authors:

Abstract

Electrophysiological correlates of the processing of visual information were studied in epileptic patients with electrodes chronically implanted on the surface of striate and extrastriate cortex. In separate experiments patients viewed faces, letter strings (words and non-words), numbers, and control stimuli. A negative potential, N200, was evoked by faces, letter strings, and numbers, but not by the control stimuli. N200 was recorded bilaterally from discrete regions of the fusiform and inferior temporal gyri. These category-specific face, letter-string, and number “modules” vary in location. In most cases there was no overlap in the location of face and letter-string modules, suggesting a mosaic of functionally discrete regions. In some cases letter-string and number N200s were recorded from the same location, suggesting that these modules may be less spatially and functionally discrete. Face N200-like potentials can be recorded from temporal scalp, allowing the possibility of studying early face processing in normal subjects. Longer-latency face-specific potentials were recorded from the inferior surface of the anterior temporal lobe. Potentials evoked by colored checkerboards were recorded from a region of the fusiform gyrus posterior to the fusiform region from which category-specific N200s were recorded. These results suggest that there are several processing streams in inferior extrastriate cortex. In addition to object recognition systems previously proposed for faces and words, our preliminary results suggest a separate system dealing with numbers. Postulated systems dealing with larger manipulable objects and animals have not been detected.
Human Extrastriate Visual Cortex
and the Perception
of
Faces,
Words, Numbers, and Colors
Truett Allison, Gregory McCarthy, Anna Nobre, Aim
Puce, and Aysenil Belger
Neuropsychology Laboratory,
VA
Medical Center,
West Haven, Connecticut 06516 and Department of
Neurology and Section of Neurosurgery, Yale
University School of Medicine, New Haven,
Connecticut 06510
Electrophysiological correlates of the processing of vi-
sual information were studied in epileptic patients with
electrodes chronically implanted on the surface of
stri-
ate and extrastriate cortex. In separate experiments pa-
tients viewed faces, letter strings (words and non-
words),
numbers, and control stimuli. A negative
potential,
N200, was evoked by
faces,
letter strings, and
numbers, but not by the control stimuli. N200 was re-
corded bilaterally from discrete regions of the fusiform
and inferior temporal
gyri.
These category-specific
face,
letter-string, and number "modules" vary in lo-
cation.
In most cases there was no overlap in the lo-
cation of face and letter-string modules, suggesting a
mosaic of functionally discrete regions. In some cases
letter-string and number N200s were recorded from the
same location, suggesting that these modules may be
less spatially and functionally discrete. Face N200-like
potentials can be recorded from temporal scalp, allow-
ing the possibility of studying early face processing in
normal subjects. Longer-latency face-specific poten-
tials were recorded from the inferior surface of the an-
terior temporal lobe. Potentials evoked by colored
checkerboards were recorded from a region of the fu-
siform gyrus posterior to the fusiform region from which
category-specific N200s were recorded.
These results suggest that there are several pro-
cessing streams in inferior extrastriate cortex. In ad-
dition to object recognition systems previously pro-
posed for faces and words, our preliminary results
suggest a separate system dealing with numbers. Pos-
tulated systems dealing with larger manipulate ob-
jects and animals have not been detected.
Little is known about the manner in which human
extrastriate cortex participates in the processing of
visual information. The ventral or inferior portion of
this region lies buried at the base of the brain over-
lying the cerebellum. It is not exposed by craniotomy
and has thus not been studied by the usual intraoper-
ative localizing techniques of cortical stimulation (e.g.,
to localize language-related cortex;e.g.,Ojemann et al.,
1989),
or evoked potential recording (e.g., to localize
sensorimotor cortex; e.g., Wood et al., 1988). Most of
what we know comes from the study of patients with
naturally occurring lesions, or from inferences derived
from anatomical and physiological studies in nonhu-
man primates.
In the last few years this situation has changed. Pos-
itron emission tomography (PET) now allows the
study of cortical regions participating in specific sen-
sory and cognitive tasks (e.g., Corbetta et al., 1991)-
Previously inaccessible cortex can now be studied
electrophysiologically in conjunction with chronic im-
plantation of electrodes in patients with medically in-
tractable epilepsy who are being evaluated for possi-
ble surgery (e.g., Fried et al., 1991). Here we review
the results of electrophysiological recordings in an ini-
tial series of 34 patients. The results suggest a consid-
erable degree of modularity in the processing of faces,
words, numbers, and colors, resulting in a mosaic of
functionally discrete regions in inferior extrastriate
cortex.
The Perception of Faces
Damage to the occipitotemporal region of the brain,
usually due to occlusion of a branch of the posterior
cerebral artery, may produce an inability to recognize
familiar faces. This condition is called face agnosia or
prosopagnosia (reviewed by Whitely and Warrington,
1977;
Damasio et al., 1982, 1990). Prosopagnosia is
usually produced by bilateral damage but can be pro-
duced by damage to the right hemisphere alone (e.g.,
Michel et al., 1989). A visual field defect, if present,
involves the upper quadrant of the affected field, dem-
onstrating that the essential lesion is located in infe-
rior extrastriate cortex.
We have been studying this conical region in pa-
tients with electrodes on inferior temporal and occip-
ital cortex. Given the clinical literature demonstrating
that this region of cortex is involved in face recogni-
tion, we asked whether viewing faces would generate
Cerebral Cortex Scp/Oct 1994:5:544-554; 1047-3211/94/J4.OO
at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from at University of North Carolina at Chapel Hill on November 1, 2013http://cercor.oxfordjournals.org/Downloaded from
facescrambled face
carscrambled car
butterfly
Fignre 1. Stimuli used to test fix face-specific neuronal activity.
face-specific neuronal activity. Five types of stimuli
were presented (Fig. 1). Faces were digitally scanned
from a college yearbook, cars were scanned from au-
tomotive magazines, and butterflies were scanned
from a field guide. The patient pressed a key to the
butterflies as a measure of attention. Graphics soft-
ware was used to rearrange the face and car images
to produce equiluminant scrambled images. The im-
ages followed a fixation point, subtended 7.2° X 7.2°
of visual angle, and were displayed for 250 msec at
intervals of
1.8-2.2
sec. Average luminance (101 cd/
m2) was not significantly different for the five types
of images. Each set of 110 stimuli consisted of 25 fac-
es,
cars, and their scrambled equivalents, and 10 but-
terflies. Three stimulus sets were presented; wave-
forms shown were averaged across all stimuli. Field
potentials were recorded from strips of stainless steel
electrodes resting on the cortical surface. Potentials
were recorded (referential to a mastoid) simultaneous-
ly from 32 or 64 locations using filter settings of
0.1
-
100 Hz and digitized at 250 Hz. Electrodes were mag-
netic resonance (MR) imaged to allow precise
localization in relation to sulci and gyri of occipito-
temporal cortex, as illustrated in Figure 2. Electrode
locations were also determined in the coordinate sys-
tem of Talairach and Tournoux (1988).
The most striking result of Initial recordings in 24
patients was that a large-amplitude (up to 200 \LV)
negative potential with a peak latency of about 200
msec (N200) was generated by faces but not by the
other categories of stimuli (Fig. 3; see also Allison et
al.,
1994). N200 was generated bilaterally in regions of
the fusiform and inferior temporal gyri indicated by
stippling in Figure 4. The response characteristics of
Cerebral Cortex Sep/Oct 1994, V 4 N 5 545
AB
Figure 2. MR localization
of
cortical surface electrodes (interelectrode distance, 1 cm): oblique axial M) and coronal (fl) images reconstructed along
a
plane
passing through the long axis of the electrode strip. In this example, electrode strip RMT extended from the medial lingual gyrus (location 1) to the lateral lingual
gyrus (location 2), fusiform gyrus (locations 3 and 4), and inferior temporal gyrus (locations 5-7). The electrode artifact is larger than the electrode itself (exposed
surface, 11 mm diameter). C, cuneus;
CBS,
calcarine sulcus;
CG,
cuneate gyrus;
CoS,
collateral sulcus;
FG,
fusiform gyrus;
ITG,
inferior temporal gyrus;
LLG,
lateral
lingual gyrus; MLG, medial lingual gyrus,
OTS,
occipitotemporal sulcus. Adapted from Allison
et
al. (1993).
scrambled (aces
cars
50J/V
400
MSFC800
Figure 3. Examples
of
locally generated field potentials evoked by the stimuli
of
Figure 1. A, Location of recording sites determined as in Figure 2. B, Initial face-
specific potential (N2C0) followed by a later face-specific positivity at 280 msec. (The face-specific positivity at 150 msec was often not present and when present
was usuaBy not face specific.) Patient
SPS;
Talairach coordinates x-34, y-49.
C,
N200 followed by a later negativity at 310 msec. Patient ONB; Talairach coordinates
x-29,
y-40.
D,
Visual evoked potentials recorded from the occipital pole. Patient AHK; Talairach coordinates
x-7,
y-95.
LG,
lingual gyrus;
PG,
parahippocampal gyrus;
other abbreviations are as in Figure
2.
548 Human Extrastrlate Cortex Allison ct al.
N200 were similar in both hemispheres and in the
fusiform and inferior temporal regions. In a given in-
dividual only a small portion of the total stippled re-
gion was activated; hence, there was considerable in-
terindividual variability in the location of these "face
modules." N200 occurred about 140 msec after the
initial activation of striate cortex by a visual stimulus,
and about 100 msec after initial activation of peristri-
ate cortex (reviewed by Allison et al., 1984). It is the
earliest face-specific activity we have recorded, sug-
gesting that a considerable amount of stimulus pro-
cessing occurs between initial activation of striate cor-
tex and activation of face modules. Cortical
stimulation at sites that generated N200 often pro-
duced a transient prosopagnosia (Allison et al., 1994),
suggesting that face modules are an essential part of
a pathway involved in face recognition.
In PET studies, Sergent et al. (1992) and Haxby et
al.
(1993) used face-matching and face-identification
tasks to identify regions of cortical activation. Of most
relevance in this context are the activated regions of
inferior extrastriate cortex. The centers of activation
in the left and right hemisphere identified by Sergent
et al. (Fig. 4) were close to the centers of the com-
bined fusiform/inferior temporal regions that generate
N200.
The right and left hemisphere anterior fusiform
regions identified by Haxby et
al.
are within the range
of fusiform N200 locations, while their posterior fusi-
form regions are slightly medial to our inferior tem-
poral N200 locations (Fig- 4). Considering the differ-
ences in tasks and techniques among the three
studies, there is good agreement regarding the regions
of inferior cortex involved in face perception. The PET
studies required across-subject averages to determine
regions of activation. It should be possible to image
brain regions involved in face processing in individual
subjects using functional MR techniques (e.g., McCar-
thy et al., 1993). In pilot studies we have imaged the
stippled region of Figure 4 using an echo-planar tech-
nique in conjunction with face and control stimuli
like those of Figure 1. We have not as yet obtained
clear MR activation by faces, perhaps because suscep-
tibility artifacts induced by the ear canals make this
region of inferior cortex difficult to image. Functional
MR studies, particularly in patients who have also
been studied electrophysiologically, will provide an-
other valuable technique for the study of face and ob-
ject recognition.
N200 was followed by later potentials that were
also face specific. There were two types of later activ-
ity: positive potentials with a peak latency of 250-350
msec (Fig. 3/J), or negative potentials with a peak la-
tency of 300-500 msec (Fig. 3C). The behavioral cor-
relates of these potentials are unknown; we surmise
that they will be more affected by attentional and
mnemonic variables than is N200. Visual evoked po-
tentials recorded from striate or peristriate cortex dif-
fered in several respects from the extrastriate poten-
tials:
initial peak latency was shorter (120 msec in the
example shown in Fig. 3-D), an off-potential followed
the initial on-potential by 250 msec (stimulus dura-
tion),
and these potentials were often largest to scram-
Flgure 4. Summary of functional regions
of
inferior extrastnate cortex. Sup-
pling, regions from which category-specific N200 potentials were recorded
to faces (Allison
et
al., 1994), letter strings, and numbers. Crosshatchmg,
regions from which the largest percentage of color potentials were recorded
(Allison et al., 1993).
C,
centers
of
PET activation in a color-attend task (Cor-
betta et
al.,
1991); H, centers of PET activation in a face-matching task (Haxby
et al., 1993);
I
centers
of
PET activation in
a
color task (Lueck
et
al., 1989),
5, centers of PET activation in a face-identification task (Sergent et al., 1992)
bled images, probably due to additional luminance-
contrast borders created by scrambling (Fig- 1)-
There is considerable evidence that the left and
right hemispheres process face information differently
(reviewed by Rhodes, 1993). Faces presented in the
left visual field in the usual upright orientation are
identified more rapidly (e.g., Geffen et al., 1971) and
more accurately (e.g., Leehey et al., 1978) than when
presented in the right visual field. However, the left
visual field (right hemisphere) advantage may disap-
pear when inverted faces are presented (Leehey et al.,
1978),
suggesting that the right hemisphere tends to
use configural features to form a gestalt of the face
whereas the left hemisphere tends to use a feature-
detection strategy Hemispheric differences in face
processing are also seen electrophysiologically (Fig.
5).
In this patient electrodes were located nearly sym-
metrically in the left and right fusiform gyrus. Upright
and inverted faces evoked essentially identical N200s
in the left hemisphere (Fig. 5B). In contrast, N200 re-
corded from the right hemisphere was considerably
smaller and 20 msec later when evoked by inverted
faces compared to upright faces (Fig. 5C). These ef-
fects are not due to inverted images per se, since up-
right and inverted cars evoked similar potentials in
both hemispheres. These effects were obtained with
faces presented centrally; how N200 and other face-
specific activities behave when stimuli are presented
Cerebral Cortex Scp/Oct 1994, V 4 N 5 547
inverted faces
inverted cars
N20050J/V
400800
figure 5. Interhemispheric differences
in
processing upright
and
inverted
faces.
A
Location
of
recording sites. Patient MBN.
B,
Potentials recorded
from
the
left hemisphere (Talairach coordinates x-43, y-51). C, Potentials
re-
corded from
the
right hemisphere (Talairach coordinates x33, y-47).
in the left and right visual fields remains to be deter-
mined.
In monkeys, face-selective neurons in the upper
wall of the superior temporal sulcus also show an in-
crease in latency when activated by inverted faces
compared with upright faces (Perrett et a!., 1988).
A
more direct comparison with our recordings might be
with single-unit recordings in ventral inferior temporal
cortex (area TE or ATT), which on anatomical and
physiological grounds (Desimone,
1991;
Tanaka et al.,
1991;Heywood and Cowey, 1992;Perrett et al., 1992;
Gross et al., 1993) is likely to be the monkey homolog
of the stippled region of Figure
4.
However, face-selec-
tive neurons in the ventral region (e.g., Yamane et al.,
1988;
Tanaka et al., 1991) have apparently not been
faces
scrambled faces
cars
400
MSEC800
figure 6. Face-specific potentials recorded from
the
anterior temporal lobe
A Location
of
recordings sites.
B,
Anterior negative potential. Patient
SGI;
Talairach coordinates x26,
yl
C, Anterior positive potential. Patient CLR;
Ta-
lairach coordinates x24,
y-8.
tested for orientation selectivity. In humans, a scalp-
positive potential in the 170-200 msec latency range
is also smaller and delayed when evoked by inverted
compared with upright faces (Jeffreys,
1993).
This po-
tential, best recorded from the vertex and larger to
face than to nonface stimuli (e.g., Seeck and Griisser,
1992;
Jeffreys,
1993),
may in part reflect the same neu-
ronal activity as does N200. An equivalent dipole
source in inferior cortex generating a surface-negative
potential (in this case N200) would generate an equiv-
alent P2OO as recorded above inferior cortex (in this
case from the upper
scalp),
similar to the topographic
pattern described for
auditor}'
potentials generated in
the superior temporal plane (Wood and Wolpaw,
1982).
In addition to the face-specific potentials recorded
from the stippled region of Figure 4, long-latency face-
specific potentials were recorded from the anterior
portion of inferior temporal cortex
(Fig. 6).
They were
548 Human Extrastriatc Cortex Allison et al.
variable in latency and waveform, and were negative
(Fig.
6ff) or positive
(Tig. 6C).
They were also variable
in location but seem to be concentrated in or near
entorhinal cortex, as in Figure 6, suggesting that they
reflect later stages of mnemonic or associative pro-
cessing of face information. These face-specinc poten-
tials are not P300-like potentials elicited by targets (re-
viewed by McCarthy et al., 1989) since at these
locations such potentials were not evoked by target
butterflies (Fig. 6B,C)
Shlomo Bentin at Hebrew University is studying
scalp-recorded potentials evoked by the stimuli of
Fig-
ure 1. A face-specific negativity at about 170 msec is
recorded from temporal scalp locations T, and T6, and
a slightly later positivity is recorded from the vertex
(C,) at about 180 msec
(Fig. 7).
The negativity is larger
over the right hemisphere (T6), perhaps reflecting the
greater role of the right hemisphere in processing face
information (Geffen et al.,
1971;
Whitely and Warring-
ton, 1977; Leehey et al., 1978; Michel et al., 1989;
Rhodes, 1993). While the scalp negativity and N200
are similar enough to suggest their correspondence,
their relationship is not altogether
clear.
T,
and T6 elec-
trodes are typically located over the posterior middle
temporal gyrus (Homan et al., 1987) and are therefore
not well placed to record potentials generated in in-
ferior cortex. Furthermore, we have never recorded a
face-specific N200 from the lateral surface of the tem-
poral lobe. The positivity recorded from the vertex
may reflect the same scalp-positive potential recorded
in other studies of face processing (e.g., Seeck and
Griisser, 1992; Jeffreys, 1993). The relationship be-
tween N200 and scalp-recorded potentials remains to
be clarified. In any case, the ability to record N200 or
N200-like face-specific potentials from the scalp is im-
portant because it should be possible to study face
processing developmental!^ in patients with proso-
pagnosia or other conditions that affect face recogni-
tion, and in other settings in which intracraniaJ re-
cordings cannot be obtained.
The Perception of Words
We have been studying potentials generated by con-
tent words (e.g., apple), by legal nonwords or pseu-
dowords (e.g., glimp), and by illegal nonwords (e.g.,
xpczt).
Letter strings were displayed for 500 msec at
intervals of 2.1-2.2 sec as red characters on a white
rectangular background, and subtended 5-10° of vi-
sual angle horizontally and 0.5-1.0° vertically. Re-
cording conditions were otherwise as described
above.
A
variety of word-related potentials were seen
in several cortical regions (Nobre and McCarthy,
1994).
Of most relevance to this review, focal nega-
tive potentials generated by word-related stimuli
were recorded from the same regions of the fusiform
and inferior temporal gyri that generated the face
N200.
The word-related N200 was evoked equally
well by content words and by legal and illegal non-
words (Fig. SB), that is, by any letter string. Face and
letter-string N200s were similar in waveform, but let-
ter-string N200s were typically shorter in peak laten-
cy by about 30 msec. Electrodes that recorded a let-
faces
scrambled faces
cars
scrambled cars
10;/V
400
MSEC800
Figure 7. Potentials recorded from the vertex (C,|, and from left (7,) and right
(7,1 posterior temporal scalp, averaged across 10 subjects.
ter-string N200 did not as a rule also record a face
N200,
suggesting a considerable degree of anatomical
and functional specificity of "letter-string modules"
in extrastriate cortex. As in the case of face N200s,
letter-string N200s were often followed by longer-la-
tency letter-string-specific potentials (e.g., Fig. SB,
the negativity at about 300 msec). Long-latency po-
tentials evoked by letter-strings were also recorded
from more anterior regions of the inferior temporal
lobe,
particularly near entorhinal cortex. This activity
is modified by word type and semantic priming (No-
bre and McCarthy, 1994).
The Perception
of
Numbers
Puce et al. (1994) recorded potentials generated in a
task in which patients learned to associate words
(concrete nouns and adjectives) with ideograms (Jap-
anese characters). In a distractor task simple arithme-
tic expressions (e.g., 4 + 3 = ?) were presented. Sur-
prisingly, these equations evoked an N200 in discrete
regions of extrastriate cortex. Words and ideograms
usually did not evoke an N200 at the same locations,
but an arithmetic operation was required and might
have been important in evoking the
potential.
To
char-
acterize this activity more systematically we have pre-
sented single or paired Arabic numbers and six types
of control
stimuli.
Stimuli were displayed for 300 msec
Cerebral Cortex Sep/Oct 1994, V 4 N 5 549
content words
faces
nonwords
pseudowords
arabic numbers
concrete nouns
faces
illegal nonwords
false fonts
+
N200
0400
MSEC
T
50pV
800
Figure 8. Potentials evoked
by
letter strings and numberi
A,
Location
of
recording sites. B, Letter-string N200. Patient
MKY;
Talairach coordinates x-45,
y-55.
C,
Number N200. Patient AFO; Talairach coordinates x-26, y-38.
at intervals
of
1.4-1.6
sec as
white characters
on a
black background, and subtended
1-12° of
visual
an-
gle horizontally
and 1.2°
vertically. Recording condi-
tions were otherwise
as
described above. Figure
8C
shows potentials recorded from
a
location
on the fu-
siform gyms.
A
"number
N2OO"
was recorded
to Ara-
bic numbers
but not to
faces, letter strings,
or
false
fonts (illegal number-like
or
letter-like characters). Ar-
abic numbers presented alone therefore sufficed
to
generate
a
number N200.
In
this recording
the re-
sponse selectivity
to
numbers
is
clear. However,
in
some cases number
and
letter-string N200s were
re-
corded from
the
same location. The degree
of
spatial
and functional differentiation
of
number
and
letter-
string modules remains
to be
determined.
All
locations
to date from which a number N200 has been recorded
were within
the
stippled region
of
Figure
4.
The Perception
of
Color
Damage
to
occipitotemporal cortex
may
produce
a
selective deficit
in the
perception
of
color. This
con-
dition
is
referred
to as a
cerebral
or
central achroma-
topsia
to
indicate that
the
color blindness
is
cerebral
rather than retinal
in
origin (reviewed
by
Damasio
et
al.,
1980; Zeki, 1990). Achromatopsia
is
often,
but not
necessarily, accompanied by
an
upper quadrant defect
in
the
affected half field, demonstrating that
the es-
sential lesion
is
located
in
inferior extrastriate cortex.
We have examined
the
involvement
of
this region
of cortex
in
color vision
by
recording potentials
evoked
by red and
blue checkerboards (Allison
et al.,
1993).
The checkerboards were displayed
at
intervals
of 3.5-4.0
sec in an
adaptation stimulus (1000 msec
duration)-test stimulus
(100
msec duration) design,
and subtended 21°
of
visual
angle horizontally and
16°
vertically. Recording conditions were otherwise
as de-
scribed above. Regions showing
a
high percentage
of
statistically significant color potentials
are
indicated
by crosshatching
in
Figure
4. The
color region
in-
volves
the
posterior portion
of
the fusiform gyrus and
may extend into
the
lateral portion
of the
lingual
gyrus,
regions that have been implicated
in the per-
ception
of
color since Verrey's description
of an ach-
romatopsia
in 1887 (see
Zeki, 1990). Electrical stim-
ulation
of the
same locations that show
a
significant
color effect often produces colored phosphenes
or,
less commonly, desaturation
of
color.
By
contrast,
in
our experience stimulation
of
striate cortex rarely pro-
duces color effects. We have
not
detected longer-la-
tency color-specific potentials
in
more anterior
regions
of
the temporal lobe. With
the
proper exper-
imental conditions such activity might
be
recordable,
for example,
in the
region
of the
left anterior lingual
gyrus thought
by
Damasio
and
Damasio (1983)
to be
involved
in
color naming.
Two PET studies
of
color perception have been
car-
ried out. Lueck
et al.
(1989) presented colored
and
equiluminant gray-scale stimuli,
and
activated
a pos-
terior region
of
fusiform gyrus (Fig.
4)
similar
to the
region
we
found electrophysiologically. Corbetta
et al.
(1991) presented stimuli that varied along several
di-
mensions including color, and found that
an
attend-to-
color condition activated
a
slightly more posterior
re-
gion
(Fig. 4).
Anatomical evidence (Clarke
and
Miklossy,
1990)
also suggests that
the
posterior fusi-
form gyrus
may be, or be a
part
of, the
human
ho-
molog
of
area V4
in
monkeys,
a
region implicated
in
the perception
of
color
and
form (reviewed
by
Zeki,
1990).
Many neurons
in
V4 are selective
for
color rath-
er than wavelength,
in the
sense diat their responses
correlate with
the
human perception
of
color; that
is,
they exhibit color constancy (Zeki, 1990). Responses
of neurons
in VI do not
exhibit this property. Thus,
according
to
Zeki, V4
is the
initial region
of
cortex
in
which color
is
treated
as
color, rather than wave-
length composition.
General Discussion
Early
Visual
Processing in Extrastriate Cortex
Even
at the
preliminary stage
of
investigation
re-
viewed here,
it
appears that four processing streams
559 Human Extra5triatc Cortex
Allison
et a].
can be discerned subsequent to initial processing in
striate and peristriate cortex. Three deal with specific
categories of objects (if faces, letter strings, and num-
bers can be considered "objects"), and the fourth
deals with color.
Faces
The stippled region of Figure 4 appears to be the ini-
tial region of cortex in which faces are treated as fac-
es ("this is a face"), rather than as, for example, an
irregular ellipse containing two ovals, a vertical line,
and a horizontal line. This assertion is based on the
internal evidence of specificity reviewed above, and
on the plausible (albeit unproven) assumption that
the neurons generating N200 are functionally similar
to the neurons in monkey temporal cortex that re-
spond selectively to faces or face components (re-
viewed by Desimone,
1991;
Harries and Perrett, 1991;
Perrett et al., 1992; Gross et al., 1993). Specifically, the
human neurons may be equivalent to neurons in the
anterior ventral portion of monkey inferior temporal
cortex, the first cortical locus along the ventral path-
way that contains face-selective neurons (Tanaka et al.,
1991;
Perrett et al., 1992; Gross et al., 1993). To take
the argument a step farther, face modules may be
functionally similar to the patches of monkey cortex
containing a relatively large percentage of face-selec-
tive neurons (Harries and Perrett, 1991). We cannot
estimate the size of a face module with any certainty
since usually only one electrode strip (oriented me-
diolaterally, or obliquely as in Fig. 2) happens to be
placed on the relevant cortex. However, N200 is usu-
ally recorded from only one electrode (and from not
more than two adjacent electrodes) of a strip, sug-
gesting that in the mediolateral dimension the module
is no larger than 1-2 cm wide.
Letter strings
The stippled region of Figure 4 may be the initial re-
gion of cortex in which letter strings are treated as
words or possible words ("this is a string of letters
that may be a word") rather than as a configuration
of straight and curved lines. Surprisingly, letter-string
modules treat words, pseudowords, and nonwords
equally, at least during the initial activity recorded as
a letter-string N200
(Fig.
8B).
This suggests that lexical
and semantic processing occurs later or, alternatively,
that a letter-string N200 reflects activity at an earlier
"pre-letter-string" stage of processing. Recordings us-
ing progressively simplified and decomposed stimuli
are needed to determine the necessary and sufficient
conditions to evoke a letter-string N200 (for the same
strategy applied to recordings in monkey inferior tem-
poral cortex, see Gross et al., 1972; Tanaka, 1993).
Numbers
Only recently have we begun recording potentials
evoked by numbers and arithmetic equations. How-
ever, preliminary results
(Fig.
8C) suggest that number
N200s have the same general properties as face and
letter-string N200s, and are also generated in the stip-
pled region of Figure
4.
A
number module may be the
first cortical region that treats numbers as numbers
("this is a number or arithmetic expression"). As in
the case of letter strings, additional recordings using
a variety of number and decomposed number-like
stimuli will be required to determine the response
properties of the presumptive number modules.
It is tempting to speculate that some cases of word
alexia and number alexia (reviewed by Damasio and
Damasio, 1983; Levin et al., 1993) are manifestations
of damage to letter-string and number modules, re-
spectively. If the initial processing of faces, letter-
strings, and numbers occurs in the same general re-
gion of extrastriate cortex, as our results suggest, one
would expect lesions of the region to produce cor-
related deficits in these functions. Such cases are the
exception rather than the rule. While it is true that
word and number alexia often occur together, their
association with prosopagnosia is infrequent. For ex-
ample, in 16 patients with alexia studied by Damasio
and Damasio (1983), only two had an associated pro-
sopagnosia. Such dissociations may simply reflect vari-
ability in the location of modules (discussed below).
Further work is needed to clarify the relationship be-
tween the neurophysiological and neuropsychological
results.
Later Visual Processing
Later stages of visual processing are believed to occur
in the anterior temporal lobe, based on several con-
siderations including the anterior projections of the
ventral pathway in monkeys (Van Essen et al., 1992;
Gross et al., 1993), single-unit recordings in humans
(Heit et al., 1988) and monkeys (Rolls, 1992), and PET
studies in humans (Sergent et al., 1992; Mclntosh et
al.,
1994). The anterior category-specific potentials
(e.g.,
Fig.
6) may reflect some portion of this later pro-
cessing. The apparent concentration of such locations
near entorhinal cortex suggests that the activity may
be related to mnemonic processes ("this is Bill Clin-
ton's face"). Later processing may also occur in mod-
ules and may be reflected in the potentials following
N200 (e.g., Fig. 3fi,C). These potentials might be
evoked by later feedforward activity in structures that
project to the fusiform/inferior temporal region, for
example, the human homolog of V4 that in the mon-
key provides the main visual input to inferior tempo-
ral cortex (Boussaoud et al.,
1991;
Gross et al., 1993).
Alternatively they might be evoked by feedback activ-
ity from downstream structures (Van Essen et al.,
1992).
Why Are Object Modules So Variable
in Location?
In a given patient we can predict that face, letter-
string, and number modules, if found, will be located
somewhere within the stippled regions of Figure 4.
But as far as we know there is no way of predicting
where within the region any module might be. Why
is this?
Children begin to deal widi written words and
numbers at the age of three or four, and an argument
could be made that letter-string and number modules
Cerebral Cortex Sep/Oct 1994, V 4 N 5 551
faces
cars
scrambled care
butterflies
5O.yV
400
MSEC800
Fgnre 9. Car-specific potential. A, Location of electrode she. B, Posftivrty at
320 msec evoked by cars. Patient BME; Talairach coordinates x27, y-3.
are formed catch-as-catch-can from "leftover" patches
of fusiform/inferior temporal cortex. But such an ar-
gument fails for face modules. Newborns attend to fac-
es more than to nonface stimuli of equal complexity,
and over their first 6 months infants become increas-
ingly skilled at discriminating faces (reviewed by Ca-
rey, 1992; Ellis, 1992). In addition, inferior temporal
neurons of infant monkeys show the same selectivity
for faces as do neurons of adult monkeys (Rodman et
al.,
1993).
Yet despite the availability of regions for the
establishment of anatomically fixed face modules,
such a deterministic process apparent!)' does not oc-
cur.
No individual would have had letter-string or num-
ber modules until very late (a few thousand years ago)
in human evolution, and few individuals had them un-
til a few hundred years
ago,
before the advent of wide-
spread literacy. In preliterate humans what occupied
the patches of cortex now occupied in most of us by
letter-string and number modules? Perhaps they were
occupied by modules for edible plants, or for danger-
ous animals, or more generally for whatever objects
were important for
survival.
Similarly,
it is possible that
category-specific modules develop in adult humans
depending on their interests and occupation. The
dairy fanner who developed a prosopagnosia and was
also unable to identify his cows (Bornstein et al.,
1969) may have had a "cow module" (operationally
defined as a region of fusiform/inferior temporal cor-
tex that generates a cow-specific N200). There is a
surprising degree of plasticity in adult sensory cortex
(reviewed by Merzenich et al., 1988; Kaas, 1991; Gil-
bert, 1993); perhaps the same principles of dynamic
reorganization apply to an even greater extent in ex-
trastriate cortex.
Of course, it is easy to take this line of reasoning to
improbable lengths. One would not want to posit a
module for each of the hundreds of categories of ob-
jects (typewriters, shoes, hands . . .) that we all rec-
ognize. It may be the case that a fairly small number
of object modules is essential for that object category
but also participates in the recognition of other ob-
jects.
This brings us to our final topic.
How Many Object Recognition Systems
Are There?
In my view, It is a vain pursuit to seek to isolate var-
ious subtypes of visual agnosia, e.g., for colours or for
faces.
But surely, the pinnacle of absurdity Is found in
the alleged dichotomy between an agnosia for inani-
mate as opposed to animate objects.
As
lief might one
seek to distinguish an agnosia for knives as against
forks,
which could equally well be claimed from the
case notes of some patients. ...
I
hazard the plea . .
for the abandonment of the term "agnosia" suggested
in 1891 by Freud. Though at the time it might have
been appropriate, subsequent writers have debased
the currency.—Macdonald Critchley
A persistent issue in the study of object recognition
is the structure of the system or systems involved. In
physiological terms, how many categories of objects
are important enough to be processed by dedicated
pathways and cortical regions? On the one hand are
those who would agree with Critchley (1965) that
there exists but one unitary system of object recog-
nition. On the other hand one cannot fail to be im-
pressed by the category-specific deficits (some per-
haps of the knife and fork variety, but others more
compelling) seen in patients with brain lesions. Over
the past 20 years many anatomical and neurophysio-
logical studies in monkeys demonstrate that after ini-
tial analysis in striate (VI) and peristriate (V2) cortex,
specialized processing of visual information is subcon-
tracted out to various regions of extrastriate cortex.
These processing streams can be conceptualized
broadly,
for example, Ungerleider and Mishkin's(1982)
dorsal "where is it?" stream and ventral "what is it?"
stream, or in baroque profusion, for example, the doz-
ens of specialized areas and hundreds of pathways
summarized by Van Essen et al. (1992). Furthermore,
psychophysical studies in humans suggest the exis-
tence of "channels," hypothetical entities having non-
overlapping sensitivities to classes of visual stimuli
and operating more or less independently (reviewed
by Regan, 1989). Thus, in the quarter century since
Critchley's jeremiad the atmosphere has changed. We
are now accustomed to thinking of parallel pathways
acting on specific subsets of sensory information.
Farah's (1990) review of the visual agnosias led her
to the conclusion that only two systems are required
552 Human Extrastrfcite Cortex Alltson ct al
for object recognition: one is required primarily for
written words and to a lesser extent for other objects,
and another is required primarily for faces and to a
lesser extent for other objects. The letter-string and
face recordings described above support this scheme,
but in addition suggest a third system dealing with
numbers.
If Farah's is the most parsimonious model of object
recognition (given that more than one system is pos-
tulated in the first place), Konorski (1967) by contrast
conceives of nine "gnostic fields" based primarily on
dissociations observed in patients. One of his catego-
ries is faces, consistent with our recordings. Another
category is "signs," which includes words and num-
bers;
our results suggest a subdivision of this category
as noted above. Two of Konorski's categories are "larg-
er partially manipulable objects" including cars, and
"animated objects" including
animals.
We
have record-
ed car-specific potentials in two patients; an example
is shown in Figure 9. These potentials consisted of a
broad, late positivity, and were not recorded from the
stippled region of Figure 4. We have never recorded
an N200 to cars or butterflies despite the fact that
these stimuli were included in all the face and number
recordings and most of the letter-string
recordings.
We
conclude either that "larger partially manipulable ob-
jects"
and "animated objects" do not exist as gnostic
fields (by our operational definition, modules gener-
ating a category-specific N200), or that we have yet
to identify effective objects within these valid fields.
However, the point to be emphasized is that these and
other models of object recognition can be tested em-
pirically with recordings like those reviewed here.
N200 is a recording of postsynaptic potentials in hun-
dreds or thousands of neurons with varying selectivi-
ties,
and in that respect human recordings cannot
achieve the degree of specificity obtainable in single-
unit recordings in monkeys. On the other hand, hu-
man subjects bring to these tasks a tremendous
amount of prior knowledge of objects and language
skills that we hope to exploit to further the under-
standing of object recognition.
Notes
This work
was
supported
by the
Veterans Administration
and
by
NIMH Grant MH-05286.
We
thank J
Claxton.J.
Jasior-
kowski,
and
K. McCarthy
for
assistance. M. Luby kindly
pro-
vided electrode localization.
We
are grateful
to
Drs.
D. D.
and
S. S. Spencer
and the
members
of
the Yale/VA Epilepsy Sur-
gery Program
for
their cooperation
in the
studies reviewed
here.
Correspondence should
be
addressed
to
Dr.
Truett Allison,
Neuropsychology Laboratory
1I6B1,
VA
Medical Center,
West Haven, CT 06516.
References
Allison
T,
Hume AL, Wood CC, Goff WR (1984) Develop-
mental
and
aging changes
in
somatosensory, auditory and
visual evoked potentials. Electroencephalogr Clin Neuro-
physiol 58:14-24.
Allison
T,
Begleiter
A,
McCarthy
G,
Roessler
E,
Nobre
AC,
Spencer DD (1993) Qectrophysiological studies
of
color
processing
in
human visual cortex. Electroencephalogr
Clin Neurophysiol 88:343-355.
Allison
T,
Ginter
H,
McCarthy
G,
Nobre
A,
Puce
A,
Luby
M,
Spencer
DD
(1994) Face recognition
in
human extra-
striate cortex. J Neurophysiol 71:821-825.
Bornstein
B,
Stroka H, Munitz
H
(1969) Prosopagnosia with
animal face agnosia. Cortex 5:164-169.
Boussaoud D, Desimone R, Ungerleider
LG
(1991) Visual
to-
pography
of
area TEO
in the
macaque.) Comp Neurol
306:554-575.
Carey S (1992) Becoming
a
face expert. Philos Trans R
Soc
Lond [Biol] 335:95-103.
Clarke S, Miklossy
J
(1990) Occipital cortex
in
man: orga-
nization
of
callosal connections, related myelo-
and cy-
toarchitecture,
and
putative boundaries
of
functional
vi-
sual
areas.
J
Comp Neurol 298188-214.
Corbetta M, Miezin FM, Dobmeyer S, Shulman GL, Peterson
SE (1991) Selective
and
divided attention during visual
discriminations
of
shape, color,
and
speed: functional
anatomy
by
positron emission tomography.
J
Neurosci
11:2383-2402.
Critchley
M
(1965)
An
appraisal
of
visual (object) agnosia
a tribute
to
Professor Otto Potzl. In: Transactions
of the
8th International Congress on Neurology,
Vol
5
(Konstans
J, ed). Amsterdam: Elsevier. Reprinted
in: The
citadel
of
the senses (Critchley
M,
ed). New York: Raven, 1986.
Damasio AR, Damaslo H (1983) The anatomic basis
of
pure
alexia. Neurology 33:1573-1583.
Damasio
A,
Yamada T, Damasio
D,
Corbett
J,
McKee J (1980)
Central achromatopsia: behavioral, anatomic, and physio-
logic aspects Neurology 30:1064-1071.
Damasio AR, Damasio
H,
Van Hoesen GW (1982) Prosopag-
nosia: anatomic basis
and
behavioral mechanisms
Neu-
rology 32:331-341.
Damasio AR, Tranel D, Damasio
H
(1990) Face agnosia
and
the neural substrates
of
memory. Annu Rev Neurosci
13:
89-109.
Desimone R (1991) Face-selective cells
in the
temporal cor-
tex
of
monkeys. J Cognit Neurosci 3:1-8.
FJlis HD (1992)
The
development
of
face processing skills.
Philos Trans R
Soc
Lond [Biol] 335:105-111.
Farah MJ (1990) Visual agnosia Cambridge,
MA:
MIT Press.
Fried
I,
Katz A, McCarthy
G,
Sass KJ, Williamson P, Spencer
SS,
Spencer
DD
(1991) Functional organization
of hu-
man supplementary motor cortex studied
by
electrical
stimulation. J Neurosci 11:3656-3666.
Geffen G, Bradshaw
JL,
Wallace
G
(1971) Interhemispheric
effects
on
reaction time
to
verbal
and
nonverbal visual
stimuli.
J
Exp
Psychol 87:415-422.
Gilbert CD (1993) Circuitry, architecture, and functional dy-
namics
of
visual cortex. Cereb Cortex 3:373-386.
Gross
CG,
Rocha-Miranda
CE,
Bender
DB
(1972) Visual prop-
erties
of
neurons
in
inferior temporal cortex
of the ma-
caque.
J
Neurophysiol 35:96-111.
Gross CG, Rodman HR, Gochin PM, Colombo
MW
(1993)
Inferior temporal cortex
as a
pattern recognition device.
In: Proceedings
of
the
3d
NEC Conference Research Sym-
posium, Computational learning
and
cognition (Baum
E,
cd),
pp
44-73.
Philadelphia:SIAM.
Harries MH, Perrett
Dl
(1991) Visual processing
of
faces
in
temporal cortex: physiological evidence
for
modular
or-
ganization
and
possible anatomical correlates.
)
Cognit
Neurosci 3:9-24.
Haxby JV, Grady
CL,
Horwitz B, Salerno
J,
Ungcrieider
LG,
Mishkln M, Schapiro MB (1993) Dissociation
of
object
and spatial visual processing pathways
In
human extra-
striate cortex.
In:
Functional organisation
of the
human
visual cortex (Gulyas B, Ottoson D, Roland PE, eds),
pp
329-340. Oxford: Pergamon.
Hcit G, Smith ME, Halgren
E
(1988) Neural encoding
of
in-
dividual words and faces by the human hippocampus and
amygdala. Nature 333:773-775.
Heywood CA, Cowey
A
(1992)
The
role
of the
"face-cell"
area
in the
discrimination
and
recognition
of
faces
by
monkeys. Philos Trans R Soc Lond (Biol] 335:31-38.
Homan
RW,
Herman J, Purdy
P
(1987) Cerebral location
of
international
10-20
system electrode placements. Elec-
troencephalogr Clin Neurophysiol 66:376-382.
Cerebral Cortex Scp/Oct
1994, V 4 N 5 553
Jeffreys DA (1993) The influence of stimulus orientation on
the vertex positive scalp potential evoked by faces. Exp
Brain Res 96:163-172.
Kaas JH (1991) Plasticity of sensory and motor maps in
adult mammals. Annu Rev Neurosci 14:137-167.
Konorski J (1967) Integrative activity of the brain. Chicago:
The University of Chicago.
Leehey S, Carey S, Diamond R, Cahn A (1978) Upright and
inverted faces: the right hemisphere knows the differ-
ence.
Cortex 14:411-419.
Levin HS, Goldstein FC, Spiers PA (1993) Acalculia. In- Clin-
ical neuropsychology (Heilman
KM,
Valenstein
E,
eds), pp
4O9-460. London: Oxford UP
Lueck CJ, Zeki S, Friston KJ, Cope P, Cunningham VJ, Lam-
mertsma
AA,
Kennard C, Frackowiak RSJ (1989) The col-
our centre in the cerebral cortex of man. Nature 340:
386-389.
McCarthy G, Wood CC, Williamson PD, Spencer DD (1989)
Task-dependent field potentials in human hippocampal
formation. J Neurosci 9:4253-4268
McCarthy G, Blamire AM, Rothman DL, Gruetter R, Shulman
RG (1993) Echo-planar magnetic resonance imaging
studies of frontal cortex activation during word genera-
tion in humans. Proc Natl Acad Sci USA 90:4952-4956.
Mclntosh AR, Grady CL, Ungerleider
LG,
Haxby
JV,
Rapoport
SI,
Horwitz B (1994) Network analyse of cortical visual
pathways mapped with
PET.
J
Neurosci 14:655-666.
Merzenich MM, Recanzone
G,
Jenkins WM, Allard TT, Nudo
RJ (1988) Cortical representational plasticity. In: Neuro-
biology of neocortex (Rakic
P,
Singer W, eds), pp 42-67.
Berlin: Wiley.
Michel
F,
Poncet M, Signoret JL (1989) Les lesions respons-
ables de la prosopagnosie sontelles toujours bilateraies?
Rev Neurol (Paris) 146:764-770.
Nobre
AC,
McCarthy G (1994) Language-related field poten-
tials in the anterior-medial temporal
lobe:
effects of word-
type and semantic
priming.
J
Neurosci, in press.
Ojemann
G,
Ojemann J, Lertich E, Berger
M
(1989) Cortical
language localization in left, dominant hemisphere: an
electrical stimulation mapping investigation in 117 pa-
tients.
J
Neurosurg 71:316-326.
Perrett Dl, Mlstlin AJ, Chitty AJ, Smith PAJ, Potter DD, Broen-
nimann R, Harries
M
(1988) Specialized face processing
and hemispheric asymmetry in man and monkey evi-
dence from single unit and reaction time studies. Behav
Brain Res 29:245-258.
Perrett Dl, Hietanen JK, Oram MW, Benson PJ (1992) Orga-
nization and functions of cells responsive to faces in the
temporal cortex. Philos Trans R Soc Lond [Biol] 335:23-
30
Puce A, Nobre
AC,
Allison T, Luby
ML,
McCarthy
KM,
McCar-
thy G (1994) Event-related potential correlates of visual
recognition modules in extrastriate cortex. Paper pre-
sented at the first annual meeting of the Cognitive Neu-
roscience Society, San Francisco.
Regan D (1989) Human brain electrophysiology: evoked po-
tentials and evoked magnetic fields in science and med-
icine New York: Qsevier.
Rhodes G (1993) Configural coding, expertise, and the right
hemisphere advantage for face recognition. Brain Cognit
22:19-41.
Rodman HR, 6 Scalaidhe SR Gross CG (1993) Response
properties of neurons in temporal cortical visual areas of
infant
monkeys.
J
Neurophysiol 70:1115-1136.
Rolls ET (1992) Neurophysiology and functions of the pri-
mate amygdala. In: The amygdala (Aggleton JP, ed), pp
143-165-
New York: Wiley.
Seeck M, Grusser OJ (1992) Category-related components
in visual evoked
potentials:
photographs of faces, persons,
flowers and tools as stimuli. Exp Brain Res 92:338-349.
Sergent J, Ohta S, MacDonald B (1992) Functional neuro-
anatomy of face and object processing: a positron emis-
sion tomography study. Brain 115:15-36.
Talairach J, Tournoux P (1988) Co-planar stereotaxic atlas
of the human brain. New York: Thieme
Tanaka K (1993) Neuronal mechanisms of object recogni-
tion. Science 262:685-688.
Tanaka K, Saito HA, Fukada
Y,
Moriya
M
(1991) Coding the
visual image of objects in the inferior temporal cortex of
the macaque monkey. J Neurophysiol 66:170-189.
Ungerleider LG, Mishkin M (1982) Two cortical visual sys-
tems.
In: Analysis of visual behavior Ongle DJ, Goodale
MA,
Mansfield
RJW,
eds). Cambridge,
MA:
MIT Press.
Van Essen DC, Anderson CH, Felleman DJ (1992) Informa-
tion processing in the primate visual system: an integrat-
ed systems perspective. Science 255:419-423-
WhitelyAM.WarringtonEK (1977) Prosopagnosia:
a
clinical,
psychological, and anatomical study of three patients. J
Neurol Neurosurg Psychiatry 40:395-403-
Wood CC, Wolpaw JR (1982) Scalp distribution of human
auditory evoked potentials. II. Evidence for overlapping
sources and involvement of auditory cortex. FJectroen-
cephalogr Clin Neurophysiol 54:25-38.
Wood
CC,
Spencer
DD,
Allison
T,
McCarthy
G,
Williamson PD,
Goff WR (1988) Localization of human sensorimotor
cortex during surgery by cortical surface recording of
somatosensory evoked potentials. J Neurosurg 68:99-
111.
Yamane S, Kaji S, Kawano K (1988) What facial features
activate face neurons in the inferotemporal cortex? Exp
Brain Res 73-209-214.
Zeki SM (1990) A century of cerebral achromatopsia. Brain
1131721-1777.
554 Human Extrastriate Cortex Allison et a).
... It has been observed that the colouring areas are located in V4 (Lueck et al., 1989;Zeki & Marini, 1998;Hadjikhani et al., 1998). Furthermore, the grapheme perception area is located in the fusiform gyrus (Allison et al., 1994) which is very close to V4. Brang et al. (2010) also showed increased activity of the brain area close to V4 in synesthetes. This suggests the areas associated with the most common form of synesthesia, the perception of graphemes and the perception of colours, are located near each other. ...
Article
Full-text available
Grapheme-color synesthesia (GC-S) is a neurological condition in which the perception of a grapheme elicits the experience of color or even the visual representation of that grapheme as colored. Previous research using transcranial magnetic stimulation on GC-synesthetes demonstrated enhanced excitability of the visual cortex. Consequently, we hypothesized that using anodal "offline" transcranial direct current stimulation on the visual cortex in area V4 followed by visual training could boost cortical excitability in the target areas and thus produce effects similar to GC-S in non-synesthetes. We discovered that after anodal stimulation, participants had a considerably smaller mean change in reaction time on white symbols than after sham and cathodal stimulation. Our findings indirectly support the cross-activation hypothesis.
... is an occipital neural response reflecting the operation of the extra-striate area and fusiform gyrus in the early stage [37]. This pathway in connection with the extra-striate area and fusiform gyrus also processes the early cognitive mechanisms of word, size, and color recognition [38]. Hence, showing a larger P1 is reasonable for managing the additional load added by cycling [7]. ...
Article
Full-text available
Background Various neurocognitive tests have shown that cycling enhances cognitive performance compared to resting. Event-related potentials (ERPs) elicited by an oddball or flanker task have clarified the impact of dual-task cycling on perception and attention. In this study, we investigate the effect of cycling on cognitive recruitment during tasks that involve not only stimulus identification but also semantic processing and memory retention. Methods We recruited 24 healthy young adults (12 males, 12 females; mean age = 22.71, SD = 1.97 years) to perform three neurocognitive tasks (namely color-word matching, arithmetic calculation, and spatial working memory) at rest and while cycling, employing a within-subject design with rest/cycling counterbalancing. Results The reaction time on the spatial working memory task was faster while cycling than at rest at a level approaching statistical significance. The commission error percentage on the color–word matching task was significantly lower at rest than while cycling. Dual-task cycling while responding to neurocognitive tests elicited the following results: (a) a greater ERP P1 amplitude, delayed P3a latency, less negative N4, and less positivity in the late slow wave (LSW) during color-word matching; (b) a greater P1 amplitude during memory encoding and smaller posterior negativity during memory retention on the spatial working memory task; and (c) a smaller P3 amplitude, followed by a more negative N4 and less LSW positivity during arithmetic calculation. Conclusion The encoding of color-word and spatial information while cycling may have resulted in compensatory visual processing and attention allocation to cope with the additional cycling task load. The dual-task cycling and cognitive performance reduced the demands of semantic processing for color-word matching and the cognitive load associated with temporarily suspending spatial information. While dual-tasking may have required enhanced semantic processing to initiate mental arithmetic, a compensatory decrement was noted during arithmetic calculation. These significant neurocognitive findings demonstrate the effect of cycling on semantic-demand and memory retention-demand tasks.
... These regions are considered responsible for auditory and visual functions (Rezk et al., 2020;Robins, Hunyadi, & Schultz, 2009). The inferior temporal gyrus is one component of the higher levels of the ventral stream of audio and visual processing and is associated with the representation of complex object features (Allison, McCarthy, Nobre, Puce, & Belger, 1994;Lewald, Staedtgen, Sparing, & Meister, 2011). The middle temporal gyrus is highly specialized to process visual motion (Sulpizio et al., 2022;Tootell et al., 1995). ...
Article
Full-text available
Objective: This study sought to investigate brain responses to positive and negative events in individuals with internet gaming disorder (IGD) during real gaming as a direct assessment of the neural features of IGD. This investigation reflects the neural deficits in individuals with IGD while playing games, providing direct and effective targets for prevention and treatment of IGD. Methods: Thirty subjects with IGD and fifty-two matched recreational game use (RGU) subjects were scanned while playing an online game. Abnormal brain activities during positive and negative events were detected using a general linear model. Functional connectivity (FC) and correlation analyses between neural features and addiction severity were conducted to provide additional support for the underlying neural features. Results: Compared to the RGU subjects, the IGD subjects exhibited decreased activation in the dorsolateral prefrontal cortex (DLPFC) during positive events and decreased activation in the middle frontal gyrus (MFG), precentral gyrus and postcentral gyrus during negative events. Decreased FC between the DLPFC and putamen during positive events and between the MFG and amygdala during negative events were observed among the IGD subjects. Neural features and addiction severity were significantly correlated. Conclusions: Individuals with IGD exhibited deficits in regulating game craving, maladaptive habitual gaming behaviors and negative emotions when experiencing positive and negative events during real game-playing compared to RGU gamers. These abnormalities in neural substrates during real gaming provide direct evidence for explaining why individuals with IGD uncontrollably and continuously engage in game playing, despite negative consequences.
Article
Do early effects of predictability in visual word recognition reflect prediction error? Electrophysiological research investigating word processing has demonstrated predictability effects in the N1, or first negative component of the event-related potential (ERP). However, findings regarding the magnitude of effects and potential interactions of predictability with lexical variables have been inconsistent. Moreover, past studies have typically used categorical designs with relatively small samples and relied on by-participant analyses. Nevertheless, reports have generally shown that predicted words elicit less negative-going (i.e., lower amplitude) N1s, a pattern consistent with a simple predictive coding account. In our preregistered study, we tested this account via the interaction between prediction magnitude and certainty. A picture-word verification paradigm was implemented in which pictures were followed by tightly matched picture-congruent or picture-incongruent written nouns. The predictability of target (picture-congruent) nouns was manipulated continuously based on norms of association between a picture and its name. ERPs from 68 participants revealed a pattern of effects opposite to that expected under a simple predictive coding framework.
Preprint
Full-text available
Proficient reading in a second language (L2) requires efficient processing of its orthographic symbols. We examined the relationship between a reader’s proficiency in English and electrophysiological markers of efficiency while processing English and Chinese orthographies. We recruited 49 adult late Chinese learners of English, who participated in an implicit single-letter processing task (alphabet letters vs. pseudoletters) and Chinese character processing task (characters vs. pseudocharacters) while an electroencephalogram measured their evoked potentials. The difference between letters and pseudoletters was manifested in the N170 and P2 ranges in a posterior-occipital distribution. The N170 pseudoletter effect in the left hemisphere correlated with L2 proficiency. We concluded that single-letter processing in L2 is still gaining efficiency as they learn to read English as a late-learn language. Future cross-linguistic investigations are necessary to determine whether this L2 effect generalizes to other L2 populations.
Book
Full-text available
In this textbook it is examined in depth the undeniable relationship between cognitive and motor development and the cognitive processes involved in motor learning. Movement is a main feature of human life considering that through movement a person can explore his/her environment, show self-handling skills, expresses himself/herself through sports, arts or even movement can function as an end in itselfDue to this close interrelation between cognitive and motor development, difficulties or disorders in one area affects the other. The theories that are presented in this course book are approached in order to point out this interrelation and the development of the factors that have an effect in it (e.g. heredity–environment). The presence of difficulties or disorders influences all aspects of development in infants, children, adolescents, and later adults. People that deviate from “normal” are affected more in all aspects of development (cognitive, motor, emotional, social etc.). Motor and cognitive evaluation is valuable for professionals to understand better the weaknesses and the abilities of children, especially in those interrelated developmental components. A dynamic intervention model is suggested that considers the aforementioned relationship. The ultimate objective of this intervention model is the development (enhancement) of both knowledge and experience that will help children to scaffold his/her difficulties in everyday living (e.g. free play, playground etc.). Additionally in this textbook the best practices are presented in Developmental and Adapted Physical Education. Several activities are proposed and their aim is not exclusively the motor development but also the cognitive as the child is studied as a whole.
Chapter
Evoked responses (ERs) are a set of event-related small changes in voltage or magnetic field measured from the brain that arise in time- and phase-locked manner to external stimulation. ERs can be directly extracted from the electroencephalography (EEG) and magnetoencephalography (MEG) recordings by averaging across stimuli. In the context of language, ERs can be used to study the different stages of processing by varying the stimulus characteristics, the way the stimuli are presented, or the task requirements of the subjects. The recorded effects in timing and amplitude at different latencies to these manipulations reflect the sensitivity of neural computations to specific processes of language perception and comprehension, from the stage of sensory analysis and perceptual processing of spoken or written linguistic signals up to the level of understanding of the meaning of complex sentences or verbal working memory. In this chapter, we will introduce the main evoked response components that are regularly used in cognitive neuroscience to characterize brain activation related to language processing in the brain. We will also provide a review of the methodological aspects related to the recording and analysis of the main responses used in the study of language, including source modeling algorithms that facilitate the extraction and separation of ER components more accurately than what is possible at the level of MEG/EEG sensors. The aim of this chapter is to provide general understanding of the use of evoked responses in language research, also with practical notions.Key wordsEvoked responses Language processing Language perception N400 Neural components Source modeling Beamformers Data acquisition Data analysis
Article
Full-text available
Young children do not form representations of newly encountered faces as efficiently as do adults. A first step in explaining this difference, like any age-related change, is locating its source. A major source of the improvement is acquisition of knowledge of faces per se, as opposed to age-related changes in general pattern encoding or memorial skills. Two consequences of expertise at individualizing members of classes that share a basic configuration are known: a large inversion effect and a caricature advantage. It is possible that both of these effects reflect increased reliance, with expertise, on configuration distinguishing features. Several phenomena that indicate that inversion interferes with the encoding of configural aspects of faces are reviewed. Finally, developmental data are presented that confirm the suspicion that there are at least two distinct sources of the vulnerability of face encoding to inversion, perhaps reflecting two distinct senses of 'configural encoding' of faces, only one of which is implicated in adult expertise at face encoding.
Article
Full-text available
The notion of a neuron that responds selectively to the image of a particular complex object has been controversial ever since Gross and his colleagues reported neurons in the temporal cortex of monkeys that were selective for the sight of a monkey's hand (Gross, Rocha-Miranda, & Bender, 1972). Since that time, evidence has mounted for neurons in the temporal lobe that respond selectively to faces. The present paper presents a critical analysis of the evidence for face neurons and discusses the implications of these neurons for models of object recognition. The paper also presents some possible reasons for the evolution of face neurons and suggests some analogies with the development of language in humans.
Article
Physiological recordings along the length of the upper bank of the superior temporal sulcus (STS) revealed cells each of which was selectively responsive to a particular view of the head and body. Such cells were grouped in large patches 3-4 mm across. The patches were separated by regions of cortex containing cells responsive to other stimuli. The distribution of cells projecting from temporal cortex to the posterior regions of the inferior parietal lobe was studied with retrogradely transported fluorescent dyes. A strong temporoparietal projection was found originating from the upper bank of the STS. Cells projecting to the parietal cortex occurred in large patches or bands. The size and periodicity of modules defined through anatomical connections matched the functional subdivisions of the STS cortex involved in face processing evident in physiological recordings. It is speculated that the temporoparietal projections could provide a route through which temporal lobe analysis of facial signals about the direction of others' attention can be passed to parietal systems concerned with spatial awareness.
Article
consider the functions of the primate amygdala in the light of the responsiveness of single neurons within the structure / the neuronal responsiveness is most informative when activity is recorded while the amygdala is functioning normally, and in situations in which the amygdala is required / the neurophysiology must thus proceed closely with lesion studies which help assess the functions of the amygdala / it is important to analyse the information processing being performed by neurons in the amygdala, and for this reason some evidence on the input and output connections of the amygdala, and on the neuronal activity in these input and output regions, is considered / this helps to provide an understanding of how the amygdala operates at the systems level of brain function, and in particular how it transforms the inputs it receives, and what effects the results of its computations have on output regions / particular attention is paid to research in nonhuman primates (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
Theory of learning and perception inspired by contemporary neurophysiology. Harvard Book List (edited) 1971 #150 (PsycINFO Database Record (c) 2012 APA, all rights reserved)
Article
The scalp distributions of human auditory evoked potentials (AEPs) between 20 and 250 msec were investigated using non-cephalic reference recordings. AEPs to binaural click stimuli were recorded simultaneously from 20 scalp locations over the right hemisphere in 11 subjects. Computer-generated isovoltage topographic maps at high temporal resolution were used to assess the stability of AEP scalp distributions over time and relate them to major peaks in the AEP wave forms. For potentials between 20 and 60 msec, the results demonstrate a stable scalp distribution of dipolar form that is consistent with sources in primary auditory cortex on the superior temporal plant near the temporoparietal junction. For potentials between 60 and 250 msec, the results demonstrate changes in AEP morphology across electrode locations and changes in scalp distribution over time that lead to two major conclusions. First, AEPs in this latency period are generated by multiple sources which partially overlap in time. Second, one or more regions of auditory cortex contribute significantly to AEPs in this period. Additional data are needed to determine the relative contribution of auditory cortex sources on the superior temporal plane and the lateral temporal surface and to identify AEP sources outside the temporal lobe.