ArticlePDF Available

An activation-verification model for letter and word recognition: The word-superiority effect

September 1982
Psychological Review 89(5):573-94

September 1982
89(5):573-94

DOI:10.1037/0033-295X.89.5.573

Source
PubMed

Authors:

Kenneth R. Paap

San Francisco State University

Roger W Schvaneveldt

Arizona State University

Developed an activation–verification model for letter and word recognition that yields predictions of 2-alternative forced-choice performance for 864 individual stimuli that are either words, orthographically regular nonwords, or orthographically irregular nonwords. The model explains why letters embedded in words are recognized more accurately than letters embedded in nonwords. The encoding algorithm uses empirically determined confusion matrices to activate units in both an alphabetum and a lexicon. Predicted performance is enhanced when decisions are based on lexical information, because activity in the lexicon tends to constrain the identity of test letters more than the activity in the alphabetum. Thus, the model predicts large advantages of words over irregular nonwords, and smaller advantages of words over regular nonwords. The predicted differences demonstrate that the effects of manipulating lexicality and orthography can be predicted on the basis of lexical constraint alone. Within each class (word, regular nonword, irregular nonword) there are significant correlations between the simulated and obtained responses on individual items. This model is contrasted with the interactive activation model of J. L. McClelland et al . (29 ref)

Content uploaded by Kenneth R. Paap

Content may be subject to copyright.

Psychological Review

1982,

VoT 89, No. 5,

573-594

1982

by the

American Psychological

Association,

Inc.

0033-295X/82/8905-0573$00.75

Activation-Verification

Model

for

Letter

and

Word

Recognition:

The

Word-Superiority

Effect

Kenneth

Paap,

Sandra

Newsome,

James

McDonald,

and

Roger

Schvaneveldt

New

Mexico

State

University

activation-verification

model

for

letter

and

word recognition yielded

predic-

tions

two-alternative forced-choice performance

for 864

individual stimuli that

were

either words,

orthographically

regular nonwords,

orthographically irreg-

ular nonwords.

The

encoding algorithm (programmed

APL) uses empirically

determined confusion matrices

activate units

both

alphabetum

and a

lexicon.

general, predicted performance

enhanced when

decisions

are

based

lexical information, because activity

in the

lexicon tends

constrain

the

identity

test letters more than

the

activity

in the

alphabetum. Thus,

the

model

predicts large advantages

words over irregular nonwords,

and

smaller advan-

tages

words over regular nonwords.

The

predicted

differences

are

those

obtained

in a

number

experiments

and

clearly demonstrate

that

the

effects

manipulating lexicality

and

orthography

can be

predicted

on the

basis

lexical

constraint alone. Furthermore, within each class (word, regular nonword, irreg-

ular nonword) there

are

significant

correlations between

the

simulated

and ob-

tained performance

individual

items.

Our

activation-verification model

contrasted with McClelland

and

Rumelhart's

(1981)

interactive activation model.

The

goal

of the

activation-verification

model

is to

account

for the

effects

prior

and

concurrent

context

word

and

letter

recognition

in a

variety

experimental

par-

adigms

(McDonald,

1980;

Paap

Newsome,

Note

Paap,

Newsome,

& Mc-

Donald,

Note

Schvaneveldt

McDonald,

Note

4). An

interactive activation

model,

in-

spired

by the

same

set of

sweeping

goals,

has

recently

been

described

McClelland

and

Portions

this

research

were

presented

at the

meet-

ings

of the

Psychonomic

Society,

St.

Louis, November

1980;

the

Southwestern

Psychological

Association,

Houston,

April 1981;

and the

Psychonomic

Society,

Philadelphia, November

1981.

The

project

was

partially

supported

Milligram

Award

-2-02190

from

the

Arts

and

Sciences Research Center

at New

Mexico

State

University.

would like

thank

Ron

Noel, Jerry

Sue

Thompson,

and

Wayne

Whitemore

for

their

contribu-

tions

various stages

this research. Also,

appre-

ciate

the

thoughtful reviews

of a first

draft

this

paper

provided

by Jay

McClelland,

Dom

Massaro,

and

Garvin

Chastain.

Sandra

Newsome

is now at

Rensselaer

Polytechnic

Institute

Troy,

New

York. James McDonald

is now

IBM in

Boulder,

Colorado.

Requests

for

reprints should

sent

Kerineth

Paap, Department

Psychology,

Box

3452,

New

Mex-

ico

State

University,

Las

Cruces,

New

Mexico,

88003.

Rumelhart

(1981).

Although

the

models

complement

one

another

nicely

with

regard

some

aspects,

will

contrast

the two ap-

proaches

in our final

discussion

and

highlight

the

very

important

differences

between

them.

The

verification

model

was

originally

de-

veloped

account

for

reaction

time

data

from

lexical-decision

and

naming

tasks

(Becker,

1976,

1980;

Becker

&Killion,

1977;

McDonald,

1980;

Schvaneveldt,

& Mc-

Donald,

1981;

Schvaneveldt,

Meyer,

Becker,

1976;

Becker,

Schvaneveldt,

Gomez,

Note

5).

Although

the

various

dis-

cussions

of the

verification

model

differ

about

certain

details,

there

has

been

general

agreement

about

the

basic structure

of the

model.

The

basic

operations

involved

word

and

letter

recognition

are

encoding,

verification,

and

decision.

refer

to the

model

described

in the

present paper

as the

activation-verification

model

emphasize

the

extensive

treatment

given

encoding

processes

that

are

based

activation

let-

ter and

word

detectors.

The

activation

pro-

cess

many

features

with

the

logogen

model

proposed

Morton

(1969).

In the

activation-verification

model,

have

at-

tempted

formalize

earlier

verbal

state-

573

574

PAAP,

NEWSOME,

MCDONALD,

AND

SCHVANEVELDT

ments about

the

verification model.

As we

will

show, this

formalization

permits

quan-

titative evaluation

aspects

of the

model

with

data

from

the

word-superiority para-

digm.

The

activation-verification model consists

encoding, verification,

and

decision

op-

erations. Encoding

used

describe

the

early

operations that lead

to the

unconscious

activation

learned units

memory.

In the

case

words,

the

most highly activated lex-

ical

entries

are

referred

to as the set of

can-

didate words.

Verification

follows

encoding

and

usually

leads

to the

conscious

recognition

of a

single

lexical

entry

from

the set of

candidates. Ver-

ification

should

viewed

as an

independent,

top-down

analysis

of the

stimulus that

guided

by a

stored representation

of a

word.

Verification

determines whether

refined

perceptual representation

of the

stimulus

word

sufficiently

similar

to a

particular

word,

supported

by the

evidence

of an

earlier,

less

refined

analysis

of the

stimulus. This gen-

eral definition

verification

sufficient

for

the

current tests

of the

activation-verifica-

tion model,

but

more specific assumptions

have

been suggested (e.g., Becker, 1980;

McDonald, 1980; Schvaneveldt

& Mc-

Donald, 1981)

and

could

be the

focus

fu-

ture work.

For

example, verification

has

been

described

as a

comparison between

pro-

totypical representation

of a

candidate

word

and a

holistic representation

of the

test stim-

ulus. However, within

the

framework

of our

model,

could just

easily suggest that

verification

involves

comparison between

the

letter information available

in an

acti-

vated word unit

and the

updated activity

the

letter units

in the

alphabetum.

The

verification process

has

been instan-

tiated

in a

computer simulation that mimics

the

real-time processing involved

verifi-

cation (McDonald, 1980).

The

simulated

verification

process

is a

serial-comparison

operation

on the set of

candidate words gen-

erated during encoding. Thus, verification

results

in a

match

mismatch.

If the

degree

fit

between

the

visual evidence

and the

candidate word exceeds

decision criterion,

then

the

word

consciously recognized.

the

match does

not

exceed

the

criterion, then

the

candidate

rejected

and the

next can-

didate

verified. Semantic context

affects

the

definition

of the

candidate set, whereas word

frequency

affects

the

order

verification

for

words

in the

candidate set. Those words

the

candidate

set

that

are

to the

con-

text

will

verified

before

those that

are

not.

the

verification process

finds no

match

among

the set of

related words,

proceeds

check

the

remaining candidates

in a de-

creasing

order

word

frequency.

These pro-

visions produce semantic-priming

and

word-

frequency

effects

in a

simulated lexical-de-

cision task.

The

upper panel

Figure

depicts

the

important structures

and

pro-

cesses that

are

simulated

for a

typical lexical-

decision task that involves normal stimulus

durations

of 250

msec

more.

The

factors

affecting

the

speed

and

accu-

racy

ofperformance

in a

particular paradigm

depend

whether decisions

are

based pri-

marily

information

from

encoding

from

verification. Because verification relies

comparison that involves continuing

perceptual

analysis

of the

stimulus,

the po-

tential

contribution

verification should

severely

attenuated whenever

backward

mask

overwrites

erases

the

sensory

buffer.

Thus,

paradigms that present masked letter

strings

offer

potential

showcase

for the

pre-

dictive

power

of our

simulated encoding pro-

cess.

The

bottom

panel

Figure

shows

the

reduced

model that

appropriate

for

very

short stimulus durations

stimuli that

are

masked.

primary importance

is the

model's

abil-

ity

explain

why

letters embedded

words

are

recognized more accurately than

letters

embedded

nonwords.

The

current version

the

model predicts

not

only this word-su-

periority

effect

(WSE)

as a

general phenom-

enon

but

also

the

relative performance

for

any

given letter string.

The

predictions

are

derived

from

the

following

descriptions

the

encoding process

and the

decision rule.

Encoding

Feature

Matching

many others,

view

encoding

as a

process that involves matching features

various

types

units.

The

model assumes

two

types

units: whole words stored

in a

lexicon

and

individual letters stored

in an

ACTIVATION-VERIFICATION

MODEL

NORMAL

STIMULUS

DURATIONS

AND NO

MASKING

575

VERY

BRIEF

STIMULUS

DURATIONS AND/OR

MASKING

Figure

1. The

upper

panel

shows

the

important

structures

that

the

model

simulates

for a

typical

lexical-

decision

task

that

involves

normal

stimulus

durations

250

msec

more;

the

lower

panel

shows

the

reduced

model

that

appropriate

for

very

short

stimulus

durations

and/or

stimuli

that

are

masked.

alphabetum.

Each letter

of the

alphabet

represented

by a

feature

list, with

the

relative

level

activation

for

each

letter

unit deter-

mined

by the

number

matching

and

mis-

matching features that have been detected.

Word

units

are

activated

to the

extent that

their constituent letters

are

activated

in the

alphabetum.

The

model also allows

for the

possibility that

the

detection

supraletter

features

(e.g., word shape

word length)

may

directly contribute

to the

activation

level

the

word units. However, because

the

present evaluation

of the

encoding process

consists

entirely

four-letter

uppercase

strings,

have assumed that there

are no

distinctive supraletter features.

is a

straightforward

matter

implement

simulation based

feature

matching.

However,

this strategy

is not

likely

succeed

because

the

selection

of the

appropriate

set

features relies heavily

guesswork.

If in-

appropriate

features

are

used,

bogus

set of

candidate words will

generated.

Confusion

Probabilities

Activation

avoid

the

problem

selecting

the

cor-

rect

set of

features,

the

activation-verifica-

tion model uses empirically determined con-

fusion

matrices

generate activation levels

in the

alphabetum

and

lexicon. Table

shows

the

obtained

confusion

matrix

for the

uppercase characters

used. Entries

are the

percentage

responses (columns)

for

each

letter

as a

stimulus (rows).

The

specific

pro-

cedure

used

obtain this matrix

has

been

reported elsewhere (Paap, Newsome,

McDonald, Note

3).

assume that

confusability

reflects

the

degree

feature matching

and the

appro-

priate rules

for

combining matching

and

mismatching information. This definition

activation emphasizes

the

role

psycho-

physical distinctiveness because

identity

match does

not

always lead

to the

same level

activation.

For

example, because

the

prob-

abilities

of a

correct response given

S, and

Fas

stimuli

(K/K,

S/S,

VIV)

are

.748,

.541,

and

.397, respectively,

the

model assumes

that

S, a

letter

average confusability,

re-

ceives less activation than

the

distinc-

tive

letter

but

more activation than

the

less distinctive letter

All

of the

matrices used

generate pre-

dictions

are

transformations

of the

matrix

shown

Table

Transformations

are ap-

Table

Confusion

Matrix

for the

Terak

Uppercase

Letters

Stimulus

-m

Note.

Entries

are the

percentages

responses (columns)

for

each letter

as a

stimulus (rows).

ACTIVATION-VERIFICATION

MODEL

577

plied

model

any

variable that

assumed

affect

stimulus quality.

For

example,

if the

onset

asynchrony

between stimulus

and

mask

greater than

the

msec used

generate

the

percentages shown

Table

then

the

values

on the

main diagonal (for correct

re-

sponses) should

increased, whereas

the

off-

diagonal

values (for incorrect responses)

are

decreased.

The

particular adjustment used

increases each correct response percentage

percentage

the

distance

to the

ceiling

and

decreases each incorrect response percentage

percentage

of the

distance

to the floor.

The

increments

and

decrements

are

such

that

the

rows

always

sum to

100%.

The

procedure

reversed when stimulus quality

degraded

rather than enhanced.

Another

effect

that

the

model

can

capture

appropriate transformations

of the

basic

matrix

loss

acuity

for

letters

greater

distances

from

the

average

fixation

point.

All

the

predictions reported later access

sep-

arate matrices

for

each

of the

four

spatial

positions.

The

extent

which separate

ma-

trices improve

the

model's predictions

de-

pends

whether correlations between

ob-

tained

and

predicted data

are

based

on all

stimulus items

only

those that test

the

same

target position.

demonstrate this

derived

single matrix

which each cell

entry

was the

mean

of the

four

confusion

probabilities

found

in the

separate matrices.

When

the

single matrix

used,

correlations

between

predicted

and

obtained

perfor-

mance

are

significantly

higher

for the

subsets

stimuli that

all

the

same target

po-

sition than across

the

entire

set of

stimuli.

When

separate

confusion

matrices

are

used,

the

correlation

for the

entire

set of

stimuli

rises to

about

the

same level

as the

separate

correlations

each position.

example

of how the

encoding pro-

cess uses

the

confusion

matrices, consider

the

presentation

of the

input string

PORE.

As in-

dicated

Figure

position-specific units

the

alphabetum

are

assumed

to be

activated

"PORE"

SENSORY

BUFFER

(

MCI

dinn

oing

—

VISUAL

REPRESENTATION

LEXICON

(GEOMETRIC

MEANS)

PORE

.533

PORK

276

GORE

275

BORE

254

LORE

245

POKE

242

}\\

ALPHABETUM

ENTRIES

AND

CONFUSION PROBABILITIES

Pos.

.54 0 .66 R .58 E .39

.09 D .08 N .03 F .07

.04 a .04 H .03 s .05

.03

.03 B .03 B .05

.03 . K .03 L .04

.02 , E .02

.04

.02 .

.02 H .04

.03

Figure

Encoding

the

word

PORE.

(Activation strengths

for

letter

units

in the

alphabetum

are

determined

letter-confusion

probabilities.

Activation

strengths

for

word

units

in the

lexicon

are

determined

taking

the

geometric mean

of the

corresponding

letter-confusion

probabilities.)

578

PAAP,

NEWSOME,

MCDONALD,

AND

SCHVANEVELDT

direct proportion

their confusability.

the

first

position

the

input letter

activates

the

corresponding

unit

the

most (.538),

the

unit more than

any

other remaining unit

(.091),

and

several other units

(G, A, B,

and

lesser

extents.

Patterns

activation

are

established

in a

similar manner

for the

other three spatial positions.

Activity

in the

alphabetum

continuously

feeds

into

the

lexicon.

The

encoding algo-

rithm

estimates

the

activation strength

for

each

word

in the

lexicon

taking

the

geo-

metric mean

of the

activity levels associated

with

the

constituent letters.

One

consequence

using

the

geometric mean

that

one

very

inactive letter unit

(close

zero)

may

pre-

vent

activation

of a

potential word unit that

receiving high levels

activation

from

three other

letter

units.

This

may

mirror psy-

chological reality because otherwise identical

versions

of the

model

yield

poorer

fits to the

obtained data

if the

geometric mean

is re-

placed

by the

arithmetic mean

or the

square

root

of the sum of

squares (the vector dis-

tance between another word

and the

input

word

in a

space generated

from

the

letter-

confusion

probabilities).

The

Word-Unit

Criterion

The

decision system does

not

monitor

all

the

activity

in the

lexicon.

The

model

as-

sumes that

the

activity

in a

word unit

can be

accessed

by the

decision

system only

if the

level

activation exceeds

preset criterion.

The

predictions reported

this paper

are all

based

on a

word-unit

criterion

.24. With

this criterion word stimuli generate

an av-

erage

about

3.4

words

in the

candidate

set

compared

about

2.1

words

for

stimuli that

are

orthographically

regular pseudowords.

the

word-unit

criterion

raised,

fewer

words

will

accessible

to the

decision system.

our final

discussion

will

suggest that

high

criterion

may

offer

alternative explana-

tion

for the

pseudoword-expectancy

effect

reported

Carr,

Davidson,

and

Hawkins

(1978).

For the

example illustrated

Figure

2, six

word

units exceed

the

criterion

for the

input

word

PORE: PORE

(.533),

PORK

(.276),

GORE

(.275),

BORE

(.254),

LORE

(.245),

and

POKE

(.242).

Nonwords

can

also activate

the

lexi-

con

through

the

same mechanism.

For ex-

ample, when

the

pseudoword

DORE

input

to the

simulation, three word units exceed

geometric mean

.240:

DONE

(.268),

LORE

(.265),

and

SORE

(.261).

Nonwords with

lower

levels

orthographic structure tend

produce less lexical activity.

For

example,

when

EPRO

(an

anagram

PORE)

pre-

sented

to the

encoding algorithm,

word

units exceed

the

.240 criterion.

Decision

Criterion

the

task requires detection

recogni-

tion

of a

letter

from

the

stimulus,

the

decision

process

assumed

have access

to the

rel-

ative

activation

levels

of all

units

in the al-

phabetum

and

those

units

in the

lexicon

that

exceed

the

word-unit criterion.

It is

further

assumed

that when total lexical activity

ex-

ceeds some preset criterion,

the

decision

will

based

lexical rather than alphabetic

evidence.

This

decision

criterion

different

from

the

individual word-unit criterion,

and

the

distinction should

kept clearly

mind. Exceeding

word-unit criterion makes

that particular lexical entry accessible

to the

decision

system. Exceeding

the

decision

cri-

terion

leads

to a

decision based

lexical

activity

rather than alphabetic activity.

advantageous

base

decision

lexical

evidence when there

some minimal

amount

activation, because many words

can

completely

specified

on the

basis

fewer

features

than would

necessary

specify

their

constituent

letters

when pre-

sented

isolation. Accordingly, lexical can-

didates

will

tend toward greater veracity than

alphabetic candidates whenever decisions

are

made

on the

basis

partial information.

The

specific decision rules used

predict

performance

in a

two-alternative,

forced-

choice letter-recognition task

are as

follows:

For any

stimulus,

the

predicted proportion

correct (PPC) depends

contributions

from

both

the

lexicon

and

alphabetum. More spe-

cifically,

PPC is the

weighted

sum of the

probability

of a

correct response based

lexical evidence

and the

probability

of a

cor-

rect response based

alphabetic evidence:

PPC

P(L)

P(C/L)

P(A)

P(C/A),

(1)

ACTIVATION-VERIFICATION MODEL

579

where

P(L)

is the

probability

of a

lexically

based

decision,

P(C/L)

is the

conditional

probability

of a

correct response given that

decision

based

on the

lexicon,

P(A)

is the

probability

of an

alphabetically based deci-

sion,

and

P(C/A)

is the

conditional proba-

bility

of a

correct response based

alpha-

betic information. Because

the

decision

for

each

trial

made

on the

basis

either lexical

alphabetic information,

P(A)

equal

P(L).

Correct

Responses

From

the

Lexicon

The

probability

of a

correct response given

decision based

in the

lexicon

P(C/L)

= 1.0 X

(Swc/Sw)

+ .5

(Swn/Sw)

+ 0 X

(SWj/Sw),

(2)

where

Swc

is the

activation strength

word

units

that

support

the

target letter,

Swn

is the

activation strength

word

units that support

neither

the

correct

nor the

incorrect alter-

native,

Sw;

is the

activation strength

word

units that support

the

incorrect alternative,

and

is the

total lexical activity.

The

general expression

for

P(C/L)

shown

Equation

2 was

selected

for

reasons

parsimony

and

programming

efficiency.

The

equation

can be

viewed

as the

application

simple high-threshold model (Luce, 1963)

each lexical entry. When

word unit

ex-

ceeds

the

criterion,

the

decision system

will

(a)

select

the

correct alternative with

prob-

ability

of 1.0

whenever

the

letter

in the

crit-

ical position supports

the

correct alternative,

(b)

select

the

correct alternative with

prob-

ability

of 0.0

whenever

the

letter

in the

crit-

ical

position supports

the

incorrect alterna-

tive,

and (c)

guess whenever

the

critical

letter

supports

neither alternative.

The

only

addi-

tional assumption required

that

the

deci-

sion

system combine

the

probabilities

from

each

lexical entry

simply weighting them

proportion

their activation strengths.

For the

following

examples, words

had to

exceed

criterion

of .24 in

order

to be

con-

sidered

by the

decision system.

the

decision

for any

single

trial

based

lexical activity,

our

underlying process

model assumes that something like Equation

does apply. That

is, we

have adopted

the

working

hypothesis that

decisions

based

unverified

lexical evidence involve

weighted

strength

of the

word units supporting each

the

two-choice alternatives. Alternatively,

P(C/L)

could

viewed

as the

probability

certain word units being

the

most

highly

ac-

tivated units

individual trials.

note

aside that

our

general approach

has

been

to find a set of

simple algorithms (with plau-

sible

psychological underpinnings) that

do a

good

job of

predicting performance.

An al-

ternative

approach

is to

begin with

very

spe-

cific

ideas about

the

underlying psychological

processes

and

then derive algorithms

suit

these particular assumptions.

have shied

away

from

this latter strategy

in the

belief

that both

the

tests

and

selection

particular

psychological

explanations would

easier

once

we had

developed

formal model

that

predicts performance

several paradigms

with

fair

amount

success.

The

factors

that determine

the

probability

correct response

from

the

lexicon

can

easily understood

examining

specific

examples.

If the

stimulus word

PORE

pre-

sented

(see Figure

2) and the

third position

probed with

the

alternatives

R and K, we

have

P(C/L)

= 1 X

(1.583/1.825)

+ .5

(0/1.825)4-0

.867.

(3)

This relatively high probability

of a

correct

response

reasonable because

five of the

highly

activated words

(BORE, PORK, GORE,

LORE,

PORE)

support

the

correct alternative,

whereas

only POKE supports

the

incorrect

alternative.

general,

P(C/L)

will

be .70 or

greater

for

words;

but

exceptions

occur.

For

example, when

the

word

GONE

pre-

sented

to the

simulation,

the

following

words,

with

their activation strengths

parentheses,

exceed

the

cutoff:

DONE

(.281),

GONE

(.549),

TONE

(.243),

BONE

(.278),

CONE

(.256),

and

LONE

(.251).

If the first

position

probed

with

the

alternatives

G and B, we

have

P(C/L)

= 1 X

(.549/1.858)

+ .5

(1.031/1.858)+

.57

(4)

Lower

values

P(C/L)

tend

occur when

there

is a

highly

activated word that supports

the

incorrect

alternative

and/or

when there

are

several

highly

activated words that

sup-

port neither alternative.

580

PAAP,

NEWSOME,

MCDONALD,

AND

SCHVANEVELDT

Correct

Responses

From

the

Alphabetum

The

probability

of a

correct response given

decision based

on the

alphabetum

P(C/A)=

1.0X(ac/Sa)

(San/Sa)

+ 0 X ta/Sa), (5)

where

«c

is the

activation strength

of the

letter

unit corresponding

to the

correct alternative,

San

is the

activation

strength

of the

letter

units

that

are

neither

the

correct

nor the in-

correct alternative,

and

is the

total

al-

phabetic

activity.

The

only

difference

be-

tween

the

decision

rule

for the

alphabetum

and

that

for the

lexicon

that alphabetic

activity

is not filtered by a

criterion.

Assuming

that

the

third position

probed

with

the

alternatives

R and K, the

P(C/A)

for

the

stimulus word

PORE

P(C/A)

= 1 X

(.585/1.000)

+ .5

X(.390/1.000)

+ 0 =

.780.

(6)

This

value would,

course,

be the

same

for

the

pseudoword

DORE,

the

anagram

EPRO,

any

other stimulus that contains

R in the

third

position.

Probability

Decision

Based

on the

Lexicon

For any

given trial,

it is

assumed that

decision

will

made

on the

basis

lexical

information

total

lexical activity exceeds

the

decision criterion. Given noise intro-

duced

variations

in the

subject's

fixation

attention,

and

within

the

visual processing

system

itself,

it is

reasonable

assume that

specific

stimulus

will

exceed

fall

short

the

decision criterion

on a

probabilistic,

rather

than

all-or-none,

basis. Accord-

ingly,

the

mathematical

instantiation

of our

verbal

model estimates,

for

each stimulus,

the

probability that

its

lexical activity

will

exceed

the

decision

criterion.

This probabil-

ity

will,

course, depend

both

the av-

erage

amount

lexical activity produced

the

stimulus

question

and the

current

value

of the

decision criterion.

The first

step

estimating

P(L)

normal-

izes

the

total lexical activity produced

each

individual stimulus

that stimulus that

produced

the

greatest amount

lexical

ac-

tivity.

Of the 288

words

that

have been used

input

to the

encoding algorithm,

the

word

SEAR

has

produced

the

greatest number

words above criterion

(9) and the

greatest

amount

total lexical activity (2.779). Thus,

normalization

involves dividing

the

total lex-

ical activity

for a

given stimulus

2.779.

Normalization

simply

convenience

ensure

that

the

amount

lexical activity

generated

each stimulus

will

fall

in the

range

of 0 to 1

and, consequently, that

P(L)

will

also

bounded

by 0 and

Because

this

transformation

simply involves dividing

constant,

we are not

altering

the

relative

lexical

strengths that were

initially

obtained

summing

the

geometric means

of all

words

above

the

word-unit criterion.

In any

event,

certainly

do not

mean

infer

that

subjects

must somehow know

advance

the

greatest amount

lexical activity

that

they

will

experience during

the

course

of the ex-

periment. Rather,

simply assume that

to-

tal

lexical activity

is one

important

deter-

miner

P(L).

The

contribution

of the

decision rule

P(L)

reflected

by a

second step that raises

each

of the

normalized activation levels

constant power between

0 and

This

yields

the

estimated

P(L)

for

each stimulus.

Stringent decision criteria

can be

modeled

using high exponents (near

1).

This

proce-

dure generates

wide range

P(L)

across

items,

and a

decrease

in the

average

P(L).

Lax

decision

criteria

can be

modeled

using

low

exponents (near

0). A

very

lax

criterion

compresses

the

range toward

the

upper

boundary

and

thus causes

the

mean

P(L)

approach

Consequently, when

very

lax

criterion

used,

P(L)

tends

to be

quite high

for

any

level

lexical activity. Using

an ex-

ponential transformation

is a

convenient

way

operationalize

decision

rules

diverse

"use lexical evidence whenever

it is

avail-

able" (exponents near

0) to

"use lexical

ev-

idence only

for

those stimuli

that

produce

substantial amounts

lexical activity" (ex-

ponents near

1). All of the

predictions dis-

cussed later

are

based

on a

constant value

(.5)

for

this parameter.

Because

P(L)

derived

from

total lexical

activity,

will generally

the

case that stim-

uli

PORE

that excite

six

word units above

threshold will have higher

probabilities

than

ACTIVATION-VERIFICATION

MODEL

581

stimuli like

RAMP

which produce only

one

suprathreshold

word unit.

summary,

the

probability

that

decision

will

based

lexical evidence

estimated

for

each stim-

ulus

using

the

following

equation:

P(L)

(7)

where

is the

total lexical activity

for

stim-

ulus

Wmax

is the

total

lexical activity

for the

stimulus producing

the

greatest activity,

and

the

exponent

n is a

parameter that

reflects

the

stringency

of the

criterion.

P(L)

for the

stimulus

PORE

would

P(L)

(1.825/2.779)-5

.810.

(8)

When

the

exponent

is set to .5,

f\L)

for

word

stimuli will range

from

about

.4 to

1.0,

with

mean

about

.6.

Finally,

it is

assumed that when

total

lex-

ical activity

less than

the

criterion,

the de-

cision will,

default,

based

alphabetic

information.

Accordingly,

the

probability

alphabetic decision,

P(A\

P(A)=l-

P(L).

(9)

Predicted

Probability

Correct

Table

uses

Equation

1 to

show

the

der-

ivation

of the

overall probability

of a

correct

response

for

two

sets

stimuli. Each

set

con-

sists

of a

word,

pseudoword that shares

three letters

common with

the

word,

and

anagram

of the

word.

The first set was

chosen

because

produces predictions that

are

similar

most sets

words

and

non-

words

and

illustrates

why the

model

will

yield

different

mean PPCs

for

words, pseudo-

words,

and

anagrams.

The

second

set is ab-

normal

and

illustrates some principles that

account

for

variations within stimulus classes.

exemplified

PORE,

the

probability

correct response based

lexical evi-

dence

usually greater than

that

based

alphabetic evidence.

The

overall proportion

correct

falls

somewhere between

the

lexical

and

alphabetic

probabilities

and

will

ap-

proach

the

lexical value

P(L),

the

prob-

ability

of a

lexical decision, increases.

gen-

eral, words should provide

better

context

than nonwords

to the

extent that

(a)

P(C/

L) >

P(C/A)

and (b)

P(L)

high. Because

these

conditions

are met for the

stimulus

PORE,

the

model predicts

4.2% advantage

over

the

pseudoword

DORE

and a

6.6%

ad-

vantage over

the

anagram

EPRO.

The

model predicts that some words should

actually produce

word-inferiority

effects.

This

can

only occur,

as in the

example

LEAF,

when

lexical

evidence

poorer than alphabetic

evidence.

Because

the

probability

of a

lexical

decision

estimated

from

total

lexical activ-

ity,

regardless

of the

veridicality

that

in-

formation,

the

model predicts that

LEAF

will

judged

on the

basis

of the

inferior lexical

evidence about

two

thirds

of the

time. This

leads

to a

predicted 8.4% disadvantage rela-

tive

to the

pseudoword

BEAF

and a

6.1%

dis-

advantage relative

to the

anagram

ELAF.

Table

Simulation

Word, Pseudoword,

and

Anagram

Differences

for Two

Examples

Simulated values

ClassStimulus Alternatives

WSE SPC =

P(L)

P(C/L)

P(A)

P(C/A)

Typical

Word

Pseudoword

Anagram

Atypical

Word

Pseudoword

Anagram

PORE

DORE

EPRO

LEAF

BEAF

ELAF

R,K

F.P

F, P

+.042

+.066

-.084

-.061

.852

.810

.786

.621

.705

.682

.810

.535

.000

.591

.428

.000

.867

X.831

.000

X.677

.736

X.OOO

.190

.465

1.000

.323

.572

1.000

X.786

.786

.682

Note.

WSE =

word-superiority

effect;

SPC is the

simulated

proportion

correct;

P(C/L)

is the

probability

of a

correct

response

from

the

lexicon; P(C/A)

is the

probability

of a

correct

response

from

the

alphabetum;

and

P(L)

is the

probability

basing

decision

lexical information.

582

PAAP,

NEWSOME, MCDONALD,

AND

SCHVANEVELDT

Test

and

Evaluation

of the

Model

The

model

can be

tested

at two

levels.

First,

averaging

across

stimuli

in the

same

class,

the

model

can be

used

predict

the

magnitude

of the WSE for

words over pseu-

dowords

words over anagrams. Second,

the

model should

able

predict item vari-

ation within

stimulus class.

Four experiments provide

the

basis

for the

following

tests (Paap

Newsome, Note

Note

Paap,

Newsome,

McDonald,

Schvaneveldt,

Note

6). All

experiments used

the

two-alternative, forced-choice letter-rec-

ognition

task.

Each experiment

compared

performance

on a set of 288

four-letter words

to a set of 288

nonwords.

The

nonwords used

two of the

experiments were

orthograph-

ically

regular pseudowords.

In the

remaining

two

experiments,

the

nonwords were formed

selecting that anagram

for

each word stim-

ulus

that minimized

the

amount

ortho-

graphic structure.

The two

alternatives

se-

lected

for

each stimulus both formed words

for

word stimuli

and

nonwords

for the

non-

word

stimuli.

Word

and

Pseudoword

Advantages

Our

first

approach

evaluating

the

model

was

to use the

algorithm described

in the in-

troduction

predict

the

proportion correct

for

each

of the 288

words, pseudowords,

and

anagrams.

The

mean output

of the

model

for

words, pseudowords,

and

anagrams

shown

Table

3. The

simulation predicts

2.8%

advantage

for

words

(.841)

over pseudowords

(.813),

and an

8.6% advantage

for

words over

anagrams (.755). These

differences

compare

favorably

to the

obtained

WSEs

2.6%

and

8.8%,

respectively.

Across

all 288

words,

the

number

lexical

entries exceeding

the

cutoff

ranged

from

to 9,

with

mean

3.4. These word units

constrain

the

identity

of the

critical letter

effectively

than

it is

constrained

by the

activity within

the

alphabetum.

Thus,

the

word

advantages predicted

by the

model

occur because lexical information

used

63%

of the

time

and the

mean probability

correct

response

from

the

lexicon (.897)

greater than that based

on the

alpha-

betum

(.758).

The

major reason

why the

model yields

lower

proportions correct

for

nonwords than

words

is not the

quality

of the

available lex-

ical evidence,

but

rather

its

frequent absence.

That

is, the

probability

of a

correct response

based

lexical evidence

for the 253

pseu-

dowords that produce

least

one

word

above threshold

nearly identical (about

.90)

that

for

the 288

words. Similarly,

P(C/

L) for the 44

anagrams that

produce,

least

one

word above

the

cutoff

.94. Thus,

the

quantity

and not the

quality

lexical

infor-

mation

is the

basis

for the

WSE.

Orthograph-

ically

regular pseudowords excite

the

lexicon

almost

much

words

(2.1

vs. 3.4

entries)

and

lead

small word advantages, whereas

orthographically

irregular anagrams generate

much

less lexical activity

(.2 vs. 3.4

entries)

and

show much larger word advantages.

Item-Specific

Effects

The

model's ability

predict performance

specific

stimuli

limited

by the

sensitivity

and

reliability

of the

data.

Our

previous work

provides

two

sets

word

data

and one set

Table

Simulated

Values

for

Words. Pseudowords,

and

Anagrams

Lexical

class

Words

Pseudowords

Anagrams

PPC

.841

.813

.755

P(C/L)

.897

.791

.144

Simulated values

P(C/A)

.758

P(L)

.634

.415

.073

3.4

2.1

Note.

PPC is the

predicted proportion correct; P(C/L)

is the

probability

of a

correct

response

from

the

lexicon;

P(C/A)

is the

probability

of a

correct response

from

the

alphabetum; P(L)

is the

probability

basing

decision

lexical information;

and NW is the

number

words that exceeded

the

criterion.

ACTIVATION-VERIFICATION

MODEL

583

for

each

of the two

types

nonwords. Each

the 288

items

in a set was

presented

to 24

different

subjects. This means that

the ob-

tained

proportions

correct

for

individual

items

vary

steps

.04. Given these lim-

itations,

correlation

data against data

provides

index

of the

maximum

amount

variation that could

accounted

for by

the

model.

The

correlation between

the two

sets

word

data

was

.56.

similar

deter-

mination

of the

reliability

of the

pseudoword

and

anagram data yielded correlations

of .48

and

.39, respectively. However, because only

subjects

saw

each

nonword stimulus, these

lower

correlations

are

due,

part,

to the

fact

that each half consisted

only

observa-

tions compared with

the 24

available

in the

word

analysis.

Table

shows

the

correlations between

the

various

sets

obtained data

and the

values

generated

by the

model. Because each cor-

relation

based

on a

large number (288)

pairs,

significant

values

need only exceed

.12.

For all

three stimulus classes, there

are

significant

correlations between

the

obtained

data

and (a) the

predicted proportion correct,

(b)

the

probability

of a

correct

response

from

the

lexicon,

and

(c)

the

probability

of a

cor-

rect response

from

the

alphabetum.

The

cor-

relations

are

quite high considering

the

lim-

itations discussed above.

For

example,

the

correlation

between

the first set of

word data

and the

predicted

proportion

correct

is .30

compared

to .56

for

data against

data.

Taking

the

ratio

of the

squared values

these cor-

relations

(.09

and

.31, respectively)

leads

the

conclusion that

the

model

can

account

for

29%

the

consistent item variation (both

correlations

are

based

on 24

observations

per

data point,

and no

correction

for n is

needed).

a final

check

on the

model's ability

predict variation within words,

the 288

words

were

partitioned into thirds

on the

basis

their predicted performance,

and

mean

ob-

tained

performance

was

computed

for

each

group.

Obtained proportion correct

for the

upper

third

was .85

compared

.82,

and

.78

for the

middle

and

bottom

thirds.

The

source

of the

model's success

pre-

dicting

interitem

variation

difficult

trace.

Because

decisions

about word stimuli

are

made

on the

basis

lexical evidence more

often

than

alphabetic evidence,

P(L)

.63,

it is

clear that both

the

lexicon

and al-

phabetum

contribute substantially

to the

overall

PPC,

and

accordingly, both branches

must

enjoy

some predictive power

order

avoid diluting

the

overall correlation

be-

tween

obtained

and

predicted correct. Fur-

thermore,

should

noted that

the

corre-

lation

between

P(C/L)

and the

obtained data

quite sensitive

to the

word-unit criterion

(because

this

affects

the

average number

candidate words). This

consistent with

the

view

that

the

predictive power

of the

lexical

branch primarily depends

getting

the

cor-

rect

set of

candidate words

and is not a

simple

transformation

alphabetic activity.

The

item-specific predictions

are far

from

exact,

but

they

are

quite encouraging because

our

lexicon

contains

only

the

1,600 four-let-

ter

words listed

in the

Kucera

and

Francis

(1967)

norms. Because

P(C/L)

for any

item

determined

by the

activation strengths

visually

similar words

in the

lexicon, sub-

stantial variation

for

particular item

can be

Table

Correlations

Between Obtained Proportion

Correct

and

Simulated

Values

Stimulus

type

Words

Setl

Set 2

Anagrams

Pseudowords

PPC

+.30

+.26

+.37

+.35

P(C/L)

+.28

+.23

+.21

+.17

Simulated

values

P(C/A)

+.29

+.27

+.34

+.38

P(L)

-.05

+.01

+.17

+.15

-.05

.00

+.14

+.16

Note.

PPC is the

predicted proportion correct;

P(C/L)

is the

probability

of a

correct response

from

the

lexicon;

P(C/A)

is the

probability

of a

correct response

from

the

alphabetum; P(L)

is the

probability

basing

decision

lexical information;

and NW is the

number

words that exceeded

the

criterion.

584

PAAP, NEWSOME, MCDONALD,

AND

SCHVANEVELDT

introduced

just

one

highly similar word

either added

deleted

from

the

lexicon.

Lexical

Constraint

The

test words consisted

of the 288

words

used

Johnston

(1978)

in his

influential

test

sophisticated-guessing theory. Half

of the

words

were

defined

Johnston

high-con-

straint words,

and the

other half

low-con-

straint words.

Johnston

assumed

that

lexical

knowledge

will

constrain

the

identity

of the

critical letter

inverse proportion

to the

number

different

letters that

will

form

words

given

the

remaining context.

For ex-

ample,

the

context

_ATE

supplies much

less

constraint than

the

context

_RIP

because

letters

form

words

in the

former

context,

but

only

three

in the

latter. Johnston rejected

the

hypothesis

that lexical constraint contributes

to the WSE

because performance

on the

high-constraint

words (.77)

was

slightly lower

than

performance

on the

low-constraint

words

(.80).

Our

model shows that when

the

same par-

tial information,

in the

form

letter-con-

fusion

probabilities,

provided

both

the

alphabetum

and

lexicon, lexical activity

can

support

the

critical

letter

often

than

does

the

alphabetic activity. This

difference

between

P(C/L)

and

P(C/A)

provides

an in-

dex

of the

potential amount

lexical

benefit

for

any

word.

view

this measure

lexical

benefit

as an

alternative definition

for the

global

concept

lexical constraint. Thus,

Johnston's

(1978)

conclusion that lexical

constraint does

not

contribute

to the WSE

may

have been premature

and the

product

less appropriate definition

lexical con-

straint.

Concerns that

have raised previ-

ously

(Paap

Newsome,

1980a)

can now be

extended

in the

context

of our

model

and the

alternative definition

for

lexical constraint.

Johnston (1978) obtained both free-recall

and

forced-choice responses. First, consider

those

trials

which

the

three context letters

were

correctly

reported.

The

conditional

probabilities

of a

correct critical-letter report

given

correct report

of all

three context let-

ters

were

.90 and .86 for

high-

and

low-con-

straint pairs, respectively. This

extremely

high

performance

for

free

recall,

and any

sig-

nificant

differences

due to

lexical constraint

may

obscured

by a

ceiling

effect.

More-

over,

if one

assumes that

the

same stimuli

presented

to the

same subjects under

the

same

conditions would yield performance

distributions with some variability, then

would

seem quite reasonable

characterize

these trials

samples that have been drawn

from

the

upper

end of the

distribution

and

that

reflect

trials

which

the

level

visual

information

was

unusually high.

When

stimulus information

high,

the

effects

lexical constraint

may be

low.

Our

model

makes exactly this prediction.

stim-

ulus

quality

enhanced

transforming

the

correct responses

in the

confusion matrices

upward,

and the

incorrect

responses down-

ward,

the

difference

between

the

lexical

and

alphabetic branches disappear.

For

example,

stimulus quality

raised

to the

extent that

the

probability

correct response based

the

alphabetum

increased

from

.758

.889,

the

advantage

lexical over alphabetic

evidence

decreases

from

13.9%

-.5%.

When

stimulus information

is low

(when

only

few

features

are

detected

each letter

location), lexical knowledge should

beneficial.

However,

when

the

subject

has

only

partial

information

about each letter,

Johnston's

(1978)

procedure

for

computing

lexical

constraint (based

complete knowl-

edge

of the

three

context

letters

and no in-

formation

about

the

target)

may no

longer

correlate with

the

lexical constraint provided

partial

set of

features

each letter

lo-

cation.

Our

analysis completely supports this

hypothesis:

Johnston's high-constraint words

yield

a PPC of

.830 compared

.852

for the

low-constraint

set. Furthermore,

the

average

number

word units exceeding criterion

exactly

the

same (3.4)

for

both sets

words.

It is

clear that there

absolutely

relation

between

the

number

letters that will

form

word

in the

critical position

of a

test word

(Johnston's

definition

lexical constraint)

and the

number

words that

are

visually

similar

that word (the candidate words

the

activation-verification model).

contrast, when lexical constraint

is de-

fined as the

amount

lexical

benefit,

the

effects

lexical constraint

are

apparent

the

data.

For

each

of the 288

stimuli

each

type,

subtracted

P(C/A)

from

P(C/L)

and

then

partitioned

the

stimuli into thirds

ACTIVATION-VERIFICATION MODEL

585

the

basis

these

differences.

For

both sets

word data

and the

pseudoword

data,

ob-

tained performance

on the

most highly con-

strained third

about

greater than that

on the

bottom third. There

were

differ-

ences

for the

anagrams,

but

this

is to be ex-

pected

because

our

anagrams rarely activate

the

lexicon. Although

the

effect

lexical

constraint

(denned

lexical

benefit)

small,

appears

in all

three data sets where

it was

predicted

occur. Furthermore, this mea-

sure provides

pure index

of the

predictive

power

of the

lexical branch

of our

model.

This

true because

the

psychophysical

dis-

tinctiveness

of the

target letter

removed

subtracting

P(C/A).

Differences

lexical

constraint

are due

only

to the

mixture

can-

didate words that support

the

correct,

incor-

rect,

neither alternative.

Another

way of

appreciating

the

role

lexical

constraint

in our

data

is to

compare

the

high-constraint (top third)

and

low-con-

straint (bottom third) words

to the

high-

and

low-constraint

anagrams.

The

magnitude

the WSE is

about

10%

for the

high-constraint

set

compared

only

5% for the

low-con-

straint set.

One

might speculate that

com-

parable

effect

lexical constraint could

found

Johnston's

(1978)

data

they were

analyzed

on the

basis

of our new

measure

lexical

constraint.

Orthography

Massaro

and his

associates

(Massaro,

1973,

1979;

Massaro, Taylor, Venezky,

Jastrzemb-

ski,

Lucas, 1980; Massaro, Venezky,

Taylor,

1979),

have convincingly advocated

model

which

letter

recognition

guided

inferences drawn

from

knowledge

of or-

thographic structure.

Our

model

has no

pro-

vision

for the

dynamic

use of

orthographic

rules,

nor

does

assume

syllabary

com-

monly

occurring letter clusters that could

activated

by, or in

parallel with,

the

alpha-

betum.

Although

it is

clear that

the

model

does

not

need

any

orthographic mechanism

order

predict

the

advantage

of the

reg-

ular

pseudowords over

the

irregular ana-

grams,

the

present experiments

offer

large

set

stimuli

and

data

assess

the

possible

contribution

orthography

within

the

word,

pseudoword,

and

anagram classes.

accordance with

the

procedure advo-

cated

Massaro,

the sum of the

logarithms

the

bigram frequencies (SLBF)

was

computed

for

each stimulus.

The

correla-

tions between SLBF

and the two

sets

word

data

were

and

.04. Apparently,

there

relation between this measure

ortho-

graphic structure

and

performance

indi-

vidual

items. This

also true

for the

correla-

tion between SLBF

and the

pseudoword

data

.09).

contrast,

the

correlation between

SLBF

and the

anagram

data

much higher

.30). This pattern

correlation

similar

to a

previous analysis

orthographic struc-

ture (Paap

Newsome,

1980b)

and

further

supports

our

conclusion

that

orthographic

structure

will

predict performance only when

very

low

levels

are

compared

somewhat

higher

levels

structure.

Although

current data

do not

permit

one

rule

out the use of

orthographic rules

letter

and

word recognition,

our

model shows

that both

the

lexical (advantage

words over

well-formed

pseudowords)

and

orthographic

(advantage

pseudowords over irregular

strings)

component

of the WSE can be

pre-

dicted

on the

basis

lexical constraint alone.

Furthermore, lexical access

may

also account

for

the

apparent

effect

orthography

on an-

agram

performance.

In the

activation-veri-

fication

model,

the

contribution

lexical

activity

determined

by the

probability

decision based

on the

lexicon,

P(L),

and

the

probability

of a

correct response based

lexical activity,

P(C/L).

The

correlation

between

orthography (SLBF)

for

each ana-

gram

and its

corresponding

P(L)

.49. Fur-

thermore,

the

correlation between SLBF

and

P(C/L)

also .49.

terms

of our

model,

there

is no

direct

effect

orthographic struc-

ture

letter recognition. Rather,

it is

simply

the

case that extremely irregular letter strings

rarely

excite

the

lexicon and, therefore, can-

not

benefit

from

lexical access.

On the

other

hand, less irregular anagrams will occasion-

ally

activate

word unit,

and

that unit

likely

support

the

correct alternative.

Recently,

Massaro (Note

conducted

simulations

of his

fuzzy

logical model that

are

similar

to the

activation-verification

model

that top-down evidence (e.g.,

log

bigram

frequencies)

combined with

an in-

dex

visual

evidence based

letter-con-

586

PAAP,

NEWSOME, MCDONALD,

AND

SCHVANEVELDT

fusion

probabilities.

For

six-letter anagrams

visual

evidence alone

is a

poor predictor;

the

correlation between predicted

and

observed

results

for

160

anagrams

only

.08. Adding

the

log-bigram

frequency

component

to the

model

raises

the

correlation

.59. Orthog-

raphy

does seem

have

considerable

im-

pact

and

suggests

the

possibility

that

percep-

tion

longer strings

may-be

influenced

orthographic

regularity

to a

much greater

extent than

perception

shorter strings.

the

other hand,

it is

entirely possible that

the

activation-verification model

may

also

able

account

for the

orthographic

ef-

fects

Massaro's

six-letter anagrams

on the

basis

lexical access

and

without recourse

to any

orthographic mechanism.

The

outcome

Massaro's simulation

for

the 40

six-letter words

less informative.

The

correlation between obtained data

and

that predicted

from

the

visual component

alone

was .48

compared

only

.43 for the

model

that combines both

the

visual

and

orthographic components. This suggests that

the

impact

orthography

on the

perception

six-letter words

may be

quite weak,

but it

may

important

note that performance

levels

were

not at all

comparable

for the

words

(90% correct)

and

anagrams (75% cor-

rect).

Comparisons

of the

Interactive Activation

and

Activation-Verification

Models

McClelland

and

Rumelhart

(1981;

Ru-

melhart

McClelland,

1982)

have proposed

interactive activation model that extends

to the

same wide scope

letter

and

word

recognition

paradigms that have been

the

tar-

get

of our

activation-verification model.

Both

models share many basic assumptions:

(a)

that stimulus

input

activates spatially

specific

letter

units,

(b)

that

activated

letter

units

modulate

the

activity

word units,

and (c)

that letter

and

word recognition

are

frequently

affected

important top-down

processes. These generally

stated

assump-

tions permit both models

predict

and ex-

plain

the

effects

lexicality,

orthography,

word

frequency,

and

priming. However,

the

specific

operations used

instantiate these

general

assumptions

McClelland

and Ru-

melhart's

computer simulation

and in our

computational algorithms

offer

large num-

ber of

provocative

differences

with respect

the

specific

mechanisms responsible

for the

various

contextual phenomena. Further-

more,

the two

models

are not

always equally

adept

accounting

for the

various context

effects.

The

Word

and

Pseudoword

Advantage

The WSE is

often

characterized

con-

sisting

of two

effects.

The

lexical

effect

refers

to the

benefits

that accrue

from

accessing

the

lexicon

and is

estimated

from

the

obtained

advantage

words over well-formed pseu-

dowords.

The

orthographic

effect

refers

to the

benefits

derived

from

the

reader's

knowledge

orthographic redundancy

and can be es-

timated

from

the

obtained advantage

pseudowords

over irregular nonwords. Both

the

activation-verification

and

interactive

activation

models assume that lexical acti-

vation

accounts

for

both lexical

and

ortho-

graphic

effects.

the

interactive

activation

model,

lexical

access facilitates letter recognition through

excitatory

feedback

from

activated word

units

their constitutent letter units. Word

stimuli

are

very

likely

activate word units

that

reinforce

the

letters presented, thereby

increasing

the

perceptibility

of the

letters.

contrast,

irregular nonwords

will

rarely

ac-

tivate a

word unit,

and

accordingly,

the

per-

sistence

activity

in the

correct

letters

units

will

not be

extended

feedback. Because

pseudowords share many

letters

common

with

words, they

too

activate word units

that

produce excitatory feedback

and

strengthen

the

letter units that give

rise to

them.

Given

the

detailed

encoding

assumptions

the

interactive activation model

and the

particular

set of

parameter values needed

predict

the

basic

pseudoword advantage,

McClelland

and

Rumelhart conclude

that

the

amount

feedback,

and

hence

the

amount

facilitation, depends primarily

the

activation

word units

that

share three

letters with

the

stimulus. They call

the set of

words

that

share three letters with

the

stim-

ulus

its

neighborhood.

The

amount

facil-

itation

for any

particular target letter will

primarly

determined

by the

number

word

units

in the

neighborhood that support

the

ACTIVATION-VERIFICATION MODEL

587

target

("friends")

and the

number that sup-

port some other letter

("enemies").

This generalization provides

good basis

for

comparing

the two

models, because

the

amount

facilitation produced

lexical

access

in our

model will

primarily deter-

mined

by the

number

friends

and

enemies

the

candidate

set

generated

by our

encod-

ing

algorithm.

The set of

words

in the

neigh-

borhood

of a

particular stimulus

likely

quite

different

from

the set of

candidate

words.

One

major reason

for

this

(as

pointed

out

earlier

in the

discussion

of the

geometric

mean

as a

measure

word-unit activation)

that

word units

that

share three letters with

the

stimulus will

fail

exceed

the

word-unit

criterion

if the

mismatching letter

is not

very

confusable

with

the

letter actually presented.

For

example,

for the

input string

SINK

with

S as the

test letter,

our

encoding algorithm

generates only three

friends

(SING,

SINE,

and

SINK)

and

four

enemies

(LINK, WINK, FINK,

and

RINK).

addition

to all of

these words,

the

neighborhood includes

five new

friends

(SICK, SANK, SINS, SILK,

and

SUNK)

and two

new

enemies

(PINK

and

MINK).

Thus,

the

ratio

friends

enemies

is 3:4 for our

model compared

to 8:6 for

their model.

Using

the

candidate

set

generated

by our

model

and the

neighborhood

denned

by a

of our

lexicon (the 1,600 four-letter

words

in the

Kucera

and

Francis, 1967,

norms),

computed

the

proportion

friends

for

each stimulus according

each

the two

models.

order

compare

the

predictive power

of the two

models,

then

correlated

the

proportion

friends against

the two

sets

word data,

the

anagram data,

and

the

pseudoword data.

For all

four

cases

the

proportion

friends

in the

candidate

set

yielded

higher correlations than

the

propor-

tion

friends

the

neighborhood.

The av-

erage

correlation

for our

model

was .24

com-

pared

to .14 for the

interactive activation

model.

summary,

our

model seems

have

slight edge

in its

ability

account

for

consistent

interitem

variation that accrues

from

lexical access.

were

also curious

as to the

implications

that McClelland

and

Rumelhart's

encoding

assumptions would have

for the

average per-

formance

on our

words,

pseudowords,

and

anagrams.

this

end the

alphabetic branch

our

model

was

modified

that

(a) the

activity

each word

was

boosted

by .07 for

each

matching letter

and

reduced

by .04 for

each

mismatching letter

and (b) the

word-

unit criterion would

exceeded

by all

those

lexical

entries

that shared

least three letters

common with

the

stimulus.

The first

mod-

ification

based

on the

values

letter-to-

word

excitation

and

inhibition used

McClelland

and

Rumelhart

and

amounts

assigning

strength

of .28 to the

word unit

corresponding

to a

word stimulus,

and a

strength

of.

to all the

word units that share

three letters with

stimulus.

The

probability

decision based

on the

lexicon,

P(L),

and

the

probability

of a

correct response based

lexical access,

P(C/L),

were

then com-

puted

usual.

The

decision rule

was

also

the

same,

but

deserves

brief comment.

extend

Mc-

Clelland

and

Rumelhart's analysis

of the

neighborhood

predictions

proportion

correct

in a

two-alternative forced-choice

task,

it is

necessary

separate nonaligned

neighbors

from

true enemies. That

is,

word

units

in the

neighborhood that support

the

incorrect

alternative (true enemies)

will

have

much more disruptive

effect

perfor-

mance than words that support neither

al-

ternative (nonaligned neighbors).

This

essentially what

done

Equation

2 for our

model

when

assume that

friends

contrib-

ute to a

correct response with

probability

nonaligned neighbors with

probability

.5, and

true enemies with

probability

ofO.

When

neighborhood based

on the

char-

acteristics

the

interactive activation model

substituted

for the

candidate

set

generated

our

encoding algorithm,

and all

other

op-

erations

are

identical,

the

average predicted

performance

is .80 for

words,

.84 for

pseu-

dowords,

and .74 for

anagrams. This will

not

do at

all, because

the

advantage

words over

anagrams

is too

small and, more impor-

tantly, words

are

predicted

to be

inferior

pseudowords!

McClelland

and

Rumelhart

have

already discussed

why

pseudowords

tend

have

high

proportion

friends.

add to

their analysis

similar account

why

words

tends

have

lower proportion

friends.

Experimenters select stimulus words

588

PAAP,

NEWSOME,

MCDONALD,

AND

SCHVANEVELDT

pairs that

differ

only

single letter. This

ensures

that

the two

alternatives

in the

target

location

will

both

form

words

in the

remain-

ing

context.

For

example,

two of

Johnston's

(1978)

high-constraint words were

SINK

and

WINK,

with

the first

position being probed

with

the

alternatives

S and W. One

conse-

quence

this

that

every

word stimulus

will

have

least

one

friend

(itself)

and one

true enemy (its mate). Experimenters create

pseudowords

substituting

one of the

con-

text

letters

from

the

original word pair.

For

example,

created

the

pseudowords

SONK

and

WONK

replacing

the Is

from

SINK

and

WINK

with

Os.

The

consequence

this

that every pseudoword

has at

least

one

friend

(SINK

for

SONK

and

WINK

for

WONK)

but no

built-in

enemy

(WONK

is not an

enemy

SONK

because

it is not a

word). This system-

atic bias introduced

in the

selection

of the

materials results

in the

words' neighborhood

averaging

only

70%

friends

compared

to 79%

for

the

pseudowords. Thus, models based

directly

on the

composition

of the

neighbor-

hood

will

predict

advantage

pseudo-

words over words.

fairness

to the

interactive activation

model,

should

clearly

pointed

out

that

when

its

encoding assumptions

are

placed

the

context

its own

complete model, rather

than

our

complete model,

the

simulation

shows

the

correct ordering

for the

words,

pseudowords,

and

single letters used

McClelland

and

Johnston (1977).

sus-

pect

that

their

full

simulation

would

also

pro-

duce

the

correct ordering

of our

words, pseu-

dowords,

and

anagrams.

The

reason

for

this

that

the

complete interactive activation

model assumes large (parameter value

.21)

amounts

inhibition between competing

word

units. Thus, when

word

presented,

the

initial strength

of the

corresponding word

unit

(about .28)

will

quickly dominate

the

initial activity (about

.17)

of any

potential

enemy.

Thus,

the

effects

lexical access

for

word

stimuli

are

almost entirely determined

feedback

from

the

corresponding word

unit

and no

others.

This

is an

interesting

con-

trast

between

the two

models.

assume

that

both

the

word advantage

and the

pseu-

doword

advantage

are

mediated

decisions

based

on the

activity

of a

small

set of

can-

didate words. McClelland

and

Rumelhart

assume

that

the

word advantage

mediated

feedback

from

single word unit (the lex-

ical

entry corresponding

to the

word pre-

sented)

but

that

the

pseudoword advantage

mediated

feedback

from

large neigh-

borhoods.

This inherent

difference

between words

and

pseudowords

in the

interactive activa-

tion

model produces some undesirable

fall-

out.

Specifically,

high levels

interword

inhibition permit

the

stimulus word

dom-

inate

any

potential competition, then

the

stimulus-driven

differences

between various

words

will

eliminated.

short, high levels

interword

inhibition

mean

that

the

func-

tional amount

activation produced

by the

presentation

of all

words

will

about

the

same. Thus,

the

significant

correlations

be-

tween

obtained performance

and

that pre-

dicted

from

our

model would stand unchal-

lenged

by the

interactive activation model.

It is

true

that

the

interactive activation model

does predict some variation between words

that

is not

stimulus driven, namely,

that

the

resting

levels

word units increase with

word frequency,

but we

will show

in a

sub-

sequent

section that this assumption

is not

good one.

Throughout

the

preceding section

have

compared

the

predictive power

of our

model's

candidate sets

that

McClelland

and

Rumelhart's

neighborhood.

Our

encoding

algorithm, which

highly sensitive

visual-

confusability

effects,

seems

enjoy

con-

sistent

advantage

in the

tests

have con-

ducted. However, this should

not be

viewed

as a

permanent

disadvantage

for the

inter-

active activation model because

the

neigh-

borhoods

tested

conform

those

ob-

tained when their parameter,

p, for

visual-

feature

extraction

is set to

1.0.

If a

value

lower

than

1.0 is

used, their model

will

gen-

erate neighborhoods sensitive

visual con-

fusability

in a way

similar

that

of our

can-

didate words. However,

one of the

difficulties

using

the

interactive activation model

heuristic device

is its

inherent complexity.

Accordingly,

it is

difficult

anticipate

the

results

simulations that have

not

been con-

ducted.

should

not be

presumed

advance

that

the

interactive activation model would

accurately predict

the

relative

differences

be-

tween

words, pseudowords,

and

anagrams

ACTIVATION-VERIFICATION MODEL

589

when

only partial information

gained

from

each

letter location. Furthermore, when

the

contribution

visual

confusability

intro-

duced through

the

partial sampling

sub-

jectively

denned

features

it is not as

likely

predictive

when confusability

based

on an

empirically derived confusion

matrix.

The

Pseudoword

Expectancy

Effect

One

potential problem

for any

model that

eschews

any

direct

contribution

ortho-

graphic knowledge

that

the

pseudoword

advantage

seems

to be

more susceptible

expectancy

effects

than

the

word advantage.

Carr,

Davidson,

and

Hawkins

(1978)

have

shown

that

subjects

do not

expect

to see

any

pseudowords, then performance

on an

unexpected

pseudoword will

be no

better

than that obtained with irregular nonwords.

contrast, they showed that

the

advantage

words over irregular nonwords

was the

same

regardless

whether

the

subject

ex-

pected

all

words

or all

nonwords.

McClelland

and

Rumelhart

can

account

for

this pattern

expectancy

effects

by as-

suming

that subjects have strategic control

over

the

degree

inhibition between

the

alphabetum

and

lexicon. They assume that

subjects

expect only words

only irregular

nonwords,

they will adopt

large value

letter-to-word

inhibition.

More specifically,

the

inhibition parameter

their simulation

set so

that

the

excitation produced

three

matching

letters will

precisely countered

the

inhibition

from

the

remaining mis-

match. Accordingly,

the

only word unit that

will

produce appreciable feedback

to the

let-

ter

units

is the

word presented. This means

that

the

word

advantage will

about

the

same

always

but

that

the

pseudoword

ad-

vantage

will

eliminated.

Our

activation-verification model

can

also

predict

the

pseudoword expectancy results

assuming

that

subjects have control over

one

parameter, namely,

the

word-unit criterion.

All

of the

predictions reported earlier used

word-unit criterion

.24.

The

average

numbers

candidate words produced

by the

three classes

stimuli were

3.4 for

words,

2.1

for

pseudowords,

and .2 for

anagrams.

adopting this

fairly

lax

criterion,

the

sub-

ject

can

take advantage

beneficial lexical

evidence

for

both words and, more impor-

tantly, pseudowords. However, because

the

word

unit corresponding

to a

word stimulus

would exceed

much

stiffer

criterion, sub-

jects have

motivation

maintain

a low

criterion and, therefore,

consider larger

sets

word units unless they expect

to see

some pseudowords.

The

expectancy

effect

was

modeled

raising

the

word-unit criterion

from

.24 to

.29. This resulted

in a

reduction

of the

num-

ber of

candidate words

to 1.4 for

word stim-

uli,

.40 for

pseudowords,

and .04 for

ana-

grams.

The

effect

this

on the

predicted

proportion correct

negligible

for

words

(.841

versus .856)

and

anagrams (.755 versus

.747)

but

results

in a

sizable decrease

pseu-

doword

performance (.813

.760).

sum-

mary,

raising

the

word-unit criterion

can re-

sult

in the

elimination

of the

pseudoword

advantage

while having

very

little

effect

the

word advantage. Although

higher cri-

terion does lead

to an

increase

P(C/L)

for

word

stimuli, this tends

to be

countered

decrease

in the

total amount

lexical

ac-

tivity

and, hence,

decrease

P(L).

Both

models

can

predict

the

pseudoword

expectancy

effect

reported

Carr

al.

(1978).

Although introspection

is at

best

weak

test

of two

opposing theories,

yield

to the

temptation

point

out

that

seems

to us

more natural that

subject-controlled

strategy

might involve

the

adjustment

of a

criterion

for

considering lexical evidence

rather than

the

adjustment

of the

amount

inhibition between letter

and

word

detectors.

Word-Frequency

Effects

for

Masked

Stimuli

Under normal

conditions

stimulus pre-

sentation, familiar words

can be

processed

effectively

than less familiar ones.

For

example, high-frequency words

are

consis-

tently classified

faster

than

low-frequency

words

lexical-decision tasks

(Landauer

Freedman,

1968;

Rubenstein,

Garfield,

Millikan, 1970; Scarborough,

Cortese,

Scarborough.,

1977).

Our

complete model

captures this familiarity

effect

assuming

that

the

order

verification

determined,

part,

word frequency. However,

it was

590

PAAP,

NEWSOME,

MCDONALD,

AND

SCHVANEVELDT

assumed

that

the

brief stimulus

durations

used

in the

present experiments, together

with

the

masking

fields,

would prevent ver-

ification

from

taking place.

Two

studies

have systematically manipu-

lated word

frequency

under conditions

backward masking.

In his first

experiment

Manelis

(1977)

selected 32-word sets

from

the

Kucera and

Francis

(1967)

norms with

high

(94-895), medium

(23-74),

and low (2-

10)

frequency counts. Although proportion

correct recognitions increased with fre-

quency

from

.775

.794

.800,

the

differ-

ences were

not

significant.

In the

second

ex-

periment pairs

high-

and

low-frequency

words

shared

the

same critical letter

and as

many

context

letters

possible.

Again,

there

were

differences

between common (.762)

and

rare (.757) words.

In a set of

three

ex-

periments described

Paap

and

Newsome

(1980b),

words were selected

from

the

Thorndike-Lorge

(1944)

count

that there

were

equal numbers

words with frequen-

cies

of 1, 2, 5, 14, and 23 per

million. Words

in the five

frequency

classes were matched

terms

of the

identity

and

position

of the

target

letter.

The

proportions

correct

re-

sponses,

increasing order

frequency,

were

.67, .62, .65,

.66,

and

.65.

The

results described above support

our

assumption

that

verification does

not

occur

when

stimulus words

are

followed

by a

mask.

have also tested

for

word-frequency

ef-

fects

in the

data

obtained with Johnston's

(1978) words.

The

Kucera

and

Francis

fre-

quency

counts were determined

for

each

the 288

words

and

correlated against both

sets

word data. These correlations

are

shown

parentheses

Table

There

are

significant correlations between word fre-

quency

and

proportion correct,

and in

fact,

the

trend

toward poorer performance with

higher word

frequency.

However, when

log-

arithmic transformation

applied

to the

fre-

quency

counts, positive correlations appear

each

of the

data sets.

Because

many

Johnston's

(1978)

words

are

quite uncommon

and may not be

entered

the

subjective lexicon

of our

typical sub-

ject,

it is

possible that this small

word-fre-

quency

effect

reflects

nothing more than

the

probability

of the

word appearing

in the

lex-

icon. This interpretation

was

investigated

sequentially

removing

the

words with

the

lowest frequency

from

the

original

set of 288

words.

shown

Table

5, the

correlation

between

the

logarithm

word frequency

and

performance systematically decreases

rare words

are

removed

from

the

sample.

When

only words with frequencies greater

than three

are

considered,

there

is no

effect

relative

frequency.

order

further support

our

claim

that

many

Johnston's

(1978)

words

are

unfa-

miliar

to our

population

undergraduate

subjects,

we had 147

students

classify

each

the

words

either

(a) a

word

that

know

the

meaning

of, (b) a

word that

don't

know

the

meaning

of, or (c) a

nonword. Thirteen

words

were

classified

nonwords

by a ma-

jority

of the

subjects

(LAVE,

TING,

BOON,

CRAG,

WHET, JILL, BOLL, WILE, HONE, HEWN,

FIFE,

BANS,

VATS).

Furthermore,

for

many

words

the

responses were distributed quite

evenly

across

the

three categories (e.g.,

FIFE,

BANS,

VATS, TEEM, HEMP, PENT, WANE,

NAVE,

SLAT).

When

removed

the 35

words

that

are

most

often

classified

non-

words

(and

the

meaning

which

known

only

minority

of the

subjects), there were

significant correlations between

the

data

for

the

individual words

and the

logarithm

their

frequency.

This purging

of our

lex-

icon also

led to a

slight improvement

in the

correlation between predicted

and

obtained

performance

for the 288

words,

r =

.32.

These tests lead

us to

conclude that mask-

ing

almost always prevents

verification

and

that there

is no

need

build

word-frequency

effects

into

our

encoding algorithm.

order

make sure that word

frequency

could

not

enhance

the

ability

of our

encoding algo-

rithm to

predict variation between words,

tried several

different

ways

having

the

log-

arithm

word

frequency

modulate

the ac-

tivity

of the

word units.

Our

basic strategy,

that

McClelland

and

Rumelhart,

was

decrease

the

stimulus-driven activity

word

units

inverse relation

their fre-

quency.

Because

the

correlation between

our

obtained word

data

and log

word

frequency

was

.16,

searched

for a

frequency

effect

that would produce

comparable correlation

between

our

predicted data

and log

word fre-

quency.

The

desired impact

word fre-

quency

was

achieved

when

the

amount

stimulus-driven

activity

was

reduced

about

for

each

half-log

unit drop

word fre-

ACTIVATION-VERIFICATION

MODEL

591

quency.

This means that

the

most common

words

in our

lexicon would receive

no re-

duction

activity,

and

those with

fre-

quency

only

one

would

reduced

40%.

Because

the

word-frequency

effect

leads

overall reduction

lexical activity,

it was

necessary

lower

the

word-unit criterion

substantially

(,14)

order

maintain can-

didate sets

about

3.3

words. Under these

conditions

the

predicted performance

for all

words

was

exactly

the

same (PPC

.84)

that predicted

from

the

original model

that

has no

provision

for

word-frequency

effects.

The

question

interest

can now be an-

swered: Does word

frequency

enhance

the

model's

ability

account

for

variation

be-

tween

words?

No, the

correlations between

predicted data

and two

sets

obtained data

show

that introducing

word-frequency

effects

produces

change

for one

data

set and a

decline

of .06 for the

other.

summary,

we can find no

evidence

our

data

elsewhere that two-alternative

forced-choice

performance

masked word

displays

shows

word-frequency

effect.

This

consistent with

the

activation-verification

model, because

assume that word fre-

quency

does

not

affect

activation

of the

word

units,

but

will

affect

the

order

verification

when

the

stimulus-presentation conditions

permit verification

occur.

The

magnitude

the

word-frequency

effects

generated

the

interactive activation model

is not

known.

Although

their model

specifically

assumes

that

the

resting activity

word units

de-

termined

familiarity, other factors, such

the

decision rules adopted

for the

forced-

choice

task,

may

severely attenuate

the

initial

frequency

differences

between word units

and, thereby, permit

the

prediction

of no

word-frequency

effect.

fair

conclusion

with

respect

word

frequency

that

the

acti-

vation-verification

model

can

correctly pre-

dict

the

magnitude

familiarity

effects

both

tachistoscopic

and

reaction time

studies

and

that

the

interactive activation model

may

able

to do so.

Reaction

Time Studies

mentioned

in the

introduction,

the

concepts

embodied

in our

activation-verifi-

cation model were originally developed

the

context

reaction time studies using lex-

ical-decision

and

naming tasks. With this

history

it is to be

expected that

the

model

can

handle

variety

reaction

time

data.

There

are too

many

findings to

cover

detail

here,

but it may be

useful

review some

this earlier

work

provide some idea about

the

performance

of the

model. Because

the

interactive activation model

has not

been

specifically

applied

lexical-decision data,

cannot

draw

specific

comparisons. How-

ever,

the

interactive activation model

has

been used

explain

the

effects

semantic

context

and

word

frequency

other reaction

time

tasks

(e.g.,

naming tasks),

and we

will

comment

on the

applicability

analogous

explanations

of findings

from

the

lexical-de-

cision

task.

The

interactive activation model

and our

activation-verification model

differ

about

the

nature

effects

prior

semantic context

and

word

frequency

when stimuli

are

pre-

Table

Correlations

Between

Obtained

Proportion

Correct

and Log

Word

Frequency

Data

set

Word

frequencies

included

All

All>

All>2

All>

Word

set

Number

words

/•=.05

.16

(-.04)

.14

(-.09)

288

.12

.14

(-.06)

.11

(-.11)

249

.13

.09

(+.04)

.07

(+.01)

228

.13

.04

(-.08)

.04

(-.14)

220

.13

Note.

Correlations

between

proportion

correct

and the

absolute

word-frequency

counts

are

shown

parentheses.

"All

> 1"

means

all

stimulus

words

with

frequency

greater

than

592

PAAP,

NEWSOME,

MCDONALD,

AND

SCHVANEVELDT

sented

for

normal durations

and

without

masking.

In the

interactive activation model,

these

two

factors both have

the

effect

of in-

creasing

activation levels

relevant word

units.

The

base activation level

of the

word

units

increases

as a

function

word fre-

quency.

Also, word units that

are

the

context have increased activity levels rel-

ative

word units

for

unrelated words. Per-

haps

word units that

are

inconsistent with

the

context would have depressed activity

levels

well.

contrast,

our

activation-verification

model

places

the

effects

word frequency

subsequent

to the

activation

word units.

Word

frequency determines

the

order

which

lexical units

are

verified

in the

verifi-

cation

process.

The

activation-verification

model

also assumes that context increases

the

activity level

lexical units that

are

to the

context,

but

this activity increase

may

high enough

cause

the

word units

exceed

the

criterion

for

inclusion

in the

can-

didate

set.

The

verification process

then

responsible

for the

analysis

stimulus

in-

formation.

Thus, verification

can

prevent

premature

response. There appears

to be no

comparable mechanism

in the

interactive

activation

model.

lexical-decision tasks, there

evidence

that

context

and

frequency

have

different

effects

on the

time required

classify

letter

string

as a

word. Becker

and

Killion

(1977)

found

that context interacts with

the

quality

the

visual stimulus whereas frequency

and

visual

quality show additive

effects.

These

findings

imply that frequency

and

context

exert

their

influence

performance

dif-

ferent

ways, contrary

expectations, derived

from

the

interactive,

activation model.

McDonald (1980) developed

computer

simulation

of the

verification

model (which

was

the

precursor

to our

activation-verifi-

cation

model). McDonald's simulation pro-

duced both

the

additivity

frequency

and

visual

quality

and the

interaction

context

and

visual quality. Further,

as we

discussed

earlier,

there

are

apparently

word-fre-

quency

effects

in the

word-superiority para-

digm.

This result

follows

naturally

from

our

model

because

frequency

does

not

affect

the

activation

process, which

is the

basis

of the

decision

in the

word-superiority paradigm.

The

activation-verification model

also

consistent with

findings on

effects

context

on the

classification

nonwords

in the

lex-

ical-decision task. Several models (including

the

interactive activation model) handle con-

text

effects

inducing

bias

favor

of re-

lated words. This approach leads

to the ex-

pectation that nonwords that

are

very

similar

particular words should

erroneously

classified

words more

often

in a

context

than

in an

unrelated context.

For

example,

the

nonword

NERSE

should

mis-

classified

often

following

word related

NURSE

(e.g.,

DOCTOR)

than following

unrelated word (e.g.,

LAMP).

contrast,

our

model assumes that lexical decisions

are

made

on the

basis

verification

rather than

activation

and

that

the

quality

of the

verifi-

cation process

is not

affected

context.

Context

affects

the

availability

lexical units

for

verification,

but not the

quality

of the

verification

process

itself. Thus, context

should

have

effect

on the

liklihood

clas-

sifying

nonword

as a

word.

The

evidence

on the

classification

non-

words

supports

the

predictions

of the

acti-

vation-verification model. Schvaneveldt

and

McDonald

(1981)

found

effect

context

classifying

nonwords when stimuli

re-

mained available until

the

response occurred.

Context

did

facilitate response time

words

their experiments. Other studies have pro-

duced similar results (Antos,

1979;

Lapinski,

1979;

McDonald, 1977, 1980; Lapinski

Tweedy, Note

8).

O'Connor

and

Forster

(1981)

concluded that

bias explanation

was

ruled

out by

their

findings

even though

one

their experiments showed bias

effects.

that experiment, however, error rates were

over

35% on the

critical items, which

is un-

usually high.

In the

context

the

activation-

verification

model, such error rates suggest

that subjects

are

responding without

verifi-

cation

on a

substantial proportion

of the

trials.

verification

optional,

speed-ac-

curacy

trade-offs

may be

partly

due to the

probability

verification

in a

particular task.

Schvaneveldt

and

McDonald (1981) also

showed

bias

effects

context with

brief

stimulus

display

followed

by a

masking stim-

ulus.

As we

argued earlier,

assume that

'these

stimulus conditions prevent

verifica-

tion.

ACTIVATION-VERIFICATION

MODEL

593

Overall,

the

activation-verification model

appears

handle

considerable amount

data

from

reaction time experiments (see

Becker,

1980,

and

McDonald, 1980,

for

fur-

ther

examples).

believe that

one

impor-

tant

characteristic

of the

model lies

in the

independent

top-down

analysis

of the

stim-

ulus

(verification)

that

sensitive

devia-

tions

from

the

stored representation

of a

word.

These deviations might

further

di-

vided

into permissible (identity preserving)

and

illegal (identity

transforming)

distortions

the

stored representation. Verification,

then, amounts

determining whether

the

stimulus impinging

on the

senses could

reasonably

interpreted

as a

particular word

after

context

or the

senses

had

suggested that

the

stimulus might

that word.

have presented

our

solution

what

perceive

as an

important theoretical prob-

lem

pattern-recognition theory

general

and

word recognition

particular. That

problem

is to

specify

the

nature

and

inter-

action

bottom-up

and

top-down infor-

mation-processng

activities

recognition.

There seems

to be

wide acceptance

of the

necessity

for

both

these types

processes.

There

less agreement about just what they

are

and how

they interact.

Our

solution

this theoretical problem provides

top-down

process

that

involves

comparing stimulus

in-

formation

prototypes stored

memory.

such,

the

top-down process

may

enhance

perception

discrepancies rather than

in-

duce

perceptual

decision

bias

favor

expected stimuli.

believe that

the ev-

idence supports

our

view,

but we are

eager

pursue

the

matter

further

with additional

research.

hope that

our

theoretical anal-

ysis

and the

contrasts

of two

theoretical

ap-

proaches

will help

focus

further

experi-

mentation.

Reference

Notes

Paap,

R., &

Newsome,

S. L. The

role

word-

shape

and

lexical constraint

in the

word superiority

effect.

In C.

Cofer

(Chair), Some

new

perspectives

word

recognition.

Symposium presented

at the

meet-

ing

of the

Southwestern Psychological Association,

Houston, April

1981.

Paap,

R.,

Newsome,

S. L.

Lexical

constraint:

Redefined

and

resurrected.

Paper presented

at the

meeting

of the

Psychonomic Society, Philadelphia,

November

1981.

Paap,

R.,

Newsome,

L.,

McDonald,

J. E.

Further

tests

the

contribution

perceptual

confu-

sions

to the

WSE.

Paper

presented

at the

meeting

the

Psychonomic Society,

St.

Louis, November 1980.

Schvaneveldt,

S.,

McDonald,

J. E. The

verifi-

cation model

word recognition.

In C.

Cofer

(Chair), Some

new

perspectives

word

recognition.

Symposium

presented

at the

meeting

of the

South-

western

Psychological Association, Houston, April

1981.

Becker,

A.,

Schvaneveldt,

W.,

Gomez,

Semantic,

graphemic,

and

phonetic

factors

word

recognition.

Paper presented

at the

meeting

of the

Psychonomic

Society,

St.

Louis,

November 1973.

Paap,

R.,

Newsome,

L.,

McDonald,

E.,

Schvaneveldt,

R. W. The

activation

verification

model:

The

effects

cuing,

masking,

and

visual

angle.

Manuscript

preparation, 1982.

Massaro,

D. W.

Simulating

letter

and

word

recog-

nition:

fuzzy

logical

model

integrating

visual

in-

formation

and

orthographic

structure

reading.

Pa-

per

presented

at the

European Conference

Artif-

ical

Intelligence,

Orsay,

France, July 1982.

Lapinsky,

R. H., &

Tweedy,

J. R.

Associate-like

non-

words

in a

lexical-decision

task:

Paradoxical

seman-

tic

context

effects.

Paper presented

at the

Mathemat-

ical Psychology meetings,

New

York

University,

Au-

gust

1976.

References

Antos,

S. J.

Processing facilitation

in a

lexical

decision

task.

Journal

Experimental

Psychology:

Human

Perception

and

Performance,

1979,

527-545.

Becker,

C. A.

Allocation

attention during visual word

recognition.

Journal

Experimental

Psychology:

Human

Perception

and

Performance,

1976,

556-

566.

Becker,

C. A.

Semantic context

effects

visual word

recognition:

analysis

semantic strategies. Mem-

ory

and

Cognition,

1980,

493-512.

Becker,

A.,

Killion,

T. H.

Interaction

visual

and

cognitive

effects

word recognition.

Journal

Ex-

perimental

Psychology:

Human

Perception

and

Per-

formance,

1977,

389-401.

Carr,

H.,

Davidson,

J.,

Hawkins,

H. L.

Percep-

tual

flexibility in

word recognition. Strategies

affect

orthographic computation

but not

lexical

access.

Journal

Experimental

Psychology:

Human

Percep-

tion

and

Performance,

1978,

674-690.

Johnston,

J. C. A

test

the

sophisticated

guessing

theory

word

perception.

Cognitive

Psychology,

1978,

10,

123-153.

Kucera,

H.,

Francis,

W. N.

Computational

analysis

present-day

American English. Providence,

R.I.:

Brown

University Press,

1967.

Landauer,

T.,

Freedman,

Information retrieval

from

long-term memory: Category size

and

recogni-

tion

time.

Journal

Verbal

Learning

and

Verbal

Behavior,

1968,

291-295.

Lapinsky,

R. H.

Sensitivity

and

bias

in the

lexical

de-

cision

task. Unpublished doctoral dissertation, State

University

of New

York

Stony Brook, 1979.

Luce,

R. D. A

threshold theory

for

simple detection

ex-

periments.

Psychological

Review, 1963,

70,

61-79.

594

PAAP,

NEWSOME, MCDONALD,

AND

SCHVANEVELDT

Manelis,

Frequency

and

meaningfulness

tachisto-

scopic

word perception. American

Journal

Psy-

chology,

1977,

99,

269-280.

Massaro,

D. W.

Perception

letters, words,

and

non-

words.

Journal

Experimental

Psychology,

1973,

100,

349-353.

Massaro,

D. W.

Letter information

and

orthographic

context

word

perception.

Journal

Experimental

Psychology,:

Human

Perception

and

Performance,

1979,

595-609.

Massaro,

W.,

Venezky,

R. L., &

Taylor,

G. A. Or-

thographic

regularity, positional

frequency,

and

visual

processing

letter strings.

Journal

Experimental

Psychology:

General,

1979,

108,

107-124.

Massaro,

W.,

Taylor,

A.,

Venezky,

L.,

Jas-

trzembski,

J. E., &

Lucas,

P. A.

Letter

and

word

per-

ception:

Orthographic

structure

and

visual

processing

reading.

Amsterdam: North

Holland,

1980.

McClelland,

L.,

Johnston,

J. C. The

role

familiar

units

perception

words

and

nonwords.

Percep-

tion

Psychophysics,

1977,

22,

249-261.

McClelland,

L.,

Rumelhart,

D. E. An

interactive

activation model

context

effects

letter

percep-

tion:

Part

account

basic

findings.

Psycholog-

ical

Review, 1981,

88,

375-407.

McDonald,

J. E.

Strategy

in a

lexical

decision

task.

Unpublished

master's

thesis,

New

Mexico State Uni-

versity,

1977.

McDonald,

J. E. An

information

processing

analysis

word

recognition.

Unpublished doctoral dissertation,

New

Mexico

State

University,

1980.

Morton,

Interaction

information

word recog-

nition.

Psychological

Review, 1969,

76,

165-178.

O'Connor,

E.,

Forster,

K. I.

Criterion bias

and

search sequence bias

word recognition. Memory

Cognition,

1981,

78-92.

Paap,

R.,

Newsome,

S. L. Do

small visual angles

produce

word superiority

effect

differential

lateral

masking?

Memory

Cognition, 1980,

1-14.

(a)

Paap,

R.,

Newsome,

S. L. A

perceptual-confusion

account

of the WSE in the

target search paradigm.

Perception

Psychophysics,

1980,

27,

444-456.

(b)

Rubenstein,

H.,

Garfield,

L.,

Millikan,

Homo-

graphic entries

in the

internal lexicon.

Journal

Ver-

bal

Learning

and

Verbal

Behavior, 1970,

487-494.

Rumelhart,

E.,

McClelland,

J. L. An

interactive

activation model

context

effects

letter

percep-

tion: Part

2. The

contextual enhancement

effect

and

some tests

and

extensions

of the

model.

Psychological

Review,

1982,

89,

60-94.

Scarborough,

L.,

Cortese,

C.,

Scarborough,

H. S.

Frequency

and

repetition

effects

lexical memory.

Journal

Experimental

Psychology:

Human

Percep-

tion

and

Performance,

1977,

1-17.

Schvaneveldt,

W.,

McDonald,

J. E.

Semantic con-

text

and the

encoding

words: Evidence

for two

modes

stimulus analysis.

Journal

Experimental

Psychology:

Human

Perception

and

Performance,

1981,

673-687.

Schvaneveldt,

W.,

Meyer,

E.,

Becker,

C. A.

Lexical

ambiguity, semantic context,

and

visual word

recognition. Journal

Experimental

Psychology:

Human

Perception

and

Performance,

1976,

243-

256.

Thorndike,

L.,

<Sf

Lorge,

I. The

teacher's

word

book

30,000

words.

New

York: Teacher's College Press,

1944.

Received

October

21,

1981

Revision

received January

28,

1982

A preview of this full-text is provided by American Psychological Association.

Learn more

Content available from Psychological Review

This content is subject to copyright. Terms and conditions apply.

Lateralized Word Recognition: Assessing the Role of Hemispheric Specialization, Modes of Lexical Access, and Perceptual Asymmetry

Article

Full-text available

Jun 2000

The processing advantage for words in the right visual field (RVF) has often been assigned to parallel orthographic analysis by the left hemisphere and sequential by the right. The authors investigated this notion using the Reicher–Wheeler task to suppress influences of guesswork and an eye-tracker to ensure central fixation. RVF advantages obtained for all serial positions and identical U-shaped serial-position curves obtained for both visual fields (Experiments 1–4). These findings were not influenced by lexical constraint (Experiment 2) and were obtained with masked and nonmasked displays (Experiment 3). Moreover, words and nonwords produced similar serial-position effects in each field, but only RVF stimuli produced a word–nonword effect (Experiment 4). These findings support the notion that left-hemisphere function underlies the RVF advantage but not the notion that each hemisphere uses a different mode of orthographic analysis.

Creating Meaningful Word and Phrase Vectors for use as Representations of Associative Meaning supporting Grammatical Analysis -- Conference Paper

Conference Paper

Full-text available

May 2024

We identify three shortcomings of word vectors as representations of the full meaning of words: 1) the dimensions of the vectors are implicit and difficult to interpret, 2) the vectors entangle all the meanings and uses of words, and 3) the vectors are unstructured. We propose solutions to each of these shortcomings and explore the implications. Our goal is to integrate word, phrase, and clause level vectors representing fine-grained, associative aspects of meaning into grammatical analysis, to support the resolution of structural ambiguities that cannot be grammatically resolved.

Creating Meaningful Word and Phrase Vectors for use as Representations of Associative Meaning supporting Grammatical Analysis -- Long Paper

Preprint

Full-text available

May 2024

Effect of contextual diversity on word recognition in different semantic contexts

Article

Full-text available

Dec 2023

Zhongchen Mu

Efficient word recognition is important to facilitate reading comprehension. Two important factors influence word recognition—word frequency (WF) and contextual diversity (CD)—but studies have not reached consistent conclusions on their role. Based on previous studies, the present study strictly controlled the anticipation of sentence context on target words. In the context of the semantic incongruence of Chinese sentences—that is, when the context is equivalent and low in anticipation of the target noun—CD effects were found on late processing indicators of the eye movement data of parafoveal words, and the CD feature of parafoveal words led to a significant parafoveal‐on‐foveal effect. However, none of these results were found in the semantically reasonable (semantic congruence) context. The results suggested that high CD words are better at adapting to unexposed or learned contexts, which was not the case for high WF words.

Activation of Phonological Codes During Eye Fixations in Reading

Article

Full-text available

Aug 1999

Two experiments addressed the issue of whether phonological codes are activated early in a fixation during reading using the fast-priming technique (S. C. Sereno & K. Rayner, 1992). Participants read sentences and, at the beginning of the initial fixation in a target location, a priming letter string was displayed, followed by the target word. Phonological priming was assessed by the difference in the gaze duration on the target word between when the prime was a homophone and when it was a control word equated with the homophone on orthographic similarity to the target. Both experiments demonstrated homophonic priming with prime durations of about 35 ms, but only for high-frequency word primes, indicating that lexicality was guiding the speed of the extraction of phonological codes early in a fixation. Evidence was also obtained for orthographic priming, and the data suggest that orthographic and phonological priming effects interact in a mutually facilitating manner.

Word Frequency Effects at Brief Exposure Durations: Comment on

Article

Full-text available

Dec 1997

K. R. Paap and L. S. Johansen (1994) proposed that word frequency effects do not occur on a lexical decision task (LDT) when postmasked target exposure duration is sufficiently brief because such a task prevents verification—their hypothesized locus of the word frequency effect. In making this assertion, they proposed that the activation interpretation of A. R. Dobbs, A. Friedman, and J. Lloyd (1985) and of P. A. Allen, M. McNeal, and D. Kvak (1992) was flawed. However, evidence that Paap and Johansen's conclusions were wrong and that their experimental design contained flaws is provided here. In Experiment 1 of the present study, word frequency effects were evident on an LDT at the 75% accuracy level proposed by Paap and Johansen as being sufficiently low to prevent verification. In Experiment 2 the mental lexica of participants from the same population as that used for Experiment 1 contained very-low-frequency words. Thus, the present results are consistent with an activation locus.

Processing difficulty while reading words with neighbors is not due to increased foveal load: Evidence from eye movements

Article

Mar 2024

Words with high orthographic relatedness are termed “word neighbors” (angle/angel; birch/birth). Activation-based models of word recognition assume that lateral inhibition occurs between words and their activated neighbors. However, studies of eye movements during reading have not found inhibitory effects in early measures assumed to reflect lexical access (e.g., gaze duration). Instead, inhibition in eye-movement studies has been found in later measures of processing (e.g., total time, regressions in). We conducted an eye-movement boundary change study (Rayner, Cognitive Psychology, 7(1), 65-81, 1975) that manipulated the parafoveal preview of the word following the neighbor word (word N+1). In this way, we explored whether the late inhibitory effects seen with transposed letter words and words with higher-frequency neighbors result from reduced parafoveal preview due to increased foveal load and/or interference during late stages of lexical processing (the L2 stage within the E-Z Reader framework). For word N+1, while there were clear preview effects, there was not an effect of the neighborhood status of word N, nor a significant interaction. This suggests that the late inhibitory effects of earlier eye-movement studies are driven by misidentification of neighbor words rather than being due to increased foveal load.

Morphological structures of two-character words influence character position encoding

Article

Jan 2024
Acta Psychol Sin

Modelling orthographic similarity effects in recognition memory reveals support for open bigram representations of letter coding

Article

Dec 2023
COGNITIVE PSYCHOL

The Effects of “Neighborhood Size” in Reading and Lexical Decision

Article

Full-text available

Aug 1999

The effects of neighborhood size (“N”)—the number of words differing from a target word by exactly 1 letter (i.e., “neighbors”)—on word identification was assessed in 3 experiments. In Experiments 1 and 2, the frequency of the highest frequency neighbor was equated, and N had opposite effects in lexical decision and reading. In Experiment 1, a larger N facilitated lexical decision judgments, whereas in Experiment 2, a larger N had an inhibitory effect on reading sentences that contained the words of Experiment 1. Moreover, a significant inhibitory effect in Experiment 2 that was due to a larger N appeared on gaze duration on the target word, and there was no hint of facilitation on the measures of reading that tap the earliest processing of a word. In Experiment 3, the number of higher frequency neighbors was equated for the high-N and low-N words, and a larger N caused target words to be skipped significantly more and produced inhibitory effects later in reading, some of which were plausibly due to misidentification of the target word when skipped. Regression analyses indicated that, in reading, increasing the number of higher frequency neighbors had a clear inhibitory effect on word identification and that increasing the number of lower frequency neighbors may have a weak facilitative effect on word identification.

Interaction of Information in Word Recognition

Article

Full-text available

Mar 1969

John Morton

Quantitative predictions are made from a model for word recognition. The model has as its central feature a set of "logogens," devices which accept information relevant to a particular word response irrespective of the source of this information. When more than a threshold amount of information has accumulated in any logogen, that particular response becomes available for responding. The model is tested against data available on (1) the effect of word frequency on recognition, (2) the effect of limiting the number of response alternatives, (3) the interaction of stimulus and context, and (4) the interaction of successive presentations of stimuli. Implications of the underlying model are largely upheld. Other possible models for word recognition are discussed as are the implications of the logogen model for theories of memory. (30 ref.) (PsycINFO Database Record (c) 2012 APA, all rights reserved)

Semantic context and the encoding of words: Evidence for two modes of stimulus analysis

Article

Full-text available

Jun 1981

Six experiments with 144 undergraduates tested the hypothesis that semantic context facilitates the encoding of words related to the context. Different tasks (making a lexical decision, detecting a rotated letter in a word, or detecting a gap in one letter of a word) and different experimental paradigms (tachistoscopic exposures with masking stimuli or RT instructions with continuously available target stimuli) were used. Findings are inconsistent with the hypothesis that semantic context lowers the decision criterion in favor of related words. In contrast, the tachistoscopic paradigm yielded data favoring the decision-bias hypothesis. Overall, findings indicate that semantic context does not affect the early stages of stimulus encoding. However, further stimulus analysis occurred subsequent to lexical access, and semantic context facilitated accessing the lexical entries for related words. (25 ref) (PsycINFO Database Record (c) 2012 APA, all rights reserved)

Frequency and Repetition Effects in Lexical Memory

Article

Full-text available

Feb 1977

Conducted 5 reaction time (RT) experiments with 75 undergraduates to explore word-frequency effects in word-nonword decision tasks and in pronunciation and memory tasks. High-frequency words were recognized substantially faster than low-frequency words in the word-nonword decision tasks. However, there was little effect of word frequency in the pronunciation and old-new memory tasks. Further, in the word-nonword lexical decision task, prior presentations of words produced substantial and apparently long-lasting reductions on the basic frequency effect. The occurrence of natural language frequency effects only in the word-nonword decision task supported the use of this task to study the organization of and retrieval from the subjective lexicon. The modification of frequency effects by repetition suggested that natural language frequency effects may be attributed partly to the recency with which words have occurred. Analysis of the response latencies using S. Sternberg's (see record 1970-11748-001) additive-factors approach indicated that frequency effects consist of both effects in encoding and in retrieval from memory. (34 ref) (PsycINFO Database Record (c) 2012 APA, all rights reserved)

Interaction of visual and cognitive effects in word recognition

Article

Full-text available

Aug 1977

D. E. Meyer et al (1975, 1976) report that semantic context has a larger effect on visually degraded words than on undegraded words. A series of 4 experiments were conducted with 54 undergraduates to explore alternative explanations of this result. Using stimulus intensity instead of dot-pattern degradation, it was found that semantic context interacted with intensity in 2 different word recognition tasks. In 2 other studies, it was found that intensity and word-frequency effects were additive. Alternative models of the word recognition process are evaluated against these results. (20 ref) (PsycINFO Database Record (c) 2012 APA, all rights reserved)

Do small visual angles produce a word superiority effect or differential lateral masking?

Article

Full-text available

Jan 1980

Purcell, Stanovich, and Spector 119781 report that recognition of the center letter of the words APE, ARE, ACE, and AGE is superior to recognition of the same targets in the nonwords formed by the context letters V_H. Since a small set of predesignated targets was used and there was complete certainty about the location of the target letter, these results pose serious problems for three otherwise viable accounts for why word superiority effects (WSEs) are obtained in a variety of other paradigms. This series of experiments explores the possibility that the word advantages reported by Purcell et al. have nothing to do with the lexical properties of the A_E display or the general phenomenon of word superiority, but they result from a fortuitous case of differential lateral masking. This reinterpretation is supported by five experiments. Experiments 1 and 2 show that the A_E word advantages are anomalous in that the magnitude of the WSE obtained with these particular words is not contingent upon the presence of a patterned mask. Experiment 3 provides direct evidence for differential lateral masking by showing that digit recognition is poorer in the V_H than in the A_E frame. Experiments 4 and 5 show that the WSE obtained under these conditions does not generalize to a new set of words and nonwords that produce the same amount of lateral masking. It was concluded that a genuine WSE does not occur under the conditions tested by Purcell et al., and that, therefore, the WSE has not been shown to depend on visual angle.

LEXICAL AMBIGUITY, SEMANTIC CONTEXT, AND VISUAL WORD RECOGNITION

Article

Jan 1976

Frequency and Meaningfulness in Tachistoscopic Word Perception

Article

Jun 1977

Leon Manelis

This study investigated the superior perceptibility of words over regular, pronounceable nonwords in tachistoscopic displays. Paradoxically, this effect was demonstrated in the absence of a word-frequency effect. The results suggest that the superior perceptibility of words in tachistoscopic displays is due to highly specific characteristics of letters as they occur in words and that lexical retrieval is not involved.

A Computational Analysis of Present-Day American English

Article

Jan 1967

An Interactive Activation Model of Context Effects in Letter Perception: Part 2. The Contextual Enhancement Effect and Some Tests and Extensions of the Model

Article

Jan 1982

The interactive activation model of context effects in letter perception is reviewed, elaborated, and tested. According to the model, context aids the perception of target letters as they are processed in the perceptual system. The implication that the duration and timing of the context in which a letter occurs should greatly influence the perceptibility of the target is confirmed by 9 experiments with 128 undergraduates, demonstrating that early or enhanced presentations of word and pronounceable-pseudoword contexts greatly increased the perceptibility of target letters. According to the model, letters in strings that share several letters with words should be equally perceptible whether they are orthographically regular and pronounceable (SLET) or irregular (SLNT) and should be much more perceptible than letters in contexts that share few letters with any word (XLQJ). The prediction was tested and confirmed. Overall results are accounted for, with some modification of parameters, although there are some discrepancies in detail. Several recent findings that seem to challenge the model are considered, and a number of extensions are proposed. (34 ref)

Homographic entries in the internal lexicon

Article

Oct 1970

The task was to distinguish between English and nonsense words, which were displayed singly. The display persisted until S pressed the yes-key if he thought the stimulus was English or the no-key if he thought it was nonsense. The response times were faster for English than nonsense, faster for English words of higher frequency than lower frequency, and faster for homographs than nonhomographs. It is hypothesized that word recognition in general requires consulting the internal lexicon. A model of the underlying processes is sketched which proposes that words of higher frequency are recognized sooner because their lexical entries are marked earlier for comparison against the stimulus information. It is also proposed that homographs are recognized sooner than nonhomographs since homographs have more lexical entries available for comparison against the stimulus information.

An activation-verification model for letter and word recognition: The word-superiority effect

Abstract

Recommended publications

Generation of visual representations

Letter detection with rapid serial visual presentation: Evidence against word superiority at feature...

Letters are visual stimuli: a comment on the use of confusion matrices

Reading errors in first- and second-grade readers of a shallow orthography: Evidence from Spanish