ArticlePDF Available

An Examination of Stimulus Control in Fluency-Based Strategies: SAFMEDS and Generalization

September 2013
Journal of Behavioral Education 22(3)

September 2013
22(3)

DOI:10.1007/s10864-013-9172-6

Authors:

James N Meindl

The University of Memphis

Jonathan W Ivy

Pennsylvania State University

Neal Miller

The University of Memphis

Nancy A Neef

The Ohio State University

Show all 5 authorsHide

Fluency-based strategies such as Say All Fast a Minute Each Day Shuffled (SAFMEDS) effectively promote fluent responding (i.e., high rate and accuracy). It is possible, however, that the stimulus control developed through these activities inhibits stimulus generalization. We investigated this concern in a two-part study with college students. Study 1 assessed generalization of rates of responding from training with SAFMEDS to a novel set of equivalent SAFMEDS flashcards. Results indicate that SAFMEDS promoted fluent responding, but rates of responding decreased during generalization probes. Furthermore, higher rates of responding during training were correlated with a greater decrease in rates of responding during generalization probes. This may indicate that students attend to irrelevant stimulus features of SAFMEDS during training. Study 2 examined the effects of embedding multiple-exemplar training within SAFMEDS. Results indicate that multiple-exemplar training can promote generalization of accurate and high-rate responding when incorporated in a SAFMEDS activity.

Scatterplot depicting the relation between student fluency on the final training timing and the change in fluency from the final timing to the generalization test

…

sual diagram depicting format of training and generalization flashcard sets

…

Average change in fluency and accurate responding

…

Figures - uploaded by James N Meindl

Content may be subject to copyright.

Content uploaded by James N Meindl

Content may be subject to copyright.

ORIGINAL PAPER

An Examination of Stimulus Control in Fluency-Based

Strategies: SAFMEDS and Generalization

James N. Meindl •Jonathan W. Ivy •Neal Miller •

Nancy A. Neef •Robert L. Williamson

ÓSpringer Science+Business Media New York 2013

Abstract Fluency-based strategies such as Say All Fast a Minute Each Day

Shufﬂed (SAFMEDS) effectively promote ﬂuent responding (i.e., high rate and

accuracy). It is possible, however, that the stimulus control developed through these

activities inhibits stimulus generalization. We investigated this concern in a two-

part study with college students. Study 1 assessed generalization of rates of

responding from training with SAFMEDS to a novel set of equivalent SAFMEDS

ﬂashcards. Results indicate that SAFMEDS promoted ﬂuent responding, but rates of

responding decreased during generalization probes. Furthermore, higher rates

of responding during training were correlated with a greater decrease in rates of

responding during generalization probes. This may indicate that students attend to

irrelevant stimulus features of SAFMEDS during training. Study 2 examined the

effects of embedding multiple-exemplar training within SAFMEDS. Results indi-

cate that multiple-exemplar training can promote generalization of accurate and

high-rate responding when incorporated in a SAFMEDS activity.

Keywords SAFMEDS Generalization Stimulus control Fluency-based

strategies Multiple-exemplar training College instruction

J. N. Meindl (&)N. Miller R. L. Williamson

The University of Memphis, 400A Ball Hall, Memphis, TN 38152, USA

e-mail: jnmeindl@memphis.edu

J. W. Ivy

Mercyhurst University, Erie, PA, USA

N. A. Neef

The Ohio State University, Columbus, OH, USA

123

J Behav Educ

DOI 10.1007/s10864-013-9172-6

Introduction

A common concern among educators is increasing the ﬂuency with which a student

can perform a targeted skill (Binder 1996; Dougherty and Johnston 1996; Doughty

et al. 2004). Fluency, which is a measure of the number of correct responses per unit

of time, has been suggested to be a key measure of proﬁciency in multiple domains,

including reading (Chafouleas et al. 2004; Weinstein and Cooke 1992), writing

(Van Houten et al. 1974), math facts (Miller et al. 1995; Codding et al. 2010), and

speaking a foreign language (Shinamune and Jitsumori 1999). The goal of ﬂuency-

based strategies is to engage in a high rate of correct responses in a short amount of

time. Several authors have argued that responding in this manner should result in

improved educational outcomes including increased retention, endurance, and

application of skills (Binder 1996; Brady and Kubina 2010; Haughton 1980;

Johnson and Layng 1996; Weiss 2001).

One common method for increasing ﬂuency is the use of ﬂashcards. Research has

suggested that students routinely utilize ﬂashcards as a way of repeatedly practicing

an academic skill (Golding et al. 2012; Kornell and Bjork 2008). The efﬁcacy of

ﬂashcards has been examined across a wide range of academic content (e.g.,

MacQuarrie et al. 2002; Mangundayao et al. 2013), and with students ranging from

elementary school age (Volpe et al. 2011) to adults (Schmidmaier et al. 2011). Say

All Fast a Minute Each Day Shufﬂed (SAFMEDS) ﬂashcards is one speciﬁc way

that educators have used ﬂashcards to explicitly promote ﬂuent performance

(Korinek and Wolking 1984). In a SAFMEDS activity, students are presented

ﬂashcards on which a discriminative stimulus is printed on one side and the

associated correct response is presented on the other side. If teaching term-deﬁnition

pairs, for example, the term is printed on the opposite side of the card from the

deﬁnition. The student would then read each deﬁnition and attempt to provide the

term as quickly as possible during a timed trial. At the end of the trial, the rate of

correct and incorrect responding would be calculated, the deck shufﬂed, and the

student would attempt to improve upon the previous rate of responding. This

strategy is routinely used when teaching functionally equivalent stimulus–stimulus

pairs. SAFMEDS has been shown to be effective in building ﬂuency across a range

of subject areas, including teaching key concepts to undergraduate psychology

majors (Bower and Orgel 1981), teaching terminology to undergraduate education

majors (Eshleman 1985), and teaching the names of inﬂuential authors to graduate

students in special education (Korinek and Wolking 1984).

Although strategies such as SAFMEDS may promote fast and accurate responding,

it is not entirely clear that this will result in all of the desired outcomes, such as

retention, persistence, and generalization that authors have hypothesized to result from

such performance (Doughty et al. 2004). In fact, because practice of this kind involves

developing stimulus control over responding, there is a risk that irrelevant features of

the stimulus (e.g., word placement and format) may come to control responding

(Chafouleas et al. 2004). When ﬁrst learning a term-deﬁnition pair, for example,

students typically read the entire deﬁnition before being able to provide the correct

term. That is to say, the stimulus controlling the correct response is the entire

deﬁnition. As students become more proﬁcient with the SAFMEDS cards and are able

J Behav Educ

123

to provide a correct response in less time, it is possible for stimulus control to shift from

the entire deﬁnition to some other aspect of the card. Students may begin to attend to

unique features of the deﬁnition such as key words or phrases, or the general

appearance of the deﬁnition or physical shape of the deﬁnition. As the unique stimuli

controlling the response in this case are educationally irrelevant, this may be

detrimental to future learning. Although performance may reﬂect a high degree of

ﬂuency, the range of stimuli controlling behavior may be restricted (Litrownik et al.

1978). A response controlled by irrelevant, or restricted, features of a stimulus is

unlikely to generalize to different environmental conditions where such features are

likely to be absent (e.g., when a student comes across a real-world example of a

concept or reads the deﬁnition of the concept in a journal article). Responding to

stimuli that share the same functional property but that differ from those previously

taught describes the concept of stimulus generalization.

The risk of developing restricted stimulus control is most likely the reason for the

requirement that students shufﬂe ﬂashcards after each time practicing a SAFMEDS

activity. Shufﬂing the cards ensures that student responses are not controlled by one

possible irrelevant aspect of the card, namely its sequence or position relative to other

cards. However, to date, no published research has explicitly investigated the relation

between SAFMEDS ﬂuency and stimulus generalization. If restricted stimulus control

is indeed being promoted through the SAFMEDS activity, it may be necessary to alter

the activity in some way in order to promote stimulus control by the appropriate

features of the stimulus, and thus facilitate generalization. One potential solution to

this problem would be to embed a generalization promoting strategy (e.g., multiple-

exemplar training) within the SAFMEDS activity. Fluency-based strategies are likely

to produce control of responding by a relatively narrow range of stimuli. Strategies that

promote generalization, on the other hand, are designed to ensure that responding is

under the control of a broader class of stimuli (Stokes and Baer 1977). One such

strategy is the use of multiple-exemplars, which involves teaching a skill with more

than one example from the range of possible stimuli that you want to evoke a given

response (e.g., Carr 2003; Ducharme and Feldman 1992; Reeve et al. 2007). This type

of generalized responding may be beneﬁcial in a classroom setting. In the case of

learning term-deﬁnition pairs, for example, generalized responding may enable the

student to provide a correct response to a variety of deﬁnitions all associated with the

same term. In this two-part study, we examined following questions:

1. As students become ﬂuent with one set of ﬂashcards in a SAFMEDS activity, is

responding controlled in part by irrelevant aspects of the ﬂashcard?

2. As students become ﬂuent with one set of ﬂashcards in a SAFMEDS activity, to

what extent does responding (both ﬂuency and accuracy) generalize to a novel

set of ﬂashcards with differently worded content?

3. Is there a relation between the level of ﬂuency achieved on one set of ﬂashcards

and the degree to which this ﬂuency generalizes to a novel set of ﬂashcards with

differently worded content?

4. If stimulus generalization does not occur naturally, does the inclusion of

multiple-exemplar training into a SAFMEDS activity promote such an

outcome?

J Behav Educ

123

Study 1

Method

Participants and Setting

One male and ﬁve female graduate students enrolled in a 10-week single-subject

design course at a large midwestern university participated in this study. Classes

were held once per week. The participants came from a variety of academic

programs including special education, school psychology, and sports ﬁtness.

Although the procedures of the study were considered part of typical classroom

instruction and were therefore mandatory for all students, all six students gave

consent for their data to be used by the authors, and the university’s Institutional

Review Board (IRB) approved the research.

All experimental sessions were conducted in a university classroom. The room

was furnished with enough desks and chairs to accommodate approximately 30

students. This room was used for the purposes of the study as well as to deliver

course content. At least two researchers were present during each day of study.

Materials

Two sets of ﬂashcards (Set A and Set B) were created by the authors prior to the

start of the course. Both Set A and B contained the same 45 term-deﬁnition pairs,

taken from the course textbook (Cooper et al. 2007). The difference between Set A

and Set B was in formatting only; whereas Set A deﬁnitions were single spaced and

centered, Set B deﬁnitions had 1.5 spacing, were left justiﬁed, and had widened

margins. Thus, although both Set A and Set B contained identically worded

deﬁnitions, the visual appearance of the deﬁnitions was different.

Both Set A and Set B were divided into three subsets of 15 ﬂashcards: sets A1,

A2, A3, and sets B1, B2, and B3. The subsets from A and B contained matching

term-deﬁnition pairs, so that A1 matched B1, A2 matched B2, and A3 matched B3.

During the ﬁrst training block, A1 was used as the training set, and B1 as the

generalization set. During the second training block, B2 was the training set, and A2

was the generalization set. During the third training block, A3 served as the training

set, and B3 as the generalization set.

Additional materials included timers, which were used on testing days, as well as

an Internet timer that visually displayed a countdown display and sounded a loud

tone when the timer reached zero.

Procedures

Each student received a new set of training ﬂashcards (A1, B2, A3) on the ﬁrst,

fourth, and seventh day of class. Generalization testing occurred after training on the

third, sixth, and ninth day of class, giving each student three class sessions of

training prior to each testing. The three-day card distribution and generalization

testing schedule was used as this allowed for three testings within a 10-week course.

J Behav Educ

123

At the start of the course, students were instructed in the proper use of the

ﬂashcards. Students were instructed to ensure all cards faced the same direction (i.e.,

correctly oriented, all deﬁnitions facing the same direction), to shufﬂe all cards

before and after any training, to read the deﬁnition silently and attempt to provide

the term, and to attempt to complete the set as quickly as possible. In addition,

students were instructed to check the accuracy of each response during training by

immediately ﬂipping the card over to compare their response with the correct term

on the back, and to then sort the cards into ‘‘correct’’ and ‘‘incorrect’’ piles. Students

were told that if they did not know an answer they were to place the card in the

‘‘incorrect’’ pile. These instructions were repeated periodically throughout the

study.

Training Days At the start of each class, students engaged in approximately

15 min of training with the ﬂashcards. Training days consisted of 2 min of

individual review, two timings during solo practice with the training ﬂashcards, two

timings during paired student practice, followed by two ﬁnal timings with the

researcher. Thus, each day of training afforded students a short review session

followed by six timings.

During the independent review, students were instructed to review the training

ﬂashcards at their own pace. They were not instructed to do so as quickly as possible

or to count their correct and incorrect responses. As classes occurred only one time per

week, the purpose of the independent review component of training was to allow

students to familiarize themselves with the cards prior to attempting to respond quickly.

During solo practice, students independently engaged in two timings with the

training ﬂashcards. Whereas traditional SAFMEDS activities involve 1-min timings

(e.g., Eshleman 1985), these practice timings lasted 30 s due to the relatively small

number of cards included in each set. A timer was set for 30 s, students were

instructed to begin training with the ﬂashcards, and the timer was started. When the

timer reached zero, a loud tone sounded and students were told to stop practicing.

They were then instructed to count the cards in the correct pile. After each timing,

the students were asked to raise their hand if they had more correct responses on this

timing than on previous timings. This informal procedure was intended to reinforce

participation in the activity and encourage students to set aims for increasing

ﬂuency. Following solo practice, students engaged in two additional timings with a

partner, which is common practice (see Merbitz et al. 2004, for example). The

paired student practice training procedures were similar to those in the solo practice;

students engaged in two timings and counted correct responses at the end of each

timing. During this practice, a student responded to the SAFMEDS cards, while a

partner watched and ensured the student gave the correct response for each card.

The partner interacted with the student only after that student had completed

responding to the SAFMEDS cards.

Following the paired student practice, students engaged in two ﬁnal timings with

a researcher. Student performance on these timings was recorded by the researcher

and constitutes the data on Train 1 and Train 2 days (see Fig. 1). When the students

practiced with the researcher, rather than countdown from 30 s as during solo and

J Behav Educ

123

Fig. 1 Change in ﬂuency scores across participants. Change in ﬂuency scores is calculated by

subtracting the second timing rate from the ﬁrst timing rate

J Behav Educ

123

paired practice, the timer counted up. Students were instructed to go as quickly as

possible, but there was no upper limit to the amount of time available to complete

the set. This approach to timing the ﬂashcard activity differed from the traditional

1-min timings used in previous descriptions of SAFMEDS (e.g., Eshleman 1985).

However, the count-up procedure was necessary in that it enabled students to

complete the entire set of ﬂashcards and thus contact every card in the training set.

After each timing, the duration of the timing and the number of correct and incorrect

responses were recorded by the researcher.

Testing Days On testing days, students engaged in SAFMEDS practice similar to

practice on training days. The only difference between training and testing days was

in regard to the second timing with the researcher. During the two timings with the

researcher, the ﬁrst timing was identical to that of a training day. On the second

timing, however, students were given a generalization ﬂashcard set, which

contained identical term-deﬁnition pairs that were formatted differently. Students

were handed the generalization ﬂashcard set and told that it contained the same

term-deﬁnition pairs as the training set and that they should attempt to go through

the set as quickly as possible. As during training days with the researcher, a count-

up timing procedure was used, allowing the student to go through the entire

generalization set. Student performance was recorded by the researcher and

constitutes the data on test days (see Fig. 1).

Response Deﬁnition and Measurement

The number of correct and incorrect responses was recorded. A correct response was

deﬁned as a match between the vocal response of the student and the term printed on

the back of the ﬂashcard. This was observable when the student turned the card over

after a response to check his/her own accuracy. Only an identical match was scored as

correct. For example, if the student said ‘‘reinforcement’’ and the correct term was

‘‘negative reinforcement,’’ the response was scored as incorrect. An incorrect response

was recorded whenever the vocal response of the student did not match the term

printed on the card. When a student responded to the SAFMEDS cards, the card was

placed in one of two piles indicating a correct or incorrect response. The researcher

ensured the card was placed in the correct pile. At the end of the timing, the researcher

counted and recorded the number of cards in each pile.

In addition to measuring correct and incorrect responding, the total time it took to

complete the set was recorded. The timer was started when the researcher said

‘‘Go,’’ and was stopped at the moment the student placed the ﬁnal card in either the

correct or incorrect pile. Recording the duration as well as the number of correct and

incorrect responses enabled ﬂuency (correct responses per minute) to be calculated.

Treatment Integrity and Interobserver Agreement (IOA)

An independent observer assessed treatment integrity and interobserver agreement

on 48 % of the testing trials. To assess treatment integrity, the observer answered a

J Behav Educ

123

series of questions that asked whether or not the researcher ran the correct number

of practice timings, and whether the count-up timings were run in the manner

described above. Mean treatment integrity for all sessions was 100 %. In addition,

the observer independently recorded the duration of each testing session as well as

the number of correct and incorrect student responses. IOA was then calculated for

the duration measure by using mean duration-per-occurrence. The two duration

measures for each occurrence were divided (smaller duration/larger duration), and

the resulting numbers added together for each occurrence. This sum was then

divided by the total number of occurrences and multiplied by 100. IOA for correct

and incorrect responses was calculated using exact count. The records of the

researcher and independent observer were compared, and the number of observa-

tions for which both observers recorded the same number for correct and incorrect

responses was counted. This number was then divided by the total number of

observations on which IOA was collected. This resulted in a number indicating the

percentage of the total instances in which both researcher and independent observer

recorded the same number of cards in each pile was recorded to produce correct and

incorrect data measures. A ﬂuency measure was achieved by converting this

measure to rate-per-minute. Changes in ﬂuency from the ﬁrst timing to the second

timing with the researcher were calculated by subtracting the ﬂuency measure of the

second timing from the ﬂuency measure of the ﬁrst timing. A positive number

indicates that the student responded more ﬂuently on the second timing, whereas a

negative number indicates the student was less ﬂuent (i.e., fewer correct responses

per minute) on the second timing. The same method was applied on generalization

test days to determine the change in ﬂuent responding from the training set to the

generalization set. Mean IOA for all sessions was 99.4 % for duration and 98.9 %

for correct and incorrect responses.

Experimental Arrangement

A non-traditional single-subject arrangement was employed to investigate the extent

to which student responding came under the control of irrelevant features of the

ﬂashcard. This experimental arrangement was selected because it simultaneously

adhered to single-subject baseline logic (e.g., consisted of the components of

prediction, veriﬁcation, and replication) and allowed us to meaningfully use the

ﬂashcards to provide instruction to our students. Rates of correct responding

(ﬂuency) were compared under two distinct stimulus conditions (ﬂashcard sets in

the training and generalization format) across three successive ﬂashcard sets.

Fluency was assessed twice each day which enabled us to make a prediction

regarding changes in ﬂuency from the ﬁrst to second timing (an important

consideration as the generalization timing was always conducted as a second

timing) and replicate this change. Further, changes in ﬂuency from training to

generalization ﬂashcard sets were assessed three different times resulting in two

within-subject replications. Although the experimental arrangement was created to

ﬁt our research questions and the classroom needs, the arrangement is quite similar

to a repeated acquisition design (Boren and Devine 1968; Kennedy 2005) as (a) we

used three equivalent learning tasks (ﬂashcard sets A, B, and C), (b) we compare

J Behav Educ

123

performances on one task (training cards) to performances on another (generaliza-

tion cards), and (c) we have two experimental conditions (ﬂashcard formats).

Statistical Analysis

The data were also analyzed statistically. First, a correlation coefﬁcient (Pearson

product-moment) was used to analyze any relative correlation between ﬂuency rates

on the differently formatted ﬂashcards. The Pearson product-moment correlation

coefﬁcient is used to analyze the statistical signiﬁcance of any relative correlation

between two variables but does not indicate a cause and effect relationship.

An additional nonparametric statistical analysis was chosen to further evaluate

changes in ﬂuency rates based on card format. The nonparametric analysis was

chosen due to the nature of the ﬂuency data. Fluency values may not be evaluated as

being derived from a normal distribution and can be considered rank ordered. Thus,

both nonparametric and parametric statistical analyses were required. Speciﬁcally,

the Wilcoxon matched-pairs signed-ranks test was chosen in an effort to evaluate

whether or not the median of the difference in average ﬂuency scores from timing 1

to timing 2 across all training days versus the average change in ﬂuency from the

ﬁrst timing to the generalization timing across all generalization test days equals

zero. This test utilizes paired data from each participant, meaning identical subjects

are measured twice under different conditions (in this case timing one and timing

two), and then the paired data from each subject are tested across all participants. In

this way, group analysis of ﬂuency differences between the ﬁrst timing and the

second timing is carried out as a whole to test for any statistical signiﬁcance of

paired sample results.

Results and Discussion

Figure 1depicts changes in ﬂuency for each participant for each set of training cards

on both training and testing days. Recall that on all Train 1 and 2 days, both the ﬁrst

and second timing were conducted using the training ﬂashcard sets. Although some

students experienced decreases in ﬂuency from timing 1 to timing 2 during training

days (i.e., when the timing was conducted on identical ﬂashcard sets), in general, it

appears that students were most likely to demonstrate a decrease in ﬂuency when the

second timing was conducted with the generalization ﬂashcard set. During

generalization testing across all students, 50 % (9 of 18) of the timings resulted

in a decrease in ﬂuent performance compared to 20 % (6 of 30) of timings

conducted on training days. That is, students emitted fewer correct responses per

minute when tested on the generalization ﬂashcards (i.e., differently formatted) than

on the training ﬂashcards. In general, this decrease in ﬂuent performance during

generalization testing was most pronounced on Flashcard Set 1 (see P1, P4, P5, and

P6, for example) and lessened with each training set.

In addition, we found that the average length of time it took students to complete

the training and generalization ﬂashcards differed. When tested on the training

ﬂashcards for Flashcard Set, it took students an average of 45 s to compete the set

compared to an average of 55 s to complete the generalization ﬂashcards. This

J Behav Educ

123

difference in length gradually diminished across ﬂashcard sets. Average lengths for

Flashcard Set 2 training and generalization timings were 50 and 52 s. Average

lengths for Flashcard Set 3 training and generalization timings were 68 and 67 s.

Although not experimentally validated, it is possible that this change in both ﬂuency

and overall timing length resulted from a learning history—after the ﬁrst

generalization test, students may have learned that the third day of testing would

involve a differently formatted set of cards. This, in turn, may have led students to

study in a different manner and thereby decrease the likelihood that they would

attend to irrelevant stimuli.

We hypothesized that the higher a student’s ﬂuency on a training set of cards, the

lower their ﬂuency would be on a generalization set of cards. Figure 2examines the

relation between ﬂuency and generalization (deﬁned as the change in performance

from the training to generalization set). When these two measures are compared, a

pattern emerges, indicating that students whose performance was highly ﬂuent

(indicated by data points furthest to the right of the graph) showed a greater

decrease in ﬂuency on the generalization test compared to students whose

performance was less ﬂuent (indicated by data points furthest to the left of the

graph). Students who did not achieve high levels of ﬂuency exhibited either a

minimal decrease in ﬂuency on the generalization test or a small increase in ﬂuency.

This relation appeared to generally hold true for all participants.

In an effort to examine any grouped data relationships, the Pearson product-

moment correlation coefﬁcient was computed to assess whether any relationship

existed between the ﬂuency measures taken during the ﬁrst timings on generaliza-

tion testing days and the change in those measures on the generalization timings.

-8

-6

-4

-2

10 12 14 16 18 20 22 24 26

Fluency (Correct per Minute)

Individual Student Change in Fluency by Final Fluency Measure

Fig. 2 Scatterplot depicting the relation between student ﬂuency on the ﬁnal training timing and the

change in ﬂuency from the ﬁnal timing to the generalization test

J Behav Educ

123

Results showed a negative correlation between the two variables (r(16) =-.717,

p=.001). This supports the visual analysis and further indicates that the more

ﬂuent students became with the training set, the greater the decrease in ﬂuency with

the generalization set. The Pearson product-moment correlation coefﬁcient is not

able to indicate any cause and effect relationship and only indicates the signiﬁcance

of any relative relationship between two variables. For that reason, an additional

nonparametric analysis was required.

The nonparametric Wilcoxon matched-pairs signed-ranks statistical analysis was

used to compare the average change in ﬂuency from timing 1 to timing 2 (across all

training days) against the average change in ﬂuency from the ﬁrst timing to the

generalization timing across all generalization testing days. Results showed that

there was a signiﬁcant effect of format change on ﬂuency (F

1,49

=10.579,

p\.005). This provides support to the visual analysis and indicates that overall,

participants experienced a statistically signiﬁcant reduction in ﬂuency on their

second timing if the format of the card changed relative to the ﬁrst timing. This was

signiﬁcant at the p\.005 level.

The purpose of study 1 was to examine the extent to which student responding

would come under the control of stimuli other than the content of the ﬂashcards

when using SAFMEDS. Students were trained with one set of cards, and then speed

and accuracy of responding was assessed on another set of cards that differed only

in the way they were formatted. We found a negative correlation between initial

ﬂuency with the training sets and the size of the decrease in ﬂuency on the

generalization tests—in general, students who responded at a relatively high rate on

the training set responded slower on the generalization set than they had on the

training set. Conversely, students who responded slowly on the training set

responded as fast or faster on the generalization set. This ﬁnding is consistent with

the hypothesis that students may have responded to irrelevant stimuli in the training

set. In order to continually respond faster (increase ﬂuency), students may have

learned to respond to speciﬁc words in the deﬁnition based on their physical position

on the cards or the overall shape of the deﬁnition, rather than responding to the

content per se. When exposed to the generalization sets, the placement of words was

shifted and the overall shape of the deﬁnition was altered. This may have resulted in

slower responding as students could no longer rely on the position of words or the

shape of the writing to determine the correct response. For students who were

slower to respond during training, however, responding may not have come under

the control of these stimuli, making it possible for them to perform at the same rate

on the generalization set. If this interpretation is correct, a potential side effect of

highly ﬂuent responding with SAFMEDS may be stimulus control by irrelevant

features. Although we found an association between initial ﬂuency and loss of

ﬂuency during generalization testing, we did not ﬁnd a decrease in accuracy of

responses. That is, although students were slower during the generalization test, they

were equally accurate on both training and generalization ﬂashcard sets.

In study 1, we found that the higher the accurate responding to one set of cards,

the slower the accurate responding when the form, but not the content, of the card

was changed. If generalization of ﬂuent responding is poor when the format of a

deﬁnition is altered, the prospects for generalization of ﬂuent responding to a novel,

J Behav Educ

123

but equally correct, deﬁnition would be poor. Anecdotally, many of the students

indicated that in order to increase their ﬂuency, they attended to speciﬁc words in

speciﬁc locations on the ﬂashcard. When we changed the format of the card, this

moved the word and thus decreased ﬂuency. Attending to a precise word within a

deﬁnition is counterproductive from an educational standpoint. If a SAFMEDS

activity teaches students to respond ﬂuently to only one set of cards with only one or

two speciﬁc words controlling the response, the utility of this activity seems limited

as students will respond slowly to non-training ﬂashcard stimuli. The purpose of

study 2 was twofold. First, we assessed the extent to which ﬂuency with one set of

deﬁnitions generalized to another set of different, but equally correct, deﬁnitions.

Second, we assessed whether multiple-exemplar training embedded within a

SAFMEDS activity could enhance generalization without degrading ﬂuency with

the SAFMEDS activity.

Study 2

Method

Participants and Setting

Thirteen graduate and undergraduate students (3 males and 10 females) enrolled in a

10-week introductory Applied Behavior Analysis course at a large midwestern

university participated in this study. The class met twice per week. As with study 1, the

participants came from a variety of academic programs including special education,

school psychology, and sports ﬁtness. All 13 students gave consent for their data to be

used by the authors, and the research was given approval by the university’s IRB.

Experimental sessions were conducted in two university classrooms. Both rooms

were furnished with enough desks and chairs to accommodate approximately 30

students. One room was used to conduct training sessions and was the same room

where the class was normally taught. The other room was located in the same

building and was used exclusively for the purposes of running testing sessions. At

least two researchers were present during each day of study 2.

Materials

Prior to the start of the course, the researchers selected 30 key terms from the course

textbook (i.e., Alberto and Troutman 2009). These terms were identiﬁed by the

researchers as important for students to learn in an introductory Applied Behavior

Analysis course. For each of these key terms, three functionally equivalent

deﬁnitions were obtained. These deﬁnitions were taken from three different

textbooks on Applied Behavior Analysis (i.e., Alberto and Troutman 2009; Cooper

et al. 2007; Martin and Pear 2007). Two sources were used to create training

ﬂashcards, and the third source was used to create generalization ﬂashcards. For the

most part, deﬁnitions were taken verbatim from these sources, but were in some

cases altered to ﬁt on a ﬂashcard or to eliminate irrelevant information included in

J Behav Educ

123

the original source (e.g., if the deﬁnition was part of a larger sentence, or included

extraneous examples). Each term and deﬁnition was printed on a ﬂashcard so that

three equivalent ﬂashcards representing one term were printed for each of the 30

terms. These cards were then divided into three distinct sets of 10 terms that were

introduced sequentially in successive training periods over the course of the study.

A training set containing 20 ﬂashcards was assembled for each of the three sets of 10

terms. For ﬁve of the terms contained in each training set, a single deﬁnition (from the

same source) was duplicated twice; the other ﬁve terms had two different deﬁnitions

(drawn from different sources). Thus, within a given training set, each of the 10 terms

was depicted twice, but half of the terms had a single deﬁnition and the other half had

two differently worded deﬁnitions. Each generalization set was comprised of 10 cards

containing the same 10 terms as the corresponding training set, but with deﬁnitions

taken from a third source (see Fig. 3for a visual depiction of the training and

generalization sets). The deﬁnitions appearing in the generalization sets were not taken

from the class textbook. As this was an introductory Applied Behavior Analysis class,

it was assumed that students had not been exposed to the textbooks from which the

deﬁnitions in the generalization sets were drawn.

Fig. 3 Visual diagram depicting format of training and generalization ﬂashcard sets

J Behav Educ

123

For all of the training and generalization cards, a small number was placed in the

lower right corner of the card. This number was recorded by the researchers during

testing days and enabled researchers to determine whether a speciﬁc term contained

a single deﬁnition or multiple deﬁnitions.

Additional materials included timers to be used on testing days as well as an

Internet timer that visually displayed a countdown display and sounded a loud tone

when the timer reached zero.

Procedures

The training and testing procedures used in study 2 were similar to those employed in

study 1. Students receiveda new set of training ﬂashcards on the ﬁrst, fourth, and seventh

day of class and generalization testing occurred after training on the third, sixth, and

ninth day of class. Training, therefore, was conducted over three class sessions, and

testing occurred on the third class session after training. As in study 1, students were

instructed as to the proper use of the ﬂashcards both at the start and during the study.

Training Days Training days in study 2 differed from study 1 in several ways.

First, although students engaged in individual review, solo practice, and paired

student practice as during study 1, timings with the researcher occurred only on the

testing days (discussed below). During training days, data on each student’s

performance on the training set were collected by a partner during the paired student

practice using a researcher-created data sheet that was collected by the researcher at

the end of the SAFMEDS practice. These data were used to determine whether each

student’s ﬂuency was increasing across the training days. Lastly, solo and paired

student practice in this study consisted of shorter timings, lasting 15 s instead of

30 s. The timing length was decreased to prevent students from contacting the

ﬂashcards more than once within a single timing.

Testing Days On testing days, students engaged in SAFMEDS practice in a

fashion similar to practice on training days. In addition, students were brought to a

separate room and engaged in two timed trials with a researcher. The ﬁrst timing

was conducted with the training ﬂashcard set. The number of correct and incorrect

responses and the total duration of the timing were recorded. For the second timing,

the student was handed a generalization ﬂashcard set. The students were told that

the terms that appeared in the new set were identical to the terms in the training set

but that the deﬁnitions might be worded differently. They were again told to provide

responses as quickly as possible. Both timings lasted, however, long it took the

student to complete the entire set of cards.

Response deﬁnitions and measurement procedures were identical to those of

study 1.

Treatment Integrity and Interobserver Agreement (IOA)

A second observer assessed treatment integrity on every day of study 2. Mean

treatment integrity for all sessions was 100 %. In addition, a second observer sat in

J Behav Educ

123

on 40 % of testing sessions and independently recorded the duration of each session

as well as the number of correct and incorrect student responses. IOA was

calculated in an identical manner to study 1. Mean IOA for all sessions was 95.8 %

for duration and 100 % for correct and incorrect responses.

Experimental Arrangement

A non-traditional single-subject arrangement (similar to the arrangement in study 1)

was employed to investigate the extent to which multiple-exemplar training

embedded in a SAFMEDS activity would promote generalization to a novel set of

ﬂashcards containing the same terms. As with study 1, our experimental

arrangement relied upon baseline logic to enhance internal validity. Students

practiced with their training ﬂashcards across three training days, followed by a

generalization testing day. On this testing day, each student’s ﬂuency and accuracy

were measured on training and generalization ﬂashcards. Further, we conducted

three testing days which resulted in two replications of the phenomenon. As with

study 1, the arrangement in study 2 is quite similar to a repeated acquisition design

(Boren and Devine 1968; Kennedy 2005).

Statistical Analysis

Due to the nature of study 2, accuracy rates were analyzed statistically in order to

evaluate any signiﬁcant difference in overall accuracy across study participants

between single-exemplar and multiple-exemplar conditions. Accuracy measures the

overall percent correct, and thus a simple parametric ttest analysis was conducted to

evaluate result.

Results and Discussion

Figure 4shows the average change in both ﬂuency and accuracy from the training to

generalization ﬂashcards for each testing day. During each testing day, cards on

which a student correctly responded were recorded. This allowed for a determi-

nation of whether a speciﬁc term in the generalization set had been trained with

single or multiple deﬁnitions. In addition, separate measures of accuracy and

ﬂuency could be calculated for single and multiple deﬁnition terms in the training

sets.

During the training set timings, students were generally quite accurate and ﬂuent

on both the multiple and single deﬁnitions cards. On average across all three

training sets, students provided an accurate response to 94.3 % of the single

deﬁnition terms and to 92.7 % of the multiple deﬁnition terms. Fluency was

similarly high and averaged 22.8 accurate responses per minute for the single

deﬁnition terms and 23.2 accurate responses per minute for the multiple deﬁnition

terms. During the generalization test, accuracy and ﬂuency decreased for both single

and multiple deﬁnition terms. However, compared to the single deﬁnition terms,

students were more accurate on the multiple deﬁnition terms. On average, student

accuracy on the generalization set was 55.8 % for the single deﬁnition terms and

J Behav Educ

123

77.1 % for the multiple deﬁnition terms. A statistical analysis (ttest) showed that

there was a signiﬁcant effect for generalization between multiple- versus single-

exemplar ﬂashcard sets [t(36) =4.563, p\.001]. Results indicated that, overall,

training with multiple-exemplar ﬂashcards resulted in signiﬁcantly higher general

accuracy during the generalization tests at the p\.001 level. With the exception of

set 2, student ﬂuency (correct responses per minute) decreased by approximately an

equal amount for multiple and single deﬁnition terms during the generalization tests.

As with study 1, the average length of time it took for students to complete both the

training and generalization timings differed. Average training and generalization

timings were 61 and 75 s for Flashcard Set 1, 48 and 72 s for Flashcard Set 2, and

51 and 58 s for Flashcard Set 3. Although across all sets it took students longer on

average to complete the generalization set, this slowing of performance was

primarily related to performance with the cards trained with a single deﬁnition as is

clear from Fig. 4.

Figure 5shows the change in accurate responding from training to generalization

for each student across all three training sets. If a student was absent on a testing

day, his or her data were removed for that day but retained for the other testing days

during which they were present. This resulted in the removal of participant 13’s data

on ﬂashcard set 1 and participant 10’s data on ﬂashcard set 3. It is important to note

that many students responded accurately to 100 % of the deﬁnitions on either the

single or multiple deﬁnition terms during each training. For those students, accuracy

on the generalization test could only either remain the same or decrease; it was

impossible for their performance to improve during the generalization test. When

Fig. 4 Average change in ﬂuency and accurate responding

J Behav Educ

123

considering only students who responded accurately to less than 100 % of the

deﬁnitions on the training sets, however, an interesting ﬁnding emerges. Of the 37

individual tests across the three training sets, there were 16 instances in which a

Fig. 5 Individual data showing change in accuracy for each student across all three ﬂashcard sets

J Behav Educ

123

student responded accurately to less than 100 % of the deﬁnitions on the multiple

deﬁnition terms, and 11 instances on the single deﬁnition terms. Of the 16 instances

with the multiple deﬁnition terms, there were 6 instances in which the students

showed an increase in accuracy on the generalization set compared to the training

set. For all of the 11 instances with the single deﬁnition terms, however, accuracy

either decreased or remained the same on the generalization test. Thus, it appears

that in some cases students actually performed better with the generalization set

than the training set following practice with multiple deﬁnition terms.

The purpose of study 2 was to (a) assess how ﬂuency with one set of terms and

deﬁnitions generalized to another set containing the same terms but with differently

worded deﬁnitions and (b) examine whether generalization was promoted by

incorporating multiple-exemplar training into the SAFMEDS activity. We found

that incorporating multiple-exemplar training into the SAFMEDS activity did not

systematically alter performance during training—students were not systematically

faster on the single deﬁnition terms than on the multiple deﬁnition terms. This

indicates that multiple-exemplar training may be embedded within a SAFMEDS

activity without negatively affecting student performance during training. A second

ﬁnding was that when presented with novel deﬁnitions during the generalization

tests, student speed and accuracy decreased compared to training levels. A decrease

in speed and accuracy was seen regardless of whether the training set contained

single or multiple exemplars. This ﬁnding is consistent with the results of study 1, in

that achieving ﬂuent responding with one set of training ﬂashcards did not

necessarily lead to equally ﬂuent performance with the generalization ﬂashcards that

contained equivalent content. Importantly, however, we found that on average,

students were more accurate on terms that were trained using multiple exemplars

than on those taught with only a single deﬁnition. Furthermore, although speed of

responding decreased on the generalization test regardless of whether the terms

were taught in single- or multiple-exemplar format, ﬂuency for the multiple-

exemplar terms was consistently equal to or greater than ﬂuency for items taught

with a single deﬁnition. Finally, when students responded accurately to less than

100 % of terms in the training set, students improved their accuracy on the

generalization set only for terms trained with multiple deﬁnitions (although this was

only a small subset of individuals). Taken together, these ﬁndings indicate that

embedding multiple-exemplar training into a SAFMEDS activity does not degrade

training performance and may result in faster, more accurate responding to novel

stimuli than single-exemplar training. This is an important ﬁnding as SAFMEDS

typically provides only one term-deﬁnition pair for each term.

General Discussion

In studies 1 and 2 we investigated (a) whether student responding during a

SAFMEDS activity comes under the control (at least in part) of irrelevant aspects of

a ﬂashcard, (b) the extent to which ﬂuency and accuracy generalize from training

ﬂashcards to novel, but equally correct, ﬂashcards, (c) the relation between the level

of ﬂuency achieved on one set of ﬂashcards and the degree to which this ﬂuency

generalizes to a novel set of ﬂashcards, and (d) whether the incorporation of

J Behav Educ

123

multiple-exemplar training into the SAFMEDS activity would promote generaliza-

tion. To investigate these questions, we provided students with ﬂashcard sets and

administered SAFMEDS practice sessions over three consecutive days before

testing for generalization to a novel set of ﬂashcards. In study 1, the novel set of

ﬂashcards contained identically worded but differently formatted deﬁnitions. We

found student ﬂuency decreased more on the generalization ﬂashcards relative to the

training ﬂashcards (see Fig. 1), suggesting some aspect of the deﬁnition format

controlled responding. In study 2, the training ﬂashcards contained some terms with

single deﬁnitions (representing typical SAFMEDS procedure) and some terms with

multiple deﬁnitions (constituting multiple-exemplar training). We found that, when

tested on a set of ﬂashcards containing novel deﬁnitions, students performed more

accurately on the terms learned under multiple-exemplar training (see Figs. 4,5).

Taken together, studies 1 and 2 identify a potentially undesirable aspect of

SAFMEDS activities (namely control of responding by irrelevant stimuli and less

than optimal generalization) and demonstrate the utility of multiple-exemplar

training in minimizing these problems.

Although a term-deﬁnition SAFMEDS activity may increase ﬂuency with the

training stimuli, the restricted stimulus control developed through the activity may

actually decrease the likelihood of stimulus generalization. This problem may

explain why ‘‘training loosely’’ has been advocated as a strategy for promoting

generalization (Stokes and Baer 1977). In the absence of such ‘‘loose training,’’

stimulus control may develop around stimuli that are irrelevant from an educational

standpoint. These stimuli may include unimportant aspects of a deﬁnition such as

word placement, the format of the deﬁnition, or the presence of a speciﬁc word. In

short, the form of a particular ﬂashcard may exert control rather than the content that

the teacher wants the student to associate with the correct answer. In study 1, upon

ﬁrst encountering the generalization ﬂashcard set, many students joked to the

researcher that we were trying to trick them because they had merely memorized a

speciﬁc word which prompted their response. Although this may be an effective

strategy for achieving ﬂuency with one speciﬁc set of ﬂashcards, it may inhibit

stimulus generalization in a way that is ultimately counterproductive to the broader

educational goals held by teachers.

When students engage in a SAFMEDS activity, it is assumed that feedback

regarding improved performance will serve as a reinforcer to increase the rate of

accurate responding. Engaging in the activity in pairs may add a social contingency

that further encourages high rates of responding. These contingencies are designed

so that students will respond to the ﬂashcards as quickly as possible. In order to

achieve this, students are likely to develop strategies to increase their response rate.

For example, students typically hold a set of ﬂashcards in a manner that affords easy

access to the next card, and may begin turning over the next card while saying the

answer for the previous card. A less obvious way students may increase their speed

is to cease reading the entire deﬁnition and respond to just one or two words on the

card. As there are several reinforcers maintaining this behavior, it is unlikely that

students will alter this strategy if they are merely instructed to read the entire

deﬁnition. It is possible that the features of the ﬂashcards that control responding

may shift over time, so that students who were reading the entire deﬁnition initially

J Behav Educ

123

come to attend to other aspects of the ﬂashcard over time. Future research might

investigate ways to alter the rules used during SAFMEDS practice, in order to

discourage such strategies, and increase the likelihood of reading the entire card. For

example, differential observing response procedures could be used to expand the

array of relevant stimulus features controlling a response. In match-to-sample

activities, differential observing responses such as naming the sample stimulus

aloud (Gutowski et al. 1995) or matching all relevant stimuli (e.g., Walpole et al.

2007) have been effective in reducing restricted stimulus control. Such differential

observing response procedures could be incorporated into a SAFMEDS activity to

prevent the development of restricted stimulus control and promote improved

generalization of the skills being practiced. For example, prior to a timed

SAFMEDS practice, students could be required to read the entire deﬁnition aloud

and then provide the associated term. Such an activity may bring the response under

the control of all relevant features of the deﬁnition.

One method of improving stimulus generalization without altering the rules used

during a SAFMEDS activity is to embed multiple-exemplar training into the

SAFMEDS activity. The results of study 2 suggest that incorporating multiple

deﬁnitions into a SAFMEDS activity may promote generalization by exposing the

students to a larger variety of relevant stimuli. Furthermore, including multiple

deﬁnitions does not appear to slow performance during initial training.

Although multiple-exemplar training was associated with increased stimulus

generalization, the speciﬁc features of the stimulus that evoked the correct response

remain unclear. For example, when given two different deﬁnitions of the same

concept (e.g., ‘‘positive reinforcement’’), responding may be controlled by shared

important features of the class (e.g., the presence of the phrases ‘‘presentation of a

stimulus’’ and ‘‘increase in the future’’), shared unimportant features of the class

(e.g., word placement), or unique features that are not shared. To this end, it may be

worthwhile to systematically analyze the controlling stimuli. Halle and Holt (1991)

described an operant methodology whereby the relevant features of the stimulus

class were systematically presented in isolation or in stimulus pairs. With respect to

SAFMEDS, the learner could be exposed to speciﬁc key words, in isolation or

combination, until the controlling stimuli were identiﬁed. An analysis of this sort

could inform attempts to program for generalization. However, identifying all of the

controlling stimuli could prove to be a time-consuming process as there would be

numerous word combinations for each SAFMEDS ﬂashcard. Future research could

evaluate the cost-beneﬁt ratio of such an approach in terms of gathering information

to promote generalization.

When using multiple-exemplar training, an important variable is the extent to

which the training stimuli differ from one another. For example, Birnie-Selwyn and

Guerin (1997) found that training stimuli that differed in one critical aspect

produced better educational outcomes than training stimuli that had multiple

differences. In the current study, the degree of difference between the stimuli used

during multiple-exemplar training was not formally evaluated. It is possible that a

more systematic approach to multiple-exemplar training would have produced even

better outcomes. Controlling for and manipulating the degree of difference in the

stimuli used during multiple-exemplar training may prove useful in producing

J Behav Educ

123

generalized outcomes. It may even be possible to select exemplars for the ﬂashcards

that sample the entire range of stimuli that should control the response, as has been

done in studies using general case analysis (e.g., Sprague and Horner 1984).

There are several limitations related to studies 1 and 2. First, in study 1, the relation

between ﬂuency and change in ﬂuencydepicted in Fig. 2may be skewed simultaneously

by ‘‘ceiling’’ and ‘‘ﬂoor’’ effects. For example, we found that the fastest responders

demonstrated the largest decrease in ﬂuency on the generalization test. The fastest

responders, however, werealso the responders who had the most ﬂuency to lose and the

least they could potentially gain. Put another way, the fast responders were able to

respond much slower on the generalization set than they had on the training set, whereas

the slow students may not have been able to respond any slower than they already were.

However, this explanation does not account for the fact that the slowest students actually

increased their rate of responding on the generalization tests.

A second limitation of both studies is that students wereallowed to take the ﬂashcards

home with them after each class. The extent to which students practiced with these

ﬂashcards between class sessions is unknown. This may have led to signiﬁcant

differences in each student’s exposure to practicewith the ﬂashcards, which may explain

why some students performed better than others during ﬁnal ﬂuency measures with the

training sets. However, as each student’s ﬂuency using the training set was compared to

his or her own ﬂuency on the novel generalization set, the amount of practice each

student received may not have been a critical variable. We can say with certainty that

each participant received at least as much practice with the training cards as described in

our procedures, and no exposure to the generalization cards until testing.

A third limitation is that the current study examined stimulus generalization

within a limited context (i.e., novel ﬂashcards). It is unclear whether other

functionally equivalent stimuli presented in different situations would evoke the

correct response. Responding to a novel deﬁnition within the same SAFMEDS

activity could be conceptualized as an initial form of stimulus generalization. This

outcome may be important in some educational situations. For example, in most

introductory behavior analysis courses, students encounter multiple deﬁnitions of

the same concept—a student who is able to identify a concept when given a

deﬁnition that differs from one previously taught may be more likely to use the

deﬁnition correctly in other situations. Future research in this area should examine

stimulus generation to a broader set of relevant stimuli within applied situations,

such as assessment materials or concepts presented verbally.

Another possible limitation is that during the ﬁnal tests with the training and

generalization cards, we used a count-up timing rather than a count-down timing

procedure. This was to ensure students were able to provide a response (either

accurate or inaccurate) to each ﬂashcard. There was no upper limit to responding

during these timing, and the average length of time it took to complete each set

differed on average. In general, it took students longer to respond to the

generalization set ﬂashcards (with the exception of Flashcard Set 3 in study 1).

However, all timings during both training and generalization lasted longer than the

count-down timings conducted on training days. It is possible that decreases in

student responding were a result of endurance issues. Future research should attempt

to keep the timing procedures consistent across training and testing days.

J Behav Educ

123

Lastly, the researchers acknowledge that attempts to combine results and assert broad

generalization of ﬁndings through statistical analysis may arguably be less justiﬁable,

given the low number of study participants, than single-subject comparisons made

through comparing each participant’s own performance from one condition to another.

Such analysis seemed reasonable, however, in order to give broader support for such

single-subject interpretations of results found across individuals. These broader

interpretations were sought not in an effort to generalize ﬁndings to any speciﬁc greater

population, but were instead intended to serve as an additional analysis of ﬁndings to

strengthen those found through traditional single-subject analysis.

Although multiple-exemplar training embedded within a SAFMEDS activity

appears to improve generalization for these students, in the ‘‘real world,’’ it is

perhaps more important that a student be able to recognize a behavioral principle

based on an example rather than a deﬁnition. Future research should assess

generalization of responding to examples or scenarios rather than merely deﬁnitions.

It may be possible, for example, to embed written examples as well as deﬁnitions

into a SAFMEDS activity and thereby promote stimulus generalization across a

variety of meaningful contexts.

Binder (1996) suggested that ﬂuency-based strategies promote retention,

endurance, and application. The current study examined application, or generaliza-

tion, of responding to novel ﬂashcards in a SAFMEDS activity. Future research

could assess retention and endurance using similar research methodologies. For

example, the design used in study 2 would allow for repeated measures of the target

behavior throughout a course or academic period. Although SAFMEDS appeared to

result in stimulus control by irrelevant features, this degree of stimulus control may

facilitate retention and endurance.

Finally, the results of these two studies, although preliminary, have an important

implication for ﬂuency-based instructional strategies. As argued by Stokes and Baer

(1977), generalization is not a guaranteed outcome of behavioral programming. The

same could be said of ﬂuency-based instructional strategies in that fast and accurate

responding may facilitate generalization (Binder 1996), but it may only be minimal. As

the results of study 1 suggested, ﬂuency-based instructional strategies such as

SAFMEDS may promote restricted stimulus control, thereby inhibiting generalization.

Although multiple-exemplar training is not typically incorporated into SAFMEDS,

practitioners who are using SAFMEDS to develop ﬂuent student responding may want

to incorporate multiple-exemplar training into the activity to promote generalization.

Acknowledgments This research was supported in part by a grant from the U.S.D.E., OSEP,

(H325DO60032; N. A. Neef, Principal Investigator). However, the contents herein do not necessarily

represent the policy of the U.S.D.E., OSEP, and endorsement by the federal government should not be

assumed.

References

Alberto, P. A., & Troutman, A. C. (2009). Applied behavior analysis for teachers (8th ed.). Upper Saddle

River, NJ: Pearson Education, Inc.

Binder, C. (1996). Behavioral ﬂuency: Evolution of a new paradigm. The Behavior Analyst, 19, 163–197.

J Behav Educ

123

Birnie-Selwyn, B., & Guerin, B. (1997). Teaching children to spell: Decreasing consonant cluster errors

by eliminating faulty stimulus control. Journal of Applied Behavior Analysis, 30, 69–91.

Boren, J. J., & Devine, D. D. (1968). The repeated acquisition of behavioral chains. Journal of the

Experimental Analysis of Behavior, 11, 651–660.

Bower, B., & Orgel, R. (1981). To err is divine. Journal of Precision Teaching, 2, 3–12.

Brady, K. K., & Kubina, R. M. (2010). Endurance of multiplication fact ﬂuency for students with

attention deﬁcit hyperactivity disorder. Behavior Modiﬁcation, 34, 79–93.

Carr, D. (2003). Effects of exemplar training in exclusion responding of auditory-visual discrimination

tasks with children with autism. Journal of Applied Behavior Analysis, 36, 507–524.

Chafouleas, S. M., Martens, B. K., Dobson, R. L., Weinstein, K. S., & Gardner, K. B. (2004). Fluent

reading as the improvement of stimulus control: Additive effects of performance-based interventions

to repeated reading on students’ reading and error rates. Journal of Behavioral Education, 13,

67–81.

Codding, R. S., Archer, J., & Connell, J. (2010). A systematic replication and extension of using

incremental rehearsal to improve multiplication skills: An investigation of generalization. Journal of

Behavioral Education, 19, 93–105.

Cooper, J. O., Heron, T. E., & Heward, W. L. (2007). Applied behavior analysis (2nd ed.). Upper Saddle

River, NJ: Pearson Education, Inc.

Dougherty, K. M., & Johnston, J. M. (1996). Overlearning, ﬂuency, and automaticity. The Behavior

Analyst, 19, 289–292.

Doughty, S. S., Chase, P. N., & O’Shields, E. M. (2004). Effects of rate building on ﬂuent performance: A

review and commentary. The Behavior Analyst, 27, 7–23.

Ducharme, J. M., & Feldman, M. A. (1992). Comparison of staff training strategies to promote

generalized teaching skills. Journal of Applied Behavior Analysis, 25, 165–179.

Eshleman, J. W. (1985). Improvement pictures with low celerations: An early foray into the use of

SAFMEDS. Journal of Precision Teaching, 6, 54–63.

Golding, J. M., Wasarhaley, N. E., & Fletcher, B. (2012). The use of ﬂashcards in an Introduction to

Psychology class. Teaching of Psychology, 39, 199–202.

Gutowski, S. J., Geren, M. A., Stromer, R., & Mackay, H. A. (1995). Restricted stimulus control in

delayed matching to complex samples: A preliminary analysis of the role of naming. Experimental

Analysis of Human Behavior Bulletin, 13, 18–24.

Halle, J. W., & Holt, B. (1991). Assessing stimulus control in natural settings: An analysis of stimuli that

acquire control during training. Journal of Applied Behavior Analysis, 24, 579–589.

Haughton, E. C. (1980). Practicing practices: Learning by activity. Journal of Precision Teaching, 1,

3–20.

Johnson, K. R., & Layng, T. V. J. (1996). On terms and procedures: Fluency. The Behavior Analyst, 19,

281–288.

Kennedy, C. H. (2005). Single-case designs for educational research. Boston, MA: Pearson Education.

Korinek, L., & Wolking, B. (1984). Study methods in graduate school. Journal of Precision Teaching, 5,

64–67.

Kornell, N., & Bjork, R. A. (2008). Optimising self-regulated study: The beneﬁts—and costs—of

dropping ﬂashcards. Memory, 16, 125–136.

Litrownik, A. J., McInnis, E. T., Wetzel-Pritchard, A. M., & Filipelli, D. L. (1978). Restricted stimulus

control and inferred attentional deﬁcits in autistic and retarded children. Journal of Abnormal

Psychology, 87, 554–562.

MacQuarrie, L. L., Tucker, J. A., Burns, M. K., & Hartman, B. (2002). Comparison of retention rates

using traditional, drill sandwich, and incremental rehearsal ﬂash card methods. School Psychology

Review, 31, 584–595.

Mangundayao, J., McLaughlin, T. F., Williams, R. L., & Toone, E. (2013). An evaluation of direct

instructions ﬂashcard system on the acquisition and generalization of numerals, shapes, and colors

for preschool-aged students with developmental delays. Journal of Developmental and Physical

Disabilities, 25, doi:10.1007/s10882-012-9326-9.

Martin, G., & Pear, J. (2007). Behavior modiﬁcation: What it is and how to do it (8th ed.). Upper Saddle

River, NJ: Pearson Education, Inc.

Merbitz, C., Vieitez, D., Merbitz, N. H., & Binder, C. (2004). Precision teaching: Applications in

education and beyond. In D. J. Moran & R. W. Malott (Eds.), Evidence-based educational methods.

San Diego, CA: Elsevier Academic Press.

J Behav Educ

123

Miller, A. D., Hall, S. W., & Heward, W. L. (1995). Effects of sequential 1-minute time trials with and

without inter-trial feedback and self-correction on general and special education students’ ﬂuency

with math facts. Journal of Behavioral Education, 5, 319–345.

Reeve, S. A., Reeve, K. F., Townsend, D. B., & Poulson, C. L. (2007). Establishing a generalized

repertoire of helping behavior in children with autism. Journal of Applied Behavior Analysis, 40,

123–136.

Schmidmaier, R., Ebersbach, R., Schiller, M., Hege, I., Holzer, M., & Fischer, M. (2011). Using

electronic ﬂashcards to promote learning in medical students: Retesting versus restudying. Medical

Education, 45, 1101–1110.

Shinamune, S., & Jitsumori, M. (1999). The effects of grammar instruction and ﬂuency training on the

learning of the and a by native speakers of Japanese. The Analysis of Verbal Behavior, 16, 3–16.

Sprague, J. R., & Horner, R. H. (1984). The effects of single instance, multiple instance, and general case

training on generalized vending machine use by moderately and severely handicapped students.

Journal of Applied Behavior Analysis, 17, 273–278.

Stokes, T. F., & Baer, D. M. (1977). An implicit technology of generalization. Journal of Applied

Behavior Analysis, 10, 349–367.

Van Houten, R., Morrison, E., Jarvis, R., & McDonald, M. (1974). The effects of explicit timing on math

performance. Journal of Applied Behavior Analysis, 9, 227–230.

Volpe, R. J., Mule, C. M., Briesch, A. M., Joseph, L. M., & Burns, M. K. (2011). A comparison of two

ﬂashcard drill methods targeting word recognition. Journal of Behavioral Education, 20, 117–137.

Walpole, C. W., Roscoe, E. M., & Dube, W. V. (2007). Use of a differential observing response to expand

restricted stimulus control. Journal of Applied Behavior Analysis, 40, 707–712.

Weinstein, G., & Cooke, N. L. (1992). The effects of two repeated reading interventions on generalization

of ﬂuency. Learning Disability Quarterly, 15, 21–28.

Weiss, M. J. (2001). Expanding ABA interventions in intensive programs for children with autism: The

inclusion of natural environment training and ﬂuency based instruction. The Behavior Analyst

Today, 2, 182–185.

J Behav Educ

123

Fluency and the Maintenance of Skills Related to Sex Laws for Individuals Adjudicated for Illegal Sexual Behavior

Article

Full-text available

Nov 2020

Previous research has evaluated the effects of various commonly used mastery criteria on skill maintenance by directly manipulating the accuracy requirement, as well as the sessions across which these accuracy levels must be demonstrated. The current study extends this literature by including a rate dimension within the mastery criterion with a unique population. We implemented a fluency-oriented treatment package to increase intraverbal skills related to state sex laws using a multiple-baseline design across 3 target sets for 2 individuals adjudicated for illegal sexual behavior. Within this intervention package, we included 2 distinct components of a single mastery criterion: (a) accuracy (i.e., 100% accuracy across 3 consecutive sessions) and (b) speed. We evaluated how each of these measurable dimensions of behavior maintained across time. Results indicate this mastery criterion produced over 80% accuracy during maintenance probes for 10 weeks across all sets for both students. However, this mastery criterion produced idiosyncratic maintenance of rates across students and sets. These results suggest that each of these dimensions of behavior does not necessarily covary and should be conceptualized as distinct clinical targets by applied behavior clinicians.

Assessing the Social Validity of the SAFMEDS Strategy From the Perspective of Teachers and Children

Article

Full-text available

Jul 2021

The Say-All-Fast-Minute-Every-Day-Shuffled (SAFMEDS) strategy promotes fluency across several skills and contexts. However, few studies have reported the social validity key stakeholders associate with using the strategy in schools. Assessing social validity may provide us with some insight into factors that may affect engagement, implementation fidelity, and persistent use of the intervention after the termination of a research study. Study 1 details the findings from a survey completed by teachers who have used the strategy in their schools (N = 55). Using thematic analysis, we identified three themes: 1) factors that promote and limit progress, 2) confidence, and 3) inherent advantages of the SAFMEDS strategy. These themes encapsulate teachers experiences of implementing the strategy under the real-word conditions of the classroom and the accompanying advantages and potential challenges they face. Within study 2, we discuss themes arising from interviews with children (N = 26) about their views and experiences of using the SAFMEDS strategy. These children had used the strategy with their teacher for one academic year to promote fast and accurate recall of arithmetic facts. Analysis of these transcripts revealed five further themes relating to children’s engagement with the strategy: 1) enjoyment, 2) data, 3) sense of achievement, 4) skills, and 5) home use. Collectively these themes have potential impact with regards to future training and support models for the SAFMEDS strategy.

Evaluation of Multiple SAFMEDS Procedures

Article

Apr 2021

Lindsley developed the “say all fast minute every day shuffled” (SAFMEDS) procedure in the late 1970s to enhance the typical use of flash cards (Graf & Auman, 2005). The acronym specifically guides the learner’s behavior when using flash cards. A review of SAFMEDS research indicates its successful use with children, college students, and older adults with and without disabilities. The literature also indicates that SAFMEDS procedures are not well documented and have multiple variations, limiting practitioners’ ability to know what procedures to use and when. The purpose of this study was to evaluate the effects of a basic SAFMEDS procedure and four supplementary SAFMEDS procedures on the rates of correct and incorrect responding to unfamiliar Russian words and Chinese characters in college students. The results of the study suggest that the basic SAFMEDS procedure produced some learning (i.e., increases in correct responding and decreases in incorrect responding), but all of the supplementary procedures led to greater increases in the number of correct responses per 1-min timing. Further research evaluating differences in performance across the supplementary procedures is warranted.

Exploring the Effects of Daily, Timed, and Typed Technical Term Definition Practice on Indicators of Fluency

Article

Nov 2020

Say All Fast Minute Every Day Shuffled (SAFMEDS) is one behaviorally based teaching tactic. Like flash cards, SAFMEDS helps build familiarity with course objectives and can be used to promote fluency in the corresponding verbal repertoire. However, SAFMEDS differs from flash cards in that it follows specific design features and the acronym specifies how to practice flash cards. Students might practice in the traditional see-say learning channel used with SAFMEDS, or they could practice in a see-type learning channel (i.e., Type All Fast Minute Every Day Shuffled [TAFMEDS]), as the precision teaching community has sought to bring digital technology to their teaching, using computerized standard celeration charts and programs that present flash cards in a digital format. The present study explored the use of computerized charting and a see-type learning channel program developed for TAFMEDS in several sections of an undergraduate Introduction to Behavior Principles course. Course instructors explored the correlations between daily TAFMEDS practice with behavior-analytic terminology and student performance. After 3 weeks of daily practice, the study concluded with a culmination of 4 checkouts that examined endurance, application, stability, retention (when possible), and performance in different learning channels. Results indicated a correlation between daily practice and higher daily performance frequencies and longer term outcomes, including maintenance, endurance, stability, application, and generativity. The findings are discussed in terms of bringing frequency-building activities to university settings and the advantages and disadvantages of bringing technological advancements into frequency-based instruction.

The comparative effects of cumulative and unitary SAFMEDS terms in an introductory undergraduate behavior analysis course

Article

Full-text available

Dec 2017

Research in precision teaching (PT) and verbal behavior has implications for undergraduate instruction, but effects of PT for under-graduate coursework are not yet fully understood. The authors examined the differences between learning outcomes for Say All Fast Minute Every Day Shuffled (SAFMEDS) cards between cumulative decks where all terms were practiced (fall semester) and unitary decks where terms were split into 10 units with no more than 10 cards each (spring semester). Results showed that the rate of correct terms recited per minute increased over the course of both semesters, the rate of incorrect terms decreased, and the rate of skips remained variable. The changes in rate occurred more quickly throughout the spring semester and more students reached the frequency aims. Other factors examined included timing intervals, multiple checkouts, definition length, and retention. The results have implications for the use of PT, especially SAFMEDS, in college classrooms.

A technology-enhanced learning intervention for statistics in higher education using bite-sized video-based learning and precision teaching

Article

Full-text available

Jan 2023

Adjustments to life and learning following the COVID-19 pandemic have transformed user acceptance of online learning methods. It is, therefore, imperative to analyse factors relating to user performance and preferences for such interactions. In this study, we combined video-based learning with precision teaching to reinforce previously learnt statistics skills in university students without a mathematical background. We developed a learning design consisting of eight ‘bite-sized’ online learning episodes. Each episode started with a brief learning video followed by a practice phase and an end-of-episode assessment. The practice phase differed in two groups of participants, matched on statistics attainment pre- intervention. A precision-teaching intervention group (N = 19) completed practice guided by a frequency-based approach aiming at building fluency in statistics. A control group (N = 19) completed self-directed practice for the same amount of time as the intervention group. All participants completed a statistics attainment test and a questionnaire on their attitudes towards statistics pre- and post- intervention, and a review of the learning materials post-intervention. The intervention group achieved, consistently, higher scores in all end-of-episode assessments compared to the control group. Both groups showed significant and comparable improvements in statistics attainment post-intervention. Both groups also reported more positive feelings towards statistics post-intervention, while the review of the learning materials suggested that the video-based learning design was well-received by students. Our results suggest that video-based learning has great potential to support, as a supplementary teaching aid, university students in learning statistics. We discuss future research directions and implications of the study.

Recent applications of SAFMEDS in a higher education learning environment

Article

Full-text available

Jul 2017

A pragmatic randomised controlled trial of SAFMEDS to produce fluency in interpretation of electrocardiograms

Article

Full-text available

Mar 2020
BMC Med Educ

Abstract Background SAFMEDS (Say-All-Fast-Minute-Every-Day-Shuffled) is a flashcard-type behavioural instructional methodology, involving one-minute learning trials that function both as practice and assessment, used to facilitate the development of fluency in a behaviour. The primary research question was whether SAFMEDS engenders improvement in performance beyond that conferred by usual teaching. A secondary research question was whether SAFMEDS is an effective method of producing fluency in Electrocardiogram (ECG) interpretation. Methods A pilot study was conducted to determine sample size required to power the pragmatic randomised controlled trial (RCT). For the subsequent RCT, participants were randomly assigned to a “usual teaching” control group (n = 14) or the SAFMEDS intervention group (n = 13), with the recognition of 15 cardiac conditions on ECGs (e.g., atrial fibrillation, complete heart block) targeted. Intervention group participants’ performance was tracked over eight weeks as they worked towards achieving the fluency criterion. Percentage accuracy in ECG interpretation was assessed at baseline and post-test for both groups. An ANCOVA was conducted to assess for differences in the performance of the intervention and control group at post-test while controlling for the baseline performance of participants. At post-test, the numbers of participants achieving fluency within the intervention group was examined. Results A large effect size of SAFMEDS (partial η2 = .67) was identified when controlling for the effects of baseline performance. At post-test, the intervention group significantly outperformed (M = 61.5%; SD = 12.1%) the control group (M = 31.6%; SD = 12.5%, p

Exploring Procedural Manipulations to Enhance Student Performance on SAFMEDS in Undergraduate Introduction to Behavior Analysis Courses

Article

Full-text available

Mar 2021
J Behav Educ

Research into the fluency-building tactic named Say All Fast Minute Every Day Shuffled (SAFMEDS) has implications for instruction at all academic levels. Several researchers have studied the effects of SAFMEDS in undergraduate courses; however, not all of the salient variables, or the relationships among variables, have been fully analyzed. The numerous procedural variations may pose specific challenges for college instructors who wish to implement SAFMEDS in their course design or for students who are attempting to use SAFMEDS on their own. We collected behavioral data across four semesters of undergraduate introductory behavior analysis courses. The effects of two procedural manipulations were evaluated in relation to undergraduate students’ performance frequencies on weekly checkouts with two or four decks of see term/say definition SAFMEDS. The authors describe how SAFMEDS were implemented and modified over the course of four semesters of instruction. SAFMEDS’ procedures were modified each semester based on student outcomes in the prior semester in an effort to improve our student outcomes and to assist other course instructors who might wish to utilize SAFMEDS in their university courses. An overview of the decision-making process and student outcomes allows for the authors to make recommendations for course instructors who employ SAFMEDS in their courses and suggestions for future research to improve our understanding of how SAFMEDS affect student performance.

Examining the maintenance and generalization effects of repeated practice: A comparison of three interventions

Article

Jun 2018
J SCHOOL PSYCHOL

Repeated reading (RR) procedures are consistent with the procedures recommended by Haring and Eaton's (1978) Instructional Hierarchy (IH) for promoting students' fluent responding to newly learned stimuli. It is therefore not surprising that an extensive body of literature exists, which supports RR as an effective practice for promoting students' reading fluency of practiced passages. Less clear, however, is the extent to which RR helps students read the words practiced in an intervention passage when those same words are presented in a new passage. The current study employed randomized control design procedures to examine the maintenance and generalization effects of three interventions that were designed based upon Haring and Eaton's (1978) IH. Across four days, students either practiced reading (a) the same passage seven times (RR+RR), (b) one passage four times and three passages each once (RR+Guided Wide Reading [GWR]), or (c) seven passages each once (GWR+GWR). Students participated in the study across 2weeks, with intervention being provided on a different passage set each week. All passages practiced within a week, regardless of condition, contained four target low frequency and four high frequency words. Across the 130 students for whom data were analyzed, results indicated that increased opportunities to practice words led to greater maintenance effects when passages were read seven days later but revealed minimal differences across conditions in students' reading of target words presented within a generalization passage.

On terms: On terms and procedures - Fluency

Article

Full-text available

Sep 1996

Fluency is a metaphor for flowing, effortless, well-practiced, and accurate performance. Current practice in fluency building involves increasing the frequency of free-operant performances. Free- operant performance is defined as continuous responding in the presence of discriminative stimuli that are either varied or not varied from response to response. Free-operant performance is also distinguished from discrete-trial performance. Frequency-building procedures are also described, including defining the learning channel and stimulus control topography of a component performance (called a pinpoint), selecting an appropriate timing period, and displaying stimuli so that no performance ceilings occur. During frequency building, frequencies of pinpoints are continuously charted on standard celeration charts. Frequencies are increased to empirically derived performance standards, or aims, that predict retention, endurance, stability, application, and adduction of performance. Frequency is also described as a dimension of performance, not simply its measurement. Frequency building is described as possibly facilitating contingency adduction.

A Comparison of Retention Rates Using Traditional, Drill Sandwich, and Incremental Rehearsal Flash Card Methods

Article

Full-text available

Dec 2002
SCHOOL PSYCHOL REV

Research has demonstrated increased retention from drill, but the data regarding drill format are inconsistent. Two commonly used models, Drill Sandwich (DS) and Incremental Rehearsal (IR), were compared to each other and to a traditional flashcard method by individually teaching words from the Esperanto International Language to 25 3rd- and 26 7th-grade students who were screened for receptive vocabulary. Retention of the pronunciation and English translation was tested after 1, 2, 3, 7, and 30 days. A significant main effect was found between the three conditions at each interval. The IR model consistently led to significantly more words retained than the traditional or DS approaches. Correlation coefficients between retention and receptive vocabulary were above .30 at the 1- and 3-day intervals for the traditional model, but little covariance was noted for DS, and even less for IR. Potential implications and suggestions for future research are included.

An Evaluation of a Direct Instructions Flashcard System on the Acquisition and Generalization of Numerals, Shapes, and Colors for Preschool-Aged Students with Developmental Delays

Article

Full-text available

Aug 2013

The purpose of this study was to evaluate the effectiveness of the DI Flashcard system for teaching preliminary mathematic skills to three preschool students. The participants attended a self-contained special education preschool. All three participants’ eligibility category was “developmentally delayed”. A concurrent multiple baseline design across three sets (colors, shapes, and numerals) was use to evaluate the effectiveness of the DI flashcard system. The results indicated that all three participants showed an increase in their performance when DI flashcards were in effect. However, the amount of improvement varied for each participant. The importance of employing evidence-based procedures to teach skills to preschool students with developmental delays was outlined.

The Use of Flashcards in an Introduction to Psychology Class

Article

Full-text available

Jun 2012

Four hundred fifteen undergraduate students in an Introduction to Psychology course voluntarily reported their use of flashcards on three exams as well as answered other questions dealing with flashcard use (e.g., when did a student first use flashcards). Almost 70% of the class used flashcards to study for one or more exams. Students who used flashcards for all three exams had significantly higher exam scores overall than those students who did not use flashcards at all or only used flashcards on one or two exams. These results are discussed in terms of retrieval practice, a specific component of using flashcards.

Applied behavior analysis for teachers

Book

Jan 1999

The Effects of Two Repeated Reading Interventions on Generalization of Fluency

Article

Feb 1992

A multitreatment, single-subject research design was used to compare the effects on gains in fluency of two types of mastery criteria for repeated reading. The subjects were four males with learning disabilities, who were at a beginning reading level. One intervention required the students to reread a passage until they demonstrated three successive improvements; the other intervention required rereading until 90 correct words per minute was reached. Both types of criteria resulted in fluency gains for all students, with a mean gain of 58% under the improvements condition and 62% under the fixed-rate condition. The successive improvements criterion appeared to be the more efficient and showed consistent positive effects of fluency gains on generalization to unpracticed passages.

The repeated acquisition of behavioral chains1

Article

Nov 1968

Monkeys were trained with food reinforcement in a chamber containing four groups of three levers. For each session the monkey's task was to learn a new four-response chain by pressing the correct lever in each group. A stable pattern of learning resulted, and the number of errors reached a steady state from session to session. The technique was then used to determine how various durations of timeouts, following errors, affected the acquisition of new chains. With no timeout, the monkeys made a great many errors, due in large part to superstitious responses within the reinforced chain. Timeout durations ranging from 1 sec to 4 min reduced the number of errors substantially. A second experiment investigated the effects upon acquisition errors of presenting a single light (an “instruction” stimulus) over the correct lever. When this light did not influence the monkeys' responses to the three alternatives, the chains were learned as without it. When the light did control responding, the monkey pressed the appropriate sequence of levers but did not learn the sequence. Thus, when the light was removed, the monkey performed as if learning that sequence for the first time.

An implicit technology of generalization1

Article

Jul 1977
J APPL BEHAV ANAL

Traditionally, discrimination has been understood as an active process, and a technology of its procedures has been developed and practiced extensively. Generalization, by contrast, has been considered the natural result of failing to practice a discrimination technology adequately, and thus has remained a passive concept almost devoid of a technology. But, generalization is equally deserving of an active conceptualization and technology. This review summarizes the structure of the generalization literature and its implicit embryonic technology, categorizing studies designed to assess or program generalization according to nine general headings: Train and Hope; Sequential Modification; Introduce to Natural Maintaining Contingencies; Train Sufficient Exemplars; Train Loosely; Use Indiscriminable Contingencies; Program Common Stimuli; Mediate Generalization; and Train “To Generalize”.

Behavior modification: What it is and how to do it (5th ed.).

Article

Jan 1978

This [textbook] deals equally with both the principles and the tactics (i.e., the rules and guidelines for specific applications) of behavior modification. [It] is addressed to 2 audiences: (a) college and university students taking courses in behavior modification, applied behavior analysis, behavior therapy, the psychology of learning, and related areas; and (b) students and practitioners of various helping professions . . . who are concerned directly with enhancing various forms of behavioral development. (PsycINFO Database Record (c) 2012 APA, all rights reserved)

A Systematic Replication and Extension of Using Incremental Rehearsal to Improve Multiplication Skills: An Investigation of Generalization

Article

Mar 2010

The purpose of this study was to replicate and extend a previous study by Burns (Education and Treatment of Children 28: 237–249, 2005) examining the effectiveness of incremental rehearsal on computation performance. A multiple-probe design across multiplication problem sets was employed for one participant to examine digits correct per minute and percentage of digits correct on problems targeted during treatment and response maintenance sessions. Treatment occurred twice weekly across 12weeks. Retention of target facts was measured prior to each treatment session. Treatment impact on generalized skills including single skill mastery probes, multiplying fractions, and word problems was examined. Results suggested that the participant achieved accurate and fluent performance on the problem sets. This performance was maintained over time and generalized to SSM, fraction, and word problem probes.

An Examination of Stimulus Control in Fluency-Based Strategies: SAFMEDS and Generalization

Abstract and Figures

Recommended publications

Design and development of a vision-based micro-assembly system

Imitation and inhibition of facial expression

FPM: Fine Pose Parts-Based Model with 3D CAD Models

Training Educators to Implement Functional Analyses