ArticlePDF Available

Abstract and Figures

The Animal Verbal Fluency (AVF) and Design Fluency (DF) structured and unstructured test versions were administered to N = 294 healthy native Dutch-speaking children who were aged between 6.56 and 15.85 years. The AVF and DF structured test scores increased linearly as a function of age, whilst the relation between age and the DF unstructured test score was curvilinear (i.e., the improvement in test scores was much more pronounced for younger children than for older children). A higher mean level of parental education was associated with significantly higher AVF and DF structured test scores. Sex was not associated with any of the outcomes. Demographically corrected norms for the AVF and DF tests were established, and an automatic scoring program was provided.
Content may be subject to copyright.
This article was downloaded by: [University of Maastricht]
On: 24 November 2011, At: 00:24
Publisher: Psychology Press
Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer
House, 37-41 Mortimer Street, London W1T 3JH, UK
Journal of Clinical and Experimental
Neuropsychology
Publication details, including instructions for authors and subscription information:
http://www.tandfonline.com/loi/ncen20
Animal Verbal Fluency and Design Fluency in
school-aged children: Effects of age, sex, and
mean level of parental education, and regression-
based normative data
Wim Van der Elst a , Petra Hurks b , Renske Wassenberg c , Celeste Meijs d & Jelle
Jolles e
a Faculty of Medicine and Life Sciences, Maastricht University, Maastricht, The
Netherlands
b Faculty of Psychology and Neuroscience, Department of Neuropsychology &
Psychopharmacology, Maastricht University, Maastricht, The Netherlands
c Maastricht University Medical Centre, Maastricht University, Maastricht, The
Netherlands
d Ruud the Moor Centre, Open University, Heerlen, The Netherlands
e AZIRE Research Institute and Faculty of Psychology & Education, VU Universiteit
Amsterdam, Amsterdam, The Netherlands
Available online: 20 Jul 2011
To cite this article: Wim Van der Elst, Petra Hurks, Renske Wassenberg, Celeste Meijs & Jelle Jolles (2011):
Animal Verbal Fluency and Design Fluency in school-aged children: Effects of age, sex, and mean level of parental
education, and regression-based normative data, Journal of Clinical and Experimental Neuropsychology, 33:9,
1005-1015
To link to this article: http://dx.doi.org/10.1080/13803395.2011.589509
PLEASE SCROLL DOWN FOR ARTICLE
Full terms and conditions of use: http://www.tandfonline.com/page/terms-and-conditions
This article may be used for research, teaching, and private study purposes. Any substantial or
systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution
in any form to anyone is expressly forbidden.
The publisher does not give any warranty express or implied or make any representation that the
contents will be complete or accurate or up to date. The accuracy of any instructions, formulae, and
drug doses should be independently verified with primary sources. The publisher shall not be liable for
any loss, actions, claims, proceedings, demand, or costs or damages whatsoever or howsoever caused
arising directly or indirectly in connection with or arising out of the use of this material.
JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY
2011, 33 (9), 1005–1015
Animal Verbal Fluency and Design Fluency
in school-aged children: Effects of age, sex, and mean
level of parental education, and regression-based
normative data
Wim Van der Elst1,PetraHurks
2, Renske Wassenberg3, Celeste Meijs4, and Jelle Jolles5
1Faculty of Medicine and Life Sciences, Maastricht University, Maastricht, The Netherlands
2Faculty of Psychology and Neuroscience, Department of Neuropsychology & Psychopharmacology,
Maastricht University, Maastricht, The Netherlands
3Maastricht University Medical Centre, Maastricht University, Maastricht, The Netherlands
4Ruud the Moor Centre, Open University, Heerlen, The Netherlands
5AZIRE Research Institute and Faculty of Psychology & Education, VU Universiteit Amsterdam,
Amsterdam, The Netherlands
The Animal Verbal Fluency (AVF) and Design Fluency (DF) structured and unstructured test versions were
administered to N=294 healthy native Dutch-speaking children who were aged between 6.56 and 15.85 years.
The AVF and DF structured test scores increased linearly as a function of age, whilst the relation between age
and the DF unstructured test score was curvilinear (i.e., the improvement in test scores was much more pro-
nounced for younger children than for older children). A higher mean level of parental education was associated
with significantly higher AVF and DF structured test scores. Sex was not associated with any of the outcomes.
Demographically corrected norms for the AVF and DF tests were established, and an automatic scoring program
was provided.
Keywords: Fluency development; Executive functions; Demographic influences; Continuous norms.
Fluencytestsareoftenusedinclinicalandresearchset-
tings (Baron, 2004; Lezak, Howieson, & Loring, 2004;
Van der Elst, Van Boxtel, Van Breukelen, & Jolles,
2006a). Two types of fluency tests are distinguished—that
is, verbal and nonverbal fluency tests. In verbal fluency
tests, people are instructed to generate as many words as
possible that belong to a certain category (e.g., a seman-
tic category such as “animals,” or a phonemic category
such as “words that begin with the letter F”). In nonver-
bal (or design) fluency tests, people are asked to generate
as many novel (i.e., nonrepeating) abstract designs as
possible. Both types of fluency test are mainly used to
assess executive functioning and related abilities, such
as working memory, attention, and inhibition (Baldo,
Shimamura, Delis, Kramer, & Kaplan, 2001; Henry &
Crawford, 2004; Mitrushina, Boone, Razani, & D’Elia,
Address correspondence to Wim Van der Elst, Faculty of Medicine and Life Sciences, Department of Psychiatry
and Neuropsychology, Maastricht University, Dr. Tanslaan 12, 6200 MD, Maastricht, The Netherlands (E-mail:
wim.vanderelst@gmail.com).
2005; Ruff, Light, Parker, & Levin, 1997; Sergeant,
Geurts, & Oosterlaan, 2002; Van der Elst et al., 2006a).
In addition to these executive abilities, verbal fluency
tests tap word knowledge, access to semantic memory,
and vocabulary size (Ruff et al., 1997; Sergeant et al.,
2002), whilst design fluency tests also tap visuoconstruc-
tional abilities, motor planning, and visuomotor abilities
(Klenberg, Korkman, & Lahti-Nuuttila, 2001; Strauss,
Sherman, & Spreen, 2006). A diminished verbal fluency
and design fluency test performance has been associ-
ated with a variety of clinical conditions in both chil-
dren (e.g., fetal alcohol syndrome, closed head injury,
and autism; Korkman, Kirk, & Kemp, 1998; Levin,
Song, Ewing-Cobbs, Chapman, & Mendelsohn, 2001;
Schonfeld, Mattson, Lang, Delis, & Ripley, 2001) and
adults (e.g., Alzheimer’s disease, Parkinson’s disease, and
©2011 Psychology Press, an imprint of the Taylor & Francis Group, an Informa business
http://www.psypress.com/jcen http://dx.doi.org/10.1080/13803395.2011.589509
Downloaded by [University of Maastricht] at 00:24 24 November 2011
1006 VAN DER ELST ET AL.
traumatic brain injury; Diesfeldt, Van der Elst, & Jolles,
2009; Flowers, Robertson, & Sheridan, 1995; Raskin &
Rearick, 1996).
The aim of the present study was to establish the nor-
mal range of performance on the Animal Verbal Fluency
(AVF) and the Design Fluency (DF) tests, using a sample
of N=294 healthy native Dutch-speaking children who
were aged between 6.56 and 15.85 years. Normative data
areanessentialtoolinbothclinicalandresearchsettings,
because they provide an empirical frame of reference to
evaluate the test performance of an individual (Capitani,
1997; Mitrushina et al., 2005; Van der Elst, 2006). The
first step in a normative analysis is to identify the demo-
graphic variables that are associated with test perfor-
mance (e.g., age, sex, and parental education), so that
the normative data can be appropriately corrected for the
relevant demographic influences. Previous research has
suggested that better performance on the AVF and DF
tests in children is mainly associated with a higher age
(Anderson, 1998; Ardila, Rosselli, Matute, & Guajardo,
2005; Brocki & Bohlin, 2004; Cohen, Morgan, Vaughn,
Riccio, & Hall, 1999; Hurks et al., 2010; Jones-Gotman,
1990; Jurado & Rosselli, 2007; Klenberg et al., 2001;
Korkman, Kemp, & Kirk, 2001; Matute, Rosselli, Ardila,
& Morales, 2004; Prigatano, Gray, & Lomay, 2008;
but see also Anderson, Anderson, Northam, Jacobs, &
Catroppa, 2001) and a higher level of parental educa-
tion (Ardila et al., 2005; Hurks et al., 2010; Hurks et al.,
2006; Klenberg et al., 2001). The effect of sex on fluency
test performance remains unclear. Ardila and colleagues
(2005) found that sex differences occurred in fluency test
performance, but the direction of the effect depended on
the type of fluency test that was considered (i.e., boys out-
performed girls in verbal fluency tests, but the opposite
pattern was found for design fluency tests). Other stud-
ies did not observe significant sex differences in fluency
test performance (Brocki & Bohlin, 2004; Riva, Nichelli,
& Devoti, 2000) or only minimal effects (e.g., Prigatano
et al., 2008, reported that gender accounted for only 1.5%
of the variance in verbal fluency performance).
Two different methods are often used to estab-
lish normative data—that is, the “traditional” and the
“regression-based” methods. In the traditional approach,
normative statistics (usually the means and SDs) are
calculated for each relevant demographic subgroup sepa-
rately. Based on these normative statistics, an individual’s
observed test score is converted into an easily inter-
pretable metric (such as a zscore or a tscore). The tra-
ditional normative approach is straightforward and well
known by most psychologists, but there are also some
drawbacks associated with it (for details, see, e.g., Van
Breukelen & Vlaeyen, 2005; Van der Elst, Van Boxtel,
Van Breukelen, & Jolles, 2006b; Testa, Winicki, Pearlson,
Gordon, & Schretlen, 2009). For example, performance
on many (neuro)psychological tests is influenced by
different demographic variables, such as age, sex, and
educational level (Lezak et al., 2004; Mitrushina et al.,
2005). As a result, the total sample has to be subdivided
into many different subgroups. For example, splitting
a sample into two parental education groups and five
age groups results in a total of 10 (=5×2) sub-
groups. The splitting of the data in subgroups yields less
precisely estimated normative statistics (i.e., the discrep-
ancy between the sample statistics and the true popula-
tion parameters increases as the sample size decreases),
and consequently more biased norms are obtained. In
addition, there is a problem with the boundary values
when the traditional normative approach is used. For
example, suppose that the normative data were strati-
fied in age bands that span two years (e.g., 6–8 years,
>8–10 years, etc.). This would imply that children aged
7.99 and 8.01 years (differing only by a few days) would
be evaluated against different normative data, whereas
children aged 6.01 and 7.99 years (differing by almost
two years) would be evaluated against the same norma-
tive data, which is not acceptable (Capitani, 1997). The
boundary problem could be reduced by using narrower
age bands, but then the total sample has to be split into
more subgroups (which increases the bias in the norma-
tive statistics because the subgroup sample sizes decrease;
see above).
In the regression-based normative method, multi-
ple linear regression models are used to compute the
expected test scores of an individual (rather than the sub-
group means that are used in the traditional approach).
Since both continuous (e.g., age in years instead of age
groups) and categorical demographic variables (e.g., level
of parental education) can be handled in the regression-
based approach, there is no need to split the data into
subgroups nor is there a problem with boundary val-
ues. Regression-based normative conversions are highly
individualized (i.e., they take the unique demographic
characteristics of each tested individual into account;
Crawford & Howell, 1998; Heaton, Miller, Taylor, &
Grant, 2004; Van Breukelen & Vlaeyen, 2005; Van der
Elst et al., 2006a, 2006b; Van der Elst, Van Boxtel, Van
Breukelen, & Jolles, 2007), but this advantage also comes
with a disadvantage. Indeed, regression-based norms are
less user-friendly because the user of the normative data
has to actively compute the expected test scores as based
on the unique demographic characteristics of the tested
individual (i.e., it is not possible to provide an exhaus-
tive normative table for each possible combination of the
demographic variables when at least one demographic
variable is continuous). The latter problem can, however,
be easily avoided by establishing simplified normative
tables or by providing a computer program that automat-
ically executes the normative conversions (as was done in
the present study, see below).
Thus, in the present study we used a regression-
based approach to establish the normative data for
the AVF and DF tests (as based on a sample of
N=294 healthy native Dutch-speaking children). The
influence of age, gender, and parental level of education
on test performance was evaluated and corrected for if
necessary.
METHOD
Participants
The data were derived from the COOS (Cognitief
Ontwikkelings Onderzoek bij Schoolgaande kinderen,in
Downloaded by [University of Maastricht] at 00:24 24 November 2011
VERBAL AND DESIGN FLUENCY IN CHILDREN 1007
English: cognitive developmental study in school-aged
children), a large-scale study into “normal” cognitive
development. A total of N=294 children were admin-
istered the AVF and DF tests. None of these children had
repeated or skipped a grade. Medication use and health
status were assessed by means of a parental-report ques-
tionnaire. Children who used medication that is known to
affect cognitive performance (such as Ritalin) or children
who had clinical conditions that are known to affect cog-
nition (such as epilepsy or attention-deficit/hyperactivity
disorder, ADHD) were excluded from the sample. All
children who participated in the study were native Dutch
speakers, and all parents (or caregivers) of the chil-
dren gave consent for their child to participate in the
research.
Basic demographic data for the eligible sample
(N=294) were provided (see Table 1). The age range of
the eligible children was between 6.56 and 15.85 years.
Age was used as a continuous variable in the norma-
tive procedures (see Introduction), but it was categorized
in bands of 1.5 years in Table 1 for descriptive pur-
poses. The educational level of the children’s parents (or
caregivers) was assessed with a commonly used Dutch
educational 8-point rating scale, which ranges from pri-
mary school to university degree (De Bie, 1987). Mean
level of parental education (MLPE) was dichotomized
into low and high groups (after a median split) for par-
ents who had MLPE values that were <5and5on
the 8-point scale, respectively (with 5 =at most junior
vocational education). These two levels of education cor-
respond with a mean (SD) of about 9.88 (2.59) and
14.68 (3.30) years of full-time education, respectively.
The Ethics Committee of the Faculty of Psychology
and Neuroscience of Maastricht University approved the
study protocol, and all data included in this manuscript
were obtained in compliance with the regulations of this
institution.
Procedure and instruments
The fluency tests were administered to each child indi-
vidually. In the Animal Verbal Fluency (AVF) Test, the
following instruction was given: “Next, I’m going to give
you one minute to tell me all the animals you can think
of. They can be any kind of animals that you can think of,
such as birds, fish, etcetera. Any questions? Go as fast as
you can. Ready? Go.” The total number of correct, non-
repeated animal names is scored and is referred to as the
AVF test scores.
The Design Fluency (DF) Test is part of the
Developmental NEuroPSYcological Assessment
(NEPSY; Korkman et al., 1998). In the DF, the children
were asked (following standard NEPSY test instructions)
to draw as many novel designs as possible (in 60 s) by
connecting two or more dots that were arranged in a
structured or an unstructured array. The test material
consisted of a large sheet with 70 squares, with five dots
in each square. In the structured array, the five dots were
arranged in the same spatial configuration as the number
five on a dice. In the unstructured array, the five dots
were randomly distributed in the square (note that all
squares contained the same random pattern). Scoring
followed the standard NEPSY rules. The number of
correctly generated structured designs and unstructured
designs are referred to as the DF structured and DF
unstructured test scores, respectively.
Statistical analyses
In the exploratory data analysis phase, means and
standard deviations were computed for the AVF, the
DF structured, and the DF unstructured test scores,
and Pearson zero-order correlations between the out-
comes and the independent variables were computed.
The effects of the demographic variables on the AVF,
DF structured, and DF unstructured test scores were
further analyzed with multiple linear regression analy-
ses. The full regression models included age, age2,sex,
MLPE, and all two-way interactions as predictors. Age
was centered (=calendar age – mean age in the sample,
=11 years) before computing the quadratic age term to
avoid multicollinearity (Aiken & West, 1991). Sex was
dummy coded as 1 =male and 0 =female. MLPE was
dummy coded as 1 =high and 0 =low. The full regression
models were reduced in a step-down hierarchical proce-
dure by excluding the nonsignificant predictors from the
models.
The assumptions of regression analysis were tested
for each model: normal distribution of the residuals (by
conducting Kolmogorov–Smirnov tests on the residual
TABLE 1
Basic demographic data
Age Gender Mean level of parental education
Age range (years) N M (SD) Male Female Low High
<8 63 7.13 (0.29) 26 37 35 28
8and<9.5 46 9.02 (0.28) 23 23 24 22
9.5 and <11 23 10.34 (0.63) 14 9 10 13
11 and <12.5 44 11.37 (0.20) 11 33 21 23
12.5 and <14 64 13.37 (0.39) 32 32 25 39
14 54 14.81 (0.52) 30 24 25 29
Total 294 11.08 (2.82) 136 158 140 154
Note. N =294.
Downloaded by [University of Maastricht] at 00:24 24 November 2011
1008 VAN DER ELST ET AL.
values), homoscedasticity (by grouping participants into
quartiles of the predicted scores and applying the Levene
test to the residuals), multicollinearity (by calculating the
variance inflation factors, which should not exceed 10;
Belsley, Kuh, & Welsch, 1980), and influential cases (by
calculating Cook’s distances).
Regression-based normative data were established
by means of a four-step procedure (Van Breukelen
& Vlaeyen, 2005; Van der Elst et al., 2006a, 2006b,
2007). First, the expected fluency test scores were com-
puted by means of the final multiple regression models
(=B0+B1X1+...+BnXn,withB
0=the intercept,
Bn=the regression weights for the demographic vari-
ables, and Xn=the values of the demographic variables).
Second, the residuals were calculated (ei=observed score
– expected score). Third, the residuals were standardized:
Zi=ei/SD(residual), with SD(residual) =the stan-
dard deviation of the residuals in the normative sample.
If heteroscedasticity occurred, the SD(residual) values
that were used in the standardization of the residuals
were computed per quartile of the predicted scores (Van
Breukelen & Vlaeyen, 2005; Van der Elst et al., 2007).
Fourth, the standardized residuals were converted into
percentile values (via the standard normal cumulative dis-
tribution function if the model assumption of normality
of the standardized residuals was met in the normative
sample, or via the empirical cumulative distribution func-
tion of the standardized residuals if the standardized
residuals were not normally distributed in the normative
sample).
An alpha level of .01 was chosen in all analyses to
avoid Type I errors due to multiple testing (Bonferroni
correction for the number of dependent variables, critical
p=.05/3
=.01). All analyses were conducted with the
“R” software package Version 2.10.1 for Linux.
RESULTS
Exploratory data analyses
The mean (SD) AVF, DF structured, and DF unstruc-
tured test scores equaled 18.76 (6.09), 14.16 (5.72), and
14.32 (4.75), respectively. As shown in Table 2, the inter-
correlations between all fluency test scores were signif-
icantly positive (all r.45; p<.01). There was a
significantly positive correlation between age and all out-
come measures (all r.52) and between MLPE and the
AVF and the DF structured test scores (all r.20; all
p<.01). Sex was not significantly correlated with any of
the outcome variables (all r.09, all p>.01).
The effects of demographical variables on fluency
test performance
The final multiple linear regression models are shown
in Table 3. The DF structured test score was square
root transformed because preliminary analyses suggested
that heteroscedasticity occurred with the untransformed
scores (i.e., the error variance increased as a function of
the predicted test scores). After this transformation, the
assumptions of multiple linear regression analysis were
met for all final models (i.e., all values of the Levene
statistics 3.055, all p>.01; all Kolmogorov–Smirnov
Z-values 1.496, all p>.01; all Cook’s distance values
0.037; and all Variance Inflation Factors 1.01).
The AVF and DF structured test scores increased lin-
early as a function of age (see Table 3). In addition to
the linear age effect, the DF unstructured test score was
affected by a quadratic age effect. Figure 1 graphically
presents the relative effects of age on the different flu-
ency test scores.1Children who had parents with a high
MLPE obtained higher AVF and DF structured test
scores than children who had parents with a low MLPE.
Sex did not affect any of the outcome measures, and none
of the interaction terms reached significance. The final
regression models explained between 29 and 54% of the
variance in the fluency test scores.
Normative procedure
Norms for the AVF, DF structured, and DF unstruc-
tured test scores are established by means of the four-step
1Figure 1 presents the expected fluency test scores at a given
age relative to the expected test scores at 6.5 years. Relative rather
than raw expected test scores were presented for reasons of clar-
ity and comparability (i.e., the different outcome measures were
expressed in different units—raw values versus square root trans-
formed values—which makes it difficult to directly compare raw
expected test scores). Note also that Figure 1 presents relative
values for children who have parents with a low MLPE, but
the shape of these curves is very similar for children who have
parents with a high MLPE.
TABLE 2
Zero-order correlations between the fluency test scores and the demographic variables
123456
1: Animal Verbal Fluency test score 1.00
2: Design Fluency structured test score .451.00
3: Design Fluency unstructured test score .46.801.00
4: Age .52.66.651.00
5: Gender .09 .07 .03 .08 1.00
6: Mean level of parental education .22.20.10 .09 .04 1.00
p<.01.
Downloaded by [University of Maastricht] at 00:24 24 November 2011
VERBAL AND DESIGN FLUENCY IN CHILDREN 1009
TABLE 3
Final multiple linear regression models for the Animal Verbal Fluency, the Design Fluency structured, and the Design Fluency
unstructured test scores resulting from a step-down hierarchical procedure
Test score Variable B SE B Std. B T SD (residual) R2
Animal Verbal Fluency (constant) 17.60 0.44 40.34
Age 1.09 0.11 0.65 10.17
MLPE 2.00 0.60 0.13 3.325.12 .29
Design Fluency structured (constant) 3.56 0.05 73.20
Age 0.18 0.01 0.50 14.94
MLPE 0.21 0.07 0.16 3.090.57 .54
Design Fluency unstructured (constant) 14.95 0.32 46.86
Age 1.05 0.07 0.64 14.63
Age20.09 0.03 0.13 2.89 3.42 .43
Note. Encoding of the predictors: age =calendar age – 11, age2=(calendar age – 11)2. MLPE (mean level of parental education): low =
0, high =1. R2is adjusted for the number of predictors in the regression equations.
p<.01.
Figure 1. Relative expected Verbal Fluency, Design Fluency
structured, and Design Fluency unstructured test scores as a
function of age. Relative expected test scores were presented
rather than raw expected test scores for reasons of clarity and
comparability (i.e., the different raw test scores were expressed
in different units).
procedure described above. For example, suppose that
a 10-year-old child who has parents with a low MLPE
generated 15 animal names in 60 s. The first step in the
normative procedure is to calculate the expected AVF test
score for this child (by means of the regression models
that were presented in Table 3), which equals 16.51—that
is, 17.6 +[(10 – 11) ×1.09] +(2.00 ×0). Secondly,
the residual is calculated—that is, –1.51 (=15 – 16.51).
Thirdly, the residual is standardized by means of the
SD(residual) value of the AVF model (see Table 3)—that
is, –0.29 (=–1.51/5.12). Fourthly, the standardized resid-
ual is converted into a percentile value by means of the
standard normal cumulative distribution. A standard-
ized residual equal to –0.29 corresponds with a percentile
value of 38. Thus, 38% of the “normal” population of
10-year-old children who have parents with a low MLPE
obtain a VFT score that is lower than 15. The AVF test
performance of this child is thus within normal limits.
User-friendly normative data
The four-step normative procedure provides highly indi-
vidualized norms but lacks user-friendliness because the
users of the norms have to actively make the required
computations (as described in the previous paragraph).
Therefore, we also provided simplified normative tables
that were derived from the four-step normative procedure
described above (see Tables A1–A3 in the Appendix).
Note that Table A2 provides normative data for the
raw DF structured test score rather than for the DF
structured test score, also in view of increasing the user-
friendliness of the norms. The use of the normative
tables is straightforward. For example, Table A1 (in the
Appendix) immediately shows that the VFT scale score
of 15 that was obtained by 10-year-old child from the
example corresponds to a percentile value between 30
and 50.
The normative tables in the Appendix are user-friendly
but lack some accuracy because a child’s age has to be
rounded up if he or she is not exactly 6.5, 7, 7.5,...,
or 15.5 years old, and because only a limited num-
ber of percentile values can be presented in these tables
(due to space limitations). To maximize both the user-
friendliness and the accuracy of the norms, a computer-
based algorithm of the four-step normative procedure
Downloaded by [University of Maastricht] at 00:24 24 November 2011
1010 VAN DER ELST ET AL.
was implemented in a Microsoft Excel worksheet. In this
worksheet, the user of the norms types in the age of the
tested child and the MLPE of the parents of the tested
child together with his or her observed AVF, DF struc-
tured, and DF unstructured test scores. The worksheet
automatically converts the test scores into percentiles by
using the four-step procedure described above and graph-
ically presents the results. This worksheet is available
via the ‘Supplementary’ tab on the article’s online page
(http://dx.doi.org/10.1080/13803395.2011.589509).
DISCUSSION
The main aim of the present study was to establish
the normal range of performance on the AVF, the DF
structured, and the DF unstructured tests in school-aged
children. As a first step in the normative procedure, the
influence of demographic variables on test performance
was evaluated. In contrast to most previous studies, we
used multiple linear regression models that included both
linear and quadratic age effects as predictors of test per-
formance. An advantage of using such models is that
linear age effects can be distinguished from quadratic
age effects. The results showed that the AVF and the
DF structured test scores increased linearly as a func-
tion of age at a rate of approximately 8.5% and 6.5%
units per year, respectively (see Figure 1). In contrast, the
curvilinear relation between age and the DF unstructured
test score suggested that the relative improvement in test
performance was much more pronounced for younger
children than for older children (see Figure 1). For exam-
ple, the DF unstructured test score of a 7.5-year-old child
was about 21.1% higher than the DF unstructured test
score of a 6.5-year-old child, whilst the DF unstructured
test score of a 15.5-year-old child was only about 3.9%
higher than the test score of a 14.5-year-old child.
In addition to age (and age2for the DF unstructured
test score), MLPE profoundly affected the AVF and the
DF structured test scores. For example, children who
had parents with a high MLPE generated on average
two words more than children who had parents with
a low MLPE. A difference of two words is substantial
and corresponded with about two years of development
(i.e., 2.00/1.09 =1.83 years; see Table 3)—for exam-
ple, the expected AVF test score of a 10-year-old child
who had parents with a high MLPE was approximately
equal to the expected AVF test score of a 12-year-old
child who had parents with a low MLPE. Previous stud-
ies also found that MLPE is associated with verbal and
design fluency test performance in children (Ardila et al.,
2005; Hurks et al., 2010; Hurks et al., 2006; Klenberg
et al., 2001), but these studies did not make a distinction
between the structured and the unstructured test versions
of the DF—as was done in the present study. We found
that MLPE had a differential effect on DF test perfor-
mance: MLPE was significantly associated with the DF
structured test score, but not with DF unstructured test
score. This result was unexpected, because it is generally
assumed that the DF structured and the DF unstructured
test versions measure the same underlying construct(s).
However, if this were to be the case, it would be expected
that the scores on both test versions would be influenced
in a similar way by an important demographic variable
such as MLPE—which was not the case.
It is possible that Type I errors accounted for the dif-
ferential effects of MLPE on the DF structured and the
DF unstructured test scores, but this explanation seems
to be unlikely. Indeed, the correlation between MLPE
and the DF unstructured test score was not significantly
different from zero (i.e., r=.10; see Table 2), and the
p-value of the regression weight of MLPE was high when
MLPE was forced to remain in a regression model that
also included age and age2as predictors (p=.43; data
not shown). Another possibility for explaining the differ-
ential effects of MLPE on the DF structured and the DF
unstructured test scores is that both test versions measure
several common constructs (such as executive functions
and visuoconstructional abilities), but also tap constructs
that are unique for each test version. Circumstantial evi-
dence for this claim can be found in the high—but not
extreme—correlation between the DF structured and the
DF unstructured test scores (i.e., r=.80; see Table 2).
For example, it is possible that mental rotation is a con-
struct that is tapped by the DF structured test version,
but not by the DF unstructured test version. Indeed,
the symmetrical organization of the array of dots of the
DF structured test version allows the child to generate
new designs by, for example, mentally rotating existing
designs, but this is not possible in the DF unstructured
test version (because of the nonsymmetrical organization
of its array of dots). Previous research has also shown
that MLPE is associated with the mental rotation abilities
of a child (see, e.g., Noble, McCandliss, & Farah, 2007),
which could thus explain the differential associations of
MLPE with the DF structured and DF unstructured test
scores. It is unknown whether this explanation is correct
(no empirical studies have yet evaluated the association
between mental rotation abilities and performance on
the DF structured and unstructured test versions), but
no matter what explanation accounts for the differen-
tial effects of the demographic variables on fluency test
performance, from a psychometric viewpoint this finding
implies that the normative data should be differentially
corrected for the relevant demographic influences. Thus
the normative data for the VFT and the DF structured
test scores were corrected for linear age-effects and for
MLPE, whilst the normative data for the DF unstruc-
tured test score were corrected for linear and quadratic
age-effects. In agreement with most previous studies (see
Introduction), sex had no significant influence on the
AVF and DF scores. Normative tables were established,
as well as an automatic scoring program that maxi-
mizes both the user-friendliness and the accuracy of the
norms.
There are some limitations of this study that warrant
further discussion. First, the present study involved a
sample of Dutch-speaking children. The question arises
whether normative data that were based on a Dutch-
speaking sample can also be applied to children with
a different native language or a different cultural back-
ground. Research in adults and the elderly has shown
Downloaded by [University of Maastricht] at 00:24 24 November 2011
VERBAL AND DESIGN FLUENCY IN CHILDREN 1011
that linguistic factors significantly affect AVF test perfor-
mance. For example, Kempler and colleagues (Kempler,
Teng, Dick, Taussig, & Davis, 1998) found that Spanish
speakers generated the smallest number of animal names
in comparison to Chinese and English speakers, and
Vietnamese speakers generated most animal names. The
researchers related these differences to differences in the
length of words for animal names in these languages
(with animal names being longest in Spanish and short-
est in Vietnamese). In children, the effects of linguistic
factors on AVF test performance have not yet been eval-
uated, but it is conceivable that similar influences exist.
It is thus not recommended to use the AVF test norms
that were established in the present study to evaluate
the test performance of non-Dutch-speaking children—
at least not until the influence of linguistic factors on
AVF test performance in children has been evaluated in
more detail. Linguistic differences per se are unlikely to
affect the DF test performance, but cross-cultural differ-
ences in test performance may nevertheless arise because
of differences in general factors such as “degree of famil-
iarity with formal testing” (Ardila, 1995). In Western
countries, differences in these factors are minimal and
are unlikely to substantially affect test performance, but
larger differences may exist between children who live
in Western and in non-Western countries. It is therefore
not recommended to use our DF structured and DF
unstructured test norms to evaluate the test performance
of non-Western children.
Second, the regression models (and normative data)
that were established in the present study were based
on the data of a sample of children who were aged
between 6.56 and 15.85 years. The question rises whether
the regression models can also be used to compute the
expected test scores for children whose age falls outside
the age range that was considered during the regression
model building phase. Extrapolating a regression model
requires several assumptions. For example, if we want to
estimate the expected test scores of a 17-year-old child
(as based on the regression models that were presented in
Table 3), we have to assume that the relation between age
and fluency test performance in the age range between
15.85 and 17 years is identical (or at least very similar)
to the relation between age and fluency test performance
in the age range between 6.56 and 15.85 years. If this
assumption is not valid, the resulting normative data may
be severely biased (e.g., a child with a “normal” test
performance may be classified as being “impaired,” or
vice versa). Thus, extrapolation of the regression models
beyond the age range that was considered during the
model building phase is not recommended.
Supplementary material
Supplementary content (an Excel file) is available via
the ‘Supplementary’ tab on the article’s online page
(http://dx.doi.org/10.1080/13803395.2011.589509).
Original manuscript received 22 December 2010
Revised manuscript accepted 14 April 2011
First published online 20 July 2011
REFERENCES
Aiken, L. S., & West, S. G. (1991). Multiple regression: Testing
and interpreting interactions. Newbury Park, CA: Sage.
Anderson, V. (1998). Assessing executive functions in children:
Biological, psychological and developmental considerations.
Neuropsychological Rehabilitation,8, 319–349.
Anderson, V. A., Anderson, P., Northam, E., Jacobs, R., &
Catroppa, C. (2001). Development of executive functions
through late childhood and adolescence in an Australian
sample. Developmental Neuropsychology,20, 385–406.
Ardila, A. (1995). Directions of research in cross-cultural
neuropsychology. Journal of Clinical and Experimental
Neuropsychology,17, 143–150.
Ardila, A., Rosselli, M., Matute, E., & Guajardo, S.
(2005). The influence of the parents’ educational level
on the development of executive functions. Developmental
Neuropsychology,28, 539–560.
Baldo, J. V., Shimamura, A. P., Delis, D. C., Kramer, J., &
Kaplan, E. (2001). Verbal and design fluency in patients
with frontal lobe lesions. Journal of the International
Neuropsychological Society,7, 586–596.
Baron, I. S. (2004). Neuropsychological evaluation of the child.
New York, NY: Oxford University Press.
Belsley, D. A., Kuh, E., & Welsch, R. E. (1980). Regression diag-
nostics: Identifying influential data and sources of collinearity.
New York, NY: Wiley.
Brocki, K. C., & Bohlin, G. (2004). Executive functions in chil-
dren aged 6 to 13: A dimensional and developmental study.
Developmental Neuropsychology,26, 571–593.
Capitani, E. (1997). Normative data and neuropsychologi-
cal assessment. Common problems in clinical practice and
research. Neuropsychological Rehabilitation,7, 295–309.
Cohen, M. J., Morgan, A. M., Vaughn, M., Riccio, C. A., &
Hall, J. (1999). Verbal fluency in children: Developmental
issues and differential validity in distinguishing children
with attention-deficit hyperactivity disorder and two sub-
types of dyslexia. Archives of Clinical Neuropsychology,14,
433–443.
Crawford, J. R., & Howell, D. C. (1998). Regression equations in
clinical neuropsychology: An evaluation of statistical meth-
ods for comparing predicted and obtained scores. Journal of
Clinical and Experimental Neuropsychology,20, 755–762.
De Bie, S. E. (1987). Standaardvragen 1987: Voorstellen voor
uniformering van vraagstellingen naar achtergrondkenmerken
en interviews [Standard questions 1987: Proposal for uni-
formization of questions regarding background variables
and interview] (2nd ed.). Leiden, The Netherlands: Leiden
University Press.
Diesfeldt, H. F. A., Van der Elst, W., & Jolles, J. (2009). Klinische
bruikbaarheid van categoriegebonden woordproductie voor
het onderscheiden van dementie en normale cognitieve
veroudering [Category fluency in normal cognitive aging
and dementia]. Tijdschrift voor Gerontologie en Geriatrie,40,
54–71.
Flowers, K. A., Robertson, C., & Sheridan, M. R. (1995). Some
characteristics of word fluency in Parkinson’s disease. Journal
of Neurolinguistics,9, 33–46.
Heaton, R. K., Miller, S. W., Taylor, M. J., & Grant, I. (2004).
Revised comprehensive norms for an Expanded Halstead–
Reitan Battery: Demographically adjusted neuropsychological
norms for African American and Caucasian adults. Lutz, FL:
Psychological Assessment Resources.
Henry, J. D., & Crawford, J. R. (2004). A meta-analytic review of
verbal fluency performance following focal cortical lesions.
Neuropsychology,18, 284–295.
Hurks, P. P. M., Schrans, D., Meijs, C., Wassenberg, R.,
Feron, F. J. M., & Jolles, J. (2010). Developmental changes
in semantic verbal fluency: Analyses of word productiv-
ity as a function of time, clustering and switching. Child
Neuropsychology,16, 366–387.
Hurks, P. P. M., Vles, J. S. H., Hendriksen, J. G. M., Kalff, A. C.,
Feron, F. J. M., Kroes, M., et al. (2006). Semantic cate-
gory fluency versus initial letter fluency over 60 seconds as
a measure of automatic and controlled processing in healthy
Downloaded by [University of Maastricht] at 00:24 24 November 2011
1012 VAN DER ELST ET AL.
school-aged children. Journal of Clinical and Experimental
Neuropsychology,28, 684–695.
Jones-Gotman, M. (1990). Presurgical psychological assessment
in children: Special tests. Journal of Epilepsy,3, 93–102.
Jurado, M. B., & Rosselli, M. (2007). The elusive nature of
executive functions: A review of our current understanding.
Neuropsychology Review,17, 213–233.
Kempler, D., Teng, E. L., Dick, M., Taussig, M., & Davis, D. S.
(1998). The effects of age, education, and ethnicity on ver-
bal fluency. Journal of the International Neuropsychological
Society,4, 531–538.
Klenberg, L., Korkman, M., & Lahti-Nuuttila, P. (2001).
Differential development of attention and executive func-
tions in 3- to 12-year-old Finnish children. Developmental
Neuropsychology,20, 407–428.
Korkman, M., Kemp, S. L., & Kirk, U. (2001). Effects of age on
neurocognitive measures of children ages 5 to 12: A cross-
sectional study on 800 Children from the United States.
Developmental Neuropsychology, 20, 331–354.
Korkman, M., Kirk, U., & Kemp, S. L. (1998). NEPSY: A devel-
opmental neuropsychological assessment. San Antonio, TX:
Psychological Corporation.
Levin, H. S., Song, J., Ewing-Cobbs, L., Chapman, S. B., &
Mendelsohn, D. (2001). Word fluency in relation to severity
of closed head injury, associated frontal brain lesions, and
age at injury in children. Neuropsychologia,39, 122–131.
Lezak, M. D., Howieson, D. B., & Loring, D. W. (2004).
Neuropsychological assessment (4th ed.). New York, NY:
Oxford University Press.
Matute, E., Rosselli, M., Ardila, A., & Morales, G. (2004).
Verbal and nonverbal fluency in Spanish-speaking children.
Developmental Neuropsychology,26, 647–660.
Mitrushina, M. N., Boone, K. B., Razani, J., & D’Elia, L. F.
(2005). Handbook of normative data for neuropsychological
assessment (2nd ed.). New York, NY: Oxford University
Press.
Noble, K. G., McCandliss, B. D., & Farah, M. J. (2007).
Socioeconomic gradients predict individual differences in
neurocognitive abilities. Developmental Science,10, 464–480.
Prigatano, G. P., Gray, J. A., & Lomay, V. T. (2008).
Verbal (animal) fluency scores in age/grade appropriate
minority children from low socioeconomic backgrounds.
Journal of the International Neuropsychological Society,14,
143–147.
Raskin, S. A., & Rearick, E. (1996). Verbal fluency in individu-
als with mild traumatic brain injury. Neuropsychology,10,
416–422.
Riva, D., Nichelli, F., & Devoti, M. (2000). Developmental
aspects of verbal fluency and confrontation naming in chil-
dren. Brain and Language,71, 267–284.
Ruff, R. M., Light, R. H., Parker, S. B., & Levin, H. S.
(1997). The psychological construct of word fluency. Brain
and Language,57, 394–405.
Schonfeld, A. M., Mattson, S. N., Lang, A. R., Delis, D. C., &
Ripley, E. P. (2001). Verbal and nonverbal fluency in children
with heavy prenatal alcohol exposure. Journal of Studies on
Alcohol,62, 239–246.
Sergeant, J. A., Geurts, H., & Oosterlaan, J. (2002). How spe-
cific is a deficit of executive functioning for attention-deficit
hyperactivity disorder? Behavioural Brain Research,130,
3–28.
Strauss, E., Sherman, E. M. S., & Spreen, O. (2006). A
compendium of neuropsychological tests: Administration,
norms, and commentary (3rd. ed.). New York, NY: Oxford
University Press.
Testa, S. M., Winicki, J. M., Pearlson, G. D., Gordon, B.,
& Schretlen, D. J. (2009). Accounting for estimated IQ in
neuropsychological test performance with regression-based
techniques. Journal of the International Neuropsychological
Society,15, 1012–1022.
Van Breukelen, G. J. P., & Vlaeyen, J. W. S. (2005). Norming
clinical questionnaires with multiple regression: The
Pain Cognition List. Psychological Assessment,17,
336–344.
Van der Elst, W. (2006). The neuropsychometrics of aging.
Normative studies in the Maastricht Aging Study. Maastricht,
The Netherlands, Neuropsy Publishers.
Van der Elst, W., Van Boxtel, M. P. J., Van Breukelen, G. J. P., &
Jolles, J. (2006a). Normative data for the Animal, Profession
and Letter M Naming verbal fluency tests for Dutch speak-
ing participants and the effects of age, education, and sex.
Journal of the International Neuropsychological Society,12,
80–89.
Van der Elst, W., Van Boxtel, M. P. J., Van Breukelen, G. J. P.,
& Jolles, J. (2006b). The Stroop Color-Word Test: Influence
of age, sex, and education; and normative data for a
large sample across the adult age range. Assessment,13,
62–79.
Van der Elst, W., Van Boxtel, M. P. J., Van Breukelen, G. J. P.,
& Jolles, J. (2007). Assessment of information process-
ing in working memory in applied settings: The paper &
pencil memory scanning test. Psychological Medicine,37,
1335–1344.
Downloaded by [University of Maastricht] at 00:24 24 November 2011
VERBAL AND DESIGN FLUENCY IN CHILDREN 1013
TABLE A1
Normative data for the Animal Verbal Fluency Test, stratified by age and mean level of parental education
Age (years)
APPENDIX
Mean level of
parental
education Percentiles 6.5 7 7.5 8 8.5 9 9.5 10 10.5 11 11.5 12 12.5 13 13.5 14 14.5 15 15.5 Percentiles
Low 99 24.6 25.2 25.7 26.2 26.8 27.3 27.9 28.4 29.0 29.5 30.1 30.6 31.1 31.7 32.2 32.8 33.3 33.9 34.4 99
95 21.1 21.7 22.2 22.8 23.3 23.8 24.4 24.9 25.5 26.0 26.6 27.1 27.7 28.2 28.7 29.3 29.8 30.4 30.9 95
90 19.3 19.8 20.3 20.9 21.4 22.0 22.5 23.1 23.6 24.2 24.7 25.3 25.8 26.3 26.9 27.4 28.0 28.5 29.1 90
80 17.0 17.5 18.1 18.6 19.2 19.7 20.3 20.8 21.4 21.9 22.5 23.0 23.5 24.1 24.6 25.2 25.7 26.3 26.8 80
70 15.4 15.9 16.5 17.0 17.6 18.1 18.6 19.2 19.7 20.3 20.8 21.4 21.9 22.5 23.0 23.6 24.1 24.6 25.2 70
50 12.7 13.2 13.8 14.3 14.9 15.4 16.0 16.5 17.1 17.6 18.1 18.7 19.2 19.8 20.3 20.9 21.4 22.0 22.5 50
30 10.0 10.6 11.1 11.6 12.2 12.7 13.3 13.8 14.4 14.9 15.5 16.0 16.6 17.1 17.6 18.2 18.7 19.3 19.8 30
20 8.4 8.9 9.5 10.0 10.6 11.1 11.7 12.2 12.7 13.3 13.8 14.4 14.9 15.5 16.0 16.6 17.1 17.7 18.2 20
10 6.1 6.7 7.2 7.8 8.3 8.9 9.4 9.9 10.5 11.0 11.6 12.1 12.7 13.2 13.8 14.3 14.9 15.4 15.9 10
5 4.3 4.8 5.4 5.9 6.5 7.0 7.5 8.1 8.6 9.2 9.7 10.3 10.8 11.4 11.9 12.4 13.0 13.5 14.1 5
1 1.0 1.3 1.9 2.4 3.0 3.5 4.1 4.6 5.1 5.7 6.2 6.8 7.3 7.9 8.4 9.0 9.5 10.0 10.6 1
High 99 26.6 27.2 27.7 28.2 28.8 29.3 29.9 30.4 31.0 31.5 32.1 32.6 33.1 33.7 34.2 34.8 35.3 35.9 36.4 99
95 23.1 23.7 24.2 24.8 25.3 25.8 26.4 26.9 27.5 28.0 28.6 29.1 29.7 30.2 30.7 31.3 31.8 32.4 32.9 95
90 21.3 21.8 22.3 22.9 23.4 24.0 24.5 25.1 25.6 26.2 26.7 27.3 27.8 28.3 28.9 29.4 30.0 30.5 31.1 90
80 19.0 19.5 20.1 20.6 21.2 21.7 22.3 22.8 23.4 23.9 24.5 25.0 25.5 26.1 26.6 27.2 27.7 28.3 28.8 80
70 17.4 17.9 18.5 19.0 19.6 20.1 20.6 21.2 21.7 22.3 22.8 23.4 23.9 24.5 25.0 25.6 26.1 26.6 27.2 70
50 14.7 15.2 15.8 16.3 16.9 17.4 18.0 18.5 19.1 19.6 20.1 20.7 21.2 21.8 22.3 22.9 23.4 24.0 24.5 50
30 12.0 12.6 13.1 13.6 14.2 14.7 15.3 15.8 16.4 16.9 17.5 18.0 18.6 19.1 19.6 20.2 20.7 21.3 21.8 30
20 10.4 10.9 11.5 12.0 12.6 13.1 13.7 14.2 14.7 15.3 15.8 16.4 16.9 17.5 18.0 18.6 19.1 19.7 20.2 20
10 8.1 8.7 9.2 9.8 10.3 10.9 11.4 11.9 12.5 13.0 13.6 14.1 14.7 15.2 15.8 16.3 16.9 17.4 17.9 10
5 6.3 6.8 7.4 7.9 8.5 9.0 9.5 10.1 10.6 11.2 11.7 12.3 12.8 13.4 13.9 14.4 15.0 15.5 16.1 5
1 2.8 3.3 3.9 4.4 5.0 5.5 6.1 6.6 7.1 7.7 8.2 8.8 9.3 9.9 10.4 11.0 11.5 12.0 12.6 1
Note. The raw test scores corresponding with percentiles 1, 5, 10, 20, 30, 50, 70, 80, 90, 95, and 99 are given.
Downloaded by [University of Maastricht] at 00:24 24 November 2011
1014 VAN DER ELST ET AL.
TABLE A2
Normative data for the Design Fluency structured test score, stratified by age and mean level of parental education
Age (years)
Mean level of
parental
education Percentiles 6.5 7 7.5 8 8.5 9 9.5 10 10.5 11 11.5 12 12.5 13 13.5 14 14.5 15 15.5 Percentiles
Low 99 16.6 17.4 18.1 18.9 19.7 20.5 21.3 22.1 23.0 23.9 24.8 25.7 26.6 27.5 28.5 29.4 30.4 31.4 32.4 99
95 13.6 14.3 15.0 15.7 16.4 17.1 17.9 18.6 19.4 20.2 21.0 21.9 22.7 23.6 24.5 25.4 26.3 27.2 28.2 95
90 12.1 12.7 13.4 14.1 14.7 15.4 16.2 16.9 17.6 18.4 19.2 20.0 20.8 21.6 22.5 23.3 24.2 25.1 26.0 90
80 10.4 11.0 11.6 12.2 12.9 13.5 14.2 14.9 15.6 16.3 17.1 17.8 18.6 19.4 20.2 21.0 21.8 22.7 23.5 80
70 9.3 9.9 10.4 11.0 11.6 12.2 12.9 13.5 14.2 14.9 15.6 16.3 17.0 17.8 18.6 19.4 20.2 21.0 21.8 70
50 7.6 8.1 8.6 9.1 9.7 10.2 10.8 11.4 12.0 12.7 13.3 14.0 14.7 15.4 16.1 16.8 17.6 18.3 19.1 50
30 6.0 6.5 6.9 7.4 7.9 8.4 8.9 9.5 10.1 10.6 11.2 11.8 12.5 13.1 13.8 14.4 15.1 15.8 16.6 30
20 5.2 5.6 6.0 6.5 6.9 7.4 7.9 8.4 8.9 9.5 10.1 10.6 11.2 11.8 12.5 13.1 13.8 14.4 15.1 20
10 4.1 4.5 4.8 5.2 5.7 6.1 6.6 7.0 7.5 8.0 8.5 9.1 9.6 10.2 10.8 11.4 12.0 12.6 13.2 10
5 3.3 3.6 4.0 4.3 4.7 5.1 5.5 6.0 6.4 6.9 7.4 7.9 8.4 8.9 9.4 10.0 10.6 11.2 11.8 5
1 2.0 2.3 2.6 2.9 3.2 3.5 3.9 4.2 4.6 5.0 5.4 5.8 6.3 6.7 7.2 7.7 8.2 8.7 9.3 1
High 99 18.4 19.1 19.9 20.8 21.6 22.4 23.3 24.2 25.1 26.0 26.9 27.8 28.8 29.8 30.8 31.8 32.8 33.8 34.9 99
95 15.2 15.9 16.6 17.4 18.1 18.9 19.7 20.5 21.3 22.2 23.0 23.9 24.8 25.7 26.6 27.5 28.5 29.5 30.4 95
90 13.6 14.3 15.0 15.7 16.4 17.1 17.9 18.7 19.5 20.3 21.1 21.9 22.8 23.6 24.5 25.4 26.3 27.3 28.2 90
80 11.8 12.5 13.1 13.8 14.4 15.1 15.8 16.6 17.3 18.1 18.8 19.6 20.4 21.2 22.1 22.9 23.8 24.7 25.6 80
70 10.6 11.2 11.8 12.5 13.1 13.8 14.4 15.1 15.8 16.6 17.3 18.1 18.8 19.6 20.4 21.2 22.1 22.9 23.8 70
50 8.8 9.3 9.9 10.4 11.0 11.6 12.3 12.9 13.5 14.2 14.9 15.6 16.3 17.1 17.8 18.6 19.4 20.2 21.0 50
30 7.1 7.6 8.1 8.6 9.1 9.7 10.2 10.8 11.4 12.0 12.7 13.3 14.0 14.7 15.4 16.1 16.8 17.6 18.3 30
20 6.2 6.6 7.1 7.6 8.1 8.6 9.1 9.7 10.2 10.8 11.4 12.0 12.7 13.3 14.0 14.7 15.4 16.1 16.8 20
10 5.0 5.4 5.8 6.2 6.7 7.2 7.7 8.2 8.7 9.2 9.8 10.4 11.0 11.6 12.2 12.8 13.5 14.1 14.8 10
5 4.1 4.5 4.9 5.3 5.7 6.1 6.6 7.0 7.5 8.0 8.5 9.1 9.6 10.2 10.8 11.4 12.0 12.6 13.3 5
1 2.7 3.0 3.3 3.6 4.0 4.3 4.7 5.1 5.5 6.0 6.4 6.9 7.4 7.9 8.4 8.9 9.4 10.0 10.6 1
Note. The raw test scores corresponding with percentiles 1, 5, 10, 20, 30, 50, 70, 80, 90, 95, and 99 are given.
Downloaded by [University of Maastricht] at 00:24 24 November 2011
VERBAL AND DESIGN FLUENCY IN CHILDREN 1015
TABLE A3
Normative data for the Design Fluency unstructured test score, stratified by age
Age (years)
Mean level of
parental
education Percentiles 6.5 7 7.5 8 8.5 9 9.5 10 10.5 11 11.5 12 12.5 13 13.5 14 14.5 15 15.5 Percentiles
Low or high 99 16.4 17.3 18.1 18.9 19.7 20.4 21.1 21.8 22.4 22.9 23.4 23.9 24.3 24.6 25.0 25.2 25.5 25.7 25.8 99
95 14.0 14.9 15.8 16.6 17.4 18.1 18.8 19.4 20.0 20.6 21.1 21.5 21.9 22.3 22.6 22.9 23.1 23.3 23.5 95
90 12.8 13.7 14.6 15.4 16.1 16.9 17.6 18.2 18.8 19.3 19.8 20.3 20.7 21.1 21.4 21.7 21.9 22.1 22.2 90
80 11.3 12.2 13.1 13.9 14.6 15.4 16.1 16.7 17.3 17.8 18.3 18.8 19.2 19.6 19.9 20.2 20.4 20.6 20.7 80
70 10.2 11.1 12.0 12.8 13.6 14.3 15.0 15.6 16.2 16.7 17.2 17.7 18.1 18.5 18.8 19.1 19.3 19.5 19.6 70
50 8.4 9.3 10.2 11.0 11.8 12.5 13.2 13.8 14.4 15.0 15.5 15.9 16.3 16.7 17.0 17.3 17.5 17.7 17.9 50
30 6.6 7.5 8.4 9.2 10.0 10.7 11.4 12.0 12.6 13.2 13.7 14.1 14.5 14.9 15.2 15.5 15.7 15.9 16.1 30
20 5.5 6.4 7.3 8.1 8.9 9.6 10.3 10.9 11.5 12.1 12.6 13.0 13.4 13.8 14.1 14.4 14.6 14.8 15.0 20
10 4.0 4.9 5.8 6.6 7.4 8.1 8.8 9.4 10.0 10.6 11.1 11.5 11.9 12.3 12.6 12.9 13.1 13.3 13.5 10
5 2.8 3.7 4.5 5.4 6.1 6.9 7.5 8.2 8.8 9.3 9.8 10.3 10.7 11.1 11.4 11.7 11.9 12.1 12.2 5
1 1.0 1.4 2.2 3.0 3.8 4.5 5.2 5.9 6.4 7.0 7.5 8.0 8.4 8.7 9.1 9.3 9.6 9.8 9.9 1
Note. The raw test scores corresponding with percentiles 1, 5, 10, 20, 30, 50, 70, 80, 90, 95, and 99 are given.
Downloaded by [University of Maastricht] at 00:24 24 November 2011
... Performance on DF tasks is thought to rely on a variety of cognitive abilities, including fine motor skills, visuospatial skills, and visuo-constructive abilities). However, DF tasks are mainly used to assess executive functions, such as attentional control, cognitive flexibility, self-monitoring, and strategy use (Goebel et al., 2009;Hurks, 2013;Lezak et al., 2012;Van der Elst et al., 2011) and associations between DF task performance and performance on other executive function tasks are frequently found (Cattelani et al., 2011;Goebel et al., 2009;Tucha et al., 2012). Impairments in DF performance have been found in pediatric patients with known difficulties in executive functions, such as attention deficit hyperactivity disorder, autism spectrum disorders, dyslexia, prenatal alcohol exposure, and traumatic brain injury (Fournier et al., 2020;Gauthier et al., 2016;Korkman et al., 1998;Levin et al., 2001;Reiter et al., 2005;Robinson & Tripp, 2013;Schonfeld et al., 2001;Mélina;Tessier et al., 2022). ...
... Previous studies have investigated the total number of correct designs in children aged 5-15 years and have found that performance increased with increasing age (Albert et al., 2009;Ardila et al., 2005;Ebersbach & Hagedorn, 2011;Hurks, 2013;Hurks et al., 2010;Klenberg et al., 2001;Korkman et al., 2001;Matute et al., 2004;Regard et al., 1982;Stievano & Scalisi, 2016;Tucha et al., 2012;Van der Elst et al., 2011). Agerelated changes in clustering and switching during the DF task were found between children from age 7 up to approximately age 15, with older children producing more and larger clusters and switching more than younger children (Hurks, 2013;Hurks et al., 2010). ...
... While these studies have shown that age is an important contributor to DF performance, age does not explain all variability in DF performance (Matute et al., 2004). Previous studies suggest that age-extrinsic factors, such as sex and the level of parental education (LPE), may explain another part of the inter-individual differences in performance (Hurks et al., 2010;Klenberg et al., 2001;Resch et al., 2014;Van der Elst et al., 2011). In terms of the total number of correct designs produced on a DF task, most previous studies in children aged 5 to 15 found no differences between boys and girls (Albert et al., 2009;Ardila et al., 2005;Ebersbach & Hagedorn, 2011;Hurks et al., 2010;Klenberg et al., 2001;Korkman et al., 2001;Matute et al., 2004;Regard et al., 1982;Stievano & Scalisi, 2016;Tucha et al., 2012;Van der Elst et al., 2011). ...
... Also, some studies reported no sex/gender effect on the total number of words (e.g., Hurks et al. 2006;Kavé 2005;Van der Elst et al. 2011). Other studies reported a small female advantage in SVF (Olabarrieta-Landa et al. 2017;Koren et al. 2005;Rivera et al. 2021) or even others found that boys outperform girls (Ardila et al. 2005;Prigatano et al. 2008). ...
... In children, few studies investigated the effect of parents' educational level and found that a higher level of parental education is associated with children's better performance in verbal fluency (Aksamovic et al. 2019;Ardila et al. 2005;Hurks et al. 2006; Van der Elst et al. 2011). As for the effect of parental education on clustering and switching strategies, a positive effect of parental education on mean cluster size (a lexico-semantic measure) but not on the number of clusters or the number of switches (executive functioning measures) is reported (Hurks et al. 2010). ...
... Previous results on the effect of parental education have been contradictory. Some studies on children of different ages (Aksamovic et al. 2019;Ardila et al. 2005;Hurks et al. 2006Hurks et al. , 2010Van der Elst et al. 2011) have suggested that parental education mainly affects the NCW and the mean cluster size, which are linked to a large extent to children's lexico-semantic knowledge. These findings are explained by studies on language development showing that children from higher-educated parents also tend to have larger vocabularies (e.g., Hart and Risley 1995;Hoff 2003). ...
Article
Full-text available
Performance in semantic verbal fluency (SVF) tasks, mainly measured by the number of words of a particular semantic category produced within a limited time, is a widely accepted measure of cognitive functioning used in the neuropsychological assessment of children and adults. Two strategic processes, Clustering and Switching (C&S) have been proposed to underlie fluency processes and affect performance in the task. However, few studies have reported on the development of those cognitive strategies and their relationship with word productivity in typically developing children. Even fewer studies have covered a broad developmental period from preschool to adolescence or measured the effect of contextual factors in this relationship. Based on a sample of 472 typically developing Greek-speaking children aged 4;0 to 16;11 years, we investigated the development of SVF performance and reported on the degree to which it is affected by C&S strategies, children’s sex, and level of parental education. Results revealed a large effect of age on word productivity and on the use of C&S strategies. Two switching factors (number of clusters and number of switches) and two clustering factors (mean cluster size and a novel measure, maximum cluster size), appeared to be significantly associated with word productivity, with the largest effect being attributed to the two switching factors. C&S factors, together with children’s age and parental education, predicted 91.7% of the variance in the SVF score. Children’s sex was not found to have a significant effect on either word productivity or C&S strategies. Results are discussed for their theoretical implications on the strategic processes underlying word production in typically developing children.
... Several VFT exist, but semantic and phonological VFT are widely used within the paediatric population. Animals and fruits represent the most common categories used during semantic verbal fluency with children and adolescents (Martins-Oliveira et al., 2016;Olabarrieta-Landa et al., 2017a;Van der Elst et al., 2011). For phonological verbal fluency, cultural and linguistic aspects are taken into consideration to determine the letter to use. ...
... According to the literature, the most important variable associated with children and adolescents' verbal fluency performance is age (i.e., Olabarrieta-Landa et al., 2017b;Van der Elst et al., 2011), showing older children and adolescents perform better on VFT. This is not surprising since verbal fluency requires executive function and frontal lobe maturation, and children do not yet have fully achieved these milestones (Anderson, 2002;Anderson et al., 2001). ...
... It seems that children and adolescents whose parents had higher educational level perform better in VFT compared with those with lower educational level (i.e., Olabarrieta-Landa et al., 2017b;Rivera et al., 2021;Van der Els et al., 2011). Sex has been inconsistently associated with VFT; however, most studies have reported no sex differences (i.e., Malloy-Diniz et al., 2007;Ruffieux et al., 2009;Tallberg et al., 2011;Van der Elst et al., 2011) or when differences arise, its impact was very small compared with age or parents' education (Olabarrieta-Landa et al., 2017b;Rivera et al., 2021). Thus, both age and parents' education level are extremely important variables to consider when assessing verbal fluency in this population. ...
Article
Background: Verbal fluency tests (VFT) are highly sensitive to cognitive deficits. Usually, the score on VFT is based on the number of correct words produced, yet it alone gives little information regarding underlying test performance. The implementation of different strategies (cluster and switching) to perform efficiently during the tasks provide more valuable information. However, normative data for clustering and switching strategies are scarce. Moreover, scoring criteria adapted to Colombian Spanish are missing. Aims: (1) To describe the Colombian adaptation of the scoring system guidelines for clustering and switching strategies in VFT; (2) to determine its reliability; and (3) to provide normative data for Colombian children and adolescents aged 6-17 years. Methods & procedures: A total of 691 children and adolescents from Colombia completed phonological (/f/, /a/, /s/, /m/, /r/ and /p/) and semantic (animals and fruits) VFT, and five scores were calculated: total score (TS), number of clusters (NC), cluster size (CS), mean cluster size (MCS) and number of switches (NS). The intraclass correlation coefficient was used for interrater reliability. Hierarchical multiple regressions were conducted to investigate which strategies were associated with VFT TS. Multiple regressions were conducted for each strategy, including as predictors age, age2 , sex, mean parents' education (MPE), MPE2 and type of school, to generate normative data. Outcomes & results: Reliability indexes were excellent. Age was associated with VFT TS, but weakly compared with strategies. For both VFT TS, NS was the strongest variable, followed by CS and NC. Regarding norms, age was the strongest predictor for all measures, while age2 was relevant for NC (/f/ phoneme) and NS (/m/ phoneme). Participants with higher MPE obtained more NC, and NS, and larger CS in several phonemes and categories. Children and adolescents from private school generated more NC, NS and larger CS in /s/ phoneme. Conclusions & implications: This study provides new scoring guidelines and normative data for clustering and switching strategies for Colombian children and adolescents between 6 and 17 years old. Clinical neuropsychologists should include these measures as part of their everyday practice. What this paper adds: What is already known on the subject VFT are widely used within the paediatric population due to its sensitivity to brain injury. Its score is based on the number of correct words produced; however, TS alone gives little information regarding underlying test performance. Several normative data for VFT TS in the paediatric population exist, but normative data for clustering and switching strategies are scarce. What this paper adds to existing knowledge The present study is the first to describe the Colombian adaptation of the scoring guidelines for clustering and switching strategies, and provided normative data for these strategies for children and adolescents between 6 and 17 years old. What are the potential or actual clinical implications of this work? Knowing VFT's performance, including strategy development and use in healthy children and adolescents, may be useful for clinical settings. We encourage clinicians to include not only TS, but also a careful analysis of strategies that may be more informative of the underlying cognitive processes failure than TS.
... The studies aimed to obtain normative data of fluency tests in languages such as Chinese, Hebrew, and Dutch in children, adolescents, and adults (Chan and Poon, 1999;Kavé et al., 2008;Lee et al., 2002;Van der Elst et al., 2011) reported a significant correlation between age and outcome measures. In addition, education (Lee et al., 2002) and parents' education ( Van der Elst et al., 2011) showed an effect on the test scores. ...
... The studies aimed to obtain normative data of fluency tests in languages such as Chinese, Hebrew, and Dutch in children, adolescents, and adults (Chan and Poon, 1999;Kavé et al., 2008;Lee et al., 2002;Van der Elst et al., 2011) reported a significant correlation between age and outcome measures. In addition, education (Lee et al., 2002) and parents' education ( Van der Elst et al., 2011) showed an effect on the test scores. ...
Article
Full-text available
Background The category Fluency Test (CFT) is included in the MATRICS consensus cognitive battery for schizophrenia (MCCB). This test has several components that allow the exploration of other cognitive functions and could be related to different symptom profiles. Methods a sample of 98 Mexican adolescents was evaluated with the CFT and the following components were analyzed: total words, errors, clusters, switching, related words, and categories. Demographic and clinical differences among them and correlations with other MCCB tests and with Positive and Negative Syndrome Scale (PANSS) factors were explored. Results The CFT components showed significant correlations with each other and with other MCCB tests, particularly with those related to verbal learning, working memory, and speed of processing. In addition, they showed moderate correlations with the total PANSS score and with the negative, positive, and cognitive PANSS factors. Conclusions The analysis of CFT components allows clinicians and investigators to obtain information regarding other cognitive functions and symptom profiles in adolescents with schizophrenia.
... To classify individuals' performance on relevant clinical or educational measures, such as neuropsychological tests or language tests, psychologists and educators need reference values or norms. Three examples of studies that provide normative data are Van der Elst et al. (2006), who derived norms for the three subtasks of the Stroop Color-Word Test, which is a general measure of cognitive flexibility and control, Van der Elst et al. (2011), who normed the Animal Verbal Fluency and the Design Fluency tests in school-aged children, which are aimed to assess vocabulary size and motor planning, respectively, and Dujardin et al. (2021), who provided reference values for several tests to assess vocabulary, reading, and spelling skills in university students. ...
Article
Full-text available
Normative studies are needed to obtain norms for comparing individuals with the reference population on relevant clinical or educational measures. Norms can be obtained in an efficient way by regressing the test score on relevant predictors, such as age and sex. When several measures are normed with the same sample, a multivariate regression-based approach must be adopted for at least two reasons: (1) to take into account the correlations between the measures of the same subject, in order to test certain scientific hypotheses and to reduce misclassification of subjects in clinical practice, and (2) to reduce the number of significance tests involved in selecting predictors for the purpose of norming, thus preventing the inflation of the type I error rate. A new multivariate regression-based approach is proposed that combines all measures for an individual through the Mahalanobis distance, thus providing an indicator of the individual’s overall performance. Furthermore, optimal designs for the normative study are derived under five multivariate polynomial regression models, assuming multivariate normality and homoscedasticity of the residuals, and efficient robust designs are presented in case of uncertainty about the correct model for the analysis of the normative sample. Sample size calculation formulas are provided for the new Mahalanobis distance-based approach. The results are illustrated with data from the Maastricht Aging Study (MAAS).
Article
The trailmaking test (TMT) has an educational bias that makes it inapplicable to people with low levels of education due to its dependence on the alphabet. The TMT standardization is the only one available in Chile, and there is a need for alternative ways of using the TMT that do not depend on the level of education for its applicability. To determine the normative scores of the TMT – black & white (TMT B&W), considering sociodemographic factors in adult and elderly Chilean population. A total sample of 227 participants (133 healthy, 94 cognitively impaired) from the Ageing Mets cohort were recruited from three areas in Chile (Antofagasta, Santiago, and Puerto Montt). The TMT B&W was administered to all participants. A multiple regression model was used to generate normative data only in the cognitively healthy group, considering the effect of age, education and sex. A significant effect of age was found in the score of the TMT B&W Parts A and B. The level of education influenced the part B of the test; however, the completion rate of the TMT B&W parts A and B was over 90% in cognitively healthy people. Norms for the number of errors were obtained, and differences between groups were found after controlling for the effect of age and education. This study is the first to provide normative data for the Chilean version of the TMT B&W and will benefit clinical neuropsychologists by improving the procedures for more accurately assessing executive functions and its impairments.
Article
Full-text available
Introduction : Cognition often remains unassessed in primary care. To improve early diagnosis of neurocognitive disorder (NCD) in Switzerland, the tablet‐based UCSF brain health assessment (BHA) and brain health survey (BHS) were validated. Methods : The German BHA, BHS, and Montreal Cognitive Assessment (MoCA) were administered to 67 patients with mild/major NCD and 50 controls. BHA includes subtests of memory, executive, visuospatial, and language functioning, and informant‐based BHS asks about behavior and motor functioning. Results : The complete instrument (BHA + BHS) was most accurate at detecting mild NCD (AUC = 0.95) and NCD without amyloid pathology (AUC = 0.96), followed by the BHA. All measures were accurate (all AUCs > 0.95) at distinguishing major NCD and NCD with amyloid pathology (Alzheimer's disease [AD]) from controls. Discussion : The German BHA and BHS are more sensitive to mild NCD and non‐AD presentations than the MoCA and thus have a high potential to identify patients with NCD in primary care earlier than currently used screens.
Article
Full-text available
Objective To quantify the evolution, impact, and importance of normative data (ND) calculation by identifying trends in the research literature and what approaches need improvement. Methods A PRISMA-guideline systematic review was performed on literature from 2000 to 2022 in PubMed, Pub-Psych, and Web of Science. Inclusion criteria included scientific articles about ND in neuropsychological tests with clear data analysis, published in any country, and written in English or Spanish. Cross-sectional and longitudinal studies were included. Bibliometric analysis was used to examine the growth, productivity, journal dispersion, and impact of the topic. VOSViewer compared keyword co-occurrence networks between 1952–1999 and 2000–2022. Results Four hundred twelve articles met inclusion and exclusion criteria. The most studied predictors were age, education, and sex. There were a greater number of studies/projects focusing on adults than children. The Verbal Fluency Test (12.7%) was the most studied test, and the most frequently used variable selection strategy was linear regression (49.5%). Regression-based approaches were widely used, whereas the traditional approach was still used. ND were presented mostly in percentiles (44.2%). Bibliometrics showed exponential growth in publications. Three journals (2.41%) were in the Core Zone. VOSViewer results showed small nodes, long distances, and four ND-related topics from 1952 to 1999, and there were larger nodes with short connections from 2000 to 2022, indicating topic spread. Conclusions Future studies should be conducted on children’s ND, and alternative statistical methods should be used over the widely used regression approaches to address limitations and support growth of the field.
Article
Full-text available
Much attention in the field of clinical neuropsychology has focused on adapting to the modern healthcare environment by advancing telehealth and promoting technological innovation in assessment. Perhaps as important (but less discussed) are advances in the development and interpretation of normative neuropsychological test data. These techniques can yield improvement in diagnostic decision-making and treatment planning with little additional cost. Brooks and colleagues (Can Psychol 50: 196–209, 2009) eloquently summarized best practices in normative data creation and interpretation, providing a practical overview of norm development, measurement error, the base rates of low scores, and methods for assessing change. Since the publication of this seminal work, there have been several important advances in research on development and interpretation of normative neuropsychological test data, which may be less familiar to the practicing clinician. Specifically, we provide a review of the literature on regression-based normed scores, item response theory, multivariate base rates, summary/factor scores, cognitive intraindividual variability, and measuring change over time. For each topic, we include (1) an overview of the method, (2) a rapid review of the recent literature, (3) a relevant case example, and (4) a discussion of limitations and controversies. Our goal was to provide a primer for use of normative neuropsychological test data in neuropsychological practice.
Article
Children can anticipate upcoming input in sentences with semantically constraining verbs. In the visual world, the sentence context is used to anticipatorily fixate the only object matching potential sentence continuations. Adults can process even multiple visual objects in parallel when predicting language. This study examined whether young children can also maintain multiple prediction options in parallel during language processing. In addition, we aimed at replicating the finding that children's receptive vocabulary size modulates their prediction. German children (5-6 years, n = 26) and adults (19-40 years, n = 37) listened to 32 subject-verb-object sentences with semantically constraining verbs (e.g., "The father eats the waffle") while looking at visual scenes of four objects. The number of objects being consistent with the verb constraints (e.g., being edible) varied among 0, 1, 3, and 4. A linear mixed effects model on the proportion of target fixations with the effect coded factors condition (i.e., the number of consistent objects), time window, and age group revealed that upon hearing the verb, children and adults anticipatorily fixated the single visual object, or even multiple visual objects, being consistent with the verb constraints, whereas inconsistent objects were fixated less. This provides first evidence that, comparable to adults, young children maintain multiple prediction options in parallel. Moreover, children with larger receptive vocabulary sizes (Peabody Picture Vocabulary Test) anticipatorily fixated potential targets more often than those with smaller ones, showing that verbal abilities affect children's prediction in the complex visual world.
Article
Full-text available
In this paper some methodological problems related to the use of normative data in neuropsychological assessment are analysed. The difference between the concepts of normality and mastery, and the problems encountered with the adjustment of raw scores for age and education are considered. Other arguments discussed and exemplified with real data are the use of inner and outer nonparametric tolerance limits, confidence limits for individual scores, multivariate norms, and discriminant analysis.
Book
The book is a collection of normative data for commonly administered neuropsychological instruments. Included are summary tables of normative data from numerous studies and meta-analytic tables for some tests, detailed critique of the normative studies for each instrument, brief information regarding the tests themselves (history, cognitive constructs, and different test versions), as well as statistical and methodological issues relevant to interpretation of neuropsychological test data. This book represents an updated and expanded edition of the original published in 1999.
Article
Previous research studies have shown that in adults, verbal fluency is impaired after lesion to the frontal lobes and left temporal lobe. More recently, there have been a few studies reported which indicated that in children, like adults, left hemisphere and frontal lesions result in pronounced effects on verbal fluency. The present study examined developmental differences in verbal fluency within a sample of 130 normal children, aged 6 to 12 years. Additionally, the same verbal fluency test was administered to two subgroups of children with developmental dyslexia and a group of children with attention-deficit/hyperactivity disorder (ADHD). Analysis of variance (ANOVA) revealed significant between-group differences by age in the normal children. Further, ANOVA demonstrated that the verbal fluency measure was clinically useful in differentiating the Language Disorder/Dysphonetic Dyslexic subgroup from the Visual-Spatial/Dyseidetic Dyslexic subgroup and the ADHD group, with the latter two groups performing within the average range