A preview of this full-text is provided by American Psychological Association.
Content available from Psychological Methods
This content is subject to copyright. Terms and conditions apply.
Sample Size Calculation and Optimal Design for Regression-Based
Norming of Tests and Questionnaires
Francesco Innocenti
1
, Frans E. S. Tan
1
, Math J. J. M. Candel
1
, and Gerard J. P. van Breukelen
1, 2
1
Department of Methodology and Statistics, Care and Public Health Research Institute (CAPHRI), Maastricht University
2
Department of Methodology and Statistics, Graduate School of Psychology and Neuroscience, Maastricht University
Abstract
To prevent mistakes in psychological assessment, the precision of test norms is important. This can be
achieved by drawing a large normative sample and using regression-based norming. Based on that norming
method, a procedure for sample size planning to make inference on Z-scores and percentile rank scores is
proposed. Sampling variance formulas for these norm statistics are derived and used to obtain the optimal
design, that is, the optimal predictor distribution, for the normative sample, thereby maximizing precision of
estimation. This is done under five regression models with a quantitative and a categorical predictor, differ-
ing in whether they allow for interaction and nonlinearity. Efficient robust designs are given in case of uncer-
tainty about the regression model. Furthermore, formulasare provided to compute the normative sample size
such that individuals’positions relative to the derived norms can be assessed with prespecified power and
precision.
Translational Abstract
Normative studies are needed to derive reference values (or norms) for tests and questionnaires, so that psy-
chologists can use them to assess individuals. Specifically, norms allow psychologists to interpret individu-
als’score on a test by comparing it with the scores of their peers (e.g., individuals with the same sex,age, and
educational level) in the reference population. Because norms are also used to make decisions on individuals,
such as the assignment to clinical treatment or remedial teaching, it is important that norms are precise (i.e.,
not strongly affected by sampling error in the sample on which the norms are based). This article shows how
this goal can be attained in three steps. First, norms are derived using the regression-based approach, which
is more efficient than the traditional approach of splitting the sample into subgroups based on demographic
factors and deriving norms per subgroup. Specifically, the regression-based approach allows researchers to
identify the predictors (e.g., demographic factors) that affect the test score of interest, and to use the whole
sample to derive norms. Second, the design of the normative study (e.g., which age groups to include) is cho-
sen such that the precision of the norms is maximized for a given total sample size for norming. Third, this
total sample size is computed such that a prespecified power and precision are obtained.
Keywords: normative data, optimal design, percentile rank score, sample size calculation, Z-score
Supplemental materials: https://doi.org/10.1037/met0000394.supp
Normative studies provide reference values, also known as
norms, that psychologists can use to compare individuals with the
reference population, for instance, to make decisions about clinical
treatments, school admission or remedial teaching, or selection of
candidates for job vacancies. Examples of normative studies are
Goretti et al. (2014) and Parmenter et al. (2010), who have derived
reference values for two batteries of neuropsychological tests to
assess cognitive function in patients with multiple sclerosis, and
Van der Elst et al. (2006), who have normed the Dutch version of
three verbal fluency tests. Normative studies are of practical im-
portance because they allow psychologists to interpret scores on
the outcome variable of interest by comparing an individual’s test
score with the scores of his or her peers (e.g., individuals of the
same age, sex, and educational level) in the reference population.
For instance, knowing that a highly educated 75-year-old woman
scored 11.5 on the profession naming verbal fluency test is in itself
not informative on whether this score is within the normal range
or exceptional. According to the normative data provided by Van
der Elst et al., (2006, Table A.2), only 10% of her peers (i.e.,
women of the same age and educational level) have a test score equal
Francesco Innocenti https://orcid.org/0000-0001-6113-8992
Math J. J. M. Candel https://orcid.org/0000-0002-2229-1131
Gerard J. P. van Breukelen https://orcid.org/0000-0003-0949-0272
We have no conflict of interest to disclose. A summary of this study was
presented at the 41st Annual Conference of the International Society for
Clinical Biostatistics.
Correspondence concerning this article should be addressed to Francesco
Innocenti, Department of Methodology and Statistics, Care and Public
Health Research Institute (CAPHRI), Maastricht University, P.O. Box 616,
6200 MD, Maastricht, the Netherlands. Email: francesco.innocenti@
maastrichtuniversity.nl
Psychological Methods
©2021 American Psychological Association
ISSN: 1082-989X https://doi.org/10.1037/met0000394
This document is copyrighted by the American Psychological Association or one of its allied publishers.
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
2023, Vol. 28, No. 1, 89–106
This article was published Online First August 12, 2021.
89