ArticlePDF AvailableLiterature Review

Multidimensional screening for predicting pain problems in adults: A systematic review of screening tools and validation studies

September 2019
PAIN Reports 4(5):1

September 2019
4(5):1

DOI:10.1097/PR9.0000000000000775

License
CC BY 4.0

Authors:

Elke Veirman

Ghent University

Dimitri M L Van Ryckeghem

Maastricht University

Annick De Paepe

Ghent University

Olivia Kirtley

KU Leuven

Show all 5 authorsHide

Screening tools allowing to predict poor pain outcomes are widely used. Often these screening tools contain psychosocial risk factors. This review (1) identifies multidimensional screening tools that include psychosocial risk factors for the development or maintenance of pain, pain-related distress, and pain-related disability across pain problems in adults, (2) evaluates the quality of the validation studies using Prediction model Risk Of Bias ASsessment Tool (PROBAST), and (3) synthesizes methodological concerns. We identified 32 articles, across 42 study samples, validating 7 screening tools. All tools were developed in the context of musculoskeletal pain, most often back pain, and aimed to predict the maintenance of pain or pain-related disability, not pain-related distress. Although more recent studies design, conduct, analyze, and report according to best practices in prognosis research, risk of bias was most often moderate. Common methodological concerns were identified, related to participant selection (eg, mixed populations), predictors (eg, predictors were administered differently to predictors in the development study), outcomes (eg, overlap between predictors and outcomes), sample size and participant flow (eg, unknown or inappropriate handling of missing data), and analysis (eg, wide variety of performance measures). Recommendations for future research are provided.

Flow of studies through the review.

…

Summary of included screening tools.

…

Key study and participant characteristics of included validation studies.

…

Methodological quality of included validation studies.

…

Figures - uploaded by Dimitri M L Van Ryckeghem

Content may be subject to copyright.

Content uploaded by Dimitri M L Van Ryckeghem

Content may be subject to copyright.

Downloaded from https://journals.lww.com/painrpts by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsIHo4XMi0hCywCX1AWnYQp/IlQrHD3CO9NTn7Xh2sPM2xidnPh2psfMNhtw0Zvxv6hsOI3res= on 09/15/2019

Downloadedfromhttps://journals.lww.com/painrpts by BhDMf5ePHKav1zEoum1tQfN4a+kJLhEZgbsIHo4XMi0hCywCX1AWnYQp/IlQrHD3CO9NTn7Xh2sPM2xidnPh2psfMNhtw0Zvxv6hsOI3res= on 09/15/2019

General section

Review

Multidimensional screening for predicting pain

problems in adults: a systematic review of

screening tools and validation studies

Elke Veirman

*, Dimitri M. L. Van Ryckeghem

a,b,c

, Annick De Paepe

, Olivia J. Kirtley

, Geert Crombez

Abstract

Screening tools allowing to predict poor pain outcomes are widely used. Often these screening tools contain psychosocial risk

factors. This review (1) identifies multidimensional screening tools that include psychosocial risk factors for the development or

maintenance of pain, pain-related distress, and pain-related disability across pain problems in adults, (2) evaluates the quality ofthe

validation studies using Prediction model Risk Of Bias ASsessment Tool (PROBAST), and (3) synthesizes methodological concerns.

We identified 32 articles, across 42 study samples, validating 7 screening tools. All tools were developed in the context of

musculoskeletal pain, most often back pain, and aimed to predict the maintenance of pain or pain-related disability, not pain-related

distress. Although more recent studies design, conduct, analyze, and report according to best practices in prognosis research, risk

of bias was most often moderate. Common methodological concerns were identified, related to participant selection (eg, mixed

populations), predictors (eg, predictors were administered differently to predictors in the development study), outcomes (eg, overlap

between predictors and outcomes), sample size and participant flow (eg, unknown or inappropriate handling of missing data), and

analysis (eg, wide variety of performance measures). Recommendations for future research are provided.

Keywords: Multidimensional screening, Yellow flags, Pain, Risk of bias

1. Introduction

Chronic pain is a common experience, with a prevalence of between

10% and 20% in the general adult population.

6,7,34,95,114

Often,

chronic pain is disabling and notoriously difficult to treat.

At least 2

strategies are possible to face these challenges. First, we can

develop new and better medical and psychosocial interventions.

Second, we can prevent acute pain from becoming chronic. The

latter requires an understanding of how and why acute pain

becomes chronic, the identification of individuals at risk, and the

timely delivery of preventive actions.

67,126

Evidence has been accumulating that psychosocial variables

are important in the prediction and prevention of chronic pain.

First, available experimental and prospective research reveals the

role of psychosocial factors in explaining pain, distress, and

disability.

The roles of learning, emotions, and cognitive factors

are well established in laboratory studies,

123

and a number of

prospective studies have provided evidence for the role of

psychosocial factors in the development and maintenance of

pain.

3,60,102

For example, Sobol-Kwapinska et al.

106

reviewed

predictors of acute postsurgical pain and found pain catastroph-

izing, optimism, expectation of pain, neuroticism, anxiety (state

and trait), negative affect, and depression to be associated with

acute postsurgical pain. Second, contemporary theoretical

models have provided insight into how acute pain patients with

a particular psychosocial profile may become stuck in a vicious

cascade of further pain, distress, and disability.

13,122

Third,

evidence is increasing that the timely delivery of cognitive-

behavioral interventions can prevent persistent disability.

Taking this evidence into account, Kendall et al.

called for the

routine assessment of psychosocial factors in people experienc-

ing acute pain. They introduced the concept of “yellow flags” as

a method to screen for psychosocial risk factors predicting long-

term disability, a concept that has been adopted by a growing

number of researchers interested in examining the value of

prognostic models.

26,27

This has led to the development of

screening tools that include various psychosocial risk factors and

Sponsorships or competing interests that may be relevant to content are disclosed

at the end of this article.

Department of Experimental Clinical and Health Psychology, Faculty of Psychol-

ogy and Educational Sciences, Ghent University, Ghent, Belgium,

Institute for

Health and Behaviour, INSIDE, Faculty of Language and Literature, Humanities, Arts

and Education, University of Luxembourg, Esch-sur-Alzette, Luxembourg,

Section

Experimental Health Psychology, Clinical Psychological Science, Departments,

Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, the

Netherlands,

Center for Contextual Psychiatry, Department of Neurosciences, KU

Leuven, Leuven, Belgium

*Corresponding author. Address: Department of Experimental Clinical and Health

Psychology, Faculty of Psychology and Educational Sciences, Ghent University, Henri

Dunantlaan 2, B-9000 Gent, Belgium. Tel.: 132 92646392; fax: 132 9 264 64 89.

E-mail address: Elke.Veirman@UGent.be (E. Veirman).

of The International Association for the Study of Pain. This is an open access article

distributed under the Creative Commons Attribution License 4.0 (CCBY), which

permits unrestricted use, distribution, and reproduction in any medium, providedthe

original work is properly cited.

PR9 4 (2019) e775

http://dx.doi.org/10.1097/PR9.0000000000000775

4 (2019) e775 www.painreportsonline.com 1

a recommendation for their use in clinical practice (eg, Keele

STarT Back Screening Tool [STarT Back]

; Preventing the

Inception of Chronic Pain [PICKUP]

115

Several reviews have summarized the predictive performance

of screening tools.

30,42,49,68,99

For instance, in a meta-analysis of

screening tools, Karan et al.

showed that screening tools poorly

predicted pain, but were acceptable and excellent in predicting

disability, and absenteeism, respectively (eg, STarT Back,

OMPSQ). This meta-analysis is of high quality and according to

the highest standards in the field.

71,109

For that reason, our aim

was not to focus upon the actual performance of the screening

tools. Available meta-analyses

49,99

have also noted that the

methodological quality of studies investigating the predictive

performance of screening tools is variable. Nevertheless, these

reviews do not provide details of the methodological problems

and limitations.

For that reason, our review focuses upon the methodological

quality of studies that validate screening tools. First, a detailed

analysis and synthesis of the methodological quality of studies is

largely missing. Indeed, despite being considered fundamental to

guide interpretation of findings, and recommendations for future

research and practice,

available reviews spend little or no

attention to this topic. Second, the methodological quality of the

studies in these reviews, typically described as “risk of bias,”

was assessed using instruments that were not specifically

designed for evaluating the quality of prediction models (eg,

Quality in Prognostic Studies tool [QUIPS]).

36,37

Recently, “Pre-

diction model Risk Of Bias ASsessment Tool” (PROBAST), a tool

for assessing the risk of bias and applicability of diagnostic and

prognostic prediction model studies, has become available and

used.

76,127–129

The aim of this systematic review was 3-fold: (a) to identify

available multidimensional screening tools that include psycho-

social risk factors for poor pain outcomes (development or

maintenance of pain, pain-related distress, and pain-related

disability) across pain problems in adults, (b) to evaluate the

quality of prospective studies validating these screening tools

with up-to-date standards for clinical prediction models, and (c)

to synthesize methodological concerns that may bias the

predictive performance of these screening tools.

2. Methods

2.1. Literature search and eligibility criteria

The literature search comprised 4 steps. First, a search was

performed for studies published in peer-reviewed journals

across relevant electronic databases (MEDLINE, PsychINFO,

and Web of Science) using the following terms in the title, key

words, or abstract: screen*AND(tool OR questionnaire)AND

pain AND risk. Screening of titles, key words, and abstracts

allowed identification of screening tools and eligible studies.

Second, a list of publications was sent to lead authors in the

field of pain research to ask for any other available screening

tools of which they were aware. Third, the reference lists of

relevant systematic reviews were hand-searched for any

articles that were not yielded by our other search methods.

Finally, when only the development article for a tool fulfilling the

inclusion criteria (see below) was identified in the search,

a search was performed for additional articles that fulfilled the

inclusion criteria by screening all publications that cited this

development article.

The following eligibility criteria were used to identify screening

tools for inclusion in this systematic review:

(1) The screening tool is a self-report questionnaire.

(2) The screening tool is multidimensional, containing at least 2

psychosocial risk factors. The report of somatic experiences

such as pain, radiation, or other somatic complaints is not

considered as psychosocial factors.

(3) The screening tool aims to predict the development (,3

months) or maintenance ($3 months) of pain, pain-related

distress, or pain-related disability.

(4) The screening tool is specifically developed in the context of

pain and can target any type of pain (eg, neck pain and low

back pain).

(5) The screening tool is a standalone instrument. Therefore, the

tool should not consist of a battery of questionnaires, as is

often the case for research purposes.

(6) The screening tool is validated in at least 1 independent study,

ie, using data that were not used to develop the screening tool.

Six criteria (listed below) were used to select studies for

inclusion. Some criteria were included to set a minimum quality

(eg, criterion 1), whereas other criteria were applied to narrow the

scope of the review (eg, criterion 2).

(1) The study is a full report published in a peer-reviewed scientific

journal.

(2) The study includes an adult sample (the average age of the

sample was older than 18 years).

(3) At baseline, the study includes patients experiencing no or

(sub)acute pain (,3 months), without restriction in the type of

pain experienced (eg, musculoskeletal pain, neuropathic pain,

and postoperative pain). In line with the development studies

of screening tools, we excluded studies involving only patients

with chronic pain ($3 months). Studies involving mixed

samples with (sub)acute and chronic pain patients were

included. However, when data for separate subsamples were

reported, we only included the samples of interest.

(4) The study includes at least 1 screening tool, which is used in its

original form. Some differences in translations, item order, and

response scale are accepted. Shortened versions are

considered different instruments.

(5) The study includes at least one of the following outcomes

during outcome assessment (,2 years after baseline

assessment): (a) Pain intensity or pain bothersomeness,

assessed using a Visual Analogue Scale (VAS), a Numeric

Rating Scale (NRS), a verbal rating scale, or a Likert scale; (b)

pain-related disability including activity limitations (ie, diffi-

culties in executing a task or an action such as the ability to

walk, eat, shower, or dress) and participation restrictions (ie,

problems relating to the involvement in life situations such as

sick leave or days absent from work or return to work status)

according to the International Classification of Functioning,

Disability, and Health (ICF) framework.

130

Assessment of

these outcomes could be performed with (a subset of

questions from) a self-report questionnaire, single questions,

or data from existing registration systems; and (c) pain-

related distress (eg, anxiety, fear, or low mood), assessed

through self-report measures.

(6) The study is a prospective cohort study including patients

presenting in primary, secondary, and tertiary health care

settings.

Finally, studies were considered ineligible if they aimed to

investigate the impact of stratified care (ie, targeted treatment to

patient subgroups based on the results of the screening tool) or

interventions that specifically targeted psychosocial risk factors

(ie, cognitive behavioral therapy) or they consisted of a random-

ized control trial. We reasoned that the focus of these studies is

on the evaluation of a (psychological) therapeutic intervention

2E. Veirman et al.·4 (2019) e775 PAIN Reports

and not on the investigation of the predictive value of screening

tools.

2.2. Data extraction and risk of bias assessment

The assessment of the quality of studies that validated the

selected screening tools was based upon a prepublication

version of the Prediction model study Risk Of Bias ASsessment

Tool (PROBAST) (personal communication, January 2017, Dr.

Robert Wolff). The PROBAST has been developed by the

Cochrane Prognosis Methods Group using a Delphi process, in

which 40 experts in the fields of prediction research and

systematic review methodology participated.

129

Its use is

recommended by most recent guidelines for performing system-

atic reviews and meta-analyses of prediction model

performance.

Data extraction of eligible validation studies was conducted by

E.V. and O.K. following a customized PROBAST template that

was created for each of the 5 risk of bias assessment areas: (1)

participant selection, (2) predictors, (3) outcomes, (4) sample size

and participant flow, and (5) analysis (details can be retrieved from

the authors upon request).

Extracted data formed the basis for

the risk of bias assessment, where signaling questions across

those 5 important areas were rated as yes,probably yes,

probably no,no,orno information, with yes indicating the

absence of bias and probably no or no indicating the potential for

bias.

For participant selection, elements judged were whether

appropriate inclusion and exclusion criteria were used and

whether patients had a similar state of health at enrollment. For

predictors, questions considered were whether definition and

assessment of predictors were similar across participants, and

whether definition and assessment of predictors were similar

compared with those of the development model. For outcomes,

important elements judged were whether a valid outcome was

used, whether predictors were excluded from the outcome

definition, whether definition and assessment of outcomes were

similar across participants, whether definition and assessment

of outcomes were similar compared with those of the de-

velopment model, and whether outcome assessment was

blinded to predictor data. For sample size and participants

flow,elementsjudgedwerewhetherareasonablenumberof

outcome events were available, whether the time interval

between predictor and outcome assessment was appropriate,

whether all enrolled participants were included in the analyses,

and whether missing data occurred and participants with

missing data were handled appropriately. Finally, for analysis,

evaluated elements focused on whether relevant model

performance measures were evaluated. Domains were sub-

sequently rated as high,moderate,low,orunclear risk of bias.

Risk of bias assessment labels were discussed and assigned

upon agreement among team members (G.C., D.V.R.,

and E.V.).

3. Results

3.1. Study selection

The study selection process was guided by the Preferred

Reporting Items for Systematic Reviews and Meta-Analyses

guidelines (PRISMA),

except for a preregistration of the review.

Electronic databases were searched from the earliest record

available on September 15, 2016, resulting in 1850 records. After

removal of duplicate articles, 2 reviewers (J.C. and E.V.)

independently screened a selection of the titles, key words, and

abstracts for possible study inclusion. First screening resulted in

187 remaining references.

In the second step, full copies of articles were obtained (E.V.).

Full-text reading of these articles resulted in exclusion of several

tools for the following reasons (1) not being a screening tool (eg,

“Amsterdam Preoperative Anxiety and Information Scale”),

(2)

the screening tool was not developed in the context of pain (eg,

“Distress and Risk Assessment Method”),

(3) the screening tool

did not assess any psychosocial factors (eg, “London Fibromyal-

gia Epidemiology Study Screening Questionnaire”),

and (4) the

screening tool assessed only 1 psychosocial factor (eg, “Fear

Avoidance Beliefs Questionnaire”).

For 3 potentially eligible screening tools, items were not

available in the literature and author contact yielded insufficient

access to the tools’ items (“Nijmegen Outcome of Lumbar Disc

surgery Screening-instrument”

; “ABLE Presurgical Assess-

ment Tool”

; and “Psychosocial Risk for Occupational Disability

Scale”

100

Finally, a number of eligible screening tools for which items

were available in the literature were not included in the current

review as no independent validation studies were retrieved from

the electronic database search nor through cited reference

search of the development articles of the screening tools (ie,

“Absenteeism Screening Questionnaire”

116

; “Back Disability Risk

Questionnaire”

103,104

; “Optimal Screening for Prediction of Re-

ferral and Outcome cohort yellow flag assessment tool”

; “Pain

Recovery Inventory of Concerns and Expectations”

105

;

“Screening-Instrument zur Feststellung des Bedarfs an

medizinisch-beruflich orientierter Rehabilitation”

112

; “Traumatic

Injuries Distress Scale”

125

; and “Work and Health

Questionnaire”

In addition to the 27 articles that were considered eligible from

the electronic database search, 2 articles

25,50

were identified

through cited reference search of the development articles of the

screening tools on May 4, 2017, and 3 references

32,53,64

were

retrieved by hand-searching of relevant review

articles

8,30,42,45,49,59,68,86,88,90,99,107

(O.K.), resulting in a total of

32 references fulfilling the inclusion criteria for the current review.

Additional author contact yielded no other tools or studies (see

Figure 1 for a flowchart).

Doubts and disagreements on the inclusion of screening tools

and eligible studies were resolved by discussion within the team

(G.C., D.V.R., E.V., A.D.P., and O.K.) until consensus was

reached. After finalizing the systematic search, all screening tools

and development studies were retrieved to extract essential data

for the risk of bias assessment. During the screening process,

reviewers were not blind to authorship, institution, journal, or

results.

3.2. Study characteristics: screening tools

The 32 included articles contained 42 study samples. Notably,

several articles reported on a similar sample as earlier published

articles, whereas other study samples completed multiple

screening tools. The articles reported on the validation of 7

screening tools:

(1) Acute Low Back Pain Screening Questionnaire (ALBPSQ; 7

studies)

/¨

Orebro Musculoskeletal Pain Screening Question-

naire (OMPSQ; 10 studies)

/¨

Orebro Musculoskeletal Screen-

ing Questionnaire (OMSQ; 3 studies).

The ALBSQ is a 24-

item self-report questionnaire aiming to predict poor progno-

sis—operationalized as accumulated sick leave—in acute and

subacute patients presenting with musculoskeletal pain

4 (2019) e775 www.painreportsonline.com 3

(back, neck, and shoulder pain). A few years after its

development, it was relabeled as the OMPSQ, including an

additional unscored item on employment status. More

recently, the OMSQ broadened the focus of the ALBPSQ to

general musculoskeletal problems and simplified the

questions.

(2) ¨

Orebro Musculoskeletal Pain Screening Questionnaire

short version (OMPSQs; 2 studies).

The OMPSQs is

a 10-item self-report questionnaire designed to predict

disability—operationalized as sick leave—in workers suf-

fering from musculoskeletal pain (back pain).

(3) ¨

Orebro Musculoskeletal Screening Questionnaire short version

(OMSQs; 1 study).

TheOMSQsisa12-itemself-report

questionnaire aiming to predict a wide variety of outcomes—including

problem severity, functional impairment, absenteeism, long-term

absenteeism, cost, and recovery time—in acute and subacute work-

injured patients presenting with musculoskeletal pain (whiplash, low

back pain).

(4) Heidelberger Kurzfragebogen R ¨uckenschmerz (HKF-R10; 1

study).

The HKF-R10 is a 27-item self-report questionnaire

developed to predict the likelihood of chronicity in patients with

acute low back pain.

(5) Pain Belief Screening Instrument (PBSI; 1 study).

The PBSI is

a 7-item self-report questionnaire aiming to predict disability in

subacute and chronic pain patients with musculoskeletal pain

(neck, shoulder, and low back pain).

(6) Keele STarT Back Screening Tool (SBT; 11 studies).

The

SBT is a 9-item self-report questionnaire developed to predict

poor outcome—operationalized as disability—in (sub)acute

and chronic primary care patients with nonspecific low back

pain.

(7) Preventing the Inception of Chronic Pain (PICKUP; 2

studies).

115

The PICKUP is a 5-item self-report questionnaire

aiming to predict the risk of chronic low back pain in patients

with acute low back pain.

An overview of the included instruments and more detailed

characteristics (as described in the base article) can be found in

Table 1.

3.3. Study characteristics: sources and samples

Studies were conducted between 2000 and 2017.

43,50

The majority

of the studies included samples that were collected in Northern

European countries (N 511) or Western European countries (N 5

11). A small number of studies collected data from samples outside

Europe, including Canada (N 51), the United States (N 53),

Australia and New Zealand (N 57), and China (N 51).

Sex and age of participants differed largely between study

samples. The average/median age of participants ranged

between 37.7 years and 53.0 years.

21,64

The sex of participants

varied from 33.7% female participants to 83.0% female

participants.

18,63

Figure 1. Flow of studies through the review.

4E. Veirman et al.·4 (2019) e775 PAIN Reports

Study samples were collected in primary care (83.3%) and

secondary care settings (11.9%), and 1 study included a com-

bined sample of participants from primary and secondary care

units (4.8%).

The terminology used to describe the settings

varied, by reference to providers (eg, general practitioner or

a physical therapist)

and/or type of services (eg, spinal

Table 1

Summary of included screening tools.

Screening tool Development study Summary of instrument Scoring method Cutoff scores/subgrouping,

follow-up

Acute Low Back Pain Screening

Questionnaire (ALBPSQ), later

renamed as ¨

Orebro

Musculoskeletal Pain Screening

Questionnaire (OMPSQ), and

reframed as ¨

Orebro

Musculoskeletal Screening

Questionnaire (OMSQ)

Linton and Hallden,

Sweden

24 items

Risk assessment for poor

prognosis—operationalized as

accumulated sick leave.

In acute and subacute patients

presenting with musculoskeletal pain

(lower back, neck, and shoulder).

In primary care setting.

21 items are scored, covering pain

experience (5 items), physical

functioning (5 items), coping (1 item),

job satisfaction (1 item), anxiety/

stress (1 item), depression (1 item),

fear-avoidance beliefs (3 items),

recovery expectations (2 items),

heavy or monotonous work (1 item),

and sick leave (1 item).

Miscellaneous items relate to age,

sex, and nationality.

Cutoff score of 105.

6-month follow-up.

Orebro Musculoskeletal Pain

Screening Questionnaire short

(OMPSQs)

Linton et al.,

Sweden

10 items

Risk assessment for poor

prognosis—operationalized as sick

leave.

In workers suffering from

musculoskeletal pain (back).

In occupational health care setting.

10 items are scored, covering pain

experience (2 items), self-perceived

function (2 items), distress (2 items),

return to work expectancies

(2 items), and fear avoidance beliefs

(2 items).

Cutoff score of 50.

1-year follow-up.

Orebro Musculoskeletal

Screening Questionnaire short

(OMPQs)

Gabel et al.,

Australia

12 items

Risk assessment for poor

prognosis—operationalized as

problem severity, functional status,

absenteeism, long-term

absenteeism, recovery time, and

cost.

In acute and subacute workers

presenting with musculoskeletal pain

(whiplash and low back pain).

In primary care setting.

12 items are scored, covering pain/

problem experience (3 items),

physical function (2 items), life

satisfaction (1 item), depression

(1 item), anxiety (1 item), fear-

avoidance beliefs (2 items), recovery

expectations (1 item), and other

(1 item).

No optimal cutoff recommended.

6-month follow-up.

Heidelberger Short Early Risk

Assessment Questionnaire (HKF-

R 10)

Neubauer et al.,

Germany

27 items

Risk assessment for chronic low back

pain

In patients with acute low back pain.

In primary care setting.

26 items are scored, covering

sociodemographic information (2

items), pain intensity and duration (4

items), efficacy of massage (1 item),

depression (5 items), catastrophizing

(5 items), and helplessness and

hopelessness (9 items).

An additional item regarding pain

intensity in the past week is present

in the measure, but is not included

within the total score.

No optimal cutoff recommended.

6-month follow-up.

Pain Belief Screening Instrument

(PBSI)

Sandborgh et al.,

Sweden

7 items.

Risk assessment for disability.

In subacute and chronic pain patients

with musculoskeletal pain (neck,

shoulder, and low back)

In primary care setting.

7 items are scored, covering pain

intensity (1 item), functional ability (1

item), fear-avoidance (2 items),

catastrophizing (1 item), and self-

efficacy (2 items).

No optimal cutoff recommended.

No follow-up.

STarT Back Tool (SBT) Hill et al.,

United

Kingdom

9 items.

Risk assessment for pain-related

disability.

In (sub)acute and chronic patients

with nonspecific back pain.

In primary care setting.

9 items are scored, covering

bothersomeness of pain (1 item),

presence of referred pain (1 item),

comorbid pain (1 item), disability (2

items), catastrophizing (1 item), fear

(1 item), anxiety (1 item), and

depression (1 item).

Stratification of patients in low

(overall score 0–3), medium (overall

score .3; psychosocial subscale

score ,4), or high risk (psychosocial

subscale scores $4) categories of

poor clinical outcome, assisting in

decision-making about the specific

course treatment.

6-month follow-up.

Predicting the Inception of

Chronic Pain (PICKUP)

Traeger et al.,

115

Australia

5 items.

Risk assessment for chronic low back

pain.

In patients with acute low back pain.

In primary care setting.

5 items are scored online through

http://pickuptool.neura.edu.au/,

covering pain intensity (1 item), leg

pain (1 item), disability compensation

(1 item), depression (1 item), and

perceived risk (1 item).

Predicted probability risk score in

percentage.

3-month follow-up.

4 (2019) e775 www.painreportsonline.com 5

outpatient clinic).

Although some studies detailed the treatment

patients received (eg, work conditioning program),

others often did

not (eg, treated as usual).

If information about the use of treatments

is reported with insufficient detail, it can potentially bias performance

results of the included screening tools because it does not allow

researchers to evaluate the impact it might have had on the

results.

82,83

Moreover, within the studies that reported on the use of

treatments, none of the studies accounted for treatment use.

Most study samples comprised participants with musculoskeletal

pain. In particular, patients with back pain were overrepresented.

Study samples often also included participants from other populations,

such as those experiencing neck pain, pain between the shoulder

blades,

124

or multisite pain

24,25

(see Table 2 for an overview).

3.4. Risk of bias assessment of included studies

3.4.1. Participant selection

The majority of the study samples consisted of mixed samples

containing both acute and chronic pain patients (59.5%). The

remaining samples comprised patients with acute pain (33.3%) or

samples for which the type of pain (acute, subacute, or chronic

pain) was not clearly described (7.1%) (Table 2).

For the PROBAST “participant selection” domain, the majority of the

study samples were rated as having a moderate risk of bias (51.1%).

Fewer study samples were rated as having low (16.7%) or high risk of

bias (19.0%). For the remaining study samples, the risk of bias was

rated as unclear (Table 3) because the presented information was

insufficient to evaluate the appropriateness of the inclusion criteria or

the state of health of participants. The reasons for increasing the risk of

bias related to the specified inclusion and exclusion criteria and

differences in the state of health of participants at enrollment.

3.4.1.1. Inclusion and exclusion criteria

The eligibility criteria were sometimes inappropriate or unclear.

For example, some studies did not exclude unemployed

participants

or did not report information on employment,

although the screening tools contained work-related questions.

Most studies reported inclusion and exclusion criteria. However,

sometimes the criteria had to be retrieved from descriptive

information

or from a previously published study.

3.4.1.2. Participants’ state of health at enrollment

Although most studies aimed to recruit a homogeneous sample,

other studies did not. Participants were found not to be in a similar

state of health at baseline in cases when studies included patients

with (sub)acute and chronic pain in a single

sample.

5,14,20–22,25,28,41,43,44,50,53,61,63,64,77,78,80,92,98

For in-

stance, despite George and Beneciuk

reported detailed

information about their patients with (sub)acute and chronic

pain, the analyses were based upon the full sample. For

a considerable number of studies, the state of health of the

participants had to be derived from descriptive information. For

example, Margison and French

only reported on average pain

duration in weeks. Sometimes, insufficient information was

available to conclude whether participants were in a similar state

of health at enrolment

(Table 2).

3.4.2. Predictors

The most frequently used screening tool was the ALBPSQ/

OMPSQ/OMSQ. However, we noted that the cutoff score to

identify the high risk group varied substantially, ranging between

72 and 147 (Table 4).

For the PROBAST “predictors” domain, the majority of the

study samples were rated as having a low risk of bias (40.5%).

Only a small number of study samples were rated as having

moderate (19.0%) or high risk of bias (11.9%). For 28.6% of the

study samples, the risk of bias was rated as unclear (Table 3)

because the presented information was insufficient to evaluate

whether differences occurred in the assessment of the

screening tools either across participants or compared with

the development study. The reasons for increasing the risk of

bias related to differences in the assessment of the screening

tools across participants and differences in the assessment of

screening tools compared with the development study.

3.4.2.1. Definition and assessment of predictors across

participants

Study samples that validated the ALBPSQ, the OMPSQ, and the

PICKUP—tools that include work-related questions—sometimes

did not report information on employment status. This could

mean that participants were all employed, or that some of the

participants were unemployed, but it was not reported.

Furthermore, those studies that did report on employment status

did not always administer these tools in a similar way across

participants. For example, Hurley et al. instructed participants to

fill out ALBPSQ work-related questions as best they could, even

when they were unemployed.

43,44

When these questions were

left blank, the mean score of the other questions was used as

replacement. In the study by Grotle et al.,

it is noted that for

participants who were unemployed, OMPSQ work-related

questions were replaced by the mean score of the other

questions.

3.4.2.2. Definition and assessment of predictors compared

with the developmental model

Furthermore, across the included studies, significant variation

was observed in the applied screening tool cutoff points used to

categorize patients. Selective reporting of results based only on

cutoff values other than those specified in the original de-

velopment study for the screening tool, was considered a risk for

underestimation or overestimation of the screening tool’s pre-

dictive accuracy. Moreover, variable use of cutoffs prohibits to

estimate the influence of a given setting on the performance at the

recommended (original) threshold. For example, for the ALBPSQ,

the standard cutoff originates from Linton and Halld ´en,

who

used 105 as their cutoff score for detecting poor prognosis in the

form of sick leave. Hurley et al.

and Vos et al.

124

only reported

results using a cutoff of 112 and 72, respectively, for the outcome

sick leave. In addition, few studies also treated screening tool

scores as continuous without additional reporting of the cutoff

values from the screening tool’s development study

(Table 4).

3.4.3. Outcomes

The majority of study samples assessed one or more outcomes

related to pain (66.7%), activity limitations (54.8%), and participation

restrictions (50.0%). In addition, about half of the study samples

reported also mixed or composite outcomes (40.5%) (Table 4).

For the PROBAST “outcomes” domain, the majority of the

study samples were assigned an unclear risk of bias (40.5%),

mainly due to insufficient information to evaluate blinding, or

a moderate risk of bias (42.9%). None of the study samples were

6E. Veirman et al.·4 (2019) e775 PAIN Reports

Table 2

Key study and participant characteristics of included validation studies.

Study Country; setting Age in years [SD; (IQ-)range] % female Pain type Pain duration Pain intensity

ALBPSQ

Hurley et al.

United Kingdom;

Primary care

Physiotherapy departments and

health centers

M543.19 [range: 17–77] 60 Low back pain ,12 weeks: N 564

.12 weeks: N 550

MPQ

ALBPSQ #112

Med

514.5 [IQ range 512.2;

range 51–54]

MPQ

ALBPSQ .112

Med

527.5 [IQ range 524.5;

range 50–70]

Hurley et al.

United Kingdom;

Primary care

Physiotherapy departments

Med

541.5 [range: 17–77] 60 Low back pain ,12 weeks: 56% MPQ:

Med

519.0 [IQ range: 20.0,

range: 0–70]

Grotle et al.

Norway;

Primary care

General practitioners, chiropractors,

and physical therapists (27%

recruited through advertisement)

M538.9 [SD 510.3] 57 Low back pain with or without

radiation

#3 days: N 541, 34%

4–12 days: N 543, 36%

13–20 days: N 536, 30%

Grotle et al.

Norway;

Primary care

General practitioners, chiropractors,

and physical therapists (27%

recruited through advertisement)

M538.0 [SD 510.1] 54 Low back pain with or without

radiation

M52.3 weeks [SD 52.2] ALBPSQ current pain:

M56.7 (SD 51.8)

ALBPSQ average pain:

M53.0 (SD 52.5)

Grotle et al

Norway;

Primary care

General practitioners, chiropractors,

and physical therapists (27%

recruited through advertisement)

M537.9 [SD 510.1] 55 Low back pain with or without

radiation

M58.1 days [SD 56.6] NRS pain intensity last week:

M56.7 [SD 51.8]

Heneweer et al.

The Netherlands;

Primary care

Physical therapists

Recovered*: M 540.8 [SD 59.2]

Not recovered*: M 543.1

[SD 59.1]

39 Nonspecific low back pain Recovered*:

,4 weeks: N 520, 64.5%

4–6 weeks: N 59, 29.0%

7–12 weeks: N 52, 6.5%

Not recovered*:

,4 weeks: N 59, 36.0%

4–6 weeks: N 56, 24.0%

7–12 weeks: N 510, 40.0%

Vos et al.

124

The Netherlands;

Primary care

General practitioners

Male/female: M 543.2/38.2 64 Neck pain M 52.76 weeks [SD 53.00] ALBPSQ current pain:

M56.5 [SD 51.75]

ALBPSQ average pain:

M53.78 [SD 52.76]

OMPSQ

Linton and Boersma

Sweden;

Primary care

General practitioners and physical

therapists

M541.1 [range: 22–66] 48 Neck and back pain .24 weeks: 43% OMPSQ current pain:

M56.2 [SD 52.1]

OMPSQ average pain:

M55.1 [SD 52.2]

Dunstan et al.

Australia;

Primary care

Occupational injury compensation

database

[range: 18–65] 34 Musculoskeletal pain NA NA

Margison and

French

—Derivation

sample

Canada;

Primary care

Private-sector clinics and

physiotherapy clinics

M541.2 [SD 510.8] 41 Neck, shoulder, upper back, lower

back, arm, wrist, and hand, leg,

ankle, and foot, and other pain

M56.7 weeks [SD 51.7] Pain intensity past 3 months:

M56.8 [SD 52.0]

(continued on next page)

4 (2019) e775 www.painreportsonline.com 7

Table 2 (continued)

Key study and participant characteristics of included validation studies.

Study Country; setting Age in years [SD; (IQ-)range] % female Pain type Pain duration Pain intensity

Margison and

French

—Validation

sample

Canada;

Primary care

Private-sector clinics and

physiotherapy clinics

M541.5 [SD 59.8] 39 Neck, shoulder, upper back, lower

back, arm, wrist, and hand, leg,

ankle, and foot, and other pain

M56.6 weeks [SD 51.5] Pain intensity past 3 months:

M57.1 [SD 52.1]

Maher and

Grotle

—Australasian

sample

Australia and New Zealand;

Primary care

Physiotherapy clinics

M543.3 [SD 512.1] 43 Nonspecific low back pain 6–8 weeks: N 545

9–11 weeks: N 538

12 weeks: N 517

OMPSQ current pain:

M55.2 [SD 51.9]

Maher and

Grotle

—Norwegian

sample

Norway;

Primary care

Doctors and chiropractors

M538.7 [SD 59.7] 56 Low back pain ,1 week: N 550

1–2 weeks: N 521

2–3 weeks: N 529

OMPSQ current pain:

M56.8 [SD 51.8]

Gabel et al.

—OMPSQ Australia;

Primary care

Physiotherapy outpatient clinics

M539 [SD 57; range: 18–58] 42 Lower back, lower back and leg,

lower back and neck, back, neck, and

shoulder pain

M54.0 weeks [SD 58.2]

6% chronic

OMPSQ current pain:

M56.5 [SD 51.8]

OMPSQ average pain:

M56.2 [SD 53.0]

Gabel et al.

—OMSQ Australia;

Primary care

Physiotherapy outpatient clinics

M539 [SD 59; range: 18–58] 43 Neck/back, arm, leg, both sides, and

several areas

M54.1 weeks [SD 58.1]

8% chronic

OMSQ intensity acute:

M56.6 [SD 51.9]

OMSQ severity chronic:

M55.8 [SD 52.7]

Linton et al.

Sweden;

Primary care

M548 83 Nonspecific back or neck pain NA NA

Gabel et al.

Australia;

Primary care

Physiotherapy centers

M538.9 [SD 510.5; range:

18–65]

43 Musculoskeletal pain resulting from

work injury (back, neck, upper limbs,

lower limbs, and multisite pain)

Item 3 OMSQ: M 54.1 [SD: 2.9] OMSQ intensity acute:

M56.3 [SD 52.0]

OMSQ severity chronic:

M56.0 [SD 52.9]

Nonclercq and Berquin

Belgium;

Secondary care

Emergency facility and outpatient

clinic

M542.2 [SD 510.7] 56 Back pain (lumbar pain, cervical pain,

and multisite pain)

,3 weeks: 58% NA

Dagfinrud et al.

Norway;

Primary care

Manual therapists

M544.3 [SD 514.4; range:

18–81]

59 Neck pain and low back pain 0–2 weeks: 23.4%

2–12 weeks: 24.1%

3–12 months: 13.9%

.1 year: 38.6%.

OMPSQ current pain:

M56.36 [SD 53.54]

Gabel et al.

Australia;

Primary care

Physiotherapy centers

M539.3 [SD 59.7] 43 General musculoskeletal pain (spine,

upper and lower limbs)

NA NA

Law et al.

China;

Primary care

Physiotherapy outpatient clinics

M544.2 [SD 511.2] 43 Nonspecific low back pain M 53.0 weeks [SD 51.8]

1–2 weeks: N 5114, 47.3%

3–5 weeks: N 5100, 41.5%

6–10 weeks: N 524, 9.9%

NPRS pain intensity:

M55.8 [SD 52.1]

Riewe et al.

Germany;

Primary, secondary care

Orthopaedic specialists,

rehabilitation facilities, and private

physiotherapy practices

M543 65 Nonspecific back pain .1 week: 94%

.24 weeks: 15%

OMPSQ current pain:

M55.5 [SD 52.1]

OMPSQ average pain:

M54.8 [SD 52.0]

OMPSQs

Linton et al.

Sweden;

Primary care

M548 83 Nonspecific back or neck pain NA NA

Karran et al.

Australia;

Secondary care

Spinal outpatient clinic

M549 [SD 516] 49 Low back pain, with or without leg

symptoms

,3 months: 20.9%

3–6 months: 33.6%

.6 months: 44.6%

NRS pain intensity previous week:

M57.1 [SD 52.2]

(continued on next page)

8E. Veirman et al.·4 (2019) e775 PAIN Reports

Table 2 (continued)

Key study and participant characteristics of included validation studies.

Study Country; setting Age in years [SD; (IQ-)range] % female Pain type Pain duration Pain intensity

OMSQs

Gabel et al.

Australia;

Primary care

Physiotherapy centers

M539.3 [SD 59.7] 43 General musculoskeletal pain (spine,

upper and lower limbs)

NA NA

STarT Back

Hill et al.

United Kingdom;

Primary care

General practice

M545 [SD 59.7] 59 Nonspecific back pain ,1 month: N 583, 17%

1–3 months: N 594, 19%

4–6 months: N 577, 15%

7 months–3 years: N 5125, 25%

.3 years: N 5112, 22%

NRS pain intensity mean (least,

average, current):

Mild (0–5): N 5325, 65%

Moderate (6–7): N 5113, 23%

Severe (8–10): N 554, 11%

Fritz et al.

United States;

Primary care

Outpatient physical therapy clinics

M544.3 [SD 515.8] 57 Low back pain

Med

546 days [IQ range:

18.5–147]

NRS initial pain intensity:

M55.3 [SD 52.3]

Field and Newell

United Kingdom;

Primary care

Chiropractic clinics

Low risk: M 545.4 [SD 515.1]

Medium risk: M 545.9 [SD 515.0]

High risk: M 545.8 [SD 514.1]

Low risk: 55

Medium

risk: 53

High risk:

Nonspecific low back pain ,1 month: 56.2%

1–3 months: 12.4%

.3 months: 31.4%

BQ pain:

Low risk:

Med

55 (range: 4–7)

Medium risk:

Med

57 (range: 6–8)

High risk:

Med

57 (range: 6–9)

Beneciuk et al.

United States;

Primary care

Outpatient physical therapy clinics

M541.1 [SD 513.5] 61 Low back pain

Med

590.0 days [IQ range:

30–365]

#14 days: 11.8%

15–90 days: 39.2%

$90 days: 49.0%

NRS pain intensity mean (current,

best, and worst):

M55.3 [SD 52.0]

Morsø et al.

—UK sample United Kingdom;

Primary care

General practices

Med

546.0 [IQ range 539–53] 59 Nonspecific low back pain ,4 weeks: N 5327, 38.2%

4–12 weeks: N 5221, 25.8%

.12 weeks: N 5285, 33.3%

NRS pain intensity:

Med

55 [IQ range: 3–7]

Mild (0–5): N 5527, 61.6%

Moderate (6–7): N 5196, 22.9%

Severe (8–10): N 5127, 14.8%

Morsø et al.

—Danish

sample

Denmark;

Primary care

General practices and physiotherapy

clinics

Med

550.0 [IQ range 541–59] 58 Nonspecific low back pain ,4 weeks: N 5149, 44.2%

4–12 weeks: N 566, 19.6%

.12 weeks: N 5122, 36.2%

NRS pain intensity:

Med

57 [IQ range: 5–8]

Mild (0–5): N 5130, 38.7%

Moderate (6–7): N 598, 29.2%

Severe (8–10): N 5108, 32.1%

Morsø et al.

—Primary care

sample

Denmark;

Primary care

General practices and physiotherapy

clinics

M552.0 [SD 515.2] 57 Low back pain ,1 month: N 565, 38.9%

1–3 months: N 539, 23.4%

.3 months: N 563, 37.7%

NRS low back pain intensity:

Med

56 (IQ range: 4–7)

NRS leg pain intensity:

Med

53 (IQ range: 0–6)

Morsø et al.

—Secondary

care sample

Denmark;

Secondary care

Spine center

M552.0 [SD 514.1] 54 Low back pain ,1 month: N 547, 5.0%

1–3 months: N 5139, 14.9%

.3 months: N 5746, 80.0%

NRS low back pain intensity:

Med

55 (IQ range: 4–7)

NRS leg pain intensity:

Med

55 (IQ range: 2–7)

Foster et al.

United Kingdom;

Primary care

Family practices

M553.0 [SD 515.0] 55 Nonspecific low back pain ,1 month: N 575, 20%

1–3 months: N 562, 17%

3–6 months: N 575, 20%

6 months–3 years: N 582, 22%

.3 years: N 574, 20%

NRS pain intensity:

M55.3 [SD: 2.4]

George and Beneciuk

United States;

Primary care

Outpatient physical therapy clinics

Med 545, M 543.5 [SD 512.4] 65 Low back pain $90 days: N 553, 47.7% NRS pain intensity mean (current,

best, and worst):

Med

55.3, M 55.4 [SD: 1.9]

(continued on next page)

4 (2019) e775 www.painreportsonline.com 9

Table 2 (continued)

Key study and participant characteristics of included validation studies.

Study Country; setting Age in years [SD; (IQ-)range] % female Pain type Pain duration Pain intensity

Newell et al.

United Kingdom;

Primary care

Chiropractic clinics

M547.8 [SD 513.9] 57 Nonspecific low back pain ,1 month: 43.2%

1–3 months: 10.0%

.3 months: 46.6%

BQ pain:

M56.4 [SD: 2.0]

Kongsted et al.

Denmark;

Primary care

Chiropractic clinics

M543 44 Nonspecific low back pain or lumbar

nerve root involvement

0–2 week: 62%

2–4 weeks: 13%

1–3 months: 11%

.3 months: 14%

NRS low back pain intensity:

M56.5

NRS leg pain intensity:

M52.4

Karran et al.

Australia;

Secondary care

Spinal outpatient clinic

M549 [SD 516] 49 Low back pain, with or without leg

symptoms

,3 months: 20.9%

3–6 months: 33.6%

.6 months: 44.6%

NRS pain intensity previous week:

M57.1 [SD 52.2]

HKF-R10

Riewe et al.

Germany;

Primary, secondary care

Orthopaedic specialists,

rehabilitation facilities, and private

physiotherapy practices

NA 67 Nonspecific back pain .8 days: 88% HKF-R10 pain past week:

M553.14 [SD 522.13]

HKF-R10 pain past week in best

stage:

M527.85 [SD 521.14]

PBSI

Sandborgh et al.

Sweden;

Primary care

Physical therapy departments and

occupational health care organization

M546 [SD 511; range: 19–64] 68 Musculoskeletal pain

Med

512 months (IQ range: 3–59,

range 1–300).

Subacute: N 522, 22%

Chronic: N 5131, 78%

PICKUP

Traeger et al.

115

Australia;

Primary care

General practitioners, pharmacists,

and physiotherapists

M545 [SD 515.8] 46 Low back pain with or without leg

pain

,2 weeks: N 51183, 78%

2–3 weeks: N 5149, 10%

3–4 weeks: N 577, 5%

4–6 weeks: N 5116, 8%

Likert Pain intensity:

None: N 50, 0%

Very mild: N 5290, 19%

Mild: N 5242, 16%

Moderate: N 5565, 37%

Severe: N 5346, 23%

Very severe: N 570, 5%

Karran et al.

Australia;

Secondary care

Spinal outpatient clinic

M549 [SD 516] 49 Low back pain, with or without leg

symptoms

,3 months: 20.9%

3–6 months: 33.6%

.6 months: 44.6%

NRS pain intensity previous week:

M57.1 [SD 52.2]

* Split by the outcome recovery, which is defined as the patient’s individual perception of well-being within the current health state.

BQ, Bournemouth Questionnaire; MPQ, McGill Pain Questionnaire; NA, not available; NRS, Numeric Rating Scale.

10 E. Veirman et al.·4 (2019) e775 PAIN Reports

rated as having low risk of bias. For 16.7% of the study samples,

the risk of bias was not rated because no performance measures

were reported for the outcomes of interest (Table 3). The reasons for

increasing the risk of bias related to the validity of the outcome,

overlap between predictors and outcomes, differences in the

assessment of outcomes across participants, differences in the

assessment of outcomes compared with the development study,

and blinding.

3.4.3.1. Validity of outcome definition

Outcome measures that mixed outcome domains were rated as

inadequate. Also, composite outcomes that combined outcome

measures or outcome domains were considered inadequate.

For example, the 10-item modified version of the Oswestry

Disability Index contains items that assess activity limitations and

participation restrictions.

Mixed or composite outcomes have

the potential to increase the event rate and thus the statistical

power. However, they may be misleading when the outcome

domains included in the outcome differ in importance to patients,

the number of events in the outcome domains of greater

importance is small, and the magnitude of effect differs markedly

across the outcome domains.

3.4.3.2. Exclusion of predictors from outcome definition

Next, overlap between predictor and outcome assessment was

frequently observed and considered as problematic. Several

Table 3

Methodological quality of included validation studies.

Study Participant selection Predictors Outcomes Sample size and participants flow Analysis

ALBPSQ

Hurley et al.

High High Unclear Unclear Moderate

Hurley et al.

High High Unclear Unclear Moderate

Grotle et al.

Moderate Unclear Moderate Unclear Moderate

Grotle et al.

Moderate High Unclear Unclear Moderate

Grotle et al.

Moderate Unclear Moderate Unclear Moderate

Heneweer et al.

Unclear Unclear — Unclear —

Vos et al.

124

Moderate High Unclear Unclear Moderate

OMPSQ

Linton and Boersma

High Unclear Moderate Unclear Moderate

Dunstan et al.

Low Unclear — Unclear —

Margison and French

—Derivation sample Low Moderate Moderate Unclear Moderate

Margison and French

—Validation sample Low Moderate Moderate Unclear Moderate

Maher and Grotle

—Australasian sample Moderate Moderate Unclear Unclear Moderate

Maher and Grotle

—Norwegian sample Low Moderate Unclear Unclear Moderate

Gabel et al.

—OMPSQ Moderate Moderate Moderate Unclear Moderate

Linton et al.

Unclear Low Unclear Unclear Moderate

Nonclercq and Berquin

Moderate Unclear Moderate Unclear Moderate

Dagfinrud et al.

High Unclear Unclear Unclear Moderate

Law et al.

Moderate Unclear Moderate High Moderate

Riewe et al.

High High Moderate Moderate Moderate

OMSQ

Gabel et al.

Moderate Moderate Moderate Unclear Moderate

Gabel et al.

Low Moderate Moderate Unclear Moderate

Gabel et al.

Low Moderate Moderate Unclear Moderate

OMPSQs

Linton et al.

Unclear Low Unclear Unclear Moderate

Karran et al.

High Unclear Unclear Low Low

OMSQs

Gabel et al.

Low Unclear Moderate Unclear Moderate

HKF-R10

Riewe et al.

Moderate Low Moderate Moderate Moderate

PBSI

Sandborgh et al.

High Low Unclear Low Moderate

SBT

Hill et al.

Moderate Low Unclear Unclear Moderate

Fritz et al.

Moderate Low — Unclear —

Field and Newell

Moderate Low — Unclear —

Beneciuk et al.

Moderate Low Moderate Unclear Moderate

Morsø et al.

—UK sample Moderate Low Moderate Unclear Moderate

Morsø et al.

—Danish sample Moderate Low Moderate Unclear Moderate

Morsø et al.

—Primary care sample Moderate Low Unclear Unclear Moderate

Morsø et al.

—Secondary care sample Moderate Low Unclear Unclear Moderate

Foster et al.

Moderate Low — Unclear —

George and Beneciuk

Moderate Low — High Moderate

Newell et al.

Moderate Low — Unclear —

Kongsted et al.

Moderate Low Moderate Low Moderate

Karran et al.

Moderate Low Unclear Low Low

PICKUP

Traeger et al.

115

Moderate Unclear Unclear Low Low

Karran et al.

High Unclear Unclear Low Low

4 (2019) e775 www.painreportsonline.com 11

Table 4

Key predictor, outcome, sample size and participants flow, and analysis characteristics of included validation studies.

Study N at baseline, (follow-up(s); N at

follow-up(s); % at final follow-up)

Outcome (assessment, applied cutoff) Events (N and/or %) Recommended criterion Performance measures

ALBPSQ

Hurley et al.

118 (at treatment discharge; 118; 100%) Pain intensity (MGPQ, NA)

Functional disability (RMDQ, NA)

Return to work (yes/no)

29/15

112 Kendall’s t

Kendall’s t

Mann–Whitney U tests, sensitivity, and

specificity

Hurley et al.

118 (12 months; 90; 76%) Pain intensity (MGPQ, NA)

Functional disability (RMDQ, NA)

Work loss (yes/no)

14/55 (20.2%/79.7%)

112 Kendall’s t

Kendall’s t

Mann–Whitney U tests, sensitivity, and

specificity

Grotle et al.

123 (1, 3 months; 120; 98%) Pain intensity (NRS, NA)

Disability (RMDQ, .4 on both 1 and 3

months)

Sickness absence (NA)

24%

8% at 1 month

6% at 3 months

90 NA

ORs

Grotle et al.

123 (6 and 12 months; 112; 91%) Pain intensity (NRS, score .2)

Disability (RMDQ, .4)

Work loss (disability days, .30 days)

90 (105 for 12 months RMDQ) Specificity, sensitivity, LRs (2/1), AUC,

and ORs (for all outcomes)

Grotle et al.

123 (1, 3, 6, 9, and 12 months; 112; 91%) Pain intensity (NRS, NA)

Disability (RMDQ, .4)

Sickness absence (disability days, NA)

17% at 12 months

12 (11%) at 1 month

10 (9%) at 3 months

7 (7%) at 6 months

7 (8%) at 9 months

9 (9%) at 12 months

112 NA

ORs at 12 months

Heneweer et al.

66 (2, 4, 8, and 12 weeks; 56; 95%) Pain intensity (VAS, NA)

Disability (QBPDS, NA)

Work absenteeism (yes/no)

7/49 (87%/13%) at 12

weeks

Continuous NA

Vos et al.

124

187 (6, 12, 26, and 52 weeks; 180; 96%) Pain intensity (NRS, NA)

Sick leave (.7 days)

31 (22%)

72 NA

Specificity, sensitivity, PPV, NPV, and AUC

OMPSQ

Linton and Boersma

122 (6 months, 107; 88%) Pain intensity (OMPSQ items, $17)

Function (OMPSQ items, $45)

Sick leave (.0 days, .30 days)

48%

60%

60%/23%/17%

90 Specificity, sensitivity, and Wilks’ l(for all

outcomes)

Dunstan et al.

55 (6 months, 55; 100%) Return to work (yes/no) 24/31 Continuous NA

Margison and French

—Derivation

sample

200 (200; 100%) Clinical discharge status (fit/not fit for

return to work)

NA 147 Sensitivity and FPR

Margison and French

—Validation

sample

211 (211; 100%) Clinical discharge status (fit/not fit for

return to work)

195/16 147 Specificity, sensitivity, PPV, and NPV

Maher and Grotle

—Australasian

sample

133 (6 weeks, 3, 12 months; 133; 100%) Pain intensity (OMPSQ item, NA)

Disability (RMDQ, NA)

Continuous Regression coefficients

Regression coefficients

Maher and Grotle

—Norwegian

sample

97 (4 weeks, 3, 12 months; 97; 100%) Pain intensity (OMPSQ item, NA)

Disability (RMDQ, NA)

Continuous Regression coefficients

Regression coefficients

Gabel et al.

—OMPSQ 66 (6 months; 58; 88%) Problem severity (NRS, .1)

Functional status (SFI, $10%)

Absenteeism (PDO, .0 days)

Long-term absenteeism (PDO, .28 days)

113

115

120

Specificity, sensitivity, LRs, and AUC (for all

outcomes)

Gabel et al.

—OMSQ 106 (6 months; 97; 92%) Problem severity (NRS, .1)

Functional status (SFI, $10%)

112

Specificity, sensitivity, LRs, and AUC (for all

outcomes)

(continued on next page)

12 E. Veirman et al.·4 (2019) e775 PAIN Reports

Table 4 (continued)

Key predictor, outcome, sample size and participants flow, and analysis characteristics of included validation studies.

Study N at baseline, (follow-up(s); N at

follow-up(s); % at final follow-up)

Outcome (assessment, applied cutoff) Events (N and/or %) Recommended criterion Performance measures

Absenteeism (PDO, .0 days)

Long-term absenteeism (PDO, .28 days)

116

120

Linton et al. (2011) 183 (12 months; 183; 100%) Sick leave (.14 days of work during past 6

months)

171 90 Specificity, sensitivity, LRs, and AUC

Gabel et al. (2012) 143 (1 month, 6 months; 43; 100%) Problem severity (NRS, .10%)

Functional status (SFI/LLFI, .10%)

Absenteeism (PDO, .0 days)

Long-term absenteeism (PDO, .28 days)

114 Specificity, sensitivity, and LRs (1) (for all

outcomes)

Nonclercq & Berquin (2012) 91 (6 months; 73; 80%) Pain intensity (OMPSQ items, .16)

Function (OMPSQ items, ,45; ODI,

.20%)

Work absence (OMPSQ item, .6 [scores

corresponding to .30 days])

34%

58%; 18%

37%

Low/high

75/97

76/86

75/106

71/106

Specificity, sensitivity, PPV, and AUC (for all

outcomes)

Dagfinrud et al. (2013) 157 (8 weeks; 128; 82%) Functional limitations (ODI/NDI, NA) NA Continuous/105 Regression coefficients

Gabel et al. (2013) 143 (6 months; 143; 100%) Problem severity (NRS, .10%)

Functional status (PRO, .10%)

Absenteeism (PDO, .0 days)

Long-term absenteeism (PDO, .28 days)

126 Specificity, sensitivity, LRs (1), and

tests

Law et al. (2013) 241 (3–4 weeks, 12 months; per outcome:

184, 160, 220, 202; 76%, 66%, 91%,

84%)

Pain intensity (NRS, NA)

Functional disability (RMDQ, NA)

Return to work (yes/no)

Sick leave (.30 days)

171/49 at 12 months

88 at 12 months

105, 130 NA

Specificity, sensitivity, AUC, and ORs (both

outcomes)

Riewe et al. (2016) 241 (6 months; per outcome: 122, 122,

108; 51%, 51%; 45%)

Pain intensity (OMPSQ items, $17)

Function (OMPSQ items, ,45)

Sick leave (.0 days)

84 Specificity, sensitivity, PPV, NPV, LRs

(1/2), and AUC (for all outcomes)

OMPSQs

Linton et al.

183 (12 months; 183; 100%) Sick leave (.14 days of work during past 6

months)

171 50 Specificity, sensitivity, LRs, and AUC

Karran et al.

220 (4 months; 195; 89%) Poor outcome (composite pain/disability

NRS, $3)

Pain intensity (NRS, $3)

Disability (NRS, $3)

High pain (NRS, $5)

High disability (NRS, $5)

164 (84%)

155 (79%)

159 (82%)

129 (66%)

126 (65%)

Lowest 10th through highest 10th decile

of risk

Nagelkerke

, AUC, calibration plot (for

poor outcome), net benefit, post hoc

sensitivity analysis (for poor outcome and

high pain), and AUC (for all outcomes)

OMSQs

Gabel et al.

143 (6 months; 143; 100%) Problem severity (NRS, .10%)

Functional status (PRO, .10%)

Absenteeism (PDO, .0 days)

Long-term absenteeism (PDO, .28 days)

72 Specificity, sensitivity, LRs (1), and

tests

STarT Back

Hill et al.

500 (6 months; 500, 100%) Disability (RMDQ, $7) Low risk: 39 (16.7%)

Medium risk: 99 (53.2%)

High risk: 58 (78.4%)

Low, medium, and high risk groups Sensitivity, specificity, LRs (1/2), and

AUC

Fritz et al.

214 (at each visit; 177, 83%) Pain intensity (NRS, NA)

Disability (DISQ, NA)

NA Low, medium, and high risk groups NA

Field and Newell

404 (14, 30, 90 days; per follow-up per

outcome: 218/204, 123/119, 142/136;

54%/50%, 30%/29%, 35%/34%)

Pain (BQ, NA)

Total (BQ, NA)

Low, medium, and high risk groups NA

(continued on next page)

4 (2019) e775 www.painreportsonline.com 13

Table 4 (continued)

Key predictor, outcome, sample size and participants flow, and analysis characteristics of included validation studies.

Study N at baseline, (follow-up(s); N at

follow-up(s); % at final follow-up)

Outcome (assessment, applied cutoff) Events (N and/or %) Recommended criterion Performance measures

Beneciuk et al.

146 (4 weeks, 6 months; 128, 111; 88%,

76%)

Pain intensity (NRS, NA)

Disability (RODQ, NA)

Continuous Regression coefficients

Regression coefficients

Morsø et al.

—UK sample 856 (3 months; 845, 99%) Pain intensity (NRS, $8)

Activity limitations (RMDQ, .30)

Pain bothersomeness (1 item, severe or

very severe)

36%

Low, medium, and high risk groups AUC

RR, ORs, and AUC

AUC

Morsø et al.

—Danish sample 344 (3 months, 322, 94%) Pain intensity (NRS, $8)

Activity limitations (RMDQ, .30)

Pain bothersomeness (1 item, severe or

very severe)

47%

Low, medium, and high risk groups AUC

RR, ORs, and AUC

AUC

Morsø et al.

—Primary care

sample

172 (6 months; 144, 83%) Pain intensity (NRS, $8)

Activity limitations (RMDQ, .30)

40.2%

Low, medium, and high risk groups AUC

RR, ORs, and AUC

Morsø et al.

—Secondary care

sample

960 (6 months; 960, 100%) Pain intensity (NRS, $8)

Activity limitations (RMDQ, .30)

69.0%

Low, medium, and high risk groups AUC

RR, ORs, and AUC

Foster et al.

368 (2, 6 months; 254 (69%), 233 (63%) Pain intensity (NRS, NA)

Disability (RMDQ, NA)

NA Low, medium, and high risk groups NA

George and Beneciuk

146 (6 months; 111, 76%) Pain intensity (NRS 50)

Disability (RMDQ, #2)

Recovery (NRS 50 and RMDQ #2)

14 (12.6%)

36 (32.4%)

14 (12.6%)

Low, medium, and high risk groups Wilks’ l

Wilks’ l

Newell et al.

Initial treatment/2-days post-initial

treatment: 749/716 (14, 30, 90 days; per

follow-up: 542, 416, 318; 58%

Pain (BQ, NA)

Total (BQ, NA)

NA Low, medium, and high risk groups NA

Kongsted et al.

859 (2 weeks, 3, 12 months; per follow-up:

710, 676, 636; 83%, 79%, 74%)

Pain intensity (NRS, .0)

Disability (RMDQ, .8)

92% at 2 weeks

60% at 3 months

56% at 12 months

79% at 2 weeks

61% at 3 months

57% at 12 months

Low, medium, and high risk groups LR (1/2), AUC, and

Karran et al.

220 (4 months; 195; 89%) Poor outcome (composite pain/disability

NRS, $3)

Pain intensity (NRS, $3)

Disability (NRS, $3)

High pain intensity (NRS, $5)

High disability (NRS, $5)

164 (84%)

155 (79%)

159 (82%)

129 (66%)

126 (65%)

Low, medium, and high risk groups Nagelkerke

, AUC, calibration plot (for

poor outcome), net benefit, post hoc

sensitivity analysis (for poor outcome and

high pain), and AUC (for all outcomes)

HKF-R10

Riewe et al.

242 (6 months; 128; 58%) Pain intensity (HKF-R10 items, $30) 90 37 Specificity, sensitivity, PPV, NPV, LRs

(1/2), and AUC

PBSI

Sandborgh et al.

168 (8 months; 146, 85%) High pain intensity (NRS, $5)

High disability (PDI, $35)

Continuous NA

Specificity, sensitivity, and Wilks’ l

(continued on next page)

14 E. Veirman et al.·4 (2019) e775 PAIN Reports

studies used items of the investigated screening tool, measured

at follow-up, as primary outcome. For instance, Linton and

Boersma

used the OMPSQ in its entirety during the outcome

assessment, selecting the items on pain, activity limitations, and

sick leave. Studies also often included outcomes that showed

overlap with domains assessed by the screening tool items. In the

study by Grotle et al.,

both the activity items of the ALBPSQ and

the items of the Roland–Morris Disability Questionnaire (RMDQ)

outcome measure address activity limitations. This overlap may

lead to overestimation of the predictive performance of the

screening tool.

91,117

3.4.3.3. Definition and assessment of outcomes across

participants

For all studies, outcomes were defined and determined in

a similar way across participants. However, they were not

always defined and determined similarly to those in the

development studies. Indeed, although different outcomes

most probably have different predictors, a number of studies

targeted outcome domains (eg, pain intensity through OMPSQ

items and activity limitations through the RMDQ and not

participation restrictions through accumulated sick leave)

which differed from the development study. Other studies

focused on similar outcome domains, but used other measures

(eg, activity limitations through a NRS and not the RMDQ due to

the large amount of missing data).

3.4.3.4. Definition and assessment of outcomes compared

with the developmental model

In addition, some studies focused on similar outcome domains

and used the same outcome measures as the development

study, but used different cutoff points for the outcome measures

from those used in the development study. For example, large

differences were observed for sick leave. Vos et al.

124

defined

long-term sick leave as .7 days off work, while Linton and

Hallden

initially defined long-term sick leave as being sick listed

for .30 days (Table 4).

3.4.3.5. Determination of outcomes without knowledge of

predictor information

Information on blinding was most often not reported, which could

either mean that the outcome assessment was not blinded or that

it was blinded but not described. In cases where studies reported

on blinding of outcome assessment, researchers usually applied

blinding.

3.4.4. Sample size and participant flow

There was a huge difference between sample sizes of the

validation studies. Sample sizes varied considerably at follow-up,

ranging from ,100 participants,

18,25,38,43,44,64,81

over 500 to

1000 participants,

41,53,77,78,80

to .1500 participants.

115

Also,

the number of outcome events differed largely between studies

ranging from 14 to 291. The most frequently observed time

intervals were 3, 6, and 12 months

(see Table 4 for an

overview).

Few studies were rated as having low (16.7%), moderate

(2.4%), or high (4.8%) risk of bias for the PROBAST “sample size

and participants flow” domain. The majority of studies were

assigned an unclear risk of bias (76.2%; Table 3) because

insufficient information was presented to evaluate the number of

Table 4 (continued)

Key predictor, outcome, sample size and participants flow, and analysis characteristics of included validation studies.

Study N at baseline, (follow-up(s); N at

follow-up(s); % at final follow-up)

Outcome (assessment, applied cutoff) Events (N and/or %) Recommended criterion Performance measures

PICKUP

Traeger et al.

115

1528 (3 months; per outcome: 1528,

1525, 1504; 100%, 99%, 98%)

Pain intensity (Likert, .2)

High pain intensity (Likert, .3)

Disability (Likert, $2)

291 (19%)

162 (10%)

217 (14%)

Calculator Nagelkerke

, AUC, calibration plot

(intercept/slope), net benefit at incidence

rate cutoff, and net number of unnecessary

interventions avoided at 30% risk cutoff (for

all outcomes)

Karran et al.

220 (4 months; 195; 89%) Poor outcome (composite pain/disability

NRS, $3)

Pain intensity (NRS, $3)

Disability (NRS, $3)

High pain intensity (NRS, $5)

High disability (NRS, $5)

164 (84%)

155 (79%)

159 (82%)

129 (66%)

126 (65%)

Lowest 10th through highest 10th decile

of risk

Nagelkerke

, AUC, calibration plot (for

poor outcome), net benefit, post hoc

sensitivity analysis (for poor outcome and

high pain), and AUC (for all outcomes)

AUC, area under the curve; BQ, Bournemouth Questionnaire; DISQ, 10-item modified version of the Oswestry Low Back Pain Disability Questionnaire; FNR, false negative rate; FPR, false positive rate; LLFI, Lower Limb Functional Index; LRs, likelihood ratios; MCID, minimal clinically important difference; MGPQ,

McGill Pain Questionnaire; NA, not available; NDI, Neck Disability Index; NPV, negative predicted value; NRS, Numeric Rating Scale; ODI, Oswestry Disability Index; ORs, odds ratios; PDI, Pain Disability Index; PDO, paid days off; PPV, positivepredicted value;PRO, patient-reported outcome; QBPDS, Quebec Back

Pain Disability Scale; RMDQ, Roland–Morris Disability Questionnaire; RODQ; Revised Oswestry Disability Questionnaire; RR, relative risk; SFI, Spine Functional Index; VAS, Visual Analogue Scale.

4 (2019) e775 www.painreportsonline.com 15

outcome events, the inclusion of enrolled participants, or the

occurrence and handling of missing data. The reasons for

increasing the risk of bias related to the number of outcome

events, the time interval between the assessment of the

screening tools and the outcome assessment, dropout, and

missing data.

3.4.4.1. Number of outcome events

The number of events (ie, the number of individuals with the

outcome event) was not reported in a large number of

studies

5,14,21–25,33,38,64,66,80

and considered inappropriate in 5

studies.

28,31,44,63,81

These studies reported ,20 events, raising

the issue of overfitting (ie, the probability of an event is typically

underestimated in low-risk patients and overestimated in high-

risk patients).

4,85

3.4.4.2. Time interval between predictor assessment and

outcome determination

Studies sometimes performed multiple follow-ups, reporting

results on the predictive validity for one or only a selection of

follow-ups (eg, follow-ups at 2- and 4-week intervals until

discharge or study completion at 6 months, report of results for

6-month follow-up).

Time between screening and outcome

assessment was considered inappropriate when results only

reported on follow-ups of ,3 months, as chronic pain is defined

as pain $3 months (eg, six weeks).

Follow-ups .12 months

were also considered inappropriate, as people’s (mental) health

status changes during the follow-up period and the baseline

information becomes increasingly less accurate as time passes

(none of the studies). In addition, follow-ups that varied across

participants (eg, at treatment discharge, dependent on the

number of therapy treatments)

were deemed inappropriate.

Surprisingly, most studies did not present any theoretical

considerations underpinning the choice of a specific follow-up

timeframe (Table 4).

3.4.4.3. Inclusion of enrolled participants in analysis

Dropout attrition is often poorly reported or presented in a way

that prevents readers from being able to fully understand the

risk of attrition bias. Studies often limit themselves to reporting

the dropout rate. We considered dropout as inappropriate

when .20%

of the participants were lost at follow-

up.

5,20,21,28,44,53,80,92

However, dropout can occur for a num-

ber of reasons that may lead to differential dropout, such as

motivation (participants lost interest), mobility (participants

moved and are no longer able to continue participation),

morbidity (participants experience illness preventing their

participation), or mortality (participants die before study

completion). For example, a low psychosocial risk group may

lose more unmotivated participants—that in turn may have

different outcomes due to being unmotivated—than a high

psychosocial risk assessment group, and this differential

dropout may lead to differences in outcomes measured

among the remaining participants. Reasons for dropout are,

however, rarely specified among the included studies.

Furthermore, although characteristics of dropout (ie, baseline

characteristics: eg, age, sex, pain intensity, and pain duration)

should be available to examine whether systematic differences

exist between those who completed a study and those who

dropped out,

only few studies reported on the differences

between completers and noncompleters.

5,28,44,50,53,80,81,98

Of these studies, some provided a detailed tabulation of the

characteristics and statistical comparison,

whereas other

studies only reported the characteristics for which differences

were found.

Further, numerous studies do not mention

whether differences were examined, which could either mean

that differences were examined for all or some baseline

characteristics but none were found, or no differences were

tested.

3.4.4.4. Handling of missing data

Finally, studies did often not report on missing values or how they

were or would have been handled,

which could either mean

that there were no missing data or that missing data were present

but not described. Missing values were considered inappropri-

ately handled when complete-case analysis was applied.

They

were judged as appropriately handled when multiple imputation

was used.

For example, Karran et al.

used Little’s Missing

Completely at Random test to determine whether values were

missing completely at random and used a maximization algorithm

to impute missing values.

3.4.5. Analyses

Statistics of reported performance measures for pain and related

outcomes varied widely. Many studies report sensitivity and

specificity of screening tools,

whereas others included further

details, reporting area under the curve using receiver operating

characteristics analyses.

Wilk’s lambda for discriminative

validity is also reported in some studies,

28,98

as are the odds

ratios from logistic regression analyses

(see Table 4 for an

overview).

For the PROBAST “analyses” domain, the majority of study

samples were assigned a moderate risk of bias (76%), and

only a few study samples were rated as low risk of bias (9.5%).

For 14.3% of the study samples, no risk of bias labels was

assigned because no performance measures were reported

for the outcomes of interest. The reason for increasing the

risk of bias related to the poor use of the performance

measures.

3.4.5.1. Evaluation of relevant model performance measures

Statistical analyses were found appropriate when they reflected

both calibration (ie, agreement between predicted and observed

event rates) and discrimination (ie, the screening tool’s ability to

distinguish between patients developing and not developing the

outcome of interest) components of predictive validity for pain

and related outcomes.

This was only the case in 2 studies.

50,115

These studies also reported more recently introduced perfor-

mance measures (eg, net benefit). Moreover, not all studies

reported performance measures for pain and related outcomes

despite assessing those outcomes. Some studies reported on

the course of particular pain and related outcomes. For example,

Grotle et al.

reported the course of pain intensity, disability, and

sickness absence from baseline across follow-ups, but reported

no information on the predictive validity of the ALBPSQ for those

outcomes, except for disability where odds ratios were provided.

Other studies reported differences in mean scores on the

screening tool for particular outcomes, used change scores for

particular outcomes, or reported on composite outcomes. For

example, Dunstan et al.

reported differences in mean ALBPSQ

scores between those who did and did not return to work.

Dagfinrud et al.

assessed functional limitations at baseline and

16 E. Veirman et al.·4 (2019) e775 PAIN Reports

follow-up; however, the predictive validity of the OMPSQ was

examined for functional improvement, and the categorization of

those that were improved and those that were not was based on

change scores. Finally, George and Beneciuk

assessed pain

intensity and disability; yet, discriminative validity was only

examined for recovery, a composite pain intensity and disability

outcome. Still others assessed pain and related outcomes, but

only reported performance measures related to outcomes that

were not within the scope of the current review. For instance,

Heneweer et al.

assessed pain intensity, disability, work

absenteeism, and self-reported recovery, but only reported area

under the curve values for the ALBPSQ total and subscale scores

in predicting recovery or nonrecovery at final follow-up (Table 4).

4. General discussion

This review (1) identified multidimensional screening tools that

assess psychosocial risk factors for poor pain outcomes, (2)

appraised the quality of the evidence in prospective studies

validating these tools, and (3) synthesized common methodo-

logical concerns in these validation studies.

Seven screening tools were identified, all developed for use in

primary care settings to predict chronic pain (HKF-R10, PICKUP)

or chronic disability (ALBPSQ/OMPSQ, OMPSQs, OMSQs,

PBSI, and STarT Back) in patients with back pain. Notably, we

found no tools for the prediction of pain-related distress, a key

indicator of health, or for the prediction of acute pain onset,

including postoperative pain. These appear to be significant gaps

in the literature.

101

We assessed the quality of the evidence of 32 studies including

42 study samples aiming to validate the predictive value of

identified screening tools. Overall, studies showed a moderate

risk of bias, which varied largely from domain to domain. Here, we

discuss the most notable methodological problems.

Most screening tools were developed to predict the chron-

ification of pain problems, except for the SBT and the PBSI, which

were developed to support decision-making for a wide range of

patients with pain conditions, regardless of pain duration.

41,97

It is

reasonable to expect that validation studies include similar patient

populations as those from the development study. Surprisingly,

this was often not the case. Indeed, although most tools were

developed to be used in patients with acute pain, a substantial

number of these validation study samples included also patients

with chronic pain. This is concerning for several reasons. First,

these studies do not address the same key question as the

development study. It may also well be that risk factors

developing chronic pain are different from predictors for the

maintenance of chronic pain. Second, it is likely that the recovery

rate of chronic pain is less than the one of acute pain.

Therefore,

the presence of chronic patients with chronic pain may (at least

partly) account for the apparently high performance in predicting

poor pain outcomes. This complicates interpretation of results

and may result in an underestimation or overestimation of the

predictive value of the screening tools. There is a need to define

the inclusion criteria for participants in a more clear and restrictive

way and to align these with the original purpose of the screening

tools.

The success of initial studies revealing the value of psychoso-

cial risk factors in predicting chronic pain problems has boosted

research in this area. However, some of the original studies were

designed with specific (clinical) groups in mind. An example is the

ALBPSQ, which was designed to target a working population.

Some items that are directly related to work (eg, “If you take into

consideration your work routines, management, salary,

promotion possibilities, and workmates, how satisfied are you

with your job?”) are therefore inapplicable to a nonworking

population. The authors have addressed this problem in various

ways. Some replaced the missing scores for those items by the

mean for nonworking patients.

Others asked patients to fill out

those questions related to either current paid or unpaid work.

43,44

Likewise, screening tools were developed for patients with

musculoskeletal, in particular back pain, but studies have also

investigated the value of the tools in other patient groups (eg,

neck pain).

124

Sometimes, items have been adapted accordingly

and/or left out. There is a lack of evidence, however, to suggest

that these changes are appropriate for the populations in

question.

All studies agree that screening tools need to predict poor pain

outcome. However, there is less agreement about what exactly

poor outcome means. Indeed, a gold standard for poor outcome

is lacking. The constructs addressed and the measures and

cutoffs used vary largely between studies. For some, poor

outcome simply means pain, for others not being able to work, or

difficulties in performing physical activities. However, different

outcomes most probably also have different predictors. The

broad use of the umbrella term “disability” brings additional

complications. Indeed, in pain research, “disability” may indicate

difficulties in performing particular physical activities (eg, ability to

walk, eat, shower, or dress) but also problems related to social

role functioning (eg, sick leave, days absent from work, or return

to work status). According to the International Classification of

Functioning (ICF),

130

these are 2 different constructs, ie, activity

limitations and participation restrictions, which should not be

confused. The lack of a gold standard may also explain the

inconsistency in criteria used across studies. For instance, Morsø

et al. defined poor pain outcome as a score greater than 7 on an

11-point NRS,

77,78

whereas George and Beneciuk

defined it as

a score greater than 0. It is obvious that the patients defined as

recovered differ between these studies. The use of an agreed-

upon set of outcome measures may provide a solution.

10,11,89

doing so, we also recommend the selection of measures that are

readily applicable to different contexts—occupational and non-

occupational settings—and to different pain problems. Such

measures already exist, but are underused (eg, Patient-Reported

Outcomes Measurement Information System, PROMIS,

9,118,119

available at www.healthmeasures.net).

Some of the identified screening tools were developed to

screen for psychosocial risk factors (“yellow flags”), or, at least,

are presented as such in studies. Some cautionary notes are

warranted. First, all screening tools also include items that could

be categorized otherwise (eg, pain duration and disability

compensation). Second, screening tools often contain items that

could equally well be the primary outcomes (pain intensity,

disability, and days off work). Although this may be less of

a problem when simply aiming to predict, it is premature to

explain the predictive power of these instruments in terms of

psychosocial processes. Indeed, given that it is generally known

that the best predictor of events in the future is their occurrence in

the present or past, it remains to be investigated whether the

predictive validity of screening tools is due to the overlap between

predictor and outcome.

91,117

To address this problem, one may

examine whether tools are able to predict outcomes, beyond the

predictive power of baseline pain and pain-related disability.

Most studies are not in line with the current guidelines for

reporting measures of performance.

110,111

In fact, there is a large

disparity in reported performance measures. Many studies

reported conventional performance measures, often reporting

either calibration (ie, how close predictions are to observed

4 (2019) e775 www.painreportsonline.com 17

outcomes) or discrimination (ie, screening tool’s ability to

correctly distinguish the 2 outcome classifications of event vs

nonevent). However, the reporting of both performance

measures is crucial. Furthermore, most studies do not

consider the clinical consequences of decisions made using

a screening tool. Therefore, there is the implicit assumption

that false-positive (ie, patient being treated unnecessarily) and

false-negative (ie, patient not getting a treatment that (s)he

would benefit from) predictions are equally harmful (ie, equally

weighted). More recent studies

50,115

do consider the relative

harms or benefits of these alternative clinical outcomes. They

apply novel performance measures such as net benefit (ie, the

expected utility of a decision to treat patients at some

threshold, compared with a decision based on an alternative

policy such as treating nobody)

75,110,111,120,121

(see also

www.decisioncurveanalysis.org).

An assessment of the risk of bias was not possible in

a considerable number of studies because of incomplete

reporting. A balanced evaluation of the risk of bias of studies

may be impeded due to nontransparent reporting. An increased

quality of reporting was observed over time, but there is still room

for improvement and there is a need for guidance. The “Trans-

parent Reporting of a multivariable prediction model for Individual

Prognosis Or Diagnosis” (TRIPOD) statement is particularly

helpful and provides guidance for the reporting of studies that

develop, validate, or update prediction models

12,73

(available

atwww.tripod-statement.org). We encourage researchers to

follow its recommendations. Equally important are the availability

of study protocols and the availability of data sets. Protocol

registration, either through publications, or through open science

applications, may reduce the impact of publication bias.

A large

number of validation studies in our review reported significant

results; yet, only 2 studies mentioned a protocol.

50,115

Protocol

registration may also reduce reporting bias.

It is common

practice to measure several outcomes, but the lack of a readily

accessible research protocol makes these studies vulnerable to

selective reporting of analyses that “worked.”

Another possi-

bility is to make data sets open, ie, available to all researchers.

115

Available data sets provide the opportunity to conduct secondary

analyses that may be informed by advances in theory and

scientific standards in the field.

There are some limitations to our review. First, we used a strict

search strategy. We excluded batteries of questionnaires and

tools that were not originally developed in the context of pain. This

may have resulted in missing instruments that are potentially

valuable. For example, the Amsterdam Preoperative Anxiety and

Information Scale (APAIS) was originally developed to evaluate

patient’s preoperative anxiety and need for preoperative in-

formation regarding the scheduled surgery and anesthesia.

Subsequently, this tool was used to predict postoperative

pain.

46,48

Second, we focused upon multidimensional screening

tools. Otherwise, one may make use of unidimensional ques-

tionnaires assessing single psychosocial risk factors to investi-

gate the predictive power of unique psychosocial variables (eg,

Pain Catastrophizing Scale

113

and Tampa Scale for Kinesiopho-

bia

) for poor pain outcomes. For screening purposes, however,

one should aim to minimize the burden of filling out questionnaires

for participants. The use of large questionnaire batteries should

therefore be avoided. Third, this research field is quickly evolving,

with new validation studies appearing at a fast pace. Since our

search, new instruments have been validated in an independent

study. For instance, the Optimal Screening for Prediction of

Referral and Outcome cohort yellow flag assessment tool was

developed in a cross-sectional cohort in 2016.

Recently,

a validation study was published.

Fourth, clinical prediction

modelling is a dynamic and evolving field

15,47,56,94,108–111

(see

also progress-partnership.org). One should keep in mind that the

present review is an exploratory mapping of this rapidly evolving

field. Assessment of the quality evidence in the included studies

was based upon a prepublication version of the PROBAST. This

version did not yet provide a guideline for scoring the questions.

We constructed, therefore, our own coding system. Now,

PROBAST has been published, with some minor changes from

the prepublication version of the PROBAST (eg, the signaling

questions of the domain “Sample size and participants flow” are

now included in the domain Outcomes and the domain

Analysis).

76,128

Despite this minor changes, the resulting map-

ping fulfills the primary goal of providing an entry point to reduce

risk of bias in this field. Fifth, we did not perform a meta-analysis.

Several meta-analyses are available that synthesize the predictive

value of screening tools. They indicate that (1) the predictive value

of these screening is highly variable depending on the pain

outcome of interest (eg, pain and disability) and (2) substantial

heterogeneity between studies exist.

49,99

Taking into account

methodological differences and quality criteria is therefore crucial

to further our understanding of the predictive value of screening

tools. Our insights have the potential to improve research in this

area and decision-making based on this research.

Disclosures

The authors have no conflict of interest to declare.

Preparation of this article was supported by funding from the

European Union’s Horizon 2020 research and innovation pro-

gram (Grant 633491).

Article history:

Received 11 March 2019

Received in revised form 11 June 2019

Accepted 26 June 2019

References

[1] Abegglen S, Hoffmann-Richter U, Schade V, Znoj HJ. Work and

Health Questionnaire (WHQ): a screening tool for identifying injured

workers at risk for a complicated rehabilitation. J Occup Rehabil

2016;27:268–83.

[2] Althof JE, Beasley BD. Psychosocial management of the foot and ankle

surgery patient. Clin Podiatr Med Surg 2003;20:199–211.

[3] Andersen JH, Haah JP, Frost P. Risk factors for more severe regional

musculoskeletal symptoms: a two-year prospective study of a general

working population. Arthritis Rheum 2007;56:1355–64.

[4] Austin PC, Steyerberg EW. Events per variable (EPV) and the relative

performance of different strategies for estimating the out-of-sample

validity of logistic regression models. Stat Methods Med Res 2017;26:

796–808.

[5] Beneciuk JM, Bishop MD, Fritz JM, Robinson ME, Asal NR, Nisenzon

AN, George SZ. The STarT back screening tool and individual

psychological measures: evaluation of prognostic capabilities for low

back pain clinical outcomes in outpatient physical therapy settings. Phys

Ther 2013;93:321–33.

[6] Blyth MF, March ML, Nicholas KM, Cousins JM. Chronic pain, work

performance and litigation. PAIN 2003;103:41–7.

[7] Breivik H, Collett B, Ventafridda V, Cohen R, Gallacher D. Survey of

chronic pain in Europe: prevalence, impact on daily life, and treatment.

Eur J Pain 2006;10:287–333.

[8] Broadbent E, Wilkes C, Koschwanez H, Weinman J, Norton S, Petrie

KJ. A systematic review and meta-analysis of the Brief Illness Percep tion

Questionnaire. Psychol Health 2015;30:1361–85.

[9] Cella D, Riley W, Stone A. The Patient-Reported Outcomes

Measurement Information System (PROMIS) developed and tested its

first wave of adult self-reported health outcome item banks: 2005–2008.

J Clin Epidemiol 2010;63:1179–94.

18 E. Veirman et al.·4 (2019) e775 PAIN Reports

[10] Chiarotto A, Boers M, Deyo RA, Buchbinder R, Corbin TP, Costa L,

Foster NE, Grotle M, Koes BW, Kovacs FM, Lin CC, Maher CG,

Pearson AM, Peul WC, Schoene ML, Turk DC, van Tulder MW,

Terwee CB, Ostelo RW. Core outcome measurement instruments

for clinical trials in nonspecific low back pain. PAIN 2018;159:

481–95.

[11] Chiarotto A, Ostelo RW, Turk DC, Buchbinder R, Boers M. Core

outcome sets for research and clinical practice. Braz J Phys Ther 2017;

21:77–84.

[12] Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent

reporting of a multivariable prediction model for individual

prognosis or diagnosis (TRIPOD): the TRIPOD statement. Ann Inter

Med 2015;162:55–63.

[13] Crombez G, Eccleston C, Van Damme S, Vlaeyen JWS, Karoly P. Fear-

avoidance model of chronic pain: the next generation. Clin J Pain 2012;

28:475–83.

[14] Dagfinrud H, Storheim K, Magnussen LH, Odegaard T, Hoftaniska I,

Larsen LG, Ringstad PO, Hatlebrekke F, Grotle M. The predictive validity

of the ¨

Orebro Musculoskeletal Pain Questionnaire and the clinicians’

prognostic assessment following manual therapy treatment of patients

with LBP and neck pain. Man Ther 2013;18:124–9.

[15] Debray TPA, Vergouwe Y, Koffijberg H, Nieboer D, Steyerberg EW,

Moons KGM. A new framework to enhance the interpretation of external

validation studies of clinical prediction models. J Clin Epidemiol 2015;

68:279–89.

[16] Debray TPA, Damen JAAG, Snell KIE, Ensor J, Hooft L, Reitsma JB,

Riley RD, Moons KGM. A guide to systematic review and meta-analysis

of prediction model performance. BMJ 2017;356:i6460.

[17] den Boer JJ, Oostendorp RA, Evers AW, Beems T, Borm GF, Munneke

M. The development of a screening instrument to select patients at risk

of residual complaints after lumbar disc surgery. Eur J Phys Rehabil Med

2010;46:497–503.

[18] Dunstan DA, Covic T, Tyson GA, Lennie IG. Does the Orebro

Musculoskeletal Pain Questionnaire predict outcomes following

a work-related compensable injury? Int J Rehabil Res 2005;28:369–70.

[19] Eccleston C, Crombez G. Advancing psychological therapies for

chronic pain. F1000Res 2017;6:461.

[20] Field J, Newell D. Relationship between STarT Back Screening Tool and

prognosis for low back pain patients receiving spinal manipulative

therapy. Chiropr Man Therap 2012;20:17.

[21] Foster NE, Mullis R, Hill JC, Lewis M, Whitehurst DG, Doyle C,

Konstantinou K, Main C, Somerville S, Sowden G, Wathall S, Young J,

Hay EM; IMPaCT Back Study team. Effect of stratified care for low back

pain in family practice (IMPaCT Back): a prospective population-based

sequential comparison. Ann Fam Med 2014;12:102–11.

[22] Fritz JM, Benecuik JM, George SZ. Relationship between categorization

with the STarT back screening tool and prognosis for people receiving

physical therapy for low back pain. Phys Ther 2011;91:722–32.

[23] Gabel CP, Burkett B, Melloh M. The shortened ¨

Orebro Musculoskeletal

Screening Questionnaire: evaluation in a work-injured population. Man

Ther 2013;18:378–85.

[24] Gabel CP, Melloh M, Burkett B, Osborne J, Yelland M. The ¨

Orebro

Musculoskeletal Screening Questionnaire: validation of a modified

primary care musculoskeletal screening tool in an acute work injured

population. Man Ther 2012;17:554–65.

[25] Gabel CP, Melloh M, Yelland M, Burkett B, Roiko A. Predictive ability of

a modified ¨

Orebro Musculoskeletal Pain Questionnaire in an acute/

subacute low back pain working population. Eur Spine J 2011;20:

449–57.

[26] Gatchel RJ, Polatin PB, Kinney RK. Predicting outcome of chronic back

pain using clinical predictors of psychopathology: a prospective

analysis. Health Psychol 1995;14:415–20.

[27] Gatchel J, Polatin P, Mayer T. The dominant role of psychosocial risk

factors in the development of chronic low back pain disability. Spine

1996;20:2702–9.

[28] George SZ, Beneciuk JM. Psychological predictors of recovery from low

back pain: a prospective study. BMC Musculoskelet Disord 2015;16:

49.

[29] George SZ, Beneciuk JM, Lentz TA, Wu SS, Dai Y, Bialosky JE, Zeppieri

G Jr. Optimal screening for prediction of referral and outcome (OSPRO)

for musculoskeletal pain conditions: results from the validation cohort.

J Orthop Sports Phys Ther 2018;48:460–75.

[30] Gray H, Adefolarin AT, Howe TE. A systematic review of instruments for

the assessment of work-related psychosocial factors (Blue Flags) in

individuals with non-specific low back pain. Man Ther 2011;16:531–43.

[31] Grotle M, Brox JI, Glomsrod B, Lonn JH, Vollestad NK. Prognostic

factors in first-time care seekers due to acute low back pain. Eur J Pain

2007;11:290–8.

[32] Grotle M, Brox JI, Veierød MB, Glomsrød B, Lønn JH, Vøllestad NK.

Clinical course and prognostic factors in acute low back pain. Patients

consulting primary care for the first time. Spine 2005;30:976–82.

[33] Grotle M, Vollestad NK, Brox JI. Screening for yellow flags in first-time

acute low back pain: reliability and validity of a Norwegian version of the

acute low back pain screening Questionnaire. Clin J Pain 2006;2:

458–67.

[34] Gureje O, Von Korff M, Simon GE, Gater R. Persistent pain and well-

being: a World Health Organization study in primary care. JAMA 1998;

280:147–51.

[35] Hahn S, Williamson PR, Hutton JL. Investigation of within-study

selective reporting in clinical research: follow-up of applications

submitted to a local research ethics committee. J Eval Clin Pract

2002;8:353–9.

[36] Hayden JA, Cote P, Bombardier C. Evaluation of the quality of prognosis

studies in systematic reviews. Ann Intern Med 2006;44:427–37.

[37] Hayden JA, Van Der Windt DA, Cartwright JL, Cote P, Bombardier C.

Assessing bias in studies of prognostic factors. Ann Intern Med 2013;

158:280–6.

[38] Heneweer H, Aufdemkampe G, van Tulder MW, Kiers H, Stappaerts KH,

Vanhees L. Psychosocial variables in patients with (sub)acute low back

pain: an inception cohort in primary care physical therapy in the

Netherlands. Spine 2007;32:586–92.

[39] Henschke N, Maher CG, Refshauge KM, Herbert RD, Cumming RG,

Bleasel J, York J, Das A, McAuely JH. Prognosis in patients with recent

onset low back pain in Australian primary care: inception cohort study.

BMJ 2008;337:1–7.

[40] Higgins JPT, Altman DG, Sterne JAC, editors. Chapter 8: assessing risk

of bias in included studies. In: Higgins JPT, Green S, editors. Cochrane

handbook for systematic reviews of interventions version 5.1.0 [updated

March 2011]. The Cochrane Collaboration, 2011. Available at: www.

cochrane-handbook.org.Accessed January 26, 2017.

[41] Hill JC, Dunn KM, Lewis M, Mullis R, Main CJ, Foster NE, Hay EM. A

primary care back pain screening tool: identifying patient subgroups for

initial treatment. Arthritis Rheum 2008;59:632–41.

[42] Hockings RL, McAuley JH, Maher CG. A systematic review of the

predictive ability of the Orebro Musculoskeletal Pain Questionnaire.

Spine 2008;33:E494–500.

[43] Hurley DA, Dusoir TE, McDonough SM, Moore AP, Linton SJ, Baxter

GD. Biopsychosocial screening questionnaire for patients with low back

pain: preliminary report of utility in physiotherapy practice in Northern

Ireland. Clin J Pain 2000;16:214–28.

[44] Hurley DA, Dusoir TE, McDonough SM, Moore AP, Baxter GD. How

effective is the acute low back pain screening questionnaire for

predicting 1-year follow-up in patients with low back pain? Clin J Pain

2001;17:256–63.

[45] Iles RA, Davidson M, Taylor NF. Psychosocial predictors of failure to

return to work in non-chronic non-specific low back pain: a systematic

review. J Occup Environ Med 2008;65:507–17.

[46] Janssen KJM, Kalkman CJ, Grobbee D, Bonsel GJ, Moons KGM,

Vergouwe Y. The risk of severe postoperative pain: modification and

validation of a clinical prediction rule. Anesth Analg 2008;107:1330–9.

[47] Justice AC, Covinsky KE, Berlin JA. Assessing the generalizability of

prognostic information. Ann Intern Med 1999;130:515–24.

[48] Kalkman CJ, Visser K, Moen J, Bonsel GJ, Grobbee DE, Moons KG.

Preoperative prediction of severe postoperative pain. PAIN 2003;105:

415–23.

[49] Karran EL, McAuley JH, Traeger AC, Hillier SL, Grabherr L, Russek LN,

Moseley GL. Can screening instruments accurately determine poor

outcome risk in adults with recent onset low back pain? A systematic

review and meta-analysis. BMC Med 2017;15:13.

[50] Karran EL, Traeger AC, McAuley JH, Hillier SL, Yau Y, Moseley GL. The

value of prognostic screening for patients with low back pain in

secondary care. J Pain 2017;18:673–86.

[51] Kendall NAS, Linton SJ, Main CJ. Guide to assessing psychosocial

yellow flags in acute low back pain: Risk factors for long term disability

and work loss. Wellington: Accident Rehabilitation and Compensation

Insurance Corporation of New Zealand and the National Health

Committee, 1997.

[52] Khan KS, Kunz R, Kleijnen J, Antes G. Five steps to conducting

a systematic review. J R Soc Med 2003;96:118–21.

[53] Kongsted A, Andersen CH, Hansen MM, Hestbaek L. Prediction of

outcome in patients with low back pain—a prospective cohort study

comparing clinicians’ predictions with those of the Start Back Tool. Man

Ther 2016;21:120–7.

[54] Lang K, Alexander IM, Simon J, Sussman M, Lin I, Menzin J, Friedman

M, Dutwin D, Bushmakin AG, Thrift-Perry M, Altomare C, Hsu MA. The

impact of multimorbidity on quality of life among midlife women: findings

4 (2019) e775 www.painreportsonline.com 19

from a U.S. nationally representative survey. J Womens Health 2015;24:

374–83.

[55] Law RKY, Lee EWC, Law SW, Chan BKB, Chen PP, Szeto GPY. The

predictive validity of OMPQ on the rehabilitation outcomes for patients

with acute and subacute non-specific LBP in a Chinese population.

J Occup Rehabil 2013;23:361–70.

[56] Lee YH, Bang H, Kim DJ. How to establish clinical prediction models.

Endocrinol Metab 2016;31:38–44.

[57] Leeuw M, Goossens MEJB, Linton SJ, Crombez G, Boersma K, Vlaeyen

JW. The fear-avoidance model of musculoskeletal pain: current state of

scientific evidence. J Behav Med 2007;30:77–94.

[58] Lentz TA, Beneciuk JM, Bialosky JE, Zeppieri G Jr, Dai Y, Wu SS,

George SZ. Development of a yellow flag assessment tool for

orthopaedic physical therapists: results from the optimal screening for

prediction of referral and outcome (OSPRO). J Orthop Sports Phys Ther

2016;5:327–43.

[59] Leysen M, Nijs J, Meeus M, Wilgen CP, Struyf F, Vermandel A, Kuppens

K, Roussel N. Clinimetric properties of illness perception questionnaire

revised (IPQ-R) and brief illness perception questionnaire (Brief IPQ) in

patients with musculoskeletal disorders: a systematic review. Man Ther

2014;20:10–17.

[60] Linton SJ. A review of psychological risk factors in back and neck pain.

Spine (Phila Pa 1976) 2000;25:1148–56.

[61] Linton SJ, Boersma K. Early identification of patients at risk of

developing a persistent back problem: the predictive validity of the

Orebro Musculoskeletal Pain Questionnaire. Clin J Pain 2003;19:80–6.

[62] Linton SJ, Hallden K. Can we screen for problematic back pain? A

screening questionnaire for predicting outcome in acute and subacute

back pain. Clin J Pain 1998;14:209–15.

[63] Linton SJ, Nicolas M, MacDonald S. Development of a short form of the

Orebro musculoskeletal pain screening Questionnaire. Spine 2011;36:

1891–5.

[64] Maher CG, Grotle M. Evaluation of the predictive validity of the Orebro

Musculoskeletal Pain Screening Questionnaire. Clin J Pain 2009;25:

666–70.

[65] Main C, Wood P, Hollis S, Spanswick CC, Waddell G. The Distress and

Risk Assessment Method. A simple patient classification to identify

distress and evaluate the risk of poor outcome. Spine 1992;17:42–52.

[66] Margison DA, French DJ. Predicting treatment failure in the subacute

injury phase using the Orebro Musculoskeletal Pain Questionnaire: an

observational prospective study in a workers’ compensation system.

J Occup Environ Med 2007;49:59–67.

[67] Marhold C, Linton SJ, Melin L. A cognitive–behavioral return-to-work

program: effects on pain patients with a history of long-term versus

short-term sick leave. PAIN 2001;91:155–63.

[68] Melloh M, Elfe ring A, Egli Presland C, Roeder C, Barz T, Rolli Salath ´eC,

Tamcan O, Mueller U, Theis JC. Identification of prognostic factors for

chronicity in patients with low back pain: a review of screening

instruments. Int Orthop 2009;33:301–13.

[69] Miller RP, Kori SH, Todd DD. The Tampa Scale: a measure of

kinisophobia. Clin J Pain 1991;7:51–2.

[70] Moerman N, van Dam FS, Muller MJ, Oosting H. The Amsterdam

preoperative anxiety and information scale (APAIS). Anesth Analg 1996;

82:445–51.

[71] Moher D, Liberati A, Tetzlaff J, Altman DG; PRISMA Group. Preferred

reporting items for systematic reviews and meta-analyses: the PRISMA

statement. PLoS Med 2009;6:e1000097.

[72] Montori VM, Permanyer-Miralda G, Ferreira-Gonz ´alez I, Busse JW,

Pacheco-Huergo V, Bryant D, Alonso J, Akl EA, Domingo-Salvany A,

Mills E, Wu P, Sch ¨unemann HJ, Jaeschke R, Guyatt GH. Validity of

composite end points in clinical trials. BMJ 2005;330:594–6.

[73] Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P,

Steyerberg EW, Vickers AJ, Ransohoff DF, Collins GS. Transparent

reporting of a multivariable prediction model for individual prognosis or

Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;

162:W1–W73.

[74] Moons KG, de Groot JA, Bouwmeester W, Vergouwe Y, Mallett S,

Altman DG, Reitsma JB, Collins GS. Critical appraisal and data

extraction for systematic reviews of prediction modelling studies: the

CHARMS checklist. PLoS Med 2014;11:e1001744.

[75] Moons KGM, Kengne AP, Grobbee DE, Royston P, Vergouwe Y, Altman

DG, Woodward M. Risk prediction models: II. External validation, model

updating, and impact assessment. Heart 2012;98:691–8.

[76] Moons KG, Wolff RF, Riley RD, Whiting PF, Westwood M, Collins GS,

Reitsma JB, Kleijnen J, Mallett S. PROBAST: a tool to assess risk of bias

and applicability of prediction model studies: explanation and

elaboration. Ann Intern Med 2019;170:W1–W33.

[77] Morsø L, Kent P, Albert HB, Hill JC, Kongsted A, Manniche C. The

predictive and external validity of the STarT Back Tool in Danish primary

care. Eur Spine J 2013;22:1859–67.

[78] Morsø L, Kent P, Manniche C, Albert HB. The predictive ability of the

STarT Back Screening Tool in a Danish secondary care setting. Eur

Spine J 2014;23:120–8.

[79] Neubauer E, Junge A, Pirron P, Seemann H, Schiltenwolf M. HKF-R

10—screening for predicting chronicity in acute low back pain (LBP):

a prospective clinical trial. Eur J Pain 2006;10:559–66.

[80] Newell D, Field J, Pollard D. Using the STarT back tool: does timing of

stratification matter? Man Ther 2015;20:533–9.

[81] Nonclercq O, Berquin A. Predicting chronicity in acute back pain:

validation of a French translation of the Orebro Musculoskeletal Pain

Screening Questionnaire. Ann Phys Rehabil Med 2012;55:263–78.

[82] Pajouheshnia R, Damen JAAG, Groenwold R, Moons KM, Peelen LM.

Treatment use in prognostic model research: a systematic review of

cardiovascular prognostic studies. Diagn Progn Res 2017;1:15.

[83] Pajouheshnia R, Peelen LM, Moons K, Reitsma JB, Groenwold R.

Accounting for treatment use when validating a prognostic model:

a simulation study. BMC Med Res Methodol 2017;17:103.

[84] Peat G, Riley RD, Croft P, Morley KI, Kyzas PA, Moons KG, Perel P,

Steyerberg EW, Schroter S, Altman DG, Hemingway H; PROGRESS

Group. Improving the transparency of prognosis research: the role of

reporting, data sharing, registration, and protocols. PLoS Med 2014;11:

e1001671.

[85] Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation

study of the number of events per variable in logistic regression analysis.

J Clin Epidemiol 1996;49:1373–9.

[86] Pengel LHM, Herbert RD, Maher CG, Refshauge KM. Acute low back

pain: systematic review of its prognosis. BMJ 2003;327:323.

[87] Phillips CJ. The cost and burden of chronic pain. Rev Pain 2009;3:2–5.

[88] Pincus T. A systematic review of psychological factors as predictors of

chronicity/disability in prospective cohorts of low back pain. Spine 2002;

27:E109–20.

[89] Porter ME, Larsson S, Lee TH. Standardizing patient outcomes

measurement. N Engl J Med 2016;374:504–6.

[90] Ramond A, Bouton C, Richard I, Roquelaure Y, Baufreton C, Legrand E,

Huez JF. Psychosocial risk factors for chronic low back pain in primary

care—a systematic review. J Fam Pract 2011;28:12–21.

[91] Reitsma JB, Rutjes AWS, Whiting P, Vlassov VV, Leeflang MMG, Deeks

JJ. Chapter 9: assessing methodological quality. In JJ Deeks, PM

Bossuyt, C Gatsonis, editors. Cochrane handbook of systematic

reviews of diagnostic test accuracy, version 1.0.0. The Cochrane

Collaboration, 2009. Available at: http://srdta.cochrane.org.Accessed

January 26, 2017.

[92] Riewe E, Neubauer E, Pfeifer AC, Schiltenwolf M. Predicting persistent

back symptoms by psychosocial risk factors: validity criteria for the

OMPSQ and the HKF-r 10 in Germany. PLoS One 2016;11:e0158850.

[93] Rodeghero JR, Cook CE, Cleland JA, Mintken PE. Risk stratification of

patients with low back pain seen in physical therapy practice. Man Ther

2015;20:855–60.

[94] Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and

prognostic research: developing a prognostic model. BMJ 2009;338:

1373–7.

[95] Saastamoinen P, Leino-Arjas P, Laaksonen M, Lahelma E. Socio-

economic differences in the prevalence of acute, chronic and disabling

chronic pain among ageing employees. PAIN 2005;114:364–71.

[96] Sackett DL, Straus SE, Richardson WS, Rosenberg W, Haynes RB.

Evidence-based medicine: how to practice and teach EBM. Edinburgh:

Churchill Livingstone, 2000.

[97] Sandborgh M, Lindberg P, Denison E. Pain belief screening instrument:

development and preliminary validation of a screening instrument for

disabling persistent pain. J Rehabil Med 2007;39:461–6.

[98] Sandborgh M, Lindberg P, Denison E. The Pain Belief Screening

Instrument (PBSI): predictive validity for disability status in persistent

musculoskeletal pain. Disabil Rehabil 2008;30:1123–30.

[99] Sattelmayer M, Lorenz T, R ¨oder C, Hilfiker R. Predictive value of the

Acute Low Back Pain Screening Questionnaire and the ¨

Orebro

Musculoskeletal Pain Screening Questionnaire for persisting

problems. Eur Spine J 2011;21(suppl 6):S773–84.

[100] Schultz IZ, Crook J, Berkowitz J, Milner R, Meloche GR. Predicting

return to work after low back injury using the psychosocial risk for

occupational disability instrument: a validation study. J Occup Rehabil

2005;15:365–76.

[101] Scott W, McCracken L. Psychological assessment to identify patients at

risk of postsurgical pain: the need for theory and pragmatism. Br J

Anaesth 2016;117:546–8.

20 E. Veirman et al.·4 (2019) e775 PAIN Reports

[102] Shahidi B, Curran-Everett D, Maluf KS. Psychosocial, physical, and

neurophysiological risk factors for chronic neck pain: a prospective

inception cohort study. J Pain 2015;16:1288–99.

[103] Shaw WS, Chin EH, Nelson CC, Reme SE, Woiszwillo MJ, Verma SK.

What circumstances prompt a workplace discussion in medical

evaluations for back pain? J Occup Rehabil 2013;23:125–34.

[104] Shaw WT, Pransky GS, Patterson WB, Winters T. Early disability risk

factors for low back pain assessed at outpatient occupational health

clinics. Spine 2005;30:572–80.

[105] Shaw WS, Reme SE, Pransky G, Woiszwillo MJ, Steenstra IA, Linton SJ.

The pain recovery inventory of concerns and expectations

a psychosocial screening instrument to identify intervention needs

among patients at elevated risk of back disability. J Occup Environ Med

2013;55:885–94.

[106] Sobol-Kwapinska M, Ba

˛bel P, Plotek W, Stelcer B. Psychological

correlates of acute postsurgical pain: a systematic review and meta-

analysis. Eur J Pain 2016;20:1573–86.

[107] Steenstra I, Verbeek J, Heymans M, Bongers P. Prognostic factors for

duration of sick leave in patients sick listed with acute low back pain:

a systematic review of the literature. Occup Environ Med 2005;62:851–60.

[108] Steyerberg EW. Clinical prediction models: A practical approach to

development, validation, and updating. New York: Springer, 2009.

[109] Steyerberg EW, Moons KG, van der Windt DA, Hayden JA, Perel P,

Schroter S, Riley RD, Hemingway H, Altman DG; PROGRESS Group.

Prognosis Research Strategy (PROGRESS) 3: prognostic model

research. PLoS Med 2013;10:e1001381.

[110] Steyerberg EW, Vergouwe Y. Towards better clinical prediction models:

seven steps for development and an ABCD for validation. Eur Heart J

2014;35:1925–31.

[111] Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski

N, Pencina MJ, Kattan MW. Assessing the performance of prediction

models: a framework for traditional and novel measures. Epidemiology

2010;21:128–38.

[112] Streibelt M,Bethge M.Prospective cohort analysis of the predictive validity of

a screening instrument for severe restrictions of work ability in patients with

musculoskeletal disorders. Am J Phys Med Rehabil 2015;94:617–26.

[113] Sullivan M, Bishop SR, Pivik J. The pain catastrophizing scale:

development and validation. Psychol Assess 1995;7:524–32.

[114] Toth C, Lander J, Wiebe S. The prevalence and impact of chronic pain

with neuropathic pain symptoms in the general population. Pain Med

2009;10:918–29.

[115] Traeger AC, Henschke N, H ¨ubscher M, Williams CM, Kamper SJ, Maher

CG, Moseley GL, McAuley JH. Estimating the risk of chronic pain:

development and validation of a prognostic model (PICKUP) for patients

with acute low back pain. PLoS Med 2016;13:e1002019.

[116] Truchon M, Schmouth ME, Cote D, Fillion L, Rossignol M, Durand MJ.

Absenteeism screening questionnaire (ASQ): a new tool for predicting

long-term absenteeism among workers with low back pain. J Occup

Rehabil 2012;22:27–50.

[117] Tu YK, Gilthorpe MS. Revisiting the relation between change and initial

value: a review and evaluation. Stat Med 2007;26:443–57.

[118] Tucker CA, Cieza A, Riley AW, Stucki G, Lai JS, Bedirhan Ustun T,

Kostanjsek N, Riley W, Cella D, Forrest CB. Concept analysis of the

patient reported outcomes measurement information system

(PROMIS®) and the International classification of functioning, disability

and Health (ICF). Qual Life Res 2014;23:1677.

[119] Tucker CA, Escorpizo R, Cieza A, Lai JS, Stucki G, Ustun TB,

Kostanjsek N, Cella D, Forrest CB. Mapping the content of the

patient-reported outcomes measurement information system

(PROMIS®) using the International classification of functioning, Health

and disability. Qual Life Res 2014;23:2431–8.

[120] Vickers AJ, Elkin EB. Decision curve analysis: a novel method for

evaluating prediction models. Med Decis Making 2006;26:565–74.

[121] Vickers AJ, Van Calster B, Steyerberg EW. Net benefit approaches to

the evaluation of prediction models, molecular markers, and diagnostic

tests. BMJ 2016;352:i6.

[122] Vlaeyen JWS, Linton SJ. Fear-avoidance and its consequences in

chronic musculoskeletal pain: a state of the art. PAIN 2000;85:

317–32.

[123] Vlaeyen JWS, Morley S, Crombez G. The experimental analysis of the

interruptive, interfering, and identity-distorting effects of chronic pain.

Behav Res Ther 2016;86:23–34.

[124] Vos CJ, Verhagen AP, Koes BW. The ability of the acute low back pain

screening Questionnaire to predict sick leave in patients with acute neck

pain. J Manipulative Physiol Ther 2009;32:178–83.

[125] Walton DM, Krebs D, Moulden D, Wade P, Levesque L, Elliott J,

MacDermid JC. The traumatic Injuries distress scale: a new tool that

quantifies distress and Has predictive validity with patient-reported

outcomes. J Orthop Sports Phys Ther 2016;46:920–8.

[126] Wilson JMG, Jungner G. Principles and practice of screening for

disease. Geneva: WHO, 1968. Available at: http://www.who.int/

bulletin/volumes/86/4/07-050112BP.pdf. Accessed January 26,

2017.

[127] Wingbermuhle RW, van Trijffel E, Nelissen PM, Koes B, Verhagen AP.

Few promising multivariable prognostic models exist for recovery of

people with non-specific neck pain in musculoskeletal primary care:

a systematic review. J Physiother 2018;64:16–23.

[128] Wolff RF, Moons KG, Riley RD, Whiting PF, Westwood M, Collins GS,

Reitsma JB, Kleijnen J, Mallett S; PROBAST Group. PROBAST: a tool to

assess the risk of bias and applicability of prediction model studies. Ann

Intern Med 2019;170:51–8.

[129] Wolff R, Whiting P, Mallet S, Riley R, Westwood M, Kleijnen K, Mallet S.

PROBAST—a risk-of-bias tool for prediction-modelling studies.

Abstracts of the global evidence summit, Cape Town, South Africa.

Cochrane Database Syst Rev 2017;9(suppl 1).

[130] World Health Organization. International classification of

functioning, disability and health (ICF) Geneva: World Health

Organization; 2001.

4 (2019) e775 www.painreportsonline.com 21

The validity and reliability of the Turkish version of the 12-item Örebro musculoskeletal screening questionnaire (ÖMSQ-12-TR)

Article

Jun 2022
DISABIL REHABIL

Purpose: The 12-item Örebro Musculoskeletal Screening Questionnaire (ÖMSQ-12) is a multidimensional questionnaire assessing general musculoskeletal problems. This study aimed to investigate its construct validity and reliability. Materials and methods: Confirmatory factor analysis (CFA) was performed for construct validity. The Tampa Scale for Kinesiophobia (TSK) and the SF-12 and Pain Numerical Rating Scale (P-NRS) were used for convergent validity. Reliability (ICC), internal consistency (Cronbach's alpha), reproducibility, and known-group validity were assessed. The cut-off value was measured. Results: A total of n = 378 individuals (aged 35.7 ± 12.4 years, female = 73.3%) with a musculoskeletal problem participated in the study. P-NRS score of the individuals was 5. Results showed that a 3-factor model did fit well under CFA (χ2/df = 2.76 ≤ 3). The questionnaire had good reliability (ICC = 0.865) and internal consistency (α = 0.810). There were no floor or ceiling effects (<%15). Total ÖMSQ-12-TR scores had a correlation with the TSK, SF-12 and P-NRS (r = 0.303-0.609). The AUC for the risk of absenteeism from work was obtained as 0.738 (p < 0.001). The risk of absenteeism was high in individuals with an ÖMSQ-12-TR score of ≥57.5. Conclusions: The ÖMSQ-12-TR is a valid and reliable questionnaire that can be used in determining the risk of absenteeism in musculoskeletal disorders and is convenient for online use. Clinical trial number: NCT04723615.

The effect of psychological factors on pain outcomes: lessons learned for the next generation of research

Article

Full-text available

Nov 2023

Big data and machine learning techniques offer opportunities to investigate the effects of psychological factors on pain outcomes. Nevertheless, these advances can only deliver when the quality of the data is high and the underpinning causal assumptions are considered. We argue that there is room for improvement and identify some challenges in the evidence base concerning the effect of psychological factors on the development and maintenance of chronic pain. As a starting point, 3 basic tenets of causality are taken: (1) cause and effect differ from each other, (2) the cause precedes the effect within reasonable time, and (3) alternative explanations are ruled out. Building on these tenets, potential problems and some lessons learned are provided that the next generation of research should take into account. In particular, there is a need to be more explicit and transparent about causal assumptions in research. This will lead to better research designs, more appropriate statistical analyses, and constructive discussions and productive tensions that improve our science.

Facteurs pronostiques précoces dans le syndrome douloureux régional complexe : une revue systématique

Article

Dec 2023

Contexte et objectif : Plusieurs facteurs de risque associés à l’apparition d’un SDRC ont été découverts, mais les preuves scientifiques concernant les facteurs pronostiques associés à la progression de cette pathologie restent rares. Toutefois, la détection et la prise en charge de ces facteurs sont nécessaires pour élaborer des stratégies de prévention secondaire. L’objectif de cette revue systématique était d’identifier les facteurs pronostiques chez les adultes souffrant d’un SDRC précoce. Base de données et traitement des données : PubMed, Embase, PsycINFO, Cochrane Library et Scopus, publiées entre janvier 1990 et novembre 2021. Deux investigateurs indépendants ont sélectionné les études transversales et longitudinales s’intéressant aux facteurs pronostiques précoces (< 12 semaines après l’apparition de la maladie) de la douleur, du score de sévérité du SDRC, de l’incapacité fonctionnelle, du retour au travail ou de la qualité de vie. L’outil QUIPS (Quality In Prognostic Studies) a été utilisé pour évaluer le risque de biais. Une métasynthèse qualitative a été réalisée. Résultats : Sur 4 652 articles différents, six études répondaient aux critères d’inclusion. Nous avons identifié 21 facteurs précoces associés à un pronostic défavorable dans le SDRC de type I. Six d’entre eux présentaient un niveau de preuves modéré : intensité de la douleur, incapacité fonctionnelle, anxiété, peur du mouvement (kinésiophobie), sexe féminin et intensité du traumatisme physique déclencheur. Seules deux études présentaient un risque de biais globalement faible. Conclusions : Cette étude a révélé un manque important d’informations sur les facteurs pronostiques précoces dans le SDRC. Un seul article s’est intéressé au lien entre le risque de chronicité et les caractéristiques psychologiques. Il est indispensable de réaliser des études de plus grande envergure, avec une population bien définie et des mesures validées.

Epidemiology of chronic pain and opioid use in primary care – a scoping review of big data research

Preprint

Full-text available

Jun 2023

Objective Recent research addressing the opioid use and misuse crisis in patients with chronic non-cancer pain in primary care has focused on traditional cohort studies underpinned by survey data. The advent of electronic health records creates a ′big data′ opportunity for improving our understanding of the epidemiology of chronic non-cancer pain in primary care and opioid use and misuse. This scoping review aimed to map the chronic non-cancer pain patient population in primary care using big data research, investigating the patient characteristics and opioid prescription patterns. Methods Searches of primary electronic databases and grey literature, including OVID, CINAHL, and Scopus, were performed from January 1, 2010 to December 2, 2022. The search strategy was restricted to the English language. Results A total of 1,057 records from databases and 515 records from grey literature were considered. Of these, only three articles met the eligibility criteria, and two articles of these reported an estimated chronic pain prevalence of 3.82% and 10.3% in the primary care setting. Chronic pain patients that presented to primary care providers were predominately female, and common comorbidities were anxiety and depression. An estimated 30% of chronic pain patients used opioids for treatment sourced from general practitioners and family practitioners. Conclusion The use of big data remains underutilized for investigating the epidemiology of chronic pain and opioid use in primary care. This review calls for a greater focus on pain informatics with big data to improve the accuracy of future clinical chronic pain epidemiology studies.

Investigation of the effect of technology and internet addictions on the musculoskeletal system in university students during the post-pandemic period

Article

Full-text available

Dec 2022

Emel Taşvuran Horata

Objective: The COVID-19 pandemic caused the habits of university students to have spent more time with technological devices and the internet. This study is aimed to investigate the effect of technology and internet addictions of university students on the musculoskeletal (MSK) problems during the post-pandemic period. Materials and Methods: This cross-sectional study was conducted with 368 university students. The Nordic Musculoskeletal Questionnaire, Pain Numerical Rating Scale, Technology Addiction Scale (TAS), Young's Internet Addiction Test-Short Form (YIAT-SF), and Örebro Musculoskeletal Screening Questionnaire-12-TR (Örebro-12-TR) were applied. Multiple linear regression analysis was performed to assess the effect of technology addiction and internet addiction on the MSK problem. Results: The mean TAS score of the participants was 45.94±15.46, the mean YIAT-SF score was 24.56±9.52, and the mean Örebro-12-TR score was 35.55±17.14. Technology (p=0.037) and internet addiction (p=0.001) variables had a significant effect on MSK problem. This model can explain 18.4% of the total variance in the risk of developing MSK problems (adjusted R2=0.184). Conclusion: This study showed that internet and technology addictions affected the MSK problems during the post-pandemic period. Interventions and training programs could reduce the risk of MSK problems.

Dealing with Chronic Pain: Tips to Teach Undergraduate Health Sciences Students

Article

Full-text available

Jan 2022

Chronic pain is a significant and costly problem all over the world that negatively impacts the quality of life of sufferers. There are clear discrepancies between the prevalence of chronic pain in society and the low priority assigned to educating future physicians about the complexities of pain. This condition also occurs in other undergraduate health science students, although research in this area has not been studied as much as in medical schools. Based on the International Association for the Study of Pain (IASP) Pain Curriculum Outline, a systematic search of the available literature, and the authors’ own experiences, we highlight some relevant tips to educate health science trainees in the management of patients with chronic pain. These tips highlight current international recommendations for a comprehensive approach to this prevalent problem in society, which should be learnt during the university training of health professionals.

Australian physiotherapists attitudes, perceptions, and behaviours towards psychosocial screening tools: a qualitative interpretive description study

Article

Mar 2024
DISABIL REHABIL

Purpose: Psychosocial factors are a barrier to recovery for people with musculoskeletal pain and psychosocial screening tools are consistently recommended by best practice guidelines to assist in identification. However, many physiotherapists do not use these tools. Presently, the perspectives on psychosocial screening tools of Australian physiotherapists are unknown. Exploration of these factors may create targets for increased uptake. The purpose of this paper is to qualitatively explore Australian physiotherapists' attitudes, perceptions, and behaviours towards psychosocial screening tools for musculoskeletal pain conditions. Materials and methods: An Interpretive description qualitative study design was employed. Seventeen Australian physiotherapists were interviewed about their attitudes, perceptions, and behaviours towards psychosocial screening tools. Interviews were transcribed verbatim and analysed according to interpretive description. Results: Analysis highlighted three major themes: (1) understanding the patient through psychosocial screening, (2) confidence and competence with psychosocial factors, and (3) factors outside of my control influence screening. Conclusions: This study presents a deeper understanding of Australian physiotherapists' diverse attitudes and practices regarding psychosocial screening tools. The research highlights not only the variability in perspectives towards the relevance of psychosocial factors in patient assessments, but also the influence of external elements such as patient demographics and clinic culture on the utilization of these screening methods.

Clinician's Commentary on Robarts et al.1

Article

May 2023

Do lifestyle factors influence pain prognosis? A 1-year follow-up study

Article

Jan 2023

Objective: The aim of this observational longitudinal study was to investigate the impact of lifestyle factors on the prognosis of patients with pain. Methods: This study was part of a large prospective longitudinal study conducted in general practice (GP). Participants completed questionnaires at baseline (T0) and one year later (T1). Outcomes analysed were the EQ-5D index, presence of pain and the ability to perform a light work for 1 hour without difficulty. Results: Among 377 individuals with pain at T0, 294 still reported pain at T1. This subgroup had a significantly higher BMI, more painful sites, higher pain intensity, more sleep problems, poorer general self-rated health (GSRH) and higher Örebro Musculoskeletal Pain Screening Questionnaire (ÖMPSQ) score at T0 than pain-free individuals at T1. There were no differences in age, sex, physical activity and smoking. In multivariable analyses, the number of painful sites, GSRH, sleep problems, pain duration, pain intensity and 2 short-form 10-item Örebro musculoskeletal pain questionnaire (SF-ÖMPSQ) items were independently associated with at least one outcome 1 year later. Only GSRH was strongly associated with all outcomes. The accuracy of GSRH at T0 to classify participants according to dichotomous outcomes was overall moderate (0.7 < AUC <0.8). Conclusions: Lifestyle factors appear to have little influence on the outcome of patients with pain in GP. Conversely, poorer GSRH – which probably integrates the subjects’ perception of several factors – could be considered a negative prognostic factor in patients with pain.

Biological and psychological early prognostic factors in complex regional pain syndrome: A systematic review

Article

Dec 2022

Background and Objective Several risk factors for the onset of CRPS have been found, but evidence for prognostic factors associated with progression of this condition remains sparse. However, detection and management of these factors are necessary to design secondary prevention strategies. The objective of this systematic review was to identify prognostic factors in adult individuals with early CRPS. Database and Data Treatment PubMed, Embase, PsycINFO, Cochrane Library, and Scopus, published between Jan 1990 and Nov 2021. Two independent investigators selected cross-sectional and longitudinal studies looking at early (<12 weeks from onset) prognostic factors for pain, CRPS severity score, disability, return to work, or Quality of life. The Quality In Prognostic Studies (QUIPS) tool was used to assess the risk of bias. A qualitative meta-synthesis was performed. Results Out of 4,652 different articles, six studies met inclusion criteria. We identified 21 early factors associated with poorer prognosis in type I CRPS. We found moderate evidence to support six of them: higher pain intensity, self-rated disability, anxiety, pain-related fear, being a female, and high-energy triggering event. Only two studies had an overall low risk of bias. Conclusions This study showed an important lack of information on early prognostic factors in CRPS. Only one article investigated the link with psychological characteristics. There is a crucial need of larger studies, with well-defined population using validated measures.

PROBAST: A Tool to Assess Risk of Bias and Applicability of Prediction Model Studies: Explanation and Elaboration

Article

Full-text available

Jan 2019

Prediction models in health care use predictors to estimate for an individual the probability that a condition or disease is already present (diagnostic model) or will occur in the future (prognostic model). Publications on prediction models have become more common in recent years, and competing prediction models frequently exist for the same outcome or target population. Health care providers, guideline developers, and policymakers are often unsure which model to use or recommend, and in which persons or settings. Hence, systematic reviews of these studies are increasingly demanded, required, and performed. A key part of a systematic review of prediction models is examination of risk of bias and applicability to the intended population and setting. To help reviewers with this process, the authors developed PROBAST (Prediction model Risk Of Bias ASsessment Tool) for studies developing, validating, or updating (for example, extending) prediction models, both diagnostic and prognostic. PROBAST was developed through a consensus process involving a group of experts in the field. It includes 20 signaling questions across 4 domains (participants, predictors, outcome, and analysis). This explanation and elaboration document describes the rationale for including each domain and signaling question and guides researchers, reviewers, readers, and guideline developers in how to use them to assess risk of bias and applicability concerns. All concepts are illustrated with published examples across different topics. The latest version of the PROBAST checklist, accompanying documents, and filled-in examples can be downloaded from www.probast.org.

PROBAST: A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies

Article

Full-text available

Jan 2019

Clinical prediction models combine multiple predictors to estimate risk for the presence of a particular condition (diagnostic models) or the occurrence of a certain event in the future (prognostic models). PROBAST (Prediction model Risk Of Bias ASsessment Tool), a tool for assessing the risk of bias (ROB) and applicability of diagnostic and prognostic prediction model studies, was developed by a steering group that considered existing ROB tools and reporting guidelines. The tool was informed by a Delphi procedure involving 38 experts and was refined through piloting. PROBAST is organized into the following 4 domains: participants, predictors, outcome, and analysis. These domains contain a total of 20 signaling questions to facilitate structured judgment of ROB, which was defined to occur when shortcomings in study design, conduct, or analysis lead to systematically distorted estimates of model predictive performance. PROBAST enables a focused and transparent approach to assessing the ROB and applicability of studies that develop, validate, or update prediction models for individualized predictions. Although PROBAST was designed for systematic reviews, it can be used more generally in critical appraisal of prediction model studies. Potential users include organizations supporting decision making, researchers and clinicians who are interested in evidence-based medicine or involved in guideline development, journal editors, and manuscript reviewers.

Optimal Screening for Prediction of Referral and Outcome (OSPRO) for Musculoskeletal Pain Conditions: Results From the Validation Cohort

Article

Full-text available

Apr 2018

Study Design Observational, prospective cohort. Background Musculoskeletal pain is a common reason to seek healthcare and earlier non-pharmacological treatment and enhancement of personalized care options are two high priority areas. Validating concise assessment tools is an important step in establishing better care pathways. Objectives To determine the predictive validity of Optimal Screening for Prediction of Referral and Outcome (OSPRO) tools for individuals with neck, low back, shoulder, or knee pain. Methods A convenience sample (n = 440) was gathered by Orthopaedic Physical Therapy-Investigator Network clinics (n = 9). Participants completed questionnaires for demographic, clinical, comorbidity, and the OSPRO tools and were followed for 12-month outcomes in pain intensity, region-specific disability, quality of life, and comorbidity change. Analyses predicted these 12-month outcomes with models that included the OSPRO review of systems and yellow flag tools and planned covariates (accounting for comorbidities and established demographic and clinical factors). Results The 10 item OSPRO yellow flag tool (baseline and 4 week change score) consistently added to predictive models for 12-month pain intensity, region-specific disability, and quality of life. The 10 item OSPRO review of system tool added to a predictive model for quality of life (mental summary score) and 13 additional items of the OSPRO review of system+ tool added to prediction of 12-month comorbidity change. Other consistent predictors included age, race, income, previous episode of pain in same region, comorbidity number, and baseline measure for the outcome of interest. Conclusion The OSPRO review of system and yellow flag tools statistically improved prediction of multiple 12 month outcomes. The additional variance explained was small and future research is necessary to determine if these tools can be used as measurement adjuncts to improve management of musculoskeletal pain. J Orthop Sports Phys Ther, Epub 7 Apr 2018. doi:10.2519/jospt.2018.7811.

Core outcome measurement instruments for clinical trials in non-specific low back pain

Article

Full-text available

Nov 2017

To standardize outcome reporting in clinical trials of patients with non-specific low back pain (LBP), an international multidisciplinary panel recommended physical functioning, pain intensity, and health-related quality of life (HRQoL) as core outcome domains. Given the lack of consensus on measurement instruments for these three domains in patients with LBP, this study aimed to generate such consensus. The measurement properties of 17 patient-reported outcome measures for physical functioning, three for pain intensity, and five for HRQoL were appraised in three systematic reviews following COSMIN methodology. Researchers, clinicians and patients (n = 207) were invited in a two-round Delphi survey to generate consensus ( ≥ 67% agreement among participants) on which instruments to endorse. Response rates were 44% and 41%, respectively. In Round 1, consensus was achieved on the Oswestry Disability Index version 2.1a (ODI 2.1a) for physical functioning (78% agreement) and the Numeric Rating Scale (NRS) for pain intensity (75% agreement). No consensus was achieved on any HRQoL instrument, although the Short Form 12 (SF12) approached the consensus threshold (64% agreement). In Round 2, consensus was reached on a NRS version with a 1-week recall period (96% agreement). Various participants requested one free-to-use instrument per domain. Considering all issues together, recommendations on core instruments were formulated: ODI 2.1a or 24-item Roland-Morris Disability Questionnaire for physical functioning, NRS for pain intensity, SF12 or 10-item PROMIS Global Health form for HRQoL. Further studies need to fill the evidence gaps on the measurement properties of these and other instruments.

Five Steps to Conducting a Systematic Review

Article

Full-text available

Mar 2003

Treatment use in prognostic model research: a systematic review of cardiovascular prognostic studies

Article

Full-text available

Sep 2017

Background Ignoring treatments in prognostic model development or validation can affect the accuracy and transportability of models. We aim to quantify the extent to which the effects of treatment have been addressed in existing prognostic model research and provide recommendations for the handling and reporting of treatment use in future studies. Methods We first describe how and when the use of treatments by individuals in a prognostic study can influence the development or validation of a prognostic model. We subsequently conducted a systematic review of the handling and reporting of treatment use in prognostic model studies in cardiovascular medicine. Data on treatment use (e.g. medications, surgeries, lifestyle interventions), the timing of their use, and the handling of such treatment use in the analyses were extracted and summarised. Results Three hundred two articles were included in the review. Treatment use was not mentioned in 91 (30%) articles. One hundred forty-six (48%) reported specific information about treatment use in their studies; 78 (26%) provided information about multiple treatments. Three articles (1%) reported changes in medication use (“treatment drop-in”) during follow-up. Seventy-nine articles (26%) excluded treated individuals from their analysis, 80 articles (26%) modelled treatment as an outcome, and of the 155 articles that developed a model, 86 (55%) modelled treatment use, almost exclusively at baseline, as a predictor. Conclusions The use of treatments has been partly considered by the majority of CVD prognostic model studies. Detailed accounts including, for example, information on treatment drop-in were rare. Where relevant, the use of treatments should be considered in the analysis of prognostic model studies, particularly when a prognostic model is designed to guide the use of certain treatments and these treatments have been used by the study participants. Future prognostic model studies should clearly report the use of treatments by study participants and consider the potential impact of treatment use on the study findings. Electronic supplementary material The online version of this article (10.1186/s41512-017-0015-0) contains supplementary material, which is available to authorized users.

Accounting for treatment use when validating a prognostic model: A simulation study

Article

Full-text available

Jul 2017
BMC MED RES METHODOL

Background Prognostic models often show poor performance when applied to independent validation data sets. We illustrate how treatment use in a validation set can affect measures of model performance and present the uses and limitations of available analytical methods to account for this using simulated data. Methods We outline how the use of risk-lowering treatments in a validation set can lead to an apparent overestimation of risk by a prognostic model that was developed in a treatment-naïve cohort to make predictions of risk without treatment. Potential methods to correct for the effects of treatment use when testing or validating a prognostic model are discussed from a theoretical perspective.. Subsequently, we assess, in simulated data sets, the impact of excluding treated individuals and the use of inverse probability weighting (IPW) on the estimated model discrimination (c-index) and calibration (observed:expected ratio and calibration plots) in scenarios with different patterns and effects of treatment use. Results Ignoring the use of effective treatments in a validation data set leads to poorer model discrimination and calibration than would be observed in the untreated target population for the model. Excluding treated individuals provided correct estimates of model performance only when treatment was randomly allocated, although this reduced the precision of the estimates. IPW followed by exclusion of the treated individuals provided correct estimates of model performance in data sets where treatment use was either random or moderately associated with an individual's risk when the assumptions of IPW were met, but yielded incorrect estimates in the presence of non-positivity or an unobserved confounder. Conclusions When validating a prognostic model developed to make predictions of risk without treatment, treatment use in the validation set can bias estimates of the performance of the model in future targeted individuals, and should not be ignored. When treatment use is random, treated individuals can be excluded from the analysis. When treatment use is non-random, IPW followed by the exclusion of treated individuals is recommended, however, this method is sensitive to violations of its assumptions. Electronic supplementary material The online version of this article (doi:10.1186/s12874-017-0375-8) contains supplementary material, which is available to authorized users.

Advancing psychological therapies for chronic pain

Article

Full-text available

Apr 2017

There is a strong tradition of therapy development and evaluation in the field of psychological interventions for chronic pain. However, despite this research production, the effects of treatments remain uncertain, and treatment development has stalled. This review summarises the current evidence but focusses on promising areas for improvement. Advancing psychological therapies for chronic pain will come from a radical re-imagining of the content, delivery, place, and control of therapy. The next generation of therapeutic interventions will also need alternative methods of measurement and evaluation, and options are discussed.

Core outcome sets for research and clinical practice

Article

Full-text available

Mar 2017
REV BRAS FISIOTER

Background This masterclass introduces the topic of core outcome sets, describing rationale and methods for developing them, and providing some examples that are relevant for clinical research and practice. Method A core outcome set is a minimum consensus-based set of outcomes that should be measured and reported in all clinical trials for a specific health condition and/or intervention. Issues surrounding outcome assessment, such as selective reporting and inconsistency across studies, can be addressed by the development of a core set. As suggested by key initiatives in this field (i.e. OMERACT and COMET), the development requires achieving consensus on: (1) core outcome domains and (2) core outcome measurement instruments. Different methods can be used to reach consensus, including: literature systematic reviews to inform the process, qualitative research with clinicians and patients, group discussions (e.g. nominal group technique), and structured surveys (e.g. Delphi technique). Various stakeholders should be involved in the process, with particular attention to patients. Results and conclusions Several COSs have been developed for musculoskeletal conditions including a longstanding one for low back pain, IMMPACT recommendations on outcomes for chronic pain, and OMERACT COSs for hip, knee and hand osteoarthritis. There is a lack of COSs for neurological, geriatric, cardio-respiratory and pediatric conditions, therefore, future research could determine the value of developing COSs for these conditions.

Few promising multivariable prognostic models exist for recovery of people with non-specific neck pain in musculoskeletal primary care: A systematic review

Article

Dec 2017

Question: Which multivariable prognostic model(s) for recovery in people with neck pain can be used in primary care? Design: Systematic review of studies evaluating multivariable prognostic models. Participants: People with non-specific neck pain presenting at primary care. Determinants: Baseline characteristics of the participants. Outcome measures: Recovery measured as pain reduction, reduced disability, or perceived recovery at short-term and long-term follow-up. Results: Fifty-three publications were included, of which 46 were derivation studies, four were validation studies, and three concerned combined studies. The derivation studies presented 99 multivariate models, all of which were at high risk of bias. Three externally validated models generated usable models in low risk of bias studies. One predicted recovery in non-specific neck pain, while two concerned participants with whiplash-associated disorders (WAD). Discriminative ability of the non-specific neck pain model was area under the curve (AUC) 0.65 (95% CI 0.59 to 0.71). For the first WAD model, discriminative ability was AUC 0.85 (95% CI 0.79 to 0.91). For the second WAD model, specificity was 99% (95% CI 93 to 100) and sensitivity was 44% (95% CI 23 to 65) for prediction of non-recovery, and 86% (95% CI 73 to 94) and 55% (95% CI 41 to 69) for prediction of recovery, respectively. Initial Neck Disability Index scores and age were identified as consistent prognostic factors in these three models. Conclusion: Three externally validated models were found to be usable and to have low risk of bias, of which two showed acceptable discriminative properties for predicting recovery in people with neck pain. These three models need further validation and evaluation of their clinical impact before their broad clinical use can be advocated. Registration: PROSPERO CRD42016042204. [Wingbermühle RW, van Trijffel E, Nelissen PM, Koes B, Verhagen AP (2018) Few promising multivariable prognostic models exist for recovery of people with non-specific neck pain in musculoskeletal primary care: a systematic review. Journal of Physiotherapy XX: XX-XX].

Multidimensional screening for predicting pain problems in adults: A systematic review of screening tools and validation studies

Abstract and Figures

Recommended publications

Static and Dynamic Balance in Adults Undergoing Lumbar Spine Surgery: Screening and Prediction of Po...

Replication of validation study of the Chew HL screening tool in an adult Italian population

Systematic Review of Psychometric Characteristics of the SASSI–3

Establishment of reference range for prostate volume and prediction of future volume change with age...