ArticlePDF Available

Propensity score-based adjustment for covariate effects on classification accuracy of bio-marker using ROC curve

Authors:

Abstract and Figures

The potential performance of bio-marker in classifying diseased from healthy population may be affected by baseline covariates (X) that are associated with both the bio-marker (Y) and the disease status (D). Some existing approaches can be able to adjust for the effect of a single covariate at a time. However, several potential covariates can be available in practice for which simultaneous adjustment in the ROC curve is essential. This study proposed a propensity score (PS) based adjustment for the effects of several covariates in the ROC curve. The PS is first derived from a linear transformation of several covariates and the PS-adjusted (and PS-specific) ROC curve was then estimated using the existing non-parametric induced ROC regression framework. The method is illustrated for both continuous and binary bio-markers. The simulation study suggests that the PS-based adjustment performed well by providing a consistent estimate of the true ROC curve and showing robustness to the mis-specification of the propensity score model as well as to a non-linear function of covariates. Further, an application of the method is provided to evaluate the effectiveness of the body-mass-index in classifying patients with hypertension or diabetes after adjusting for the potential covariates such as age, sex, education, socio-economic status.
Content may be subject to copyright.
Full Terms & Conditions of access and use can be found at
https://www.tandfonline.com/action/journalInformation?journalCode=tbep20
Biostatistics & Epidemiology
ISSN: (Print) (Online) Journal homepage: https://www.tandfonline.com/loi/tbep20
Propensity score-based adjustment for covariate
effects on classification accuracy of bio-marker
using ROC curve
Muntaha Mushfiquee & M. Shafiqur Rahman
To cite this article: Muntaha Mushfiquee & M. Shafiqur Rahman (2022): Propensity score-
based adjustment for covariate effects on classification accuracy of bio-marker using ROC curve,
Biostatistics & Epidemiology, DOI: 10.1080/24709360.2022.2131994
To link to this article: https://doi.org/10.1080/24709360.2022.2131994
Published online: 24 Oct 2022.
Submit your article to this journal
View related articles
View Crossmark data
BIOSTATISTICS & EPIDEMIOLOGY
https://doi.org/10.1080/24709360.2022.2131994
Propensity score-based adjustment for covariate effects on
classification accuracy of bio-marker using ROC curve
Muntaha Mushfiquee and M. Shafiqur Rahman
Institute of Statistical Research and Training, University of Dhaka, Dhaka, Bangladesh
ABSTRACT
The potential performance of bio-marker in classifying diseased from
healthy population may be affected by baseline covariates (X) that
are associated with both the bio-marker (Y) and the disease status
(D). Some existing approaches can be able to adjust for the effect
of a single covariate at a time. However, several potential covariates
can be available in practice for which simultaneous adjustment in
the ROC curve is essential. This study proposed a propensity score
(PS) based adjustment for the effects of several covariates in the ROC
curve. The PS is first derived from a linear transformation of several
covariates and the PS-adjusted (and PS-specific) ROC curve was then
estimated using the existing non-parametric induced ROC regression
framework. The method is illustrated for both continuous and binary
bio-markers. The simulation study suggests that the PS-based adjust-
ment performed well by providing a consistent estimate of the true
ROC curve and showing robustness to the mis-specification of the
propensity score model as well as to a non-linear function of covari-
ates. Further, an application of the method is provided to evaluate
the effectiveness of the body-mass-index in classifying patients with
hypertension or diabetes after adjusting for the potential covariates
such as age, sex, education, socio-economic status.
ARTICLE HISTORY
Received 10 March 2022
Accepted 28 September 2022
KEYWORDS
Induced ROC regression;
non-parametric method;
body-mass-index;
classification
1. Introduction
In medicine, bio-marker plays a vital role in diagnosing the risk of developing a disease,
the absence or presence of a disease and disease progression [1,2]. For example, body
temperature is a well-known bio-marker for fever; cholesterol values are considered as a
bio-marker and risk indicator for coronary and vascular disease and blood pressure is used
to determine the risk of stroke [3,4]. Therefore, one of the recent advances in bio-medical
research is the discovery of novel bio-markers and quantication of risk oered by the
marker [57]. Furthermore, sometimes, the controlled clinical trials have been conducted
to determine if altering the concentrations of new bio-marker lead to the improvement
in patient prognosis. For example, the inammatory marker CRP has been extensively
studied in cardiovascular medicine to determine its potentiality to quantify the risk of
developing cardiovascular disease [810]. The usefulness of the new marker depends on
CONTACT M. Shafiqur Rahman shafiq@isrt.ac.bd Institute of Statistical Research and Training, University of
Dhaka, Dhaka 1000, Bangladesh
© 2022 International Biometric Society Chinese Region
2M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
how much they add extra predictive value in correctly classifying the diseased subjects
from the non-diseased subjects [5,6].
Given the importance of bio-marker in disease classication, it is essential to evaluate
their classication accuracy- the ability to provide the correct diagnosis given a subject’s
true disease status (discrimination). The receiver operating characteristic (ROC) curve, a
graphical plot of sensitivity (true positive rate) against the 1-specicity (false positive rate)
evaluated at dierent cuto values of the diagnostic marker, is commonly used to evaluate
the binary classication accuracy of the marker [1113]. The area under the curve (AUC)
is used as a numerical summary measure of the classication accuracy, which is interpreted
as the probability that for a randomly selected pair of subjects, the subject with disease has
a larger marker value than the subject without disease. It ranges between 0.5 and 1, with the
value of 0.5 indicating random classication by the marker like a coin toss and 1 indicating
perfect classication.
The potential performance of a bio-marker may be strongly inuenced by both patients
and disease characteristics or features of the specimen collection or test settings. As dis-
cussed by Janes and Pepe [14,15], covariate can be associated with both bio-marker and
thediseaseoutcomeandactedasaconfounderoreectmodier.Henceitcanaect
the classication accuracy in two dierent ways: (i) it can confound the performance of
the bio-marker in disease classication but not aect the discriminatory power (it is not
an eect modier) (ii) it can aect classication accuracy (discriminatory power) of the
marker at its dierent levels (it is an eect modier). For the rst case, the classication
accuracy (discriminatory power) of the marker is the same across dierent levels (groups)
ofthecovariatevaluesbutdierentfromthoseforthepooleddata,becausethedistri-
bution of bio-marker for cases and control vary across dierent levels of the covariates.
HencetheadjustmentforthecovariateeectontheROCcurveisnecessary,otherwise,
the traditional pooled ROC curve may provide misleading conclusion on the performance
of the marker. The adjusted ROC (aROC) is analogous to the adjusted odds-ratio in asso-
ciation study in epidemiology and statistics [15]. For the second case, the classication
accuracy diers for dierent levels of the covariates, for which conditional or covariate-
specic ROC curve (for a specic value of the covariate) can be considered. However, for
the continuous covariate and small sample situation covariate-specic ROC curve cannot
be estimated with precision. In this situation, to provide a summary of the classication
accuracy, covariate-adjusted ROC curve is usually estimated from weighted average of
covariate-specic ROC curve, with weights corresponding to the proportion of cases in
each covariate group [16,17].
To assess possible covariate eects on ROC curve, a considerable number of approaches
have been discussed in literature. Of them, an attempt was made by a few studies to include
all covariates in multivariable models and then produce a ROC curve for the linear predic-
tor ηi=
ˆ
βXiderived from the model [13,18]. However, the ROC curve derived from the
linearpredictorofamultivariablemodelisnottheROCcurveofbio-markerYadjusted
for covariate value X1and X2, but rather it captures the combined ability of both the bio-
marker and covariates to discriminate between case and control. This combination can
perform well even if Yis a poor classier and X’s discriminate well [14,16]. Further, the
covariate adjustment by comparing nested models (model with and without bio-marker
Y) may produce misleading conclusion if both bio-marker and covariate are correlated.
This approach provides how much accuracy improves with the addition of Yto X, i.e.
BIOSTATISTICS & EPIDEMIOLOGY 3
the incremental value, which is dierent from the covariate-adjusted performance of a
bio-marker [14].
To adjust for the covariate eects on the ROC curve, two dierent regression approaches
(‘induced’ and ‘direct’ methodology) have been suggested in literature. In the induced
regression [19,20], separate regression models for bio-marker (diagnostic results) for dis-
eased and healthy population are specied, and the covariate eects on the associated ROC
curve are then computed by deriving an induced form of the ROC curve. Whereas in
thedirectmethodology[21,22], a regression model for conditional ROC curve with the
eects of the covariates is directly specied, instead of considering the bio-marker val-
ues. Both these approaches are based on a generalized linear model (GLM) framework to
characterize the covariate eects on ROC curve. However, using the parametric model for
evaluatingthecovariateseects,particularlythecontinuouseectsofcontinuouscovari-
ates, on ROC curve may provide misleading conclusions if the eects are incorrectly
specied and/or the distribution assumption of the bio-marker is mis-specied. To address
these problems, Rodríguez–Alvarez et al. [17]proposedanon-parametricapproachforthe
induced-regression methodology where only an assumption is required that the eect of
continuous covariate follows a smooth function. However, the non-parametric approach
of Rodríguez–Alvarez et al. [17]suersfromnotbeingabletoaddressmorethanone
covariate at a time, because of the computational complexity of distribution function in
the non-parametric regression setting. However, in practice, a mixture of several continu-
ous and categorical covariates is commonly available, for which methodological extension
in ROC analysis is required to adjust for the eects of several covariates simultaneously.
The issues of covariate adjustment in ROC analysis are similar to the adjustment for
theeectsofbaselinecovariates(confounders)inestimatingthemarginaleectsofa
treatment/exposure, which is commonly addressed by performing propensity score (PS)
analysis [2325]. Some recent studies applied propensity score in ROC analysis [26,27].
Of them, the study by Han et al. [26] applied covariate balancing propensity score when
estimatingtheAUCforabio-markeraimingthatbalancinginbio-markerprevalencemay
provide greater AUC value, i.e. better discriminatory power of the bio-marker. In the study
by Galadima and McClish [27], they used PS in estimating conditional AUC after control-
ling for confounders, where the AUC is used as the measure of eect of a dichotomous
exposure when outcome is continuous and interpreted as the probability that a randomly
selected non-exposed subject has a better response than a randomly selected exposed sub-
ject, which is, however, not the same to the AUC metric generally estimated for evaluating
the classication accuracy of a bio-marker. Moreover, none of them serves the necessity
of covariate adjustment in ROC analysis when a set of covariates confound the perfor-
mance of bio-marker in disease classication without aecting the discriminatory power.
With this background, this paper proposes propensity (PS) score-based adjustment for the
covariate eects in ROC analysis, where a set of both categorical and continuous covari-
ates can be adjusted simultaneously [28]. This can be performed by calculating PS from a
linear transformation between the bio-marker and the set of covariates through a multi-
variable regression model and using it as a single covariate, instead of all covariates, in the
existing non-parametric ROC regression setup. The process is very straightforward and
simple and the resulting ROC estimators (both PS-specic and PS-adjusted) have shown
to have the intuitive interpretation and statistical properties of a good estimator. Details of
the proposed approach with motivation are discussed in Section 3.
4M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
This paper is organized as follows. Section 2describes the existing non-parametric
method of covariate adjustment in ROC curve, and Section 3describes the proposed PS
based adjustment. An extensive simulation study is discussed in Section 4and an applica-
tion of the methods is in Section 5. Section 6ends the paper with some discussions of the
main ndings and concluding remarks.
2. ROC curve and covariate adjustment
2.1. Overview on ROC curve
Let us consider Ybe a random variable representing a continuous bio-marker values. The
diagnosis according to any cuto value cis positive if Ycand negative if Y<c.Let
D1and D0denotes the diseased and non-diseased populations, respectively. Now the true
positive rate TPR(c)andthefalsepositiverateFPR(c)at any cuto value care dened as:
TPR(c)=P(Yc|D1)=1FD1(c)
FPR(c)=P(Yc|D0)=1FD0(c),
where FD1and FD0are distribution function YD1and YD2,respectively.Inatheoretical
ROC curve, the true positive rate (sensitivity) is plotted in function of the false positive
rate (1-specicity) for dierent cut-o points [29].EachpointontheROCcurverepresents
a sensitivity/specicity pair corresponding to a particular decision threshold. A test with
perfect discrimination (no overlap in the two distributions) has a ROC curve that passes
through the upper left corner (100% sensitivity, 100% specicity) [30]. Therefore, the closer
theROCcurveistotheupperleftcorner,thehighertheoverallaccuracyofthetest[17]. Let
us consider pistheallfalsepositiveratesaccordingtothevaryingvaluesofc(−∞,)[31].
The ROC curve can be dened such that:
ROC(p)=1FD1[F1
D0(1p)], p(0, 1).
The most commonly used value to summarize the accuracy is the area under the ROC
curve
AUC =1
0
ROC(p)dp=P(YD1>YD0).
This can be interpreted as the probability that in a randomly selected pair of healthy and
diseased individuals, the diagnostic marker value is higher for the diseased subject. Values
of AUC close to 1.0 indicate that the marker has high diagnostic accuracy [32,33].
2.2. Covariate-adjusted ROC curve for single covariate
Let us consider a continuous bio-marker Yand a continuous covariate X.Thenusingthe
similar notations in the paper by Janes and Pepe [15]theTPRandFPRforY|Xat any
cuto are dened such that,
TPRX(c)=1FD1(c|X)=Pr YD1>c|X,(1)
FPRX(c)=1FD0(c|X)=Pr YD0>c|X.(2)
BIOSTATISTICS & EPIDEMIOLOGY 5
Then the covariate-specic ROC curve can be dened as
ROCX(p)=1FD1(X)F1
D0(X)(1p).(3)
According to Janes and Pepe [16] the covariate-adjusted ROC curve can be dened such
that,
aROCX(p)=ROCX(p)dFXD1(X).(4)
Then the AUC can be obtained such that,
aAUCX=1
0
aROCX(p)dp.
2.3. Non-parametric estimation of coviariate-adjusted ROC curve
2.3.1. Continuous bio-marker
Assume that YD1and YD0are bio- marker value for diseased and healthy subjects, respec-
tively. As described by Rodríguez–Alvarez et al. [17], a non-parametric regression model
isassumedtotestbio-markervaluesalongwithacontinuouscovariateXfor diseased and
non-diseased population:
YD1=μ1(X)+σ1(X)1,(5)
YD0=μ0(X)+σ0(X)0,(6)
where μ1(X)=β01 +β11Xand μ0(X)=β00 +β01Xare the regression functions, and σ1
and σ0are the variance functions. The errors 1and 0areassumedtobeindependentof
the covariate X,withzeromean,unitvariance.BasedonEquations(3),5,and6andafter
some algebra discussed in Appendix A of the paper by Rodríguez–Alvarez et al. [17], the
covariate-specic ROC curve from this induced regression has the following form:
ROCX=x(p)=FD1(X)μ0(X)μ1(X)
σ1(X)+σ0(X)
σ1(X)F1
D0(X)(p).(7)
TheaboveROCcurvecanbeestimatedusingthefollowingprocedure.Let{(xD1
i,yD1
i)}nD1
i=1
and {(xD0
j,yD0
j)}nD0
j=1be two independent random samples drawn from the diseased and
healthy populations, respectively. Then μ1(X)and μ0(X)can be estimated as,
μ1(X)=
x,{(xD1
i,yD1
i)}nD1
i=1,h1,q1,
μ0(X)=
x,{(xD0
j,yD0
j)}nD0
j=1,h0,q0,
where is local polynomial kernel estimator, h1and h0are bandwidths or smoothing
parameters, q1and q0are order of polynomials for the diseased and healthy population,
respectively. The corresponding variance function can be estimated as
σ2
1(X)=
x,{(xD1
i,zD1
i)}nD1
i=1,g1,r1,
6M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
σ2
0(X)=
x,{(xD0
j,zD0
j)}nD0
j=1,g0,r0,
where zD1
i=[yD1
iμ1(xD1
i)]2and zD0
j=[yD0
jμ0(xD0
j)]2and is local polynomial
kernel estimator, g1and g0are bandwidths, r1and r0are order of polynomials for the
corresponding diseased and healthy population, respectively [34].
To implement the estimation procedure, one needs to choose the order of polynomials
and bandwidths. As suggested by Rodríguez–Alvarez et al. [17], we choose polynomial of
order 1 (q1=q0=1) for mean function and polynomial of order 0 (r0=r1=0) for vari-
ance function. For bandwidth selection, cross-validation was used for automatic selection.
To speed up the cross-validation process, binning-type acceleration process proposed by
Fan and Marron [35] was used to obtain binning-approximation of μ1,μ0,σ2
1,andσ2
0.
For more details on bandwidth selection, see Rodríguez–Alvarez et al. [17]. Now using the
aboveestimators,theTPRandFPRcanbeestimatedas
FD1(z)=1
nD1
nD1
i=1
IyD1
i−ˆμ1(XD1
i)
ˆσ1(XD1
i)z,
FD0(z)=1
nD0
nD0
j=1
IyD0
j−ˆμ0(XD0
j)
ˆσ0(XD0
j)z.
Finally, the estimated covariate-specic ROC curve can be obtained as
ROCX=x(p)=
FD1(X)μ0(X)μ1(X)
σ1(X)+σ0(X)
σ1(X)
F1
D0(X)(p),(8)
where
F1
D0(X)(p)=sup{z;
FD0(z)p}. Then the estimated covariate-adjusted ROC curve
dened in Equation (3) can estimated as,
a
ROCX(p)=ROCX(p)dFD1X(X)=1
nD1(X)
nD1(X)
i=1
IyD1
iμ1(XD1
i)
σ1(XD1
i)>
F1
D0(X)(p).
(9)
Then the estimated covariate-adjusted AUC can be obtained such that,
a
AUCX=1
0
aROC(p)dp. (10)
2.3.2. Binary bio-marker
Assume that the bio-marker, Y, is binary (positive or negative result based on a threshold
value) and it takes the value 1 if and only if an unobservable continuous measurement Y
exceed a threshold θ, otherwise it takes value 0. The random variable Yis termed as latent
response. We can write the probability of positive response as πi=Pr[Yi=1] =Pr[Y>
θ]. To identify the latent class model, the threshold θoften considered as zero and stan-
dardize Ytohavestandarddeviationone.Underthelatentvariablemodelingframework,
the non-parametric regression for the binary bio-marker along with continuous covariate
Xcan be written as
Y
D1=ν1(X)+τ1(X)U1, (11)
BIOSTATISTICS & EPIDEMIOLOGY 7
Y
D0=ν0(X)+τ0(X)U0, (12)
where ν1(X)=γ01 +γ11X,andν0(X)=γ00 +γ01Xare regression functions, and τ1(X)
and τ0(X)are both variance functions, and both U1and U0are error terms with normally
distributed random variable of mean 0 and variance 1. However, as the scale of the latent
variable is not identied, to interpret the regression coecient in unitof standard deviation
of latent variable, we take τ1(X)=τ0(X)=1. The latent variable models can be converted
into probit model as follows:
π1=Pr[Y
D1>0] =Pr[U1>ν1]=Pr[U1>ν1]=1(ν1)=(ν1),
π0=Pr[Y
D0>0] =Pr[U0>ν1]=Pr[U0>ν0]=1(ν0)=(ν0),
where (.)is the standard normal cumulative density function. The ν1and ν0can be esti-
mated using both parametrically following maximum likelihood estimation technique and
non-parametrically using kernel estimation [36]. Here, we estimated both ν1and ν0follow-
ing the non-parametric estimation described in Section 2.3.1.ThenfollowingEquation(8)
the ROC can be estimated for binary bio-marker as
ROCb
X=x(p)=
FD1(X)ν0(X)ν1(X)+
F1
D0(X)(p),
where the TPR and FPR can be estimated with the observed binary response yas
FD1(z)=1
nD1
nD1
i=1
IyD1
i−ˆν1(XD1
i)zi,
FD0(z)=1
nD0
nD0
j=1
IyD0
j−ˆν0(XD0
j)zj,
with zD1
i=[yD1
iν1(xD1
i)]2and zD0
j=[yD0
jν0(xD0
j)]2. Finally, the covariate-adjusted
ROC curve, denoted by a
ROC
X(p), for the binary bio-marker can be estimated as
a
ROCb
X(p)=ROCb
X(p)dFD1(X)(X)=1
nD1
nD1
i=1
IyD1
iν1(XD1
i)>
F1
D0(X)(p),
where
F1
D0(X)(p)=sup{z;
FD0(z)p}.
3. Propensity score-based adjustment for covariate effects in ROC analysis
The non-parametric induced regression discussed in the earlier section can manage to
adjustfortheeectofasinglecovariateatatime,however,multiplecovariatesoften
needed to be adjusted in ROC analysis. In this paper, an adjustment for several covariates
onestimatingROCcurvearemadeusingpropensityscore,whichisalinearcombina-
tion of several covariates. The propensity score-based adjustment for confounding eect
is very popular in epidemiological research. We motivated from the studies by Janes and
Pepe [14,16], who suggested that the adjustment for covariate eects in ROC analysis is
8M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
analogous to the adjustment for confounder in odds ratio in epidemiological studies. We,
therefore, apply this approach to the non-parametric ROC regression analysis to adjust
for several covariates. The procedure with mathematical justication is described in the
following sub-sections starting with introducing propensity score.
3.1. Propensity score
The propensity score (PS), a probability of being treated (Z=1) given a set of baseline
covariates (X), is being used for estimating casual eect of the treatment/exposure in epi-
demiological observational studies where random assignment is not feasible [23,37]. The
PSareusedinsuchawaythattheresultingtreatment(Z=1) and control groups (Z=0)
will have similar covariate values to those created through random assignment [38]. This
is called covariate balancing between treatment and control group, and statistically, it
termed as conditional independence of treatment and covariate given PS and denoted by
ZX|PS. Several PS methods are available in literature including ‘matching’, ‘stratica-
tion’, ‘weighting’ and covariate adjustment’. Of the methods, the ‘PS-covariate adjustment
is a popular one which uses PS as covariate in the outcome model to estimate the eect of
the treatment/exposure on outcome. The resulting estimate can be interpreted as the eect
of treatment on outcome comparing to untreated group, with the same PS (i.e. the same
covariate values) for both groups.
Generalized propensity score (GPS) is widely used for estimating casual eect of the
continuous treatment or exposure on the outcome [28,39,40]. Therefore, if the bio-marker
Yis continuous (here the notation for treatment Zis replaced by bio-marker Y), the GPS
canbeusedtoestimatePS.Letr(y,x)be the conditional density of ygiven the covariates
x,whichhasthefollowingform:
r(y,x)=fY|X(y|x)=N(βX,σ2).
Then the generalized propensity score (gPS)canbeobtainedas
gPS =r(y,x)=1
2πˆσ2exp 1
2ˆσ2(Yiˆ
βX)2, (13)
where ˆ
βand ˆσare the estimates (by maximum likelihood or ordinary least squares) of
the regression coecient and standard deviation of the random error, respectively in the
propensity score model.
If the bio-marker Yis binary, the PS can be estimated using logistic regression including
asetofcovariatesXas:
bPS =P(Y=1|X)=[1 +exp(βX)]1. (14)
The PS estimated from the above models can be considered an example of a single-index
in that the information of several covariates eects on the response is completely captured
through the the linear predictor, βX.
3.2. Using propensity score in ROC regression
The PS obtained from the above models (Equations (13)–(14)) are result of a linear com-
bination of several covariates and hence it can be considered as dimension reduction
BIOSTATISTICS & EPIDEMIOLOGY 9
technique for a covariate set [41]. In the dimension reduction regression [42,43], the main
goal is to reduce the dimension of Xwithout losing information on Y|X.Thisleadstoan
assumption that without loss of information the conditional distribution F(Y|X)can be
indexed by a (or a set of) linear combination of X. That is, distribution of Y|Xis the same
as that of Y|PS (equivalently Y|βX) for all values of Xand mathematically we can write
F(Y|X)=F(Y|PS). It implies that the covariate matrix of order p×ncan be replaced
by the predictor matrix of order 1 ×n. In other words, according to the conditional inde-
pendence theory given by Dawid [44], the above dimension reduction can be explained as
the conditional independence of Yand Xgiven PS (or βX). Based on this background, we
proposedtousePSinROCanalysisinsteadofusingseveralcovariatesX.NowusingPS
(PS is used here as a general notation for both gPS and bPS) instead of X,theTPRandFPR
for Y|PS can be dened as
TPRPS(c)=1FD1(PS)(c)=Pr YD1>c|PS, (15)
FPRPS(c)=1FD0(PS)(c)=Pr YD0>c|PS, (16)
Then the PS-specic ROC curve can be dened as
ROCPS(p)=1FD1(PS)F1
D0(PS)(1p). (17)
ThePS-adjustedROCcurvecanbedenedas,
aROCPS(p)=ROCPS(p)dFPSD1(PS). (18)
Then the AUC can be obtain such that,
aAUCPS =1
0
aROC1(p)dp.
One can use the linear coecient βXof the PS model instead of PS because a one-to-one
correspondingly holds between PS and βX.UsingPSorβXas a covariate conrm the
adjustment of several covariates (X), because the PS or βXserves as a representative of
several covariates by means of their linear combination. Now, given F(Y|X)=F(Y|PS)
for all values of X, we can state that the components in Equations (1)–(3) conditioning on
Xwill be equal to the Equations (15)–(17) conditioning on PS, respectively, i.e. TPRX(c)=
TPRPS(c),FPR
X(c)=FPRPS(c)and ROCX(p)=ROCPS (p).Theseresultsindicateequal-
ity of the components in Equation (4) and Equation 18, i.e. aROCPS(p)=aROCX(p).The
corresponding PS-adjusted ROC curve, aROCPS, and the area under the curve, aAUCPS,
have similar interpretations to the Xadjusted curve and area, aROCXand aAUCX,but
referring the adjustment for the eect of PS rather than X. In other words, as discussed by
Janes and Pepe [14], the adjustment of the covariate XinROCanalysisensuretheappro-
priate distribution of bio-marker between diseased and healthy subjects, which was not
possiblewithoutadjustmentofXbecauseitconfoundsthetruedistributionofthebio-
markerbetweenthegroups.Itcanbearguedthatthesameadjustmentinthedistributionof
diseased and healthy bio-marker will appear when Xis replaced by PS because of a one-to-
one correspondence holds between XandPS,accordingtothePS-modeldescribedabove.
10 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
However, the main advantage of the PS-approach is that it can accommodate the adjust-
ment of several covariates by means of their linear combination connecting to PS, which
is dicult in the existing approach.
The PS-adjusted ROC curve dened in Equation (18) can be obtained by replacing the
single covariate Xby the estimated propensity score (PS) into the non-parametric induced
regression Equations 5–(6). That is, for the continuous bio-marker Y,thenon-parametric
regression Equations (5)–(6) take the following form:
YD1=μ1(gPS)+σ1(gPS)1,
YD0=μ0(gPS)+σ0(gPS)0.
For the binary bio-marker Y, the PS adjusted the above non-parametric regression equa-
tions become:
Y
D1=ν1(bPS)+τ1(bPS)U1,
Y
D0=ν0(bPS)+τ0(bPS)U0.
The PS-specic (Equation 17) and PS-adjusted ROC curve (Equation 18) for both binary
and continuous bio-marker can be estimated following the non-parametric induced regres-
sion procedure described in Sections 2.3.12.3.2 butreplacingthesinglecovariateXby the
PS.Therefore,theestimatedPS-adjustedROCcurveandAUChavethesimilarinterpre-
tation to those obtained by adjusting for single covariate. For example, a value of 0.75 for
the estimated PS-adjusted AUC (aAUCPS) indicates that, for a randomly selected pair of
subjects (disease vs healthy), there is a 75% chance of having greater bio-marker value for
the diseased subject than the healthier counterpart, given that both subjects have the same
PS value, i.e. the same value for each of the covariates.
Again, following Rodríguez–Alvarez et al. [17], we propose to use bootstrap variance
for constructing condence interval and testing the eect of PS on PS-specic ROC curve
with hypothesis H0:ROC
PS(p)=ROC(p). For more details about the bootstrap variance
estimation procedure in the non-parametric induced ROC regression, see Section 3of the
paper by Rodríguez–Alvarez et al. [17].
4. Simulation study
A simulation study was conducted to evaluate the performance of the PS-based covariate
adjustment in the ROC analysis. Separate simulation series were conducted for continu-
ous and binary bio-marker. In each simulation series, we rst generated two independent
covariates of which one is continuous (X1) and the other is dichotomous (X2), separately
for diseased (D1)andhealthy(D0) population, respectively, as
X1D1ND1,σ2
D1);X1D0ND0,σ2
D0),
X2D1Bin(n,πD1);X2D0Bin(n,πD0).
We consi d e r e d μD1=51.54, μD0=38.19, σ2
D1=224.13, σ2
D0=182.33, πD1=0.67,
πD0=0.42. Dierent values of the parameters associated with the covariate distribution
for diseased and healthy population suggest imbalance covariate distribution between the
BIOSTATISTICS & EPIDEMIOLOGY 11
population, for which adjustment for the covariate eects is necessary. Then the continuous
bio-marker values for diseased and healthy population were generated from the following
models:
YD1=β01 +β11X1D1+β21 X2D1+D1, (19)
YD0=β00 +β10X1D0+β20 X2D0+D0, (20)
where D1N(0, 1)and D0N(0, 1). The values of the parameters were set to β01 =
29.99, β11 =−0.013, β21 =2.06, β00 =21.50, β10 =0.121, β20 =−0.77. Dierent values
of the parameters of the model for diseased and healthy subjects suggest an association of
covariates X1and X2with both the bio-marker and the disease.
Similarly the binary bio-marker values were generated separately for diseased and
healthy population as
Yb
D1=1, if Y
D1>0,
=0, otherwise
Yb
D0=1, if Y
D0>0,
=0, otherwise
where the latent variables Y
D1and Y
D0can be generated from the following model:
Y
D1(x)=γ01 +γ11X1D1+γ21 X2D1+UD1,
Y
D0(x)=γ00 +γ10X1D0+γ20 X2D0+UD0.
where UD1N(0, 1)and UD0N(0, 1). We considered γ01 =−1.5, γ11 =0.123, γ21 =
0.772, γ00 =−2.46, γ10 =0.013, γ20 =2.06. As described earlier, dierent values of the
parameters of the two models suggest an association of covariates with both bio-marker
and disease outcome.
For both simulation series, we considered dierent scenarios by varying the sample size
as 150, 300, 500, 1000, with 30% for the subjects with disease and the rest of them for
healthy subjects. This implies that P(D)=0.30. Further simulation scenarios were cre-
ated by considering X1and X2correlated. For each simulation scenario, we estimated the
propensity score (using Equation (13) for continuous marker and Equation (14) for binary
marker) from a linear combination of the X1and X2and used it in the non-parametric
ROC regression described in Section 2.3 to estimate the PS-adjusted ROC curve. The aver-
age over the 500 estimated curves were compared with the true ROC curve (obtained by
integrating the covariate specic ROC curves over the covariate distribution among the
cases) to evaluate the performance of the PS-based adjustment in ROC curve. The R pack-
age ‘npROCRegression’ was used to estimate the non-parametric PS-based adjusted ROC
curve.
Figure 1showsthatthewholesamplingdistributionofthePS-adjustedROCcurvesfor
continuous bio-marker over 500 replications and their mean ROC curve with the super-
imposedtrueROCcurve.TheresultsrevealedthatthepropensityscoreadjustedROC
curve is reasonably correctly estimated (very close to true curve). The simulation standard
error (or mean squared error) decreased with increasing sample size indicating consistent
12 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
Figure 1. Estimated ROC curve (dash line) from the average over 500 replicated curves with a superim-
posed true ROC curve (solid line) for a continuous bio-marker linearly related with covariates. FPR refers
to false positive rate and PS to propensity score, n.h to sample size in healthy population and n.d in
disease population. Maximum Monte Carlo Error=0.0295.
estimate of the curve. Similar results were observed for the binary bio-marker (Figure 2).
Simulation with correlated covariates also suggest similar results (results not shown).
We performed another simulation study to investigate if the mis-specication of the
propensity score model aects the PS-adjusted ROC results. For this, two more covari-
ates, X3D1N(0, 1),X3D0N(0, 1)and X4D1Ben(n, 0.7),X4D1Ben(n, 0.3)were
independently generated in addition to the covariate X1and X2generated in the ear-
lier simulation, for a total of 300 subjects (keeping 30% diseased). The corresponding
bio-marker values, YD1and YD0, were generated for the same subjects as generated pre-
viously but adding X3in the previous true models (Equations (19)–(20)). The values for
therespectivecoecients(β31,β30)associatedwithX3intheextendedtruemodelare0.5
and 0.2, respectively. We then tted four dierent models, of which one is correctly speci-
ed model: E(Y|X)=β0+β1X1+β2X2+β3X3, and the other three are dierent forms
of mis-specied models. They are Model 1: E(Y|X)=β0+β1X1+β2X2,whereX3is
missing, Model 2: E(Y|X)=β0+β1X1+β2X2+β3X4,whereX3is missing and X4was
treatedasnoise,andModel3:E(Y|X)=β0+β1X1+β2X2+β3X3+β4X4,whereX4
BIOSTATISTICS & EPIDEMIOLOGY 13
Figure 2. PS-adjusted ROC curve (dash line) from the average over 500 replicated curves with a super-
imposed true ROC curve (solid line) for a binary bio-marker. FPR refers to false positive rate and PS to
propensity score, n.h to sample size in healthy population and n.d in disease population. Maximum
Monte Carlo Error=0.0367.
was considered as noise. The results revealed that the eects mis-specication of the PS
model in estimating true ROC curve is negligible (Figure 3), which is true for all forms of
mis-specication.
We further performed a simulation study to investigate the robustness of the non-
parametric induced regression method to a non-linear function of covariates. That is, the
bio-marker in both healthy and diseased population is non-linearly related with covari-
ates. For this we extended the models in Equations (19)–(20) by adding another covariates
X3D1and X3D0with their quadratic terms. The true beta coecient values for both the lin-
ear and quadratic term of the new covariates were considered as 0.5, 0.2, 0.05, and 0.02,
respectively. The results in Figure 4show that the method appeared to be robust to a
non-linear relationship of biomarker and the covariates and provided consistent estimates
with the increasing sample size.
5. Application
Body mass index (BMI) is an inuential risk factor of both hypertension and diabetes
that lead to many cardiovascular diseases such as stroke, cardiac disease, etc. [45,46].
14 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
Figure 3. PS-adjusted ROC curve (dash line) from the average over 500 replicated curves with a super-
imposed true ROC curve (solid line) for different mis-specified models. FPR refers to false positive rate
and PS to propensity score. Maximum Monte Carlo Error=0.0287.
Considering BMI as bio-marker, the methods are applied to evaluate BMI in classifying
the subjects with hypertension (and diabetes separately) from the healthy subjects with
ageover35yearsinthepresenceofseveralcovariatessuchasage,gender,education,
sociology-economic status, etc.
5.1. Data and summary measure
Data used to illustrate the method were extracted from the nationally representative
2011 Bangladesh Demographic and Health Survey (BDHS) that has been conducted
through a collaborative eort of the National Institute of Population Research and Training
(NIPORT), Mitra and Associates and ICF international under the program of worldwide
demographic and health survey. Details of the survey methodology can be found in their
report [47]. The bio-marker measurements for both blood pressure and fasting glucose
levelwerecollectedfromallmembersofahouseholdwithage35yearsandaboveinevery
third of the selected households in each primary sampling unit selected throughout the
country. For measuring blood pressure, the survey used the WHO’s recommended LIFE
BIOSTATISTICS & EPIDEMIOLOGY 15
Figure 4. Estimated ROC curve (dash line) with a superimposed true ROC curve (solid line) for a continu-
ous bio-marker non-linearly related with covariates. FPR refers to false positive rate and PS to propensity
score, n.h to sample size in healthy population and n.d in disease population. Maximum Monte Carlo
Error=0.0427.
SOURCEUA-76PlusBloodPressureMonitormodel.Bythetrainedtechnician,threemea-
surements of both systolic and diastolic blood pressure were taken at 10-minute intervals
between the measurements, and the average of the second and third measurements was
used to report respondents’ blood pressure. In addition to these measurements, respon-
dentswereaskedwhethertheyhavebeentoldbyadoctorornursethattheyhavehigh
blood pressure and hence are now taking prescribed medication for lowering it. A sub-
ject is classied as hypertensive (elevated blood pressure) if his/her systolic blood pressure
(SBP) value equal to or greater than 140 mmHg or a diastolic blood pressure (DBP) value
equal to or greater than 90 mmHg or he/she was taking medication for lowering the blood
pressure.
For measuring fasting plasma glucose level, the respondents were requested in advance
for fasting overnight until a capillary blood sample (whole blood obtained from the middle
or ring nger) is taken in the morning using the HemoCue 201+ blood glucose analyzer.
The respondents were further asked whether they have been told by doctor or nurse that
they had diabetes prior to the survey and are taking medication for recovering from it.
A subject with fasting plasma glucose level equal to higher than 7.0 mmol/L or who was
taking medication is classied as a patient with diabetes. For more details about the survey
andsuchmeasurements,seeelsewhere[4547]. Finally, bio-marker information related to
both the hypertension and diabetes were collected from a total of 7328 (male 3744, female
3584) and 7593 (male 3753, female 3840) subjects, respectively, who given their consent
for providing the bio-marker information.
16 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
Tab le 1. Summary statistics for the selected variables.
Variables n%/Mean 95%CI
Hypertension
Hypertension 1865 25.52 [24.53, 26.54]
No hypertension 5442 74.47 [73.46, 75.46]
Diabetes
Diabetic 850 11.63 [10.92, 12.39]
Non-diabetic 6457 88.37 [87.61, 89.08]
Age 7307 51.05 [50.76, 51.34]
Gender
Male 3616 49.49 [48.34, 50.63]
Female 3691 50.52 [49.36, 51.66]
BMI
BMI <25 6213 85.03 [84.19, 85.82]
BMI 25 1094 14.97 [14.17, 15.80]
Type of place of resident
Urban 2402 32.87 [31.80, 33.95]
Rural 4905 67.13 [66.04,68.19]
Wealth index
Poorest 1292 17.68 []16.82,18.57]
Poorer 1313 17.97 [17.10,18.86]
Middle 1418 19.41 [18.51, 20.32]
Richer 1528 20.91 [19.99,21.85]
Richest 1756 24.03 [23.06, 25.02]
Educational level
No education, preschool 3248 44.94 [43.80, 46.09]
Primary 2025 27.71 [26.69, 28.75]
Secondary 1372 18.78 [17.89, 19.69]
Higher 626 8.57 [7.94, 9.23]
A set of individual-level covariates available including age (years), sex (male, female),
education (no education, primary, secondary, higher), area of residence (urban, rural), and
household’s socio-economic status (poorest, poorer, richer, richest). The socio-economic
status was determined by taking ve equal parts of wealth index calculated using principal
component analysis of assets owned by household, with the bottom 20% as the poorest
and the top 20% as the richest. The status of both hypertension (yes, no) and diabetes (yes,
no) were considered as two outcome variables and BMI (a composite index of height and
weight) as bio-marker. However, status of diabetes was considered as covariate when we
evaluated the performance of BMI in classifying hypertensive patient and vice-versa.
Table 1presents the summary statistics of all variables included in the analysis. It can be
observed that 25% and 12% of the total subjects are found to be hypertensive and diabetic,
respectively. The average age of all subjects is 51.05 with almost equal number of male and
female. Of the total sample, 1094 (14.97%) subjects have BMI over 25, 32.87% belong to
urban area and 67.13% to rural area and majority have no education or primary education
(approximately 44.94% and 27.71%).
5.2. Performance of BMI in classifying patients with diabetes and hypertension
Literatureandsomeexplanatoryanalyses(resultsnotshown)suggestapossibleassocia-
tion of all such covariates (age, gender, area of residence and socio-economic status) with
both the BMI and diseases: diabetes and hypertension [45,46,48]. Here we tried with the
adjustmentfordierentcombinationsofcovariatesintheROCcurvethroughPSadjust-
ment to evaluate the performance of the BMI in classifying the hypertensive or diabetes
BIOSTATISTICS & EPIDEMIOLOGY 17
patients. Separate analysis was performed for diabetes and hypertension. When the sta-
tus of diabetes was considered as outcome, the status of hypertension was considered as
covariate and vice-versa.
Both the Figures 5and 6show the results for diabetes and hypertension, respectively,
where in each gure, the part(a) depicts the covariate-adjusted ROC curves for dierent
combination of covariates, the part(b) depicts the 95% bootstrap-based condence inter-
valforthetrueROCcurveadjustingforallpossiblecovariates,andthepart(c)illustrates
the AUC of the corresponding PS-specic ROC curves. For both the disease status, a small
variation can be observed in the ROC curves when adjustment made for dierent com-
binations of covariates (Figures 5(a) and 6(a)). Of the dierent combination, the greatest
Figure 5. Estimate of the propensity score-based adjusted ROC curve for assessing BMI to classify dia-
betes patients. (a) Adjustment for different combination of covariates; (b) Adjustment for all possible
covariates; (c) PS-specific AUC when adjustment made for all covariates.
18 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
Figure 6. Estimate of the propensity score-based adjusted ROC curve for assessing BMI to classify hyper-
tensive patients. (a) Adjustment for different combination of covariates; (b) Adjustment for all possible
covariates; (c) PS-specific AUC when adjustment made for all covariates.
classication accuracy for both hypertension and diabetes was observed when all possible
covariates were adjusted simultaneously. Moreover, a signicant (p<0.01) variation can
be observed for PS-specic (covariates-specics) ROC curves and their areas (AUC) when
adjustment made for all possible covariates (Figures 5(c) and 6(c)), which suggest that the
performance of BMI in classifying the subjects with disease from the healthier counterpart
is aected by the PS values i.e. covariates. For example, greater discriminating ability can be
observed for the lower PS values, i.e. for greater ˆ
βXreecting for the female subjects with
higher age and education and better economic condition. The predictive performance of
BMIisappearedtosimilarforbothdiseases(hypertensionanddiabetes).Moreover,simi-
lar results can be observed when BMI was considered as binary bio-marker with BMI <25
and 25 indicating normal and overweight, respectively (results not shown).
BIOSTATISTICS & EPIDEMIOLOGY 19
6. Discussion
The importance of new bio-marker in bio-medical research depends on how accurately the
marker classies the subjects with disease from the healthy subjects. The ROC curve is a
popular method for evaluating classication accuracy of bio-marker. However, the dier-
ence in patient characteristics (baseline covariates) between disease and healthy population
may make it dicult to evaluate classication accuracy of bio-marker and hence it is nec-
essary to adjust for the covariate eect in the ROC analysis. While the existing methods are
unabletoadjustformorethanonecovariatesatatime[17], this study provided a solution
by using propensity score to adjust for several covariates simultaneously. More specically,
the propensity score, derived from a linear combination of a set of covariates, was adapted
in the existing non-parametric induced regression estimator for the ROC curve instead
of using that set of covariates. The PS used here as data dimension reduction technique
to reduce dimension of several covariates by means of their linear combination, which is
similar to dimension reduction regression discussed in literature [42,43]. Hence, using PS
directly or the linear coecient ( ˆ
βx) from where PS was derived instead of several covari-
atesdoesnotchangetheinterpretationoftheROCcurve.Forexample,thePS-adjusted
ROCcurvecanbeinterpretedastheclassicationaccuracyofthebio-markeradjusting
for the eects of PS, i.e. the eects of several covariates. In addition, this approach can be
used to generate covariate-specic ROC curve from the PS-specic ROC curves because
there is one-to-one correspond holds between the PS and covariate.
The simulation study suggests that the PS-adjusted ROC cur ve appeared to be consistent
with increasing sample size. Even the eects of the mis-specication of the propensity score
model is negligible. Furthermore, the method was applied to evaluate the performance of
BMI in classifying the hypertensive and diabetic patient from their healthier counter parts,
in the presence of several patient specic covariates such as age, gender, education, socio-
economic status, area of residence. The results support the simulation ndings and have a
meaningful interpretation. The ndings of the study suggest that PS-based approach can
beusedtoadjustfortheeectofseveralbaselinecovariatesthatmakeconfoundingthe
discriminatory power of the bio-marker.
ThePS-adjustedROCcurve,aROC
PS, derived using non-parametric induced regres-
sion possesses the same properties as the standard ROC estimator, aROCX,fromthesame
methodology considering single covariates as described by Rodríguez–Alvarez et al. [17].
Themainandimportantadvantageoftheproposedapproachisthatitcanadjusttheeect
ofseveralcovariatesinROCanalysisatatimewithoutlossofgeneralitybymeansofasin-
gle score (PS) obtained from the linear transformation of the covariate set. In this paper,
we discussed regression approach as a dimension reduction technique through modeling
bio-markers with a set of covariates, which is truly based on the normality and linearity
assumptions. However, further research may be required to investigate the consequence of
the degrees/forms of mis-specication (non-normality, non-linearity, etc.) of the propen-
sity score model and extend the methods to overcome such situations. In this paper, we
used bootstrap variance for making inference, however, further research may be possible
to derive analytically variance with robust variance estimation technique to capture the
variation in the propensity score model into the ROC estimator. In addition, the proposed
methodhasafewlimitationsthatittakesahugetimetohavethenalestimateofROC
curve and its area, and therefore extensive simulation considering diverse scenarios such
20 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
as correlated covariates, non-linear function of the propensity score model, etc. was not
possible.
Acknowledgments
TheauthorsacknowledgetheauthorityofmeasuresDHSformakingavailablethedatausedhereina
public domain. In addition, the authors acknowledge the proceeding of the 62nd ISI World Statistics
Congress, Kuala Lampur, Malaysia, 2019 published by the same authors, because some parts of this
manuscript are completely matched with this conference proceedings.
Disclosure statement
No potential conict of interest was reported by the author(s).
Ethics approval and consent to participate
As the dataset is freely available in a public domain and is permitted to use in research
publication, the ethics approval and consent statement has been approved by the authority
who made the data available for public use.
Data availability statement
The dataset used in this study can be downloaded freely from a public domain at
https://dhsprogram.com/data/ under the authority of the DHS program.
Notes on contributors
Muntaha Mushquee completedBShonoursandMSinAppliedStatisticsfromtheInstituteofSta-
tistical Research and Training, University of Dhaka, and did her 2nd MSc degree in Statistics at the
Memorial University of Newfoundland, Canada. She is now working as a Health Data Analyst at BC
PHSA, Canada.
M. Shaqur Rahman hasaPhDinMedicalStatistics,UniversityCollegeLondonandiscurrently
working as a Professor of Applied Statistics at the Institute of Statistical Research and Training, Uni-
versity of Dhaka. His main research areas include casual inference, developing and validating risk
prediction models, mixed eect models, focusing on development of new statistical methods for
analyzing data in public health and medical research. So far he published a number of research
papers in peer reviewed journals and scientic reports and guided a number of undergraduate and
postgraduate students for preparing their projects and thesis. In addition to teaching and research,
he is also involved with statistical consultancy services for various national and international orga-
nizations. He has been serving as an associate editor of a number of peer-reviewed journals. He is
a member of the Bangladesh Statistical Association, International Biometric Society, International
Statistical Institute, International Society of Clinical Biostatistics and an educational ambassador of
the American Statistical Association from Bangladesh.
ORCID
M. Shaqur Rahman http://orcid.org/0000-0001-5256-7453
References
[1] Mayeux R. Biomarkers: potential uses and limitations. NeuroRx. 1(2): 182–188.
BIOSTATISTICS & EPIDEMIOLOGY 21
[2] Strimbu K, Tavel JA. What are biomarkers?. Curr Opin HIV AIDS. 2010;5(6):463–466.
[3] Craig-Schapiro R, Fagan AM, Holtzman DM. Biomarkers of alzheimer’s disease. Neurobiol
Dis. 2009;35(2):128–140.
[4] Zheng Y, Cai T, Jin Y, et al. Evaluting prognostic accuracy of biomarkers under competing risk.
Biometrics. 2012;68(2):388–396.
[5] Chen W, Samuelson FW, Gallas BD, et al. On the assessment of the added value of new
predictive biomarkers. BMC Med Res Methodol. 2013;13:98.
[6] Moons KGM, de Groot JAH, Linnet K, et al. Quantifying the added value of a diagnostic test
or marker. Clin Chem. 2012;58(10):1408–1417.
[7] Linnet K, Bossuyt PMM, Moons KGM, et al. Quantifying the accuracy of a diagnostic test or
marker. Clin Chem. 2012;58(9):1292–1301.
[8] Ridker PM, Buring JE, Rifai N, et al. Development and validation of improved algorithms for
the assessment of global cardiovascular risk in women. J Am Med Assoc. 2007;297:611–619.
[9] Ridker PM, Paynter NP, Rifai N, et al. C-reactive protein and parental history improve
global cardiovascular risk prediction: the reynolds risk score for men. Circulation.
2008;118:2243–2251.
[10] Wilson PWF, Pencina M, Jacques P, et al. C-Reactive protein and reclassication of cardiovas-
cular risk in the framingham heart study. Circ Cardiovasc Qual Outcomes. 2008;1(2):92–97.
[11] Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating character-
istic (ROC) curve. Radiology. 1982;143:29–36.
[12] Birgit G. Analysis of biomarker data: logs, odds ratios, and receiver operating characteristic
curves. Currunet Opinion in HIV and AIDS. 2010;5(6):473–9.
[13] Pepe MS. An interpretation for the roc curve and inference using glm procedures. Biometrics.
2000;56:352–359.
[14] Janes H, Pepe MS. Adjusting for covariates in studies of diagnostic, screening, or prognostic
markers:anoldconceptinanewsetting.AmJEpidemiol.2008;168(1):89–97.
[15] Janes H, Pepe MS. Adjusting for covariate eects on classication accuracy using the covariate-
adjusted receiver operating characteristic curve. Biometrika. 2009;96(2):371–382.
[16] Janes H, Pepe MS. Adjusting for covariate eects on classication accuracy using the covariate-
adjusted ROC curve. UW Biostat Work Pap Ser. 2009;96(2):371–382.
[17] Rodríguez-Alvarez MX, Roca-Pardiñas J, Cadarso-Suárez C. Roc curve and covari-
ates: extending induced methodology to the non-parametric framework. Stat Comput.
2011;21(4):483–499.
[18] Harrell Jr FE, Cali RM, Pryor DB, et al. Evaluating the yield of medical tests. J Am Med Assoc.
1982;247:2543–46.
[19] Pepe MS. Three approaches to regression analysis of receiver operating characteristic curves
for continuous test results. Biometrics. 1998;54(1):124–135.
[20] Faraggi D. Adjusting receiver operating characteristic curves and related indices for covariates.
JRStatSocSerDStat.2003;52(2):179–192.
[21] Cai T, Pepe MS. Semiparametric receiver operating characteristic analysis to evaluate biomark-
ers for disease. J Am Stat Assoc. 2002;97(460):1099–1107.
[22] Alonzo TA, Pepe MS. Distribution-free roc analysis using binary regression techniques.
Biostatistics. 2002;3(3):421–432.
[23] Austin PC. An introduction to propensity score methods for reducing the eects of confound-
ing in observational studies. Multivariate Behav Res. 2011;46(3):399–424.
[24] D’Agostino RB. Propensity score methods for bias reduction in the comparison of a treatment
to a non-randomized control group. Stat Med. 1998;17:2265–2281.
[25] Weitzen S, Lapane KL, Toledano AY, et al. Principles for modeling propensity scores in medical
research: a systematic literature review. Pharmacoepidemiol Drug Saf. 2004;13:841–853.
[26] Han S, Andrei AC, Tsui KW, et al. Roc analysis using covariate balancing propensity scores
with an application to biochemical predictors for thyroid cancer. Communications in Statistics
Part B: Simulation and Computation. 2022;51(1):374–390.
22 M. MUSHFIQUEE AND M. SHAFIQUR RAHMAN
[27] Galadima HI, McClish DK. Controlling for confounding via propensity score methods
can result in biased estimation of the conditional auc: A simulation study. Pharm Stat.
2019;18(5):568–582.
[28]McCareyDF,GrinBA,AlmirallD,etal.Atutorialonpropensityscoreestimation
for mutiple treatments using generalized boosted model. Stat Med. 2013;30–32(19):3388–
3414.
[29] Coocicnough DJ, Rossmanrr K, Lusted LB. Radiographic applications of receiver operating
characteritic (roc) curves. Radiology. 2003;229(1):3–8.
[30] Goncalves L, Subtil A, Oliveira MR. Roc curve estimation: an overview. Revstat Stat J.
2014;12(1):1–20.
[31] Metz CE. Basic principles of roc analysis. Semin Nucl Med. 1978;8(4):283–298.
[32] Faraggi D, Reiser B. Estimation of the area under the roc curve. Stat Med. 2002;30(20):
3093–3106.
[33] Metz CE. Basic principles of ROC analysis. Semin Nucl Med. 1978;8:283–298.
[34] González-Manteiga W, Pardo-Fernández JC, Keilegom IV. Roc curves in non-parametric
location-scale regression models. Scand J Stat. 2011;38:169–184.
[35] Fan J, Marron JS. Fast implementations of nonparametric curve estimators. J Comput Graph
Stat. 1994;3(1):35–56.
[36] Müller H-G, Schmitt T. Kernel and probit estimates in quantal bioassay. J Am Stat Assoc.
1988;83(403):750–759.10.1080/01621459.1988.10478658
[37] Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for
causal eects. Biometrika. 1983 Apr;70(1):41–55.10.1093/biomet/70.1.41
[38] Olmos A, Govindasamy P. A practical guide for using propensity score weighting in R.
2015;20(13).
[39] Zhu Y, Coman DL, Ghosh D. A boosting algorithm for estimating generalized propensity
score with continuous treatment. J Causal Inference. 2015;3(1):25–40.
[40] Austin PC. Assessing covariate balance when using the generalized propensity score with
quantitative or continuous exposures. Stat Methods Med Res. 2019;28(5):1365–1377. PMID:
29415624.
[41] Ghosh D. Propensity score modelling in observational studies using dimension reduction
methods. Stat Probab Lett. 2011;81(7):813–820. ISSN 0167-7152. Statistics in Biological and
Medical Sciences.
[42] Cook RD, Lee H. Dimension reduction in binary response regression. J Am Stat Assoc.
1999;94(448):1187–1200.
[43] Li KC. Sliced inverse regression for dimension reduction. J Am Stat Assoc. 1991;86(414):
316–327.
[44] Dawid A. Conditional independence in statistical theory. J R Stat Soc Series B Stat Methodol.
1979;41(1):1–31.ISSN 00359246.
[45] AkterT,SarkerEB,RahmanMS.Atutorialongeewithapplicationstodiabetesandhyperten-
sion data from a complex survey. J Biomed Anal. 2018;1(1):37–50.
[46] Roy PK, Khan MHR, Akter T, et al. Exploring socio-demographic-and geographical-variations
in prevalence of diabetes and hypertension in bangladesh: Bayesian spatial analysis of national
health survey data. Spat Spatiotemporal Epidemiol. 2019;29:71–83.
[47] NIPORT, Mitra-Associates, and Macro International. Bangladesh Demographic and Health
Survey 2011. Technical report, National Institute of Population Research and Training
(NIPORT); Dhaka, Bangladesh, and Calverton, Maryland, USA, 2011.
[48] DuaS,BhukerM,SharmaP,etal.Bodymassindexrelatestobloodpressureamongadults.
North Am J Med Sci. 2014;6(2):89–95.
ResearchGate has not been able to resolve any citations for this publication.
Article
Full-text available
Correlated data frequently arise from cross-sectional studies with complex cluster design because individuals from the same cluster or region share some common characteristics. Analyzing correlated data using standard statistical methods, which are applicable for independent data, may produce misleading inference. This article reviews the GEE and its software implementations and provides some guidelines for using it in practice. To illustrate GEE, data from the 2011 Bangladesh Demographic and Health Survey, a two-stage complex cluster survey have been used to identify the risk factors for diabetes and hypertension. The results suggest that age, current working status, education, socioeconomic status, and body mass index are significantly associated with hypertension and diabetes. Further, we found significant positive correlation between the responses from the same cluster, justifying the use of GEE.
Article
Full-text available
Correlated data frequently arise from cross-sectional studies with complex cluster design because individuals from the same cluster or region share some common characteristics. Analyzing correlated data using standard statistical methods, which are applicable for independent data, may produce misleading inference. This article reviews the GEE and its software implementations and provides some guidelines for using it in practice. To illustrate GEE, data from the 2011 Bangladesh Demographic and Health Survey, a two-stage complex cluster survey have been used to identify the risk factors for diabetes and hypertension. The results suggest that age, current working status, education, socioeconomic status, and body mass index are significantly associated with hypertension and diabetes. Further, we found significant positive correlation between the responses from the same cluster, justifying the use of GEE.
Article
Full-text available
Propensity score methods are increasingly being used to estimate the effects of treatments and exposures when using observational data. The propensity score was initially developed for use with binary exposures (e.g., active treatment vs. control). The generalized propensity score is an extension of the propensity score for use with quantitative exposures (e.g., dose or quantity of medication, income, years of education). A crucial component of any propensity score analysis is that of balance assessment. This entails assessing the degree to which conditioning on the propensity score (via matching, weighting, or stratification) has balanced measured baseline covariates between exposure groups. Methods for balance assessment have been well described and are frequently implemented when using the propensity score with binary exposures. However, there is a paucity of information on how to assess baseline covariate balance when using the generalized propensity score. We describe how methods based on the standardized difference can be adapted for use with quantitative exposures when using the generalized propensity score. We also describe a method based on assessing the correlation between the quantitative exposure and each covariate in the sample when weighted using generalized propensity score -based weights. We conducted a series of Monte Carlo simulations to evaluate the performance of these methods. We also compared two different methods of estimating the generalized propensity score: ordinary least squared regression and the covariate balancing propensity score method. We illustrate the application of these methods using data on patients hospitalized with a heart attack with the quantitative exposure being creatinine level.
Article
Full-text available
Propensity score weighting is one of the techniques used in controlling for selection biases in non-experimental studies. Propensity scores can be used as weights to account for selection assignment differences between treatment and comparison groups. One of the advantages of this approach is that all the individuals in the study can be used for the outcomes evaluation. In this paper, we demonstrate how to conduct propensity score weighting using R. The purpose is to provide a step-by-step guide to propensity score weighting implementation for practitioners. In addition to strengths, some limitations of propensity score weighting are discussed. The use of propensity scores is becoming part of the evaluation landscape (Guo & Fraser, 2015). Rosenbaum and Rubin (1983) introduced the concept of propensity score analysis to address selection bias when random assignment is not feasible. As defined by Rosenbaum and Rubin, a propensity score is the conditional probability of assignment to a treatment condition given a set of observed covariates: e = p(z=i|X). When propensity scores are used, the resulting groups will have similar characteristics to those created through random assignment. Most of the applications related to propensity scores are in matching (Thoemmes & Kim, 2011). Recently, Randolph, Fable, Manuel, and Balloun (2014) in this journal, described in detail how to conduct propensity score matching using R.
Article
Full-text available
In this article, we study the causal inference problem with a continuous treatment variable using propensity score-based methods. For a continuous treatment, the generalized propensity score is defined as the conditional density of the treatment-level given covariates (confounders). The dose–response function is then estimated by inverse probability weighting, where the weights are calculated from the estimated propensity scores. When the dimension of the covariates is large, the traditional nonparametric density estimation suffers from the curse of dimensionality. Some researchers have suggested a two-step estimation procedure by first modeling the mean function. In this study, we suggest a boosting algorithm to estimate the mean function of the treatment given covariates. In boosting, an important tuning parameter is the number of trees to be generated, which essentially determines the trade-off between bias and variance of the causal estimator. We propose a criterion called average absolute correlation coefficient (AACC) to determine the optimal number of trees. Simulation results show that the proposed approach performs better than a simple linear approximation or L2 boosting. The proposed methodology is also illustrated through the Early Dieting in Girls study, which examines the influence of mothers’ overall weight concern on daughters’ dieting behavior.
Article
Biomarker evaluation is important for diagnosing clinical diseases. Covariate adjusted receiver operating characteristic (ROC) regression has been used to identify significant biomarker candidates. Here, we show that the statistical significance of a biomarker can be affected by its prevalence. We propose a novel method that incorporates covariate prevalence information in the ROC regression. This approach is based on covariate balancing propensity scores proposed by Imai and van Dyk. Our method produces higher AUC values, demonstrating improved discrimination ability compared to direct ROC regression or unadjusted ROC analysis; this method can be used to improve biomarker development and can be implemented by an artificial intelligence (AI) system. Extensive simulation studies and data from a thyroid cancer study illustrate the advantages of our approach.
Article
In the medical literature, there has been an increased interest in evaluating association between exposure and outcomes using nonrandomized observational studies. However, because assignments to exposure are not random in observational studies, comparisons of outcomes between exposed and nonexposed subjects must account for the effect of confounders. Propensity score methods have been widely used to control for confounding, when estimating exposure effect. Previous studies have shown that conditioning on the propensity score results in biased estimation of conditional odds ratio and hazard ratio. However, research is lacking on the performance of propensity score methods for covariate adjustment when estimating the area under the ROC curve (AUC). In this paper, AUC is proposed as measure of effect when outcomes are continuous. The AUC is interpreted as the probability that a randomly selected nonexposed subject has a better response than a randomly selected exposed subject. A series of simulations has been conducted to examine the performance of propensity score methods when association between exposure and outcomes is quantified by AUC; this includes determining the optimal choice of variables for the propensity score models. Additionally, the propensity score approach is compared with that of the conventional regression approach to adjust for covariates with the AUC. The choice of the best estimator depends on bias, relative bias, and root mean squared error. Finally, an example looking at the relationship of depression/anxiety and pain intensity in people with sickle cell disease is used to illustrate the estimation of the adjusted AUC using the proposed approaches.
Article
Bangladesh has been experiencing an epidemiological transition from communicable diseases to non-communicable disease (NCDs), with a rapid increase in the NCD related morbidity and mortality in the last decade. Hypertension and diabetes are two important risk factors of NCDs that significantly increase the burden of cardiovascular diseases and risk of death. While the prevalence of people with both hypertension and diabetes has been increasing dramatically over time, it is essential to identify relatively more prevalent socio-demographic groups and geographical regions (local administrative districts) to reduce the NCDs related deaths in an urgent basis. This study focused on examining the association of socio-demographic factors with both hypertension and diabetes and exploring the regional variations in their prevalence using nationally representative survey data on adult population of age over 35 years. Bayesian spatial analysis was performed for both hypertension and diabetes data separately by fitting a model, that accounts for spatial variations, using integrated nested laplace approximation. The area-specific prevalence was then estimated as weighted average of the corresponding individual level predicted probabilities of being diseased derived from the fitted model, with weight from the individual level sampling weight. Finally, the estimated area-specific prevalence estimates were sketched in country-map to explore regional variations and identify regions with relatively higher prevalence. The results revealed that people of older age, higher education, better socio-economic condition, higher BMI are at greater risk of having hypertension and diabetes. Significant regional variations were observed with prevalence for hypertension ranges between 10% and 35% and for diabetes between 6% and 19% while their national prevalence were reported as 24% and 11%, respectively. The western regions of the country including middle capital city were found to be relatively more prevalent for hypertension while the middle-east and south-east regions were observed to be more prevalent for diabetes. The capital Dhaka region was observed as the most prevalent for both diabetes and hypertension. Details explanations of the findings and evidence based policy implications were discussed.
Article
Modern advances in computing power have greatly widened scientists' scope in gathering and investigating information from many variables, information which might have been ignored in the past. Yet to effectively scan a large pool of variables is not an easy task, although our ability to interact with data has been much enhanced by recent innovations in dynamic graphics. In this article, we propose a novel data-analytic tool, sliced inverse regression (SIR), for reducing the dimension of the input variable x without going through any parametric or nonparametric model-fitting process. This method explores the simplicity of the inverse view of regression; that is, instead of regressing the univariate output variable y against the multivariate X, we regress x against y. Forward regression and inverse regression are connected by a theorem that motivates this method. The theoretical properties of SIR are investigated under a model of the form, y = f(beta-1x, ..., beta(K)x, epsilon), where the beta-k's are the unknown row vectors. This model looks like a nonlinear regression, except for the crucial difference that the functional form of f is completely unknown. For effectively reducing the dimension, we need only to estimate the space [effective dimension reduction (e.d.r.) space] generated by the beta-k's. This makes our goal different from the usual one in regression analysis, the estimation of all the regression coefficients. In fact, the beta-k's themselves are not identifiable without a specific structural form on f. Our main theorem shows that under a suitable condition, if the distribution of x has been standardized to have the zero mean and the identity covariance, the inverse regression curve, E(x \ y), will fall into the e.d.r. space. Hence a principal component analysis on the covariance matrix for the estimated inverse regression curve can be conducted to locate its main orientation, yielding our estimates for e.d.r. directions. Furthermore, we use a simple step function to estimate the inverse regression curve. No complicated smoothing is needed. SIR can be easily implemented on personal computers. By simulation, we demonstrate how SIR can effectively reduce the dimension of the input variable from, say, 10 to K = 2 for a data set with 400 observations. The spin-plot of y against the two projected variables obtained by SIR is found to mimic the spin-plot of y against the true directions very well. A chi-squared statistic is proposed to address the issue of whether or not a direction found by SIR is spurious.
Article
In observational studies, investigators have no control over the treatment assignment. The treated and non-treated (that is, control) groups may have large differences on their observed covariates, and these differences can lead to biased estimates of treatment effects. Even traditional covariance analysis adjustments may be inadequate to eliminate this bias. The propensity score, defined as the conditional probability of being treated given the covariates, can be used to balance the covariates in the two groups, and therefore reduce this bias. In order to estimate the propensity score, one must model the distribution of the treatment indicator variable given the observed covariates. Once estimated the propensity score can be used to reduce bias through matching, stratification (subclassification), regression adjustment, or some combination of all three. In this tutorial we discuss the uses of propensity score methods for bias reduction, give references to the literature and illustrate the uses through applied examples.