ArticlePDF Available

Comparing comorbidity measures for predicting mortality and hospitalization in three population-based cohorts

Authors:

Abstract and Figures

Multiple comorbidity measures have been developed for risk-adjustment in studies using administrative data, but it is unclear which measure is optimal for specific outcomes and if the measures are equally valid in different populations. This research examined the predictive performance of five comorbidity measures in three population-based cohorts. Administrative data from the province of Saskatchewan, Canada, were used to create the cohorts. The general population cohort included all Saskatchewan residents 20+ years, the diabetes cohort included individuals 20+ years with a diabetes diagnosis in hospital and/or physician data, and the osteoporosis cohort included individuals 50+ years with diagnosed or treated osteoporosis. Five comorbidity measures based on health services utilization, number of different diagnoses, and prescription drugs over one year were defined. Predictive performance was assessed for death and hospitalization outcomes using measures of discrimination (c-statistic) and calibration (Brier score) for multiple logistic regression models. The comorbidity measures with optimal performance were the same in the general population (n = 662,423), diabetes (n = 41,925), and osteoporosis (n = 28,068) cohorts. For mortality, the Elixhauser index resulted in the highest c-statistic and lowest Brier score, followed by the Charlson index. For hospitalization, the number of diagnoses had the best predictive performance. Consistent results were obtained when we restricted attention to the population 65+ years in each cohort. The optimal comorbidity measure depends on the health outcome and not on the disease characteristics of the study population.
Content may be subject to copyright.
RESEARCH ARTICLE Open Access
Comparing comorbidity measures for
predicting mortality and hospitalization
in three population-based cohorts
Jacqueline M Quail
1*
, Lisa M Lix
1,2
, Beliz Acan Osman
1
and Gary F Teare
1
Abstract
Background: Multiple comorbidity measures have been developed for risk-adjustment in studies using
administrative data, but it is unclear which measure is optimal for specific outcomes and if the measures are
equally valid in different populations. This research examined the predictive performance of five comorbidity
measures in three population-based cohorts.
Methods: Administrative data from the province of Saskatchewan, Canada, were used to create the cohorts. The
general population cohort included all Saskatchewan residents 20+ years, the diabetes cohort included individuals
20+ years with a diabetes diagnosis in hospital and/or physician data, and the osteoporosis cohort included
individuals 50+ years with diagnosed or treated osteoporosis. Five comorbidity measures based on health services
utilization, number of different diagnoses, and prescription drugs over one year were defined. Predictive
performance was assessed for death and hospitalization outcomes using measures of discrimination (c-statistic) and
calibration (Brier score) for multiple logistic regression models.
Results: The comorbidity measures with optimal performance were the same in the general population (n=
662,423), diabetes (n= 41,925), and osteoporosis (n= 28,068) cohorts. For mortality, the Elixhauser index resulted in
the highest c-statistic and lowest Brier score, followed by the Charlson index. For hospitalization, the number of
diagnoses had the best predictive performance. Consistent results were obtained when we restricted attention to
the population 65+ years in each cohort.
Conclusions: The optimal comorbidity measure depends on the health outcome and not on the disease
characteristics of the study population.
Background
Population-based administrative databases are com-
monly used in studies about health status and health
service utilization. These databases enable easy access to
demographic and health-related data on large study
populations but they are not without limitations. Studies
that use administrative data employ observational
designs, which can result in differences amongst study
groups, which may also be related to the outcome of
interest. This may lead to spurious results if these differ-
ences are not controlled using appropriate risk-adjust-
ment methodologies. One of the most important
predictors of health-related outcomes is the presence of
comorbidities, or pre-existing health conditions that
coexist with an index disease [1]. Therefore, for all
research related to health-related events and services, it
is essential to risk-adjust for comorbidity in order to get
unbiased estimates.
A number of comorbidity measures have been applied to
administrative data. Some are simple, such as counts of
the number of physician visits, diagnoses, or prescription
drug dispensations within a prescribed time frame [2].
Comorbidity indices based on specific sets of diagnoses for
chronic conditions or prescription drugs used to treat
chronic conditions have also been developed. The Chronic
Disease Score (CDS) is a weighted index of the burden of
comorbid conditions based upon pharmaceutical data
from administrative databases [3]. Diagnosis-related
* Correspondence: jquail@hqc.sk.ca
Contributed equally
1
Saskatchewan Health Quality Council, Saskatoon, Canada
Full list of author information is available at the end of the article
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
© 2011 Quail et al ; licensee BioMed Centra l Ltd. This is an Open Access article distributed und er the terms of the Creative Commons
Attribution License (http://creative commons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in
any medium, pro vided the original work is properly cited.
indices include the Charlson and Elixhauser indices, which
use International Classification of Disease (ICD) diagnosis
codes to identify major health problems, such as heart or
lung disease [4,5]. Both the Charlson and Elixhauser
indices were originally developed using in-hospital popula-
tions to predict mortality, although they have been applied
to outpatient populations [6-8]. All of these measures are
potentially useful for comparing different populations
because they are general measures of comorbidity.
Previous research has focused on the performance of
these comorbidity measures in either specific popula-
tions or for specific outcomes, but never in multiple
populations for multiple outcomes. The majority of
research has focused on in-patient populations [9-11]
and chronic disease populations, including osteoarthritis
[6], hypertension [12], and migraine [13]. Most of these
studies investigate death or health care costs as the out-
come and results have been inconsistent, likely as the
result of differences between studies in data sources and
how both comorbidity measures and outcomes are
defined. More limited research has been conducted in
general populations and the research that has been done
has focused on mortality as the primary health outcome
measure [7,14]. Fewer have investigated health service
utilization outcome measures [7,15]. Overall, the major-
ity of studies focus on a single population or outcome
and the consistency of the findings across populations
and outcomes bears further investigation. With this in
mind, we investigated the performance of five comorbid-
ity measures for predicting death and health services
utilization outcomes in three population-based cohorts;
a general population cohort and two chronic disease
cohorts composed of individuals diagnosed with diabetes
or osteoporosis.
Methods
Data sources
Administrative health data for this research were
obtained from the province of Saskatchewan, Canada
which has a population of approximately 1.1 million
[16]. Data on hospital contacts, physician contacts, and
outpatient prescriptions are collected and captured in
electronic databases that can be anonymously linked via
a unique personal health insurance number [17,18]. Like
all Canadian provinces, Saskatchewan has a provincial
health insurance plan that virtually all members of the
population are registered in except for a relatively small
proportion of the population (<1%) whose health care is
covered by the federal government (Royal Canadian
Mounted Police, veterans, and inmates in federal peni-
tentiaries). Additionally, First Nations people that have
treaty relationships with the federal government
(approximately 9% of the provincial population) also
receive some of their health benefits, including prescrip-
tion drug benefits, from the federal government. There-
fore these individuals are not included in the provincial
prescription drug benefit plan and related administrative
data files that were used in this study.
Hospital data is stored in the Discharge Abstract
Database. Diagnoses in hospital data are recorded using
the International Classification of Diseases, 9th Revision
(i.e., ICD-9) up to and including fiscal year 2001/02,
where a fiscal year extends from April 1 to March 31.
In fiscal year 2001/02, the International Classification of
Diseases, 10th revision, Canadian Version (i.e., ICD-10-
CA) was introduced and virtually all codes were
recorded in this format from fiscal year 2002/03 onward.
Between three and sixteen diagnoses are captured in
each record prior to the introduction of ICD-10-CA,
and up to 25 diagnoses are captured subsequently. The
type of diagnosis is also recorded, which identifies the
most responsible diagnosis for admission, comorbid
diagnoses that are not directly related to admission, and
diagnoses that developed after admission and which
represent the development of complications during the
hospitalization.
Data on physician services are contained in the
Medical Services Database. Physicians who are paid on
a fee-for-service basis submit billing claims to the pro-
vincial health ministry. A single diagnosis using three-
digit ICD-9 codes is recorded on each claim. Physicians
who are salaried are required to submit billing claims
for administrative purposes, a practice known as shadow
billing.
The Prescription Drug Database contains information
on all outpatient drugs dispensed to Saskatchewan resi-
dents who are eligible for coverage. Approximately 9%
of Saskatchewan residents - primarily Registered Indians
-arenoteligiblebecausetheyhavetheirprescription
costs paid for by another government agency [18,19].
The database includes information on active ingredients,
strength and dosage form, date and quantity dispensed,
as well as the pharmacologic-therapeutic classification of
a drug based on the American Hospital Formulary Sys-
tem (AHFS) [20].
The Population Registry captures demographic charac-
teristics, location of residence, and dates of coverage by
the provincial health insurance plan. The Vital Statistics
Registry contains information on all births and deaths in
the province.
The accuracy and completeness of Saskatchewans
administrative databases have made them popular data
sources for numerous studies of population health and
health services utilization [21-23]. Ethical approval for
this research was received from the University of
Saskatchewan Biomedical Research Ethics Board.
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 2 of 12
Cohort definitions
We used these administrative databases to define three
cohorts - a general population cohort and two chronic dis-
ease cohorts that are subsets of the general population
cohort - to investigate whether the performance of comor-
bidity measures varies among different study populations
and by outcome. In order to be included in one of these
cohorts, individuals must have had uninterrupted health
coverage in the year we assessed comorbidity (fiscal year
2001/02) and the year we assessed outcomes (fiscal year
2002/03). For the diabetes and osteoporosis cohorts, we
used data from April 1, 1996 to March 31, 2002 to identify
people with either of these conditions.
The general population cohort was composed of all
Saskatchewan residents aged 20 and older. We defined
the diabetes cohort using the National Diabetes Surveil-
lance System case definition [24], which has been devel-
oped using Canadian administrative data and validated
in previous research [25,26]. This case definition has
been shown to have excellent sensitivity (86%) and spe-
cificity (97%) [25]. It identifies all individuals who have a
diagnosis of diabetes (ICD-9: 250; ICD-10-CA: E10-E14)
in at least one hospital record or in at least two physi-
cian claims within a two-year period. The index date is
the earliest date of a diabetes diagnosis. We identified
all Saskatchewan residents 20 years of age and older
who met the case definition using data from April 1,
1996 to March 31, 2002.
We defined the osteoporosis cohort based upon the
results of a validation study that evaluated the sensitivity
and specificity of osteoporosis diagnosis codes in hospi-
tal and physician data, and outpatient prescription drug
records for an osteoprotective drug by comparing them
to bone mineral densitometry tests from a provincial
screening program [15]. The osteoporosis case definition
identifies all individuals aged 50 or older who have a
diagnosis of osteoporosis (ICD-9: 733; ICD-10-CA: M80,
M81) in at least one hospital record or at least one phy-
sician claim, or who have at least one outpatient drug
dispensation for an osteoprotective medication (i.e.,
alendronate, clodronate, etidronate, pamidronate, rise-
dronate, salmon calcitonin, raloxifene, teriparatide, zole-
dronic acid). The case definition has been shown to
have a sensitivity of 89.4% and a specificity of 91.5% in
women 50 years of age and older [27]. We assigned the
index date as the earliest date of diagnosis or prescrip-
tion drug dispensation. We identified all Saskatchewan
residents 50 years of age and older who met the case
definition using data from April 1, 1996 to March 31,
2002. Individuals were excluded if they had a diagnosis
for Pagets disease (ICD-9: 731.0; ICD-10-CA: M88.0,
M88.8, M88.9) in the study period because they may
have different comorbidity characteristics than those
without the disease.
Comorbidity measures
Five comorbidity measures were considered: number of
different diagnoses, Charlson index, Elixhauser index,
number of different dispensed drugs, and the Chronic
Disease Score (CDS). Each measure was created using
data for fiscal year 2001/02.
The number of different diagnoses recorded to the
third digit in ICD-9 and ICD-10-CA was determined
using both the hospital and physician billing databases.
Any diagnoses related to pregnancy, childbirth, or abor-
tion were excluded because these events are not disease-
related.
TheCharlsonindexisaweightedindexofthebur-
den of comorbidity used to predict one-year mortality
[4]. It is calculated using diagnoses for 17 diseases
abstracted from hospital data. When present, each
condition is assigned a score from one to six and the
scores are summed to give a single value ranging from
0 to 32, where a higher score indicates a greater bur-
den of comorbidity. It was originally created using
ICD-9 codes but has been verified using ICD-10-CA
codes [28,29] as well as physician data [30]. We used
diagnoses codes from both hospital and physician data
to calculate the Charlson index using Quan et als
(2006) version [31,32]. For the hospital data, diagnoses
that developed after hospital admission and which
represent complications of the hospitalization were
excluded.
The Elixhauser index identifies the presence of 31
diseases using administrative data [5]. Each condition is
coded as present or absent and is entered into a statisti-
cal model as its own variable. Similar to the Charlson
index, the Elixhauser index was originally created using
ICD-9 codes but has since been verified using ICD-10-
CA codes [29]. We used diagnoses codes from both hos-
pital and physician data to calculate the Elixhauser index
using Quan et als (2006) version [33,34]. For the
hospital data, diagnoses that developed after hospital
admission and which represent complications of the
hospitalization were excluded.
The number of dispensed drugs was calculated using
the American Hospital Formulary Service (AHFS) phar-
macologic-therapeutic classification system by summing
the number of different four-digit drug classifications
for each cohort member to a maximum of 125. The
Chronic Disease Score (CDS) is also calculated based
upon the AHFS classification system [3]. Drugs used to
treat 17 conditions are identified and assigned a score
from one to five. The scores are summed to give a sin-
gle value ranging from 0 to 35, where a high score indi-
cates a greater burden of comorbidity. The CDS
predicts both hospitalization and mortality and is posi-
tively correlated with physician ratings of disease sever-
ity (r = 0.57) [3].
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 3 of 12
Outcome variables
Study outcomes included death and hospitalization
between April 1, 2002 and March 31, 2003. Death was
determined using data from the population registry, vital
statistics, and hospitalization databases. Two measures
of hospital use were examined: at least one hospitaliza-
tion and two or more hospitalizations. Hospitalizations
related to pregnancy, childbirth, and abortion were
excluded from the analysis. We investigated multiple
hospitalizations because hospitalization was a relatively
frequent occurrence in the chronic disease cohorts.
Other study variables
Other variables used to describe the cohorts include age,
sex, region of residence, and income quintile. All
demographic variables were determined for fiscal year
2001/02. Age and sex were identified from the Popula-
tion Registry. Region of residence and income were both
determined using a residents postal code identified
from the Population Registry. Income is a well known
factor that influences health outcomes, such that lower
income individuals are more likely to become ill,
disabled, and die than their more affluent counterparts
[35]. Region of residence may also affect health and the
use of health services. Nearly half of Saskatchewan
residentsliveinaruralareaandmayfacebarriersto
accessing health care services that urban residents do
not. A resident was categorized as living in an urban
area if his/her postal code was in a Census Metropolitan
Area or Census Agglomeration with a population of
10,000 or more.
Income quintiles were calculated using a method
based on average household income from the 2001 Sta-
tistics Canada Census [36]. Each residentspostalcode
wasidentifiedfromthePopulation Registry and linked
to a dissemination area, the smallest geographic unit
used in Census data. Residents were identified as
belonging to an income quintile based upon the Income
Per Person Equivalent which takes the size of a house-
hold into consideration. Income quintiles were calcu-
lated so that the entire population of Saskatchewan was
divided into five equal groups. Some residents could not
be assigned to a quintile because income measures are
suppressed for DAs with a small population. Approxi-
mately 14% of the total Saskatchewan population had a
missing income quintile. Imputed values were assigned
using a method that creates a predictive model for the
missing quintiles using sociodemographic variables that
are not suppressed such as marital status, ethnicity, and
employment status. A multiple imputation approach
was then used to assign income quintile [37]. After
applying this methodology, income could still not be
assigned to some rural areas in which approximately
one percent of Saskatchewan residents lived in fiscal
year 2001/02.
Statistical analysis
In order to ensure comparability with the general popu-
lation, we restricted both chronic disease cohorts to
individuals who were alive on April 1, 2002, and who
had uninterrupted health coverage for fiscal years 2001/
02 and 2002/03. Two sets of analyses were conducted;
one including all members of each cohort and the other
including only those cohort members age 65 and older.
Frequencies, means, and standard deviations were
used to describe the characteristics of each cohort. The
chronic disease cohorts are subsets of the general popu-
lation cohort and so McNemars test was used to test
for differences between the cohorts on each of the three
outcome measures. Validation of each comorbidity mea-
sure was conducted using multiple logistic regression
analysis [15]. Specifically, to assess the predictive perfor-
mance of each comorbidity measure a series of models
were fit to the data for each outcome. The base model
was comprised of the following variables: age (in years),
a quadratic age effect, sex, region of residence, and
income quintile. Five full models were then fit to the
data. Each model contained all of the variables in the
base model in addition to one or more variables defin-
ing the comorbidity measure. All comorbidity measures
were entered into the full models as continuous vari-
ables with the exception of the Elixhauser index, which
was included as a series of dichotomous variables. Sensi-
tivity analyses revealed that redefining the continuous
comorbidity measures as categorical variables did not
result in any substantial change in model fit as judged
by the Hosmer-Lemeshow goodness-of-fit test.
Discriminative performance for the base and full mod-
els was assessed using the c-statistic, which is equivalent
to the area under the receiver operating characteristic
(ROC) curve for dichotomous outcomes [27,38]. The c-
statisticrangesfromzerotoone,withavalueofone
representing perfect prediction and a value of 0.5 repre-
senting chance prediction. A value between 0.7 and 0.8
is considered to demonstrate acceptable predictive per-
formance, while a value greater than 0.8 demonstrates
excellent discriminative performance. The 95% confi-
dence intervals (CIs) were computed. Differences in the
c-statistic (i.e., Δc) for the base and full models were
tested using the method of DeLong et al. [39]. The per-
centage change in the c-statistic was also computed.
Model calibration was assessed using the Brier score,
which ranges from zero to one [28]. A lower score indi-
cates less prediction error. Given that a score of 0.25
can be achieved by assigning an event probability of 0.5
to each individual [28], a value less than 0.25 was
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 4 of 12
considered to represent acceptable prediction error. The
standard deviation of the Brier score was also computed.
Analyses were conducted for each full cohort and then
for the age-restricted (65+) cohort using SAS software
[40].
Results
Table 1 contains summary data for the demographic
and socioeconomic variables, health outcomes, and
comorbidity measures for each cohort. We describe first
the findings for the full cohorts. We identified 662,423
individuals in the general population cohort, 41,925
individuals age in the diabetes cohort, and 28,028 indivi-
duals in the osteoporosis cohort. A total of 2,909 indivi-
duals were members of both the diabetes cohort (6.9%)
and osteoporosis cohort (10.4%). The general population
cohort was, on average, younger than the diabetes
cohort [mean age (SD) 47.9(17.9) versus 62.4(15.0)].
Females represented approximately 50% of both the
general population and diabetes cohort and the majority
of the osteoporosis cohort (86.1%). A greater proportion
of members of the diabetes cohort had incomes in the
lowest quintile (26.1%) than both the general population
and osteoporosis cohorts. The majority of individuals
lived in an urban setting, ranging from 52% in the dia-
betes cohort to 58% in both the general population and
osteoporosis cohort.
All three outcomes occurred less frequently in the
general population cohort than in either of the other
two cohorts. Each outcome was two to three times
more common in the diabetes and osteoporosis cohorts.
These differences were statistically significant for both
hospitalization outcomes but not for death.
For each comorbidity measure, the general population
cohort had the lowest mean score. The osteoporosis
cohort had the highest mean number of diagnoses while
the diabetes cohort had the highest mean number of
drugs and chronic disease score. Mean Charlson index
summary scores were identical in the osteoporosis and
diabetes cohorts. The Charlson value is less than one in
these disease-based cohorts because the cohorts were
created using diagnosis codes recorded over a 6-year
period, whereas the comorbidity measures were created
using diagnosis codes recorded within a single year.
Table 2 summarizes the distribution of Elixhauser cate-
gories. In each full cohort, uncomplicated hypertension
Table 1 Description of general population, diabetes, and osteoporosis full and age-restricted cohorts
General Population Diabetes Osteoporosis
Variable Full
(n= 662,423)
65+ years
(n= 137,700)
Full
(n= 41,925)
65+ years
(n= 20,025)
Full
(n= 28,068)
65+ years
(n= 20,090)
DEMOGRAPHICS
Age, mean (SD) 47.9 (17.9) 75.3 (7.3) 62.4 (15.0) 75.2 (6.9) 71.6 (10.8) 77.0 (7.4)
Female, % 51.3 56.8 47.8 49.7 86.1 87.1
Urban residence, % 58.2 51.1 51.8 49.5 58.0 56.8
Missing 0.2 0.1 0.1 0.1 0.1 0.1
Income quintile, %
Q1 (lowest) 21.6 22.2 26.1 24.7 22.2 23.3
Q2 22.0 23.6 23.0 23.6 22.3 22.7
Q3 18.2 18.7 17.0 18.3 18.7 19.2
Q4 16.6 15.8 14.8 15.0 17.0 16.5
Q5 (highest) 20.4 18.4 17.9 17.2 18.9 17.5
Missing 1.2 1.3 1.2 1.3 1.0 0.8
OUTCOMES
Death, % 1.3 5.1 4.3 7.5 4.7 6.1
One or more hospitalizations, % 17.4 31.8 31.9 39.9 33.9 37.2
Two or more hospitalizations, % 5.1 12.6 13.2 17.9 14.8 16.9
COMORBIDITY MEASURES
# Diagnoses,
mean (SD)
3.9 (4.4) 6.3 (5.6) 7.5 (6.4) 8.4 (6.7) 8.0 (6.2) 8.5 (6.4)
Charlson index
summary score, mean (SD)
0.3 (0.9) 0.7 (1.5) 0.8 (1.5) 1.1 (1.8) 0.8 (1.5) 0.9 (1.5)
# Drugs, mean (SD) 1.8 (1.0) 2.5 (1.2) 5.1 (3.9) 6.1 (3.9) 4.1 (2.4) 4.4 (2.4)
CDS, mean (SD) 1.4 (2.6) 3.3 (3.4) 4.8 (3.8) 5.6 (3.7) 3.6 (3.6) 4.0 (3.6)
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 5 of 12
was the most common chronic condition followed by
chronic pulmonary disease. Few members of the general
population had congestive heart failure (2.0%) although it
was the third and fourth most common chronic condi-
tion in the diabetes cohort (8.0%) and osteoporosis
cohort (8.2%), respectively. The prevalence of each
chronic condition was lowestinthegeneralpopulation
cohort for all Elixhauser categories. Compared to the
other cohorts, members of the diabetes cohort were
more likely to be identified as having complicated dia-
betes (14.7%) and renal failure (3.1%), while members of
the osteoporosis cohort were more likely to be identified
as having hypothyroidism (9.0%), depression (7.5%), and
rheumatic disease (6.9%).
When we restricted attention to only those cohort
members who were 65+ years, we found the general
population and diabetes cohorts were similar in terms of
age and urban residence, while the osteoporosis cohort
was slightly older and composed of more women (Table
1). While mean scores for all comorbidity measures
increased, the pattern across the age-restricted cohorts
was similar to that for the full cohorts.
Table 3 reports the results of the multivariable ana-
lyses for the full and age-restricted general population
Table 2 Elixhauser Index categories for the general population, diabetes, and osteoporosis full and age-restricted
cohorts, 2001/02
General Population Diabetes Osteoporosis
Variable Full
(%)
65+ years
(%)
Full
(%)
65+ years
(%)
Full
(%)
65+ years
(%)
Hypertension, uncomplicated 16.7 42.5 42.7 51.1 39.9 44.8
Chronic pulmonary disease 8.4 12.7 13.5 14.3 15.2 15.9
Depression 5.6 4.6 6.0 4.8 7.5 7.1
Hypothyroidism 3.4 6.6 4.7 5.6 9.0 9.4
Solid tumor 2.5 7.6 5.4 8.8 6.7 7.3
Congestive heart failure 2.0 8.1 8.0 13.4 8.2 10.7
Psychiatric disorder 1.3 4.1 2.7 4.4 4.3 5.6
Rheumatic disease 1.2 2.4 1.9 2.2 6.9 7.0
Diabetes, complicated 1.0 2.8 14.7 18.5 2.3 2.6
Valvular disease 1.0 2.6 2.2 3.1 2.8 3.3
Other neurological disorders 1.0 1.3 1.2 1.3 1.8 1.7
Cardiac arrhythmias 0.8 3.2 2.8 4.7 3.0 3.9
Fluid and electrolyte disorders 0.8 2.5 2.7 4.0 3.2 4.1
Coagulopathies 0.8 2.8 2.5 3.9 2.9 3.6
Metastatic cancer 0.8 2.4 1.7 2.8 2.4 2.6
Renal failure 0.6 1.8 3.1 4.2 1.9 2.3
Drug abuse 0.5 0.1 0.4 0.1 0.2 0.1
Peripheral vascular disease 0.4 1.3 1.5 2.2 1.2 1.5
Deficiency anemia 0.4 1.0 0.8 1.3 1.3 1.5
Hypertension, complicated 0.3 0.9 1.0 1.5 1.0 1.2
Pulmonary circulation disorders 0.3 1.0 0.8 1.2 1.2 1.4
Liver disease 0.3 0.3 0.7 0.6 0.5 0.4
Alcohol abuse 0.3 0.2 0.7 0.4 0.2 0.2
Diabetes, uncomplicated 0.2 0.5 3.0 3.3 0.4 0.4
Obesity 0.2 0.3 1.0 0.9 0.3 0.3
Paraplegia 0.2 0.3 0.4 0.6 0.3 0.3
Peptic ulcer disease 0.1 0.4 0.4 0.5 0.4 0.5
Lymphoma 0.1 0.2 0.2 0.3 0.3 0.3
Weight loss 0.1 0.2 0.2 0.2 0.3 0.3
Blood loss anemia <0.1 0.1 0.1 0.1 0.1 0.1
AIDS <0.1 <0.1 <0.1 0 0 0
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 6 of 12
cohorts. The base model for death in the full cohort
had a c-statistic of 0.880 (95% CI: 0.886, 0.892) and a
Brier score of 0.012, indicating excellent discrimination
and very low prediction error. The addition of a
comorbidity measure to the base model yielded a sta-
tistically significant improvement in the c-statistic for
all five measures, although the amount of change was
small (< 4%). The addition of the Elixhauser index to
the base model was associated with the largest c-
statistic (c = 0.913; 95% CI: 0.910, 0.916), followed by
the Charlson index (c = 0.905; 95% CI: 0.902, 0.908).
Hospitalization base models had markedly lower c-
statistics than the mortality base model. In fact, the
c-statistic for the base model for any hospitalization in
the full cohort failed to exceed the threshold of 0.70
and barely surpassed it for multiple hospitalizations
Table 3 Model comparisons for mortality and hospitalization in the general population cohort, full and age-restricted
Model Full cohort (n = 662,423) Age 65+ (n = 137,700)
c-statistic
(95% CI)
Brier
score (SD)
Δc(%) c-statistic
(95% CI)
Brier
score (SD)
Δc(%)
Death
Base model 0.880
(0.877, 0.884)
0.012 (0.098) 0.729
(0.723, 0.735)
0.046
(0.183)
+ # diagnoses 0.901
(0.898, 0.904)
0.012 (0.096) 0.021 (2.36) 0.769
(0.764, 0.775)
0.046
(0.179)
0.040 (5.53)
+ Charlson 0.905
(0.902, 0.908)
0.012 (0.095) 0.025 (2.81) 0.785
(0.779, 0.790)
0.045
(0.177)
0.056 (7.64)
+ Elixhauser 0.913
(0.910, 0.916)
0.012 (0.093) 0.033 (3.73) 0.805
(0.799, 0.810)
0.044
(0.173)
0.076 (10.36)
+ # drugs 0.894
(0.890, 0.897)
0.012 (0.096) 0.013 (1.53) 0.764
(0.759, 0.770)
0.045
(0.178)
0.035 (4.83)
+ CDS 0.889
(0.886, 0.892)
0.012 (0.097) 0.009 (1.01) 0.751
(0.745, 0.757)
0.046
(0.181)
0.022 (3.00)
One or more hospitalizations
Base model 0.652
(0.651, 0.654)
0.138 (0.238) 0.563
(0.559, 0.566)
0.215
(0.171)
+ # diagnoses 0.722
(0.720, 0.724)
0.130 (0.233) 0.070 (10.68) 0.668
(0.664, 0.671)
0.202
(0.189)
0.105 (18.60)
+ Charlson 0.671
(0.669, 0.672)
0.136 (0.238) 0.018 (2.81) 0.613
(0.610, 0.616)
0.210
(0.179)
0.050 (8.92)
+ Elixhauser 0.682
(0.680, 0.683)
0.134 (0.236) 0.029 (4.47) 0.630
(0.627, 0.633)
0.206
(0.184)
0.067 (11.92)
+ # drugs 0.688
(0.686, 0.690)
0.134 (0.235) 0.036 (5.46) 0.625
(0.622, 0.628)
0.207
(0.181)
0.063 (11.10)
+ CDS 0.672
(0.671, 0.674)
0.136 (0.236) 0.020 (3.05) 0.604
(0.601, 0.607)
0.210
(0.177)
0.041 (7.34)
Two or more hospitalizations
Base model 0.706
(0.704, 0.709)
0.047 (0.187) 0.571
(0.567, 0.576)
0.110
(0.246)
+ # diagnoses 0.782
(0.779, 0.785)
0.045 (0.179) 0.075 (10.67) 0.686
(0.682, 0.690)
0.105
(0.235)
0.115 (20.07)
+ Charlson 0.731
(0.728, 0.734)
0.047 (0.184) 0.024 (3.42) 0.633
(0.629, 0.638)
0.108
(0.241)
0.062 (10.92)
+ Elixhauser 0.748
(0.745, 0.751)
0.046 (0.181) 0.042 (5.91) 0.653
(0.649, 0.658)
0.106
(0.237)
0.082 (14.40)
+ # drugs 0.744
(0.742, 0.747)
0.046 (0.182) 0.038 (5.37) 0.638
(0.633, 0.642)
0.107
(0.238)
0.067 (11.66)
+ CDS 0.729
(0.726, 0.732)
0.047 (0.184) 0.023 (3.18) 0.619
(0.614, 0.623)
0.108
(0.241)
0.048 (8.32)
Base model includes age, age
2
, sex, income quintile, and geography
CDS = Chronic Disease Score
Δc= difference in the c-statistic between the base and full models; c-statistics in boldface font are significantly different from the c-statistic for the base model,
according to the method of DeLong et al. [39]
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 7 of 12
(c = 0.706; 95% CI: 0.704, 0.709). For each model, the
addition of every comorbidity measure was associated
with a statistically significant increase in the c-statistic.
The number of diagnoses was the best-performing
comorbidity measure for the models of one or more
hospitalizations (c = 0.722; 95% CI: 0.720, 0.724) and
two or more hospitalizations (c = 0.782; 95% CI: 0.779,
0.785).
Table 4 provides the multivariable models for the full
and age-restricted diabetes cohorts. Similar to the gen-
eral population cohort, the base model for death had
the largest c-statistic in the full cohort (c = 0.781; 95%
CI: 0.770, 0.791). The addition of each comorbidity mea-
sure to the base model resulted in a significant improve-
ment in the c-statistic. The greatest improvement was
observed for the Elixhauser index (c = 0.845; 95% CI:
Table 4 Model comparisons for mortality and hospitalization for diabetes cohort, full and age-restricted
Model Full cohort (n = 41,925) Age 65+ (n = 20,025)
c-statistic
(95% CI)
Brier
score (SD)
Δc(%) c-statistic
(95% CI)
Brier
score (SD)
Δc(%)
Death
Base model 0.781
(0.770, 0.791)
0.038
(0.166)
0.695
(0.680, 0.709)
0.067
(0.209)
+ # diagnoses 0.818
(0.809, 0.828)
0.037
(0.161)
0.037 (4.78) 0.749
(0.736, 0.762)
0.065
(0.202)
0.054 (7.81)
+ Charlson 0.830
(0.821, 0.839)
0.037
(0.161)
0.049 (6.33) 0.764
(0.752, 0.776)
0.065
(0.202)
0.070 (10.02)
+ Elixhauser 0.845
(0.836, 0.854)
0.036
(0.156)
0.064 (8.20) 0.788
(0.776, 0.800)
0.063
(0.196)
0.094 (13.50)
+ # drugs 0.799
(0.789, 0.810)
0.038
(0.163)
0.018 (2.36) 0.729
(0.715, 0.742)
0.065
(0.204)
0.034 (4.92)
+ CDS 0.789
(0.779, 0.800)
0.038
(0.165)
0.009 (1.11) 0.709
(0.695, 0.723)
0.066
(0.207)
0.014 (2.07)
One or more hospitalizations
Base model 0.610
(0.604, 0.616)
0.211
(0.174)
0.547
(0.539, 0.556)
0.238
(0.104)
+ # diagnoses 0.701
(0.695, 0.706)
0.195
(0.194)
0.091 (14.90) 0.655
(0.647, 0.663)
0.224
(0.148)
0.108 (19.63)
+ Charlson 0.666
(0.660, 0.672)
0.203
(0.187)
0.056 (9.21) 0.621
(0.613, 0.629)
0.231
(0.130)
0.074 (13.45)
+ Elixhauser 0.677
(0.671, 0.682)
0.198
(0.192)
0.067 (10.98) 0.634
(0.626, 0.642)
0.226
(0.143)
0.086 (15.76)
+ # drugs 0.641
(0.635, 0.647)
0.206
(0.181)
0.031 (5.11) 0.604
(0.596, 0.612)
0.232
(0.125)
0.056 (10.28)
+ CDS 0.624
(0.618, 0.630)
0.208
(0.177)
0.014 (2.31) 0.578
(0.570, 0.587)
0.235
(0.115)
0.031 (5.65)
Two or more hospitalizations
Base model 0.617
(0.609, 0.625)
0.113
(0.242)
0.546
(0.535, 0.556)
0.146
(0.245)
+ # diagnoses 0.726
(0.719, 0.733)
0.106
(0.230)
0.109 (17.61) 0.674
(0.665, 0.684)
0.139
(0.237)
0.129 (23.55)
+ Charlson 0.694
(0.687, 0.702)
0.110
(0.236)
0.077 (12.55) 0.643
(0.633, 0.653)
0.143
(0.240)
0.097 (17.76)
+ Elixhauser 0.709
(0.702, 0.717)
0.107
(0.232)
0.092 (14.96) 0.655
(0.645, 0.665)
0.139
(0.237)
0.109 (20.02)
+ # drugs 0.651
(0.643, 0.658)
0.111
(0.237)
0.034 (5.45) 0.603
(0.592, 0.614)
0.144
(0.241)
0.057 (10.49)
+ CDS 0.633
(0.625, 0.641)
0.112
(0.240)
0.016 (2.63) 0.580 (0.569, 0.590) 0.145
(0.242)
0.034 (6.22)
Base model includes age, age
2
, sex, income quintile, and geography
CDS = Chronic Disease Score
Δc= difference in the c-statistic between the base and full models; c-statistics in boldface font are significantly different from the c-statistic for the base model,
according to the method of DeLong et al. [39]
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 8 of 12
0.836, 0.854), followed by the Charlson index (c = 0.830;
95%CI:0.821,0.839).C-statistics for the base models
predicting hospitalization were below 0.70, and although
the addition of each comorbidity measure resulted in a
statistically significant improvement in the c-statistic,
few models exceeded the threshold of 0.70. The number
of diagnoses was associated with largest c-statistic for at
least one hospitalization (c- = 0.701; 95%CI: 0.695,
0.672) and two or more hospitalizations (c = 0.726; 95%
CI: 0.719, 0.733). Overall, there was better predictive
performanceinthegeneralcohortthaninthediabetes
cohort for all outcomes.
Table 5 reports the results of the multivariable models
for the full and age-restricted osteoporosis cohorts. The
base model for death had the largest c-statistic (c =
0.758; 95% CI: 0.745, 0.771). The addition of each
comorbidity measure to the base model was associated
with a statistically significant improvement in the
Table 5 Model comparisons for mortality and hospitalization for osteoporosis cohort, full and age-restricted
Model Full cohort (n = 28,068) Age 65+ (n = 20,090)
c-statistic
(95% CI)
Brier
score (SD)
Δc(%) c-statistic
(95% CI)
Brier
Score (SD)
Δc(%)
Death
Base model 0.758
(0.745, 0.771)
0.042
(0.175)
0.718
(0.703, 0.733)
0.055
(0.196)
+ # diagnoses 0.792
(0.780, 0.804)
0.042
(0.171)
0.034 (4.48) 0.756
(0.742, 0.769)
0.054
(0.191)
0.038 (5.29)
+ Charlson 0.811
(0.800, 0.822)
0.042
(0.170)
0.053 (6.95) 0.772
(0.759, 0.785)
0.054
(0.190)
0.054 (7.57)
+ Elixhauser 0.827
(0.817, 0.838)
0.040
(0.166)
0.069 (9.09) 0.793
(0.780, 0.805)
0.052
(0.185)
0.075 (10.42)
+ # drugs 0.782
(0.769, 0.794)
0.042
(0.172)
0.024 (3.10) 0.746
(0.732, 0.760)
0.054
(0.191)
0.028 (3.91)
+ CDS 0.778
(0.766, 0.791)
0.042
(0.172)
0.020 (2.62) 0.741
(0.727, 0.754)
0.054
(0.192)
0.023 (3.15)
One or more hospitalizations
Base model 0.579
(0.572, 0.586)
0.220
(0.157)
0.550
(0.542, 0.558)
0.232
(0.129)
+ # diagnoses 0.667
(0.660, 0.673)
0.208
(0.178)
0.088 (15.14) 0.647
(0.639, 0.654)
0.220
(0.157)
0.097 (17.54)
+ Charlson 0.619
(0.612, 0.626)
0.216
(0.166)
0.040 (6.98) 0.598
(0.590, 0.606)
0.227
(0.141)
0.048 (8.73)
+ Elixhauser 0.637
(0.630, 0.644)
0.212
(0.173)
0.058 (10.08) 0.620
(0.612, 0.628)
0.223
(0.151)
0.070 (12.65)
+ # drugs 0.631
(0.624, 0.638)
0.213
(0.168)
0.052 (9.02) 0.609
(0.601, 0.618)
0.225
(0.144)
0.059 (10.80)
+ CDS 0.618
(0.611, 0.625)
0.215
(0.166)
0.039 (6.69) 0.597
(0.589, 0.605)
0.227
(0.141)
0.047 (8.61)
Two or more hospitalizations
Base model 0.606
(0.597, 0.615)
0.124
(0.243)
0.570
(0.559, 0.580)
0.139
(0.245)
+ # diagnoses 0.697
(0.688, 0.705)
0.118
(0.235)
0.090 (14.92) 0.669
(0.659, 0.679)
0.133
(0.238)
0.099 (17.37)
+ Charlson 0.650
(0.641, 0.659)
0.122
(0.240)
0.044 (7.19) 0.619
(0.609, 0.630)
0.137
(0.242)
0.050 (8.73)
+ Elixhauser 0.672
(0.663, 0.681)
0.119
(0.237)
0.066 (10.86) 0.645
(0.634, 0.655)
0.135
(0.239)
0.075 (13.15)
+ # drugs 0.656
(0.647, 0.665)
0.121
(0.237)
0.050 (8.28) 0.625
(0.615, 0.636)
0.136
(0.240)
0.056 (9.80)
+ CDS 0.651
(0.642, 0.660)
0.121
(0.238)
0.045 (7.41) 0.620
(0.610, 0.631)
0.137
(0.240)
0.051 (8.89)
Base model includes age, age
2
, sex, income quintile, and geography
CDS = Chronic Disease Score
Δc= difference in the c-statistic between the base and full models; c-statistics in boldface font are significantly different from the c-statistic for the base model,
according to the method of DeLong et al. [39]
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 9 of 12
c-statistic, although the amount of change was largest
for the Elixhauser index (c = 0.827: 95% CI: 0.817,
0.838), followed by the Charlson index (c = 0.811; 95%
CI: 0.800, 0.822). For both hospitalization outcomes, the
c-statistic of all models failed to exceed 0.70. The num-
ber of diagnoses was associated with the greatest
improvement in the c-statistic for at least one hospitali-
zation (c-= 0.667; 95% CI: 0.660, 0.673) as well as for
two or more hospitalizations (c = 0.688; 95% CI: 0.688,
0.705). Overall, there was better predictive performance
in the general cohort than in the diabetes cohort for all
outcomes.
Results for the age-restricted cohorts were similar to
the results for each full cohort. C-statistics were lower
and Brier scores were higher in all models in the age-
restricted cohort compared to their equivalent models
in full cohort, but the same comorbidity measures were
identified as having the greatest improvement in predic-
tive performance. Additionally, the change in the c-sta-
tistic was greater for each model in the age-restricted
cohort model than for the counterpart model in the full
cohort.
Discussion
This research comparing discrimination and prediction
error associated with comorbidity measures in three
population-based cohorts has the following key findings.
First, the optimal comorbidity measure depends upon
the outcome of interest. However, for each of the out-
comes, the best performing comorbidity measures was
consistent across populations with different disease
characteristics. Finally, compared to the full general
population cohort, models in the chronic disease and
age-restricted cohorts had poorer discriminative perfor-
mance of a base set of demographic and socioeconomic
variables, but the improvement in discriminative perfor-
mance associated with a comorbidity measure was con-
sistently larger.
We found that the best performing comorbidity mea-
sure depends upon the outcome of interest. The number
of different diagnoses in both hospital and physician
databases was the best predictor of hospitalization,
whereas the best predictors of death were the Elixhauser
and Charlson indices. Our findings for health services
utilization are similar to those of Farley et al., who
foundthatnumberofdiagnosesandnumberofdrugs
performed better than the Elixhauser and Charlson
indices for predicting health care expenditures in a gen-
eral population [41]. However, Perkins et al. reported
that a simple count of medications was the best predic-
tor of health care costs in a population-based cohort of
community-dwelling older adults aged 60 and older [7].
Perkins et al. investigated many of the same comorbidity
measures that we did including count of diagnoses,
count of drugs, CDS, and the Charlson index using both
hospital and physician data. The discrepancy between
Perkinsand our findings may be the result of differ-
ences in how the number of drugs was ascertained. Per-
kins et al. had a pharmacist divide drugs into detailed
subclasses whereas we used the AHFS to the fourth
digit, which is not as specific. Another possible explana-
tion is that although Perkins et al. included a count of
diagnoses as one of their comorbidity measures, the
researchers restricted the count to ten common chronic
conditions, whereas we identified all conditions based
on diagnoses codes recorded in hospital and physician
databases.
Our finding that the Elixhauser and Charlson indices
are the optimal predictors of one-year mortality is also
in keeping with other studies of different populations
including in-patient populations, community-dwelling
older adults, and hypertensive adults [7,9,12,42].
Furthermore, studies that directly compare the Elixhau-
ser to Charlson indices found, as we did, that the Elix-
hauser index performed better than the Charlson index
[6,43,44].
Our findings show that, although the optimal comor-
bidity measure varies by outcome, results are remarkably
consistent across study populations with different dis-
ease profiles. This consistency in the performance of
comorbidity measures predicting one-year mortality was
also observed by Stukenborg et al. (2001) in five in-
patient populations (acute myocardial infarction, conges-
tive heart failure, chronic obstructive pulmonary disease,
hypertension with complications, and acute cerebrovas-
cular disease), although they limited their investigation
to only the Elixhauser and Charlson indices. Similarly,
in 2004 Schneeweiss et al. compared the discriminative
performance of four versions of the Charlson index and
two versions of the CDS for predicting one-year mortal-
ity. They found the rank order for the best performing
comorbidity measures was identical in four cohorts of
community-dwelling adults age 64 and older - two with
cardiovascular disease and two without restrictions. Our
results confirm these findings, but further expand upon
their work by investigating additional comorbidity mea-
sures, investigating health services utilization as an out-
come, and including adults of all ages in the study
cohorts.
Although the rank ordering of comorbidity measures
in terms of discriminant performance was consistent
across our study populations, we found that perfor-
mance was poorer in the chronic disease cohorts. Com-
pared to identical models in the general population
cohort, models in the diabetes and osteoporosis cohorts
had poorer discriminative performance (i.e., lower c-sta-
tistic values) and higher prediction error (i.e., higher
Brier scores). This may be the result of not including
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 10 of 12
important disease-specific variables in our models such
as the recency of diagnoses or exposure to certain medi-
cations. We did not include them in order to make the
models directly comparable between study cohorts. Par-
ticularly for health services outcomes, for which the c-
statistic rarely surpassed the cutoff of 0.70, adding other
variables might improve the performance of these
models.
We also observed diminished performance in the age-
restricted cohorts compared to the full cohorts although
the magnitudes of change in the c-statistics were consis-
tently larger. The increase in the magnitudes of change
in the c-statistics may be explained by the fact that
advancing age is associated with the presence of multi-
ple chronic illnesses [45]. The high prevalence of
chronic disease would account for a greater proportion
of the variation observed in the age-restricted cohorts
compared to the full cohort and which would corre-
spond to a proportionately greater increase in the c-
statistic.
The strengths of this study include the use of popula-
tion-based administrative databases and our comprehen-
sive investigation of multiple comorbidity measures with
multiple outcomes in a variety of study populations.
Furthermore, the cohorts are population-based which
ensures good generalizability of our findings.
There are some limitations to our research. Misclassi-
fication bias for specific comorbid conditions may occur
as a result of inaccurate diagnosis coding in the admin-
istrative databases. Specifically, physicians may code a
suspected diagnosis only to rule it out later and which
would falsely inflate scores for diagnosis-based comor-
bidity measures. As well, comorbidity measures were
constructed using a single year of administrative data so
individuals that have a disease but did not have any
health service records containing a diagnosis of that dis-
ease during the one year timeframe would not be cap-
tured in the comorbidity measure for that disease. For
example, of the diabetes cohort identified for this study
- based on diagnosis codes occurring during a 6-year
period - only 15% had a diabetes diagnosis code occur-
ring during the one-year period upon which the Elix-
hauser index was based. We could have reduced the
likelihood for misclassification of the comorbidity mea-
sures by expanding the time-frame for ascertainment of
comorbidity to more than one year, but a one-year time
frame has been adopted in other studies [7-9] and
enables direct comparison with our results. Further-
more, a study comparing varying time frames for the
assessment of comorbidity in the prediction of one-year
mortality found that a longer time frame did not
improve the c-statistic substantially [46]. Another limita-
tion is that our finding that the best performing comor-
bidity measure is consistent across cohorts may be
because the chronic disease cohorts are subsets of the
general population cohort. This is unlikely though, given
that people with diabetes and/or osteoporosis comprise
only about 5% of the full general population cohort and
15% of the age 65+ general population cohort.
Conclusions
Based on the results of this research, the optimal
comorbidity measure is primarily dependent upon the
outcome of interest and not on the study population.
Overall, the count of diagnoses had the best predictive
performance for hospital utilization while the Elixhauser
index, followed by the Charlson index, had the best pre-
dictive performance for mortality. These findings were
consistent in a general population cohort and two
chronic disease cohorts, even when analyses were
restricted to older adults.
Acknowledgements and Funding
This research was supported in part by a Canadian Institutes of Health
Research New Investigator Award and funding from the Centennial Chair
Program, University of Saskatchewan to the second author. This study is
based in part on de-identified data provided by the Saskatchewan Ministry
of Health. The interpretation and conclusions contained herein do not
necessarily represent those of the Government of Saskatchewan or the
Ministry of Health.
Author details
1
Saskatchewan Health Quality Council, Saskatoon, Canada.
2
School of Public
Health, University of Saskatchewan, Saskatoon, Canada.
Authorscontributions
JQ developed the study concept, analyzed the data, interpreted results, and
prepared the manuscript. LL developed the study concept and
methodology, and interpreted results. BA assisted with the literature review
and data analysis. GT developed the study concept. All authors read and
approved the final manuscript.
Competing interests
JQ: No disclosures
LL: Unrestricted research grant from Amgen Canada
GT: No disclosures
BA: No disclosures
Received: 7 December 2010 Accepted: 10 June 2011
Published: 10 June 2011
References
1. Valderas JM, Starfield B, Sibbald B, Salisbury C, Roland M: Defining
comorbidity: implications for understanding health and health services.
Ann Fam Med 2009, 7:357-363.
2. Schneeweiss S: Sensitivity analysis and external adjustment for
unmeasured confounders in epidemiologic database studies of
therapeutics. Pharmacoepidemiol Drug Saf 2006, 15:291-303.
3. von Korff M, Wagner EH, Saunders K: A chronic disease score from
automated pharmacy data. J Clin Epidemiol 1992, 45:197-203.
4. Charlson ME, Pompei P, Ales KL, MacKenzie CR: A new method of
classifying prognostic comorbidity in longitudinal studies: development
and validation. J Chronic Dis 1987, 40:373-383.
5. Elixhauser A, Steiner C, Harris DR, Coffey RM: Comorbidity measures for
use with administrative data. Med Care 1998, 36:8-27.
6. Dominick KL, Dudley TK, Coffman CJ, Bosworth HB: Comparison of three
comorbidity measures for predicting health service use in patients with
osteoarthritis. Arthritis Rheum 2005, 53:666-672.
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 11 of 12
7. Perkins AJ, Kroenke K, Unutzer J, Katon W, Williams JW, Hope C,
Callahan CM: Common comorbidity scales were similar in their ability to
predict health care costs and mortality. J Clin Epidemiol 2004,
57:1040-1048.
8. Schneeweiss S, Seeger JD, Maclure M, Wang PS, Avorn J, Glynn RJ:
Performance of comorbidity scores to control for confounding in
epidemiologic studies using claims data. Am J Epidemiol 2001,
154:854-864.
9. Chu YT, Ng YY, Wu SC: Comparison of different comorbidity measures for
use with administrative data in predicting short- and long-term
mortality. BMC Health Serv Res 2010, 10:140.
10. DHoore W, Bouckaert A, Tilquin C: Practical considerations on the use of
the Charlson comorbidity index with administrative data bases. J Clin
Epidemiol 1996, 49:1429-1433.
11. Grunau GL, Sheps S, Goldner EM, Ratner PA: Specific comorbidity risk
adjustment was a better predictor of 5-year acute myocardial infarction
mortality than general methods. J Clin Epidemiol 2006, 59:274-280.
12. Tang J, Wan JY, Bailey JE: Performance of comorbidity measures to
predict stroke and death in a community-dwelling, hypertensive
Medicaid population. Stroke 2008, 39:1938-1944.
13. Baser O, Palmer L, Stephenson J: The estimation power of alternative
comorbidity indices. Value Health 2008, 11:946-955.
14. Schneeweiss S, Wang PS, Avorn J, Glynn RJ: Improved comorbidity
adjustment for predicting mortality in Medicare populations. Health Serv
Res 2003, 38:1103-1120.
15. Schneeweiss S, Maclure M: Use of comorbidity scores for control of
confounding in studies using administrative databases. Int J Epidemiol
2000, 29:891-898.
16. 2006 Community Profiles. [http://www12.statcan.ca/census-recensement/
2006/dp-pd/prof/92-591/index.cfm?Lang=E].
17. Downey W, Beck P, McNutt M, Stang M, Osei W, Nichol J: Health
Databases in Saskatchewan. In Pharmacoepidemiology. Edited by: Strom B.
Chinchester: Wiley; 2000:325-345.
18. Health services databases: Information document. [http://www.health.
gov.sk.ca/Default.aspx?DN = 2103410e-ad99-4bf5-ba42-dc07b16f45a6].
19. Li B, Evans D, Faris P, Dean S, Quan H: Risk adjustment performance of
Charlson and Elixhauser comorbidities in ICD-9 and ICD-10
administrative databases. BMC Health Serv Res 2008, 8:12.
20. Home page of the American Society of Health-System Pharmacists.
[http://www.ashp.org/].
21. Edouard L, Rawson NS: Reliability of the recording of hysterectomy in the
Saskatchewan health care system. Br J Obstet Gynaecol 1996, 103:891-897.
22. Liu L, Reeder B, Shuaib A, Mazagri R: Validity of stroke diagnosis on
hospital discharge records in Saskatchewan, Canada: implications for
stroke surveillance. Cerebrovasc Dis 1999, 9:224-230.
23. Rawson NS, DArcy C: Assessing the validity of diagnostic information in
administrative health care utilization data: experience in Saskatchewan.
Pharmacoepidemiol Drug Saf 1998, 7:389-398.
24. National Diabetes Surveillance System: Responding to the challenge of
diabetes in Canada: First report of the National Diabetes Surveillance
System. 2003, Ottawa, ON, Health Canada. 8-31-2010.
25. Hux JE, Ivis F, Flintoft V, Bica A: Diabetes in Ontario: Determination of
prevalence and incidence using a validated adminstrative data
algorithm. Diabetes Care 2002, 25:512-516.
26. Blanchard JF, Ludwig S, Wajda A, Dean H, Anderson K, Kendall O: Incidence
and prevalence of diabetes in Manitoba, 1986-1991. Diabetes Care 1996,
19:807-811.
27. Ikeda M, Ishigaki T, Tamauchi K: Relationship between Brier score and
area under the binormal ROC curve. Computer Meth Prog Biomedicine
2002, 67:187-194.
28. Redelmeier DA, Bloch DA, Hickam DH: Assessing predictive accuracy: how
to compare Brier scores. J Clin Epidemiol 1991, 44:1141-1146.
29. Quan H, Sundararajan V, Halfon P, Fong A, Burnand B, Luthi JC,
Saunders LD, Beck CA, Feasby TE, Ghali WA: Coding algorithms for
defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med
Care 2005, 43:1130-1139.
30. Klabunde CN, Potosky AL, Legler JM, Warren JL: Development of a
comorbidity index using physician claims data. J Clin Epidemiol 2000,
53:1258-1267.
31. ICD9 Enhanced Charlson Index SAS code. [http://mchp-appserv.cpe.
umanitoba.ca/Upload/SAS/ICD9_E_Charlson_dxtype.sas.txt].
32. ICD10 Charlson Index SAS code. [http://mchp-appserv.cpe.umanitoba.ca/
Upload/SAS/ICD10_Charlson_dxtype.sas.txt].
33. ICD9-CM Elixhauser SAS code. [http://mchp-appserv.cpe.umanitoba.ca/
Upload/SAS/_ElixhauserICD9CM.sas.txt].
34. ICD10 Elixhauser SAS code. [http://mchp-appserv.cpe.umanitoba.ca/
Upload/SAS/_ElixhauserICD10.sas.txt].
35. Raphael DPoverty: Income Inequality, and Health in Canada.[http://www.
povertyandhumanrights.org/docs/incomeHealth.pdf].
36. Roos NP, Mustard CA: Variation in health and health care use by
socioeconomic status in Winnipeg, Canada: does the system work well?
Yes and no. Milbank Q 1997, 75:89-111.
37. Reiter JP, Raghunathan TE: The multiple adaptations of multiple
imputation. Journal of the American Statistical Associations 2007,
102:1462-1471.
38. Harrell FE, Lee KL, Mark DB: Multivariable prognostic models: issues in
developing models, evaluating assumptions and adequacy, and
measuring and reducing errors. Stat Med 1996, 15:361-387.
39. DeLong ER, DeLong DM, Clarke-Pearson DL: Comparing the areas under
two or more correlated receiver operating characteristic curves: a
nonparametric approach. Biometrics 1988, 44:837-845.
40. SAS Institute Inc: SAS. Cary, NC.: SAS Institute Inc.; 2007.
41. Farley JF, Harley CR, Devine JW: A comparison of comorbidity
measurements to predict healthcare expenditures. Am J Manag Care
2006, 12:110-119.
42. Schneeweiss S, Wang PS, Avorn J, Maclure M, Levin R, Glynn RJ:
Consistency of performance ranking of comorbidity adjustment scores
in Canadian and U.S. utilization data. J Gen Intern Med 2004, 19:444-450.
43. Stukenborg GJ, Wagner DP, Connors AF Jr: Comparison of the
performance of two comorbidity measures, with and without
information from prior hospitalizations. Med Care 2001, 39:727-739.
44. Southern DA, Quan H, Ghali WA: Comparison of the Elixhauser and
Charlson/Deyo methods of comorbidity measurement in administrative
data. Med Care 2004, 42:355-360.
45. Moore EG, Rosenberg MW, Fitzgibbon SH: Activity limitation and chronic
conditions in Canadas elderly, 1986-2011. Disabil Rehabil 1999,
21:196-210.
46. Radley DC, Gottlieb DJ, Fisher ES, Tosteson AN: Comorbidity risk-
adjustment strategies are comparable among persons with hip fracture.
J Clin Epidemiol 2008, 61:580-587.
Pre-publication history
The pre-publication history for this paper can be accessed here:
http://www.biomedcentral.com/1472-6963/11/146/prepub
doi:10.1186/1472-6963-11-146
Cite this article as: Quail et al.: Comparing comorbidity measures for
predicting mortality and hospitalization in three population-based
cohorts. BMC Health Services Research 2011 11:146.
Submit your next manuscript to BioMed Central
and take full advantage of:
Convenient online submission
Thorough peer review
No space constraints or color figure charges
Immediate publication on acceptance
Inclusion in PubMed, CAS, Scopus and Google Scholar
Research which is freely available for redistribution
Submit your manuscript at
www.biomedcentral.com/submit
Quail et al.BMC Health Services Research 2011, 11:146
http://www.biomedcentral.com/1472-6963/11/146
Page 12 of 12
... Counts and indices can be used as exposures in epidemiological analyses of the relationship with healthcare outcomes. Emergency hospital admission presents a significant burden both to individuals and the healthcare system, and the risk is known to be greater amongst those with higher scores on MLTC counts and indices assessed using routine healthcare data [9]. There is increasing interest in characterising MLTC in cohort studies, including the UK Biobank [10,11]. ...
... Our finding of moderate predictive accuracy for emergency hospital admission using both the count and index approaches is similar to that seen in previous studies using population-based cohorts [9,22,26]. Other work has highlighted how the use of variables beyond LTC can improve prediction of hospitalisation, for example, by the use of more detailed weightings and information on acute conditions in the adjusted morbidity groups tool [26], or blood test results and medications in the QAdmissions model [27]. ...
... Several studies have also incorporated different socioeconomic indicators based on a person's address, which, along with MLTC, are also predictors of hospital admission [26][27][28]. Finally, our finding of similar Harrell's Cstatistic for the prediction of emergency admission in younger and older age groups is also similar to that seen in previous studies [9,26]. ...
Article
Full-text available
Background: Numerous approaches are used to characterise multiple long-term conditions (MLTC), including counts and indices. Few studies have compared approaches within the same dataset. We aimed to characterise MLTC using simple approaches, and compare their prevalence estimates of MLTC and associations with emergency hospital admission in the UK Biobank. Methods: We used baseline data from 495,465 participants (age 38-73 years) to characterise MLTC using four approaches: Charlson index (CI), Byles index (BI), count of 43 conditions (CC) and count of body systems affected (BC). We defined MLTC as more than two conditions using CI, BI and CC, and more than two body systems using BC. We categorised scores (incorporating weightings for the indices) from each approach as 0, 1, 2 and 3+. We used linked hospital episode statistics and performed survival analyses to test associations with an endpoint of emergency hospital admission or death over 5 years. Results: The prevalence of MLTC was 44% (BC), 33% (CC), 6% (BI) and 2% (CI). Higher scores using all approaches were associated with greater outcome rates independent of sex and age group. For example, using CC, compared with score 0, score 2 had 1.95 (95% CI: 1.91, 1.99) and a score of 3+ had 3.12 (95% CI: 3.06, 3.18) times greater outcome rates. The discriminant value of all approaches was modest (C-statistics 0.60-0.63). Conclusions: The counts classified a greater proportion as having MLTC than the indices, highlighting that prevalence estimates of MLTC vary depending on the approach. All approaches had strong statistical associations with emergency hospital admission but a modest ability to identify individuals at risk.
... We used CCI scores to assess general health status; the score for each patient was calculated by summing the weighted scores of 31 comorbid conditions (Quan et al., 2005). Scholars conducting epidemiological studies often employ the CCI to control confounding variables (Quail et al., 2011). ...
... The nearest neighbour matching technique [34] was used to construct the control group of 2,024 individuals without dementia based on propensity scores. Persons with dementia were matched 1:1 with controls using covariates at the time of index date: age group (65-69, 70-74, 75-79, 80-84, and ≥ 85), sex, geographic region, urban versus rural residence, and Charlson Comorbidity Index score based on diagnoses for 17 diseases excluding dementia [35] during the 1-year period prior to index date. The index date used for matching controls was April 1, 2013. ...
Article
Full-text available
Background Rural-urban differences in health service use among persons with prevalent dementia are known. However, the extent of geographic differences in health service use over a long observation period, and prior to diagnosis, have not been sufficiently examined. The purpose of this study was to examine yearly rural-urban differences in the proportion of patients using health services, and the mean number of services, in the 5-year period before and 5-year period after a first diagnosis of dementia. Methods This population-based retrospective cohort study used linked administrative health data from the Canadian province of Saskatchewan to investigate the use of five health services [family physician (FP), specialist physician, hospital admission, all-type prescription drug dispensations, and short-term institutional care admission] each year from April 2008 to March 2019. Persons with dementia included 2,024 adults aged 65 years and older diagnosed from 1 April 2013 to 31 March 2014 (617 rural; 1,407 urban). Matching was performed 1:1 to persons without dementia on age group, sex, rural versus urban residence, geographic region, and comorbidity. Differences between rural and urban persons within the dementia and control cohorts were separately identified using the Z-score test for proportions (p < 0.05) and independent samples t-test for means (p < 0.05). Results Rural compared to urban persons with dementia had a lower average number of FP visits during 1-year and 2-year preindex and between 2-year and 4-year postindex (p < 0.05), a lower likelihood of at least one specialist visit and a lower average number of specialist visits during each year (p < 0.05), and a lower average number of all-type prescription drug dispensations for most of the 10-year study period (p < 0.05). Rural-urban differences were not observed in admission to hospital or short-term institutional care (p > 0.05 each year). Conclusions This study identified important geographic differences in physician services and all-type prescription drugs before and after dementia diagnosis. Health system planners and educators must determine how to use existing resources and technological advances to support care for rural persons living with dementia.
... We are not aware of any study comparing three different CCIs in a diverse surgical patient population, especially not in a pre-procedural anesthesia evaluation center. The subjects of the most comparable studies were population-based analyses [23,26], emergency patients [21], or patients with arthritis [25]. ...
Article
Full-text available
(1) Background: Patients’ comorbidities play an immanent role in perioperative risk assessment. It is unknown how Charlson Comorbidity Indices (CCIs) from different sources compare. (2) Methods: In this prospective observational study, we compared the CCIs of patients derived from patients’ self-reports and from physicians’ assessments with hospital administrative data. (3) Results: The data of 1007 patients was analyzed. Agreement between the CCI from patients’ self-report compared to administrative data was fair (kappa 0.24 [95%CI 0.2–0.28]). Agreement between physicians’ assessment and the administrative data was also fair (kappa 0.28 [95%CI 0.25–0.31]). Physicians’ assessment and patients’ self-report had the best agreement (kappa 0.33 [95%CI 0.30–0.37]). The CCI calculated from the administrative data showed the best predictability for in-hospital mortality (AUROC 0.86 [95%CI 0.68–0.91]), followed by equally good prediction from physicians’ assessment (AUROC 0.80 [95%CI 0.65–0.94]) and patients’ self-report (AUROC 0.80 [95%CI 0.75–0.97]). (4) Conclusions: CCIs derived from patients’ self-report, physicians’ assessments, and administrative data perform equally well in predicting postoperative in-hospital mortality.
... [24][25][26][27][28] Finally, using the R "comorbid" package, ECI scores were collected for all patients using 30 groups of diagnoses that have been shown to reflect comorbidity burden. 20,[29][30][31][32][33] Statistical Analysis All statistics were conducted using RStudio (Version 1.2.5042) with a level of significance of α = 0.05. Discharge weights, defined as weights that make the NRD data nationally representative, were used for all summary statistics. ...
Article
Background and Objective: As incidence of operative spinal pathology continues to grow, so do the rates of lumbar spinal fusion procedures. Comorbidity indices can be used preoperatively to predict potential complications. However, there is a paucity of research defining the optimal comorbidity indices in patients undergoing spinal fusion surgery. We aimed to utilize modeling strategies to evaluate the predictive validity of various comorbidity indices and combinations thereof. Methods: Nationwide Readmissions Database (NRD) from 2016-2019 was used to identify patients underwent spinal fusion. Receiver operating characteristic (ROC) curves were created for relevant complications, including mortality, non-routine discharge, cost in top quartile, length of stay (LOS) in top quartile, and 30-day readmission, using comorbidity indices as predictor values. Results: A total of 750,183 patients were included. Non-routine discharges occurred in 161,077 (21.5%) patients. The adjusted all-payer cost for the procedure was $37,616.97±$27,408.86 (Top Quartile: $45,409.20), and the length of stay was 4.1±4.4 days (Top Quartile: 8.1 days). Models using Frailty+ Elixhauser Comorbidity Index (ECI) as the primary predictor consistently outperformed other models with statistically significant p-values, as determined by comparing their ROCs, for most complications. However, for prediction of mortality, the model using Frailty+ECI was no better than the model using ECI alone (p=0.23), and for prediction of all-payer cost, the ECI model outperformed the models using frailty alone (p<0.0001) and the model using Frailty+ECI (p<0.0001). Conclusions: This investigation is the first to utilize big data and modeling strategies to delineate the relative predictive utility of the ECI and JHACG comorbidity indices for the prognostication of patients undergoing lumbar fusion surgery. With the knowledge gained from our models, spine surgeons, payers, and hospitals may be able to identify vulnerable patients more effectively within their practice who may require a higher degree of resource utilization.
... Recently, the application of advanced analytics to available digital healthcare data has highlighted the precision of models that predict suicidality using structured data (18,19) and unstructured data, applying natural language processing techniques (20). Further, numerous studies have used administrative healthcare data to predict hospitalization in general populations (21)(22)(23) and among individuals with comorbid depressive symptoms (24,25). ...
Article
Full-text available
Objective: To develop and validate algorithms to identify individuals with major depressive disorder (MDD) at elevated risk for suicidality or for an acute care event. Methods: We conducted a retrospective cohort analysis among adults with MDD diagnosed between January 1, 2018 and February 28, 2019. Generalized estimating equation models were developed to predict emergency department (ED) visit, inpatient hospitalization, acute care visit (ED or inpatient), partial-day hospitalization, and suicidality in the year following diagnosis. Outcomes (per 1000 patients per month, PkPPM) were categorized as all-cause, psychiatric, or MDD-specific and combined into composite measures. Predictors included demographics, medical and pharmacy utilization, social determinants of health, and comorbid diagnoses as well as features indicative of clinically relevant changes in psychiatric health. Models were trained on data from 1.7M individuals, with sensitivity, positive predictive value, and area-under-the-curve (AUC) derived from a validation dataset of 0.7M. Results: Event rates were 124.0 PkPPM (any outcome), 21.2 PkPPM (psychiatric utilization), and 7.6 PkPPM (suicidality). Among the composite models, the model predicting suicidality had the highest AUC (0.916) followed by any psychiatric acute care visit (0.891) and all-cause ED visit (0.790). Event-specific models all achieved an AUC >0.87, with the highest AUC noted for partial-day hospitalization (AUC = 0.938). Select predictors of all three outcomes included younger age, Medicaid insurance, past psychiatric ED visits, past suicidal ideation, and alcohol use disorder diagnoses, among others. Conclusions: Analytical models derived from clinically-relevant features identify individuals with MDD at risk for poor outcomes and can be a practical tool for health care organizations to divert high-risk populations into comprehensive care models.
... 1,[20][21][22][23] Lastly, Elixhauser Comorbidity Index (ECI) scores were collected for all patients using the R "comorbid" package, which uses 30 groups of diagnoses to measure their burden of comorbidity. [24][25][26][27][28][29] To quantify the influence of frailty within our patient population, nearest-neighbor propensity score matching for age, sex, ECI, insurance type, median income by zip code, and NRD discharge weighting was implemented between frail and nonfrail patients (Fig. 1). The MatchIt algorithm selects the best-fit parametric models based on the minimum "distance" parameter, which is determined through logistic regression models that minimize the propensity score with no replacement. ...
Article
Full-text available
OBJECTIVE Frailty embodies a state of increased medical vulnerability that is most often secondary to age-associated decline. Recent literature has highlighted the role of frailty and its association with significantly higher rates of morbidity and mortality in patients with CNS neoplasms. There is a paucity of research regarding the effects of frailty as it relates to neurocutaneous disorders, namely, neurofibromatosis type 1 (NF1). In this study, the authors evaluated the role of frailty in patients with NF1 and compared its predictive usefulness against the Elixhauser Comorbidity Index (ECI). METHODS Publicly available 2016–2017 data from the Nationwide Readmissions Database was used to identify patients with a diagnosis of NF1 who underwent neurosurgical resection of an intracranial tumor. Patient frailty was queried using the Johns Hopkins Adjusted Clinical Groups frailty-defining indicator. ECI scores were collected in patients for quantitative measurement of comorbidities. Propensity score matching was performed for age, sex, ECI, insurance type, and median income by zip code, which yielded 60 frail and 60 nonfrail patients. Receiver operating characteristic (ROC) curves were created for complications, including mortality, nonroutine discharge, financial costs, length of stay (LOS), and readmissions while using comorbidity indices as predictor values. The area under the curve (AUC) of each ROC served as a proxy for model performance. RESULTS After propensity matching of the groups, frail patients had an increased mean ± SD hospital cost ($85,441.67 ± $59,201.09) compared with nonfrail patients ($49,321.77 ± $50,705.80) (p = 0.010). Similar trends were also found in LOS between frail (23.1 ± 14.2 days) and nonfrail (10.7 ± 10.5 days) patients (p = 0.0020). For each complication of interest, ROC curves revealed that frailty scores, ECI scores, and a combination of frailty+ECI were similarly accurate predictors of variables (p > 0.05). Frailty+ECI (AUC 0.929) outperformed using only ECI for the variable of increased LOS (AUC 0.833) (p = 0.013). When considering 1-year readmission, frailty (AUC 0.642) was outperformed by both models using ECI (AUC 0.725, p = 0.039) and frailty+ECI (AUC 0.734, p = 0.038). CONCLUSIONS These findings suggest that frailty and ECI are useful in predicting key complications, including mortality, nonroutine discharge, readmission, LOS, and higher costs in NF1 patients undergoing intracranial tumor resection. Consideration of a patient’s frailty status is pertinent to guide appropriate inpatient management as well as resource allocation and discharge planning.
Article
Full-text available
Background: Comorbidity indices such as Charlson’s (CCI) and Elixhauser’s (ECI) are used to adjust the patient’s care, depending on the severity of their condition. However, no study has compared these indices’ ability to predict nursing-sensitive outcomes (NSOs). We compared the performance of CCI and ECI in predicting NSOs in gastric cancer patients’ gastrectomy. Methods: Gastric cancer patients with gastrectomy, aged 19 years or older and admitted between 2015 and 2016, were selected from the Korea Insurance Review and Assessment Service database. We examined the relationships between NSOs and CCI or ECI while adjusting patient and hospital characteristics with logistic regression. Results: The ECI item model was the best in view of the C-statistic and Akaike Information Criterion for total NSO, physiologic/metabolic derangement, and deep vein thrombosis, while the Charlson item model was the best for upper gastrointestinal tract bleeding. For the C-statistic, the ECI item model was the best for in-hospital mortality, CNS complications, shock/cardiac arrest, urinary tract infection, pulmonary failure, and wound infection, while the CCI item model was the best for hospital-acquired pneumonia and pressure ulcers. Conclusions: In predicting 8 of 11 NSOs, the ECI item model outperformed the others. For other NSOs, the best model varies between the ECI item and CCI item model.
Article
Full-text available
Objectives This study investigated patterns in health service usage among older adults with dementia and matched controls over a 10-year span from 5 years before until 5 years after diagnosis. Design Population-based retrospective matched case–control study. Setting Administrative health data of individuals in Saskatchewan, Canada from 1 April 2008 to 31 March 2019. Participants The study included 2024 adults aged 65 years and older living in the community at the time of dementia diagnosis from 1 April 2013 to 31 March 2014, matched 1:1 to individuals without a dementia diagnosis on age group, sex, rural versus urban residence, geographical region and comorbidity. Outcome measures For each 5-year period before and after diagnosis, we examined usage of health services each year including family physician (FP) visits, specialist visits, hospital admissions, all-type prescription drug dispensations and short-term care admissions. We used negative binomial regression to estimate the effect of dementia on yearly average health service utilisation adjusting for sex, age group, rural versus urban residence, geographical region, 1 year prior health service use and comorbidity. Results Adjusted findings demonstrated that 5 years before diagnosis, usage of all health services except hospitalisation was lower among persons with dementia than persons without dementia (all p<0.001). After this point, differences in higher health service usage among persons with dementia compared to without dementia were greatest in the year before and year after diagnosis. In the year before diagnosis, specialist visits were 59.7% higher (p<0.001) and hospitalisations 90.5% higher (p<0.001). In the year after diagnosis, FP visits were 70.0% higher (p<0.001) and all-type drug prescriptions 29.1% higher (p<0.001). Conclusions Findings suggest the year before and year after diagnosis offer multiple opportunities to implement quality supports. FPs are integral to dementia care and require effective resources to properly serve this population.
Article
Full-text available
SUMMARY Background Large health care utilization databases are frequently used to analyze unintended effects of prescription drugs and biologics. Confounders that require detailed information on clinical parameters, lifestyle, or over-the-counter medications are often not measured in such datasets, causing residual confounding bias. Objective This paper provides a systematic approach to sensitivity analyses to investigate the impact of residual confounding in pharmacoepidemiologic studies that use health care utilization databases. Methods Four basic approaches to sensitivity analysis were identified: (1) sensitivity analyses based on an array of informed assumptions; (2) analyses to identify the strength of residual confounding that would be necessary to explain an observed drug-outcome association; (3) external adjustment of a drug-outcome association given additional information on single binary confounders from survey data using algebraic solutions; (4) external adjustment considering the joint dis- tribution of multiple confounders of any distribution from external sources of information using propensity score calibration. Conclusion Sensitivity analyses and external adjustments can improve our understanding of the effects of drugs and biologics in epidemiologic database studies. With the availability of easy-to-apply techniques, sensitivity analyses should be used more frequently, substituting qualitative discussions of residual confounding. Copyright # 2006 John Wiley & Sons, Ltd.
Article
Full-text available
Background Large health care utilization databases are frequently used to analyze unintended effects of prescription drugs and biologics. Confounders that require detailed information on clinical parameters, lifestyle, or over-the-counter medications are often not measured in such datasets, causing residual confounding bias.Objective This paper provides a systematic approach to sensitivity analyses to investigate the impact of residual confounding in pharmacoepidemiologic studies that use health care utilization databases.Methods Four basic approaches to sensitivity analysis were identified: (1) sensitivity analyses based on an array of informed assumptions; (2) analyses to identify the strength of residual confounding that would be necessary to explain an observed drug-outcome association; (3) external adjustment of a drug-outcome association given additional information on single binary confounders from survey data using algebraic solutions; (4) external adjustment considering the joint distribution of multiple confounders of any distribution from external sources of information using propensity score calibration.Conclusion Sensitivity analyses and external adjustments can improve our understanding of the effects of drugs and biologics in epidemiologic database studies. With the availability of easy-to-apply techniques, sensitivity analyses should be used more frequently, substituting qualitative discussions of residual confounding. Copyright © 2006 John Wiley & Sons, Ltd.
Article
The objective of this study was to develop a prospectively applicable method for classifying comorbid conditions which might alter the risk of mortality for use in longitudinal studies. A weighted index that takes into account the number and the seriousness of comorbid disease was developed in a cohort of 559 medical patients. The 1-yr mortality rates for the different scores were: "0", 12% (181); "1-2", 26% (225); "3-4", 52% (71); and "greater than or equal to 5", 85% (82). The index was tested for its ability to predict risk of death from comorbid disease in the second cohort of 685 patients during a 10-yr follow-up. The percent of patients who died of comorbid disease for the different scores were: "0", 8% (588); "1", 25% (54); "2", 48% (25); "greater than or equal to 3", 59% (18). With each increased level of the comorbidity index, there were stepwise increases in the cumulative mortality attributable to comorbid disease (log rank chi 2 = 165; p less than 0.0001). In this longer follow-up, age was also a predictor of mortality (p less than 0.001). The new index performed similarly to a previous system devised by Kaplan and Feinstein. The method of classifying comorbidity provides a simple, readily applicable and valid method of estimating risk of death from comorbid disease for use in longitudinal studies. Further work in larger populations is still required to refine the approach because the number of patients with any given condition in this study was relatively small.
Article
Together with all other developed countries, Canada's population is experiencing a significant increase in the proportion that is elderly. This paper examines basic linkages between individual ageing, the prevalence of various chronic health conditions, functional limitation and the receipt of help in activities of daily living (ADL) and instrumental activities of daily living (IADL) for the Canadian population using recent data from the National Population Health Survey (NPHS) as well as the Health and Activity Limitation Surveys (HALS) and the two General Social Surveys (GSS) with health data. Presented are age- and sex-specific prevalence of chronic conditions and logistic regression is used to assess the impacts of different chronic conditions on the receipt of help for IADL and ADL. The importance of gender and living alone in influencing the receipt of help and also of use of formal agencies is presented using additional data from HALS. Findings from these analyses are also used to project changes i...
Article
Objectives: Implementation of the International Statistical Classification of Disease and Related Health Problems, 10th Revision (ICD-10) coding system presents challenges for using administrative data. Recognizing this, we conducted a multistep process to develop ICD-10 coding algorithms to define Charlson and Elixhauser comorbidities in administrative data and assess the performance of the resulting algorithms. Methods: ICD-10 coding algorithms were developed by "translation" of the ICD-9-CM codes constituting Deyo's (for Charlson comorbidities) and Elixhauser's coding algorithms and by physicians' assessment of the face-validity of selected ICD- 10, codes. The process of carefully developing ICD-10 algorithms also produced modified and enhanced ICD-9-CM coding algorithms for the Charlson and Elixhauser comorbidities. We then used data on in-patients aged 18 years and older in ICD-9-CM and ICD-10 administrative hospital discharge data from a Canadian health region to assess the comorbidity frequencies and mortality prediction achieved by the original ICD-9-CM algorithms, the enhanced ICD-9-CM algorithms, and the new ICD-10 coding algorithms. Results: Among 56,585 patients in the ICD-9-CM data and 58,805 patients in the ICD-10 data, frequencies of the 17 Charlson comorbidities and the 30 Elixhauser comorbidities remained generally similar across algorithms. The new ICD-10 and enhanced ICD9-CM coding algorithms either matched or outperformed the original Deyo and Elixhauser ICD-9-CM coding algorithms in predicting in-hospital mortality. The C-statistic was 0.842 for Deyo's ICD9-CM coding algorithm, 0.860 for the ICD-10 coding algorithm, and 0.859 for the enhanced ICD-9-CM coding algorithm, 0.868 for the original Elixhauser ICD-9-CM coding algorithm, 0.870 for the ICD-10 coding algorithm and 0.878 for the enhanced ICD-9-CM coding algorithm. Conclusions: These newly developed ICD-10 and ICD-9-CM comorbidity coding algorithms produce similar estimates of comorbidity prevalence in administrative data, and may outperform existing ICD-9-CM coding algorithms.
Chapter
IntroductionDescriptionStrengthsWeaknessesParticular applicationsThe futureAcknowledgments
Article
Multivariable regression models are powerful tools that are used frequently in studies of clinical outcomes. These models can use a mixture of categorical and continuous variables and can handle partially observed (censored) responses. However, uncritical application of modelling techniques can result in models that poorly fit the dataset at hand, or, even more likely, inaccurately predict outcomes on new subjects. One must know how to measure qualities of a model's fit in order to avoid poorly fitted or overfitted models. Measurement of predictive accuracy can be difficult for survival time data in the presence of censoring. We discuss an easily interpretable index of predictive discrimination as well as methods for assessing calibration of predicted survival probabilities. Both types of predictive accuracy should be unbiasedly validated using bootstrapping or cross-validation, before using predictions in a new data series. We discuss some of the hazards of poorly fitted and overfitted regression models and present one modelling strategy that avoids many of the problems discussed. The methods described are applicable to all regression models, but are particularly needed for binary, ordinal, and time-to-event outcomes. Methods are illustrated with a survival analysis in prostate cancer using Cox regression.