ArticlePDF Available

Predicting Type 2 Diabetes Based on Polymorphisms From Genome-Wide Association Studies: A Population-Based Study

September 2008
Diabetes 57(11):3122-8

September 2008
57(11):3122-8

DOI:10.2337/db08-0425

Source
PubMed

License
CC BY-NC-ND 3.0

Authors:

Abbas Dehghan

Imperial College London

Show all 9 authorsHide

Prediction of type 2 diabetes based on genetic testing might improve identification of high-risk subjects. Genome-wide association (GWA) studies identified multiple new genetic variants that associate with type 2 diabetes. The predictive value of genetic testing for prediction of type 2 diabetes in the general population is unclear. We investigated 18 polymorphisms from recent GWA studies on type 2 diabetes in the Rotterdam Study, a prospective, population-based study among homogeneous Caucasian individuals of 55 years and older (genotyped subjects, n = 6,544; prevalent cases, n = 686; incident cases during follow-up, n = 601; mean follow-up 10.6 years). The predictive value of these polymorphisms was examined alone and in addition to clinical characteristics using logistic and Cox regression analyses. The discriminative accuracy of the prediction models was assessed by the area under the receiver operating characteristic curves (AUCs). Of the 18 polymorphisms, the ADAMTS9, CDKAL1, CDKN2A/B-rs1412829, FTO, IGF2BP2, JAZF1, SLC30A8, TCF7L2, and WFS1 variants were associated with type 2 diabetes risk in our population. The AUC was 0.60 (95% CI 0.57-0.63) for prediction based on the genetic polymorphisms; 0.66 (0.63-0.68) for age, sex, and BMI; and 0.68 (0.66-0.71) for the genetic polymorphisms and clinical characteristics combined. We showed that 9 of 18 well-established genetic risk variants were associated with type 2 diabetes in a population-based study. Combining genetic variants has low predictive value for future type 2 diabetes at a population-based level. The genetic polymorphisms only marginally improved the prediction of type 2 diabetes beyond clinical characteristics.

Available via license: CC BY-NC-ND 3.0

Content may be subject to copyright.

Predicting Type 2 Diabetes Based on Polymorphisms

From Genome-Wide Association Studies

A Population-Based Study

Mandy van Hoek,

1,2

Abbas Dehghan,

Jacqueline C.M. Witteman,

Cornelia M. van Duijn,

2,3

Andre´ G. Uitterlinden,

1,2

Ben A. Oostra,

2,3

Albert Hofman,

Eric J.G. Sijbrands,

and

A. Cecile J.W. Janssens

OBJECTIVE—Prediction of type 2 diabetes based on genetic

testing might improve identiﬁcation of high-risk subjects. Ge-

nome-wide association (GWA) studies identiﬁed multiple new

genetic variants that associate with type 2 diabetes. The predic-

tive value of genetic testing for prediction of type 2 diabetes in

the general population is unclear.

RESEARCH DESIGN AND METHODS—We investigated 18

polymorphisms from recent GWA studies on type 2 diabetes in

the Rotterdam Study, a prospective, population-based study

among homogeneous Caucasian individuals of 55 years and older

(genotyped subjects, n⫽6,544; prevalent cases, n⫽686;

incident cases during follow-up, n⫽601; mean follow-up 10.6

years). The predictive value of these polymorphisms was exam-

ined alone and in addition to clinical characteristics using logistic

and Cox regression analyses. The discriminative accuracy of the

prediction models was assessed by the area under the receiver

operating characteristic curves (AUCs).

RESULTS—Of the 18 polymorphisms, the ADAMTS9,CDKAL1,

CDKN2A/B-rs1412829,FTO,IGF2BP2,JAZF1,SLC30A8,TCF7L2,

and WFS1 variants were associated with type 2 diabetes risk in

our population. The AUC was 0.60 (95% CI 0.57– 0.63) for

prediction based on the genetic polymorphisms; 0.66 (0.63– 0.68)

for age, sex, and BMI; and 0.68 (0.66 – 0.71) for the genetic

polymorphisms and clinical characteristics combined.

CONCLUSIONS—We showed that 9 of 18 well-established

genetic risk variants were associated with type 2 diabetes in a

population-based study. Combining genetic variants has low

predictive value for future type 2 diabetes at a population-based

level. The genetic polymorphisms only marginally improved the

prediction of type 2 diabetes beyond clinical characteristics.

Diabetes 57:3122–3128, 2008

Type 2 diabetes is a multifactorial disease caused

by a complex interplay of multiple genetic vari-

ants and many environmental factors. With the

recent genome-wide association (GWA) studies,

the number of replicated common genetic variants asso-

ciated with type 2 diabetes has rapidly increased (1–7). A

total of 18 polymorphisms have been ﬁrmly replicated

(1–7). It is unclear whether and how the currently known

genetic variants can be used in practice, because the

combined effect of these variants has not been investi-

gated in a population-based study. Particularly, because

most GWA studies were enriched for patients with a

positive family history and early onset of the disease,

association of these variants to type 2 diabetes risk in the

general population, including elderly individuals, remains

to be determined.

Because complex diseases are caused by multiple ge-

netic variants, predictive testing based on a single genetic

marker will be of limited value (8,9). Simulation studies

suggest that the predictive value could be improved by

combining multiple common low-risk variants (10 –13).

Several empirical studies on the predictive value of genetic

polymorphisms have been conducted before the recent

GWA data were available (14 –16). In a case-control study,

Weedon et al. (16) showed that combining the information

of three polymorphisms improved disease prediction, al-

beit to a limited extent. Vaxillaire et al. (15) investigated 19

polymorphisms and found that the predictive value was

low compared with clinical characteristics.

Genetic variants associated with risk of type 2 diabetes

could potentially be useful for the prediction, prevention,

and early treatment of the disease. We investigated

whether combining the currently known and well-repli-

cated genetic variants predicts type 2 diabetes in the

Rotterdam Study, a prospective population-based fol-

low-up study. We investigated whether these genetic vari-

ants improve prediction beyond clinical characteristics.

RESEARCH DESIGN AND METHODS

The design and data collection of the Rotterdam Study have been described

previously (17). In short, the Rotterdam Study is a prospective, population-

based, cohort study among 7,983 inhabitants of a Rotterdam suburb, designed

to investigate determinants of chronic diseases. Participants were aged 55

years and older. Baseline examinations took place from 1990 until 1993.

Follow-up examinations were performed in 1993–1994, 1997–1999, and 2002–

2004. Between these exams, continuous surveillance on major disease out-

comes was conducted. Information on vital status was obtained from

municipal health authorities. The medical ethics committee of the Erasmus

From the

Department of Internal Medicine, Erasmus University Medical

Center, Rotterdam, the Netherlands; the

Department of Epidemiology and

Biostatistics, Erasmus University Medical Center, Rotterdam, the Nether-

lands; the

Department of Clinical Genetics, Genetic Epidemiology Unit,

Erasmus University Medical Center, Rotterdam, the Netherlands; and the

Department of Public Health, Erasmus University Medical Center, Rotter-

dam, the Netherlands.

Corresponding author: Dr. Eric J.G. Sijbrands, e.sijbrands@erasmusmc.nl.

Received 27 March 2008 and accepted 1 August 2008.

Published ahead of print at http://diabetes.diabetesjournals.org on 11 August

2008. DOI: 10.2337/db08-0425.

long as the work is properly cited, the use is educational and not for proﬁt,

and the work is not altered. See http://creativecommons.org/licenses/by

-nc-nd/3.0/ for details.

The costs of publication of this article were defrayed in part by the payment of page

charges. This article must therefore be hereby marked “advertisement” in accordance

with 18 U.S.C. Section 1734 solely to indicate this fact.

ORIGINAL ARTICLE

3122 DIABETES, VOL. 57, NOVEMBER 2008

Medical Center approved the study protocol, and all participants gave their

written informed consent.

Data collection. At baseline, prevalent cases of diabetes were diagnosed by

a nonfasting or postload glucose level (after oral glucose tolerance testing)

ⱖ11.1 mmol/l and/or treatment with antidiabetic medication (oral medication

or insulin) and the diagnosis of diabetes as registered by a general practitio-

ner. During follow-up, diabetes was diagnosed at fasting plasma glucose levels

ⱖ7.0 mmol/l and/or a nonfasting plasma glucose levels ⱖ11.0 mmol/l and/or

treatment with antidiabetic medication (oral medication or insulin) (18,19)

and the diagnosis of diabetes as registered by a general practitioner. Patients

registered in general practitioners’ records as type 1 diabetic were excluded

from the present analyses (n⫽15).

The CDKAL1 rs7754840, CDKN2AB rs10811661, FTO rs8050136, HHEX

rs1111875, IGF2BP2 rs4402960, KCNJ11 rs5219, PPARG rs1801282, SLC30A8

rs13266634, and TCF7L2 rs7903146 polymorphisms were genotyped by means

of TaqMan allelic discrimination assays. DNA material was available for 6,544

of the 7,983 participants for the TaqMan analyses. The assays were designed

and optimalized by Applied Biosystems (Foster City, CA; http://store.

appliedbiosystems.com). Genotypes were determined in 2-ng genomic DNA.

Reactions were performed on the TaqMan Prism 7900HT platform. The

analyses were performed as described previously (20). Assays were run on 90

blood bank samples to test for adequate cluster separation. A total of 325

samples were genotyped in duplo. Success rates for TaqMan genotyping

ranged from 93.2 to 96.7%, with the exception of 86.1% for IGF2BP2 and 87.4%

for HHEX. TaqMan duplicate error rates for the HHEX and IGF2BP2 polymor-

phisms were 1.2 and 0.6%

The ADAMTS9 rs4411878 (proxy for rs4607103, r

⫽0.95), CDC123-

CAMK1D rs11257622 (proxy for rs12779790, r

⫽0.83), CDKN2A/B rs1412829

(proxy for rs564398, r

⫽0.97), JAZF1 rs1635852 (proxy for rs864745, r

⫽

0.97), NOTCH2 rs1493694 (proxy for rs10923931, r

⫽1.0), TCF2 rs4430796,

THADA rs7578597, TSPAN8-LGR5 rs1353362 (proxy for rs7961581, r

⫽0.96),

and WFS1 rs10012946 (proxy for rs10010131, r

⫽1.0) genotypes were derived

from the genotype data of the version 3 Illumina Inﬁnium II HumanHap550

SNP chip array. From a total of 6,449 subjects, there was sufﬁcient DNA

material for the array. Samples with a call rate ⬍97.5% (n⫽209), excess

autosomal heterozygosity ⬎0.336 (approximate false discovery rate [FDR]

⬍0.1% [n⫽21]), or mismatch between called and phenotypic sex (n⫽36) or

with outliers identiﬁed by the identity-by-state (IBS) clustering analysis with

⬎3 SDs from population mean (n⫽102) or IBS probabilities ⬎97% (n⫽129)

were excluded from the analysis; in total, 5,974 samples remained for

analyses.

The availability of Illumina 550K array data enabled us to compare

genotype calls between TaqMan and Illumina data for the FTO, HHEX,

IGF2BP2, SLC30A8, TCF7L2, and CDKAL1 polymorphisms as well. Concor-

dance rates ranged between 98.6 and 99.7%. To increase success rates, we

merged the data and deleted pairs that were not concordant. The success rates

for the polymorphisms increased to 98.4 –99.4%.

Statistical analyses. Associations of individual polymorphisms were inves-

tigated using Cox proportional hazards models for the prediction of incident

type 2 diabetes and logistic regression analyses for the prediction of prevalent

and incident type 2 diabetes together. Analyses were performed crude and

adjusted for age, sex, and BMI. We also applied Cox proportional hazards

models and logistic regression analyses to investigate the combined predictive

value of 1) the 18 polymorphisms (all polymorphisms included as separate

independent categorical variables); 2) the risk allele score based on the 18

polymorphisms (assuming all effect sizes of equal weight); 3) age, sex, and

BMI; and 4) age, sex, and BMI and all polymorphisms on type 2 diabetes risk.

The risk allele score was calculated by summing up the number of risk alleles

for each participant with complete genotype information, with risk alleles

being the alleles associated with increased risk of type 2 diabetes (1–5). The

risk allele score assumes that all genetic variants have the same effect, i.e.,

minor differences in effects size are ignored. The association between the risk

allele score and the predicted probabilities was quantiﬁed by the Spearman

correlation coefﬁcient.

The discriminative accuracy was evaluated by the area under the receiver

operating characteristic (ROC) curves (AUCs). The AUC can range from 0.5

(total lack of discrimination) to 1.0 (perfect discrimination). AUCs were

calculated for the predicted risks of the logistic regression model, the risk

allele score, and the linear predictor values of the Cox proportional hazards

models. AUCs were compared with Analyze-it version 2.11 (www.analyze-

it.com), which uses the method of Hanley and McNeil for ROC curve analyses

(21,22).

The analyses were repeated for subgroups for age (cutoff 70 years of age)

and BMI (cutoff 26 kg/m

). All analyses were performed with SPSS software

version 12.0.1.

Simulation analyses. A simulation analysis was performed to quantify the

expected AUC for prediction of incident type 2 diabetes based on the odds

ratios (ORs) of the investigated polymorphisms in literature (OR 1.09 for

ADAMTS9, 1.11 for CDC123-CAMK1D, 1.12 for CDKAL1, 1.12 for CDKN2A/B

rs1412829, 1.20 for CDKN2A/B rs10811661, 1.17 for FTO, 1.13 for HHEX,

1.14 for IGF2BP2, 1.10 for JAZF1, 1.14 for KCNJ11, 1.13 for NOTCH2, 1.14 for

PPARG, 1.12 for SLC30A8, 1.10 for TCF2, 1.37 for TCF7L2, 1.15 for THADA,

1.09 for TSPAN8, and 1.11 for WFS1) (1,2,4,6,7,23). The method of simulation

has been described in detail previously (10). In brief, we simulated genetic

proﬁles and type 2 diabetes status for 100,000 individuals, of whom 10.3% were

supposed to have incident type 2 diabetes, as observed in our population.

Genetic proﬁles were constructed from the polymorphisms based on

observed allele frequencies. Under the assumption that each polymor-

phism has two alleles and that allele proportions were in Hardy-Weinberg

equilibrium, genotype frequencies for the single polymorphisms were

calculated. Assuming that the polymorphisms segregate independently, for

each individual, a genotype was randomly assigned. Disease risks associ-

ated with the genetic proﬁles were modeled using Bayes’ theorem. The

likelihood ratio of the genetic proﬁle was calculated by multiplying the

likelihood ratios of the single genotypes. The OR of the heterozygous

genotypes compared with the homozygous nonrisk genotypes were derived

from the three large GWA studies (1,2,4). Finally, disease status was

modeled by a procedure that compares disease risk of each subject to a

randomly drawn value between 0 and 1 from a uniform distribution. This

procedure ensures that for each genomic proﬁle, the percentage of people

who will develop the disease equals the disease risk associated with that

proﬁle, when the subgroup of individuals with that proﬁle is sufﬁciently

large. The simulation was repeated 10 times to obtain a robust estimate of

the AUC. The AUC was obtained as the c-statistic by the function somers2,

which is available in the Hmisc library of R software (version 2.5.1;

www.R-project.org, accessed December 2007).

RESULTS

Baseline characteristics. A total of 6,544 participants

were successfully genotyped for at least one polymor-

phism. Complete genotype information on all polymor-

phisms was present in 5,297 subjects (of whom 490 were

incident cases and 545 were prevalent cases). Age (P⫽

0.11), sex (P⫽0.22), BMI (P⫽0.30), and presence of type

2 diabetes (P⫽0.20) were not signiﬁcantly different

between successfully genotyped individuals or individuals

with one or more missing genotypes. General characteris-

tics of the population are shown in Table 1. Individuals

with type 2 diabetes had higher BMI, higher waist circum-

ference, more often hypertension, and lower HDL choles-

terol compared with individuals without type 2 diabetes.

All polymorphisms were in Hardy Weinberg equilibrium in

the total population and in individuals without type 2

diabetes (highest ␹

3.58, 2 d.f., P⫽0.06 for PPARG

rs1801282).

Individual effects of clinical characteristics and poly-

morphisms on type 2 diabetes risk. Table 2 shows the

effect of each polymorphism on type 2 diabetes risk

(prevalent and incident type 2 diabetes). The minor alleles

of the CDKAL1,FTO,IGF2BP2, and TCF7L2 variants

were associated with increased risk of type 2 diabetes. The

ADAMTS9, CDKN2A/B rs1412829, JAZF,SLC30A8, and

WFS1 minor alleles decreased type 2 diabetes risk. Cox

regression analyses restricted to incident type 2 diabetes

results gave similar results. Adjustment for age, sex, and

BMI did not materially change the results, except for the

FTO polymorphism, for which the effect on type 2 diabetes

risk disappeared after adjustment for BMI.

In a Cox regression analysis, age (hazard ratio [HR] 1.02

[95% CI 1.01–1.03]), sex (0.67 [0.57– 0.79]), and BMI (1.14

[1.12–1.16]) affected prospective type 2 diabetes risk.

Risk allele score and risk of type 2 diabetes. Figure 1

shows the ORs associated with increasing risk allele

scores compared with the reference group (0 –12 risk

alleles) in a logistic regression model. Individuals carrying

21 risk alleles or more (14.4% of the population) had

M. VAN HOEK AND ASSOCIATES

DIABETES, VOL. 57, NOVEMBER 2008 3123

signiﬁcantly higher type 2 diabetes risk (7.2% of the

population carried 21 alleles, OR 1.90 [1.07–3.40]; 4.4% had

22 alleles, 2.11 [1.15–3.86]; 2.0% had 23 alleles, 2.11 [1.07–

4.18]; and 1.8% had 24 –32 alleles, 2.10 [1.04 – 4.22]) com-

pared with the reference group of 0 –12 alleles (n⫽109;

2.0% of the population). In a Cox regression analysis on

incident cases of diabetes, this ﬁgure was similar (data not

shown). The per-allele HR was 1.04 (95% CI 1.02–1.07)

(P⫽0.001).

Risk allele score. Figure 2 shows the predicted type 2

diabetes risks from the logistic regression model that

included all 18 genetic polymorphisms in relation to the

risk allele score. The Spearman correlation coefﬁcient was

0.60, indicating a wide range of predicted risks for each

value of the risk allele scores. When analyzing only inci-

dent type 2 diabetes, this ﬁgure was similar (Spearmen rho

0.59; ﬁgure not shown).

Analyses of discriminative accuracy. Figure 3 shows

the ROC curves for the prediction of incident type 2

diabetes based on the genetic polymorphisms, clinical

characteristics, and both. The AUC was 0.60 (95% CI

0.57– 0.63) for prediction based on the genetic polymor-

phisms; 0.66 (0.63– 0.68) for age, sex, and BMI; and 0.68

(0.66 – 0.71) for the genetic polymorphisms and clinical

characteristics combined. The difference between the

AUCs for clinical characteristics with and without the

genetic polymorphisms was signiﬁcant (P⬍0.0001).

The AUC of the risk allele score was 0.56 (0.53– 0.59).

When including only the signiﬁcantly associated genetic

variants of the current study (ADAMTS9,CDKAL1,

CDKN1412829,FTO,IGFBP2,JAZF1,TCF7L2,SLC30A8,

and WFS1), the AUC was 0.58 (0.56 – 0.61). Combining

the KCNJ11,PPARG, and TCF7L2 variants resulted in

an AUC of 0.53 (0.50 – 0.55). Based on the simulation

study, the expected AUC for all genetic polymorphisms

using the effect sizes described in literature was 0.57.

When combining incident and prevalent type 2 diabetes

cases, the AUC of all polymorphisms was 0.60 (0.58 –

0.62).

In subgroup analyses, the AUC of the all polymorphisms

was 0.62 (95% CI 0.58 – 0.67) in the low BMI group and 0.59

(0.56 – 0.62) in the high BMI subgroup. The AUC was 0.61

(0.59 – 0.65) in the low age-group and 0.63 (0.58 – 0.67) in

the high age-groups.

DISCUSSION

We investigated the predictive value of 18 type 2 diabetes

risk polymorphisms from the recent GWA studies for the

prediction of type 2 diabetes in a large, prospective,

population-based study of elderly individuals. Our study

shows that combining information of these 18 well-repli-

cated variants has relatively low discriminative accuracy

for the prediction of type 2 diabetes in a general popula-

tion (AUC 0.60). The 18 genetic variants identiﬁed to date

did not substantially improve the discriminative accuracy

of disease prediction based on clinical characteristics.

In line with the results of the GWA studies (1–5,24),

ADAMTS9,CDKAL1,CDKN2A/B,FTO,IGFBP2,JAZF1,

SLC30A8,TCF7L2, and WFS1 were associated type 2

diabetes risk in our population. Some of these effects were

slightly stronger than the effects described previously. In

contrast to most previous studies, the KCNJ11 polymor-

phism was not associated. The ORs of the other polymor-

phisms were similar to previously published results but

not statistically signiﬁcant, which may be explained by the

smaller number of type 2 diabetes cases and therefore

smaller power to reach signiﬁcance in our prospective

study.

A risk allele score, obtained by counting the number of

risk alleles, can be used as a simple proxy of the combined

effect of multiple polymorphisms. Risk allele scores ignore

the effect sizes of the individual genetic variants, but a

previous simulation study has shown that this has limited

impact on the discriminative accuracy (11). In contrast, we

found a wide range of predicted risks for each value of the

risk allele score, suggesting that differences in the variants

carried result in substantial differences in actual disease

risks. The risk allele score associated with modest in-

creases in disease risk and the AUC for prediction was

0.56. When taking effect size differences between polymor-

phisms into account, the AUC was 0.60, showing that the

risk allele score predicted less accurately. Other empirical

studies have not investigated the differences in diagnostic

accuracy between simple risk allele scores and predicted

risks obtained from regression models (14 –16,25).

The discriminative accuracy of predictive genetic test-

ing in complex diseases depends on the number of genes

involved, the risk allele frequencies, and the size of the

associated risks (10). The maximum discriminative accu-

TABLE 1

General characteristics of genotyped participants at study baseline by type 2 diabetes status

All

participants

Subjects without

type 2 diabetes

Incident case subjects

with type 2 diabetes

Prevalent cases with

type 2 diabetes

n6,544 5,221 601 686

Age (years) 69.5 ⫾0.11 69.0 ⫾0.13 68.2 ⫾0.32* 73.6 ⫾0.35†

Men (%) 40.7 40.4 44.3 39.8

BMI (kg/m

)26.3 ⫾0.05 26.0 ⫾0.05 28.0 ⫾0.15† 26.8 ⫾0.15†

Waist circumference (cm) 90.5 ⫾0.14 89.6 ⫾0.16 94.7 ⫾0.45† 93.8 ⫾0.46†

Systolic blood pressure (mmHg) 139.3 ⫾0.3 137.9 ⫾0.31 143.5 ⫾0.85† 146.8 ⫾0.93†

Diastolic blood pressure (mmHg) 73.7 ⫾0.1 73.6 ⫾0.2 75.5 ⫾0.5† 73.1 ⫾0.5

Hypertension (%) 33.4 30.5 46.9† 52.9†

Total cholesterol (mmol/l) 6.6 ⫾0.02 6.6 ⫾0.02 6.6 ⫾0.05 6.5 ⫾0.05

HDL cholesterol (mmol/l) 1.34 ⫾0.005 1.37 ⫾0.005 1.25 ⫾0.01† 1.25 ⫾0.01†

Current smoking (%) 22.1 22.5 25.5 22.1

Former smoking (%) 40.7 42.0 42.8 39.0

Continuous variables are expressed as means ⫾SE. Type 2 diabetes status was missing for 36 individuals. *P⬍0.05, †P⬍0.001 for

comparison with subjects without type 2 diabetes. Comparisons between groups were performed using ANOVA for continuous variables and

␹

test for categorical variables.

GENETIC PREDICTION OF TYPE 2 DIABETES

3124 DIABETES, VOL. 57, NOVEMBER 2008

racy is determined by the heritability of the disease (10).

Based on previously published effect sizes for the 18

polymorphisms, we predicted that the AUC would be 0.57;

and based on our empirical data, we found that it was 0.60.

This difference is explained by the fact that some polymor-

phisms had a slightly larger effect in our population than

TABLE 2

Individual effects of nine polymorphisms on type 2 diabetes risk with and without adjustment for covariates

Gene variant

Risk

allele Genotype (n)

Percent case

subjects Percent

control

subjects

All case subjects Incident type 2 diabetes

Prevalent Incident

(95% CI)

Crude HR

(95% CI)

Adjusted HR

(95% CI)*

ADAMTS9 C CC (3,450) 61.9 61.1 57.4 1.0 1.0 1.0

rs4411878† CT (2,159) 32.5 34.7 37.1 0.84 (0.74–0.97) 0.88 (0.73–1.05) 0.87 (0.73–1.04)

TT (321) 5.6 4.2 5.5 0.84 (0.62–1.13) 0.68 (0.45–1.04) 0.68 (0.45–1.04)

CDC123/ C TT (3,952) 64.4 67.4 67.9 1.0 1.0 1.0

CAMK1D TC (1,758) 31.5 28.8 29.6 1.04 (0.90–1.20) 0.98 (0.82–1.19) 0.98 (0.81–1.18)

rs11257622‡ CC (213) 4.2 3.8 3.5 1.18 (0.84–1.64) 1.07 (0.69–1.67) 1.05 (0.68–1.63)

CDKAL1 C GG (3,097) 47.1 45.4 48.7 1.0 1.0 1.0

rs7754840 GC (2,692) 41.1 43.0 41.9 1.05 (0.93–1.20) 1.09 (0.92–1.29) 1.10 (0.93–1.31)

CC (633) 11.8 11.6 9.4 1.31 (1.07–1.61) 1.31 (1.001–1.72) 1.38 (1.06–1.80)

CDKN2A/B A AA (1,915) 31.3 35.2 32.1 1.0 1.0 1.0

rs1412829§ AG (2,910) 47.7 50.4 49.2 0.97 (0.84–1.12) 0.94 (0.78–1.13) 0.89 (0.74–1.08)

GG (1,098) 21.0 14.5 18.7 0.93 (0.77–1.12) 0.72 (0.56–0.94) 0.70 (0.54–0.92)

CDKN2A/B T TT (4,131) 72.0 63.1 66.0 1.0 1.0 1.0

rs10811661 TC (1,865) 24.7 34.5 30.1 0.95 (0.83–1.09) 1.17 (0.99–1.39) 1.13 (0.95–1.34)

CC (232) 3.2 2.4 3.9 0.70 (0.49–1.02) 0.64 (0.37–1.08) 0.68 (0.40–1.17)

FTO A CC (2,526) 38.3 38.4 39.8 1.0 1.0 1.0

rs8050136 CA (2,944) 44.0 46.3 46.3 1.01 (0.88–1.16) 1.03 (0.86–1.23) 1.00 (0.84–1.20)

AA (927) 17.7 15.3 14.0 1.23 (1.02–1.47) 1.12 (0.88–1.43) 1.06 (0.83–1.36)

HHEX C CC (2,235) 36.1 36.3 34.8 1.0 1.0 1.0

rs1111875 CT (3,097) 50.1 46.1 48.4 0.96 (0.84–1.10) 0.91 (0.76–1.09) 0.93 (0.78–1.12)

TT (1,052) 13.7 17.6 16.8 0.89 (0.74–1.07) 1.01 (0.80–1.27) 1.01 (0.79–1.28)

IGF2BP2 T GG (3,101) 46.9 47.7 49.4 1.0 1.0 1.0

rs4402960 GT (2,650) 41.8 41.5 42.0 1.04 (0.91–1.18) 1.01 (0.85–1.19) 1.01 (0.85–1.20)

TT (575) 11.3 10.8 8.6 1.35 (1.09–1.66) 1.24 (0.95–1.63) 1.23 (0.94–1.62)

JAZF1 T TT (1,646) 31.0 29.8 27.1 1.0 1.0 1.0

rs1635852¶ TC (2,927) 46.5 48.6 49.8 0.85 (0.73–0.98) 0.88 (0.73–1.07) 0.85 (0.70–1.04)

CC (1,357) 22.5 21.6 23.1 0.85 (0.71–1.02) 0.86 (0.68–1.09) 0.84 (0.66–1.06)

KCNJ11 G AA (2,394) 37.7 39.9 39.2 1.0 1.0 1.0

rs5219 AG (2,925) 48.3 46.6 47.8 1.00 (0.88–1.15) 0.97 (0.81–1.16) 0.97 (0.81–1.16)

GG (807) 14.0 13.5 13.0 1.07 (0.88–1.30) 1.02 (0.79–1.32) 1.02 (0.79–1.32)

NOTCH2 T CC (4,670) 77.8 79.0 78.8 1.0 1.0 1.0

rs1493692储CT (1,168) 20.7 19.7 19.6 1.04 (0.89–1.22) 1.03 (0.84–1.27) 1.07 (0.86–1.32)

TT (92) 1.4 1.3 1.6 0.86 (0.50–1.49) 0.75 (0.35–1.57) 0.74 (0.35–1.56)

PPARG C CC (4,888) 79.7 76.9 77.1 1.0 1.0 1.0

rs1801282 CG (1,322) 19.1 21.7 21.1 0.95 (0.81–1.10) 1.03 (0.84–1.25) 0.99 (0.81–1.21)

GG (108) 1.2 1.4 1.8 0.70 (0.41–1.20) 0.76 (0.38–1.54) 0.66 (0.33–1.34)

SLC30A8 C CC (3,176) 49.8 54.1 48.8 1.0 1.0 1.0

rs13266634 CT (2,709) 43.7 38.2 42.4 0.91 (0.80–1.04) 0.82 (0.69–0.97) 0.84 (0.70–0.99)

TT (546) 6.6 7.7 8.8 0.75 (0.59–0.96) 0.80 (0.59–1.09) 0.84 (0.62–1.14)

TCF2 G AA (1,560) 25.9 24.1 26.8 1.0 1.0 1.0

rs4430796 AG (2,924) 48.1 50.7 49.6 1.06 (0.91–1.24) 1.11 (0.90–1.36) 1.14 (0.92–1.40)

GG (1,417) 26.1 25.2 23.6 1.16 (0.97–1.39) 1.16 (0.91–1.47) 1.16 (0.91–1.48)

TCF7L2 T CC (3,292) 43.7 47.5 52.6 1.0 1.0 1.0

rs7903146 CT (2,587) 42.4 42.3 39.7 1.23 (1.08–1.41) 1.19 (1.01–1.41) 1.20 (1.01–1.42)

TT (554) 13.9 10.2 7.7 1.82 (1.48–2.24) 1.48 (1.12–1.95) 1.62 (1.22–2.14)

THADA T TT (4,608) 79.9 78.8 77.3 1.0 1.0 1.0

Rs7578597 TC (1,227) 19.0 19.0 21.1 0.88 (0.75–1.03) 0.91 (0.73–1.12) 0.90 (0.72–1.12)

CC (94) 1.1 2.2 1.6 1.01 (0.60–1.67) 1.32 (0.74–2.34) 1.51 (0.85–2.67)

TSPAN8/

LGR5 C TT (3,018) 50.4 48.3 51.4 1.0 1.0 1.0

Rs1353362** TC (2,409) 40.3 41.8 40.6 1.05 (0.92–1.20) 1.10 (0.92–1.31) 1.08 (0.90–1.29)

CC (493) 9.3 9.9 8.0 1.25 (0.99–1.57) 1.29 (0.96–1.73) 1.21 (0.90–1.62)

WFS1 C CC (2,179) 40.7 40.4 35.8 1.0 1.0 1.0

Rs1412829†† CT (2,801) 44.6 45.7 47.8 0.83 (0.73–0.96) 0.86 (0.72–1.04) 0.87 (0.73–1.05)

TT (949) 14.7 13.9 16.4 0.77 (0.63–0.93) 0.77 (0.59–1.00) 0.76 (0.58–0.99)

*HR adjusted for age, sex, and BMI; †proxy for rs4607103, r

⫽0.95; ‡proxy for rs12779790, r

⫽0.83; §proxy for rs564398, r

⫽0.97; ¶proxy

for rs864745, r

⫽0.97; 储proxy for rs10923931, r

⫽1.0; **proxy for rs7961581, r

⫽0.96; ††proxy for rs10010131, r

⫽1.0.

M. VAN HOEK AND ASSOCIATES

DIABETES, VOL. 57, NOVEMBER 2008 3125

described in the literature (1– 4,6,7,16,23). Nonetheless,

the discriminative accuracy of all known replicated type 2

diabetes susceptibility variants to date was rather low.

However, our analysis was based on 18 common variants

with relatively small effects. The heritability of type 2

diabetes is estimated to range from 30 to 70% depending

on the population investigated (26), and many common

variants with small effects or fewer rare variants with

stronger effects are still to be discovered. These may

further improve the discriminative accuracy of predictive

genetic testing for type 2 diabetes.

Several previous studies have investigated the predic-

tive value of multiple genetic variants in type 2 diabetes,

either alone or in addition to clinical characteristics. An

overview of the studies performed so far in Caucasian

populations and the number of genes investigated is

provided in Table 3. Weedon et al. (16) investigated three

variants that were also in our study and reported an AUC

of 0.58, whereas we found an AUC of 0.53 for the same

polymorphisms. The population of Weedon et al. consisted

in large part of patients who had early onset of type 2

diabetes or a positive family history of type 2 diabetes,

whereas our population included elderly subjects from the

general population. The percentage of variance of the

disease explained by genetic factors is expected to be

higher in populations with a positive family history for the

OR (95% CI)

Number of risk alleles

(% per category)

(2.0%) (2.6%) (4.9%) (8.3%) (11.5%) (14.4%) (15.9%) (13.9%) (10.9%) (7.2%) (4.4%) (2.0%) (1.8%)

0.0

1.0

2.0

3.0

4.0

5.0

0-12 13 14 15 16 17 18 19 20 21 22 23 24-36

FIG. 1. Odds ratios for type 2 diabetes according to the number of risk alleles carried.

Predicted risk (%)

Number of risk alleles

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 3

FIG. 2. Correlation of predicted type 2 diabetes risks with the risk allele score. Predicted risks of type 2 diabetes were obtained from the logistic

regression model with 18 genetic polymorphisms as independent categorical variables.

GENETIC PREDICTION OF TYPE 2 DIABETES

3126 DIABETES, VOL. 57, NOVEMBER 2008

disease, and this may lead to a higher diagnostic accuracy

for genetic variants. Recently, Cauchi et al. (25) investi-

gated 15 genetic variants that were associated in GWA

analyses in a French case-control study. The AUC of the

genetic variants together with age, sex, and BMI was 0.86.

Unfortunately, the AUC was not calculated for genes and

clinical characteristics separately to assess the additive

value of genetic information. Some of the included vari-

ants were speciﬁcally identiﬁed in this case-control study

and had relatively large effects on type 2 diabetes risk

compared with effects found in meta-analyzed GWA stud-

ies (1,2,4,6). It is therefore difﬁcult to generalize these

ﬁndings to other populations, and we expect that our

population-based prospective study yielded more realistic

estimates. Two other studies investigated the improve-

ment of the discriminative accuracy by adding genetic test

results to clinical characteristics (14,15). Even though we

included the 18 ﬁrmly replicated polymorphisms to date

and they predominantly tested other genetic variants, our

ﬁndings were similar, demonstrating no substantial added

value of genetic information beyond clinical characteris-

tics (14,15,27).

We can only speculate on the reasons why these genetic

variants have little added value beyond clinical character-

istics. First, the effects of the genetic variants on type 2

diabetes risk could be exerted through clinical character-

istics such as BMI, which implies that including both genes

and intermediate factors in the regression model will

reduce the effect of the gene. However, adjustment for

clinical characteristics did not substantially change the

effect of the genetic variants on type 2 diabetes risk (Table

2). Second, the effects of age, sex, and BMI on type 2

diabetes risk in our population may outweigh the contri-

bution of the genetic variants. Such an effect was illus-

trated in an earlier study, which showed that a genetic

predisposition became apparent in subjects with less other

risk factors (28). In our elderly population, one may expect

that nongenetic risk factors are more prevalent compared

with younger populations. However, the AUC for the

genetic variants was higher than expected from the simu-

lation analyses. This makes an underestimation of the

contribution of the genetic variants in our population

unlikely.

Obvious strengths of our study are the large size of

the study population, the population-based design, and the

long period of follow-up. Despite these advantages, the

number of incident type 2 diabetes cases was still rela-

tively low to demonstrate statistically signiﬁcant effects of

low-risk susceptibility genes. Furthermore, we had insuf-

ﬁcient statistical power to formally investigate gene-gene

and gene-environment interactions. In age and BMI sub-

group analyses, the estimates for prediction based on the

genetic variants were similar and showed overlapping CIs.

However, because of smaller numbers of cases in the

subgroups, these results should be interpreted with cau-

tion. Cauchi et al. (25) reported gene-gene interactions of

recently discovered loci but did not report on the effects of

these interactions on the AUC. Earlier studies found no

evidence for strong gene-gene interactions (15,16). Taking

into account these interactions may further improve the

discriminative accuracy.

In conclusion, we showed that 9 of 18 currently well-

established genetic risk variants were associated with type

2 diabetes in a population-based study. The currently

known and replicated genetic variants found in GWA

studies contributed modestly to the prediction of type 2

diabetes in a population-based setting and marginally

improved the risk prediction when added to clinical char-

acteristics. Future research should aim at identifying and

replicating new genetic susceptibility variants and gene-

gene and gene-environment interactions to approach

levels of discriminative accuracy that enable the identiﬁ-

cation of at-risk groups.

ACKNOWLEDGMENTS

The Rotterdam Study is supported by the Erasmus Medical

Center and Erasmus University Rotterdam; the Nether-

lands Organization for Scientiﬁc Research; the Nether-

lands Organization for Health Research and Development

(ZonMw); the Research Institute for Diseases in the El-

derly; the Ministry of Education, Culture and Science; the

Ministry of Health, Welfare and Sports; the European

Commission; and the Municipality of Rotterdam. This

study was further supported by the Centre for Medical

Systems Biology in the framework of the Netherlands

Genomics Initiative. The genome-wide association data-

base of the Rotterdam Study was made possible through

funding from the Dutch Research Organisation (nr.175.

010.2005.011).

0.0

0.2

0.4

0.6

0.8

1.0

0.0 0.2 0.4 0.6 0.8 1.0

1-Specificity

Genetic Polymorphisms

Clinical Characteristics (age, sex, BMI)

Both

Sensitivity

FIG. 3. ROC curves for prediction of incident type 2 diabetes based on

18 genetic polymorphisms, clinical characteristics (age, sex, and BMI),

and both.

TABLE 3

Overview of diagnostic accuracies obtained from earlier empiri-

cal studies on genetic risk variants and type 2 diabetes

Study

Genetic

variants

(n)

AUC

genetic

variants

AUC clinical

characteristics

AUC

combined

Weedon et al.

(16) 3 0.58 NR NR

Lyssenko et al.

(14,27) 2 NR 0.68* 0.68

Vaxillaire et al.

(15) 3 0.56 0.82† 0.84

Cauchi et al.

(25) 15 NR NR† 0.86

NR, not reported. *Clinical characteristics: fasting plasma glucose

and BMI. †Clinical characteristics: age, sex, and BMI.

M. VAN HOEK AND ASSOCIATES

DIABETES, VOL. 57, NOVEMBER 2008 3127

We thank Dr. Fernando Rivadeneira for making the

Illumina 550K data from the Rotterdam Study available.

REFERENCES

1. Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PI, Chen H, Roix JJ,

Kathiresan S, Hirschhorn JN, Daly MJ, Hughes TE, Groop L, Altshuler D,

Almgren P, Florez JC, Meyer J, Ardlie K, Bengtsson BK, Isomaa B, Lettre

G, Lindblad U, Lyon HN, Melander O, Newton-Cheh C, Nilsson P, Orho-

Melander M, Rastam L, Speliotes EK, Taskinen MR, Tuomi T, Guiducci C,

Berglund A, Carlson J, Gianniny L, Hackett R, Hall L, Holmkvist J, Laurila

E, Sjogren M, Sterner M, Surti A, Svensson M, Svensson M, Tewhey R,

Blumenstiel B, Parkin M, Defelice M, Barry R, Brodeur W, Camarata J, Chia

N, Fava M, Gibbons J, Handsaker B, Healy C, Nguyen K, Gates C, Sougnez

C, Gage D, Nizzari M, Gabriel SB, Chirn GW, Ma Q, Parikh H, Richardson

D, Ricke D, Purcell S: Genome-wide association analysis identiﬁes loci for

type 2 diabetes and triglyceride levels. Science 316:1331–1336, 2007

2. Scott LJ, Mohlke KL, Bonnycastle LL, Willer CJ, Li Y, Duren WL, Erdos MR,

Stringham HM, Chines PS, Jackson AU, Prokunina-Olsson L, Ding CJ, Swift

AJ, Narisu N, Hu T, Pruim R, Xiao R, Li XY, Conneely KN, Riebow NL,

Sprau AG, Tong M, White PP, Hetrick KN, Barnhart MW, Bark CW,

Goldstein JL, Watkins L, Xiang F, Saramies J, Buchanan TA, Watanabe RM,

Valle TT, Kinnunen L, Abecasis GR, Pugh EW, Doheny KF, Bergman RN,

Tuomilehto J, Collins FS, Boehnke M: A genome-wide association study of

type 2 diabetes in Finns detects multiple susceptibility variants. Science

316:1341–1345, 2007

3. Sladek R, Rocheleau G, Rung J, Dina C, Shen L, Serre D, Boutin P, Vincent

D, Belisle A, Hadjadj S, Balkau B, Heude B, Charpentier G, Hudson TJ,

Montpetit A, Pshezhetsky AV, Prentki M, Posner BI, Balding DJ, Meyre D,

Polychronakos C, Froguel P: A genome-wide association study identiﬁes

novel risk loci for type 2 diabetes. Nature 445:881– 885, 2007

4. Zeggini E, Weedon MN, Lindgren CM, Frayling TM, Elliott KS, Lango H,

Timpson NJ, Perry JR, Rayner NW, Freathy RM, Barrett JC, Shields B,

Morris AP, Ellard S, Groves CJ, Harries LW, Marchini JL, Owen KR, Knight

B, Cardon LR, Walker M, Hitman GA, Morris AD, Doney AS, McCarthy MI,

Hattersley AT: Replication of genome-wide association signals in UK

samples reveals risk loci for type 2 diabetes. Science 316:1336 –1341, 2007

5. Welcome Trust Case Control Consortium: Genome-wide association study

of 14,000 cases of seven common diseases and 3,000 shared controls.

Nature 447:661– 678, 2007

6. Zeggini E, Scott LJ, Saxena R, Voight BF, Marchini JL, Hu T, de Bakker PI,

Abecasis GR, Almgren P, Andersen G, Ardlie K, Bostrom KB, Bergman RN,

Bonnycastle LL, Borch-Johnsen K, Burtt NP, Chen H, Chines PS, Daly MJ,

Deodhar P, Ding CJ, Doney AS, Duren WL, Elliott KS, Erdos MR, Frayling

TM, Freathy RM, Gianniny L, Grallert H, Grarup N, Groves CJ, Guiducci C,

Hansen T, Herder C, Hitman GA, Hughes TE, Isomaa B, Jackson AU,

Jorgensen T, Kong A, Kubalanza K, Kuruvilla FG, Kuusisto J, Langenberg

C, Lango H, Lauritzen T, Li Y, Lindgren CM, Lyssenko V, Marvelle AF,

Meisinger C, Midthjell K, Mohlke KL, Morken MA, Morris AD, Narisu N,

Nilsson P, Owen KR, Palmer CN, Payne F, Perry JR, Pettersen E, Platou C,

Prokopenko I, Qi L, Qin L, Rayner NW, Rees M, Roix JJ, Sandbaek A,

Shields B, Sjogren M, Steinthorsdottir V, Stringham HM, Swift AJ, Thorle-

ifsson G, Thorsteinsdottir U, Timpson NJ, Tuomi T, Tuomilehto J, Walker

M, Watanabe RM, Weedon MN, Willer CJ, Illig T, Hveem K, Hu FB, Laakso

M, Stefansson K, Pedersen O, Wareham NJ, Barroso I, Hattersley AT,

Collins FS, Groop L, McCarthy MI, Boehnke M, Altshuler D: Meta-analysis

of genome-wide association data and large-scale replication identiﬁes

additional susceptibility loci for type 2 diabetes. Nat Genet 40:638 – 645,

2008

7. Gudmundsson J, Sulem P, Steinthorsdottir V, Bergthorsson JT, Thorleifs-

son G, Manolescu A, Rafnar T, Gudbjartsson D, Agnarsson BA, Baker A,

Sigurdsson A, Benediktsdottir KR, Jakobsdottir M, Blondal T, Stacey SN,

Helgason A, Gunnarsdottir S, Olafsdottir A, Kristinsson KT, Birgisdottir B,

Ghosh S, Thorlacius S, Magnusdottir D, Stefansdottir G, Kristjansson K,

Bagger Y, Wilensky RL, Reilly MP, Morris AD, Kimber CH, Adeyemo A,

Chen Y, Zhou J, So WY, Tong PC, Ng MC, Hansen T, Andersen G,

Borch-Johnsen K, Jorgensen T, Tres A, Fuertes F, Ruiz-Echarri M, Asin L,

Saez B, van BE, Klaver S, Swinkels DW, Aben KK, Graif T, Cashy J, Suarez

BK, van Vierssen TO, Frigge ML, Ober C, Hofker MH, Wijmenga C,

Christiansen C, Rader DJ, Palmer CN, Rotimi C, Chan JC, Pedersen O,

Sigurdsson G, Benediktsson R, Jonsson E, Einarsson GV, Mayordomo JI,

Catalona WJ, Kiemeney LA, Barkardottir RB, Gulcher JR, Thorsteinsdottir

U, Kong A, Stefansson K: Two variants on chromosome 17 confer prostate

cancer risk, and the one in TCF2 protects against type 2 diabetes. Nat

Genet 39:977–983, 2007

8. Holtzman NA, Marteau TM: Will genetics revolutionize medicine? N Engl

J Med 343:141–144, 2000

9. Janssens AC, Gwinn M, Valdez R, Narayan KM, Khoury MJ: Predictive

genetic testing for type 2 diabetes. Br Med J 333:509 –510, 2006

10. Janssens AC, Aulchenko YS, Elefante S, Borsboom GJ, Steyerberg EW, van

Duijn CM: Predictive testing for complex diseases using multiple genes:

fact or ﬁction? Genet Med 8:395– 400, 2006

11. Janssens AC, Moonesinghe R, Yang Q, Steyerberg EW, van Duijn CM,

Khoury MJ: The impact of genotype frequencies on the clinical validity of

genomic proﬁling for predicting common chronic diseases. Genet Med

9:528 –535, 2007

12. Yang Q, Khoury MJ, Botto L, Friedman JM, Flanders WD: Improving the

prediction of complex diseases by testing for multiple disease-susceptibil-

ity genes. Am J Hum Genet 72:636 – 649, 2003

13. Wray NR, Goddard ME, Visscher PM: Prediction of individual genetic risk

to disease from genome-wide association studies. Genome Res 17:1520 –

1528, 2007

14. Lyssenko V, Almgren P, Anevski D, Orho-Melander M, Sjogren M, Saloranta

C, Tuomi T, Groop L: Genetic prediction of future type 2 diabetes. PLoS

Med 2:e345, 2005

15. Vaxillaire M, Veslot J, Dina C, Proenca C, Cauchi S, Charpentier G, Tichet

J, Fumeron F, Marre M, Meyre D, Balkau B, Froguel P: Impact of common

type 2 diabetes risk polymorphisms in the D.E.S.I.R. Prospective Study.

Diabetes 57:244 –254, 2008

16. Weedon MN, McCarthy MI, Hitman G, Walker M, Groves CJ, Zeggini E,

Rayner NW, Shields B, Owen KR, Hattersley AT, Frayling TM: Combining

information from common type 2 diabetes risk polymorphisms improves

disease prediction. PLoS Med 3:e374, 2006

17. Hofman A, Breteler MM, van Duijn CM, Krestin GP, Pols HA, Stricker BH,

Tiemeier H, Uitterlinden AG, Vingerling JR, Witteman JC: The Rotterdam

Study: objectives and design update. Eur J Epidemiol 22:819 – 829, 2007

18. Expert Committee on the Diagnosis and Classiﬁcation of Diabetes Melli-

tus: Report of the Expert Committee on the Diagnosis and Classiﬁcation of

Diabetes Mellitus. Diabetes Care 20:1183–1197, 1997

19. Alberti KG, Zimmet PZ: Deﬁnition, diagnosis and classiﬁcation of diabetes

mellitus and its complications. Part 1: diagnosis and classiﬁcation of

diabetes mellitus provisional report of a WHO consultation. Diabet Med

15:539 –553, 1998

20. Fang Y, van Meurs JB, d’Alesio A, Jhamai M, Zhao H, Rivadeneira F,

Hofman A, van Leeuwen JP, Jehan F, Pols HA, Uitterlinden AG: Promoter

and 3⬘-untranslated-region haplotypes in the vitamin D receptor gene

predispose to osteoporotic fracture: the Rotterdam Study. Am J Hum

Genet 77:807– 823, 2005

21. Hanley JA, McNeil BJ: A method of comparing the areas under receiver

operating characteristic curves derived from the same cases. Radiology

148:839 – 843, 1983

22. Hanley JA, McNeil BJ: The meaning and use of the area under a receiver

operating characteristic (ROC) curve. Radiology 143:29 –36, 1982

23. Sandhu MS, Weedon MN, Fawcett KA, Wasson J, Debenham SL, Daly A,

Lango H, Frayling TM, Neumann RJ, Sherva R, Blech I, Pharoah PD,

Palmer CN, Kimber C, Tavendale R, Morris AD, McCarthy MI, Walker M,

Hitman G, Glaser B, Permutt MA, Hattersley AT, Wareham NJ, Barroso I:

Common variants in WFS1 confer risk of type 2 diabetes. Nat Genet

39:951–953, 2007

24. Frayling TM: A new era in ﬁnding type 2 diabetes genes: the unusual

suspects. Diabet Med 24:696 –701, 2007

25. Cauchi S, Meyre D, Durand E, Proenca C, Marre M, Hadjadj S, Choquet H,

De GF, Gaget S, Allegaert F, Delplanque J, Permutt MA, Wasson J, Blech I,

Charpentier G, Balkau B, Vergnaud AC, Czernichow S, Patsch W, Chikri M,

Glaser B, Sladek R, Froguel P: Post genome-wide association studies of

novel genes associated with type 2 diabetes show gene-gene interaction

and high predictive value. PLoS Med 3:e2031, 2008

26. Stumvoll M, Goldstein BJ, van Haeften TW: Type 2 diabetes: principles of

pathogenesis and therapy. Lancet 365:1333–1346, 2005

27. Janssens AC, Gwinn M, Khoury MJ, Subramonia-Iyer S: Does genetic

testing really improve the prediction of future type 2 diabetes? PLoS Med

3:e114, 2006

28. Pardo Silva MC, Janssens AC, Hofman A, Witteman JC, van Duijn CM:

Apolipoprotein E gene is related to mortality only in normal weight

individuals: the Rotterdam Study. Eur J Epidemiol 23:135–142, 2008

GENETIC PREDICTION OF TYPE 2 DIABETES

3128 DIABETES, VOL. 57, NOVEMBER 2008

The role of single-nucleotide polymorphisms of some candidate genes of carbohydrate and fat metabolism in predicting the risk of type 2 diabetes mellitus

Article

Full-text available

Mar 2023

Over the past decade, some progress has been made in identifying and characterizing variants of DNA polymorphisms of genes associated with predisposition to type 2 diabetes mellitus (T2DM). The analysis of gene polymorphisms in combination with socio-demographic, clinical and metabolic parameters can be considered as a promising approach to identify high-risk groups for the development of T2DM. The review includes foreign and domestic studies of predictive models for the risk of developing T2DM comprising single-nucleotide polymorphisms, published in the period from 2006 to 2021. The search for the literature sources was carried out on the PubMed platform. The predictive accuracy of polygenic risk scores was assessed by comparing the area under the curve (AUC). The most commonly used clinical predictors of T2DM risk are sex, age, BMI, family history of diabetes, presence of arterial hypertension, waist circumference, waist-to-hip ratio. All genetic risk models for T2DM had lower AUC values than phenotypic (clinical) risk models. The addition of genetic factors has, in turn, improved AUC compared to purely clinical risk models in many studies, which may be a useful tool for primary prevention of T2DM. However, only those polymorphisms that strongly confirm their association with the risk of developing T2DM in different populations studies should be added to predictive risk scales.

GRP94 is an IGF-1R chaperone and regulates beta cell death in diabetes

Article

Full-text available

May 2024

High workload-induced cellular stress can cause pancreatic islet β cell death and dysfunction, or β cell failure, a hallmark of type 2 diabetes mellitus. Thus, activation of molecular chaperones and other stress-response genes prevents β cell failure. To this end, we have shown that deletion of the glucose-regulated protein 94 (GRP94) in Pdx1 ⁺ pancreatic progenitor cells led to pancreas hypoplasia and reduced β cell mass during pancreas development in mice. Here, we show that GRP94 was involved in β cell adaption and compensation (or failure) in islets from leptin receptor-deficient ( db/db) mice in an age-dependent manner. GRP94-deficient cells were more susceptible to cell death induced by various diabetogenic stress conditions. We also identified a new client of GRP94, insulin-like growth factor-1 receptor (IGF-1R), a critical factor for β cell survival and function that may mediate the effect of GRP94 in the pathogenesis of diabetes. This study has identified essential functions of GRP94 in β cell failure related to diabetes.

AI-enhanced integration of genetic and medical imaging data for risk assessment of Type 2 diabetes

Article

Full-text available

May 2024

Type 2 diabetes (T2D) presents a formidable global health challenge, highlighted by its escalating prevalence, underscoring the critical need for precision health strategies and early detection initiatives. Leveraging artificial intelligence, particularly eXtreme Gradient Boosting (XGBoost), we devise robust risk assessment models for T2D. Drawing upon comprehensive genetic and medical imaging datasets from 68,911 individuals in the Taiwan Biobank, our models integrate Polygenic Risk Scores (PRS), Multi-image Risk Scores (MRS), and demographic variables, such as age, sex, and T2D family history. Here, we show that our model achieves an Area Under the Receiver Operating Curve (AUC) of 0.94, effectively identifying high-risk T2D subgroups. A streamlined model featuring eight key variables also maintains a high AUC of 0.939. This high accuracy for T2D risk assessment promises to catalyze early detection and preventive strategies. Moreover, we introduce an accessible online risk assessment tool for T2D, facilitating broader applicability and dissemination of our findings.

Comprehensive overview of disease models for Wolfram syndrome: toward effective treatments

Article

Full-text available

Feb 2024
MAMM GENOME

Wolfram syndrome (OMIM 222300) is a rare autosomal recessive disease with a devastating array of symptoms, including diabetes mellitus, optic nerve atrophy, diabetes insipidus, hearing loss, and neurological dysfunction. The discovery of the causative gene, WFS1, has propelled research on this disease. However, a comprehensive understanding of the function of WFS1 remains unknown, making the development of effective treatment a pressing challenge. To bridge these knowledge gaps, disease models for Wolfram syndrome are indispensable, and understanding the characteristics of each model is critical. This review will provide a summary of the current knowledge regarding WFS1 function and offer a comprehensive overview of established disease models for Wolfram syndrome, covering animal models such as mice, rats, flies, and zebrafish, along with induced pluripotent stem cell (iPSC)-derived human cellular models. These models replicate key aspects of Wolfram syndrome, contributing to a deeper understanding of its pathogenesis and providing a platform for discovering potential therapeutic approaches.

JAZF1: A metabolic actor subunit of the NuA4/TIP60 chromatin modifying complex

Article

Full-text available

Apr 2023

The multisubunit NuA4/TIP60 complex is a lysine acetyltransferase, chromatin modifying factor and gene co-activator involved in diverse biological processes. The past decade has seen a growing appreciation for its role as a metabolic effector and modulator. However, molecular insights are scarce and often contradictory, underscoring the need for further mechanistic investigation. A particularly exciting route emerged with the recent identification of a novel subunit, JAZF1, which has been extensively linked to metabolic homeostasis. This review summarizes the major findings implicating NuA4/TIP60 in metabolism, especially in light of JAZF1 as part of the complex.

Association between cardiovascular disease risk and incident type 2 diabetes mellitus in individuals with prediabetes: A retrospective cohort study

Article

Feb 2024

Type 2 diabetes linked FTO gene variant rs8050136 is significantly associated with gravidity in gestational diabetes in a sample of Bangladeshi women: Meta-analysis and case-control study

Article

Full-text available

Nov 2023
PLOS ONE

Objective Gestational diabetes mellitus (GDM) is a growing public health concern that has not been extensively studied. Numerous studies have indicated that a variant (rs8050136) of the fat mass-associated gene, FTO, is associated with both GDM and Type 2 diabetes mellitus(T2DM). We conducted a meta-analysis on the association between the FTO single nucleotide polymorphism (SNP) rs8050136 and T2DM, followed by a case-control study on the association of the said SNP and GDM in a sample of Bangladeshi women. Method A total of 25 studies were selected after exploring various databases and search engines, which were assessed using the Newcastle-Ottawa Scale (NOS). The MetaGenyo web tool was used to conduct this meta-analysis. A case-control study was performed on 218 GDM patients and 284 controls to observe any association between FTO rs8050136 and GDM. Genotyping was performed using the tetra-primer amplification refractory mutation system-polymerase chain reaction (T-ARMS) method, and statistical analyses were performed using various statistical softwares. Results In the meta-analysis 26231 cases and 43839 controls were examined. Pooled association analyses revealed a statistically significant relationship between the FTO rs8050136 polymorphism and an elevated risk of T2DM under all genetic models (P<0.05). In the case-control study, synergistic analyses of the SNP and gravida with GDM revealed a significant (P<0.01) association with an increase in odds by 1.6 to 2.4 folds in multigravida and decrease in odds by 2 folds in primigravida. A positive family history of diabetes and the minor allele of this SNP collectively increased the risk of developing GDM by many-fold (1.8 to 2.7 folds). However, after accounting for family history of diabetes and gravidity, analyses showed no significant association with GDM. Conclusion Our meta-analysis revealed a significant association between SNP rs8050136 of FTO with T2DM, and this variant was substantially associated with an increased risk of GDM in a sample of Bangladeshi multigravida women.

An early prediction model for type 2 diabetes mellitus based on genetic variants and nongenetic risk factors in a Han Chinese cohort

Article

Full-text available

Oct 2023

Aims We aimed to construct a prediction model of type 2 diabetes mellitus (T2DM) in a Han Chinese cohort using a genetic risk score (GRS) and a nongenetic risk score (NGRS). Methods A total of 297 Han Chinese subjects who were free from type 2 diabetes mellitus were selected from the Tianjin Medical University Chronic Disease Cohort for a prospective cohort study. Clinical characteristics were collected at baseline and subsequently tracked for a duration of 9 years. Genome-wide association studies (GWASs) were performed for T2DM-related phenotypes. The GRS was constructed using 13 T2DM-related quantitative trait single nucleotide polymorphisms (SNPs) loci derived from GWASs, and NGRS was calculated from 4 biochemical indicators of independent risk that screened by multifactorial Cox regressions. Results We found that HOMA-IR, uric acid, and low HDL were independent risk factors for T2DM (HR >1; P<0.05), and the NGRS model was created using these three nongenetic risk factors, with an area under the ROC curve (AUC) of 0.678; high fasting glucose (FPG >5 mmol/L) was a key risk factor for T2DM (HR = 7.174, P< 0.001), and its addition to the NGRS model caused a significant improvement in AUC (from 0.678 to 0.764). By adding 13 SNPs associated with T2DM to the GRS prediction model, the AUC increased to 0.892. The final combined prediction model was created by taking the arithmetic sum of the two models, which had an AUC of 0.908, a sensitivity of 0.845, and a specificity of 0.839. Conclusions We constructed a comprehensive prediction model for type 2 diabetes out of a Han Chinese cohort. Along with independent risk factors, GRS is a crucial element to predicting the risk of type 2 diabetes mellitus.

Disease Risk Models

Chapter

Aug 2019

Gene–environment interactions in the associations of PFAS exposure with insulin sensitivity and beta-cell function in a Faroese cohort followed from birth to adulthood

Article

Mar 2023
ENVIRON RES

Background: Exposure to perfluoroalkyl substances (PFAS) has been associated with changes in insulin sensitivity and pancreatic beta-cell function in humans. Genetic predisposition to diabetes may modify these associations; however, this hypothesis has not been yet studied. Objectives: To evaluate genetic heterogeneity as a modifier in the PFAS association with insulin sensitivity and pancreatic beta-cell function, using a targeted gene-environment (GxE) approach. Methods: We studied 85 single-nucleotide polymorphisms (SNPs) associated with type 2 diabetes, in 665 Faroese adults born in 1986-1987. Perfluorooctane sulfonate (PFOS) and perfluorooctanoic acid (PFOA) were measured in cord whole blood at birth and in participants' serum from age 28 years. We calculated the Matsuda-insulin sensitivity index (ISI) and the insulinogenic index (IGI) based on a 2 h-oral glucose tolerance test performed at age 28. Effect modification was evaluated in linear regression models adjusted for cross-product terms (PFAS*SNP) and important covariates. Results: Prenatal and adult PFOS exposures were significantly associated with decreased insulin sensitivity and increased beta-cell function. PFOA associations were in the same direction but attenuated compared to PFOS. A total of 58 SNPs were associated with at least one PFAS exposure variable and/or Matsuda-ISI or IGI in the Faroese population and were subsequently tested as modifiers in the PFAS-clinical outcome associations. Eighteen SNPs showed interaction p-values (PGxE) < 0.05 in at least one PFAS-clinical outcome association, five of which passed False Discovery Rate (FDR) correction (PGxE-FDR<0.20). SNPs for which we found stronger evidence for GxE interactions included ABCA1 rs3890182, FTO rs9939609, FTO rs3751812, PPARG rs170036314 and SLC12A3 rs2289116 and were more clearly shown to modify the PFAS associations with insulin sensitivity, rather than with beta-cell function. Discussion: Findings from this study suggest that PFAS-associated changes in insulin sensitivity could vary between individuals as a result of genetic predisposition and warrant replication in independent larger populations.

A genome-wide association study identifies novel risk loci for type 2 diabetes

Article

Full-text available

Feb 2007

Type 2 diabetes mellitus results from the interaction of environmental factors with a combination of genetic variants, most of which were hitherto unknown. A systematic search for these variants was recently made possible by the development of high-density arrays that permit the genotyping of hundreds of thousands of polymorphisms. We tested 392,935 single-nucleotide polymorphisms in a French case-control cohort. Markers with the most significant difference in genotype frequencies between cases of type 2 diabetes and controls were fast-tracked for testing in a second cohort. This identified four loci containing variants that confer type 2 diabetes risk, in addition to confirming the known association with the TCF7L2 gene. These loci include a non-synonymous polymorphism in the zinc transporter SLC30A8, which is expressed exclusively in insulin-producing beta-cells, and two linkage disequilibrium blocks that contain genes potentially involved in beta-cell development or function (IDE-KIF11-HHEX and EXT2-ALX4). These associations explain a substantial portion of disease risk and constitute proof of principle for the genome-wide approach to the elucidation of complex genetic traits.

The Wellcome Trust Case Control Consortium (WTCCC) Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, 661-678

Article

Full-text available

Jun 2007

There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined 2,000 individuals for each of 7 major diseases and a shared set of 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 10-7: 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10-5 and 5 10-7) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

Response to the Expert Committee on the Diagnosis and Classification of Diabetes Mellitus

Article

Full-text available

Mar 1998
DIABETES CARE

A Method of Comparing the Areas Under Receiver Operating Characteristic Curves Derived from the Same Cases

Article

Full-text available

Oct 1983

Receiver operating characteristic (ROC) curves are used to describe and compare the performance of diagnostic technology and diagnostic algorithms. This paper refines the statistical comparison of the areas under two ROC curves derived from the same set of patients by taking into account the correlation between the areas that is induced by the paired nature of the data. The correspondence between the area under an ROC curve and the Wilcoxon statistic is used and underlying Gaussian distributions (binormal) are assumed to provide a table that converts the observed correlations in paired ratings of images into a correlation between the two ROC areas. This between-area correlation can be used to reduce the standard error (uncertainty) about the observed difference in areas. This correction for pairing, analogous to that used in the paired t-test, can produce a considerable increase in the statistical sensitivity (power) of the comparison. For studies involving multiple readers, this method provides a measure of a component of the sampling variation that is otherwise difficult to obtain.

Report of the ADA Expert Committee on the Diagnosis and Classification of Diabetes Mellitus

Article

Jan 2002
DIABETES CARE

Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels

Article

Jan 2007

Two variants on chromosome 17 confer prostate cancer risk, and the one in TCF2 protects against type 2 diabetes [J]

Article

Nov 2006

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls WTCCC Nature 2007 447 7 June 2007 661 678 17554300

Article

Nov 2006
NATURE

Wellcome Trust Case Control Consortium

This paper was published as Nature, 2007, 447 (7145), pp. 661-678. It is available from http://www.nature.com/nature/journal/v447/n7145/abs/nature05911.html. Doi: 10.1038/nature05911 Metadata only entry There is increasing evidence that genome-wide association (GWA) studies represent a powerful approach to the identification of genes involved in common human diseases. We describe a joint GWA study (using the Affymetrix GeneChip 500K Mapping Array Set) undertaken in the British population, which has examined 2,000 individuals for each of 7 major diseases and a shared set of 3,000 controls. Case-control comparisons identified 24 independent association signals at P < 5 10-7: 1 in bipolar disorder, 1 in coronary artery disease, 9 in Crohn's disease, 3 in rheumatoid arthritis, 7 in type 1 diabetes and 3 in type 2 diabetes. On the basis of prior findings and replication studies thus-far completed, almost all of these signals reflect genuine susceptibility effects. We observed association at many previously identified loci, and found compelling evidence that some loci confer risk for more than one of the diseases studied. Across all diseases, we identified a large number of further signals (including 58 loci with single-point P values between 10-5 and 5 10-7) likely to yield additional susceptibility loci. The importance of appropriately large samples was confirmed by the modest effect sizes observed at most loci identified. This study thus represents a thorough validation of the GWA approach. It has also demonstrated that careful use of a shared control group represents a safe and effective approach to GWA analyses of multiple disease phenotypes; has generated a genome-wide genotype database for future studies of common diseases in the British population; and shown that, provided individuals with non-European ancestry are excluded, the extent of population stratification in the British population is generally modest. Our findings offer new avenues for exploring the pathophysiology of these important disorders. We anticipate that our data, results and software, which will be widely available to other investigators, will provide a powerful resource for human genetics research.

Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature. The Welcome Trust Case Control Consortium20074477145661678

Article

Jan 2007

A bootstrap resampling procedure for model building: Application to the cox regression model

Article

Dec 1992

A common problem in the statistical analysis of clinical studies is the selection of those variables in the framework of a regression model which might influence the outcome variable. Stepwise methods have been available for a long time, but as with many other possible strategies, there is a lot of criticism of their use. Investigations of the stability of a selected model are often called for, but usually are not carried out in a systematic way. Since analytical approaches are extremely difficult, data-dependent methods might be an useful alternative. Based on a bootstrap resampling procedure, Chen and George investigated the stability of a stepwise selection procedure in the framework of the Cox proportional hazard regression model. We extend their proposal and develop a bootstrap-model selection procedure, combining the bootstrap method with existing selection techniques such as stepwise methods. We illustrate the proposed strategy in the process of model building by using data from two cancer clinical trials featuring two different situations commonly arising in clinical research. In a brain tumour study the adjustment for covariates in an overall treatment comparison is of primary interest calling for the selection of even 'mild' effects. In a prostate cancer study we concentrate on the analysis of treatment-covariate interactions demanding that only 'strong' effects should be selected. Both variants of the strategy will be demonstrated analysing the clinical trials with a Cox model, but they can be applied in other types of regression with obvious and straightforward modifications.

Predicting Type 2 Diabetes Based on Polymorphisms From Genome-Wide Association Studies: A Population-Based Study

Abstract

Recommended publications

Implication of genetic variants near TCF7L2, SLC30A8, HHEX, CDKAL1, CDKN2A/B, IGF2BP2, and FTO in ty...

Evaluation of risk prediction updates from commercial genome-wide scans

Replication Study of Candidate Genes Associated With Type 2 Diabetes Based On Genome-Wide Screening

Clinical Risk Factors, DNA Variants, and the Development of Type 2 Diabetes

Type 2 diabetes genetics: Starting to solve the puzzle