ArticlePDF Available

On the Pitfalls of using Arbiter PUFs as Building Blocks

August 2015
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 34(8):1-1

August 2015
34(8):1-1

DOI:10.1109/TCAD.2015.2427259

Authors:

ESMT European School of Management and Technology

Physical unclonable functions (PUFs) have emerged as a promising solution for securing resource-constrained embedded devices such as RFID tokens. PUFs use the inherent physical differences of every chip to either securely authenticate the chip or generate cryptographic keys without the need of nonvolatile memory. However, PUFs have shown to be vulnerable to model building attacks if the attacker has access to challenge and response pairs. In these model building attacks, machine learning is used to determine the internal parameters of the PUF to build an accurate software model. Nevertheless, PUFs are still a promising building block and several protocols and designs have been proposed that are believed to be resistant against machine learning attacks. In this paper, we take a closer look at two such protocols, one based on reverse fuzzy extractors and one based on pattern matching. We show that it is possible to attack these protocols using machine learning despite the fact that an attacker does not have access to direct challenge and response pairs. The introduced attacks demonstrate that even highly obfuscated responses can be used to attack PUF protocols. Hence, this paper shows that even protocols in which it would be computationally infeasible to compute enough challenge and response pairs for a direct machine learning attack can be attacked using machine learning.

Schematic of an n-bit Arbiter PUF.

…

CMA-ES attack on a 64-stage Arbiter PUF based on the information which bits have flipped. The responses were generated by adding Gaussian noise to simulated delay values so that 5% of the responses flipped. On the left Y-axis, the highest achieved model accuracy from 100 runs with 800 generations each is shown. On the right Y-axis the number of runs that converged, i.e., that achieved a model accuracy of at least 90% is shown. The X-axis depicts the used number of input blocks, each input block consisting of 255 response bits.

…

Generation of the substring W and P W for the case ind 1 < L − L sub and for the case ind 1 > L − L sub .

…

Figures - uploaded by Georg Tobias Becker

Content may be subject to copyright.

Content uploaded by Georg Tobias Becker

Content may be subject to copyright.

On the Pitfalls of using Arbiter-PUFs as Building

Blocks

Georg T. Becker

Horst Görtz Institute for IT Security, Ruhr University Bochum, Germany

Georg.Becker@rub.de

Abstract—Physical Unclonable Functions (PUFs)

have emerged as a promising solution for securing

resource-constrained embedded devices such as RFID

tokens. PUFs use the inherent physical diﬀerences of

every chip to either securely authenticate the chip or

generate cryptographic keys without the need of non-

volatile memory. However, PUFs have shown to be

vulnerable to model building attacks if the attacker

has access to challenge and response pairs. In these

model building attacks, machine learning is used to

determine the internal parameters of the PUF to build

an accurate software model. Nevertheless, PUFs are

still a promising building block and several protocols

and designs have been proposed that are believed to

be resistant against machine learning attacks. In this

paper we take a closer look at two such protocols,

one based on reverse fuzzy extractors [10] and one

based on pattern matching [15], [17]. We show that

it is possible to attack these protocols using machine

learning despite the fact that an attacker does not have

access to direct challenge and response pairs. The intro-

duced attacks demonstrate that even highly obfuscated

responses can be used to attack PUF protocols. Hence,

our work shows that even protocols in which it would

be computationally infeasible to compute enough chal-

lenge and response pairs for a direct machine learning

attack can be attacked using machine learning.

Index Terms—Physical Unclonable Functions, Ma-

chine Learning, Reverse Fuzzy Extractor, Evolution

Strategies

I. Introduction

Physical Unclonable Functions (PUFs) have gained

wide-spread attention in the research community as a

new cryptographic primitive for hardware security ap-

plications. PUFs make use of the fact that two manu-

factured computer chips are never completely identical

due to process variations. A PUF exploits these process

variations to build a unique identity for every chip. There

are many applications for which PUFs can be used. Two

prominent examples are its use in challenge-and-response

protocols to authenticate devices as well as for secure key

generation and storage. Securely storing a cryptographic

key in embedded devices in a way that they are resistant to

physical attacks such as probing, reverse-engineering and

side-channel attacks is extremely diﬃcult. By using PUFs,

no key needs to be stored in non-volatile memory since the

secret is instead derived from internal physical character-

istics which are hard to measure from the outside. This

makes PUFs a very promising technology for embedded

security applications.

A PUF usually gets a challenge and answers with a

response that depends on its process variation. PUFs can

be classiﬁed into two categories: weak PUFs and strong

PUFs. In a weak PUF, the number of challenges the

PUF can accept is very limited so that an attacker can

try all possible challenges and store their corresponding

responses. This way an attacker could easily forge the PUF

by replacing the PUF with a simple memory look-up. A

strong PUF on the other hand has a challenge space that

is large enough so that it is computationally infeasible

to try and store all possible challenges. Strong PUFs can

be used in challenge-and-response protocols as well as for

secure key generation. A weak PUF cannot be used for

challenge-and-response protocols, but can still be used for

secure key generation. Note that the terminology strong

PUF and weak PUF might falsely give the impression

that a strong PUF is “better” than a weak PUF. However,

this terminology only deﬁnes the challenge space without

judging the PUF’s reliability, uniqueness or other security

properties.

Current strong PUF designs face two big problems that

are related: they suﬀer from unreliability [12] and are

prone to machine learning attacks [18], [19]. In an ideal

case, a PUF always generates the same response for a

given challenge. However, due to environmental eﬀects and

thermal noise, the response to the same challenge can vary.

In practice, PUF protocols therefore either need to allow

for a few false response bits or need error correction codes

to correct the faulty responses. The second problem is that

most strong PUFs can be modeled in software and the

needed parameters to model a speciﬁc PUF instance can

be determined using machine learning techniques if chal-

lenge and response pairs are known to the attacker [18].

To overcome this problem, new protocols and designs

have been proposed that are believed to be resistant

against machine learning attacks. Furthermore, some of

these protocols actually make use of the fact that model

building attacks on delay based PUFs are possible so that

the veriﬁer can build software models of the PUF. During

a set-up phase, challenge and response pairs are revealed

and an accurate software model of the PUF is constructed

using machine learning techniques. After the set-up phase

direct access to the PUF is permanently disabled and

an authentication protocol is used that does not directly

reveal the challenge and response pairs. Two prominent

examples of PUF based authentication protocols are the

reverse fuzzy extractor based protocol by van Herrewege et

al. [10] and a pattern matching based protocol by Ma-

jzoobi et al. [15], [17]. These PUF protocols can be

implemented very eﬃciently in terms of area and power.

Hence, they are very promising alternatives to traditional

cryptography for constrained devices such as RFID tokens

or medical implants.

A. Our Contribution and Outline

The main contribution of this paper is to show how

powerful machine learning attacks can be and that for

a security analysis of a PUF protocol it is not enough

to show that an attacker does not have access to direct

challenges and response pairs. We show this by attacking

two PUF protocols as case studies, the reverse fuzzy

extractor based protocol by van Herrewege et al. [10] and

the Slender PUF protocol based on pattern matching [15],

[17].

It was shown using empirical tests that given a certain

number of challenge and responses a PUF can be modeled

with a certain accuracy. However, a common mistake is

that a false conclusion is drawn from these empirical

results: It is assumed that to attack a PUF a certain

number of direct challenge and responses is needed. While

such tests might tell us the model accuracy that can be

achieved if we have a certain number of direct challenge

and responses, it does not mean that we need direct

challenge and responses for machine learning attacks. In

this paper we demonstrate, by attacking the Slender PUF

protocol and the reverse fuzzy extractor protocol, that

other information can be used instead. In both cases only

obfuscated responses in the form of a padded substring or

helper data of an error correction code are used to perform

successful machine learning attacks. The attack on the

reverse fuzzy extractor protocol also shows that not only

information about the value of response bits can be used

for attacking a protocol, but also the information about

the reliability of response bits. Since this information is

often provided by the helper data of error correction codes,

this attack is of importance for many diﬀerent protocols

and systems.

In the next Section an introduction to Arbiter PUFs is

given and the ES-based machine learning algorithm which

is used to attack the PUF protocols is introduced. In Sec-

tion III two machine learning attacks on the reverse fuzzy

extractor protocol are described: One machine learning

attack that directly uses eavesdropped helper data and

one attack that uses the reliability information provided

by the helper data when the same challenges are used more

than once. Section IV shows that both the conference as

well as the journal version of the Slender PUF protocol

can be attacked using ES-based machine learning attacks.

The implications of these attacks are summarized in the

conclusion.

II. Background

The Arbiter PUF is the most popular electrical strong

PUF in the literature and most PUF protocols are based

on Arbiter PUFs or similar structures.

A. Arbiter PUF

The basic idea of the Arbiter PUF [13] is to apply a

race signal to two identical delay paths and determine

which of the two paths is faster. The two paths have an

identical layout so that the delay diﬀerence ∆Dbetween

the two signals mainly depends on process variations.

This dependency on process variations ensures that each

chip has a unique delay behavior. The Arbiter PUF gets

a challenge as its input which deﬁnes the exact paths

the race signals take. Figure 1 shows the schematic of

an Arbiter PUF. It consists of a top and bottom signal

that is fed through delay stages. Each individual delay

stage consists of two 2-bit multiplexers (MUXes) that

have identical layouts and that both get the bottom and

top signals as inputs. If the challenge bit for the current

stage is ’1’, the multiplexers switch the top and bottom

signals, otherwise the two signals are not switched. Each

transistor in the multiplexers has a slightly diﬀerent delay

characteristic due to process variations and hence the

delay diﬀerence between the top and bottom signal is

diﬀerent for a ’1’ and a ’0’. This way, the race signal

can take many diﬀerent paths: an n-stage Arbiter PUF

has 2ndiﬀerent paths the race signals can take. However,

challenges that only diﬀer in a few bits have a very similar

behavior so that an Arbiter PUF does not necessarily have

2nunique challenges. An Arbiter at the end of the PUF

determines which of the two signals is faster. The arbiter

consists of two cross-coupled AND gates which form a

latch and has an output of ’1’ if the top signal arrives

ﬁrst and ’0’ if the bottom signal is the ﬁrst to arrive. The

arbiter can have a slight bias so that the PUF result might

be slightly biased towards ’0’ or ’1’.

Fig. 1. Schematic of an n-bit Arbiter PUF.

To increase the resistance of Arbiter PUFs against

machine learning attacks it is proposed to add a non-linear

element to the PUF design. One of the most common

methods to add non-linearity to a PUF design is the XOR

Arbiter PUF. In a k-XOR Arbiter PUF, kPUF instances

are placed on the chip. Each of the PUF instances gets

the same challenge and the responses of the kPUFs

are XORed to build the ﬁnal response bits. While the

machine learning resistance increases by XORing more

PUFs, adding additional PUF instances also increases the

area overhead of the design. Furthermore, the XOR PUFs

become increasingly unreliable the more PUFs are XORed.

Hence, in practice only a small number of PUFs can be

used to build an XOR Arbiter PUF.

B. Modeling an Arbiter PUF

The response of an n-stage Arbiter PUF is determined

by the delay diﬀerence ∆Dbetween the top and bottom

signal. This delay diﬀerence is simply the sum of the delay

diﬀerences of the individual stages. The delay diﬀerence of

each stage depends on the corresponding challenge bit ci.

Hence, there are two delay diﬀerences per stage, δ0,i for

challenge ci= 0 and δ1,i for ci= 1. This way the PUF can

be modeled using 2·nparameters. However, there exists

a more eﬃcient way of modeling an n-stage Arbiter PUF

using only n+ 1 parameters [18]. A PUF instance can be

described by the delay vector ~w = (w1, ..., wn+1)with:

w1=δ0,1−δ1,1,(1a)

wi=δ0,i−1+δ1,i−1+δ0,i −δ1,i for 2≤i≤n, (1b)

wn+1 =δ0,n +δ1,n (1c)

The delay diﬀerence ∆Dat the end of the arbiter is the

result of the scalar multiplication of the transposed delay

vector ~w with a feature vector ~

Φthat is derived from the

challenge c:

∆D=~wT~

Φ(2)

The feature vector ~

Φis derived from the challenge vector

~c as follows:

Φi=

l=i

(−1cl)for 1≤i≤n

Φn+1 = 1

(3)

Modeling a PUF in this way can signiﬁcantly decrease

the simulation time and also reduces the parameters that

need to be known to n+ 1. It was shown in the past how

these parameters can be computed (approximated) easily

using diﬀerent machine learning techniques. In practice,

only a few hundred challenge and response pairs are

needed to model an Arbiter PUF with a predication rate

very close to the reliability of the attacked PUF [11].

C. Evolution Strategies

Evolution Strategies (ES) is a widely used machine

learning technique that gets its inspiration from the evo-

lution theory. In evolution, a species can adapt itself to

environmental changes by means of natural selection, also

called survival of the ﬁttest. In every generation, only

the ﬁttest specimen survive and reproduce, while the

weak specimen die and hence do not reproduce. Since the

specimen of the next generation inherit the genes of the

ﬁttest specimen of the previous generation, the species

continuously improves.

In ES-based machine learning attacks on PUFs, the

same principle of survival of the ﬁttest is used. As dis-

cussed in the previous Section, a PUF instance can be

described by its delay vector w. The goal of a machine

learning attack on an Arbiter PUF is to ﬁnd a delay vector

wthat most precisely resembles the real PUF instance.

The main idea of an ES machine learning attack is to

generate random PUF instances and check which of these

PUF instances are the ﬁttest, i.e., which PUF instances

resemble the real PUF model the best. These ﬁttest PUF

instances are kept as parents for the next generation

while the other PUF instances are discarded. In the next

generation, children are generated using the parent’s delay

vectors together with some random mutations, i.e., some

random modiﬁcations of the delay vectors. From these

child instances the ﬁttest instances are determined again

and kept for the next generation as parent instances. This

process is repeated for many generations in which the PUF

instances gradually improve and resemble the real PUF

behavior more and more.

To be able to perform an ES machine learning attack

two requirements are needed: 1) It needs to be possible

to describe a PUF instance with a vector wand 2) a

ﬁtness test is needed that, given the delay vectors w, can

determine which instances are the ﬁttest.

Since arbiter based PUFs can be modeled using the

delay vector w, whether or not an ES machine learning

attack is feasible depends on requirement 2), whether or

not a ﬁtness test for these PUF models exist. Typically,

the used ﬁtness test for an Arbiter PUF is the model

accuracy Abetween the lmeasured responses Rfrom

the physical PUF and the computed responses R0of the

PUF instance under test. The model accuracy can be

computed by computing the average hamming distance

HD() between the two response strings:

A=HD(R0, R)

l(4)

The PUF instances with the highest model accuracies

are considered the ﬁttest. This ﬁtness test can be used

whenever the attacker has access to challenge and response

pairs.

There exist many variants of ES machine learning algo-

rithms that mainly diﬀer in how many parents are kept

in each generation, how the children are derived from the

parents and how the random mutation is controlled. Typi-

cally, the mutation is done my adding a random Gaussian

variable N(0, σ)to each parameter. Diﬀerent approaches

exist how the mutation parameter σis controlled. The

closer the PUF instances are to the optimal solution, the

smaller σshould be. One approach to control σis to

deterministically decrease σin every generation. In self-

adaption on the other hand, the mutation parameters

adapt themselves depending on how the machine learning

algorithm is currently performing. In this paper we use Co-

variance Matrix Adaptation (CMA-ES) with the default

parameters provided in [9]. CMA-ES uses recombination,

i.e., one child instance depends on several parent instances.

It also uses self-adaption, i.e. the mutation strength is not

controlled deterministically but adapts itself depending on

how the ES algorithm is performing. In our experiments,

CMA-ES outperformed the (µ/γ)-ES used in [18] for

attacking XOR Arbiter PUFs and the self adoption makes

the algorithm perform very well for diﬀerent noise levels.

III. Attacking Reverse Fuzzy Extractor PUF

protocol

Van Herrewege et al. proposed a PUF based mutual

authentication protocol based on a so called reverse fuzzy

extractor [10]. The protocol has many similarities to

a controlled PUF [7]. The main idea is to not directly

reveal the PUF responses during the authentication phase.

Instead, only the helper data of an error correction code

of the PUF response is transmitted to the veriﬁer. By

only revealing the helper data, the protocol is supposed to

be resistant against machine learning attacks. However,

we will show that it is possible to use this helper data

to attack the PUF protocol. In the following, we ﬁrst

introduce the reverse fuzzy extractor protocol and then

discuss possible attacks on the design.

A. Reverse Fuzzy Extractor Protocol

The protocol’s security is based on building a secure

sketch using a fuzzy extractor. Since PUF responses can

be unreliable, fuzzy extractors and secure sketches have

been proposed for secure and reliable PUF-based key

generation [6]. Van Herrewege et al. extended this idea

to build a reverse fuzzy extractor that can be used in a

very lightweight challenge and response protocol. Fuzzy

extractors are built on error correction codes. An error

correction code typically consists of two functions, a gen-

eration function h=Gen(r)that generates the helper

data hfor a PUF response rand a reproduction function

r=Rep(h, r0)that recovers the response rgiven the

helper data hand a noisy response r0. In a controlled

PUF, both Gen as well as Rep are executed on the PUF

token. However, Rep is usually computationally expensive.

The key idea of the reverse fuzzy extractor is to avoid

the need of the computational expensive Rep function

on the token side and outsource it to the veriﬁer. This

makes the protocol considerably more lightweight and a

very promising solution for constrained devices such as an

RFID tag.

The protocol is a mutual authentication protocol, i.e.,

the two participating parties authenticate each other.

The two parties in this protocol are the token with the

embedded PUF and the veriﬁer. The protocol is based on

a generation function Gen and reproduction function Rep.

Given a PUF response string r, the token computes the

helper data h=Gen(r). This helper data can be used

by the veriﬁer with a noisy response r0to recover rusing

the reproduction function Rep with r=Rep(h, r0)as long

as the hamming distance between rand r0is below the

error correction threshold t. If the response r0is too noisy,

i.e., HD(r, r0)> t, the reproduction phase can fail and

r6=Rep(h, r0).

In [10], the syndrome construction of the BCH(n,k,t)

error correction code with n=255, k=21, and t=55 is used

for the generation phase. BCH is a very common error

correction code that has been proposed for various PUF

applications before, e.g. in [21], [8], [1], [14]. The syndrome

construction consists of a matrix multiplication of an n-

bit PUF response rwith the transpose of the n×(n−k)

parity check matrix Hof the used BCH error correcting

code.

h=Gen(r) = r×HT(5)

The BCH(255,21,55) error correcting code can correct

up to 55 erroneous bits of a n= 255 bit PUF response

using n−k= 234 bit helper data h. In the reproduction

function, an error vector eis computed by decoding the

syndrome s=h−r0×HTusing the decoding algorithm

of the used BCH code. This error vector eis subtracted

from r0to recover r.

s=h−r0×HT(6a)

e=Dec(s)(6b)

r=r0−e(6c)

Due to the special form of the parity check matrix

Hof the BCH code, the matrix multiplication r×HT

can be computed very eﬃciently using a single LFSR.

This makes the generation function extremely lightweight

in hardware. In contrast, the decoding of the syndrome

sis computationally much more complex. However, the

decoding is only needed for the reproduction function and

is outsourced from the computationally restricted token

to veriﬁer. This is the key feature that makes the reverse

fuzzy extractor more lightweight than e.g. a controlled

PUF that uses BCH for error correction.

The protocol consists of two phases, an initialization

phase that is used once to set up the protocol and an

authentication phase. In the initialization phase the veri-

ﬁer generates random challenges ciand sends these to the

token. The token computes the responses ri=P U F (ci)

and directly sends these responses to the veriﬁer who

stores them in a database. At the end of the initialization

phase, the initialization phase is permanently disabled so

that the token never again directly reveals challenge and

response pairs.

The authentication protocol is depicted in Table I. The

authentication process is started by the veriﬁer with an

authentication request to the token. The token replies with

an ID and the veriﬁer then chooses a random nonce N

and selects qchallenge and response pairs ci,rifrom the

database. The veriﬁer sends the qchallenges citogether

with the nonce Nto the token. The token computes the

responses r0

iusing its PUF, i.e., r0

i=P U F (ci). In the next

step the token computes the syndromes hi=Gen(r0

i)us-

ing the generation function Gen. The token computes the

hash a=Hash(ID, N, r0

1, .., r0

q, h1, .., hq)and transmits

aand h1, .., hqto the veriﬁer. The veriﬁer computes r0

using the helper data hiand the stored responses riwith

i=Rep(ri, hi). If a6=Hash(ID, N, r0

1, .., r0

q, h1, .., hq)

the veriﬁer rejects the authentication and aborts. Other-

wise the veriﬁer computes b=Hash(a, r0

1, ..., r0

q)and sends

bto the prover. The prover accepts the authentication if

b=Hash(a, r0

1, ..., r0

q).

TABLE I

Reverse Fuzzy Extractor Protocol

Token Veriﬁer

ID, physical PUF ID0,(c1, r1), .., (cq, rq))

auth

←−−−−−−−−−−−−−−−−

−−−−−−−−−−−−−−−−→ if ID06=I D reject and abort

N∈R{0,1}l

c1,..,cq,N

←−−−−−−−−−−−−−−−−

i←P U F (ci)

hi←Gen(r0

a←Hash(I D, N, r0

1, .., r0

q, h1, .., hq)h1,..,hq,a

−−−−−−−−−−−−−−−−→

i←Rep(ri, hi)

a0←Hash(I D, N, r0

1, .., r0

q, h1, .., hq)

if a06=areject and abort

b←Hash(a, r 0

1, .., r0

←−−−−−−−−−−−−−−−

if Hash(a, r 0

1, .., r0

q)6=breject and abort

B. Discussion

The security proof provided in [10] relies on the fact that

the syndrome construction is a secure sketch as deﬁned

in [3]. For every syndrome hthere are 2kpossible responses

rwith h=Gen(r). Each of the responses is equally likely.

Therefore, an attacker cannot recover direct challenge-

and-response pairs with a probability higher than 2kfor

a given h. This is also true if the attacker has access

to multiple diﬀerent syndromes for noisy responses of

the same challenge. The reason for this is that multiple

syndromes do not carry information about the value of the

response bits, but only the positions of the bit errors. This

means that given multiple helper data for noisy responses,

the attacker only learns which bits have ﬂipped, but not

the value of the ﬂipped bits. Therefore, an attacker cannot

determine the response for a challenge even if he has access

to multiple diﬀerent helper data for the same challenge.

Since it is impossible to recover the correct response

from helper data from the same challenge, the protocol is

assumed to be secure against model building attacks [10].

However, we will show that the protocol can still be

attacked. The main reason is that the starting assumption

that for a model building attack an attacker needs to know

challenge and response pairs turns out to be wrong. The

helper data leaks enough information to attack the PUF

using an ES-based machine learning attack. Furthermore,

while multiple diﬀerent helper data for the same challenge

do not reveal the response, they indicate which response

bits are unstable. Using the unreliability information to

attack Arbiter PUFs has recently been proposed as a fault

attack in [2]. Hence, the unreliability information provided

by the helper data can be used to model arbiter PUFs

using machine learning.

Furthermore, the security analysis only considered how

much information was leaked by helper data from a

single challenge and did not consider related challenges.

In the reverse fuzzy extractor protocol [10] an LFSR is

used to generate the individual nsubchallenges from the

master challenge. However, this approach to generated

subchallenges is very problematic, as recently pointed out

by Delvaux et al. in [4]. An attacker can send related

challenges to the token to be able to recover the response

bits. A single challenge c1actually consists of 255 64-bit

subchallenges c1,1, c1,2, .., c1,255. These subchallenges are

computed using a 64-bit LFSR with c1,1being equal to the

initial state of the LFSR. For each subchallenge, the LFSR

is clocked 64 times. Assume the attacker has sent challenge

c1,1as a master challenge to the token. The token will

then use c1,1, c1,2, .., c1,255 as the subchallenges to compute

r1=r1,1, r1,2, .., r1,255 and the corresponding helper data

h1=r1×HT.

In the next step the attacker sends challenge c1,2

as the master challenge to the token. The token will

now use the challenges c1,2, c1,3, .., c1,256 to get response

r1,2, r1,3, .., r1,256 and the corresponding helper data h2.

This gives the attacker a system of linear equations with

256 unknowns and 255 ×2 = 510 equations:

h1= (r1,1, r1,2, .., r1,255)×HT(7a)

h2= (r1,2, r1,3, .., r1,256)×HT(7b)

The attacker can simply solve this over-deﬁned sys-

tem of linear equations to recover the response bits

r1,1, .., r1,256. Hence, due to the challenge generator an at-

tacker can very easily compute the challenge and responses

by sending a second related challenge to the token. In

practice, PUF responses might be unstable so that a few

errors might occur. However, this makes the attack only

slightly more diﬃcult and an attacker can average over

multiple responses to reduce or eliminate these errors.

We would like to note that this problem is due to the

speciﬁc implementation of the challenge generator. In par-

ticular, this problem occurred because the authors did not

consider chosen challenge and related challenge attacks.

LFSRs are very popular for challenge generation due to

their lightweight nature. However, this attack illustrates

how dangerous it can be if the challenge generator is

only chosen for performance reasons. A good challenge

generator should make it computationally infeasible for an

attacker to apply related challenges to the PUF. Since the

reverse fuzzy protocol uses a hash function, one possible ﬁx

could be to use this hash function with secure padding to

generate the subchallenges. This way the related challenge

attack from [4] can be defeated.

C. Direct ES machine learning attacks on helper data

In this section we will discuss how to attack the protocol

using an ES-based machine learning attack with the helper

data as input. Recall that in an ES machine learning attack

we need a ﬁtness test that can determine which of a given

set of PUF models resembles the correct PUF model the

best. Typically, known challenge and response pairs are

used to compute the model accuracy which then serves

as a ﬁtness metric. However, the direct responses are not

available in the reverse fuzzy extractor protocol and we

therefore need a diﬀerent ﬁtness test based on the helper

data.

Assume that the PUF models we test in our ES machine

learning algorithm have an accuracy large enough so that

the hamming distance between the modeled response r0

and the correct response ris HD(r0, r)< t. Then an

attacker can compute the syndrome h0=Gen(r0)and

compute the error vector e=Dec(h−h0). This error vector

edirectly reveals the hamming distance between rand r0

since e=r−r0. Hence, once the PUF models are accurate

enough that the hamming distance between the modeled

PUF responses and the measured PUF responses is below

t, it is easy to determine the ﬁtness of the PUF models.

The question is if we can also ﬁnd a ﬁtness test for PUF

models with hamming distances larger than t. Our solution

is rather simple. Assume that we have lchallenges ciand

their corresponding helper data hi. For every helper data

hiwe compute all 2k= 221 responses ri,j for which hi=

Gen(ri,j ) = ri,j ×HTholds true. Then we compute the

modeled responses r0

iusing the delay vector wof the PUF

model P U F 0under test with r0

i=P U F 0(ci). In the next

step we compute the minimum hamming distance between

all possible ri,j and the modeled response r0

fi=minj=1,..,2k{HD(ri,j , r0

i)}(8)

The ﬁtness fof a PUF model is then simply given by the

sum f=Pfi. The smaller f, the ﬁtter the PUF model.

When the PUF model accuracy is very low, it is likely that

for the computation of fithe wrong response candidate ri,j

is used, i.e., ri6=ri,j . In this case the ﬁtness value fiis

misleading. However, the higher the model accuracy and

the more inputs are used, the more likely it becomes that

the correct PUF model is chosen as the ﬁttest.

We have simulated the attack using a simulated 64-

bit Arbiter PUF and 7 inputs, each consisting of 234-bit

helper data corresponding to a 255 response string. Note

that q= 7 is the default value of the reverse fuzzy protocol

and hence a single execution of the authentication protocol

reveals 7 inputs. To measure the resulting model accuracy

we used 10k challenges and responses as a reference set.

The results of the attack are shown in Figure 2. ES

machine learning attacks are non-deterministic. Given the

same input, the algorithm can lead to diﬀerent results.

From 100 runs with the same inputs, 24 runs were suc-

cessful and achieved a model accuracy of at least 96%.

In a second experiment we applied Gaussian noise to the

delay values so that 5% of the responses ﬂipped, to test

the impact of noise to our attack. The number of successful

tries decreased only slightly from 24 runs without noise to

19 runs with 5% noise. A single run with 7 inputs took

around 23 minutes using around 16 cores on a cluster.

From Figure 3 and 4 one can see that once a PUF

model accuracy of more than about ≈60% is achieved,

the attack is successful and the model accuracy quickly

increases to 98%. This is due to the fact that for small

model accuracies the ﬁtness value fis not very meaningful.

This is due to the fact that for model accuracies lower than

60% it is more likely that ri,j 6=ri, i.e., that the min()

function chooses the wrong index and hence adds noise to

the ﬁtness function. The higher the model accuracy, the

less likely it is that a wrong index is chosen and hence

the noise in the ﬁtness function decreases. This can be

observed in Figure 4, since once a run achieves a certain

ﬁtness value, the run converges to a near optimal solution.

This is the reason why a run either did not converge

and stayed below 60% accuracy, or it did converge and

resulted in model accuracies of around 98%. We also tested

the attack against 128-bit Arbiter PUFs. While for small

number of inputs the attack was not successful we were

able to model a 128-bit Arbiter PUF with 200 inputs.

From 15 runs, two achieved a model accuracy of more

than 98% after 500 generations while the other runs did

not converge. A single run with 500 generations took

about 210 minutes. Hence, larger Arbiter PUFs increase

the attack complexity but cannot prevent the machine

learning attack. Nevertheless, attacking the reverse fuzzy

extractor protocol is considerably harder than attacking

plain Arbiter PUFs. Furthermore, the attack complexity is

directly related to k. Increasing kalso increases the attack

complexity and hence for large values of k(e.g. k > 80) the

attack would become computationally infeasible. To show

the impact of the parameter choice of the BCH code we

performed some experiments with diﬀerent kand nvalues

for a 64-stage Arbiter PUF. The results are summarized

in Table II. The experiments show that when kand nare

chosen so that the resulting error correction rate t/n is

similar, both the convergence rate as well as the achieved

model accuracy is similar. However, especially for larger

values of kthe computation time and hence the attack

complexity increases signiﬁcantly. Hence, by either using

a more secure PUF as a building block or by choosing

a much larger BCH code the attack could be prevented.

However, we will introduce in the next section another

attack on the reverse fuzzy extractor protocol that is

independent of BCH parameters.

D. ES machine learning attack using noisy responses

The information for which challenges the PUF is un-

reliable, i.e., for which challenges the response might ﬂip

TABLE II

Result of CMA-ES on the helper data of the reverse fuzzy extractor when used with a 64-stage Arbiter PUF for

different BCH(k,n,t) codes. The parameter kdenotes the data bits, nthe response length, and tthe number of errors the

code can correct. The presented results are averaged over attacks on 20 or more different PUF instances.

k n t used used needed average average

inputs responses runs accuracy attack time

21 255 55 7 1785 3.6 97.5% 88.5m

15 127 27 14 1778 3.3 97.7% 10.6m

10 63 13 28 1764 3.3 97.6% 7.8m

Fig. 2. Result of CMA-ES on the reverse fuzzy extractor using

7 inputs, i.e., 7 syndromes on simulated responses from a 64-bit

Arbiter PUF. The model accuracy was computed using 10k reference

challenge and responses. 100 runs with the same challenges and

PUF instance were performed from which 24 runs achieved a model

accuracy of more than 96%.

Fig. 3. Progression of 100 runs of the CMA-ES on the reverse fuzzy

extractor with 7 inputs and a 64-bit Arbiter PUF. The Y-axis shows

the achieved model accuracy after each generation.

Fig. 4. Progression of 100 runs of the CMA-ES on the reverse fuzzy

extractor with 7 inputs and a 64-bit Arbiter PUF. The Y-axis shows

the computed ﬁtness value f after each generation.

contains valuable information from an attacker’s perspec-

tive. Becker et al. [2] proposed a fault attack on controlled

PUFs that uses the information which challenge bits are

unstable under voltage variations to build an accurate

PUF model. For this attack, the attacker only needs

to know which challenges are unstable, i.e., for which

challenges the response might ﬂip due to environmental or

thermal noise. The response bits on the other hand are not

needed for this attack to work. Delvaux et al. [5] used the

amount of bit ﬂips under thermal noise in addition to the

response bits for a model building attack on an Arbiter

PUF. The main observation in both attacks is that the

closer the delay diﬀerence for a given challenge is to zero,

the more likely is it that the bit ﬂips. On the other hand,

the larger the delay diﬀerence, the less likely it is that a

challenge bit ﬂips. Figure 5 [2] shows the circuit-level sim-

ulated delay diﬀerences for diﬀerent challenges of an 128-

bit Arbiter PUF. The delay diﬀerences are approximately

Gaussian and lie between roughly -100 ps and +100 ps.

When the supply voltage is changed from the default 1.1V

to 1V and 1.2V some of the challenges ﬂipped, i.e., the

sign of the delay diﬀerence changed. These challenges are

highlighted in black. All ﬂipped responses had a delay

diﬀerence between -13ps and +13ps. Hence, every ﬂipped

bit gives us an important piece of information: the absolute

delay diﬀerence for this challenge is very likely smaller

than a threshold τ, where τdepends on the speciﬁc PUF

instance (13 ps in this example).

Fig. 5. The delay diﬀerence in pico seconds of an 128-bit Arbiter

PUF for 49k diﬀerent traces. Colored in blue are the delay diﬀerences

of all traces and in black are the delay diﬀerences for the traces whose

output ﬂipped when the supply voltage was altered from 1.1V to 1V

and 1.2V.

It was demonstrated in [2] that an ES-based machine

learning algorithm can be used to build an accurate model

of an Arbiter PUF knowing only which responses are

unstable. The question is, how do we determine the bits

that are unreliable in the reverse Fuzzy extractor protocol?

As it turns out, the BCH code hands us this information

on a silver platter: For a response rand a noisy response

˜rwith HD(r, ˜r)< t and their corresponding public helper

data hand ˜

h, one simply has to compute the error vector

efrom the reproduction function:

s=h−˜

h(9a)

e=Decode(s)(9b)

Since e=r−˜r, the error vector tells us which bits

have ﬂipped and which bits were stable. This is exactly the

information that is needed to perform a machine learning

based fault attack.

The used ES machine learning attack is very similar to

the attack described in the previous Section. The main

diﬀerence is the computation of the ﬁtness test of the

PUF models. The input of the ﬁtness test is not the helper

data hi, but the error vector eithat was computed from

multiple helper data for the same challenge under diﬀerent

environmental conditions1. To evaluate the ﬁtness of a

PUF model, a modeled error vector e0is computed for

every challenge. To do this, the delay diﬀerence for every

challenge is computed and if the delay diﬀerence is below

a threshold τ, a bit ﬂip is expected. The measured error

vectors eiare then correlated with these modeled error

vectors e0

iand the corresponding correlation coeﬃcient

is used as a ﬁtness indicator. Please note that the error

vector eidoes not exactly match the modeled error vectors

i, since not all challenges whose delay value is below τ

necessarily ﬂipped during the measurement (see Figure 5).

However, if the correct PUF model was used, the two error

vectors should be similar. In our experiments the correla-

tion coeﬃcient worked very well to test this similarity.

One open question is how to set the threshold value τ,

since this value depends on the PUF instance as well as

the environmental conditions. A good solution is to simply

add τto the parameters that are to be determined by

the ES machine learning algorithm. By making it part

of the machine learning parameters, the optimal value is

determined by the algorithm on the ﬂy and does not need

to be chosen by the attacker.

We implemented this attack assuming that Gaussian

noise is added to the delay diﬀerences of each challenge.

We added Gaussian noise to the delay diﬀerence of each

challenge so that 5% of the responses ﬂipped, i.e., for 5%

of the challenges the sign of the delay diﬀerence changed.

The results of the attack for a 64-stage Arbiter PUF are

depicted in Figure 6 and for a 128-stage Arbiter PUF in

Figure 7. While the number of needed traces is slightly

higher (14 input blocks instead of 7) compared to a direct

ES machine learning attack, the needed number of inputs

is still extremely small and the attack time is magnitudes

faster. The biggest advantage of the attack however is that

the attack is independent of the used BCH parameters as

long as all errors are corrected.

This is a signiﬁcant diﬀerence to the direct ES machine

learning attack on the helper data. In the previous attack,

1It is also possible to challenge the PUF using the same environ-

mental conditions, since PUFs in practice also show some unreliabil-

ity without changing environmental conditions.

Fig. 6. CMA-ES attack on a 64-stage Arbiter PUF based on the

information which bits have ﬂipped. The responses were generated

by adding Gaussian noise to simulated delay values so that 5% of

the responses ﬂipped. On the left Y-axis, the highest achieved model

accuracy from 100 runs with 800 generations each is shown. On the

right Y-axis the number of runs that converged, i.e., that achieved a

model accuracy of at least 90% is shown. The X-axis depicts the used

number of input blocks, each input block consisting of 255 response

bits.

Fig. 7. CMA-ES attack on 128-stage Arbiter based on the infor-

mation of which bits have ﬂipped. The responses were generated by

adding Gaussian noise to simulated delay values so that 5% of the

responses ﬂipped. On the left Y-axis, the highest achieved accuracy

from 100 runs with 800 generations each are shown. On the right Y-

axis the number of runs that converged, i.e., that achieved a model

accuracy of at least 90% are shown. The X-Axis depicts the used

number of input blocks, each input block consisting of 255 response

bits.

changing the parameters of the used BCH code and using

a diﬀerent PUF as a building block might be able to

prevent the attack, since the attack complexity directly

depends on the used BCH code. However, changing the

parameters of the BCH code does not aﬀect this attack.

The information of which bits are unstable will always be

leaked by the helper data of the reverse fuzzy extractor

protocol if the attacker can send the same challenge twice.

Please note that this is not a ﬂaw in the used BCH codes

but is a fundamental ﬂaw in the reverse fuzzy extractor

concept. The ability to determine the error vector egiven

two syndromes for the same challenge is needed for the

protocol execution.

While the attack from Section III-C can be prevented

by using codes with a larger kor PUF architectures that

are more resistant against model building attacks, this

attack exploits a fundamental weakness of the reverse

fuzzy extractor that cannot be easily ﬁxed.The results

presented in this section also have consequences beyond

the reverse fuzzy extractor. Every protocol that uses BCH

codes or similar error correction codes that reveal infor-

mation about which bits are unreliable need to carefully

consider these fault attacks. For example, the presented

results have direct impact on controlled PUFs since in

these systems the same error correction codes are used as

the reverse-fuzzy extractor. A summary of the diﬀerent

attacks on the reverse fuzzy extractor protocol can be

found in Table III

IV. Attacking the Slender PUF Protocol

The Slender PUF protocol was ﬁrst introduced in [15],

which we will refer to as the conference version and

has also been published with small modiﬁcations in [17],

which we will refer to as the journal version. The Slender

PUF protocol is based on pattern matching, which has

previously been proposed for correcting errors in PUF

based key generation [16].

The protocol enables a token (called prover in [15]) with

physical access to the PUF to authenticate itself to a

veriﬁer. To do so, the veriﬁer ﬁrst builds an accurate model

of the PUF during an initialization phase. During this

initialization phase, the veriﬁer receives direct challenge

and response pairs from the token which can be used

to build an accurate PUF model using machine learning.

After the veriﬁer has built an accurate software model of

the PUF, the initialization phase is permanently disabled

and the token will never again directly reveal any challenge

and response pairs. Instead, the token only reveals a

permuted substring of the response bits. The protocol of

the conference version is shown in Table IV. The protocol

is started by the veriﬁer by sending a nonce noncevto

the PUF. The token responds with a randomly generated

nonce noncet. The two nonces are used as a seed for a

pseudo random number generator (PRNG). This PRNG is

then used in step 4 to generate Lchallenges C. The token

computes the corresponding responses Rusing its physical

PUF instance, i.e. R=PUF(C). In step 6 the token ran-

domly chooses an index ind, with 1≤ind ≤L, that points

to a location of the response string R. The index is used

to generate a substring Wfrom the response string with a

predeﬁned length Lsub, with W=Rind , .., Rind+Lsub . The

response string Ris used in a circular manner, so that if

ind +Lsub > L,W=Rind, .., RL, R1, ..Rind−Lsub .

This substring Wis then sent to the veriﬁer. The veriﬁer

computes its own response string R0using the software

model of the PUF with R0=PUF_model(C). In the last

step the veriﬁer uses the computed response string R0to

search for the index ind0which points to the substring

W0⊂R0that has the smallest hamming distance to W:

ind0=argminj=1,..,L (HD(W0

j, W )) (10a)

j=(R0

j, .., R0

j+Lsub−1,if j≤L−Lsub

j, .., R0

L, R0

1, ..R0

j+Lsub−L,if j > L −Lsub

(10b)

If the authentication is successful the index ind0com-

puted by the veriﬁer should be equal to the token’s index

ind. Note that ideally R=R0and hence the hamming

distance between the transmitted substring Wand the

veriﬁers substring W0

ind0is 0. However, in practice Rand

R0will diﬀer slightly due to inaccuracies in the veriﬁers

PUF model as well as noise in the physical PUF at

the token side. Therefore the token accepts a few false

response bits in the substring W. If the hamming distance

between W0

ind0and Wis below an error threshold e,

HD(W, W 0

ind0)< e, then the authentication is successful.

Otherwise the authentication fails and the protocol needs

to be restarted.

TABLE IV

Conference version of the Slender PUF Protocol [15]

Token Veriﬁer

physical PUF P U Fmodel

noncev

−−−−−−−−→

noncet

←−−−−−−−

C=G(noncev, noncet)C=G(noncev, noncet)

R=P U F (C)R0=P UFmodel (C)

W=SEL(ind, Lsub, R)

−−−−−−−→

T=match(R0, W, e)

T=true?

TABLE V

Journal version of the Slender PUF Protocol [17]

Token Veriﬁer

physical PUF P U Fmodel

noncev

−−−−−−−−→

noncet

←−−−−−−−

C=G(noncev, noncet)C=G(noncev, noncet)

R=P U F (C)R0=P UFmodel (C)

W=SEL(ind, Lsub, R)

P W =P AD(ind2, W )

P W

−−−−−−−→

T=match(R0, P W, e)

T=true?

For the journal version [17] the protocol was modiﬁed

slightly as can be seen in Table V. In the journal version

of the protocol, the substring Wis not directly revealed.

Instead, a padding is applied to Wand only the padded

substring P W is revealed. In the ﬁrst step of the padding

process, Lpw random padding bits are generated. Then

a second random index ind2is chosen with 1≤ind2≤

Lpw and the string Wis inserted into the padding bits

at position ind2. This process is illustrated in Figure 8.

The padded substring P W is transmitted to the veriﬁer.

TABLE III

Summary of the machine learning attacks on the reverse fuzzy extractor protocol.

Attack type PUF stages maximum accuracy successful runs used inputs execution time per run

direct CMA-ES 64-bit 97% 24/100 7 ≈23 minutes

reliability based CMA-ES 64-bit 97% 10/100 21 <1 minute

direct CMA-ES 128-bit 99% 6/75 200 ≈210 minutes

reliability based CMA-ES 128-bit 97% 6/100 56 <1 minute

Similarly to the conference version, the veriﬁer uses the

simulated PUF output sequence R0to ﬁnd the substrings

W0and W. But due to the padding, the veriﬁer does not

know Wand hence has to test all Lpw possible substrings

Wkin addition to the Lpossible substrings W0

jto ﬁnd the

correct Wand W0:

(ind1, ind2) = argminj=1≤L,k =1≤Lpw (HD(W0

j, Wk))

If the hamming distance between the simulated substring

ind1and the received substring Wind2is below a certain

threshold, the authentication was successful.

Fig. 8. Generation of the substring Wand P W for the case ind1<

L−Lsub and for the case ind1> L −Lsub .

The journal version has the disadvantages that the

transmitted string P W is longer than Wand that the

veriﬁer has to do more computations to ﬁnd the substring

W. In particular, the veriﬁer needs to perform L·Lpw

hamming distance computations compared to Lcompu-

tations in the conference version. One advantage of the

journal version is that two secret indices are used and [17]

suggests to use these indices to extend the authentication

protocol to a session key exchange protocol.

A. Security of the Slender PUF protocol

The security argument of the Slender PUF protocol

provided in [15], [17] relies on the fact that an attacker

does not have access to direct challenge and response pairs.

The Slender PUF Protocol uses XOR Lightweight Arbiter

PUFs in which the responses of multiple Arbiter PUFs

are XORed to generate a single output bit [20]. Unlike

in a classic XOR Arbiter PUF, in the XOR Lightweight

Arbiter PUF used in [15], [17] not all PUFs get the same

challenge. Instead, every uneven numbered PUF gets the

challenge applied in the reverse order, i.e, the challenge

bit for the ﬁrst stage becomes the challenge bit for the

last stage and second challenge bit becomes the second to

last and so on. Otherwise the XOR Lightweight Arbiter

PUF is equal to the XOR Arbiter PUF.

XOR Arbiter PUFs are known to be much more resis-

tant against machine learning attacks than simple Arbiter

PUFs [18]. However, the number of XORs that can be

used is limited since the reliability decreases with each

added XOR. In [17], a worst case unreliability of 24.7%,

34.6%, and 43.2% for an 2-input, 3-input and 4-input XOR

Lightweight Arbiter PUF with 64 stages respectively was

measured for their FPGA prototype. But as also pointed

out by Majzoobi et al. , much smaller error rates have been

reported for ASIC implementations [12]. While modeling

attacks on XOR Lightweight PUFs are more diﬃcult than

on single Arbiter PUFs, model building attacks are still

possible if enough challenge and response pairs are known.

For their reference FPGA implementation in [17], empiri-

cal results show that for the used 3 XOR Lightweight PUF

64k challenges and responses are needed for a successful

machine learning attack. However, please note that this

result is only valid for their PUF measurements with the

very high error rate and only for their used ES machine

learning algorithms. If the used responses are more reliable

much fewer challenges are needed to accurately model a 3

XOR Lightweight Arbiter PUF. In the journal version it

was therefore assumed that Nmin = 64kresponse bits are

needed to model a 3 XOR Lightweight PUF that has an

unreliability of around 18%. In the conference version it

is assumed that at least Nmin = 5kresponses are needed

to model a noise-free 4-XOR Lightweight PUF. Note that

in the literature it is reported that 12k challenges are

needed [18] and hence this assumption is chosen very

conservatively.

The main security argument why the Slender PUF

protocol is resistant to machine learning attacks is that

it would be computationally infeasible to compute enough

direct challenge and response pairs to model the used

XOR Lightweight Arbiter PUF [15], [17]. To get Nmin

challenges and responses, an attacker needs Nmin/Lsub

correct substrings W. To guess a substring W, an attacker

has to guess the indices ind1and ind2correctly for which

there are L·LP W possibilities. Therefore, for the used

values of L= 1300,Lsub = 1250 and LP W = 512, in

average:

2·(L·LP W )Nmin

Lsub =1

2·(1300 ·512)d64000

1250 e≈21004

(11)

guesses are needed to correctly guess 64000

1250 = 52 correct

substrings W. Note that for each of these guesses the

attacker would need to perform a machine learning attack.

Hence, according to this analysis from [17], it would be

computational infeasible to attack the protocol using a

machine learning attack. Using the same logic, the number

of needed machine learning attacks against the conference

version of the protocol with L= 1024,Lsub = 256 and

Nmin = 5kwould be in average:

2·(LNmin

Lsub ) = 1

2·(1024d5000

256 e)≈2199 (12)

However, we will show that ES-based machine learning

attacks on the protocols with these parameters are indeed

feasible. This is due to the fact that in an ES machine

learning attack the attacker does not need to guess the

correct indices ind1and ind2. Instead of trying to guess

the indices to get more than nmin challenges and response

pairs, the attacker can directly use the strings P W or W

as inputs to a CMA-ES based machine learning attack.

Hence, their ﬁrst assumption, that for a successful machine

learning attack the attacker needs a certain amount of

direct challenges and responses is wrong.

B. ES-Machine Learning Attack on the Slender PUF pro-

tocol

As discussed in Section II-C, for a successful ES machine

learning attack on a PUF design, an attacker needs to be

able to model the underlying PUF and needs a ﬁtness test

that can determine which PUF instances from a given set

of PUF instances are the ﬁttest, i.e. which instances model

the PUF the best. In the Slender PUF protocol, the chal-

lenges and responses are never directly revealed. Hence,

an attacker cannot use the model accuracy as the ﬁtness

test. Let us ﬁrst take a look at the conference version of

the Slender PUF Protocol. In the conference version, the

substring Wifor challenge iof length Lsub is transmitted

without applying any padding. To ﬁnd the correct index

ind, the veriﬁer performs a maximum sequence align-

ment with his string R0

iand Wi. That is, the veriﬁer

computes for all possible indices ind all substrings W0

i,j

and computes the hamming distance HD(W0

i,j , Wi). The

authentication passes if minj=1,..,Lsub(H D(W0

i,j , Wi)) < t.

One possibility for an attacker would be to use the same

method as the veriﬁer for the ﬁtness test. Assume the

attacker has eavesdropped (or initialized) nexecutions of

the protocol and collected nsubstrings ~

W=W1, ..., Wn

and their corresponding challenges C1, ..., Cn. To test

the ﬁtness of a PUF instance generated during the ES-

machine learning attack, the attacker computes responses

R=R0

1, .., R0

nwith R0

i=P U Fmodel(Ci). In the next

step the attacker performs a maximum sequence alignment

of the computed responses R0

iwith the eavesdropped

substrings Wito ﬁnd the minimum hamming distance for

every substring. These minimum hamming distances are

summed up and are used as a ﬁtness metric f:

i,j =(R0

i,j , .., R0

i,j+|S|−1,if j≤L−Lsub

i,j , .., R0

i,L, R0

i,1, ..R0

i,j+Lsub −L,if j > L −Lsub

(13a)

i=1

minj=1,..,Lsub (HD(W0

i,j , Wi)) (13b)

If the model accuracy of a PUF instance is high, it

is very likely that the minimal Hamming distance corre-

sponds to the correct index and hence the ﬁtness function

is a good ﬁtness metric for this case. However, if the PUF

model accuracy is very low, it is possible that the correct

index might have a higher hamming distance than a false

index. In this case the wrong index is chosen and the

hamming distance is misleading. In our experiments this

method worked for attacking the protocol in conjunction

with a 2 XOR Lightweight PUF but was unsuccessful

when used in conjunction with a 3 XOR Lightweight

PUF. Therefore, this ﬁtness function works for small PUF

instances such as the 2 XOR Lightweight Arbiter but for

larger PUF instances a better ﬁtness function is needed.

The main disadvantage of the proposed ﬁtness function

is that as long as the model accuracy of the PUF is low,

it is very likely that the wrong index is chosen during the

computation of fand hence the ﬁtness value is misleading.

It is therefore advisable to ﬁnd a ﬁtness function that is

independent of the index ind and does not need a certain

minimum accuracy before becoming meaningful. This can

be achieved by using a ﬁtness function based on the

hamming weight HW () of the strings ~

W=W1, .., Wn. In

the ﬁrst step the attacker computes the hamming weights

hWiof the strings Wi, i.e., the number of ones in Wi. Since

Wiis a subset of Ri, the hamming distance of Rican be

written as:

hRi=H W (Ri) = HW (Wi) + H W (Ri/Wi)

The hamming weight of nPUF response bits follows

(ideally) a binomial distribution B(n, p)with pbeing the

probability that a response is one and nbeing the number

of response bits. In an unbiased PUF an output of one

and zero is equally likely and hence p= 0.5. Therefore hri

can be seen as hWiplus a binomial distributed random

variable hnoise ∼B(L−Lsub,0.5):

hRi=H W (Wi) + HW (Ri/Wi) = hWi+hnoise

Correspondingly, for the hamming distance of the com-

puted response string R0

hR0

i=H W (W0

i) + H W (R0

i/W 0

i) = hW0

i+h0

noise

To test the ﬁtness of a PUF model the attacker com-

putes the response strings ~

R0and computes the hamming

weights ~

hw=hw1, .., hwnand ~

R=hR0

1, .., hR0

n. The

output of the ﬁtness function fis the person correlation

coeﬃcient corr() between ~

hwand ~

f(~

W , ~

R0) = corr(H W (~

W), H W (~

R0))

=corr(~

hW,~

hR0) = corr(~

hW,~

hW0+~

hnoise)(14)

The more accurate a tested PUF model is, the smaller

is the diﬀerence between ~

Rand ~

R0and hence also the

diﬀerence between ~

Wand ~

W0. Hence, the correlation co-

eﬃcient of corr(~

hW,~

hR0)increases with increasing model

accuracies. Therefore the this ﬁtness function based on

hamming weights and correlation coeﬃcient can be used as

an eﬃcient and reliable ﬁtness test for a CMA-ES attack.

We performed a CMA-ES machine learning attack based

on this ﬁtness function with the default parameters of

L= 1024 and Lsub = 256 and a noise-free 4 XOR

Lightweight PUF as suggested in [15]. We were able to

attack the protocol using 600k inputs. That is, 600k strings

Wiof size Lsub = 256 were used and for each Withe

attacker needs to compute L= 1024 response bits for

i. We also tested the attack on a very noisy 3 XOR

Lightweight PUF by adding a Gaussian random variable

to each simulated delay diﬀerences ∆D, which is the

same approach as used in Section III-D. We added as

much noise so that in average 23% of the response bits

of the 3 XOR Lightweight PUF switched. This is more

than the noise observed by the FPGA implementation of

the journal version for a stable power supply and much

more than can be expected from an ASIC implementation.

We were able to attack this very noisy PUF using 60k

inputs. If a noise free 3 XOR Lightweight PUF is used the

same attack works with around 30k inputs. Hence, while

noise increases the attack complexity and decreases the

achieved model accuracy, the impact is relatively small

compared to the large amount of noise that was added.

This is due to the fact that there is already a lot of

noise present due to ~

hnoise. The added noise due to the

unreliability of the PUF therefore has not as much impact

since the ﬁtness function was particularly chosen to allow

for noise. Hence, the attack in general is still feasible in the

presence of substantial noise. An overview of the attack

for diﬀerent parameters can be found in Table VI. In

these experiments an independent set of 10k challenges

and responses without any noise was used to determine

the resulting model accuracy of the attack.

To speed up the computation time it is advisable to

stop unsuccessful CMA-ES runs early. We used the global

mutation parameter σof the CMA-ES in conjunction

with the correlation coeﬃcient as a stop criteria. In our

experiments, a run which did not have a certain correlation

coeﬃcient when the global mutation parameter σof the

CMA-ES fell below 0.7 was unlikely to converge to an

acceptable model accuracy. We therefore used this global

mutation parameter in conjunction with the correlation

coeﬃcient to stop unsuccessful runs early. This can signiﬁ-

cantly increase the computation time. Successful runs were

aborted at a global mutation parameter of 0.3 although

running for more generation would have resulted in slightly

better model accuracies. Whether or not a run was success-

ful can be determined based on the achieved correlation

coeﬃcient, since successful runs have a signiﬁcantly higher

correlation coeﬃcient than unsuccessful runs. The results

in Table VI are the average results of attacks on diﬀerent

PUF instances. The simulations were performed on an

AMD Opteron 6276 cluster in which one node had 64 cores

and 16 cores were used for each run. Most of the time all

64 cores of the node were used by diﬀerent simulations.

C. Attacking the journal version

In the journal version of the Slender PUF protocol a

padded string P W is transmitted instead of W. In [17] the

proposed parameters for a 64 stage 3 XOR Lightweight

PUF with a best case unreliability of 18% and worst

case unreliability of 34.6% are L= 1300,Lsub = 1250

and LP W = 512. Please note that these parameters were

chosen due to the particular noisy PUF used in [17]. In

principle, the same attack as described for the conference

version of the protocol can be used to attack the journal

version. The only diﬀerence is that the attacker does not

know Wias only the padded string P Wiis revealed during

the protocol execution. Nevertheless, the same ﬁtness

function as in the conference version can be used by simply

computing the hamming distance of P Wiinstead of Wi.

The hamming weight of P Wiis

hP Wi=H W (Wi) + HW (Pi) = hWi+hPi

with Pibeing the padding bits. The ﬁtness function can

therefore be written as:

f(~

PW, ~

R0) = corr(~

hP Wi,~

hR0)

=corr(~

hW+~

hP,~

hW0+~

hnoise)(15)

Since the padding bits Piare identically distributed

random bits, hPiis a random variable with

hPi∼B(LP W ,0.5) which just like hnoise adds noise

to the ﬁtness function. Due to the parameter selection of

L= 1300,Lsub = 1250 and LPW = 512 the noise in the

ﬁtness function of the journal version is actually much

smaller than the conference version, since the noise terms

are hPi=∼B(512,0.5) and hnoise ∼B(50,0.5) while

the signal term is hWi∼B(1250,0.5) for the journal

version. In comparison, in the conference version the

noise term is hnoise ∼B(768,0.5) while the signal term is

only hWi∼B(256,0.5). Hence, the signal to noise ratio is

actually higher in the journal version.

Therefore less inputs are needed for attacking the jour-

nal version. Attacking the journal version in conjunction

with a noise-free 4 XOR Lightweight PUF was possible

with 300k inputs while only 20k inputs were needed to

attack the protocol when a 3 XOR Lightweight PUF with

an unreliability of 23% was used. An overview of the attack

for diﬀerent parameters can be found in Table VII.

The results show that the Slender PUF protocol can

be attacked using machine learning attacks despite the

fact that the security argument provided in [15], [17]

suggests that it is computationally infeasible to attack

the protocol. While the attack complexity increases signif-

icantly compared to an attack on a plain XOR Lightweight

PUF, the attack complexity is far from computationally

infeasible. Hence, the Slender PUF protocol only increases

the machine learning resistance of a PUF but does not

prevent the attacks for the proposed parameters. The main

pitfall done in the security argument of the Slender PUF

protocol is that it is assumed that an attacker needs direct

challenge and responses for a successful machine learning

attack. However, as our attacks shows, direct challenges

and responses are not needed. Other metrics such as the

hamming weight can also be used. It is therefore not

enough to show that an attacker does not have access

to direct challenges and responses to show the resistance

against machine learning attacks.

TABLE VI

Results of the CMA-ES attack on the conference version. The numbers are the average of “number of attacked instances”

independent attacks.

XORs stages unreliability used inputs model accuracy needed runs attack time

3 64 0 30·10395.8% 8.6 7.1h

3 64 0 40·10396.6% 2.9 4.2h

3 64 23% 60·10387.7% 2.4 7.0h

4 64 0 600·10397.2% 3.992.7h

4 64 29% 1.2·10692.3% 2.8155h

TABLE VII

Results of the CMA-ES attack on the journal version. The numbers are the average of “number of attacked instances”

independent attacks.

XORs stages unreliability used inputs model accuracy needed runs attack time

3 64 0 10·10397.0% 12.5 9.3h

3 64 0 20·10397.5% 2.2 3.6h

3 64 23% 20·10388.0% 4.3 6.5h

3 64 23% 30·10389.8% 1.6 4.3h

4 64 0% 300·10396.9% 1.6 30.3h

4 64 29% 400·10392.5% 2 48.0h

The presented results show that the security gain of the

Slender PUF protocol compared to a plain PUF is not

as great as expected. The Slender PUF protocol should

therefore only be used in conjunction with PUFs that are

already very resistant to machine learning attacks. If the

used PUF on the other hand is not very resistant against

machine learning attacks, such as a 4 XOR Lightweight

PUF, the Slender PUF protocol can be attacked with

reasonable resources and number of inputs. However, this

requirement is a bit contradicting: For the protocol to

work, the veriﬁer needs to be able to build an accurate

software model of the PUF. At the same time, the PUF

should be very resistant against machine learning attacks.

This seems hard to achieve in practice.

V. Conclusion

In this paper we have demonstrate, by attacking the

reverse fuzzy extractor and the Slender PUF protocol, how

powerful machine learning attacks can be. The main lesson

learned is that machine learning attacks are possible even

if no direct challenges and responses are available to an

attacker. Access to highly obfuscated responses such as

the substrings in the Slender PUF protocol or the helper

data of error correction codes can be enough to perform an

ES-based machine learning attack. A common pitfall when

evaluating the security of PUFs is to assume that a certain

number of direct challenges and responses are needed for

a successful machine learning attack. However, as demon-

strated in this paper, direct challenges and responses are

not always needed to perform machine learning attacks.

Highly obfuscated responses can still be used to accurately

model a PUF. Therefore, evaluating the security of a

PUF based protocol is signiﬁcantly more diﬃcult than

determining the number of direct challenges and responses

an attacker has access to.

Furthermore, our attack also demonstrates how useful

ES based machine learning algorithms are for attacking

PUF protocols. On plain XOR Arbiter PUFs Logistic

Regression based machine learning attacks outperform ES

algorithms [18]. However, since ES is a black-box opti-

mizer, the attacker has a lot of ﬂexibility when choosing

the ﬁtness function since the ﬁtness function does not need

to be of any particular form. Hence, ES machine learning

attacks are especially well suited for cases in which the

attacker does not have direct access to the challenge and

response pairs.

The attack on the reverse fuzzy extractor also demon-

strates how much valuable information helper data from

error correction codes can contain. The important point

is that these error correction codes do not necessarily

need to leak the individual response bits to be useful for

an attacker. The information which challenges are more

robust than others can be used for a machine learning

algorithm as well. This is especially problematic for error

correction codes such as linear codes that directly reveal

which bits have ﬂipped if the same challenge is applied

twice. Hence, when choosing error correction codes for

delay based PUFs, machine learning attacks need to be

carefully considered. In particular, our results have di-

rect impact on the security of controlled PUFs, since in

controlled PUFs helper data of error correction codes are

similarly leaked as in the reverse fuzzy extractor.

References

[1] F. Armknecht, R. Maes, A. Sadeghi, F.-X. Standaert, and

C. Wachsmann. A formalization of the security features of

physical functions. In Security and Privacy (SP), 2011 IEEE

Symposium on, pages 397–412, May 2011.

[2] G. T. Becker and R. Kumar. Active and passive side-channel

attacks on delay based puf designs. IACR Cryptology ePrint

Archive, 2014:287, 2014.

[3] X. Boyen. Reusable cryptographic fuzzy extractors. In Proceed-

ings of the 11th ACM conference on Computer and communi-

cations security, pages 82–91. ACM, 2004.

[4] J. Delvaux, D. Gu, D. Schellekens, and I. Verbauwhede. Secure

lightweight entity authentication with strong pufs: Mission im-

possible? In Cryptographic Hardware and Embedded Systems

(CHES 2014), volume 8731 of LNCS, pages 451–475. Springer,

2014.

[5] J. Delvaux and I. Verbauwhede. Side channel modeling attacks

on 65nm arbiter pufs exploiting cmos device noise. In 6th IEEE

International Symposium on Hardware-Oriented Security and

Trust (HOST 2013), June 2013.

[6] Y. Dodis, L. Reyzin, and A. Smith. Fuzzy extractors: How

to generate strong keys from biometrics and other noisy data.

In Advances in cryptology-Eurocrypt 2004, pages 523–540.

Springer, 2004.

[7] B. Gassend, D. Clarke, M. van Dijk, and S. Devadas. Controlled

physical random functions. In Computer Security Applications

Conference, 2002. Proceedings. 18th Annual, pages 149–160,

2002.

[8] J. Guajardo, S. Kumar, G.-J. Schrijen, and P. Tuyls. Fpga

intrinsic pufs and their use for ip protection. In Cryptographic

Hardware and Embedded Systems - CHES 2007, volume 4727 of

LNCS, pages 63–80. Springer, 2007.

[9] N. Hansen. The cma evolution strategy: A comparing review. In

Towards a New Evolutionary Computation, volume 192 of Stud-

ies in Fuzziness and Soft Computing, pages 75–102. Springer,

2006.

[10] A. Herrewege, S. Katzenbeisser, R. Maes, R. Peeters, A.-R.

Sadeghi, I. Verbauwhede, and C. Wachsmann. Reverse fuzzy

extractors: Enabling lightweight mutual authentication for puf-

enabled rﬁds. In Financial Cryptography and Data Security,

volume 7397 of LNCS, pages 374–389. Springer, 2012.

[11] G. Hospodar, R. Maes, and I. Verbauwhede. Machine learning

attacks on 65nm arbiter pufs: Accurate modeling poses strict

bounds on usability. In IEEE International Workshop on In-

formation Forensics and Security (WIFS), pages 37–42. IEEE,

2012.

[12] S. Katzenbeisser, Ü. Koçabas, V. Rozic, A.-R. Sadeghi, I. Ver-

bauwhede, and C. Wachsmann. Pufs: Myth, fact or busted? a

security evaluation of physically unclonable functions (pufs) cast

in silicon. In Cryptographic Hardware and Embedded Systems

- CHES 2012, volume 7428 of LNCS, pages 283–301. Springer,

2012.

[13] J. W. Lee, D. Lim, B. Gassend, G. E. Suh, M. Van Dijk, and

S. Devadas. A technique to build a secret key in integrated cir-

cuits for identiﬁcation and authentication applications. In VLSI

Circuits, 2004. Digest of Technical Papers. 2004 Symposium on,

pages 176–179. IEEE, 2004.

[14] R. Maes, A. Van Herrewege, and I. Verbauwhede. Pufky:

A fully functional puf-based cryptographic key generator. In

Cryptographic Hardware and Embedded Systems - CHES 2012,

volume 7428 of LNCS, pages 302–319. Springer, 2012.

[15] M. Majzoobi, M. Rostami, F. Koushanfar, D. Wallach, and

S. Devadas. Slender puf protocol: A lightweight, robust, and

secure authentication by substring matching. In Security and

Privacy Workshops (SPW), 2012 IEEE Symposium on, pages

33–44, May 2012.

[16] Z. Paral and S. Devadas. Reliable and eﬃcient puf-based key

generation using pattern matching. In Hardware-Oriented Se-

curity and Trust (HOST), 2011 IEEE International Symposium

on, pages 128–133. IEEE, 2011.

[17] M. Rostami, M. Majzoobi, F. Koushanfar, D. Wallach, and

S. Devadas. Robust and reverse-engineering resilient puf au-

thentication and key-exchange by substring matching. Emerging

Topics in Computing, IEEE Transactions on, PP(99):1–1, 2014.

[18] U. Rührmair, F. Sehnke, J. Sölter, G. Dror, S. Devadas, and

J. Schmidhuber. Modeling attacks on physical unclonable func-

tions. In Proceedings of the 17th ACM conference on Computer

and communications security, CCS ’10, pages 237–249, New

York, NY, USA, 2010. ACM.

[19] U. Ruhrmair, J. Solter, F. Sehnke, X. Xu, A. Mahmoud, V. Stoy-

anova, G. Dror, J. Schmidhuber, W. Burleson, and S. Devadas.

Puf modeling attacks on simulated and silicon data. Information

Forensics and Security, IEEE Transactions on, 8(11):1876–

1891, Nov 2013.

[20] G. Suh and S. Devadas. Physical unclonable functions for device

authentication and secret key generation. In Design Automation

Conference, 2007. DAC ’07. 44th ACM/IEEE, pages 9–14, June

2007.

[21] G. E. Suh, C. W. O’Donnell, I. Sachdev, and S. Devadas. Design

and implementation of the aegis single-chip secure processor

using physical random functions. In ACM SIGARCH Computer

Architecture News, volume 33, pages 25–36. IEEE Computer

Society, 2005.

Georg T. Becker received his PhD in Electri-

cal and Computer Engineering at the Univer-

sity of Massachusetts Amherst in 2014. He is

currently working in the Embedded Security

Group at the Horst Görtz Institut for IT-

Security at the Ruhr-Universität Bochum. His

primary research interest is hardware secu-

rity with a special focus on hardware Tro-

jans, Physical Unclonable Functions and side-

channel analysis. He received a B.Sc. degree in

Applied Computer Science and a M.Sc. degree

in IT-Security from the Ruhr-Universität Bochum in 2007 and 2009

respectively.

Active learning for fast and slow modeling attacks on Arbiter PUFs

Preprint

Aug 2023

Modeling attacks, in which an adversary uses machine learning techniques to model a hardware-based Physically Unclonable Function (PUF) pose a great threat to the viability of these hardware security primitives. In most modeling attacks, a random subset of challenge-response-pairs (CRPs) are used as the labeled data for the machine learning algorithm. Here, for the arbiter-PUF, a delay based PUF which may be viewed as a linear threshold function with random weights (due to manufacturing imperfections), we investigate the role of active learning in Support Vector Machine (SVM) learning. We focus on challenge selection to help SVM algorithm learn ``fast'' and learn ``slow''. Our methods construct challenges rather than relying on a sample pool of challenges as in prior work. Using active learning to learn ``fast'' (less CRPs revealed, higher accuracies) may help manufacturers learn the manufactured PUFs more efficiently, or may form a more powerful attack when the attacker may query the PUF for CRPs at will. Using active learning to select challenges from which learning is ``slow'' (low accuracy despite a large number of revealed CRPs) may provide a basis for slowing down attackers who are limited to overhearing CRPs.

A Lightweight and Secure PUF-Based Authentication and Key-exchange Protocol for IoT Devices

Preprint

Full-text available

Jan 2024

The Internet of Things (IoT) is rapidly becoming a common technology that will improve people's lives by seamlessly integrating into many facets of modern life and facilitating information sharing across platforms. Device Authentication is a significant challenge for IoT devices as they are placed in unprotected environments, vulnerable to physical attacks and common security risks. Large computational requirements and communication costs during Authentication make the existing methods, like Public Key Cryptography and Identity-based Encryption, unsuitable for resource-constrained IoT devices. Physical Unclonable Function (PUF) offers a lightweight security mechanism instead of traditional sophisticated cryptosystems by providing an unclonable and tamper-sensitive unique signature. Therefore, we use lightweight operations like bitwise XOR, hash function, and PUF, suitable for resource-constrained IoT devices to authenticate IoT devices. Despite several studies employing the PUF, to the authors' knowledge, existing solutions require an intermediary verifier/gateway and/or active internet by the IoT device to directly interact with a Server to authenticate itself and, hence, are not scalable when the IoT device works technologies like Bluetooth Low Energy, Zigbee, etc. To address the aforementioned issue, we present a system in which the IoT device does not require an active internet connection to communicate with the server. The results of a thorough security study are validated against adversarial attacks and PUF-modelling attacks. For formal security validation, the AVISPA verification tool is also used. Performance study recommends this protocol's lightweight characteristics. The proposed protocol's acceptability and defenses against various adversarial assaults are supported by a prototype developed with ESP32.

A Lightweight Authentication Protocol Against Modeling Attacks Based on a Novel LFSR-APUF

Article

Full-text available

Jan 2023

Simple authentication protocols based on conventional physical unclonable functions (PUF) are vulnerable to modeling attacks and other security threats. This paper proposes an arbiter PUF based on a linear feedback shift register (LFSR-APUF). Different from the previously reported linear feedback shift register for challenge extension, the proposed scheme feeds the external random challenges into the LFSR module to obfuscate the linear mapping relationship between the challenge and response. It can prevent attackers from obtaining valid challenge-response pairs (CRPs), increasing its resistance to modeling attacks significantly. A 64-stage LFSR-APUF has been implemented on a field programmable gate array (FPGA) board. The experimental results reveal that the proposed design can effectively resist various modeling attacks such as logistic regression (LR), evolutionary strategy (ES), Artificial Neuro Network (ANN), and support vector machine (SVM) with a prediction rate of 51.79% and a slight effect on the randomness, reliability, and uniqueness. Further, a lightweight authentication protocol is established based on the proposed LFSR-APUF. The protocol incorporates a low-overhead, ultra-lightweight, novel private bit conversion Cover function that is uniquely bound to each device in the authentication network. The proposed authentication protocol not only resists spoofing attacks, physical attacks, and modeling attacks effectively, but also ensures the security of the entire authentication network by transferring important information in encrypted form from the server to the database even when the attacker completely controls the server.

DA PUF: dynamic adversarial PUF against machine learning attacks

Conference Paper

May 2024

A Highly Secure Reconfigurable Memory-Based Strong PUF for Device Authentication in Internet of Things

Conference Paper

Mar 2024

Active learning for fast and slow modeling attacks on Arbiter PUFs

Conference Paper

Sep 2023

Fully Symmetrical Obfuscated Interconnection and Weak-PUF-Assisted Challenge Obfuscation Strong PUFs Against Machine-Learning Modeling Attacks

Article

Jan 2024

In this paper, we propose a fully symmetrical obfuscated-interconnection PUF (SOI PUF), which contains n delay stages with each stage having 4 k obfuscated interconnections for resisting machine learning (ML)-based modeling attacks. All the delay stages contribute to k PUF primitives while achieving a 20× increase in the number of possible interconnections with the same hardware resources over similar prior arts. The SOI PUF mathematical model also theoretically demonstrates the large number of nonlinear matrix multiplications for resisting ML-based modeling attacks. We further exploit parallel weak PUF cells and propose the challenge-obfuscated SOI PUF (cSOI PUF), which can effectively prevent adversaries from bypassing unknown interconnections through reverse engineering (RE) attacks. The proposed SOI PUF and cSOI PUFs are evaluated by both software simulation and FPGA measurements. Without requiring a large k as in the existing PUF architectures, the simulation results demonstrate that the proposed SOI and cSOI PUFs can achieve a ~50% prediction accuracy for k ≥ 3, even when facing ML attacks using 5-hidden-layer Artificial Neural Network (ANN) with 40M training CRPs. Furthermore, the proposed (64,2/4/6/8)-SOI PUF and (64,2/4/6/8)-cSOI PUF implemented using Xilinx Artix-7 FPGA can both achieve a measured reliability and uniformity of >94% and ~50%, respectively. Depending on the value of k , the uniqueness ranges from 29.1% to 42.7% for SOI PUFs, and further improves to ~50% for cSOI PUFs. The resilience against Reliability-based modeling attacks, Probably Approximately Correct (PAC) attacks and Reverse-Engineering-based modeling attacks will also be discussed.

Hardware Security for Internet of Things Identity Assurance

Article

Full-text available

Jan 2024

With the proliferation of Internet of Things (IoT) devices, there is an increasing need to prioritize their security, especially in the context of identity and authentication mechanisms. However, IoT devices have unique limitations in terms of computational capabilities and susceptibility to hardware attacks, which pose significant challenges to establishing strong identity and authentication systems. Paradoxically, the very hardware constraints responsible for these challenges can also offer potential solutions. By incorporating hardware-based identity implementations, it is possible to overcome computational and energy limitations, while bolstering resistance against both hardware and software attacks. This research addresses these challenges by investigating the vulnerabilities and obstacles faced by identity and authentication systems in the IoT context, while also exploring potential technologies to address these issues. Each identified technology underwent meticulous investigation, considering known security attacks, implemented countermeasures, and an assessment of their pros and cons. Furthermore, an extensive literature survey was conducted to identify instances where these technologies have effectively supported device identity. The research also includes a demonstration that evaluates the effectiveness of hardware trust anchors in mitigating various attacks on IoT identity. This empirical evaluation provides valuable insights into the challenges developers encounter when implementing hardware-based identity solutions. Moreover, it underscores the substantial value of these solutions in terms of mitigating attacks and developing robust identity frameworks. By thoroughly examining vulnerabilities, exploring technologies, and conducting empirical evaluations, this research contributes to understanding and promoting the adoption of hardware-based identity and authentication systems in secure IoT environments. The findings emphasize the challenges faced by developers and highlight the significance of hardware trust anchors in enhancing security and facilitating effective identity solutions.

Enhancing Hardware Security: An Analysis of SRAM-PUFs

Conference Paper

Aug 2023

ANV-PUF: Machine-Learning-Resilient NVM-Based Arbiter PUF

Article

Sep 2023

Physical Unclonable Functions (PUFs) have been widely considered an attractive security primitive. They use the deviations in the fabrication process to have unique responses from each device. Due to their nature, they serve as a DNA-like identity of the device. But PUFs have also been targeted for attacks. It has been proven that machine learning (ML) can be used to effectively model a PUF design and predict its behavior, leading to leakage of the internal secrets. To combat such attacks, several designs have been proposed to make it harder to model PUFs. One design direction is to use Non-Volatile Memory (NVM) as the building block of the PUF. NVM typically are multi-level cells, i.e, they have several internal states, which makes it harder to model them. However, the current state of the art of NVM-based PUFs is limited to ‘weak PUFs’, i.e., the number of outputs grows only linearly with the number of inputs, which limits the number of possible secret values that can be stored using the PUF. To overcome this limitation, in this work we design the Arbiter Non-Volatile PUF (ANV-PUF) that is exponential in the number of inputs and that is resilient against ML-based modeling. The concept is based on the famous delay-based Arbiter PUF (which is not resilient against modeling attacks) while using NVM as a building block instead of switches. Hence, we replace the switch delays (which are easy to model via ML) with the multi-level property of NVM (which is hard to model via ML). Consequently, our design has the exponential output characteristics of the Arbiter PUF and the resilience against attacks from the NVM-based PUFs. Our results show that the resilience to ML modeling, uniqueness, and uniformity are all in the ideal range of 50%. Thus, in contrast to the state-of-the-art, ANV-PUF is able to be resilient to attacks, while having an exponential number of outputs.

Robust and Reverse-Engineering Resilient PUF Authentication and Key-Exchange by Substring Matching

Article

Full-text available

Mar 2014

This paper proposes novel robust and low-overhead physical unclonable function (PUF) authentication and key exchange protocols that are resilient against reverse-engineering attacks. The protocols are executed between a party with access to a physical PUF (prover) and a trusted party who has access to the PUF compact model (verifier). The proposed protocols do not follow the classic paradigm of exposing the full PUF responses or a transformation of them. Instead, random subsets of the PUF response strings are sent to the verifier so the exact position of the subset is obfuscated for the third-party channel observers. Authentication of the responses at the verifier side is done by matching the substring to the available full response string; the index of the matching point is the actual obfuscated secret (or key) and not the response substring itself. We perform a thorough analysis of resiliency of the protocols against various adversarial acts, including machine learning and statistical attacks. The attack analysis guides us in tuning the parameters of the protocol for an efficient and secure implementation. The low overhead and practicality of the protocols are evaluated and confirmed by hardware implementation.

Slender PUF Protocol: A Lightweight, Robust, and Secure Authentication by Substring Matching

Article

Full-text available

May 2012

We introduce Slender PUF protocol, an efficient and secure method to authenticate the responses generated from a Strong Physical Unclonable Function (PUF). The new method is lightweight, and suitable for energy constrained platforms such as ultra-low power embedded systems for use in identification and authentication applications. The proposed protocol does not follow the classic paradigm of exposing the full PUF responses (or a transformation of the full string of responses) on the communication channel. Instead, random subsets of the responses are revealed and sent for authentication. The response patterns are used for authenticating the prover device with a very high probability. We perform a thorough analysis of the method's resiliency to various attacks which guides adjustment of our protocol parameters for an efficient and secure implementation. We demonstrate that Slender PUF protocol, if carefully designed, will be resilient against all known machine learning attacks. In addition, it has the great advantage of an inbuilt PUF error tolerance. Thus, Slender PUF protocol is lightweight and does not require costly additional error correction, fuzzy extractors, and hash modules suggested in most previously known PUF-based robust authentication techniques. The low overhead and practicality of the protocol are confirmed by a set of hardware implementation and evaluations.

Reverse Fuzzy Extractors: Enabling Lightweight Mutual Authentication for PUF-enabled RFIDs

Conference Paper

Full-text available

Jan 2012

RFID-based tokens are increasingly used in electronic payment and ticketing systems for mutual authentication of tickets and terminals. These systems typically use cost-effective tokens without expensive hardware protection mechanisms and are exposed to hardware attacks that copy and maliciously modify tokens. Physically Unclonable Functions (PUFs) are a promising technology to protect against such attacks by binding security critical data to the physical characteristics of the underlying hardware. However, existing PUF-based authentication schemes for RFID do not support mutual authentication, are often vulnerable to emulation and denial-of service attacks, and allow only for a limited number of authentications. In this paper, we present a new PUF-based authentication scheme that overcomes these drawbacks: it supports PUF-based mutual authentication between tokens and readers, is resistant to emulation attacks, and supports an unlimited number of authentications without requiring the reader to store a large number of PUF challenge/response pairs. In this context, we introduce reverse fuzzy extractors, a new approach to correct noise in PUF responses that allows for extremely lightweight implementations on the token. Our proof-of-concept implementation shows that our scheme is suitable for resource-constrained devices.

The CMA Evolution Strategy: A Comparing Review

Chapter

Full-text available

Jun 2007

Nikolaus Hansen

Derived from the concept of self-adaptation in evolution strategies, the CMA (Covariance Matrix Adaptation) adapts the covariance matrix of a multi-variate normal search distribution. The CMA was originally designed to perform well with small populations. In this review, the argument starts out with large population sizes, reflecting recent extensions of the CMA algorithm. Commonalities and differences to continuous Estimation of Distribution Algorithms are analyzed. The aspects of reliability of the estimation, overall step size control, and independence from the coordinate system (invariance) become particularly important in small populations sizes. Consequently, performing the adaptation task with small populations is more intricate.

Secure Lightweight Entity Authentication with Strong PUFs: Mission Impossible?

Conference Paper

Sep 2014

Physically unclonable functions (PUFs) exploit the unavoidable manufacturing variations of an integrated circuit (IC). Their input-output behavior serves as a unique IC ‘fingerprint’. Therefore, they have been envisioned as an IC authentication mechanism, in particular for the subclass of so-called strong PUFs. The protocol proposals are typically accompanied with two PUF promises: lightweight and an increased resistance against physical attacks. In this work, we review eight prominent proposals in chronological order: from the original strong PUF proposal to the more complicated converse and slender PUF proposals. The novelty of our work is threefold. First, we employ a unified notation and framework for ease of understanding. Second, we initiate direct comparison between protocols, which has been neglected in each of the proposals. Third, we reveal numerous security and practicality issues. To such an extent, that we cannot support the use of any proposal in its current form. All proposals aim to compensate the lack of cryptographic properties of the strong PUF. However, proper compensation seems to oppose the lightweight objective.

Side channel modeling attacks on 65nm arbiter PUFs exploiting CMOS device noise

Conference Paper

Jun 2013

Physically Unclonable Functions (PUFs) are emerging as hardware security primitives. For so-called strong PUFs, the number of challenge-response pairs (CRPs) increases exponentially with the required chip area in the ideal case. They can provide a mechanism to authenticate chips which is inherently unique for every manufactured sample. Modeling of the CRP behavior through Machine Learning (ML) has shown to be a threat however. In this paper, we exploit repeatability imperfections of PUF responses as a side channel for model building. We demonstrate that 65nm CMOS arbiter PUFs can be modeled successfully, without utilizing any ML algorithm. Data originates from real-world measurements and hence not from simulations. Modeling accuracies exceeding 97% are obtained, which is comparable with previously published ML results. Information leakage through the exploited side channel should be considered for all strong PUF designs. Combined attack strategies, whereby repeatability measurements facilitate ML, might be effective and are recommended for further research.

PUFs: Myth, Fact or Busted? A Security Evaluation of Physically Unclonable Functions (PUFs) Cast in Silicon

Conference Paper

Sep 2012

Physically Unclonable Functions (PUFs) are an emerging technology and have been proposed as central building blocks in a variety of cryptographic protocols and security architectures. However, the security features of PUFs are still under investigation: Evaluation results in the literature are difficult to compare due to varying test conditions, different analysis methods and the fact that representative data sets are publicly unavailable. In this paper, we present the first large-scale security analysis of ASIC implementations of the five most popular intrinsic electronic PUF types, including arbiter, ring oscillator, SRAM, flip-flop and latch PUFs. Our analysis is based on PUF data obtained at different operating conditions from 96 ASICs housing multiple PUF instances, which have been manufactured in TSMC 65 nm CMOS technology. In this context, we present an evaluation methodology and quantify the robustness and unpredictability properties of PUFs. Since all PUFs have been implemented in the same ASIC and analyzed with the same evaluation methodology, our results allow for the first time a fair comparison of their properties.

PUFKY: a fully functional PUF-based cryptographic key generator

Conference Paper

Sep 2012

We present PUFKY: a practical and modular design for a cryptographic key generator based on a Physically Unclonable Function (PUF). A fully functional reference implementation is developed and successfully evaluated on a substantial set of FPGA devices. It uses a highly optimized ring oscillator PUF (ROPUF) design, producing responses with up to 99% entropy. A very high key reliability is guaranteed by a syndrome construction secure sketch using an efficient and extremely low-overhead BCH decoder. This first complete implementation of a PUF-based key generator, including a PUF, a BCH decoder and a cryptographic entropy accumulator, utilizes merely 17% (1162slices) of the available resources on a low-end FPGA, of which 82% are occupied by the ROPUF and only 18% by the key generation logic. PUFKY is able to produce a cryptographically secure 128-bit key with a failure rate <10−9 in 5.62ms. The design's modularity allows for rapid and scalable adaptations for other PUF implementations or for alternative key requirements. The presented PUFKY core is immediately deployable in an embedded system, e.g. by connecting it to an embedded microcontroller through a convenient bus interface.

Machine learning attacks on 65nm Arbiter PUFs: Accurate modeling poses strict bounds on usability

Conference Paper

Dec 2012

Arbiter Physically Unclonable Functions (PUFs) have been proposed as efficient hardware security primitives for generating device-unique authentication responses and cryptographic keys. However, the assumed possibility of modeling their underlying challenge-response behavior causes uncertainty about their actual applicability. In this work, we apply well-known machine learning techniques on challenge-response pairs (CRPs) from 64-stage Arbiter PUFs realized in 65nm CMOS, in order to evaluate the effectiveness of such modeling attacks on a modern silicon implementation. We show that a 90%-accurate model can be built from a training set of merely 500 CRPs, and that 5000 CRPs are sufficient to perfectly model the PUFs. To study the implications of these attacks, there is need for a new methodology to assess the security of PUFs suffering from modeling. We propose such a methodology and apply it to our machine learning results, yielding strict bounds on the usability of Arbiter PUFs. We conclude that plain 64-stage Arbiter PUFs are not secure for challenge-response authentication, and the number of extractable secret key bits is limited to at most 600.

PUF Modeling Attacks on Simulated and Silicon Data

Article

Nov 2013

We discuss numerical modeling attacks on several proposed strong physical unclonable functions (PUFs). Given a set of challenge-response pairs (CRPs) of a Strong PUF, the goal of our attacks is to construct a computer algorithm which behaves indistinguishably from the original PUF on almost all CRPs. If successful, this algorithm can subsequently impersonate the Strong PUF, and can be cloned and distributed arbitrarily. It breaks the security of any applications that rest on the Strong PUF's unpredictability and physical unclonability. Our method is less relevant for other PUF types such as Weak PUFs. The Strong PUFs that we could attack successfully include standard Arbiter PUFs of essentially arbitrary sizes, and XOR Arbiter PUFs, Lightweight Secure PUFs, and Feed-Forward Arbiter PUFs up to certain sizes and complexities. We also investigate the hardness of certain Ring Oscillator PUF architectures in typical Strong PUF applications. Our attacks are based upon various machine learning techniques, including a specially tailored variant of logistic regression and evolution strategies. Our results are mostly obtained on CRPs from numerical simulations that use established digital models of the respective PUFs. For a subset of the considered PUFs-namely standard Arbiter PUFs and XOR Arbiter PUFs-we also lead proofs of concept on silicon data from both FPGAs and ASICs. Over four million silicon CRPs are used in this process. The performance on silicon CRPs is very close to simulated CRPs, confirming a conjecture from earlier versions of this work. Our findings lead to new design requirements for secure electrical Strong PUFs, and will be useful to PUF designers and attackers alike.

On the Pitfalls of using Arbiter PUFs as Building Blocks

Abstract and Figures

Recommended publications

RFID-Tags for anti-counterfeiting

The gap between promise and reality: On the insecurity of XOR arbiter PUFs

On the Scaling of Machine Learning Attacks on PUFs with Application to Noise Bifurcation

A Lockdown Technique to Prevent Machine Learning on PUFs for Lightweight Authentication

Secure Lightweight Entity Authentication with Strong PUFs: Mission Impossible?