Conference PaperPDF Available

Secure Sketch for Biometric Templates

December 2006

December 2006
4284:99-113

DOI:10.1007/11935230_7

Source
DBLP

Conference: Advances in Cryptology - ASIACRYPT 2006, 12th International Conference on the Theory and Application of Cryptology and Information Security, Shanghai, China, December 3-7, 2006, Proceedings

Authors:

Nasir D. Memon

New York University

There have been active discussions on how to derive a consistent cryptographic key from noisy data such as biometric templates, with the help of some extra information called a sketch. It is desirable that the sketch reveals little information about the biometric templates even in the worst case (i.e., the entropy loss should be low). The main difficulty is that many biometric templates are represented as points in continuous domains with unknown dis- tributions, whereas known results either work only in discrete domains, or lack rigorous analysis on the entropy loss. A general approach to handle points in continuous domains is to quantize (discretize) the points and apply a known sketch scheme in the discrete domain. However, it can be difficult to analyze the entropy loss due to quantization and to find the "optimal" quantizer. In this paper, instead of trying to solve these problems directly, we propose to ex- amine the relative entropy loss of any given scheme, which bounds the number of additional bits we could have extracted if we used the optimal parameters. We give a general scheme and show that the relative entropy loss due to sub- optimal discretization is at most (nlog 3), where n is the number of points, and the bound is tight. We further illustrate how our scheme can be applied to real biometric data by giving a concrete scheme for face biometrics.

Content uploaded by Nasir D. Memon

Content may be subject to copyright.

Secure Sketch for Biometric Templates

Qiming Li1, Yagiz Sutcu2,andNasirMemon

1Department of Computer and Information Science

2Department of Electrical and Computer Engineering

3Department of Computer and Information Science

Polytechnic University

6 Metrotech Center, Brooklyn, NY 11201

qiming.li@ieee.org, ygzstc@yahoo.com, memon@poly.edu

Abstract. There have been active discussions on how to derive a con-

sistent cryptographic key from noisy data such as biometric templates,

with the help of some extra information called a sketch. It is desirable

that the sketch reveals little information about the biometric templates

even in the worst case (i.e., the entropy loss should be low). The main

diﬃculty is that many biometric templates are represented as points in

continuous domains with unknown distributions, whereas known results

either work only in discrete domains, or lack rigorous analysis on the

entropy loss. A general approach to handle points in continuous domains

is to quantize (discretize) the points and apply a known sketch scheme in

the discrete domain. However, it can be diﬃcult to analyze the entropy

loss due to quantization and to ﬁnd the “optimal” quantizer. In this

paper, instead of trying to solve these problems directly, we propose to

examine the relative entropy loss of any given scheme, which bounds the

number of additional bits we could have extracted if we used the optimal

parameters. We give a general scheme and show that the relative entropy

loss due to suboptimal discretization is at most (nlog 3), where nis the

number of points, and the bound is tight. We further illustrate how our

scheme can be applied to real biometric data by giving a concrete scheme

for face biometrics.

Keywords: Secure sketch, biometric template, continuous domain.

1 Introduction

The main challenge in using biometric data in cryptography is that they cannot

be reproduced exactly. Some noise will be inevitably introduced into biometric

samples during acquisition and processing. There have been active discussions

on how to extract a reliable cryptographic key from such noisy data. Some

recent techniques attempt to correct the noise in the data by using some public

information Pderived from the original biometric template X. These techniques

include fuzzy commitment [12], fuzzy vault [11], helper data [19], and secure

sketch [7]. In this paper, we follow Dodis et al. [7] and call such public information

Pasketch.

X. Lai and K. Chen (Eds.): ASIACRYPT 2006, LNCS 4284, pp. 99–113, 2006.

International Association for Cryptologic Research 2006

100 Q. Li, Y. Sutcu, and N. Memon

Typically, there are two main components in a secure sketch scheme. The ﬁrst is

the sketch generation algorithm, which we will refer to as the encoder.Ittakesthe

original biometric template Xas the input, and outputs a sketch P. The second al-

gorithm is the biometric template reconstruction algorithm, or the decoder,which

takes another biometric template Yand the sketch Pas the input and outputs X.

If Yand Xare suﬃciently similar according to some similarity measure, we will

have X=X. An important requirement for such a scheme is that the sketch P

should not reveal too much information about the biometric template X.Dodis

et al. [7] gives a notion of entropy loss, which (informally speaking) measures the

advantage that Pgives to any adversary in guessing X,whenXis discrete in na-

ture (Section 3 provides the details). It is worth to note that the entropy loss is a

worst case bound for all distributions of X.

There are several diﬃculties in applying many known secure sketch tech-

niques to known types of biometric templates directly. Firstly, many biometric

templates are represented by sequences of npoints in a continuous domain (say,

R), or equivalently, points in an n-dimensional space (say, Rn). In this case,

since the entropy of the original data can be very large, and the length of the

extracted key is typically quite limited, the “entropy loss” as deﬁned in [7] can

be very high for any possible scheme. For example, Xis often a discrete approx-

imation of some points in a continuous domain (e.g., decimal fractions obtained

by rounding real numbers). As the precision of Xgets higher, both the entropy

of Xand the entropy loss from Pbecome larger, but the extracted key can

become stronger. Hence, this notion of entropy loss alone is insuﬃcient, and the

seemingly high entropy loss for this type of biometric data would be misleading.

We will discuss this issue in detail in Section 4, and give a complimentary deﬁni-

tion of relative entropy loss for noisy data in the continuous domain. Informally

speaking, the relative entropy loss of a sketch measures the imperfectness of the

rounding, which is the maximum amount of additional entropy we can obtain

by the “optimal” rounding. At the same time, the entropy loss from Pserves as

a measure of the security of the sketch in the discrete domain.

Secondly, even if the biometric templates are represented in discrete form,

there are practical problems when the entropy of the original template is high.

For example, the iris pattern of an eye can be represented by a 2048 bit binary

string called iris code, and up to 20% of the bits could be changed under noise

[9]. The fuzzy commitment scheme based on binary error-correcting codes [12]

seems to be applicable at the ﬁrst glance. However, it would be impractical to

apply a binary error-correcting code on such a long string with such a large

error-correcting capability. A two-level error-correcting technique is proposed in

[9], which essentially changes the similarity measure. As a result, the space is no

longer a metric space.

Thirdly, the similarity measures for many known biometric templates can

be quite diﬀerent from those considered in many theoretical works (such as

Hamming distance, set diﬀerence and edit distance in [7]). This can happen as

a result of technical considerations (e.g., in the case of iris codes). However,

in many cases this is due to the nature of biometric templates. For instance,

Secure Sketch for Biometric Templates 101

a ﬁngerprint template usually consists of a set of minutiae (feature points in

2-D space), and two templates are considered as similar if more than a certain

number of minutiae in one template are near distinct minutiae in the other. In

this case, the similarity measure has to consider both Euclidean distance and set

diﬀerence at the same time.

The secure sketch for point sets [5] is perhaps the ﬁrst rigorous approach to

similarity measures that do not deﬁne a metric space. A generic scheme is pro-

posed in [5] for point sets in bounded discrete d-dimensional space for any d,

where the underlying similarity measure is motivated by the similarity measure

of ﬁngerprint templates. While such a scheme is potentially applicable to ﬁn-

gerprints represented as minutiae, other types of biometrics are diﬀerent both

in representations and similarity measures, thus require diﬀerent considerations

and diﬀerent schemes.

In this paper, we study how to design secure sketch for biometric templates,

where the worst case bound can be proved. We observe that many biometric

templates can be represented in a general form: The original Xcan be considered

as a list of npoints, where each point xof Xis in a bounded continuous domain.

Under noise, each point can be perturbed by a distance less than δ,andontop

of that, at most tpoints can be replaced. Similar to [5], we will refer to the

ﬁrst noise as the white noise, and the second replacement noise.Wenotethat

this similarity measure can be applied to handwritten online signatures [8], iris

patterns [9], voice features [15], and face biometrics [17]. This formulation is

diﬀerent from that in [5] in two ways: (1) The points are in a continuous domain,

and (2) the points are always ordered.

To handle points in continuous domain, a general two step approach is to

(1) quantize (i.e., discretize) the points in Xto a discrete domain with a scalar

quantizer Qλ,whereλis the step size, and (2) apply secure sketch techniques on

the quantized points 

X=Qλ(X) in the quantized domain, which is discrete. For

example, if points in Xare real numbers between 0 and 1, assume that we have

a scalar quantizer Qλwith step size λ=0.01, such that Qλ(x)=xif and only

if xλ ≤x<(x+1)λ,theneverypointinXwould be mapped to an integer in

[0,99]. After that, we can apply a secure sketch for discrete points in the domain

[0,99]nto achieve error-tolerance.

However, there are two diﬃculties when this approach is applied. Firstly, if we

follow the notion of secure sketch and entropy loss as in [7], the quantization error

X−

Xin the ﬁrst step has to be kept in the sketch, since exact reconstruction of

Xis required by deﬁnition. However, it can be diﬃcult to give an upper bound on

the entropy loss from the quantization errors. Even if we can, it can be very large.

Furthermore, as the quantization step λbecomes very small, the bound on the

entropy loss in the quantized domain during the second step canbeveryhigh.For

instance, for x∈[0,1) and δ=0.01, when λ=0.01, the entropy loss in Step

(2) will be log 3, and the bound is tight. When λ=0.001, the entropy loss will

be log 21. However, the big diﬀerence in entropy loss in the quantized domain can

be misleading. We will revisit this example in Section 5, and will show that the

second case actually results in a stronger key if Xis uniformly distributed.

102 Q. Li, Y. Sutcu, and N. Memon

To address the above problems, we consider the following strategy. Instead of

trying to answer the question of how much entropy is lost during quantization,

we study how diﬀerent quantizers aﬀect the strength of the key that we can

ﬁnally extract from the noisy data. In particular, given a secure sketch scheme

in the discrete domain and a quantizer Q1with step size λ1,weconsiderany

quantizer Q2with step size λ2. Assuming that m1and m2are the strengths of

the keys under these two quantizers respectively, we found that it is possible to

give an upper bound on the diﬀerence between m1and m2, for any distribution

of X, and any choices of λ2(hence Q2) within a certain range. This bound can

be expressed as a function of λ1. In other words, although we do not know what

is the exact entropy loss due to the quantizer Q1, we do know that at most how

far away Q1can be from the “optimal” one. Based on this, we give a notion

of relative entropy loss for data in continuous domain. Furthermore, we show

that if Xis uniformly distributed, the relative entropy loss can be bounded by

a constant for any choice of λ1.

To illustrate how our general approach can be applied to practical biometric

templates, we give a scheme based on the authentication scheme for face biomet-

rics in [17]. We will also discuss some practical issues in designing secure sketch

schemes for biometric templates.

We note that our proposed schemes and analysis can be applied for two parties

to extract secret keys given correlated random variables (e.g., [14]), where the

random variables take values in a continuous domain (e.g. R). The entropy loss

in the quantized domain measures how much information can be leaked to an

eavesdropper, while the relative entropy loss measures how many additional bits

that we might be able to extract.

We will give a review of related works in Section 2, followed by some pre-

liminary formal deﬁnitions in Section 3. Our deﬁnition of secure sketch and its

security will be presented in Section 4. We give a general similarity measure and

our proposed schemes in Section 5, together with a security analysis and some

discussions on choosing the parameters. A concrete secure sketch scheme for face

biometrics will be given in 6.

2 Related Works

It is not surprising that the construction of the sketch largely depends on the

representation of the biometric templates and the underlying distance function

that measures the similarity. Most of the known techniques assume that the

noisy data under consideration are represented as points in some metric space.

The fuzzy commitment scheme [12], which is based on binary error-correcting

codes, considers binary strings where the similarity is measured by Hamming

distance. The fuzzy vault scheme [11] considers sets of elements in a ﬁnite ﬁeld

with set diﬀerence as the distance function, and corrects errors by polynomial

interpolation. Dodis et al. [7] further gives the notion of fuzzy extractors,wherea

“strong extractor” (such as pair-wise independent hash functions) is applied after

the original Xis reconstructed to obtain an almost uniform key. Constructions

Secure Sketch for Biometric Templates 103

and rigorous analysis of secure sketch are given in [7] for three metrics: Hamming

distance, set diﬀerence and edit distance. Secure sketch schemes for point sets in

[5] are motivated by the typical similarity measure used for ﬁngerprints, where

each template consists of a set of points in 2-D space, and the similarity measure

does not deﬁne a metric space.

On the other hand, there have been a number of works on how to extract

consistent keys from real biometric templates, which have quite diﬀerent rep-

resentations and similarity measures from the above theoretical works. Such

biometric templates include handwritten online signatures [8], ﬁngerprints [20],

iris patterns [9], voice features [15], and face biometrics [17]. These works, how-

ever, do not have suﬃciently rigorous treatment of the security, compared to

well-established cryptographic techniques. Some of the works give analysis on

the entropy of the biometrics, and approximated amount of eﬀorts required by

a brute-force attacker.

Boyen [2] shows that a sketch scheme that is provably secure may be insecure

when multiple sketches of the same biometric data are obtained. Boyen et al.

further study the security of secure sketch schemes under more general attacker

models in [1], and techniques to achieve mutual authentication are proposed.

Linnartz and Tuyls [13] consider a similar problem for biometric authentica-

tion applications. They consider zero mean i.i.d. jointly Gaussian random vectors

as biometric templates, and use mutual information as the measure of security

against dishonest veriﬁers. Tuyls and Goseling [19] consider a similar notion of

security, and develop some general results when the distribution of the original

is known and the veriﬁer can be trusted. Some practical results along this line

also appear in [18].

3 Preliminaries

3.1 Entropy and Entropy Loss in Discrete Domain

In the case where Xis discrete, we follow the deﬁnitions by Dodis et al. [7]. They

consider a variant of the average min-entropy of Xgiven P, which is essentially

the minimum strength of the key that can be consistently extracted from X

when Pis made public.

In particular, the min-entropy H∞(A) of a discrete random variable Ais

deﬁned as H∞(A)=−log(maxaPr[A=a]). For two discrete random variables

Aand B, the average min-entropy of Agiven Bis deﬁned as 

H∞(A|B)=

−log(Eb←B[2−H∞(A|B=b)]).

For discrete X, the entropy loss of the sketch Pis deﬁned as L=H∞(X)−



H∞(X|P). This deﬁnition is useful in the analysis, since for any -bit string B,

we have 

H∞(A|B)≥H∞(A)−. For any secure sketch scheme for discrete X,

let Rbe the randomness invested in constructing the sketch, it is not diﬃcult to

show that when Rcan be computed from Xand P,wehave

L=H∞(X)−

H∞(X|P)≤|P|−H∞(R).(1)

104 Q. Li, Y. Sutcu, and N. Memon

In other words, the entropy loss can be bounded from above by the diﬀerence

between the size of Pand the amount of randomness we invested in computing

P. This allows us to conveniently ﬁnd an upper bound of Lfor any distribution

of X, since it is independent of X.

3.2 Secure Sketch in Discrete Domain

Our deﬁnitions of secure sketch and entropy loss in the discrete domain follow

that in [7]. Let Mbe a ﬁnite set of points with a similarity relation S⊆M×M.

When (X, Y )∈S,wesaytheYis similar to X, or the pair (X, Y ) is similar.

Deﬁnition 1. A sketch scheme in discrete domain is a tuple (M,S,Enc,Dec),

where Enc :M→{0,1}∗is an encoder and Dec :M×{0,1}∗→Mis a decoder

such that for all X, Y ∈M,Dec(Y, Enc(X)) = Xif (X, Y )∈S.Thestring

P=Enc(X)is the sketch, and is to be made public. We say that the scheme is

L-secure if for all random variables Xover M, the entropy loss of the sketch P

is at most L. That is, H∞(X)−

H∞(X|Enc(X)) ≤L.

We call 

H∞(X|P)theleft-over entropy, which in essence measures the “strength”

of the key that can be extracted from Xgiven that Pis made public. Note that

in most cases, the ultimate goal is to maximize the left-over entropy for some par-

ticular distribution of X. However, in the discrete case, the min-entropy of Xis

ﬁxed but can be diﬃcult to analyze. Hence, entropy loss becomes an equivalent

measure which is easier to quantify.

4 Secure Sketch in Continuous Domain

In this section we propose a general approach to handle noisy data in a contin-

uous domain. We consider points in a universe U,whichisasetthatmaybe

uncountable. Let Sbe a similarity relation on U, i.e., S⊆U×U.LetMbe a

set of ﬁnite points, and let Q:U→Mbe a function that maps points in Uto

points in M. We will refer to such a function Qas a quantizer.

Deﬁnition 2. A quantization-based sketch scheme is a tuple (U,S,Q,M,Enc,Dec),

where Enc :M→{0,1}∗is an encoder and Dec :M×{0,1}∗→Mis an decoder

such that for all X, Y ∈U,Dec(Q(Y),Enc(Q(X))) = Q(X)if (X, Y )∈S.The

string P=Enc(Q(X)) is the sketch. We say that the scheme is L-secure in the

quantized domain if for all random variable Xover U, the entropy loss of Pis at

most L, i.e., H∞(Q(X)) −

H∞(Q(X)|Enc(Q(X))) ≤L.

In other words, a quantization is applied to transform the points in the con-

tinuous domain to a discrete domain, and a sketch scheme for discrete domain

is applied to obtain the sketch P. During reconstruction, we require the exact

reconstruction of the quantization Q(X) instead of the original Xin the contin-

uous domain. When required, a strong extractor can be further applied to Q(X)

to extract a key (as the fuzzy extractor in [7]). That is, we treat Q(X)asthe

“discrete original”. Similarly, we call 

H∞(Q(X)|P) the left-over entropy.

Secure Sketch for Biometric Templates 105

When Qis ﬁxed, we can use the entropy loss on Q(X) to analyze the security

of the scheme, and bound the entropy loss of P. However, using this entropy loss

alone may be misleading, since there are many ways to quantize X, and diﬀerent

quantizer would make a diﬀerence in both the min-entropy of Q(X)andthe

entropy loss. Since our ultimate goal is to maximize the left-over entropy (i.e.,

the average min-entropy 

H∞(Q(X)|P)), the entropy loss alone is not suﬃcient

to compare diﬀerent quantization strategies.

To illustrate the subtleties, we consider the following example. Let xbe a point

uniformly distributed in the interval [0,1), and under noise, it can be shifted but

still within the range [x−0.01,x+0.01). We can use a scalar quantizer Q1with

step size 0.01, such that all points in the interval [0,1) are mapped to integers

[0,99]. In this case, the min-entropy H∞(Q1(x)) = log 100. As we can see later,

there is an easy way to construct a secure sketch for such Q1(x) with entropy

loss of log 3. Hence, the left-over entropy is log(100/3) ≈5.06. Now we consider

another scalar quantizer Q2with step size 0.001, such that the range of Q2(x)is

[0,999]. A similar scheme on Q2(x) would give entropy loss of log 21, which seems

much larger than the previous log 3. However, the min-entropy of Q2(x)isalso

increased to log 1000, and the left-over entropy would be log(1000/21) ≈5.57,

which is slightly higher than the case where Q1is used.

Intuitively, for a given class of methods of handling noisy data in the quantized

domain, it is important to examine how diﬀerent precisions of the quantization

process aﬀect the strength of the extracted key. For this purpose, we propose

to consider not just one, but a family of quantizers Q, where each quantizer Q

drawn from Qdeﬁnes a mapping from Uto a ﬁnite set MQ.LetMbe the set

of such MQfor all Q∈Q. We also deﬁne a family of encoders Eand decoders

D, such that for each Qand MQ, there exist uniquely deﬁned EncQ∈Eand

DecQ∈Dthat can handle Q(X)inMQ.

Deﬁnition 3. A quantization-based sketch family is a tuple (U,S,Q,M,E,D),

such that for each quantizer Q∈Q,thereexistM∈M,Enc ∈Eand Dec ∈D,

and (U,S,Q,M,Enc,Dec)is a quantization-based sketch scheme. We say that

such a scheme is a member of the family, and is identiﬁed by Q.

Deﬁnition 4. A quantization-based sketch family (U,S,Q,M,E,D)is (L,R)-

secure for functions L,R:Q→Rif for any member identiﬁed by Q1(with

encoder Enc1) it holds that

1. This member is L(Q1)-secure in the quantized domain; and

2. For any random variable X, and any member identiﬁed by Q2(with encoder

Enc2), we have



H∞(Q2(X)|Enc2(Q2(X))) −

H∞(Q1(X)|Enc1(Q1(X))) ≤R(Q1).

In other words, to measure the security of the family of schemes, we examine two

aspects of the family. Firstly, we consider the entropy loss in the quantized do-

main for each member of the family. This is represented by the function L,which

serves as a measure of security when the quantizer is ﬁxed. Secondly, given any

106 Q. Li, Y. Sutcu, and N. Memon

quantizer in the family, we consider the question: If we use another quantizer, how

many more bits can be extracted? We call this the relative entropy loss,whichis

represented by the function R.

We observe that for some sketch families, the relative entropy loss for any given

member can be conveniently bounded by the size of of the sketch generated by

that member. We say that such sketch families are well-formed. More precisely,

we have

Deﬁnition 5. A quantization-based sketch family (U,S,Q,M,E,D)is well-formed

if for any two members (U,S,Q1,M1,Enc1,Dec1)and (U,S,Q2,M2,Enc2,Dec2),it

holds for any random variable Xthat



H∞(Q1(X)|P1,P

2)= 

H∞(Q2(X)|P1,P

2)(2)

where P1=Enc1(Q1(X)) and P2=Enc2(Q2(X)).

Theorem 1. For any well-formed quantization-based sketch family, given any

two members (U,S,Q1,M1,Enc1,Dec1)and (U,S,Q2,M2,Enc2,Dec2), it holds

for any random variable Xthat



H∞(Q2(X)|P2)−

H∞(Q1(X)|P1)≤|P1|

where P1=Enc1(Q1(X)) and P2=Enc2(Q2(X)).

Proof: First, it is not diﬃcult to show that for any random variables A, B and

C,wehave



H∞(A|B)−|C|≤ 

H∞(A|B, C)≤

H∞(A|B).(3)

Let 

X1=Q1(X)and 

X2=Q2(X). Since the sketch family is well-formed,



H∞

X1|P1,P

2=

H∞

X2|P1,P

2.(4)

Substituting Bby P1,Cby P2,andAby 

X1and 

X2respectively in (3), we have



H∞

X2|P2−|P1|≤ 

H∞

X2|P1,P

2

=

H∞

X1|P1,P

2≤

H∞

X1|P1.

(5)

5 A General Scheme for Biometric Templates

We observe that many biometric templates can be represented as a sequence of

points in some bounded continuous domain. There are two types of noise that

can occur. The ﬁrst noise, white noise, perturbs each points by a small distance,

and the second noise, replacement noise, replaces some points by diﬀerent points.

Secure Sketch for Biometric Templates 107

Without loss of generality, we assume that each biometric template Xcan be

written as a sequence X=x1,x

2,··· ,x

n,whereeachxi∈Rand 0 ≤xi<1.

In other words, X∈U=[0,1)n. For each pair of biometric templates Xand

Y,wesaythat(X, Y )∈Sif there exists a subset Cof {1,··· ,n}, such that

|C|≥n−tfor some threshold t,andforeveryi∈C, it holds that |xi−yi|<δ,

for some threshold δ.

Similar to the two-part approach in [5], we construct the sketch in two parts.

The ﬁrst part, the white noise sketch, handles the white noise in the noisy data,

and the second part, the replacement noise sketch, corrects the replacement noise.

We will concentrate on the white noise sketch in this paper, and the replacement

noise sketch can be implemented using a known secure sketch scheme for set

diﬀerence (e.g., that in [7,3]).

5.1 Proposed Quantization-Based Sketch Family

Each member of the family is parameterized by a λsuch that λ∈Rand 0 <

λ≤δ.

Quantizer Qλ.Each quantizer Qλin Qis a scalar quantizer with step size

λ∈R.Foreachx∈U,Qλ(x)=xif and only if λx≤x<λ(x+1), and

the quantization of Xis deﬁned as 

X=Qλ(X)Qλ(x1),··· ,Qλ(xn).The

corresponding quantized domain is thus Mλ=[0,1

λ]n. The encoders and the

decoders work only on the quantized domain. The white noise appeared in the

quantized domain is of level 

δλ=δ/λ. In other words, under white noise, a

point xin the quantized domain can be shifted by a distance of at most 

δλ.Let

us denote Δλ2

δλ+1.

Codebook Cλ.Furthermore, for each quantized domain Mλwe consider a code-

book Cλ, where every codeword c∈C

λhas the form c=kΔλfor some

non-negative integer k.WeuseCλ(·) to denote the function such that given

a quantized point x, it returns a value c=Cλ(x) such that |x−c|≤

δλ.Thatis,

the functions ﬁnds the unique codeword cthat is nearest to xin the codebook.

Encoder Encλ.Given a quantized 

X∈M

λ, the encoder Encλdoes the following.

1. For each xi∈

X, compute ci=Cλ(xi);

2. Output P=Encλ(

X)=d1,··· ,d

n,wheredi=xi−cifor 1 ≤i≤n.

In other words, for every xi, the encoder outputs the distance of xifrom its

nearest codeword in the codebook Cλ.

Decoder Decλ.For a corrupted template Y, it is ﬁrst quantized by 

Y=Qλ(Y).

Given P=d1,··· ,d

nand 

Y=y1,··· ,yn, and the decoder Decλdoes the

following.

1. For each yi∈

Y, compute ci=Cλ(yi−di);

2. Output 

X=Decλ(

Y)=c1+d1,··· ,c

n+dn.

In other words, the decoder shifts every yiby di, maps it to the nearest codeword

in Cλ, and shifts it back by the same distance.

108 Q. Li, Y. Sutcu, and N. Memon

5.2 Security Analysis

For each member of the sketch family with parameter λ, the diﬀerence dibe-

tween xiand piranges from −

δλto 

δλ.Intuitively,logΔλbits are suﬃcient

and necessary to describe the white noise in the quantized domain (recall that

Δλ=2



δλ+1=2δ

λ+ 1). Hence, we have

Lemma 2. The quantization-based sketch scheme (U,S,Qλ,Mλ,Encλ,Decλ)is

(nlogΔλ

)-secure in the quantized domain.

Proof: Note that the size of each digenerated in the second step of the encoder

is log Δλ. Hence the total size of the sketch is nlog Δλ. Therefore, the entropy

loss of the sketch Pis at most nlog Δλby Equation (1).

It is not diﬃcult to see that the above bound is tight. For example, when each

xis uniformly distributed in the quantized domain, the min-entropy of each x

after quantization would be log1

λ, and the average min-entropy of xgiven P

would be at most log |Cλ|=log1

λ−log Δλ.

Now we consider the relative entropy loss. First of all, we observe that the

proposed sketch family is well-formed according to Deﬁnition 5.

Lemma 3. The quantization-based sketch family deﬁned in Section 5.1 is well-

formed.

Proof: We consider any two members in the sketch family. The ﬁrst is identiﬁed

by Qλ1with step size λ1, and the second is identiﬁed by Qλ2with step size λ2.

For any p o i nt x∈X,letx1=Qλ1(x). Recall that during encoding, a code-

word is computed as c1=Cλ1(x1), and the diﬀerence d1=x1−c1is put into

the sketch. Similarly, let x2=Qλ2(x), c2=Cλ2(x2)andd2=x2−c2.

Since λ1≤δand λ2≤δ, it is easy to see that if d1,d

2and x1is known, we

can compute x2deterministically. Similarly, given d1,d

2and x2,x1can also be

determined. Thus, we have



H∞(x1|d1,d

2)= 

H∞(x1,x2|d1,d

2)= 

H∞(x2|d1,d

2).(6)

ThesameargumentscanbeappliedtoallthepointsinX. Hence, let P1=

Encλ1(X)andP2=Encλ2(X), we have



H∞

X1|P1,P

2=

H∞

X1,

X2|P1,P

2=

H∞

X2|P1,P

2.(7)

That is, the proposed sketch family is well-formed.

By combining Theorem 1 and Lemma 3, and considering that for the member

of the sketch family identiﬁed by Qλ1with step size λ1, the size of the sketch

|P1|=n(log Δλ1), we have the following lemma.

Lemma 4. For the quantization-based sketch family deﬁned in Section 5.1, given

any member identiﬁed by Qλ1with step size λ1and encoder Encλ1it holds that, for

Secure Sketch for Biometric Templates 109

every random variable X∈Uand any member identiﬁed by Qλ2with step size λ2

and encoder Encλ2, we have



H∞(Qλ2(X)|Encλ2(Qλ2(X))) −

H∞(Qλ1(X)|Encλ1(Qλ1(X))) ≤n(log Δλ1).

In other words, the relative entropy loss is at most n(log Δλ1)for Qλ1.

Not only the above is a worst case bound, we can show that the worst case can

indeed happen.

Lemma 5. The relative entropy loss in Lemma 4 is tight for suﬃciently small δ.

Proof: Fo r a ny g ive n λ1, we ﬁnd a λ2such that it is possible to ﬁnd Δλ1



(2δ/λ1+1) points W={w0,··· ,w

Δλ1−1}such that Qλ1(wi)−Cλ1(Qλ1(w1)) =

i−δ/λ1,andCλ2(wi)=cifor some codeword ci∈C

λ2. In other words, we

want to ﬁnd points such that each of them would generate a diﬀerent diin the

ﬁnal sketch with Qλ1, but would generate exactly the same number (i.e., 0) in

the sketch when Qλ2is used. Note that when δis suﬃciently small, there would

be suﬃciently many codewords in Cλ1, and it is always possible to ﬁnd such λ2

(e.g., λ2=λ1/2).

When each x∈Xis uniformly distributed over W,wecanseethatthesketch

from the scheme identiﬁed by Qλ1would reveal all information about X, but in

the case of Qλ2, the left-over entropy would be exactly logΔλ1.

Therefore, combining lemmas 2, 4 and 5 we have

Theorem 6. The quantization-based sketch family deﬁned in Section 5.1 is (L,R)-

secure where for each member in the family identiﬁed by Qλwith step size λ,where

L(Qλ)=R(Qλ)=nlog Δλ. Furthermore, the bounds are tight.

For exa m p l e , if λ=δ,wewouldhaveL(Qλ)=R(Qλ)=n(log 3). Note that

although decreasing λmight give a larger left-over entropy, this is not guaranteed.

In fact, if we use a λ<λ, by applying the above theorem on Qλ,wecansee

that it may result in a smaller left-over entropy than using Qλ(e.g., consider

the example in the proof of Lemma 5).

5.3 A Special Case

We further study a special case when each point x∈Xis independently and

uniformly distributed over [0,1). We further assume that 1/δ is an integer, and

the family of schemes only consists of members with step size λsuch that 1/λ is

an integer that is a multiple of Δλ. This additional assumption is only for the

convenience of the analysis, and would not make too much diﬀerence in practice.

In this case, the entropy loss in the quantized domain for the member identiﬁed

by Qλwith step size λwould be exactly n(log Δλ), which shows that Lemma 2

is tight. Moreover, it is interesting that the relative entropy loss in this case can

be bounded by a constant.

110 Q. Li, Y. Sutcu, and N. Memon

Corollary 7. When each x∈Xis independently and uniformly distributed, the

quantization-based sketch family deﬁned in Section 5.1 is (L,R)-secure where

for each member in the family identiﬁed by Qλwith step size λ,whereL(Qλ)=

n(log Δλ),andR(Qλ)=nlog(1 + λ

2δ)≤nlog(3/2).

Proof: The claim L(Qλ)=n(log Δλ) follows directly from Lemma 2, so we

only focus on R. Consider two members of the family identiﬁed by Qλ1and

Qλ2respectively. Without loss of generality, we assume λ1>λ

2.Considerany

x∈X,letx1=Qλ1(x), c1=Cλ1(x1). Similarly we deﬁne x2=Qλ2(x)andc2=

Cλ2(x2). Hence, the min-entropy in the quantized domain would be log(1/λ1)

and log(1/λ2) respectively.

Clearly, c1and c2are also uniformly distributed over Cλ1and Cλ2respectively,

and do not depend on d1and d2. Hence, the left-over entropy for these two

members would be log(|Cλ1|)=log 1

λ1+2δand log(|Cλ2|)=log 1

λ2+2δrespectively.

Furthermore, recall that 0 <λ

2<λ

1≤δ, and the diﬀerence between these two

quantities can be bounded as

log(|Cλ2|)−log(|Cλ1|)=logλ1+2δ

λ2+2δ<log(1 + λ1

2δ)≤log 3

Therefore, the relative entropy loss is bounded by nlog(3/2) as claimed.

5.4 Remarks

Choosing the step size λ.We can view the step size λas a measure of the precision

of 

X. Since the white noise in the continuous domain is ﬁxed at δ,whenλbecomes

smaller, the corresponding white noise in the quantized domain would increase,

and vice versa. That is intuitively why it is not possible to obtain much more left-

over entropy by simply having Xrepresented in a higher precision. In fact, it is

not diﬃcult to show that there are certain distributions of Xsuch that a smaller

step size would reveal more information. Furthermore, the scheme can be more

eﬃcient if we use a relatively larger step size, since we would need fewer bits to

represent both Xand the white noise in the quantized domain. If we use the same

quantizer for both encoding and decoding, the simplest form of white noise in the

quantized domain can be achieved when λ=δ, where a quantized xcan be either

left unchanged, or shifted by 1. In this case, from Theorem 6, we can get at most

nlog 3 additional bits if we choose other λ<δ.IfXis uniformly distributed, the

increment is at most nlog(3/2) by Corollary 7.

When λ>δ, the form of white noise in the quantized domain would remain

unchanged, but we may lose too much information about Xdue to the large

quantization step, which may result in a much lower left-over entropy. There-

fore, it is not desirable to have a step size larger than δin general. If diﬀerent

quantizers are used during encoding and decoding, with large step size (e.g., 2δ),

it is possible to reduce the white noise in the quantized domain to a special 0-1

noise, under which an xis either left unchanged or shifted to x+1, as observed

in [4]. Nevertheless, this strategy may give lower left-over entropy.

Secure Sketch for Biometric Templates 111

Handling replacement noise. After the white noise has been corrected, an exist-

ing scheme for set diﬀerence can be applied in the quantized domain to correct

the replacement noise. There are known schemes that can achieve entropy loss

of O(tlog1

λ) with small leading constant, such as those in [7,3]. Although the

replacement noise is not considered for the face biometrics that we study in

Section 6, it may need to be addressed for other biometric templates (e.g., iris

patterns [9]).

Extension to higher dimensions. It is straightforward to extend our scheme to

higher dimensions, where each x∈Xis a point in some d-dimensional space. For

example, we can apply a scalar quantizer on each coordinate of every point, and

let the distance of two points in d-dimensional space be measured by max-norm

(i.e., the maximum distance in all dimensions). The entropy loss of the resulting

scheme would be dtimes that in the current construction for 1-D points. If there

is no replacement noise, we could also expand the npoints in d-dimensional

space into nd points in 1-D and apply the proposed scheme.

The choice of the sketch family. It is important to note that evenif a quantization-

based sketch family is well-formed, it does not guarantee the existence of a “good”

quantizer in that family. Nevertheless, it does allow us to evaluate any given mem-

ber in the family with respect to the “optimal” member in the family. We consider

it a challenging open problem to ﬁnd a general algorithm to ﬁnd the optimal quan-

tizer among all possible quantizers, given certain practical constraints (e.g., the

smallest possible quantization step and the distribution of X).

6 A Concrete Construction for Face Biometrics

Face images, especially those taken from a controlled environment, can be used

as the basis of identity veriﬁcation, Here we follow the techniques employed in

[17] and make use of the singular value decomposition (SVD) of the face images

for veriﬁcation, which is a well-known strategy in the face recognition literature

(such as [10,6]). Given a face image Aof size M×N, we can always ﬁnd matrices

U,Σand Vsuch that A=UΣV T,whereΣis an M×Nmatrix with min(M, N )

non-zero elements ordered according to their signiﬁcance.Asnotedin[17],some

(say, n)mostsigniﬁcantcoeﬃcientsofΣcontain signiﬁcant identity information

of the individual. Typically nis chosen such that the sum of these ncoeﬃcients

is more than, say, 98% of the sum of all the coeﬃcients.

In [17], the biometric template of an individual is obtained as follows. First,

we take a few face images, compute the SVD, and obtain the minimum mini

and maximum maxiof the i-th signiﬁcant coeﬃcient, for 1 ≤i≤n,wheren

is chosen to be 20. The mean value ai=(maxi+mini)/2 is then taken as

a point in the template. When a new face image is presented for veriﬁcation,

its SVD is computed, and if for 1 ≤i≤n,thei-th signiﬁcant coeﬃcient is

suﬃciently close to ai, it is considered as authenticated. The scheme in [17] is

applied to face images from the Essex Faces94 Database [16], which contains

152 faces with 20 images for each face (24bit color JPEG). Twelve images per

112 Q. Li, Y. Sutcu, and N. Memon

face are randomly chosen to compute the templates, and the rest 8 are used for

testing. The experiments show that when the false accept rate is 0.005, the false

reject rate is less than 0.045.

To apply our sketch scheme, for each coeﬃcient, we further compute the min-

imum min and the maximum max of all the templates in the database (assuming

that the number of templates is large). Hence, we can compute our biometric

template Xas a sequence of npoints, where the i-th point xi=ai−min

max−min .We

set the noise level δi=k(maxi−ai)

max−min for some constant k≥1. In this way, each

point xiwill be between 0 and 1 so that our scheme can be applied. There is a

diﬀerence, however, that we have a diﬀerent δifor each point, which we have to

put as part of the sketch. Nevertheless, our analysis on the entropy loss can be

easily adapted to this case, and the diﬀerence here will not aﬀect the security of

the scheme. Here we choose λi=δifor all 1 ≤i≤n.

In this way, the sketch produced by our proposed scheme, would be the tuple

P=(min,max,λ

1,··· ,λ

n,x1−C

λ1(x1),··· ,xn−C

λn(xn))

where xi=Qλi(xi)for1≤i≤n. By applying the arguments in Theorem 6 and

Corollary 7 to each point in X,wehave

Corollary 8. The entropy loss in the quantized domain for the aforementioned

scheme is at most nlog 3.Letmbe the left-over entropy. When λi<δ

ifor any

i,1≤i≤n, let the left-over entropy be m. We have m−m≤nlog 3.Ifall

points are uniformly distributed, we have m−m≤nlog(3/2).

When n= 20, the above bounds are approximately 31.7and11.7 respectively.

References

1. X. Boyen, Y. Dodis, J. Katz, R. Ostrovsky, and A. Smith. Secure remote authen-

tication using biometric data. In Eurocrypt, 2005.

2. Xavier Boyen. Reusable cryptographic fuzzy extractors. In ACM CCS, pages

82–91, Washington DC, USA, 2004. ACM Press.

3. Ee-Chien Chang, Vadym Fedyukovych, and Qiming Li. Secure sketch for multi-set

diﬀerence. Cryptology ePrint Archive, Report 2006/090, 2006. http://eprint.

iacr.org/.

4. Ee-Chien Chang and Qiming Li. Small secure sketch for point-set diﬀerence. Cryp-

tology ePrint Archive, Report 2005/145, 2005. http://eprint.iacr.org/ .

5. Ee-Chien Chang and Qiming Li. Hiding secret points amidst chaﬀ. In Eurocrypt,

volume 4004 of LNCS, pages 59–72, 2006.

6. Yong-Qing Cheng. Human face recognition method based on the statistical model of

small sample size. In SPIE Proc. Intell. Robot and Compu. Vision, pages 85–95, 1991.

7. Yevgeniy Dodis, Leonid Reyzin, and Adam Smith. Fuzzy extractors: How to gen-

erate strong keys from biometrics and other noisy data. In Eurocrypt, volume 3027

of LNCS, pages 523–540. Springer-Verlag, 2004.

8. F. Hao and C.W. Chan. Private key generation from on-line handwritten signa-

tures. Information Management and Computer Security, 10(2), 2002.

Secure Sketch for Biometric Templates 113

9. Feng Hao, Ross Anderson, and John Daugman. Combining cryptography with bio-

metrics eﬀectively. Technical Report UCAM-CL-TR-640, University of Cambridge,

2005.

10. Z. Hong. Algebraic feature extraction of image for recognition. Pattern Recognition,

24:211–219, 1991.

11. Ari Juels and Madhu Sudan. A fuzzy vault scheme. In IEEE Intl. Symp. on

Information Theory, 2002.

12. Ari Juels and Martin Wattenberg. A fuzzy commitment scheme. In ACM CCS,

pages 28–36, 1999.

13. J.-P. Linnartz and P. Tuyls. New shielding functions to enhance privacy and pre-

vent misuse of biometric templates. In AVB PA , pages 393–402, 2003.

14. Ueli Maurer and Stefan Wolf. Information-theoretic key agreement: From weak to

strong secrecy for free. In Eurocrypt, 2000.

15. F. Monrose, M.K. Reiter, Q. Li, and S. Wetzel. Cryptographic key generation from

voice. In IEEE Symp. on Security and Privacy, 2001.

16. Libor Spacek. The essex faces94 database. http://cswww.essex.ac.uk/mv/allfaces/.

17. Y. Sutcu, T. Sencar, and N. Memon. A secure biometric authentication scheme

based on robust hashing. In ACM MM-SEC Workshop, 2005.

18. P. Tuyls, A.H.M. Akkermans, T.A.M. Kevenaar, G.J. Schrijen, A.M. Bazen, and

R.N.J. Veldhuis. Practical biometric authentication with template protection. In

AVB PA , pages 436–446, 2005.

19. P. Tuyls and J. Goseling. Capacity and examples of template-protecting biometric

authentication systems. In ECCV Workshop BioAW, pages 158–170, 2004.

20. Shenglin Yang and Ingrid Verbauwhede. Automatic secure ﬁngerprint veriﬁcation

system based on fuzzy vault scheme. In IEEE Intl. Conf. on Acoustics, Speech,

and Signal Processing (ICASSP), pages 609–612, 2005.

Hybrid Multimodal Biometric Template Protection

Article

Full-text available

Jan 2021

Biometric template disclosure starts gaining an important concern in deploying practical biometric authentication systems, where an assailant compromises the database for illegitimate access. To protect biometric templates from disclosure attacks, biometric authentication systems should meet these four requirements: security, diversity, revocability, and performance. Different methods have been suggested in the literature such as feature transformation techniques and biometric cryptosystems. However, no single method could satisfy the four requirements, giving rise to the deployment of hybrid mechanisms. In this context, the current paper proposes a hybrid system for multimodal biometric template protection to provide robustness against template database attacks. Herein, a secure sketch method is first applied to secure the fingerprint modality. Subsequently, a Dual-Tree Complex Wavelet Transform Discrete Cosine Transform (DTCWT-DCT) based watermarking is employed to entrench the fingerprint sketch into the face image. However, a 3D chaotic-map-based encryption method is employed to protect the watermarked facial image in order to offer an added security level. The experimentation performed using the ORL face database and three Fingerprint Verification Competition (FVC) fingerprint databases showed the approach’s efficiency in withstanding standard digital image watermarking attacks, brute force attacks, and information leakage. Moreover, the results revealed that the approach achieves high performance, and satisfies diversity and revocability requirements.

Information-theoretic and statistical approaches to the problem of authentication using graphical codes

Thesis

Dec 2014

Phan Ho

In recent years, authentication theory has attracted a lot of attention in different applications. However, the theoretical analysis of authentication for printed graphical codes remains an open issue. In this thesis, the problem of authentication is investigated from an information theoretic security point of view. An authentication model is analyzed using two settings, namely non-channel coding based authentication and channel coding based authentication.In the former, a reliable performance measurements of an authentication system relying on a Neyman Pearson hypothesis test is provided. Specifically, an asymptotic expression using Sanov's theorem is rst proposed to compute the probabilities of false alarm and non-detection, then a practical method based on MC simulations using importance sampling is given to estimate these very small probabilities. Thanks to these accurate computation of twos error probabilities, it is demonstrated that it is entirely possible to optimize the authentication performance when the model of the print and scan channel is known.In the latter, the setup in which the authentication message is coded using the deterministic channel codes is studied. It is showed that using channel coding is possible to enhance the authentication performance. More precisely, it is demonstrated that finding codes making the probability of false alarm and non-detection arbitrarily small at the same time is possible. Such codes have rates between the capacity of main channel and the capacity of the opponent channel. It should be noted that the legitimate receiver does not know whether the observed message comes from the legitimate or from the opponent. Therefore it is the objective of the legitimate receiver to use a decoding rule matching with the distribution law of the main channel but mismatching with the opponent channel. Then the probability of non detection is concerned with mismatched decoding. Finally, a practical scheme using parallel concatenated codes with turbo decoding is proposed. The analysis of the EXIT chart is discussed to choose channel parameters so that the authentication performance is optimized.

Secure identification for the Internet of Things

Thesis

Full-text available

Jul 2021

Marzieh Gheisari

This thesis addresses the problem of authentication of low-power devices in the Internet of Things by introducing new functionalities: group membership verification and identification. The procedure verifies if a given IoT device is a member of a group without revealing the identity of that member. Similarly, group membership identification states which group the device belongs to without knowing the identity. We propose a protocol through the joint use of two mechanisms: quantizing templates into discrete embeddings, making reconstruction difficult, and aggregating several templates into one group representation, impeding identification. First, we consider two independent procedures, one for embedding, the other for aggregating. Then, we replace those deterministic functions with functions whose parameters are learned through optimization. Finally, rather than considering group assignments that are predetermined, group assignments are also learned together with representations of the groups. Our experiments show that learning yields an excellent trade-off between security/privacy and verification/identification performances. We also investigate the impact of the sparsity level of the features representing group members on both security and verification performances. It shows it is possible to trade compactness and sparsity for better security or better verification performance.

Physical Unclonable Functions for Authenticating and Preventing Reverse Engineering of Integrated Circuits and Electronics Hardware

Article

Full-text available

May 2020

Mdshahed Enamulquadir

Electronics hardware is subject to a number of potential threats such as reverse engineering and counterfeiting. As a result, hardware authentication mechanisms and anti-reverse engineering techniques such as obfuscation and tamper-resistance are essential. In this thesis, we will present methods to approach these problems, and the primary research contributions of this thesis are a Low pass filter PUF for the authentication of PCBs and ICs; Key generation for hardware obfuscation using strong PUFs; and Session key generation using strong PUF modeling. Physical Unclonable Functions (PUFs) are probabilistic circuit primitives that extract randomness from the physical characteristics of a device. In this work, we propose a novel PUF design based on resistor and capacitor variations for low pass filters (LoPUF). We extract the process variations present at the output of the filter with the use of an inverter to digitize the output and a counter to measure output pulse widths. We have created a process to select RC pairs that can be used to reliably generate authentication IDs. The LoPUF has been evaluated in the context of both printed circuit boards and integrated circuits. As a result of the increased use of contract foundries, IP theft, excess production and reverse engineering are major concerns for the electronics and defense industries. Hardware obfuscation and IP locking can be used to make a design secure by replacing a part of the circuit with a key-locked module. In order to ensure each chip has unique keys, we propose a strong PUF-based hardware obfuscation scheme to uniquely lock each chip that is less area intensive than previous work. Communication with embedded systems can be problematic because they are limited in their capability to implement public key encryption and client-side authentication. In this work, we introduce a session key generation mechanism using PUFs. We propose a novel dynamic key generation method that depends on the ability to model certain PUF circuits using machine learning algorithms. Our proposed method also mitigates tampering attacks as no information is stored between subsequent keys. We have shown the effectiveness of our method with error-correcting capability to keep the outputs of the PUF from noise.

WGAN-E: A Generative Adversarial Networks for Facial Feature Security

Article

Full-text available

Mar 2020

Artificial intelligence technology plays an increasingly important role in human life. For example, distinguishing different people is an essential capability of many intelligent systems. To achieve this, one possible technical means is to perceive and recognize people by optical imaging of faces, so-called face recognition technology. After decades of research and development, especially the emergence of deep learning technology in recent years, face recognition has made great progress with more and more applications in the fields of security, finance, education, social security, etc. The field of computer vision has become one of the most successful branch areas. With the wide application of biometrics technology, bio-encryption technology came into being. Aiming at the problems of classical hash algorithm and face hashing algorithm based on Multiscale Block Local Binary Pattern (MB-LBP) feature improvement, this paper proposes a method based on Generative Adversarial Networks (GAN) to encrypt face features. This work uses Wasserstein Generative Adversarial Networks Encryption (WGAN-E) to encrypt facial features. Because the encryption process is an irreversible one-way process, it protects facial features well. Compared with the traditional face hashing algorithm, the experimental results show that the face feature encryption algorithm has better confidentiality.

General Constructions of Fuzzy Extractors for Continuous Sources

Chapter

Feb 2024

Fuzzy extractors are cryptographic primitives designed to generate cryptographic keys from noisy sources. While most existing fuzzy extractors are designed for discrete sources, our work focuses on fuzzy extractors for continuous sources. To evaluate the feasibility of key extraction from continuous sources, we introduce the notion of max-divergence, as classical entropy definitions are not directly applicable in this context. Building upon the concept of max-divergence, we extend the definition of fuzzy extractors to accommodate continuous sources. In addition, we introduce the notion of continuous-source fuzzy conductors, which generates strings with sufficient entropy from continuous noisy sources, and present a general approach for constructing continuous-source fuzzy extractors from continuous-source fuzzy conductors. Furthermore, we provide two constructions using lattice codes for error correction in the Euclidean space, and analyze the security of our constructions. Finally, we discuss the practical implementation of our proposed constructions.

Privacy Preserving Biometric Authentication on the blockchain for smart healthcare

Article

Aug 2022

Neyire Deniz Sarier

Privacy Preserving Biometric Authentication (PPBA) schemes are designed for anonymous authentication of patients to protect patient’s privacy in accessing healthcare services. Recently, blockchain technology in healthcare has emerged as a new research area to provide tamper-resistance and non-repudiation in e-health systems. One aspect of this research could lead to blockchain-based secure biometric identification for smart healthcare, which may face the paradox of anonymous biometric authentication on public blockchains. In this paper, we describe an efficient, fully anonymous and GDPR-compliant PPBA protocol built into the blockchain of any privacy coin such as Monero. The new protocol provides encrypted offline storage and processing in the encrypted domain. The infrastructure necessary for the online authentication is outsourced to the public blockchain that provides integrity of its data. In addition to auditing capabilities for misbehaving entities, the new system reduces the number of transactions necessary for authentication and enables revocation of biometric identities. We provide new PPBA schemes both for set difference/overlap and Euclidean distance metrics without using bilinear pairings, where the former leads to an efficient solution to the compatibility for organ transplant. We limit the generation of encrypted templates for public testing even if biometric/health data is of low min-entropy. Due to the anonymity of the cryptocurrency, we break the link between the stealth address of an authenticating user and its biometrics. We describe the user and identity privacy notions independent of the underlying privacy coin and guarantee the security of our proposal in the framework of those generic notions. Finally, we simulate the new proposal on Monero blockchain and analyze the transaction fees required for hill climbing attacks. The results show that our design leads to a natural hindrance against these attacks that could be successful even if the templates are stored as encrypted. To the best of our knowledge, this is the first efficient blockchain-based PPBA scheme that exhibits a punishment against hill climbing attacks through transaction fees.

Anthentification transparente dans un environnement numérique ubiquitaire

Thesis

Jul 2021

Takoua Guiga

L'authentification des individus est une tâche indispensable pour contribuer à la sécurité efficace des systèmes informatiques. Les solutions innovantes foisonnent et la recherche est très active, mais peu d’acteurs s’intéressent à l’expérience utilisateur pris dans la globalité de ses interactions numériques. Beaucoup promettent de remédier au « cauchemar des mots de passe » à l’aide de moyens matériels pour renforcer la sécurité, pourtant la promesse d’un moyen d’authentification universel, sécurisé et simple est rarement tenue. Dans ce contexte de multiplicité des facteurs d’authentification, la biométrie physiologique est souvent évoquée comme alternative. Ces technologies sont toutefois très controversées, en raison notamment du risque d’atteinte à la vie privée et de la non-révocabilité de ces données. La biométrie comportementale, moins intrusive et plus simple d’emploi, peut constituer une alternative intéressante, à même de concilier les exigences apparemment contradictoires de sécurité, d’usabilité et de respect de la vie privée.Cependant, les systèmes d’authentification comportementale actuels s’appuient principalement sur un seul objet connecté, notamment le smartphone, ce qui semble naturel puisque d’une part celui-ci est devenu le terminal de référence des utilisateurs, et que d’autre part, la multitude de capteurs dont il dispose couplée à ses possibilités de calcul, de stockage et de connectivité en font un outil de choix pour récupérer et traiter les données nécessaires à l’authentification comportementale en continu. Or, le parcours numérique d’un individu ne se limite pas aux interactions avec son smartphone. De nombreux utilisateurs disposent d’autres terminaux tels que les ordinateurs personnels et tablettes dans un cadre professionnel ou privé. De plus, grâce à l’essor des objets connectés, dont beaucoup disposent de capteurs susceptibles de récupérer des informations comportementales fortement authentifiantes, l’utilisateur se trouve immergé dans un environnement numérique ubiquitaire dans lequel la fonction d’authentification devrait venir s’intégrer naturellement.Dans le cadre de cette thèse, nous proposons l'utilisation de la biométrie pour lier l'utilisateur avec ses objets connectés implicitement avec une solution multidevice, basée sur un cercle de confiance partagé entre les différents objets connectés permettant une authentification sécurisée de l'utilisateur et ses devices, qu'on appelle Aura d'authentification.Nous avons réalisé une étude de l'état de l'art sur les systèmes d'authentification et leurs exigences sécuritaires, les algorithmes de protection des données biométriques et sur les Objets connectés et l'Internet des objets IoT. Nous avons défini une méthode d'authentification transparente via un unique objet connecté, limitant les actions de l'utilisateur tout en protégeant sa vie privée.Nous avons validé cette approche sur des bases de données conséquentes en prenant en particulier le smartphone et l'ordinateur portable comme exemple d'objets intelligents. Nous proposons une approche originale d'authentification transparente via plusieurs objets connectés dans un environnement numérique ubiquitaire, qu'on appelle dans la suite Aura d'authentification. Cette approche, basée sur le transfert de confiance entre les objets connectés, s'appuie sur de la biométrie et assure la facilité d'usage et la sécurisation de l'accès de l'utilisateur à ses terminaux et services dans le respect de sa vie privée.

Comments on Biometric-based Non-transferable Credentials and their Application in Blockchain-based Identity Management

Article

Feb 2021
COMPUT SECUR

Neyire Deniz Sarier

In IT-ecosystems, access to unauthorized parties is prevented with credential-based access control techniques (locks, RFID cards, biometrics, etc.). Some of these methods are ineffective against malicious users who lend their credentials to other users. To obtain non-transferability, Adams proposed a combination of biometrics encapsulated in Pedersen commitment with Brands digital credential. However, Adams’ work does not consider the Zero Knowledge Proof-of Knowledge (ZKPoK) system for Double Discrete Logarithm Representation of the credential. Besides, biometrics is used directly, without employing any biometric cryptosystem to guarantee biometric privacy, thus Adams’ work cannot be GDPR-compliant. In this paper, we construct the missing ZKPoK protocol for Adam’s work and show its inefficiency. To overcome this limitation, we present a new biometric-based non-transferable credential scheme that maintains the efficiency of the underlying Brands credential. Secondly, we show the insecurity of the first biometric-based anonymous credential scheme designed by Blanton et al.. In this context, we present a brute-force attack against Blanton’s biometric key generation algorithm implemented for fuzzy vault. Next, we integrate an Oblivious PRF (OPRF) protocol to solve the open problem in Blanton’s work and improve its efficiency by replacing the underlying signature scheme with PS-signatures. Finally, we evaluate application scenarios for non-transferable digital/anonymous credentials in the context of Blockchain-based Identity Management (BBIM). We show that our modified constructions preserve biometric privacy and efficiency, and can easily be integrated into current BBIM systems built upon efficient Brands and PS-credentials.

Privacy Preserving Multimodal Biometric Authentication in the Cloud

Conference Paper

Jan 2017

Neyire Deniz Sarier

Combining cryptography with biometrics effectively

Article

Full-text available

Jan 2006

Abstract We propose the first practical and secure way to integrate the iris biometric into cryptographic applications. A repeatable binary string, which we call a biometric key, is generated reliably from genuine iris codes. A well-known difficulty has been how to cope with the 10 to 20% of error bits within an iris code and derive an errorfree key. To solve this problem, we carefully studied the error patterns within iris codes, and devised a two-layer error correction technique that combines Hadamard and Reed-Solomon codes. The key is generated from a subject’s iris image with the aid of auxiliary error-correction data, which do not reveal the key, and can be saved in a tamper-resistant token such as a smart card. The reproduction of the key depends on two factors: the iris biometric and the token. The attacker has to procure both of them to compromise the key. We evaluated our technique using iris samples from 70 different eyes, with 10 samples from each eye. We found that

New Shielding Functions to Enhance Privacy and Prevent Misuse of Biometric Templates

Conference Paper

Full-text available

Jun 2003
Lect Notes Comput Sci

In biometrics, a human being needs to be identified based on some characteristic physiological parameters. Often this recognition is part of some security system. Secure storage of reference data (i.e., user templates) of individuals is a key concern. It is undesirable that a dishonest verifier can misuse parameters that he obtains before or during a recognition process. We propose a method that allows a verifier to check the authenticity of the prover in a way that the verifier does not learn any information about the biometrics of the prover, unless the prover willingly releases these parameters. To this end, we introduce the concept of a delta-contracting and epsilon-revealing function which executes preprocessing in the biometric authentication scheme. It is believed that this concept can become a building block of a public infrastructure for biometric authentication that nonetheless preserves privacy of the participants.

A Fuzzy Vault Commitment Scheme

Article

Jan 2002

Cryptographic Key Generation from Voice (Extended Abstract)

Article

Jan 2001

We propose a technique to reliably generate a crypto- graphic key from a user's voice while speaking a password. The key resists cryptanalysis even against an attacker who captures all system information related to generating or verifying the cryptographic key. Moreover, the technique is sufficiently robust to enable the user to reliably regener- ate the key by uttering her password again. We describe an empirical evaluation of this technique using utterances recorded from users.

Human face recognition method based on the statistical model of small sample size

Article

Feb 1992
Proceedings of SPIE

Automatic recognition of human faces is a frontier topic in computer vision. In this paper, a novel recognition approach to human faces is proposed, which is based on the statistical model in the optimal discriminant space. Singular value vector has been proposed to represent algebraic features of images. This kind of feature vector has some important properties of algebraic and geometric invariance, and insensitiveness to noise. Because singular value vector is usually of high dimensionality, and recognition model based on these feature vectors belongs to the problem of small sample size, which has not been solved completely, dimensionality compression of singular value vector is very necessary. In our method, an optimal discriminant transformation is constructed to transform an original space of singular value vector into a new space in which its dimensionality is significantly lower than that in the original space. Finally, a recognition model is established in the new space. Experimental results show that our method has very good recognition performance, and recognition accuracies of 100 percent are obtained for all 64 facial images of 8 classes of human faces.

Information-Theoretic Key Agreement: From Weak to Strong Secrecy for Free

Conference Paper

May 2000

One of the basic problems in cryptography is the generation of a common secret key between two parties, for instance in order to communicate privately. In this paper we consider information-theoretically secure key agreement. Wyner and subsequently Csiszár and Körner described and analyzed settings for secret-key agreement based on noisy communication channels. Maurer as well as Ahlswede and Csiszár generalized these models to a scenario based on correlated randomness and public discussion. In all these settings, the secrecy capacity and the secret-key rate, respectively, have been defined as the maximal achievable rates at which a highly-secret key can be generated by the legitimate partners. However, the privacy requirements were too weak in all these definitions, requiring only the ratio between the adversary’s information and the length of the key to be negligible, but hence tolerating her to obtain a possibly substantial amount of information about the resulting key in an absolute sense. We give natural stronger definitions of secrecy capacity and secret-key rate, requiring that the adversary obtains virtually no information about the entire key. We show that not only secret-key agreement satisfying the strong secrecy condition is possible, but even that the achievable key-generation rates are equal to the previous weak notions of secrecy capacity and secret-key rate. Hence the unsatisfactory old definitions can be completely replaced by the new ones. We prove these results by a generic reduction of strong to weak key agreement. The reduction makes use of extractors, which allow to keep the required amount of communication negligible as compared to the length of the resulting key.

A Fuzzy Vault scheme

Article

Feb 2006

We describe a simple and novel cryptographic construction that we refer to as a fuzzy vault. A player Alice may place a secret value κ in a fuzzy vault and “lock” it using a set A of elements from some public universe U. If Bob tries to “unlock” the vault using a set B of similar length, he obtains κ only if B is close to A, i.e., only if A and B overlap substantially. In constrast to previous constructions of this flavor, ours possesses the useful feature of order invariance, meaning that the ordering of A and B is immaterial to the functioning of the vault. As we show, our scheme enjoys provable security against a computationally unbounded attacker. Fuzzy vaults have potential application to the problem of protecting data in a number of real-world, error-prone environments. These include systems in which personal information serves to authenticate users for, e.g., the purposes of password recovery, and also to biometric authentication systems, in which readings are inherently noisy as a result of the refractory nature of image capture and processing.

Automatic Secure Fingerprint Verification System Based on Fuzzy Vault Scheme

Conference Paper

Apr 2005
Acoust Speech Signal Process

We construct an automatic secure fingerprint verification system based on the fuzzy vault scheme to address a major security hole currently existing in most biometric authentication systems. The construction of the fuzzy vault during the enrollment phase is automated by aligning the most reliable reference points between different templates, based on which the converted features are used to form the lock set. The size of the fuzzy vault, the degree of the underlying polynomial, as well as the number of templates needed for reaching the reliable reference point are investigated. This results in a high unlocking complexity for attackers with an acceptable unlocking accuracy for legal users.

Hong, Z.: Algebraic Feature Extraction of Image for Recognition. Pattern Recognition 24, 211-219

Article

Dec 1991
PATTERN RECOGN

Zi-Quan Hong

The extraction of image features is one of the fundamental tasks in image recognition. Up until now, there have been several kinds of features to be used for the purpose of image recognition as follows: (1) visual features; (2) statistical features of pixel; (3) transform coefficient features. In addition, there is another kind of feature which the author believes is very useful, i.e. (4) algebraic features which represent intrinsic attributions of an image. Singular Values (SV) of image are this kind of feature. In this paper, we prove that SV feature vector has some important properties of algebraic and geometric invariance, and insensitiveness to noise. These properties are very useful for the description and recognition of images. As an example, SV feature vector is used for the problem of recognizing human facial images. In this paper, using SV feature vector samples of facial images, a normal pattern Bayes classification model based on Sammon's optimal descriminant plane is constructed. The experimental result shows that SV feature vector has good performance of class separation.

Reusable cryptographic fuzzy extractors

Conference Paper

Oct 2004

Xavier Boyen

We show that a number of recent definitions and constructions of fuzzy extractors are not adequate for multiple uses of the same fuzzy secret---a major shortcoming in the case of biometric applications. We propose two particularly stringent security models that specifically address the case of fuzzy secret reuse, respectively from an outsider and an insider perspective, in what we call a chosen perturbation attack. We characterize the conditions that fuzzy extractors need to satisfy to be secure, and present generic constructions from ordinary building blocks. As an illustration, we demonstrate how to use a biometric secret in a remote fuzzy authentication protocol that does not require any storage on the client's side.

Secure Sketch for Biometric Templates

Abstract

Recommended publications

Improving Reliability of Biometric Hash Generation through the Selection of Dynamic Handwriting Feat...

Rethinking Secure Precoding via Interference Exploitation: A Smart Eavesdropper Perspective

Binary Discriminant Analysis for Face Template Protection

Efficient Methods for the Addressing/Decoding of a Lattice-based