ArticlePDF Available

Gait Recognition System Tailored for Arab Costume of The Gulf Region

December 2009

December 2009

DOI:10.1109/ISSPIT.2009.5407511

Authors:

Tamer Shanableh

American University of Sharjah

Khaled Assaleh

Ajman University

Existing work on gait recognition is focused on casual (western) customs hence not suitable for the gulf region where long gowns are used for both men and women. This paper proposes a gait recognition solution that is suitable for both gulf customs and casual customs. The solution is based on computing an adaptive image prediction between consecutive images. The resultant predictions are then accumulated into one image and transformed using either discrete cosine transformation (DCT) or Radon transformation. The feature vectors of the gait are computed from such transformed images. Feature modeling based on polynomial networks follows. The proposed solution is tested on a dataset with around 100 participants with mixed genders and mixed customs. The proposed system yields an impressive classification rates approaching 100% accuracy.

Content uploaded by Tamer Shanableh

Content may be subject to copyright.

Content uploaded by Tamer Shanableh

Content may be subject to copyright.

Gait Recognition System Tailored for Arab Costume of The Gulf Region

Tamer Shanableh

Department of computer

science and Engineering

American University of

Sharjah

tshanableh@aus.edu

Khaled Assaleh

Department of Electrical

Engineering

American University of

Sharjah

kassaleh@aus.edu

Layla Al – Hajjaj

Computer Science

program

American university of

Sharjah

g00021907@aus.edu

AbdulWahab Kabani

Computer Science

program

American university of

Sharjah

b00020950@aus.edu

Abstract –

Existing work on gait recognition is focused on casual

(western) customs hence not suitable for the Gulf region where long gowns

are used for both men and women. This paper proposes a gait recognition

solution that is suitable for both Gulf customs and casual customs. The

solution is based on computing an adaptive image prediction between

consecutive images. The resultant predictions are then accumulated into one

image and transformed using either Discrete Cosine Transformation

(DCT) or Radon transformation. The feature vectors of the gait are

computed from such transformed images. Feature modeling based on

polynomial networks follows. The proposed solution is tested on a dataset

with around 100 participants with mixed genders and mixed customs. The

proposed system yields an impressive classification rates approaching 100%

accuracy.

Keywords –

Human identification; computer vision; motion analysis;

gait biometric

1. INTRODUCTION

In biometrics, people are identified based on their

characteristics such as voice, iris, fingerprint, hand geometry

and face. It has been reported that such identification can also

be based on the way that a human walks [1]. Such a biometric is

referred to as Gait. Basically video cameras are used to acquire

video sequences of individuals and recognize them based on the

way they walk. Gait recognition has a number of attractive

characteristics when compared to existing biometrics [2]. For

example it does not require a physical contact as required by

fingerprint or hand recognition. It also does not require high

image resolution or special image acquisition conditions as

required by face recognition for instance. Lastly it is non-

intrusive and can recognize people at a distance without their

knowledge or direct involvement.

In 2005, a research group from the University of South Florida

issued a human gait recognition challenge [3]. The group

compiled a dataset of video sequences with different covariates

such as camera viewing angle, walking surface type, carrying

conditions where a person can be carrying a briefcase for

example, shoe type where walking in heals for instance will

affect the gait and the video capturing time. In the latter most

video sequences were acquired in a second round after six

months of the first shooting. The dataset contains data for

experiments of increasing difficulty levels. A total of 1,870

video sequences are acquired with a total of 122 participants.

The dataset is made available and it is used for benchmarking

new solutions in gait recognition.

However, this dataset and subsequently all the previously

proposed solutions are based on western style of dress code.

Such solutions are likely to fail when applied to Gulf style dress

code including white/black robes, head gears/scarves and veils.

This is so because of the nature of the feature extraction

methods that exploit the gait cycle which depends on the

movements of the legs. .

This paper proposes an efficient feature extraction and

classification scheme for gait recognition for the Arab costume

in the Gulf region. The proposed scheme is also shown to work

for western costume as well.

In general gait recognition based on video sequences is divided

into a number of steps:

1. Segmentation: this step entails identifying the pixel locations

belonging to the subject to be identified. The segmented images

are binarized resulting in what is known as “silhouette frames”.

One approach for this segmentation is though background

modeling and separation. For instance, in [3] it was proposed to

extract bounding boxes of the subjects and then compute the

mean vector and covariance matrix of background pixels. The

pixels of the bounding box containing the subject are then

classified into either foreground or background using

Mahalanobis distances from the background model. The

distances are then classified into foreground or background

based on their likelihoods which are estimated using an

Expectation Minimization (EM) procedure.

Variants of this segmentation algorithm are also reported in the

literature. For instance, in [5] the pixels’ Mahalanobis distances

from the background model were thresholded into either

foreground or background without the need for computing

likelihoods through the EM algorithm. Other approaches

include extracting principal components of silhouette boundary

vector variations [6] or Fourier descriptors [7].

2. Feature extraction: This step can be preceded by what is

known as gait cycle estimation which is the set of images

starting from the right heel touching the ground all the way to

where it touches the ground again. This information can be

used to segment the sequences into cycles and then align them

using various techniques such as Population Hidden Markov

Models (PHMMs) [4]. Features are then based on averaged

subsequences [8]. Other feature extraction techniques are

reported for characterizing gait dynamics such as stride, stride

speed, length and cadence [9]. Others used static body

information such as ratios of various body parts [9]. Feature

vectors composed of the amplitude of the spectrum of key

silhouette frames are also reported in [2]. More recently

dynamic features from averaged silhouette cycles are extracted

by Gabor-based discriminative common vectors (DCV) analysis

[10]. Likewise [11] proposed the use of Kernel-based Principal

Component Analysis (KPCA) to extract gait features. In other

approaches, the human body components are studied separately

and feature vectors are extracted accordingly [12].

3. Feature modeling and similarity measures: Here the extracted

features are compared against the stored entries in the dataset.

Reported measures include computing the Euclidean distances

in the Linear Discriminate Analysis (LDA) space [4, 13],

symmetric group theoretic distances [14], normalized Euclidean

distance between the projection centroids of two gait sequences

[15]. Dynamics of the gait sequences can also be modeled by

hidden Markov models (HMMs) as reported in [16].

The existing work on gait recognition is however based on

identifying people in western or casual costumes namely; pants

and shorts. Such solutions are not suitable for identifying

individuals in the Gulf region of the Middle East. The local

dress code in the Gulf region for males includes robes

and head gears. Likewise the dress code for females includes

robes and head scarves or face veils. Examples of such customs

are shown in Figure 1. It is clear that the gap between the legs

of the individuals are concealed hence all the techniques based

on gait cycles do not apply. Note that this problem can also be

present if the individuals are dressed in long skirts as well;

hence the problem is not specific for the Gulf customs. We

propose a solution that applies to both casual and Gulf customs

based on accumulating the adaptive prediction errors of

consecutive images are shall be explained in Section 3.

The rest of this paper is organized as follows. The compiled

dataset and data acquisition procedure is described in Section 2.

Feature extraction and motion representation for causal and

Gulf customs are presented in Section 3. The classification

problem is then formalized using polynomial networks in

Section 4. Experimental results are presented and discussed in 5

prior to arriving to the final conclusion in section 6.

2. DATASET DESCRIPTION

Although the purpose of this research is to devise a method for

gait recognition for individuals in Gulf costumes nonetheless,

we need to verify that the proposed solution is also applicable

to recognizing individuals in casual costumes (mainly with pants

or shorts). As such the same system can be deployed for

identifying individuals with mixed costumes.

Similar to the setup reported in [3] the camera was positioned

10 meters away from the walking subjects. However one digital

camera was used with one view only. The video capturing took

place in the rotunda of one of our lecture building using one

digital camera. An example of participants with different

costumes is shown in Figure 1.

(a)

(b)

(c)

(d)

Fig. 1. Example participants with different costumes. (a) Female

with Gulf costume (b) Male with Gulf costume (c) Female with

casual costume (d) Male with casual costume.

A total number of 103 subjects participated in the data

collection. All participants are undergraduate students of the

same age group between 18 and 22 year old. Out of the 103

subjects, 53 participated with Gulf costumes (33 females and 20

males). Another 50 subjects participated in casual costumes (11

females and 39 males).

Each participant was asked to walk naturally across the rotunda

back and forth a total number of 8 times. Out of which 4

instances are captured with a walk from right to left and 4 in the

other direction.

3. FEATURE EXTRACTION

The existing literature on Gait recognition heavily depends on

the extraction of gait cycles. Such a cycle can be defined as the

sequence of images from which the right heel of an individual

touched the ground all the way until it touches the ground

again. The extraction of gait cycles depend on the observing the

p between the

rtain position

me position.

tween the le

proach to feat

e propose t

cumulate it in

extracted fro

at in the da

bjects are w

ntains variou

ject extractio

e other hand,

eprocessing

longing to t

narized result

ne approach

odeling and se

this paper w

ed in digital

ediction error

subtracted fr

ediction erro

fferences tha

dividual. The

rcentile of no

resholded pre

to one imag

edictions (A

dividual’s mot

nsecutive ima

e positive d

fferences. Ea

parately. In t

n be referred t

ote that the co

relative motio

the AP ima

otion we pro

ages in comp

ediction in th

ckward predic

forward or b

e Sum of Abs

e prediction

sult of imple

ure shows th

w minimized

two legs in a

of the legs an

nfortunately,

s is not at

re extraction

extract the

o one or two

such image

aset descripti

lking in fron

stationary o

and segment

in the absenc

hall entail i

e subject. Th

ng in what is

or this segm

aration as des

base our feat

ideo coding

between succe

m its immedia

can be th

did not re

threshold can

-zero pixels o

iction error i

which we

) image. Fo

on, the forwa

es can be rep

fferences an

h prediction

is case we en

o as positive A

ered and unc

to the indivi

es.

o mini

ose to use th

ting the predi

s case is refer

tion respective

ckward predic

lute Differenc

ource that mi

enting this tec

t the appearan

s desired.

(

ideo sequence

ends after a

with the Gulf

all apparent

hall be sought

motion of a

mages. Featur

that describe

n it was m

t of a static

jects. Hence

tion is not ne

of a stationa

entifying the

segmented i

known as “s

ntation is thr

ribed above [

re extraction

here we co

sive images.

te previous im

esholded to

ult from the

be set to th

the predictio

ages can the

refer to as

better repre

d prediction

esented using

the other

rror image is

up with two

and negative

vered backgro

ual and thus r

ize the appe

previous im

tion error for

ed to as forw

he decisio

tion can be ba

s (SAD) of th

imizes the SA

nique is show

e of the back

(

that starts fro

ull cycle with

costume, the

ence a diffe

individual

vectors can t

the motion.

ntioned that

background

he preprocess

ded this case.

y background

pixel locati

ages are usu

lhouette fram

ugh backgro

]..

n the techni

pute the for

hat is, each

ge. The resul

filter out im

motion of

or the

error image.

be accumul

he Accumul

sentation of

rror between

images. One

or the nega

then threshol

AP images w

AP images.

und will appea

presented as s

rance of rela

ges or the fu

given image.

rd prediction

between the

ed on compu

prediction er

is selected.

in Figure 2.

round objects

the

ent

ote

the

hat

the

ally

s”.

ues

ard

age

ant

age

the

ted

the

for

ive

ich

ive

ure

use

ing

or.

are

Once t

spatial

languag

based

ransf

coeffici

transfo

energy

maged.

such l

quantiz

content

propos

coeffici

scannin

progres

scannin

coeffici

can be

he p

scannin

Note t

positive

process

s me

extracti

P im

dimens

the dire

done o

smooth

one di

deal lo

projecti

coeffici

Polyno

which

higher

sequen

main s

feature

mprov

expand

comput

. 2. AP image

forw

(a) Negative

e AP images

omain featur

e recognition

on either t

rm (DCT)

nts.

mation is its

s concentrate

This fact is

w frequency

tion step size

. Therefore i

to represent

nts. There co

manner st

sing inwards

process c

nts. This nu

elected empiri

ocess of D

is also know

at the zonal co

AP images. T

are then interl

tioned previ

n is based o

ges are projec

onal curve tha

ction of the p

either the

and reduce th

ensional DC

pass filtering

n can be

nts.

ial network

onlinearly ex

dimensionalit

e. Training o

ages. The fir

vectors via

ng the separ

d feature v

ing the weigh

(b)

of a motion s

rd/backward

P image (b)

re computed,

s. Following

s reported in

e two dim

oefficients o

important

energy compa

in the top left

tilized in ima

coefficients a

in compariso

terms of sp

our AP imag

fficients can

rting from

towards the

n select a

ber is known

ally.

T transforma

as Zonal cod

ding is applied

e resultant ve

aved to gener

usly, the sec

Radon trans

ted at a given

t reflects the i

ojection angle.

orizontal or t

e size of the c

transformatio

with a given f

epresented u

CLASSIFIC

rovides a par

ands a seque

and maps

a polynomia

t stage involv

olynomial ex

ation of the

ctor space.

s of the poly

quence with a

rediction.

ositive AP im

the next step

he authors’

[

]

, these fea

nsional Disc

the Rado

roperty of

ction. Most o

corner of the

e and video c

e quantized

with the hi

tial feature e

using the t

e selected usi

he top left

ottom right

predefined

as the DCT

ion followed

ng.

to both negati

tors of the zo

te the final fe

nd approach

ormation. Es

angle; the re

tegral of pixel

ypically the

e vertical im

mputed imag

n can be used

equency cuto

sing few lo

TION

ameterized no

ce of feature

hem to a ta

network con

es expanding

ansion with

different cla

he second s

omial networ

aptive

ge.

is to extract

ork on sing

ures can be

ete Cosine

transform

the DCT

f the image

transformed

ding where

ith a finer

h frequency

traction we

p left DCT

g a zig-zag

corner and

orner. The

umber of

utoff which

by zigzag-

e and

al coding

ture vector.

to feature

entially, the

ult is a one

lines across

rojection is

ge axis. To

projection,

followed by

. Hence the

frequency

linear map

ectors to a

rget output

ists of two

the training

the aim of

ses in the

tage entails

applied to

the expanded feature vectors. Polynomial networks have been

used successfully in biomedical signal separation [18].

In a polynomial networks setting, the gait recognition problem

can be formulated as follows. The response variables which

represent the M individuals (where each individual can be

referred to a class in this case) of the training dataset, are

denoted by an M number of q vectors, i.e. ={| =

1,2,…,} . For a given class of features vectors, say class i, the

corresponding q vector will contain binary values with ‘1s’

indicating individuals belonging to class i and ‘0s’ for the rest

of the individuals or participants.

The feature vector of the ith individual at repetition j of class m

is composed of l feature variables and is denoted by 

,=

[

,(0)

,(1)…

,()]. Consequently, the feature

vectors in the training set is denoted by the matrix X where:











,(0)

,(1)⋯



,()



,(0)

,(1)…



,()

⋮⋮⋮



,(0)

,(1)…



,()





(1)

We wish to perform a nonlinear mapping between the feature

vector matrix X and the response variables ={| =

1,2,…,}. In polynomial networks, the dimensionality of the

feature vectors in matrix X is first expanded into an rth order.

The dimensionality expansion can be achieved by a reduced

multivariate polynomial expansion as proposed in [16]. The

expansion of X into the rth order is denoted by the matrix P ∈

Rn x k where k is the dimensionality of the expanded feature

vector which is defined as [16]:

k = 1+r + l(2r-1) (2)

The mapping between P and  is then achieved by using least-

squared error objective criterion:



 =avg

min‖ −

‖ (3)

Where ‖.‖ denotes the l2 norm. Minimizing the objective

function results in:



 =(

) (4)

Note that model weights are computed using a non-iterative

least squares method which is a clear advantage when it comes

to computational complexity.

Consequently the training process results in a set of weights

{

|=1,2,…,}. To classify a feature vector

representing the walk of an individual, we compute the inner

product of its expanded feature vector with each of the weight

vectors. This results in a score sequence sm m=1,2,..,M. The

class label of the feature vector is then determined by

argmax(s).

5. EXPERIMENTAL RESULTS

In the following results we validate the proposed feature

extraction schemes on Gulf costumes and compare the results

against those obtained on casual costumes which are similar to

what is reported in the literature. Common to all of the results

to follow, we report the gait classification rate obtained from

training and testing the system with feature vectors of different

lengths according to the DCT cutoff as explained in Section 3

above. Unless otherwise stated, the classification results are

obtained using a least-squares classifier without polynomial

expansion.

We start the experimental results section by comparing between

different approaches to spatial domain feature extraction. In

Figure 3, the female and male datasets are mixed, this also

includes mixing different directions of walking (i.e. from left to

right and vice versa). For each individual, 75% of the walking

samples are used for training and 25% for testing. Hence the

testing data is unseen by the training model. The figure shows

that the Radon transformation with horizontal projections of

AP images results in the highest classification results. Intuitively

this makes sense because the horizontal projection represents

the shape of the accumulated motion from both the front and

the rear of the body of an individual. At a DCT cutoff of 60

coefficients, the classification results are very close to 100%.

This should not come as a surprise as similar results have been

reported in [3]. On the other hand the figure shows that Radon

transformation with vertical projection of the AP images results

in a very poor classification result. This is so because such

projections can only describe the height of the individual and

the sinusoidal-like motion of the head during the walk. Clearly

such features are not enough for identifying an individual.

Interleaving the feature vectors of both aforementioned

projection though results in an acceptable result as shown in the

figure. Feature extraction using zonal coding resulted is a

moderate classification results. This can be justified by the fact

that the AP images contain plenty of high frequencies, hence

describing such images whilst discarding most of the high

frequency content though zonal coding does not result in

accurate and precise feature vectors. Lastly, it is worth

mentioning that the above discussion applies equally for both

Gulf and casual customs. However in the latter scenario, the

classification scores resulting from the horizontal projections

are a bit more accurate. This is not a surprise as the Gulf

costume conceals some details of the body motion.

(a)

0.2

0.4

0.6

0.8

10 20 30 40 50 60

Classification rate

Length of feature vector

Horizontal Projection Vertical projection

Interleaved projection Zonal coding

(b)

Fig. 3. Comparison between various spatial feature extraction

approaches. (a) Gulf costume (b) Casual costume.

In Figure 4 we interleave the two directions of walking in terms

of system training and testing. That is we train the system on

one direction of walking and test it on the other direction.

Clearly this experiment is carried out in a cross-validation

manner and the average classification result is reported. In this

experiment 50% of the feature vectors belong to each direction

of walking hence the training to testing ratio is set as such. The

spatial feature extraction approach is Radon transformation

with horizontal projections. Clearly, the classification rates

based on different direction of walking results is less accurate as

shown in the figure, nonetheless, at a DCT cutoff of 60 a

classification rate of around 90% is achieved. The figure also

shows that with training and testing based on second order

reduced model polynomial expansion higher classification

results are obtained. At low DCT cutoffs the enhancement is

quite evident. However at higher dimensionality and due to the

low number of training samples per individual (4 in this

experiment), the matrix of the expanded feature vector matrix

becomes ill-conditioned thus affecting the matrix inverse

operation in the computation of the model weights. Again the

same discussion applies for both the Gulf and the casual

costume.

(a)

(b)

Fig. 4.Classification rates using different training approaches

based on the direction of walking. (a) Gulf costume (b) Casual

costume.

6. CONCLUSION

This paper proposed a solution for gait recognition using non-

western costumes. In particular, the work was considered with

Gulf customs for both genders. The proposed solution was also

tested on casual costumes and was shown to work as well. As

such the same system can be deployed for identifying

individuals with mixed customs without the need for a

customized solution for a particular custom. The paper

proposed to accumulate the prediction errors of consecutive

video images using an adaptive forward/backward prediction

scheme. This was needed to counter effect the relative motion

of the background objects. Once the motion is accumulated

into one or two images, spatial feature extraction was applied. It

was shown that the Radon transformation with horizontal

image projections result in precise and concise feature vectors

that are linearly separable. The experimental results revealed

that the proposed solution results in accurate classification rates

and works equally well for both of the aforementioned

costumes.

REFERENCES

[1] S.V. Stevenage, M.S. Nixon, and K. Vince, “Visual analysis

of gait as a cue to identity,” Applied Cognitive Psychology, vol.

13, pp. 513-526, Dec. 1999.

[2] G. Zhao, R. Chen, G. Liu, and H. Li, “Amplitude spectrum-

based gait recognition,” Proc. Int’l Conf. Automatic Face and

Gesture Recognition, pp. 23-28, 2004.

[3] S. Sarkar, P. Jonathon Phillips, Z. Liu, I. Robledo, P.

Grother, K. W. Bowyer, “The human id gait challenge problem:

data sets, performance, and analysis,” IEEE Transactions on

Pattern Analysis and Machine Intelligence, 27(2), pp. 162 – 177,

Feb. 2005

[4] Z.i Liu and S. Sarkar, “Improved gait recognition by gait

dynamics normalization”, IEEE Transactions on pattern

analysis and machine intelligence, (28)6, June 2006.

0.2

0.4

0.6

0.8

10 20 30 40 50 60

Classification rate

Length of feature vector

Horizontal Projection Vertical projection

Interleaved projection Zonal coding

0.4

0.6

0.8

10 20 30 40 50 60

Classification rate

Length of feature vector

Training dataset based on mixed directions of walking

Training dataset based on a different direction of walking

With 2nd order reduced model expansion

0.4

0.6

0.8

10 20 30 40 50 60

Classification rate

Length of feature vector

Training dataset based on mixed directions of walking

Training dataset based on a different direction of walking

With 2nd order reduced model expansion

[5] P.J. Phillips, S. Sarkar, I. Robledo, P. Grother, and K.

Bowyer, “The gait identification challenge problem: data sets

and baseline algorithm,” Proc. Int’l Conf. Pattern Recognition,

pp. 385-388, 2002.

[6] L. Wang, T. Tan, H. Ning, and W. Hu, “Silhouette analysis-

based gait recognition for human identification,” IEEE Trans.

Pattern Analysis and Machine Intelligence, 25(12), pp. 1505-

1518, Dec. 2003.

[7] S.D. Mowbry and M.S. Nixon, “Automatic Gait Recognition

via Fourier Descriptors of Deformable Objects,” Proc. Conf.

Audio Visual Biometric Person Authentication, pp. 566-573,

2003.

[8] Z. Liu and S. Sarkar, “Simplest representation yet for gait

recognition: averaged silhouette,” Proc. Int’l Conf. Pattern

Recognition, vol. 4, pp. 211-214, 2004.

[9] A. Johnson and A. Bobick, “A multi-view method for gait

recognition using static body parameters,” Proc. Int’l Conf.

Audio and Video-Based Biometric Person Authentication, pp.

301-311, 2001.

[10] X. Yang, Y. Zhou, T. Zhang, G. Shu, J. Yang, “Gait

recognition based on dynamic region analysis,” Signal

Processing, 88(9), pp. 2350-2356, September 2008.

[11] J. Wu, J. Wang, L. Liu, “Feature extraction via KPCA for

classification of gait patterns,” Human Movement Science,” pp.

393-411, 26(3), June 2007.

[12] N. Boulgouris, Z. Chi, “Human gait recognition based on

matching of body components,” Pattern

Recognition,” 40(6), pp. 1763-1770, June 2007.

[13] J. Han and B. Bhanu, “Statistical feature fusion for gait-

based human recognition,” Proc. IEEE Conf. Computer Vision

and Pattern Recognition, vol. 2, pp. 842-847, June 2004.

[14] Y. Liu, R. Collins, and Y. Tsin, “Gait sequence analysis

using frieze patterns,” Proc. European Conf. Computer Vision,

pp. 657-671, May 2002.

[15] L. Wang, T. Tan, H. Ning, and W. Hu, “Silhouette analysis-

based gait recognition for human identification,” IEEE

Transactions on pattern analysis and machine intelligence,

25(12), December 2003.

[16] M.-H. Cheng, M.-F. Ho, C.-L. Huang, “Gait analysis for

human identification through manifold learning and HMM,”

Pattern Recognition, 41(8), pp. 2541-2553, August 2008.

[17] T. Shanableh and K. Assaleh, “Telescopic vector

composition and polar accumulated motion residuals for

feature extraction in Arabic Sign Language recognition,”

EURASIP Journal on Image and Video Processing, vol. 2007,

Article ID 87929, 10 pages, 2007. doi:10.1155/2007/87929.

[18] K. Assaleh, and H. Al-Nashash, “A Novel Technique for

the Extraction of Fetal ECG Using Polynomial Networks,”

IEEE Transactions on Biomedical Engineering, 52(6), pp. 1148

– 1152, June 2005.

Vision-based approaches towards person identification using gait

Article

Full-text available

Sep 2021

Visual surveillance has exponentially increased the growth of security devices and systems in the digital era. Gait-based person identification is an emerging biometric modality for automatic visual surveillance and monitoring as the walking patterns highly correlate to the subject’s identity. The scientific research on person identification using gait has grown dramatically over the past two decades due to its several benefits. It does not require active collaboration from users and can be performed without their cooperation. It is difficult to be impersonated and identification can be validated from low-resolution videos and with simple instrumentation. This paper presents a comprehensive overview of the exiting techniques, their key stages, and recent developments in vision-based person identification using gait. We reviewed the historical research on gait locomotion and explain that how it is used to recognize the identity. The article summarizes the different types of features that have been proposed to encode the biomechanics of gait and also groups them into different categories and subcategories based upon the similarity in their implementation. We also present the impact of different covariate factors that affect the performance of gait recognition systems and also discuss the recent works to cope with these challenges. Furthermore, a comparison of the recognition accuracies reported by the existing algorithms to assess their performance under verification and identification mode is also presented. A detailed summary of publicly available vision-based gait databases is also provided. Finally, it offers insight into the challenges and open problems for future perspectives in the field of gait recognition that can help to set the directions for future research in this field.

Robust gait recognition: A comprehensive survey

Article

Full-text available

Oct 2018

Gait recognition has emerged as an attractive biometric technology for the identification of people by analysing the way they walk. However, one of the main challenges of the technology is to address the effects of inherent various intra-class variations caused by covariate factors such as clothing, carrying conditions, and view angle that adversely affect the recognition performance. The main aim of this survey is to provide a comprehensive overview of existing robust gait recognition methods. This is intended to provide researchers with state of the art approaches in order to help advance the research topic through an understanding of basic taxonomies, comparisons, and summaries of the state-of-the-art performances on several widely used gait recognition datasets.

Motion-Based Gait Recognition for Recognizing People in Traditional Gulf Clothing

Conference Paper

Nov 2019

Decision-Level Fusion for Single-View Gait Recognition with Various Carrying and Clothing Conditions

Article

Full-text available

Feb 2017
IMAGE VISION COMPUT

Gait Recognition is one of the latest and attractive biometric techniques, due to its potential in identification of individuals at a distance, unobtrusively and even using low resolution images. In this paper we focus on single lateral view gait recognition with various carrying and clothing conditions. Such a system is needed in access control applications whereby a single view is imposed by the system setup. The gait data is firstly processed using three gait representation methods as the features sources; Accumulated Prediction Image (API) and two new gait representations namely; Accumulated Flow Image (AFI) and Edge-Masked Active Energy Image (EMAEI). Secondly, each of these methods is tested using three matching classification schemes; image projection with Linear Discriminant Functions (LDF), Multilinear Principal Component Analysis (MPCA) with K-Nearest Neighbor (KNN) classifier and the third method: MPCA plus Linear Discriminant Analysis (MPCA + LDA) with KNN classifier. Gait samples are fed into the MPCA and MPCALDA algorithms using a novel tensor-based form of the gait images. This arrangement results into nine recognition sub-systems. Decisions from the nine classifiers are fused using decision-level (majority voting) scheme. A comparison between unweighted and weighted voting schemes is also presented. The methods are evaluated on CASIA B Dataset using four different experimental setups, and on OU-ISIR Dataset B using two different setups. The experimental results show that the classification accuracy of the proposed methods is encouraging and outperforms several state-of-the-art gait recognition approaches reported in the literature.

Automatic Gait Recognition via Fourier Descriptors of Deformable Objects

Conference Paper

Full-text available

Jun 2003
Lect Notes Comput Sci

We describe a new method for Automatic Gait Recognition based around the use of Fourier descriptors that model the periodic deformation of human gait. Fourier descriptors have been used successfully in the past to model the boundary of static or moving, rigid-bodied objects, but many objects actually deform in some way as they move. Here we use Fourier descriptors to model not only the object’s boundary, but also the spatio-temporal deformations under which the object’s boundary is subjected. We applied this new method to the Large Gait Database, compiled at the University of Southampton, and found that the Fourier descriptors obtained for each person appear to be unique and can be used for recognition. Successful recognition rates of over 85% were obtained from the Large Gait Database using only a small set of descriptors.

Effects of the timing of postevent information on preschoolers’ memories of an event

Article

Full-text available

Dec 1999
APPL COGNITIVE PSYCH

Few researchers have investigated whether the timing of postevent information aspects the accuracy of children's reports of events they have experienced. In this study, four-year-olds dressed up in costumes and had their photographs taken. An unfamiliar adult spoke to the children about the event either a day (immediate condition) or a month (delayed condition) later, providing both accurate and misleading information about the staged event. When questioned five weeks after the event, children in a control group who had not received the review were more inaccurate answering focused questions than children who had been reminded of the event. A review a while after the event but shortly before the interview increased the amount of details recalled and this was not at the expense of accuracy. Mis-information was seldom reported spontaneously, although children in all groups acquiesced to leading questions in line with the misleading suggestions.

A Multi-view Method for Gait Recognition Using Static Body Parameters

Conference Paper

Full-text available

Jan 2001

A multi-view gait recognition method using recovered static body parameters of subjects is presented; we refer to these parameters as activity-specific biometrics. Our data consists of 18 subjects walking at both an angled and frontal-parallel view with respect to the camera. When only considering data from a single view, subjects are easily discriminated; however, discrimination decreases when data across views are considered. To compare between views, we use ground truth motion-capture data of a reference subject to find scale factors that can transform data from different viewsi nto a common frame ("walking-space"). Instead of reporting percent correct from a limited database, we report our results using an expected confusion metric that allows us to predict how our static body parameters filter identity in a large population: lower confusion yields higher expected discrimination power. We show that using motion-capture data to adjust vision data of different views to a common reference frame, we can get achieve expected confusions rates on the order of 6%.

Visual Analysis of Gait as a Cue to Identity

Article

Full-text available

Dec 1999
APPL COGNITIVE PSYCH

For humans, the ability to discriminate between, and to identify, others is paramount. The most obvious way this is accomplished is by means of face recognition. However, this is not the only method available. The present article reports on two experiments designed to see whether gait can be used as a reliable cue to identity. Experiment One showed that the human visual system was sophisticated enough to learn to identify six individuals on the basis of their gait signature under conditions of simulated daylight, simulated dusk and point-light displays. It thus appeared that gait-related judgements could be made, and furthermore, that these judgements were possible without reliance on shape information. Experiment Two suggested that even under adverse viewing conditions involving a single brief exposure, humans could identify a target from a ‘walking identity parade’ at greater than chance levels. These results emerged regardless of the lighting conditions, and were largely independent of the gender of the target walker. As such, the present results suggest that gait could be used as a reliable means of discriminating between individuals, and the importance of such an identity cue, in conditions in which the face is obscured, are discussed. Copyright © 1999 John Wiley & Sons, Ltd.

Telescopic Vector Composition and Polar Accumulated Motion Residuals for Feature Extraction in Arabic Sign Language Recognition

Article

Full-text available

Jan 2007
Int J Image Video Process

This work introduces two novel approaches for feature extraction applied to video-based Arabic sign language recognition, namely, motion representation through motion estimation and motion representation through motion residuals. In the former, motion estimation is used to compute the motion vectors of a video-based deaf sign or gesture. In the preprocessing stage for feature extraction, the horizontal and vertical components of such vectors are rearranged into intensity images and transformed into the frequency domain. In the second approach, motion is represented through motion residuals. The residuals are then thresholded and transformed into the frequency domain. Since in both approaches the temporal dimension of the video-based gesture needs to be preserved, hidden Markov models are used for classification tasks. Additionally, this paper proposes to project the motion information in the time domain through either telescopic motion vector composition or polar accumulated differences of motion residuals. The feature vectors are then extracted from the projected motion information. After that, model parameters can be evaluated by using simple classifiers such as Fisher's linear discriminant. The paper reports on the classification accuracy of the proposed solutions. Comparisons with existing work reveal that up to 39% of the misclassifications have been corrected.

Telescopic Vector Composition and Polar Accumulated Motion Residuals for Feature Extraction in Arabic Sign Language Recognition

Article

Jan 2007
Int J Image Video Process

Fast communication: Gait recognition based on dynamic region analysis

Article

Sep 2008
SIGNAL PROCESS

Gait Energy Image (GEI) has been proved to be an effective identity signature in gait recognition. But previous approaches only treat this 2D image representation as a holistic feature and neglect the intrinsic dynamic characteristics of gait patterns. In this paper, we use variation analysis to obtain the dynamic region in GEI which reflects the walking manner of an individual. Based on this analysis, a dynamics weight mask is constructed to enhance the dynamic region and suppress the noises on the unimportant regions. The obtained gait representation called enhanced GEI (EGEI) is then represented in low dimensional subspace by Gabor-based discriminative common vectors analysis. We test the proposed approach on the USF HumanID Gait Database. Experimental results prove its effectiveness in terms of recognition rate.

Human gait recognition based on matching of body components

Article

Jun 2007
PATTERN RECOGN

This paper presents a novel approach for gait recognition based on the matching of body components. The human body components are studied separately and are shown to have unequal discrimination power. Several approaches are presented for the combination of the results obtained from different body components into a common distance metric for the evaluation of similarity between gait sequences. A method is also proposed for the determination of the weighting of the various body components based on their contribution to recognition performance. Using the best performing of the proposed methods, improved recognition performance is achieved.

Gait Analysis for Human Identification through Manifold Learning and HMM.

Conference Paper

Jan 2007
PATTERN RECOGN

With the increasing demands of visual surveillance systems, human identification at a distance has gained more attention from the researchers recently. Gait analysis can be used as an unobtrusive biometric measure to identify people at a distance without any attention of the human subjects. We propose a novel effective method for both automatic viewpoint and person identification by using only the silhouette sequence of the gait. The gait silhouettes are nonlinearly transformed into low-dimensional embedding by Gaussian process latent variable model (GPLVM), and the temporal dynamics of the gait sequences are modeled by hidden Markov models (HMMs). The experimental results show that our method has higher recognition rate than the other methods. (c) 2007 Elsevier Ltd. All rights reserved.

Gait Analysis For Human Identification Through Manifold Learning and HMM

Article

Aug 2008
PATTERN RECOGN

Gait recognition is a process of identifying individuals by the way they walk. Gait is often used as a unobstrusive biometric offering the possibility to identify people at a distance without any interaction or co-operation with the subject. This paper presents a novel method for both automatic viewpoint and person identification using only the silhouette sequence of gait. The gait silhouettes are nonlinearly transformed into low dimensional embedding and the dynamics in time-series images are modeled by HMM in the corresponding embedding space. The experimental results demonstrate that the proposed algorithm is an encouraging progress for the gait analysis research.

Gait Recognition System Tailored for Arab Costume of The Gulf Region

Abstract

Recommended publications

Infrared Gait Recognition Based on Hidden Markov Model

Individual Recognition from Gait Using Feature Value Method

Continuous speech recognition unit using forward probabilities

Palmprint recognition based on multiple feature information fusion