ArticlePDF Available

Evaluation of Image Pre-Processing Techniques for Eigenface Based Face Recognition

July 2002
Proceedings of SPIE - The International Society for Optical Engineering 4875(2):677-685

July 2002
4875(2):677-685

DOI:10.1117/12.477052

Source
CiteSeer

Authors:

Thomas Heseltine

The University of York

Jim Austin

The University of York

We present a range of image processing techniques as potential pre-processing steps, which attempt to improve the performance of the eigenface method of face recognition. Verification tests are carried out by applying thresholds to gather false acceptance rate (FAR) and false rejection rate (FRR) results from a data set comprised of images that present typical difficulties when attempting recognition such as strong variations in lighting direction and intensity, partially covered faces and changes in facial expression. Results are compared using the equal error rate (EER), which is the error rate when FAR is equal to FRR. We determine the most successful methods of image processing to be used with eigenface based face recognition, in application areas such as security, surveillance, data compression and archive searching.

Average face image and the first 5 eigenfaces defining a face space with no image pre-processing.

…

Error rates from various thresholds using an eigenface system with no image pre-processing.

…

Chart showing the test procedure

…

Equal Error Rate Results for various image processing methods.

…

Examples of pre-processed images

…

Figures - uploaded by Thomas Heseltine

Content may be subject to copyright.

Content uploaded by Thomas Heseltine

Content may be subject to copyright.

Evaluation of image pre-processing techniques for

eigenface based face recognition

Thomas Heseltine1, Nick Pears and Jim Austin

Advanced Computer Architecture Group

Department of Computer Science

The University of York

ABSTRACT

We present a range of image processing techniques as potential pre-processing steps, which attempt

to improve the performance of the eigenface method of face recognition. Verification tests are

carried out by applying thresholds to gather false acceptance rate (FAR) and false rejection rate

(FRR) results from a data set comprised of images that present typical difficulties when attempting

recognition, such as strong variations in lighting direction and intensity, partially covered faces and

changes in facial expression. Results are compared using the equal error rate (EER), which is the

error rate when FAR is equal to FRR. We determine the most successful methods of image

processing to be used with eigenface based face recognition, in application areas such as security,

surveillance, data compression and archive searching.

Keywords: Face Recognition, Eigenface, Image Pre-processing, Lighting.

This paper was published in The Proceedings of the Second International Conference on Image and Graphics, SPIE vol. 4875, pp. 677-685 (2002) and

is made available as an electronic reprint with permission of SPIE. One print or electronic copy may be made for personal use only. Systematic or

multiple reproduction, distribution to multiple locationsvia electronic or other means, duplication of any material in this paper for a fee or for

commercial purposes, or modification of the content of the paper are prohibited.

1tom.heseltine@cs.york.ac.uk www.cs.york.ac.uk/~tomh

1 INTRODUCTION

The Eigenface technique for face recognition is an information theory approach based on principal component analysis,

as proposed by Turk and Pentland1, 2. We take a set of M training images and compute the eigenvectors of their

covariance matrix, then select the M` eigenvectors (eigenfaces) with the highest eigenvalues to define an image

subspace (face space). By projecting a face-image into face space we obtain a ‘face-key’ vector of M` dimensions. We

define the ‘likeness’ of any two face-images as the Euclidean distance between their respective ‘face-key’ vectors.

Using this method, we perform many comparisons between different images of the same face and images of different

faces. By applying a range of threshold values to the distance measurements of these comparisons, we obtain false

acceptance rates (FAR) and false rejection rates (FRR). The equal error rate (EER) is used as a single measure of the

effectiveness of the system and is attained at a specific threshold value.

A problem inherent to information theory approaches is that there are often features within the training data, identified

as principal components, which may not necessarily discriminate between faces but between the circumstances under

which the images were captured. These may include such factors as lighting direction, intensity and colour, head

orientation, image quality and facial expression. For example, suppose some images in the training data were taken

with bright sunlight shining on one side of the face. The feature of having one side of the face lighter than the other

may be identified as a principle component and hence used to distinguish between different people. This should clearly

not be the case.

We introduce a pre-processing step as an addition to the above system, in an attempt to reduce the presence of these

unwanted features in both the training and test data. This pre-processing takes place prior to any principal component

analysis of training data and before projection of any test images into face space. The result being that the eigenfaces

produced are a more accurate representation of the differences between people’s faces rather than the differences

between environmental factors during image capture. The image processing techniques include a range of standard

image filtering algorithms as found in many graphics applications, some image normalisation methods (including those

discussed by Finlayson et al3,4) and some of our own image processing algorithms.

2 RELATED WORK

The problems caused by changes in lighting conditions have been well researched. Adini, Moses and Ullman suggest

that the differences between images of one face under different illumination conditions are greater than the differences

between images of different faces under the same illumination conditions5. It has been attempted, with some success, to

identify and compensate for the effect of lighting conditions in various face recognition systems. Zhao and Chellappa

use a generic 3D surface of a face, together with a varying albedo reflectance model and a Lambertian physical

reflectance model to compensate for both the lighting and head orientation6, before applying a recognition system based

on linear discriminant analysis. Much research has also been carried out to improve eigenface recognition systems.

Cutler has shown that it can successfully be applied to infrared images7, resulting in a much decreased error rate. This

shows that an artificial infrared light source could be used to reduce the effect of external light sources, producing an

accurate system for use in security applications such as site access. However, the use of such a light source is not

always practical, particularly if the camera is far from the subject.

Pentland, Moghaddam and Starner extended their eigenface system to include multiple viewing angles of a persons

face8, improving the systems performance when applied to faces of various orientations. They also incorporate a

modular eigenspace system8, which significantly improves the overall performance, although it does not tackle the

lighting problem directly. Belhumeur, Hespanha and Kreigman use Fisher’s linear discriminant to capture the

similarities between multiple images in each class (per person)9, hoping to discount the variations due to lighting from

the defined subspace. Their results show a significant improvement over the standard eigenface approach (from 20% to

less than 2% error rate). However, as both the eigenface and fisherface methods are both information theory

approaches, it’s likely that additional image processing could improve both methods. Although Adini, Moses and

Ullman point out that there is no image representation that can be completely invariant to lighting conditions, they do

show that different representations of images, on which lighting has less of an affect, can significantly reduce the

difference between two images ofthe same face5.

3 THE FACE DATABASE

We conduct experiments using a database of 960 bitmap images of 120 individuals (60 male, 60 female), extracted from

the AR Face Database provided by Martinez and Benavente10. We separate the database into two disjoint sets: i) The

training set, containing 60 images of different people of various gender, race and age taken under natural lighting

conditions with neutral expression; ii) the test set containing 900 images (15 images of 60 people of various gender,

race and age). Each of the 15 images were taken under the conditions described in table 1 (examples in Figure 1).

Table 1. Image capture conditions.

All the images are stored as bitmaps (converted into vectors for PCA). After initial investigations to determine an

appropriate resolution, we find there is no significant change in the EER for resolutions of 100 pixels between the eyes

down to just 15 pixels. As a compromise between test execution time and some pre-processing techniques possibly

working better with higher resolutions, we select 25 pixels distance between the eyes to conduct all further experiments.

All images are scaled and rotated such that the centres of the eyes are aligned 25 pixels apart, using our eye detection

algorithm (not described here). Each image is cropped to a width and height of 75 and 112 pixels respectively.

Figure 1. Example test set images for a single person (first 6 repeated on two days).

4 DEFINING FACE SPACE

In this section we review the eigenface method of face recognition for completeness. Consider our training set of

images of 75 by 112 pixels. These images could be represented as two-dimensional (75 by 112) arrays of pixel

intensity data. Similarly, vectors of 8400 (75x112) dimensions could represent the images. Interpreting these vectors

as describing a point within 8400 dimensional space means that every possible image (of 75 by 112 pixels) occupies a

single point within this image space. What’s more, similar images (for example images of faces) should occupy points

within a fairly localised region of this image space. Taking this idea a step further, we assume that different images of

the same face map to nearby points in image space and images of different faces map to far apart points. Ideally, we

wish to extract the region of image space that contains faces, reduce the dimensionality to a practical value, yet

maximise the spread of different faces within the image subspace. Here we apply Principal Component Analysis to

define a space with the properties mentioned above.

We take the set of M training images (in our case M = 60): {Γ1, Γ2, Γ3, …ΓM} and compute the average image

∑

=Γ=Ψ M

1, followed by the difference of each image from the average image Ψ−Γ=Φ nn .Thusweconstruct

the covariance matrix as,

ΦΦ= ∑

1]...[321 M

ΦΦΦΦ= (1)

Lighting

Expressions\covering Natural From the left From the right From left & right

Neutral expression Day 1, Day 2 Day 1, Day 2 Day 1, Day 2 Day 1, Day 2

Happy expression Day 1, Day 2

Angry expression Day 1, Day 2

Mouth covered Day 1 Day 1 Day 1

The eigenvectors and eigenvalues of this covariance matrix are calculated using standard linear methods. These

eigenvectors describe a set of axes within the image space, along which there is the most variance in the face images

and the corresponding eigenvalues represent the degree of variance along these axes. The M eigenvectors are sorted in

order of descending eigenvalues and the M` greatest eigenvectors (in our system M` = 30) are chosen to represent face

space. The effect is that we have reduced the dimensionality of the space to M`, yet maintained a high level of variance

between face images throughout the image subspace.

Each eigenvector contains 8400 elements (the number of pixels in the original bitmap) and can be displayed as images,

as shown in Figure 2.

Figure 2. Average face image and the first 5 eigenfaces defining a face space with no image pre-processing.

Due to the likeness to faces, Turk and Pentland to refer to these vectors as eigenfaces, and the space they define, face

space.

5 VERIFICATION OF FACE IMAGES

Once face space has been defined, we can project any image into face space by a simple matrix multiplication:

)( Ψ−Γ= T

kk u

`,...1Mk

for

=(2)

where ukis the kth eigenvector and ωkis the kth weight in the vector ΩT=[ω1,ω2,ω3,…ωM`]. The M` weights

represent the contribution of each respective eigenface and by multiplying the eigenfaces by their weight and summing,

we can view the face image as mapped into face space (shown in Figure 3).

Figure 3. Test images and their face space projections.

The vector, Ω, is taken as the ‘face-key’ for a person’s image projected into face space. We compare any two ‘face-

keys’ by a simple Euclidean distance measure, 2

ba Ω−Ω=

. An acceptance (the two face images match) or

rejection (the two images do not match) is determined by applying a threshold. Any comparison producing a distance

below the threshold is a match.

To gather results for the False Rejection Rate, each of the 15 images for a single person, is compared with every other

image of their face. No image is compared with itself and each pair is only compared once (the relationship is

symmetric), giving )( 2

1iipc nnnn −= = 6300 comparisons to test false rejection, where npis the number of people

and niis the number of face images per person.

False acceptance results are gathered using only images of the type “Day1, neutral expression, natural lighting” and

“Day 2, neutral expression, natural lighting.” Other comparisons are unlikely to produce false acceptances (due to the

combination of different lighting, expression and obscuring), resulting in an initial low ERR, hence any effect of image

processing on the more problematic comparisons would be seemingly reduced. Using these images, every person is

compared with every other person. This gives 4 comparisons per pair of people, with no person compared to himself

and each pair only compared once. Thus, a total of )(2 2ppc nnn −= = 7080 comparisons are made between different

people to test false acceptance, where npis the number ofpeople (60).

Hence, each equal error rate is based on 13380 verification attempts, using a set of comparisons under a range of

conditions, such that the FRR and the FAR are maximised. For each threshold value we produce a FAR and a FRR. By

applying a range of threshold values we produce a range of FAR and FRR pairs that are plotted on a graph. The results

for our benchmark system (no pre-processing) can be seen below. The equal error rate can be seen as the point where

FAR equals FRR (Figure 4).

Figure 4. Error rates from various thresholds using an eigenface system with no image pre-processing.

6 IMAGE PRE-PROCESSING

We are now in a position to test the effect of image pre-processing on the system performance. A summary of our

procedure is given in Figure 5. We present a range of pre-processing techniques, which may affect the EER of the

eigenface system, when applied to face images prior to recognition. The image processing methods fall into four main

categories: colour normalisation methods, statistical methods, convolution methods and combinations of these methods.

The methods are used to produce a single scalar value for each pixel. Examples of these pre-processing methods can be

seen in Figure 7.

Figure 5. Chart showing the test procedure

6.1 COLOUR NORMALISATION METHODS

6.1.1 INTENSITY: INTENSITY NORMALISATION

We use a well-known image intensity normalisation method, in which we assume that, as the intensity of the lighting

source increases by a factor, each RGB component of each pixel in the image is scaled by the same factor. We remove

the effect of this intensity factor by dividing by the sum of the three colour components.













++++++

=bgr b

bgr g

bgr r

bgr normnormnorm ,,),,(

(3)

Since the pixels of the resulting image have equal intensity, summing the three colour channels would result in a blank

image. Therefore, to create an image with single scalar values for each pixel (as required by our eigenface system) we

can either take a single colour channel, or sum just the red and green components (the chromaticities).

6.1.2 GREY WORLD: GREY WORLD NORMALISATION

Here we take a similar approach to the above normalisation, but compensating for the effect of variations in the colour

of the light source. Different colours of light cause the RGB colour components of an image to scale apart, by factors α,

βand γrespectively, ),,(),,( bgrbgr newnewnew

γβα

=. Which is normalised using the equation below.













+++++++++

=NNN

normnormnorm bbb Nb

ggg Ng

rrr Nr

bgr ...

...

),,(

212121

(4)

6.1.3 COMPREHENSIVE: COMPREHENSIVE COLOUR IMAGE NORMALISATION

We use an algorithm proposed by Finlayson3, which normalises an image for variations in both lighting geometry and

illumination colour. The method involves the repetition of intensity normalisation followed by grey world

normalisation (as described above), until the resulting image reaches a stable state (i.e. the change in pixel values from

one cycle to another is sufficiently small).

6.1.4 HSV HUE: STANDARD DEFINITION OF HUE

The hue of an image is calculated using the standard hue definition, such that each pixel is represented by a single scalar

value H.

()()

[]

()()()()













−−+−− −+−

=−bgbrgrgr

brgr

cos

(5)

6.1.5 BGI HUE: HUE THAT IS INVARIANT TO BRIGHTNESS AND GAMMA

Finlayson and Schaefer introduce a definition of hue that is invariant to brightness (the scaling of each colour channel

by a constant factor) and gamma (raising the colour channels to a constant power)4, which are often caused by

variations in scene environment and capture equipment.

)log(2)log()log( )log()log(

tan 1bgr gr

H−+ −

=−

(6)

In summary, the colour normalisation methods used to process the images are shown in table 2.

Intensity Colour intensity normalisation.

Chromaticities Summation of the R, G components of colour intensity normalisation.

Grey world Grey world normalisation.

Comprehensive Comprehensive colour normalisation.

Comprehensive chromes Summation of the R, G components of comprehensive normalisation..

Hsv hue Standard hue definition.

Bgi hue Brightness and gamma invariant hue.

Table 2. Colour normalisation methods.

6.2 STATISTICAL METHODS

We introduce some statistical methods that apply transformations to the image intensity values in order to make the

brightness and contrast constant for all images. The effect is that every image appears to be equally as bright (as a

whole) and span across an equal range of brightness.

These statistical methods can be applied in a number of ways, mainly by varying the areas of the image from which the

statistics are gathered. It is not necessarily the case that lighting conditions will be the same at all points on the face, as

the face itself can cast shadows. Therefore, in order to compensate for the variations in lighting conditions across a

single face, we can apply these methods to individual regions of the face. This means that, we are not only

compensating for a difference in lighting conditions from one image to another, but also for different lighting conditions

from one area of the face to another. In summary, the methods we evaluated are shownin table 3.

Brightness Global transformation of brightness, such that intensity moments are normalised.

Horizontal brightness Application of brightness method to individual rows of pixels.

Vertical brightness Application of brightness method to individual columns of pixels.

Local brightness Application of brightness method to individual local regions of an image.

Local brightness mean Transformation of brightness, such that the mean becomes a constant specified

value within local regions of the image.

Table 3. Statistical methods used.

6.3 CONVOLUTION METHODS

Convolution methods involve the application of a small template to a window, moved step-by-step, over the original

image. These templates can be configured to enhance or suppress features, reduce noise and extract edges. The

templates evaluated are described in table 4.

Smooth (σ= 0.788) Standard low-pass filtering using a 3x3 pixel template.

Smooth more (σ= 1.028) Similar to the above, only with a larger 5x5 pixel neighbourhood.

Blur An extreme blurring effect.

Edge Enhances the edges of an image.

Edge more Same as the above only more so.

Find edges Segmentation of an image to include only those pixels that lie on edges.

Contour Similar to Find edges, only more sensitive to changes in contrast.

Detail Enhance areas of high contrast.

Sharpen Reduces the blur in the image.

Emboss A stylise type filter that enhances edges with a shadow casting affect.

Table 4. Convolution methods used.

6.4 METHOD COMBINATIONS

In an attempt to capture the advantages of multiple image processing methods, we combine some of those methods that

produce the best improvement in EER, as shown in table 5.

Contour -> Smooth Contour filtering followed by smoothing.

Smooth->Contour Smoothing followed by contour filtering.

Local brightness -> Smooth Local brightness transformation followed by smoothing.

Local brightness -> Contour Local brightness transformation followed by contour filtering.

Contour + Local brightness The summation of the resulting images from the Contour filter and the Local

Brightness transformation.

C->S + LB Contour filtering followed by smoothing, summed with the Local Brightness

transformation.

S->LB->C Smoothing followed by the Local Brightness transformation, followed by

Contour filtering.

Table 5. Method combinations used.

7 RESULTS

We present the results produced by using various image processing methods as a bar chart of EERs (Figure 6). The

base-line eigenface system (no image processing) is displayed in the chart as a dark red bar. It can be seen that the

majority of image processing methods did produce some improvement to the eigenface system. However, what is

surprising is the large increase in error rate produce by some of the colour normalisation methods of image processing,

most notably the brightness and gamma invariant hue introduced by Finlayson and Schaefer4. We believe that the

reduction in effectiveness when using such methods is due to the loss of information during these procedures. Edges

become less defined and some of the shading due to geometrical structure is lost (see Figure 7). An increase is also

witnessed using the blurring filters. It is therefore not surprising to see that the edge-enhancing methods had a positive

impact on the EERs (the find edges and contour filters were particularly effective), as did the statistical methods, which

normalise intensity moments (increasing the shading gradient in many areas).

Having identified the most successful image processing method of those evaluated, as normalising intensity moments

within local regions of the image, then applying a convolution contour filter, we continue to improve the system by

testing different cropping of images to find the optimum for this image processing method, reaching an EER of 22.4%

(Figure 8).

8 CONCLUSION

We have shown that the eigenface-based method of face recognition can be significantly improved by means of simple

image pre-processing techniques. Without any alterations to the eigenface technique itself, an EER of 22.4% percent

can be achieved (a reduction of 11.6%) using a data set containing a majority of extremely difficult images (20% of the

images are partially obscured and 40% ofthe images have extreme lighting conditions).

There are some factors that may be the cause of the remaining 22.4% error, which were not compensated for by the

image pre-processing techniques. Firstly, the eye detection algorithm was be no means perfect and although an attempt

was made to manually correct any misaligned images, it is clear (from browsing the database) that some images are not

aligned well. It would be relatively simple to implement a system in which several small translations and scales of the

original image were projected into face space for each recognition attempt, hence compensating for any inaccuracies in

the alignment procedure.

Comparable improvements have been witnessed in similar PCA methods of face recognition, such as Pentland et al’s

modular eigenspace system8and in Belhumeur et al’s comparison to Fisher faces9. It is likely that the image pre-

processing methods described could be of similar benefit to these algorithms, and result in a greatly improved face

recognition system.

REFERENCES

1. M. Turk, A. Pentland. “Eignefaces for Recognition”, Journal ofCognitive Neuroscience, Vol. 3,pp. 72-86, 1991.

2. M. Turk, A. Pentland. “Face Recognition Using Eignefaces”, In Proc. IEEE Conf. on Computer Vision and Pattern

Recognition, pp. 586-591, 1991.

3. G.D. Finlayson, B. Schiele, J. L. Crowley. “Comprehensive Colour Image Normalisation”, Proc. ECCV '98,LNCS

1406, Springer, pp. 475-490, 1998.

4. G. Finlayson, G. Schaefer. “Hue that is Invariant to Brightness and Gamma”, BMVC01, Session 3: Colour &

Systems, 2001.

5. Y. Adini, Y. Moses, S. Ullman. “Face Recognition: the Problem of Compensating for Changes in Illumination

Direction”, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 721-732, 1997.

6. W. Zhao, R. Chellappa. “3D Model Enhanced Face Recognition”, In Proc. Int. Conf. Image Processing, Vancouver,

2000.

7. R. Cutler.“Face Recognition Using Infrared Images and Eigenfaces”, citeseer.nj.nec.com/456378.html, 1996.

8. A. Pentland, B. Moghaddom, T. Starner. “View-Based and Modular Eigenfaces for Face Recognition”, Proc. of IEEE

Conf. on Computer Vision and Pattern Recognition (CVPR'94), 1994.

9. P. Belhumeur, J. Hespanha, D. Kriegman. “Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear

Projection”, IEEE Transactions onPattern Analysisand Machine Intelligence, PAMI19 (7), pp. 711-720, 1997.

10. A.M. Martinez, R. Benavente. “The AR Face Database.” CVC Technical Report #24, 1998.

Figure 6. Equal Error Rate Results for various image processing methods.

Figure 7. Examples of pre-processed images

Figure 8. Error rates for base-line system, most successful pre-processing and image crop

An Adaptive Approach for Eigenfaces -based Facial Recognition

Research

Full-text available

Apr 2017

The fundamental target is to show the specific faces and recognize the pictures from the substantial number of put away faces with some continuous variety with dropping down the dimensionality space. Her we focus on to diminish the space eccentrics and to got the exact feature vectors and consider all the previous approaches for the same .This paper focuses on mathematical rigor and the conventional methodology aspects over the face recognition classifier. Considering all the conventional approach for Eigen face approaches this paper presents a comparative analysis in more efficient manner. The focus is not to make a new classifier but to analyze the conventional approach with proposed method.

Klasifikasi Tumor Payudara Pada Citra Ultrasonografi Menggunakan Multi-fitur Tekstur dan Support Vector Machine

Article

Full-text available

Apr 2024

Breast cancer is a prevalent type of cancer affecting women worldwide. Additionally, Globocan reported nearly 400,000 new cancer cases in Indonesia in 2020, with 16% being breast cancer. The Ministry of Health has prioritised breast cancer treatment due to the high number of cases. Early detection is a crucial factor in increasing patient life expectancy. Stage 1 breast cancer, for instance, has a 5-year life expectancy of 100%. Breast ultrasound or mammary ultrasound is a commonly used method to detect various breast problems, including cysts and tumors. It is a relatively easy procedure, and the necessary equipment is generally available at Health Facility 1. Texture features are extracted from breast ultrasound images using Gabor and Gray Level Co-occurrence Matrix (GLCM) techniques. The resulting feature vector is then selected and its dimensions reduced to simplify the computing process. This vector is then used to train an SVM classifier to distinguish between benign and malignant cases. The accuracy of the classifier is 0.67 (training) and 0.66 (validation). Meanwhile, the loss obtained during training was 0.77 and during validation was 0.84. Further improvement is required for the accuracy of the model to be applicable. Abstrak Kanker payudara merupakan salah satu jenis kanker yang banyak mengancam perempuan di berbagai belahan dunia. Selain itu, data Globocan juga menunjukan bahwa pada tahun 2020 terdapat hampir 400 ribu kasus kanker baru di indonesia dengan 16%-nya merupakan kanker payudara. Oleh karena tingginya kasus kanker payudara, pemerintah melalui kementrian kesehatan menjadikan kanker payudara sebagai prioritas penanganan. Salah satu faktor yang dapat meningkatkan harapan hidup pasien adalah deteksi dini, dimana kanker payudara stadium 1 memiliki angka harapan hidup 5 tahun adalah 100%. Salah satu metode yang sering digunakan untuk mendeteksi berbagai masaah pada payudara (seperti kista dan tumor) adalah USG payudara atau USG mammae.USG payudara relatif mudah dan alatnya umumnya tersedia pada Faskes 1. Penelitian ini melakukan ekstaksi fitur tekstur yaitu Gabor dan GLCM (Gray Level Co-occurrence Matrix) dari citra USG payudara. Vektor fitur yang terbentuk selanjutnya diseleksi dan direduksi dimensinya agar meringankan proses komputasi. Vektor fitur tersebut kemudian selanjutnya digunakan untuk melatih klasifier SVM untuk mengklasifikasi kedalam dua kelas yaitu benign (tumor jinak) dan malignant (tumor ganas/kanker Berdasarkan hasil pengujian didapatkan akurasi sebesar 0.67 (training) dan 0.66 (validasi). sementara itu, loss yang didapat adalah sebesar 0.77 saat training dan 0.84 saat validasi. Akurasi yang didapatkan masih perlu ditingkatkan jika model yang dikembangkan ingin diterapkan pada level aplikatif.

Research on the Image Background Base Algorithm Based on the Factor Space Theory

Conference Paper

Full-text available

Aug 2023

Drowsiness Detection System Based on PERCLOS and Facial Physiological Signal

Article

Full-text available

Jul 2022
SENSORS-BASEL

Accidents caused by fatigue occur frequently, and numerous scholars have devoted tremendous efforts to investigate methods to reduce accidents caused by fatigued driving. Accordingly, the assessment of the spirit status of the driver through the eyes blinking frequency and the measurement of physiological signals have emerged as effective methods. In this study, a drowsiness detection system is proposed to combine the detection of LF/HF ratio from heart rate variability (HRV) of photoplethysmographic imaging (PPGI) and percentage of eyelid closure over the pupil over time (PERCLOS), and to utilize the advantages of both methods to improve the accuracy and robustness of drowsiness detection. The proposed algorithm performs three functions, including LF/HF ratio from HRV status judgment, eye state detection, and drowsiness judgment. In addition, this study utilized a near-infrared webcam to obtain a facial image to achieve non-contact measurement, alleviate the inconvenience of using a contact wearable device, and for use in a dark environment. Furthermore, we selected the appropriate RGB channel under different light sources to obtain LF/HF ratio from HRV of PPGI. The main drowsiness judgment basis of the proposed drowsiness detection system is the use of algorithm to obtain sympathetic/parasympathetic nervous balance index and percentage of eyelid closure. In the experiment, there are 10 awake samples and 30 sleepy samples. The sensitivity is 88.9%, the specificity is 93.5%, the positive predictive value is 80%, and the system accuracy is 92.5%. In addition, an electroencephalography signal was used as a contrast to validate the reliability of the proposed method.

A Real-Time Framework for Human Face Detection and Recognition in CCTV Images

Article

Full-text available

Mar 2022
MATH PROBL ENG

This paper aims to develop a machine learning and deep learning-based real-time framework for detecting and recognizing human faces in closed-circuit television (CCTV) images. The traditional CCTV system needs a human for 24/7 monitoring, which is costly and insufficient. The automatic recognition system of faces in CCTV images with minimum human intervention and reduced cost can help many organizations, such as law enforcement, identifying the suspects, missing people, and people entering a restricted territory. However, image-based recognition has many issues, such as scaling, rotation, cluttered backgrounds, and variation in light intensity. This paper aims to develop a CCTV image-based human face recognition system using different techniques for feature extraction and face recognition. The proposed system includes image acquisition from CCTV, image preprocessing, face detection, localization, extraction from the acquired images, and recognition. We use two feature extraction algorithms, principal component analysis (PCA) and convolutional neural network (CNN). We use and compare the performance of the algorithms K-nearest neighbor (KNN), decision tree, random forest, and CNN. The recognition is done by applying these techniques to the dataset with more than 40K acquired real-time images at different settings such as light level, rotation, and scaling for simulation and performance evaluation. Finally, we recognized faces with a minimum computing time and an accuracy of more than 90%.

Influence of Smart Interactive Advertising Based on Age and Gender: A Case Study from Sri Lanka

Chapter

Jan 2020

Advertising media plays a major role in marketing. Sometimes, the consumers tend to get advertisements as a disturbance when they do not show what consumers need to know or buy at right time. Also, some advertisements cannot watch with the children together due to the reasons like nudity. The smart interactive advertising (SIA) concept is a solution to these issues. The study discusses the influence of SIA concept to Sri Lankan people. A prototype system of SIA to detect age and gender of the particular person and to shows advertisements according to detected age and gender has developed. A questionnaire was used to find the influence of SIA concept. The empirical findings suggest that the SIA concept is a good solution for the issues mentioned above. Specifically, it is influencing in better way to Sri Lankan People.

Computer Vision Based Pre-processing of Microscopic Images

Conference Paper

Nov 2021

Localization of Facial Images Manipulation in Digital Forensics via Convolutional Neural Networks

Chapter

Full-text available

May 2021

Throughout digital media forensics, the identification of manipulated images and videos is a key topic. Many methods of detection use a binary classification to assess the likelihood of manipulation of a message. Another significant subject is the position, mainly due to three standard attacks, of the exploited regions (i.e., segmentation): elimination, copy-move and splicing. In order to simultaneously detect manipulated images and videos and locate the region for every question, a convolutional neural network is built which uses the multi-work learning approach. The information gained during the execution of one task would be exchanged and both tasks strengthened. To improve network generation, a semi-supervised learning approach is used. A decoder and a Y-shaped decoder are part of the network. For binary classification, activation of the encoded features is used. The output from one decoder branch is used to segment the areas manipulated and from the other branch to reconstruct the input to boost overall efficiency. Experiments using FaceForensics and FaceForensics++ have shown the network's ability to deal with the flaws in the preceding attacks and to counter facial reenactment attacks. In addition, the network can deal with unknown attacks by fine-tuning just a limited amount of data.

Comparison K-Nearest Neighbors (K-NN) and Artificial Neural Network (ANN) in Real Time Entrants Recognition

Conference Paper

Full-text available

Nov 2018

A review of face recognition methods using deep learning network

Article

Feb 2019

Face recognition could be a technology capable of distinguishing or confirming an individual from a digital image or a video frame from a video supply. Face recognition technology is employed in wide selection of applications like authentication, access management, and police investigation. It is finding applications in all industries ranging from retail, advertising to banking etc. It is to this extent that Large retailers are using facial recognition to recognize customers and present offers, they also use it to catch shoplifters. Deep learning Network is influencing every aspect of computer vision technology and research. In this paper, we are depicting the role and achievements of different deep models for face recognition in images and videos, we have also compared recent algorithms for face recognition.

Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

Article

Full-text available

Jul 1997

We develop a face recognition algorithm which is insensitive to large variation in lighting direction and facial expression. Taking a pattern classification approach, we consider each pixel in an image as a coordinate in a high-dimensional space. We take advantage of the observation that the images of a particular face, under varying illumination but fixed pose, lie in a 3D linear subspace of the high dimensional image space-if the face is a Lambertian surface without shadowing. However, since faces are not truly Lambertian surfaces and do indeed produce self-shadowing, images will deviate from this linear subspace. Rather than explicitly modeling this deviation, we linearly project the image into a subspace in a manner which discounts those regions of the face with large deviation. Our projection method is based on Fisher's linear discriminant and produces well separated classes in a low-dimensional subspace, even under severe variation in lighting and facial expressions. The eigenface technique, another method based on linearly projecting the image space to a low dimensional subspace, has similar computational requirements. Yet, extensive experimental results demonstrate that the proposed “Fisherface” method has error rates that are lower than those of the eigenface technique for tests on the Harvard and Yale face databases

The AR face database

Article

Full-text available

Jan 1998

Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

Chapter

Full-text available

Jan 2006

We develop a face recognition algorithm which is insensitive to gross variation in lighting direction and facial expression. Taking a pattern classification approach, we consider each pixel in an image as a coordinate in a high-dimensional space. We take advantage of the observation that the images of a particular face under varying illumination direction lie in a 3-D linear subspace of the high dimensional feature space — if the face is a Lambertian surface without self-shadowing. However, since faces are not truly Lambertian surfaces and do indeed produce self-shadowing, images will deviate from this linear subspace. Rather than explicitly modeling this deviation, we project the image into a subspace in a manner which discounts those regions of the face with large deviation. Our projection method is based on Fisher's Linear Discriminant and produces well separated classes in a low-dimensional subspace even under severe variation in lighting and facial expressions. The Eigenface technique, another method based on linearly projecting the image space to a low dimensional subspace, has similar computational requirements. Yet, extensive experimental results demonstrate that the proposed Fisherface method has error rates that are significantly lower than those of the Eigenface technique when tested on the same database.

Face recognition using eigenfaces

Article

Jan 1991
J COGNITIVE NEUROSCI

Face Recognition: The Problem of Compensating for Changes in Illumination Direction

Article

Jan 1997

Eigenfaces for Recognition

Article

Jan 1991

We have developed a near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals. The computational approach taken in this system is motivated by both physiology and information theory, as well as by the practical requirements of near-real-time performance and accuracy. Our approach treats the face recognition problem as an intrinsically two-dimensional (2-D) recognition problem rather than requiring recovery of three-dimensional geometry, taking advantage of the fact that faces are normally upright and thus may be described by a small set of 2-D characteristic views. The system functions by projecting face images onto a feature space that spans the significant variations among known face images. The significant features are known as "eigenfaces," because they are the eigenvectors (principal components) of the set of faces; they do not necessarily correspond to features such as eyes, ears, and noses. The projection operation characterizes an individual face by a weighted sum of the eigenface features, and so to recognize a particular face it is necessary only to compare these weights to those of known individuals. Some particular advantages of our approach are that it provides for the ability to learn and later recognize new faces in an unsupervised manner, and that it is easy to implement using a neural network architecture.

3D model enhanced face recognition

Conference Paper

Feb 2000
Image Process

A personal identification system based on the analysis of frontal or profile images of the face has applications in human-computer interfaces, access control, surveillance etc. Among many practical face recognition schemes, image based approaches are possibly the most promising ones. However, the 2D images/patterns of 3D face objects can change dramatically due to lighting and viewing variations. In this paper, we propose a generic 3D model to enhance existing face recognition systems. More specifically, we use a 3D model to synthesize the so-called prototype image from a given image acquired under different lighting and viewing conditions

Face Recognition Using Eigenfaces

Conference Paper

Jul 1991
IEEE Comput Soc Conf Comput Vis Pattern Recogn

An approach to the detection and identification of human faces is presented, and a working, near-real-time face recognition system which tracks a subject's head and then recognizes the person by comparing characteristics of the face to those of known individuals is described. This approach treats face recognition as a two-dimensional recognition problem, taking advantage of the fact that faces are normally upright and thus may be described by a small set of 2-D characteristic views. Face images are projected onto a feature space (`face space') that best encodes the variation among known face images. The face space is defined by the `eigenfaces', which are the eigenvectors of the set of faces; they do not necessarily correspond to isolated features such as eyes, ears, and noses. The framework provides the ability to learn to recognize new faces in an unsupervised manner

Hue That is Invariant to Brightness and Gamma

Article

Apr 2002

Hue provides a useful and intuitive cue that is used in a variety of computer vision applications. Hue is an attractive feature as it captures intrinsic information about the colour of objects or surfaces in a scene. Moreover, hue is invariant to confounding factors such as illumination brightness. However hue is not stable to all of the types of confounding factors that one might reasonably encounter. Specifically, the RGBs captured in images are sometimes raised to the power gamma. This is done for two reasons. First, to make the images suitable for display (since monitors have an intrinsic non-linearity).

View-Based and Modular Eigenspaces for Face Recognition

Article

Oct 1994
IEEE Comput Soc Conf Comput Vis Pattern Recogn

In this work we describe experiments with eigenfaces for recognition and interactive search in a large-scale face database. Accurate visual recognition is demonstrated using a database of O(10 3 ) faces. The problem of recognition under general viewing orientation is also examined. A view-based multiple-observer eigenspace technique is proposed for use in face recognition under variable pose. In addition, a modular eigenspace description technique is used which incorporates salient features such as the eyes, nose and mouth, in an eigenfeature layer. This modular representation yields higher recognition rates as well as a more robust framework for face recognition. An automatic feature extraction technique using feature eigentemplates is also demonstrated. 1 Introduction In recent years considerable progress has been made on the problems of face detection and recognition, especially in the processing of "mug shots," i.e., head-on face pictures with controlled illumination and scale...

Evaluation of Image Pre-Processing Techniques for Eigenface Based Face Recognition

Abstract and Figures

Recommended publications

Z.: Face Recognition: A Comparison of Appearance-Based Approaches

Three-Dimensional Face Recognition: A Fishersurface Approach

Combining multiple face recognition systems using Fisher's linear discriminant

Three-dimensional face recognition using combinations of surface feature map subspace components