Conference PaperPDF Available

Sparse analysis based fault deviations modeling and its application to fault diagnosis

May 2017

May 2017

DOI:10.1109/CCDC.2017.7979293

Conference: 2017 29th Chinese Control And Decision Conference (CCDC)

Authors:

Yue Wang

New York University

Chunhui Zhao

Zhejiang University

The values of coefficients in the first fault direction for fault #2

…

Fault reconstruction results for 2 T and SPE evaluated by MRR in same-fault analysis and FRR in cross-fault analysis using RC algorithm and the proposed algorithm (the shaded values indicate the same-class fault analysis results evaluated by MRR% and the unshaded values indicate the cross-class fault analysis results evaluated by FRR%)

…

Figures - uploaded by Yue Wang

Content may be subject to copyright.

Content uploaded by Yue Wang

Content may be subject to copyright.

INTRODUCTION

With rapid development of chemical processes, their large

and complex characteristics drive an increasing interest in

fault detection and diagnosis over the last few decades

[1-4]

Multivariate statistical analysis methods, such as partial

least square (PLS)

[5]

, principal component analysis (PCA)

[6]

, fisher discriminant analysis (FDA)

[7]

and etc., have been

widely applied to the field of statistical process monitoring

(SPM). They all share the characteristics of dimensionality

reduction and the ability to deal with highly correlated

variables. For PCA monitoring, two subspaces principal

component subspace (PCS) and residual subspace (RS)

monitored by

and SPE monitoring statistics respectively

are gained. When the values of monitoring statistics run out

of the desired regions, it can be concluded that some

abnormal or faulty behaviors have occurred. After fault

detection, it is hoped that the type of abnormal conditions

can be quickly confirmed and necessary corrective actions

can be taken to remove them, which means to bring the

out-of-control monitoring statistics back to the normal.

Dunia and Qin

[8]

defined a fault reconstruction concept in

the context of PCA models, which consisted of finding the

reconstruction direction and brought them back to the

normal region along the fault subspace.

FDA

[9-10]

is a popular method for fault diagnosis, in which

fault diagnosis is regarded as a classification problem.

However, it has been reported to have some drawbacks and

limitations (singular problem caused by the within-class

scatter matrix, limitation of the number of discriminant

components by between-class scatter matrix and the

This work is supported by the National Natural Science Foundation of China

(Nos. 61422306 and 61433005).

non-orthogonality of discriminant components). Several

improved methods

[11-13]

have been proposed to solve the

singularity problem of within-class scatter matrix. Zhao et

al.

[14]

proposed a nested-loop fisher discriminant analysis

(NeLFDA) algorithm which performed an inner-loop and

outer-loop calculation to address the three problems of the

conventional FDA algorithm. However, it cannot

comprehensively extract fault features as the variations of

variance from normal data to fault data are not considered.

By comprehensively considering fault types in the process,

Zhao et al. proposed

[15]

a fault degradation oriented Fisher

discriminant analysis (FDFDA) method to bring in the

variance variations between fault data and normal data to

the objective function of FDA.

Nevertheless, these methods treat the whole measurement

variables as a single subject, which do not isolate the

specific faulty variables. Contribution plots method

[16-17]

has been widely used for isolating faulty variables by

comparing the contributions of different variables to the

out-of control monitoring statistics. However, it may lead to

confusing results as the contribution of faulty variables may

disseminate to other variables. Instead of performing on the

context of PCA, Qin et al.

[18]

identified the major

contributing variables based on FDA directions, which has

the drawbacks of not probing into the relationship of

specific variables and still limited by the drawbacks of

traditional FDA method. Zhao et al.

[19]

proposed a faulty

variable isolation method to identified faulty variables on

the NeLFDA direction, which overcame the limitations of

traditional FDA. But variables are evaluated and isolated

one by one in this method, which is inefficient and the

relationship of specific variables is not probed into.

Sparse analysis based fault deviations modeling and its application to fault

diagnosis

Yue Wang

, Chunhui Zhao

1∗

, Youxian Sun

1. State Key Laboratory of Industrial Control Technology, College of Control Science and Engineering, Zhejiang University, Hangzhou,

310027,China

E-mail: chhzhao@zju.edu.cn

Abstract: In the fault process, some specific variables will be disturbed significantly and cover much fault information,

while some irresponsible variables still keep similar relations with those of normal condition. Therefore, this paper

proposes a sparse relative discriminant fault deviations (SRDFD) modeling algorithm which can extract fault directions

and isolate faulty variables simultaneously to improve fault diagnosis performance. In the proposed algorithm, a sparse

objective function is formulated by bringing an L1 penalization to the objective function of fault degradation oriented

FDA (FDFDA) algorithm, which improved the traditional FDA algorithm by further considering the relative variations

of variance between fault data and normal data. The proposed objective function is not convex, so that the

minorization-maximization approach is used to efficiently optimize it. Then soft threshold operator is performed for

analytic solutions. The extracted sparse directions and the corresponding loadings are used as reconstruction models to

eliminate fault deviations. Online fault diagnosis is then conducted by finding the correct reconstruction models which

can best eliminate the out-of-control monitoring statistics. The performance is verified by the pre-programmed faults of

Tennessee Eastman (TE) benchmark process.

Key Words: faulty variable isolation, relative variations of variance, FDA, minorization-maximization

4509

978-1-5090-4657-7/17/$31.00 c

2017 IEEE

In general, for each case, some specific variables will be

disturbed significantly and cover much fault information. It

is important to consider isolating these faulty variables so as

to improve the diagnosis performance. Therefore, this paper

proposes a sparse relative discriminant fault deviations

(SRDFD) modeling algorithm for comprehensive extracting

fault features and isolating faulty variables at one time. In

this algorithm, we bring in L1 penalization to the objective

function in the work of Zhao et al.

[15]

, which integrates

dispersion and the variations of variance between normal

data and fault data, so that the resulting coefficients of

irresponsible variables are equal to zero for the extracted

directions. A minorization-maximization approach

[20]

and

the soft threshold operator are performed for solving the

proposed objective function. The extracted sparse

directions and the corresponding loadings are used for

reconstruction. Online fault diagnosis is then conducted by

finding correct reconstruction models which can best

eliminate the out-of-control monitoring statistics.

The reminder of the paper is arranged as below. First,

motivation of the proposed algorithm is presented. Then, the

proposed algorithm is mathematically formulated and

verified based on pre-programmed faults from TE process.

At last, conclusions are drawn on the basis of the results of

this study.

METHODOLOGY

2.1

Motivation

FDA is a widely used fault diagnosis method, by which data

from different classes are separated well along extracted

directions and then fault diagnosis can be performed.

However, traditional FDA may not extract comprehensive

fault features for the reason that it does not take the

variations of variance between fault data and normal data

into consideration, so that Zhao et al.

[15]

proposed a fault

degradation oriented Fisher discriminant analysis (FDFDA)

method which integrates dispersion and the variations of

variance between normal data and fault data. The objective

function is shown as below,

()

max

§·

¨¸

©¹

wSw

wwSw wSw

(1)

where

is the between-class scatter matrix;

and

are within-class scatter matrices for normal data and fault

data respectively;

is a weighting factor.

The three matrices are calculated by the following

expression:

()()

bff f

=−−Sxxxx

(2)

()()

ffiffif

=− −

Sxxxx

(3)

()()

nninnin

=−−

Sxxxx

(4)

where

and

are the mean vectors of fault data and

normal data respectively;

is the mean vector of total

samples;

and

denote the ith sample in fault data

and normal data respectively.

The weighting factor (

) is calculated on the basis of

and

()

ȕ()

(5)

where

()tr •

denotes the sum of eigenvalues calculated from

the matrix.

By assuming that

wSw

and using a Lagrange operator,

a conventional eigenvalue problem can be gained by the

following expression:

()

nb f

−

+=SS Sw w

(6)

Then a set of fault directions can be obtained one time by

performing singular value decomposition (SVD) on the Eq.

(6). Nevertheless, the work of Zhao et al. did not consider

isolating specific faulty variables for accurately describing

fault characteristics. Moreover, for the traditional solution

which is implemented by SVD, the extracted components

are usually the linear combinations of all the original

variables.

2.2 

The proposed algorithm

For further improving the power of feature extraction for

fault diagnosis, a sparse relative discriminant fault

deviations (SRDFD) modeling algorithm is presented by

rebuilding the objective function in the work of Zhao et al.

In this algorithm, extracting fault directions and isolating

faulty variables can be implemented at one time by bringing

in a sparse constraint, the L1-penalization. Then the

function is solved by the minorization-maximization

approach and the soft threshold operator is used to gain

analytic solutions. At last the extracted directions and

loadings are used as reconstruction models for fault

diagnosis. The specific is described as below:

Two data sets are prepared, normal data set

()

NP×X

where subscript n denotes normal data and one fault data set

()

,,fm fm

NP×X

where subscript f and m denotes fault data

and fault class index respectively. The normal data are

normalized and described by

for simplicity. Then the

fault data normalized by the data preprocessing information

are described by

,fm

for simplicity. Then the within-class

scatter matrix for normal data (

), within-class scatter

matrix for fault data (

), between-class scatter matrix

(

) and weighting factor (

) are calculated by Eqs. (2-5).

(1) The proposed objective function

For extracting fault directions and isolating faulty variables

at one time, the proposed objective function is gained by

rebuilding the objective function in the FDFDA algorithm.

The proposed objective function is shown as below:

4510 2017 29th Chinese Control And Decision Conference (CCDC)

()

max +ȕ

.=1

bf jj

λσ

§·

=−

¨¸

©¹

wwSwwSw

wSw

(7)

where

is the extracted direction; P denotes the number

of variables;

•

demotes the one norm;

is the diagonal

estimation of within-class scatter matrix for normal data, in

which

is the jth element and

is the within-class

standard deviation for variable j for normal data. Including

in the penalty has the effect that variables that vary more

within each class undergo greater penalization. When

large, some elements of the solution

will be equal to

zero, so that the resulting direction is sparse. In particular,

the tuning parameter

is calculated as follows:

(0) 1/2 1/2

ˆˆ

=(ȕ)

nb fn

λλ

−−

+SS SS

(8)

where

•

denotes the largest eigenvalue; the value of

penalization factor

(0)

is determined by cross validation.

(2) The minorization-maximization approach

For the proposed objective function, there exists a problem

that it is non-concave, therefore, the minorization-

maximization approach is used to solve it, whose procedure

is briefly introduced here. The minorization-maximization

method is to find a function

()

(| )

to minorize the

function

()

at the point

()m

() () ()

()

()( | )

() ( | )

mmm

=Θ

≥Θ ∀

www

www w

(9)

Then

can be calculated by iterating the following

function until convergence:

(1) ()

arg max{ ( | )}

mm+

=Θ

www

(10)

For the objective function in Eq.(6), define a matrix

um b f

=+SS S

first for simplicity, which is obvious to

be positive semi-definite, and define

()

sum

wwSw

Then it can be gained that

() () ()

() () () () ()

() ()

()()( )()

()2

mmm

mm m

sum sum

mmmmm

sum sum

sum

ff f

ΤΤ

≥+−∇

=−

ww ww w

wS w w S w

wwSwwSw

wSw

(11)

Therefore, the objective function in Eq. (6) can be

transferred to be the following form:

()

T()

max 2

sum j j

λσ

§·

=−

¨¸

©¹

wwSw

(12)

where

()m

is a fixed value.

So that the solution of

is calculated by the following

iteration procedure:

(a) let

(0)

be the first eigenvalue of

nsum

−

(b) for m=1,2,... until convergence: let

(1)m+

be the

solution to

T()

=argmax{2 }

um j j

λσ

−

wwSw

(13)

denotes the solution at convergence.

(3) Soft threshold operator

To solve the function in Eq. (13), we first consider the

following problem:

()

ˆˆ

min ( 2 )

nsum jj

λσ

ΤΤ

−+

dSd dS w

(14)

=0d

, then

=0w

. Otherwise,

wd dSd

is the diagonal estimation of

, the solution to

function (14) is

(1)

1(),

ˆ2

jsumj

λσ

−

½

°°

=®¾

°°

¯¿

(15)

where S is the soft threshold operator, which is defined as

(, ) sgn()( )Sxa x x a

=−

(16)

where

•

denotes the absolute value;

()

•

outputs the

larger value of the element in the bracket and zero.

For the calculated direction

, the extracted component is

calculated for fault data and normal data as follows:

fm fm

tXw

(17)

where

,fm

and

denote the components for fault data

and normal data respectively.

The specific procedure of the proposed algorithm is

described as below:

Step A. Fault direction extraction

For the rebuilt objective function in Eq. (7), one sparse fault

direction

can be gained by performing the

miniorization-maximization approach and the soft threshold

operator in Eq. (14) and Eq. (15) respectively.

Step B. Data deflation

To guarantee that the extracted components are orthogonal

with each other, data deflation is necessary for fault data and

normal data by removing the information concerning the

extracted component:

,,,,,

,,,,

()

fm fm fm fm fm

nnnnn

fm fm fm fm

nnnn

ΤΤ −Τ

ΤΤ−Τ

=−

ptttX

EXtp

(18)

where

,fm

and

are the loading vectors for fault data

and normal data respectively;

,fm

and

are the

residuals of fault data and normal data without the

information of

,fm

and

respectively.

Step C. Data updating

2017 29th Chinese Control And Decision Conference (CCDC) 4511

Using

,fm

and

to replace fault data and normal data

for updating within-class scatter matrix for fault data (

)

and between-class scatter matrix (

) in Eq. (2) and Eq. (3).

Step D. Iterative implementation

Then repeat Steps A to C to extract next directions, loadings

and components until all the needed results are gained.

The outputs are a set of extracted directions (

()PR×W

), a

set of loadings (

()

PR×P

) and a set of extracted

components (

()

fm f

NR×T

), which are composed of

(1)P×w

(1)

P×p

and

(1)

fm f

N×t

respectively and

are used as reconstruction models to reconstruct fault

deviations of fault class m. R denotes the retained number of

extracted directions, which can be determined by the

diagnosis performance in practical application.

2.3

Fault diagnosis performance evaluation

Two types of fault diagnosis analyses, same-fault analysis

and cross-fault analysis, can be performed for evaluating the

performance of fault reconstruction. For same-fault

analysis, the reconstruction models corresponding to one

fault data are applied to the same fault class and the

in-control monitoring statistics indicate good reconstruction

performance. While for cross-fault analysis, the

reconstruction models corresponding to one fault data are

applied to other fault classes and out-of-control monitoring

statistics indicate correct reconstruction performance.

Correspondingly, two evaluation indexes are defined for

fault reconstruction

[22]

, missing reconstruction ratio

(

MRR

) and false reconstruction ratio (

2.4

Online fault diagnosis based on reconstruction

strategy

First the variations of normal data are extracted by PCA and

corresponding control limits are established so as to

evaluate the fault samples corrected by fault deviations

concerning each fault class.

Perform PCA on the normal data

to gain monitoring

system:

()

no n o

no n o o

nnono

=−

TXP

XXPP

EXIPP

XX E

(19)

where

()

no n

×T

are the principal components and

()

no n

×E

are the residuals; L denotes the number of

principal components retained;

are the data

reconstructed by the principal loadings

()

×P

. The

subscript o denotes that these results are used for original

monitoring system establishing

Then

and SPE statistics are calculated as follows:

,, ,,

()()

no no o no n o

no no

SPE

Τ−

=− Σ −

tt tt

(20)

where

,no

is the row vector in

,no

;

,no

denotes the mean

vector of

,no

;

is the variance-covariance matrix of

principal components from normal data;

,no

is the row

vector in

,no

.The control limits

Ctr

and

SPE

Ctr

are

calculated by an F-distribution and a weighted

Chi-distribution respectively for

and SPE.

Whenever a new fault observation,

(1)

new

is available,

it is first corrected by fault deviations concerning fault class

,new new new f m

∗Τ Τ Τ Τ

=−xxxWP

(21)

where

new

∗

denotes the corrected fault sample with the fault

deviations concerning fault class m removed.

Then the original monitoring system are performed on the

corrected fault sample

new

∗

to gain corrected monitoring

statistics:

()

()()

new new o

new new o o

new new no o new no

new new new

SPE

∗Τ ∗Τ

∗Τ ∗Τ Τ

∗∗ Τ−∗

∗∗Τ∗

=−

=− Σ −

txP

exIPP

tt tt

(22)

If the corrected monitoring statistics

new

∗

and

new

SPE

∗

are

in their control limits respectively, it means that fault

deviations in the new fault sample can be well corrected by

the models from fault class m, which indicates that the fault

sample belongs to the class m.

ILLUSTRATION RESULTS

In this section, the performance of the proposed algorithm is

illustrated by Tennessee Eastman benchmark process,

which contains 41 measured variables and 11 manipulated

variables. The specific description can be found in the work

of Downs and Vogel

[21]

. Four sets of fault data (#2, #4, #7,

#8) and one normal data are used here and each involve 480

samples for training data and 100 samples for testing data.

For these four fault classes, the sparse reconstruction

models are established by the proposed algorithm.

010 20 30 40 50 60

-0.02 5

-0.02

-0.01 5

-0.01

-0.00 5

0.005

0.01

0.015

0.02

0.025

values of coeffi cients

variable No.

Fig 1. The values of coefficients in the first fault direction for fault #2

4512 2017 29th Chinese Control And Decision Conference (CCDC)

020 40 60 80 100

model #2

reconstructed SPE

020 40 60 80 100

reconstructed SPE

(a)

020 40 60 80 100

model #4

reconstructed SPE

020 40 60 80 100

reconstructed SPE

(b)

020 40 60 80 100

model #7

reconstructed SPE

020 40 60 80 100

reconstructed SPE

(c)

020 40 60 80 100

model #8

reconstructed SPE

020 40 60 80 100

reconstructed SPE

(d)

Fig 2. Reconstruction results for fault #4 by models developed from (a)

fault #2, (b) fault #4, (c) fault #7 and (d) fault #8 (the dotted line represents

the control limit)

Taking fault #2 for example, the coefficient for each

variable of the first fault direction is displayed in Figure 1,

which shows that the coefficients for some variables are

zero, such as the fifth and sixth variables. It indicates that

some variables that do not influence the process of fault #2

are not contained in the models, which agrees with the actual

situation. Therefore, the proposed algorithm can extract

fault directions and isolate faulty variables at the same time.

The results of fault diagnosis are shown in Figure 2, which

are gained by performing the data from fault #4 on different

reconstruction models. It can be seen that the fault can be

correctly diagnosed, since only the models developed from

fault #4 can well bring the alarming monitoring statistics

back to the normal. While for reconstruction models

developed from other fault classes, at least one monitoring

statistic is still out-of control.

To evaluate the performance of the reconstruction models,

missing reconstruction ratio (MRR) in same-fault analysis

and false reconstruction ratio (FRR) in cross-fault analysis

are used in this case. The penalization factor

(0)

and the

number of extracted directions in the models for each fault

case are determined by making MRR for training data just

smaller than 5%. The results for the proposed algorithm

concerning testing data are displayed in Table 1. For

same-fault analysis, MRR results for fault #2, #4 and #8 are

larger than 5%, which means that fault characteristics for

these three faults have changed more or less. For cross-fault

analysis, results of FRR for

are much larger but those

for SPE are small, which indicates that the diagnosis power

for

is less than that of SPE. What's more, fault #2 cannot

be correctly diagnosed, as results for FRR corresponding to

the reconstructed fault data from fault #2 corrected by

models developed from fault #8 are extremely large, which

may have the reason that characteristics of fault #2 have

changed to be close to those of fault #8.

Table 1

Fault reconstruction results for 2

and SPE evaluated

by MRR in same-fault analysis and FRR in cross-fault analysis

using RC algorithm and the proposed algorithm (the shaded

values indicate the same-class fault analysis results evaluated by

MRR% and the unshaded values indicate the cross-class fault

analysis results evaluated by FRR%)

Model #

Fault #

2 4 7 8

2 0 0 0 85

4 80 0 82 77

7 0 0 0 31

8 33 1 55 0

SPE 2 7.5 0 0 44

4 0 13.5 6.5 0

7 0 0 4 3.5

8 2.5 1 19.5 24

Then the comparative analysis is conducted between the

proposed algorithm and the FDFDA method. For Fault #8,

the results of comparative analysis are displayed in Figure 3.

By correcting the fault deviations along the directions

extracted by FDFDA method, the reconstructed fault data

still appear alarming signals, which means that fault

information cannot be comprehensively removed by

FDFDA method. While for the proposed algorithm, the

alarming monitoring statistics are both bring back to the

normal region, which shows the effectiveness.

2017 29th Chinese Control And Decision Conference (CCDC) 4513

020 40 60 80 100

model #8

reconstructed SPE

020 40 60 80 100

reconstructed SPE

(a)

020 40 60 80 100

200

400

600

model #8

reconstructed SPE

020 40 60 80 100

100

150

200

reconstructed SPE

(b)

Fig 3. Reconstruction results for fault #8 by models developed from the

same fault data using (a) the proposed algorithm and (b) FDFDA

algorithm (the dotted line represents the control limit)

CONCLUSION

In the present work, a sparse relative discriminant fault

deviations (SRDFD) modeling algorithm is proposed. The

proposed algorithm rebuilds the objective function of

FDFDA algorithm, which added relative variations of

variance between fault data and normal data to FDA, by

bringing in an L1 penalization. Therefore, the extraction of

comprehensive fault deviations and the isolation of specific

variables can be accomplished at the same time. For solving

the proposed algorithm, which is non-cave , minorization-

maximization approach is adopted and soft threshold

operator is performed for analytic solutions. Then the

reconstruction models are established based on the

extracted fault deviations. Its application to fault diagnosis

is illustrated and shows superiority to the FDFDA method.

The feasibility and performance of the proposed method

have been verified by pre-programmed faults of TE process.

REFERENCES

[1] C.H. Zhao, F.R. Gao, Fault-relevant Principal Component

Analysis (FPCA) Method for Multivariate Statistical

Modeling and Process Monitoring. Chemom. Intell. Lab.

Syst, 133, 1-16, 2014.

[2] S.M. Zhang, C.H. Zhao, S. Wang, F.L. Wang, Pseudo

time-slice construction using variable moving window-k

nearest neighbor (VMW-kNN) rule for sequential uneven

phase division and batch process monitoring. Ind. Eng. Chem.

Res, Vol.56, No.3, 728-740, 2017.

[3] C.C. Hus, C.T. Su, An adaptive forecast-based chart for

non-Gaussian processes monitoring: with application to

equipment malfunctions detection in a thermal power plant,

IEEE. T. Contr. Syst. T, Vol.19, 1245-1250, 2011

[4] Q.C. Jiang, X.F. Yan, W.X. Zhao, Fault detection and

diagnosis in Chemical Process Using Sensitive Principal

Component Analysis, Ind. Eng. Chem. Res, Vol.52,

1635-1644, 2013

[5] J. De, S. SIMPLS: an alternative approach to partial least

squares regression, Chemom. Intell. Lab. Syst, Vol. 18, No.3,

251-263, 1993

[6] S. Wold, K. Esbensen, P. Geladi, Principal component

analysis, Chemom. Intell. Lab. Syst, Vol.2, 37-52, 1987

[7] L.H. Chiang, M.E. Kotanchek, A.K. Kordon, Fault diagnosis

based on Fisher discriminant analysis and support vector

machines, Computers & chemical engineering, Vol.28,

1389-1401, 2004

[8] R. Dunia, S.J. Qin. Subspace approach to multidimensional

fault identification and reconstruction. AIChE J, Vol.44,

1813-1831,1998.

[9] Z. Du, X. Jin, Multiple faults diagnosis for sensors in air

handling unit using fisher discrimminant analysis, Energ.

Convers. Manage, Vol.49, 3654-3665, 2008

[10] L.H. Chiang, E.L. Russell, R.D Braatz, Fault diagnosis in

chemical processes using Fisher discriminant analysis,

discriminant partial least squares, and principal component

analysis, Chemometr. Intell. Lab, Vol.50, 243-252, 2000

[11] N.B. Peter, P.H. Joao, J.K. David, Eigenfaces vs fisherfaces:

recognition using class specific linear projection, IEEE

Trans. Pattern Anal. Mach. Intell, Vol.27, 929-941, 2005

[12] J.H. Friedman, Regularized discriminant analysis, J. Am.

Stat. Assoc, Vol.84, 165-175, 1989

[13] J.P. Ye, Q. Li, A two-stage linear discriminant analysis via

QR-decomposition, IEEE Trans. Pattern Anal. Mach. Intell,

Vol. 27, 929-941, 2005

[14] C.H. Zhao, F.R. Gao, A nested-loop Fisher discriminant

analysis algorithm, Chemom. Intell. Lab. Syst, Vol.146,

396-406, 2015

[15] C.H. Zhao, F.R. Gao, Critical-to-Fault-Degradation Variable

Analysis and Direction Extraction for Online Fault

Prognostic, IEEE. T. Contr. Syst. T,

10.1109/TCST.2016.2576018.

[16] J.A. Westerhuis, S.P. Gurden, A.K. Smilde, Generalized

contribution plots in multivariate statistical process

monitoring, Chemom. Intell. Lab. Syst, Vol.51, 95-114,

2000

[17] J.L. Liu, D.S. Chen, Fault isolation using modified

contribution plots, Comput. Chem. Eng, vol. 61, 9-19, 2014.

[18] Q.P. He, S.J. Qin, J. Wang, A new fault diagnosis method

using fault directions in Fisher discriminant analysis, AIChE

J, Vol.51, 555–571, 2005

[19] W. Wang, C.H. Zhao, Y.X. Sun, Locating faulty variables by

evaluating ratio of variable contribution based on

discriminant analysis for online fault diagnosis, Control

Conference (CCC), 2015 34th Chinese, IEEE, 6366-6371,

2015

[20] K. Lange, D.R. Hunter, I. Yang, Optimization transfer using

surrogate objective functions, J. Comput. Graph. Stat, Vol. 9,

1-20, 2000

[21] J.J. Downs, E.F. Vogel, A plant-wide industrial process

control problem, Comput. Chem. Eng, Vol.17, 245-255,1993

[22] C.H. Zhao, Y.X. Sun, F.R. Gao, A multi-time-region(MTR)-

based fault space decomposition and reconstruction

modeling strategy for online fault diagnosis, Ind. Eng. Chem.

Res, Vol. 34, 11207-11217, 2012.

4514 2017 29th Chinese Control And Decision Conference (CCDC)

A sparse fault degradation oriented fisher discriminant analysis (FDFDA) algorithm for faulty variable isolation and its industrial application

Article

Sep 2019
CONTROL ENG PRACT

In a fault process, the variables may be influenced differently. In order to improve the diagnosis performance, it is an important issue to isolate those significant faulty variables that cover informative fault effects. However, those variables are selected one by one and their correlations are not considered in the previous work. As sparse-relevant methods can automatically and efficiently isolate significant correlated variables, it is natural to consider applying the criteria of sparsity to separate the significantly influenced faulty variables and analyze them by specific methods. First, the sparse version of the fault degradation oriented Fisher discriminant analysis (FDFDA) algorithm is proposed to produce informative discriminant directions with sparse loadings. Subsequently, a faulty variable selection strategy is proposed based on the sparse FDFDA algorithm to select significantly influenced faulty variables. By iteratively isolating correlated variables along each sparse fault direction, all the faulty variables can be automatically selected until the left fault data and normal data share the similar characteristics. Therefore, the whole measurement variables can be divided into faulty variable set and normal variable set. Then different fault diagnosis models can be developed according to their different characteristics for each fault class. For online application, a probabilistic fault diagnosis strategy is proposed to determine the fault cause of the new sample by the largest synthetic probability that integrates the diagnosis results of two variable sets. The performance of the proposed fault diagnosis method is illustrated using the data from the cut-made process of cigarette.

A Multi-level Bayesian Network Based on Causality Analysis for Fault Diagnosis of Nonstationary Processes

Conference Paper

Nov 2018

Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

Article

Full-text available

Jul 1997

We develop a face recognition algorithm which is insensitive to large variation in lighting direction and facial expression. Taking a pattern classification approach, we consider each pixel in an image as a coordinate in a high-dimensional space. We take advantage of the observation that the images of a particular face, under varying illumination but fixed pose, lie in a 3D linear subspace of the high dimensional image space-if the face is a Lambertian surface without shadowing. However, since faces are not truly Lambertian surfaces and do indeed produce self-shadowing, images will deviate from this linear subspace. Rather than explicitly modeling this deviation, we linearly project the image into a subspace in a manner which discounts those regions of the face with large deviation. Our projection method is based on Fisher's linear discriminant and produces well separated classes in a low-dimensional subspace, even under severe variation in lighting and facial expressions. The eigenface technique, another method based on linearly projecting the image space to a low dimensional subspace, has similar computational requirements. Yet, extensive experimental results demonstrate that the proposed “Fisherface” method has error rates that are lower than those of the eigenface technique for tests on the Harvard and Yale face databases

Pseudo Time-Slice Construction Using a Variable Moving Window k Nearest Neighbor Rule for Sequential Uneven Phase Division and Batch Process Monitoring

Article

Jan 2017

Multiphase characteristics and uneven-length batch duration have been two critical issues to be addressed for batch process monitoring. To handle these issues, a variable moving window-k nearest neighbor (VMW-kNN) based local modeling, irregular phase division, and monitoring strategy is proposed for uneven batch processes in the present paper. First, a pseudo time-slice is constructed for each sample by searching samples that are closely similar to the concerned sample in which the variable moving window (VMW) strategy is adopted to vary the searching range and the k nearest neighbor (kNN) rule is used to find the similar samples. Second, a novel automatic sequential phase division procedure is proposed by similarity evaluation for local models derived from pseudo time-slices to get different irregular phases and ensure their time sequence. Third, the affiliation of each new sample is real-time judged to determine the proper phase model and fault status can be distinguished from phase shift event. The pr...

Locating faulty variables by evaluating ratio of variable contribution based on discriminant analysis for online fault diagnosis

Conference Paper

Jul 2015

Critical-to-Fault-Degradation Variable Analysis and Direction Extraction for Online Fault Prognostic

Article

Jun 2016

Fault prognostic determines whether a failure is impending and estimates how soon an incident will occur; it is nowadays recognized as a key feature in maintenance strategies. For slowly time-varying autocorrelated fault process, the fault degradation process can be revealed for fault prognostic. Based on this assumption, a fault degradation modeling and online fault prognostic strategy is developed in this paper. A stability factor (SF) is defined to evaluate the changing characteristics of process status and a SF-based non-steady faulty variable identification method is developed to find critical-to-fault-degradation variables. A fault degradation-oriented Fisher discriminant analysis is proposed on the selected variables to model the fault evolution process. Uninformative fault effects that do not present degradation are excluded, so that the critical fault degradation information can be focused on. The proposed method is verified by three cases, including a numerical case, cut-made process of cigarette, and the well-known Tennessee Eastman benchmark chemical process.

A Nested-loop Fisher Discriminant Analysis Algorithm

Article

Jun 2015
CHEMOMETR INTELL LAB

Fisher discriminant analysis (FDA), as a very important method for feature extraction, has been widely used in different applications. However, some drawbacks of the conventional FDA algorithm have limited its success and applications. In order to improve the discriminant power, a new discriminant analysis algorithm is proposed based on Fisher’s linear discriminant objective by developing a nested loop algebra, called nested-loop fisher discriminant analysis (NeLFDA). The basic idea of the proposed NeLFDA is to overcome three important problems of the conventional fisher discriminant analysis algorithm: (1) the within-class scatter matrix may be singular for eigenvalue decomposition, (2) the number of extracted discriminant components is limited by rank deficiency of the between-class scatter matrix, and (3) the discriminant components are correlated with each other for each class. The above problems are addressed in a nested-loop iterative process including inner-loop and outer-loop calculation. Using the proposed algorithm, its application to classification and fault diagnosis is evaluated on two examples. Illustration results show that the proposed algorithm can better separate different classes with improved discriminant power and provide more promising fault diagnosis performance.

Fault Detection and Diagnosis in Chemical Processes Using Sensitive Principal Component Analysis

Article

Jan 2013

Sensitive principal component analysis (SPCA) is proposed to improve the principal component analysis (PCA) based chemical process monitoring performance, by solving the information loss problem and reducing nondetection rates of the T2 statistic. Generally, principal components (PCs) selection in the PCA-based process monitoring is subjective, which can lead to information loss and poor monitoring performance. The SPCA method is to subsequently build a conventional PCA model based on normal samples, index PCs which reflect the dominant variation of abnormal observations, and use these sensitive PCs (SPCs) to monitor the process. Moreover, a novel fault diagnosis approach based on SPCA is also proposed due to SPCs’ ability to represent the main characteristic of the fault. The case studies on the Tennessee Eastman process demonstrate the effect of SPCA on online monitoring, showing its performance is significantly better than that of the classical PCA methods.

Optimization Transfer Using Surrogate Objective Functions

Article

Feb 2012
J COMPUT GRAPH STAT

The well-known EM algorithm is an optimization transfer algorithm that depends on the notion of incomplete or missing data. By invoking convexity arguments, one can construct a variety of other optimization transfer algorithms that do not involve missing data. These algorithms all rely on a majorizing or minorizing function that serves as a surrogate for the objective function. Optimizing the surrogate function drives the objective function in the correct direction. This article illustrates this general principle by a number of specific examples drawn from the statistical literature. Because optimization transfer algorithms often exhibit the slow convergence of EM algorithms, two methods of accelerating optimization transfer are discussed and evaluated in the context of specific problems.

Fault-relevant Principal Component Analysis (FPCA) Method for Multivariate Statistical Modeling and Process Monitoring

Article

Apr 2014
CHEMOMETR INTELL LAB

For industrial processes, there are always some specific faults which are not easy to be detected by the conventional PCA algorithm since the monitoring models are defined based on the general distribution information of normal data which may not highlight the abnormal changes. For these specific faults, if fault data are available and used for model development, more meaningful directions may be extracted for monitoring which can improve fault detection sensitivity. In the present work, a fault-relevant principal component analysis (FPCA) algorithm is proposed for statistical modeling and process monitoring by using both normal and fault data. The key is how to extract and supervise the fault-influential data distribution directions. By analyzing the relative changes from normal to fault with available fault data, the new model structure further decomposes the original PCA systematic subspace and residual subspace into two parts respectively. The part that will present larger variation relative to the normal case under the disturbance of fault is regarded to be more informative for fault detection (called fault-relevant part here). It is then separated from the fault-irrelevant part and highlighted for online monitoring which is deemed to be more effective for fault detection. The proposed method provides a detailed insight into the decomposition of the original normal process information from the fault-relevant perspective. Its sensitivity to fault detection is illustrated by data from a numerical example and the Tennessee Eastman process. Index Multivariate statistical analysis, principal component analysis (PCA), fault-relevant principal component analysis (FPCA), fault detection, subspace decomposition.

Fault isolation using modified contribution plots

Article

Feb 2014

Investigating the root causes of abnormal events is a crucial task for an industrial process. When process faults are detected, isolating the faulty variables provides additional information for investigating the root causes of the faults. Numerous data-driven approaches require the datasets of known faults, which may not exist for some industrial processes, to isolate the faulty variables. The contribution plot is a popular tool to isolate faulty variables without a priori knowledge. However, it is well known that this approach suffers from the smearing effect, which may mislead the faulty variables of the detected faults. In the presented work, a contribution plot without the smearing effect to non-faulty variables was derived. A continuous stirred tank reactor (CSTR) example and the industrial application were provided to demonstrate that the proposed approach is not only capable of locating different faulty variables when the fault was propagated by the controllers, but also capable of identifying the variables responsible for the multiple sensor faults.

A Multiple-Time-Region (MTR)-Based Fault Subspace Decomposition and Reconstruction Modeling Strategy for Online Fault Diagnosis

Article

Jul 2012

Time-varying fault characteristics have not yet been addressed by conventional fault-reconstruction-based modeling methods, which could affect fault diagnosis performance. In the present work, the multiple-time-region (MTR) nature, that is, the multiplicity of fault characteristics along with the process evolution, is proposed and efficiently analyzed for fault diagnosis. First, an automatic time-region-division algorithm is developed that can partition the whole fault process into different local regions according to the changes in fault characteristics. Different local fault characteristics are thus analyzed by building different representative fault feature models in multiple time regions. Following the changing relationships between the fault and normal operation statuses, different fault reconstruction actions are finally taken in different time regions. By a proper time-region division, the proposed method can better model the time-varying fault behaviors and capture the different fault-to-normal reconstruction relationships for fault diagnosis. The feasibility and performance of the proposed method are illustrated with the Tennessee Eastman process, revealing enhanced fault understanding and improved fault diagnosis performance.

Sparse analysis based fault deviations modeling and its application to fault diagnosis

Figures

Recommended publications

Fault Diagnosis Based on Qualitative Bond Graph and Genetic Algorithms

Finite-State Temporal Automata Modeling for Fault Diagnosis

Distributionally Robust Active Fault Diagnosis for Nonlinear Uncertain Systems

Qualitative Modeling for Observer-Based Fault Diagnosis