ArticlePDF Available

Fault isolation based on transfer-function models using an MPC algorithm

January 2022
Computers & Chemical Engineering 159(12):107668

January 2022
159(12):107668

DOI:10.1016/j.compchemeng.2022.107668

Authors:

Zhejiang University

This work studies model-based fault isolation. A method of fault isolation filter design is developed. The fault to output transfer function model is obtained using system identification. The basic idea is to formulate the isolation problem as a dual decoupling control problem then use a MPC controller as the decoupler. In this way an efficient algorithm can be developed to obtain the fault isolation filter; time delays and right-half-plane (RHP) zeros can be handled. Model errors that occur during system identification are considered in designing the isolation filter. Optimal fault isolation filters that suppress disturbances are also developed. The method is demonstrated using a simulated 600MW supercritical power generation unit and the Tennessee Eastman process (TEP).

IMC structure.

…

Block diagram of the proposed FDI method.

…

Step responses of real plant and of the identified model.

…

Step responses of G y f (q). Subfigures marked as red corresponds to G y f i j containing RHP zeros.

…

Input and output variables of the numerical example.

…

Figures - uploaded by Jinming Zhou

Content may be subject to copyright.

Content uploaded by Jinming Zhou

Content may be subject to copyright.

Fault isolation based on transfer-function models using an MPC algorithm

Jinming Zhoua, Yucai Zhua,∗

aState Key Laboratory of Industrial Control Technology, College of Control Science and Engineering, Zhejiang University, Hangzhou 310027, China

Abstract

This work studies model-based fault isolation. A method of fault isolation ﬁlter design is developed. The fault to output transfer

function model is obtained using system identiﬁcation. The basic idea is to formulate the isolation problem as a dual decoupling

control problem then use a MPC controller as the decoupler. In this way an eﬃcient algorithm can be developed to obtain the fault

isolation ﬁlter; time delays and right-half-plane (RHP) zeros can be handled. Model errors that occur during system identiﬁcation

are considered in designing the isolation ﬁlter. Optimal fault isolation ﬁlters that suppress disturbances are also developed. The

method is demonstrated using a simulated 600MW supercritical power generation unit and the Tennessee Eastman process (TEP).

Keywords: Fault isolation, Model predictive control, System identiﬁcation, Filter design, Tennessee Eastman process

1. Introduction1

Recent decades witness an increased focus on fault diagno-2

sis in modern industrial processes [1, 2, 3, 4, 5]. Fault refers to3

an unpermitted deviation of at least one characteristic or vari-4

able in a system [6]. In a broader context, faults can also re-5

fer to the slight changes in signals [7], which can be deemed6

as precursors before faults occur. A well-designed fault di-7

agnosis system can not only provide early detection of un-8

expected malfunctions or failures in process, give guidelines9

for equipment overhaul and maintenance, but improves prod-10

uct quality and brings economic beneﬁts. Among the various11

branches of fault diagnosis methods, model-based fault diagno-12

sis is the most promising one and has been widely investigated13

through decades. Generally speaking, model-based methods14

contain two steps [2]: residual generation and residual eval-15

uation. Fig. 1 presents a typical model-based fault diagnosis16

scheme. In residual generation step, the consistency between17

process output and model output of is checked, according to18

which faults will be detected. The residual evaluation step can19

be viewed as a signal processing or ﬁltering procedure, in which20

useful information about the faults will be extracted. This paper21

focuses on fault isolation (FI) problem, which aims at determin-22

ing the locations of the detected faults.23

As discussed in [2], the existing schemes for FI can be di-24

vided into three categories: using unknown input decoupling25

strategy; solving FI by means of a bank of residual generators;26

formulating FI as a dual problem of designing a decoupling27

controller. Solving FI by means of a bank of residual generators28

mainly refers to the dedicated observer scheme (DOS) [8] and29

generalized observer scheme (GOS) [9]. In DOS, each residual30

?This work is supported by the National Natural Science Foundation of

China, Grant Numbers: U1809207, 61673343.

∗Corresponding author

Email addresses: zhoujinming@zju.edu.cn (Jinming Zhou),

zhuyucai@zju.edu.cn (Yucai Zhu)

is only related to one output, which has very clear structure and 31

working principle, but its application is limited to sensor fault 32

isolation; in GOS, each residual is sensitive to all but one faults, 33

and the decision logic is complementary to DOS. 34

Unknown input decoupling technique aims at making the 35

generated signals independent of the disturbance (unknown in- 36

put) of no interest but preserving the sensitivity to the parts 37

to be detected. In fault detection (FD) problem, unknown in- 38

put decoupling technique is widely used to reduce the pro- 39

cess disturbance eﬀect and improve the detection accuracy 40

[10, 11, 12, 13, 14]. FI problem can also be formulated as a 41

number of unknown input decoupling problems where faults of 42

no interest are handled as unknown input [2, 15, 16], then faults 43

can be clustered into some groups and FI is achieved according 44

to some decision logic. 45

Many model-based fault diagnosis design problems have 46

close relationships to control theory. For instance, both in con- 47

trol and fault diagnosis applications, model mismatch or model 48

uncertainty is a challenging problem. From around 1990, in- 49

spired by the development of H∞and robust control theory 50

which provide systematic analysis tools to handle model un- 51

certainties, many FD design methods based on H∞and linear 52

matrix inequality (LMI) have been developed [17, 18, 19, 20]. 53

FI is a problem that is closely related to decoupling control. 54

Decoupling is an important concept in multi-input multi-output 55

(MIMO) control systems. After the decoupling procedure each 56

output are only aﬀected by one input thus the MIMO system is 57

reduced to several single-input single-output (SISO) systems. 58

Similarly, in FI, after processing of the original residuals, each 59

residual should only be aﬀected by one or a group of faults. 60

Based on the similarities, [21] proposed an FI method utiliz- 61

ing the duality between FI and the state feedback decoupling; 62

in [2], this method was extended to handle more general cases. 63

However, several problems may occur when developing decou- 64

pling algorithms. In [22], it is pointed out that the realizability, 65

stability and robustness of the decouplers are three frequently 66

Preprint submitted to Computers &Chemical Engineering April 4, 2022

process

Process input Process output

process

model residual

processing Decision

logic

-residual

Model-based fault diagnosis system

information

of faults

residual generation

fault detection residual evaluation

fault isolation, identification, analysis, ...

process

Process input Process output

process

model residual

processing Decision

logic

-residual

Model-based fault diagnosis system

information

of faults

residual generation

fault detection residual evaluation

fault isolation, identification, analysis, ...

Figure 1: A diagram of model-based fault diagnosis system.

encountered problems in decoupling control. The reason lies in67

that for ideal decoupling [23], the inversion of the plant transfer68

function matrix is required, which can become improper, or un-69

stable when time delays and RHP zeros occur. In fact, these are70

common problems when plant model inversion is involved, see71

also [24]. In order to handle the problems of time delays and72

RHP zeros more complex algorithms are required [25, 26, 22],73

the result of which may become unreliable due to numerical74

issues when system dimension increases.75

Industrial processes are essentially dynamic and there are ca-76

sual relationships between various variables. These two fea-77

tures can be captured by state-space models or by transfer-78

function models. Model-based method is superior to data-79

driven and signal-based methods because models can reveal80

deeper information about processes than signals. Bottleneck81

of the model-based methods is the diﬃculty in model building,82

especially in process industries. In [27], we have studied the83

identiﬁcation based residual generation and FD problem. Fol-84

lowing the same line, this work aims at developing an eﬃcient85

FI algorithm based on the identiﬁed model. Model predictive86

control (MPC) is nowadays the most successful MIMO control87

strategy in process control [28, 29], which oﬀers a powerful tool88

to handle large-scale interacting loops with constraints. Unlike89

traditional decoupling control that uses parametric controllers,90

MPC algorithms can handle time delays and RHP zeros inher-91

ently without special treatment. Based on the fact that FI can92

be solved by a dual problem of decoupling control, this work93

tries to solve FI problem by combining system identiﬁcation94

and MPC technique.95

Contributions and outline96

The core of FI is to decouple the interactions between faults97

and residuals. In control applications, it turns out that MPC can98

inherently handle the interactions between manipulated vari-99

ables and controlled variables of large-scale plants, implying100

that the MPC controller also serves as a decoupler. Inspired by101

this nice feature of the MPC algorithm, this paper reveals that102

with some transformation and modiﬁcations, the unconstrained103

MPC controller can be used to address the couplings between104

faults, then to solve FI problem.105

The paper is divided into two main parts. In the ﬁrst part,106

the relation between the unconstrained MPC controller and the107

isolation ﬁlter is analyzed. Then an algorithm is developed to108

obtain a parametric MPC controller and simultaneously an iso-109

lation ﬁlter. Considering that the algorithm is based on a model110

with errors, a method to validate the isolation ﬁlter is devel- 111

oped that considers errors in system identiﬁcation. The pro- 112

posed method can handle large-scale systems containing time 113

delays and RHP zeros and is numerically reliable. The sec- 114

ond part deals with the eﬀect of the nuisance factors containing 115

model error and unmeasured disturbance which degrade FI per- 116

formance. Based on a performance index and the related post- 117

ﬁlters, the FI performance can be enhanced. Further, an identi- 118

ﬁcation based fault detection and isolation (FDI) framework is 119

established combining the fault detection scheme in [27]. 120

The rest of the paper is organized as follows: Section 2 con- 121

tains preliminaries about the underlying system and identiﬁca- 122

tion method, and formulates the FI problem to be addressed; 123

Section 3 presents details of the FI approach; Section 4 deals 124

with the FI performance and proposes the FDI framework; in 125

Section 5 the proposed method is validated in an example of a 126

600MW power generation unit; Section 6 presents the applica- 127

tion of the method in TEP. Summary and conclusion are given 128

in Section 7. 129

Notations 130

Vectors and matrices are in bold-face. Rndenotes the real n131

dimensional space, Rn×mdenotes the set of n×mreal matrices. 132

ATdenotes transpose of A,Ai j denotes ith raw jth column el- 133

ement of A. Kronecker product is denoted as ⊗. The identity 134

matrix of size n×nis denoted In.0n×pis the n×pmatrix full of 135

zero. diag{· · · } represents a diagonal or block-diagonal matrix. 136

For a vector x,xidenotes its ith element. kxkdenotes Euclidean 137

norm of x, and kxkQis the weighted norm. E[·], var[·], cov[·]138

denote mathematical expectation, variance and covariance op- 139

erators. |z|is the modulus of a complex number z.q−1is the 140

time-shift operator: q−1u(t)=u(t−1). Gaussian distribution and 141

chi-square distribution are denoted as N(·,·) and χ2(·). Φξ(ω)142

denotes the power spectrum of a signal sequence {ξ(t)}, when 143

this notation is used it is implied that the spectrum exists. 144

denotes the logical exclusive nor operator. 145

2. Preliminaries and problem statement 146

2.1. System description 147

This work considers MIMO linear time-invariant (LTI) sys- 148

tems. The true system without faults can be written as: 149

S:y(t)=G(q)u(t)+v(t),v(t)=H(q)e(t) (1) 150

where G(q) is a stable transfer matrix of dimension p×m,H(q) a 151

transfer matrix of dimension p×psatisfying: H(q) and H−1(q)152

are stable, H(q=∞)=Ip.u(t)∈Rmand y(t)∈Rpare 153

input and output signals, v(t)∈Rpdenotes the disturbance. 154

e(t)∈Rpis zero mean white noise with covariance matrix Λ,155

i.e. E{e(t)eT(t)}=Λ.156

Several ways exist to describe the true system S, for instance 157

state-space model [30, 31], input-output model [32, 33]. In this 158

work, we focus on input-output model. A parameterized model 159

structure Mis introduced as: 160

M:y(t)=G(q,θ)u(t)+v(t),v(t)=H(q,θ)e(t) (2) 161

where θ∈ Dθ∈Rdis a parameter vector, Dθrestricts θto162

the values that G(q,θ) is stable, H(q,θ) is stable and inversely163

stable.164

Prediction error method (PEM) is a standard method to165

identify θ, in which a quadratic cost function of the pre-166

diction error (PE) is minimized based on test data ZN=167

{u(1),y(1),· · · ,u(N),y(N)}:168

θN=arg min

t=1

(y(t)−ˆ

y(t,θ))TΛ−1(y(t)−ˆ

y(t,θ)),(3)169

where the predictor ˆ

y(t,θ) is given by the stable ﬁlter:170

y(t,θ)=H−1(q,θ)G(q,θ)u(t)+[I−H−1(q,θ)]y(t).(4)171

For details, see [33].172

An important property of PEM is that, when the model struc-173

ture Mis ﬂexible enough to capture the true system, i.e. there174

exists a θothat G(q)=G(q,θo), H(q)=H(q,θo) and assume175

that the white noise e(t) is Gaussian, the estimate ˆ

θNasymp-176

totically (N→ ∞) subjects to Gaussian distribution around the177

true parameter θo:178

θN∼ N(θo,1

NP(θo)) (5)179

where180

P−1(θo)=E{Ψ(t,θo)Λ−1ΨT(t,θo)},(6)181

Ψ(t,θ) is deﬁned as: d

dθˆ

y(t,θ). We refer to [34] for more explicit182

type of (5). This conclusion allows one to determine uncertainty183

regions for ˆ

θN, or in other words, provide an eﬃcient way to184

quantify estimate error, which has been proved to be useful in185

robust controller design based on identiﬁed model [35, 36, 37],186

model quality validation [32] and input design [38, 34]. Sim-187

ilarly, this conclusion will serve as a tool to deal with model188

error in our subsequent study. Notice that in practice the true189

parameter θowill be replaced by the estimate ˆ

θN.190

Remark 1. This work focuses on input-output model of the191

system based on which FI and relevant issues will be investi-192

gated. Besides using PEM to build the model, one can also per-193

form subspace identiﬁcation [30, 31] or ﬁrst-principle modeling194

to get a state-space model then transform it to transfer-function195

type. Remember that, while PEM can provide model uncer-196

tainty description mentioned above, how to describe model197

errors in subspace identiﬁcation method and in ﬁrst-principle198

modeling is still an open question.199

2.2. Fault detection and fault isolation200

When faults occur, (1) becomes:201

y(t)=G(q)u(t)+v(t)+Gy f (q)f(t),(7)202

f(t)∈Rl,Gy f (q) is a p×ltransfer matrix. According to [2],203

both additive and multiplicative faults can be modelled in this204

form. For additive faults representative of oﬀsets or drifts in205

actuators and sensors, if fkis a sensor (output) fault measuring206

y¯

i,207

Gy f

ik =1,Gy f

ik =0 (i,¯

i); (8)208

if fkis an actuator (input) fault corresponding to uj,209

Gy f

ik (q)=Gi j(q).(9) 210

For multiplicative faults representative of process faults, we 211

have 212

Gy f (q)=δG(q),f(t)=u(t) (10) 213

where δG(q) denotes the change caused by process faults, and 214

typically this term is unknown. Moreover, it is augured that 215

multiplicative faults can aﬀect the system stability [39]. In this 216

work we conﬁne ourselves to isolation of additive faults. For 217

such faults, an estimated Gy f (q) can be obtained using system 218

model (through e.g. identiﬁcation). Subsequently, the term ad- 219

ditive will be omitted when there is no ambiguity. 220

In practice, the faults to be considered should be chosen re- 221

lated to the process variables that is important for safety and 222

product quality. For additive faults, sensors and actuators in 223

the loops that is key for safe operations and control require- 224

ments should be taken into account. Prior knowledge obtained 225

by analysing data during the fault events and maintenance peri- 226

ods, or through insights of the devices can also be useful. 227

Based on the system model, the consistency between the 228

measured outputs and the model outputs can be checked, which 229

forms the basic idea of FD. In [27], it has been proved that out- 230

put error (OE) is more suitable for FD than PE. Denote the esti- 231

mated model of Sas ˆ

G(q) :=G(q,ˆ

θN), then OE can be written 232

as: 233

r(t)=y(t)−ˆ

G(q)u(t),(11) 234

which will be used as the basic residual in this work. From 235

(7), one can see that faults will cause deviations, or changes 236

in r(t). By combining components of r(t) into some statistics 237

J(r(t)) and choosing proper threshold Jth , faults can be detected 238

according to the following decision logic: 239











J(r(t)) >Jth ⇒alarm

J(r(t)) ≤Jth ⇒no alarm .(12) 240

Denote ∆G(q) :=G(q)−ˆ

G(q), r(t) can further be written as: 241

r(t)= ∆G(q)u(t)+v(t)

| {z }

v(t)

+Gy f (q)f(t) (13) 242

where ¯

v(t) is a general disturbance term containing the nuisance 243

factors that can degrade the detection ability. In order to eval- 244

uate the detection ability, the following performance index is 245

introduced: 246

Deﬁnition 1. Let ε(t) be a scalar residual, if we denote εnas 247

the residual at the normal condition, εfas that at the faulty con- 248

dition when f(t) occurs, then the fault detection performance 249

index is deﬁned as 250

Enεf(t)o2

E{εn(t)}2.(14) 251

Let ε(t)=ri(t), and consider one fault fk, the detection per- 252

formance of rito fkcan be achieved, Jik :=E{rfk

i(t)}2

E{rn

i(t)}2. A larger 253

Jimplies a better performance of εto detect fin a statistical254

sense.255

To illustrate the FI problem, assume that ∆G=0,v=0, then256

FI problem can be formulated as:257

Problem 1. Find a stable ﬁlter L(q) of dimension l×psuch258

that259

L(q)r(t)=L(q)Gy f (q)f(t)=Γ(q)f(t) (15)260

where Γ(q)=diag{γ1(q),· · · , γl(q)}.261

In (15), each element in the new residual only responses to one262

fault, this is called perfect fault isolation (PFI) [2] and we call263

L(q) an isolation ﬁlter. From a pragmatic viewpoint, if the non-264

diagonal elements of Γ(q) is nonzero but tends to zero quickly265

with small dynamic ﬂuctuations, it can be deemed that PFI is266

solved approximately with dynamic interactions. In this work,267

we extend the deﬁnition of PFI to encompass such approxima-268

tions.269

Concerning the solvability of Problem 1, the following result270

is important:271

Lemma 1. For additive fault f(t)∈Rland system (7), Problem272

1 is solvable if and only if273

rank(Gy f (q)) =l.(16)274

The proof of Lemma 1 can be found in [40, 2]. Before de-275

signing the fault diagnosis system, according to the considered276

faults, lcan be determined. Lemma 1 implies for a poutputs277

MIMO system, one can at most isolate pfaults. Hence if l≤p,278

isolation ﬁlter aiming at solving PFI can be developed; if l>p,279

more sensors are required for more available information, one280

can also consider isolating faults into groups as an alternative,281

see [15, 16].282

In this paper, we only focus on the occasions that l≤pwhere283

a PFI can be obtained. Subsequent sections aim at addressing284

several problems: How to develop eﬃcient PFI algorithm using285

MPC controller? How to validate the designed isolation ﬁlter286

when model error exists? How to deal with disturbance and287

enhance isolation performance?288

3. Fault isolation using MPC controller289

In this section, it will be demonstrated that the transpose of290

an unconstraint MPC controller can serve as the isolation ﬁlter.291

According to the plant model, the fault to output model can292

be built, based on which an eﬃcient algorithm to obtain the293

isolation ﬁlter is developed. Finally, fault isolation with model294

errors is considered.295

3.1. MPC controller and isolation ﬁlter296

Some brief review on MPC will be given ﬁrst. Take dynamic297

matrix control (DMC) for instance, at each control interval, the298

following performance index is minimized:299

Jc=kw(t)−ˆ

yP(t)k2

Q+k∆uM(t)k2

R(17)300

where wis the reference trajectory, ˆ

yPdenotes predicted output301

over the prediction horizon P(for explicit expression, see e.g.302

controller plant

model

Figure 2: IMC structure.

[41]), and ∆uMdenotes the future control move over control 303

horizon M, which is optimized in (17). Qand Rare weighting 304

matrices. The ﬁrst input in the optimal sequence is then sent 305

into the plant, and the entire calculation is repeated at subse- 306

quent control intervals [42]. For unconstraint MPC, the solution 307

of (17) is explicit. The control move at time tcan be expressed 308

as: 309

∆u(t)=L(ATQA +R)−1ATQ[w(t)−ˆ

yP(t)] (18) 310

where Acontains internal model parameters of DMC which are 311

formed by the coeﬃcients of model step responses, 312

L=

1 0 0 0

...

0 1 0 0

.(19) 313

(18) can be viewed as a nonparametric model between w(t) and 314

u(t). 315

The unconstraint linear MPC has close connections to some 316

other control strategies. For instance, it can be transformed to 317

a IMC structure [43, 44]; when Pand Mapproach inﬁnity, un- 318

constraint MPC becomes a standard linear quadratic regulator 319

(LQR) problem [45]. The role of MPC controller as a decou- 320

pler can hardly be seen from Jcand ∆u(t). Using the equiv- 321

alence between unconstraint MPC and IMC, and investigating 322

IMC structure, the decoupling function will emerge. The struc- 323

ture of IMC is shown in Fig. 2, the closed-loop transfer function 324

from wto yis: 325

T(q)=G(q)[Im+K(q)∆G(q)]−1K(q).(20) 326

A desirable controller design must satisfy T(1) =Ipto avoid 327

steady-state oﬀset, which implies a decoupling with dynamic 328

interactions [25]. Notice that it diﬀers from static decoupling 329

[22] where the static decoupler is designed simply based on 330

steady-state gains. A complete decoupling can be realized when 331

∆G=0[25]: factorize ˆ

G(q) as 332

G(q)=ˆ

G+(q)ˆ

G−(q),ˆ

G+(1) =Ip(21) 333

where ˆ

G+(q) contains time delays and RHP zeros of ˆ

G(q) and 334

G−(q) has a stable and realizable inverse. Then use 335

K(q)=ˆ

G−1

−(q)F(q) (22) 336

Algorithm 1: Designing isolation ﬁlter by identifying the MPC controller

Input: Plant test data ZN={u(1),y(1),· · · ,u(N),y(N)}, desired response Γ(q)

Output: Fault isolation ﬁlter L(q)

/* Step 1-3: tune MPC parameters to get the desired response. */

1Use system identiﬁcation to build plant model based on ZNand get d

Gy f (q);

2Set d

Gy f (q)T

as the plant and internal model in MPC;

3Tune the MPC parameters Q,R,P,Mto get the desired closed-loop response;

/* Step 4-16: use system identification to get a parametric MPC contorller as well as the

isolation filter. */

4(Identiﬁcation settings): set the length of identiﬁcation experiment NK, controller order nKand design test signals

η1,· · · , ηl;

5for j←1to ldo

6Generate NK-sample test signal nηj(1),· · · , η j(NK)o, set {W(1),· · · ,W(NK)}according to (26);

7Run MPC simulation for NKsample time;

8for i←1to pdo

9Set the data set ZNK

i j ={W j(1),Ui(1),· · · ,Wj(NK),Ui(NK)};

10 Get ˆ

Ki j(q) using LS method;

11 Set Lji(q)=ˆ

Ki j(q);

12 end

13 end

14 if L(q)d

Gy f (q),Γ(q)then

15 Go back to Step 4 to adjust the identiﬁcation settings;

16 end

as a controller delivers337

T(q)=ˆ

G+(q)F(q).(23)338

If ˆ

G+(q) and F(q) are chosen to be diagonal, T(q) will also have339

a diagonal structure.340

Now we are ready to demonstrate that the transpose of the341

MPC or IMC controller can be used as the isolation ﬁlter. If342

G(q)=ˆ

G(q)=hGy f (q)iT,343

T(q)=hGy f (q)iTK(q).(24)344

Transpose both sides of (24) yields345

KT(q)Gy f (q)=TT(q).(25)346

Factorize hGy f (q)iTaccording to (21), set a controller accord-347

ing to (22), and choose diagonal hGy f (q)iT

+and F(q), TT(q) be-348

comes diagonal, implying that KT(q) solves PFI. Analogously,349

when K(q) is a MPC or IMC controller that leads to a T(q) such350

that T(1) =Ip, PFI can be realized with some dynamic interac-351

tions.352

To obtain such model-based controllers, ﬁrst the fault trans-353

fer matrix should be estimated based on the identiﬁed system354

model. From above discussion, one can see that the transpose355

of the ideal IMC controller can exactly solve PFI, whereas one356

must perform transfer matrix factorization and inversion as in357

(21) and (22), which can be numerically diﬃcult when system358

Test signals

identification

input data output data

MPC

controller

Figure 3: Illustration of Problem 2.

dimension is large [25, 46]. For unconstrained MPC, after the 359

tuneable parameters are determined the control move can be 360

explicitly given as in (18), hence one possible alternative is to 361

design an unconstrained MPC with a very diagonal T(q), and 362

use the transpose of the MPC controller to realize FI. 363

3.2. Isolation ﬁlter identiﬁcation 364

For FI purpose, the expression (18) is of little help and a 365

parametric (ﬁlter-type) model of the MPC controller is required. 366

For unconstrained linear MPC, the equivalent controller K(q) is 367

a linear ﬁlter which can be obtained by solving the following 368

black-box identiﬁcation problem: 369

Problem 2. Design identiﬁcation test using test signal vec- 370

tor W, use it as the reference trajectory of the de- 371

signed MPC, identify the transfer function matrix of the 372

MPC controller K(q) from simulation data set ZNK=373

{W(1),U(1),· · · ,W(NK),U(NK)}, where Wis used as in-374

put and Uis used as output.375

Problem 2 is illustrated by Fig. 3. Here Wand Udenote con-376

troller input and output, NKdenotes data length for identifying377

controller K(q). Problem 2 is an identiﬁcation problem with the378

following features: 1) it is an open-loop identiﬁcation problem;379

2) it is noise-free; 3) the order of K(q) to be identiﬁed is typi-380

cally high, because the ideal controller of IMC, or MPC is the381

(approximate) inverse of the plant model that is non-parametric382

[24, 46, 44]. Bearing the above points in mind, we recommend383

to perform lidentiﬁcation tests (simulation runs), at each time384

only add a test signal to one component of W. Denote the test385

signal in jth test as ηj, we have:386

W(t)=[01×j−1, η j(t),01×l−j]T,at jth experiment. (26)387

ηjmust satisfy the persistent excitation condition [33], for in-

stance, it could be generalized binary noise (GBN) signal [47].

In each experiment, by using data set

ZNK

i j ={W j(1),Ui(1),· · · ,Wj(NK),Ui(NK)},1≤i≤p

to identify Ki j respectively, one can get the jth column of K(q).388

The dynamic relation between Wj(t) and Ui(t) can be ex-389

pressed as:390

Ui(t)=ϕT

i j(t)ϑi j (27)391

where392

ϕT

i j(t)=[−Ui(t−1),· · · ,−Ui(t−na),Wj(t),· · · ,Wj(t−nb)],

(28)

393

ϑT

i j =[a1

i j,· · · ,ana

i j ,b0

i j,· · · ,bnb

i j ].(29)394

395

ϑi j contains the parameters in Ki j, i.e.396

Ki j(q)=Bi j (q)

Ai j(q)=

i j +b1

i jq−1+· · · +bnb

i j q−nb

1+a1

i jq−1+· · · +ana

i j q−na

.(30)397

In practice it is common to set na=nb=nK. Notice that (27) is398

a linear regression such that ϑi j can be obtained using the least-399

square (LS) method. After doing lidentiﬁcation experiments400

and identifying l·pmodels, a MIMO model of K(q) whose401

elements are independently parameterized is obtained.402

Test signals may be simultaneously added in a single test and403

a MIMO model is identiﬁed using some MIMO identiﬁcation404

methods. For our purpose MIMO test is not really necessary be-405

cause the “tests” here are simulations which do not have much406

economic cost. Moreover, using MIMO identiﬁcation approach407

here will need to use very high orders, which can cause numer-408

ical problems.409

The isolation ﬁlter L(q) can be obtained by transposing the410

identiﬁed MPC controller. For checking whether it has decou-411

pled d

Gy f (q), one can investigate the step or frequency responses412

of L(q)d

Gy f (q). The above identiﬁcation method are summa-413

rized in Algorithm 1 where the identiﬁcation of plant model is414

also included.415

Some comments for the algorithm are given below:416

1) Data set ZNfor plant identiﬁcation: Because Gyf (q)417

shares some same elements with G(q), it is necessary to 418

carry out identiﬁcation tests using test signals (excitation) 419

and generate informative data, in order to guarantee a 420

high-quality fault to output model. 421

2) Γ(q)and related Q,R: To better approximate PFI and to 422

ensure a desirable detection speed, a high Qto Rratio set- 423

ting is suggested. The diagonal elements of Γ(q) should 424

be tuned to have fast dynamics while the nondiagonal ele- 425

ments should only pose some small ﬂuctuations then tend 426

to zero quickly. 427

3) Test signals ηiand simulation time NK:NKmust be chosen 428

suﬃcient large while the design of the persistent excitation 429

signal ηican be found in [32]. 430

4) Controller order nK: Before identiﬁcation, the order of 431

controller can be roughly estimated by investigating the or- 432

der of the inverse of d

Gy f (q). When the actual order is very 433

high, the above identiﬁcation procedure actually plays the 434

role of model reduction. Note that despite the noise-free 435

feature of this identiﬁcation problem (no variance error), 436

the bias error induced by model reduction must be care- 437

fully treated. The ﬁnal determination of nKmay become 438

a trail and error procedure. It is suggested that check the 439

gain matrix, or step responses of the delivered Γto see 440

whether the identiﬁed controller model serves for the FI 441

purpose, some illustrations are given in Section 5.2. 442

Though in Section 3.1 we begin the discussion with DMC, 443

in practice other forms of MPC can also be used. Moreover, 444

the method to identify the MPC controller developed in Algo- 445

rithm 1 can also be used in other occasions where one needs to 446

decouple some variables, or requires to invert a system model. 447

The proposed algorithm has following advantages that makes it 448

very applicable: 449

1) it is numerically reliable because all the calculations are 450

explicit; 451

2) it requires no special treatment for time delays and RHP 452

zeros; 453

3) it can handle large-scale systems, which will be demon- 454

strated using the TE process. 455

3.3. Fault isolation with model errors 456

In Algorithm 1, the fault isolation ﬁlter L(q) is derived based 457

on estimated d

Gy f (q) without considering model errors. In order 458

to make the method more applicable, model errors are incorpo- 459

rated here. 460

Denote L(q)r(t) as rFI , when ∆G(q),0, ∆Gy f (q) :=Gy f (q)−461

Gy f (q),0, the transfer matrix from fto rFI becomes: 462

Γo(q)=L(q)Gy f (q)

=L(q)d

Gy f (q)+ ∆Gy f (q)=Γ(q)+L(q)∆Gy f (q)(31) 463

One can see that the diagonal structure of Γ(q) may be damaged464

by the additional term L(q)∆Gy f (q). Denote this term as ∆Γ(q).465

Obviously, when ∆Gy f (q) becomes signiﬁcant, FI will fail.466

In system identiﬁcation for model-based control, uncertainty467

regions of the estimated model are often developed for test-468

ing whether the achieved model is qualiﬁed for control purpose469

[32]. Due to that ∆Gy f (q) is closely related to plant model (as470

mentioned in Section 2.2), follow an analogous line as in iden-471

tiﬁcation for control, uncertainty regions of ∆Gy f (q) can be de-472

veloped and used to validate whether L(q) is suﬃciently accu-473

rate for the FI. One way is to introduce uncertainty into the step474

responses of Γo(q). By investigating step responses one can475

evaluate the severity of interactions. As Γ(q) is already known476

hence the remaining task is to derive the uncertainty region of477

∆Γ(q)f(t).478

When PEM is used to build the model, parameter uncertainty479

description in (5) can be used for the subsequent study. Follow-480

ing assumption is needed to establish (5):481

Assumption 1. Assume that S∈Msuch that there exists a θo

482

that G(q)=G(q,θo), H(q)=H(q,θo), the white noise e(t) is483

Gaussian; the identiﬁcation are based on an informative data set484

ZN[33] of suﬃcient large N, and correct system order is used.485

With no loss of generality, denote the fault vector as fT(t)=486

[( fs(t))T,(fa(t))T], where fs(t)∈Rnsdenotes output faults and487

fa(t)∈Rnadenotes input faults, na+ns=l. Denote Ga(q,θo)488

as the real transfer matrix from fa(t) to y(t). Notice that489

∆Gy f (q)=h0p×ns∆Ga(q)i(32)490

where491

∆Ga(q) :=Ga(q,θo)−Ga(q,ˆ

θN) (33)492

contains elements of ∆G(q). According to (32)493

L(q)∆Gy f (q)f(t)=L(q)∆Ga(q)fa(t).(34)494

Denote ∆θ:=ˆ

θN−θo, then using ﬁrst-order Taylor expansion495

∆Ga(q)fa(t)=

Pna

i=1∆Ga

1ifa

Pna

i=1∆Ga

pi fa



=

Pna

i=1

dGa

dθT∆θfa

Pna

i=1

dGa

dθT∆θfa



=[Ga(q,θo)]0fa(t)⊗Id∆θ

(35)

496

where [Ga(q,θo)]0is a block matrix whose (i,j) block equals497

dθTGa

i j(q,θo). Now (34) writes:498

L(q)∆Gy f (q)f(t)=L(q)[Ga(q,θo)]0fa(t)⊗Id

| {z }

Υ(t)

∆θ(36)499

where Υ(t)∈Rl×d. Under Assumption 1, (5) holds such500

that ∆θsubjects to Gaussian distribution, cov(∆θ)=1

NP(θo).501

Hence L(q)∆Gy f (q)f(t) also subjects to Gaussian distribution502

and Gauss’ approximation formula [33] directly gives:503

cov hL(q)∆Gy f (q)f(t)i=1

NΥ(t)P(θo)ΥT(t).(37)504

To summarize: 505

∆Γ(q)f(t)∼ N(0,1

NΥ(t)P(θo)ΥT(t)).(38) 506

Moreover, denote P(t) as a vector containing the diagonal ele- 507

ments of 1

NΥ(t)P(θo)ΥT(t), we have 508

∆Γ(q)f(t)i∼ N(0,Pi(t)).(39) 509

(39) implies that: 510

∆Γ(q)f(t)i≤ U β

2pPi(t),w.p.1−β(40) 511

where −U β

,Uβ

2speciﬁes a conﬁdence interval of the stan- 512

dard Gaussian distribution with conﬁdence level 1−β,βis some 513

probability. Using (40), the uncertainty region of each compo- 514

nents of ∆Γ(q)f(t) can be determined. 515

Now by setting f(t) properly we can derive the step responses 516

of Γ(q) with error bounds which will be called step response 517

bands. Under Assumption 1, Γo(q) will fall into this step re- 518

sponse bands. Deﬁne a Boolean matrix Vas 519

Vi j =









1step response bands of Γi j intersects with 1

at steady state

0 else

(41) 520

Notice that Vii =1 and according to (31), when ∆Gy f is not 521

signiﬁcant, the step response band will be narrow and Vi j =0522

while for large ∆Gy f , some Vi j may becomes 1. 523

Assume that faults do not occur simultaneously, using Vit 524

can be inferred how Γo(q) becomes when model error exists. 525

Several situations may occur: 526

1) Vis diagonal: It means that in the presence of model un- 527

certainty, the fault still excites most its corresponding com- 528

ponent in rFI . Hence one can simply decide which fault 529

occurs according to that which component of the residual 530

exceeds the threshold; 531

2) Vis nondiagonal but rank(V)=l: Due to model uncer- 532

tainty, the fault excite several components of rFI , to a in- 533

distinguishable extent. However, rank(V)=limplies that 534

diﬀerent combinations of components will be excited (dif- 535

ferent patterns) and by matching the pattern FI can be real- 536

ized; for example, consider a Gy f with dimension 2 ×2. If 537

the step response bands of Γ21 intersects with 1 at steady 538

state while Γ12 does not, then 539

V="1 0

1 1#(42) 540

is of full rank. In this case f1excites rFI

1and rFI

2whereas f2541

only excites rFI

2, by investigating diﬀerent patterns caused 542

by f1and f2, fault isolation can be realized; 543

3) rank(V)<l: This implies that many step response bands 544

of Γ(q) are wide and the fault model is poor. In this sit- 545

uation, l−rank(V) faults cannot be isolated. Model re- 546

identiﬁcation with improved plant tests are recommended 547

here. 548

Remark 2. Typically, the model error contains the noise in-549

duced variance error and structure defect induced bias error.550

Notice that (5) only considers variance error. However, when551

the true system satisﬁes (1), it is arguable that the total error in552

any identiﬁed model is dominated by variance error [48].553

In situations 1) and 2) above, the matrix Vcan be used in fault554

isolation as follows. Deﬁne a Boolean vector Ξwith dimension555

l. If rFI

ialarms, set Ξi=1 else set Ξi=0. In the ideal case,556

a single column in Vequals Ξ, implying that the patterns are557

perfectly matched; then the fault is at this column. Otherwise,558

the fault number k∗can be determined by559

k∗=arg max

i=1

Vik Ξi,(43)560

where denotes the logical exclusive nor operator. This im-561

plies that the fault number is determined when two patterns are562

most closely matched. It should be reminded that due to incon-563

sistency between real systems and our assumptions, occasions564

may occur that (43) delivers nonunique k∗. We suggest that then565

choose the component that alarms, i.e. exceeds the threshold566

most intensively as the most possible candidate.567

4. Optimal FI ﬁltering568

After addressing model errors in FI, the issue of disturbance569

will be studied. Unmeasured disturbances (referring ¯

v(t) in570

(13)) aﬀect the isolation performance. That is to say, despite571

that in rFI (t) faults can be well structured, it may not cause572

alarms in rFI (t) if the detection performance of rFI (t) is poor,573

which means that the FI will fail. A situation might be encoun-574

tered is that a fault which could originally be detected by the575

basic residual rmay not be detected, causing miss alarms in576

the new residual rFI. In model-based fault diagnosis methods,577

one typical way to improve the detection ability of a certain578

residual is to use post-ﬁlters that maximize some performance579

indices [2]. For our purpose to enhance FI performance, the580

ﬁlter related to rFI

k(t) and fkis chosen such that:581

QFI

opt,k(q)=arg max

EnQ(q)rFI,fk

k(t)o2

EnQ(q)rFI,n

k(t)o2.(44)582

We call QFI

opt,k(q) optimal ﬁlter due to that it maximizes a spe-583

ciﬁc performance index. According to [27], QFI

opt,k(q) can be584

obtained by investigating spectrum information, which is a fre-585

quency selection ﬁlter selecting the frequency586

ωopt,k=arg max

ΦrFI,fk

(ω)

ΦrFI,n

k(ω).(45)587

In practice, considering realizability and phase delay, this opti-588

mal ﬁlter can be approximated by bandpass or lowpass ﬁlters.589

After a series of such optimal ﬁlters are designed, set590

QFI

opt(q)=diag nQFI

opt,1,· · · ,QFI

opt,ko(46)591

the ﬁltered rFI becomes 592

rFI

opt(t)=QFI

opt(q)L(q)r(t).(47) 593

Notice that after being ﬁltered by the diagonal QFI

opt(q), the de- 594

coupled structure is maintained in the residual. 595

Now combining the FD scheme developed in [27], we pro- 596

pose a FDI scheme in Fig. 4. In the FD block, rFD

opt which typi- 597

cally has the highest fault detection ability, is used; when faults 598

are detected, the isolation ﬁlter is used to make the faults struc- 599

tured and another optimal ﬁlter QFI

opt is used to improve the de- 600

tection ability of rFI . There are three ﬁlters in the FDI scheme 601

playing important roles, one isolation ﬁlter and two optimal ﬁl- 602

ters aiming at enhancing abilities to detect faults. The two op- 603

timal ﬁlters are generally diﬀerent and should be designed ac- 604

cording to diﬀerent residuals. However for special cases such 605

as step and drift faults, they are both low-pass ﬁlters. 606

5. A 600MW power generation unit example 607

In this section we study fault isolation of a 600 MW super- 608

critical power generating unit. A coordinate control system 609

(CSS) is required to maintain the stable operation of the unit 610

while meeting the power demand from the power grid. In CCS, 611

three main controlled variables are steam ﬂow (proportional to 612

output power at steady working points), main steam pressure 613

and main steam temperature; three main manipulated variables 614

are coal feed rate, main steam valve (steam turbine valve) and 615

feed water ﬂow. The variables of the unit are coupled. This 616

section will use a 3 ×3 unit model identiﬁed from real data 617

(with asymptotic method [32], and at 100% load) to simulate 618

the dynamics of the unit. This model has been proved to be 619

high-quality, and used in a MPC system. 620

The continuous-time transfer matrix of the model is given 621

in (48). It can be veriﬁed that the system has time delays and 622

RHP zeros. The input-output variable names are given in Ta- 623

ble 1, in which boiler control command directly aﬀect coal 624

feed rate; middle point temperature is an alternative of the main 625

steam temperature which can indicate the temperature changes 626

more sensitively; desuperheating water valve regulates the mid- 627

dle point temperature below some upper limit. We introduce 628

output disturbance to each output with 10% noise-to-signal ra- 629

tio, as given in (49) its discrete form and use three signals with 630

low-pass spectrums to simulate the input variations at the steady 631

working point: 632

v1(t)=1−0.1q−1

1−0.92q−1¯e1(t),

v2(t)=1−0.2q−1

1−0.89q−1¯e2(t),

v3(t)=1−0.29q−1

1−0.994q−1¯e3(t),

(50) 633

where ¯e1(t),¯e2(t),¯e3(t) are uncorrelated Gaussian white noise 634

with zero mean and unit variance. In simulation, the sampling 635

interval is chosen as 10s. The faults in the three inputs will be 636

process

Process input

process

model -

Process output No

Decision log ic

(for FI)

Improve FD

performace

Decision log ic

(for FD)

Decision log ic

(for FD)

Which faul t(s) occurs?

Yes

Improve FI

performace

Low FI perfomance

Yes

Alarm

FD block FI block

Continue monitoring Yes No

Redesign filter

Figure 4: Block diagram of the proposed FDI method.

G(s)=

0.0183(s+0.0609)(s+0.0118)e−40 s

(s+0.2586)(s+0.00541)(s+0.00406)

0.00489(s+0.175)(s+0.000910)e−40 s

(s+0.00267)(s2+0.0309s+0.00645)

0.000260(s+0.350)(s+0.00310)e−40 s

(s+0.0799)(s+0.00570)(s+0.00357)

0.000263(s2+0.0275s+0.00117)e−20 s

(s+0.00110)(s2+0.0876s+0.00280)

0.00137(s−0.0351)(s+0.0147)e−20 s

(s+0.302)(s+0.0385)(s+0.00648)

1.526×10−5(s+0.277)(s+0.00476)e−20s

(s+0.03283)(s+0.0148)(s+0.00339)

−0.00190(s+0.0134)(s−0.00270)e−130 s

(s+0.00111)(s2+0.0655s+0.000175)

−0.00157(s+0.0102)(s+0.00276)e−130 s

(s+0.00116)(s2+0.0133s+0.000264)

−0.000164(s2+0.0231s+0.000366)e−130 s

(s+0.00105)(s2+0.00857s+0.000164)



(48)

H(q)=diag (1−0.7413q−1+0.01897q−2

1−1.905q−1+0.9063q−2,1−0.42q−1

1−0.98q−1,1−0.6q−1

1−0.92q−1)(49)

considered, which, from above discussion, will have signiﬁcant637

eﬀect on CCS of the unit.638

Table 1: Input and output variables of the numerical example.

Input variables Output variables

u1Boiler main control command y1Output power

u2Main steam valve y2Steam pressure

u3Desuperheating water valve y3Middle point temperature

5.1. Plant identiﬁcation639

To illustrate the proposed method, ﬁrst a model is re-640

identiﬁed with model error quantiﬁed. Three uncorrelated GBN641

signal are added to the inputs, the experiment time N=8000642

samples. Three MISO BJ models are identiﬁed where cor-643

rect orders and delays are used, the result is shown in Fig. 5,644

from which one could see that the identiﬁed model can capture645

the dynamic characteristics of the real system and model error646

exists due to the disturbance (49). Notice that when decom-647

posing the system to pMISO subsystems, the corresponding648

estimated parameter vector ˆ

θ1,· · · ,ˆ

θpis uncorrelated, hence649

P(ˆ

θ)=diag{P(ˆ

θ1),· · · ,P(ˆ

θp)}.650

5.2. Isolation ﬁlter design and validation651

Based on the identiﬁed model, an MPC controller for iso-652

lation ﬁlter is designed and identiﬁed using Algorithm 1,653

the parameters used are P=400, M=200, Q=654

diag{7000,6500,2500},R=diag{1,1,1}. The simulation time 655

NK=8000 samples. As an illustration of determining the iden- 656

tiﬁed controller order nK, diﬀerent orders are chosen, the rela- 657

tive error (RE) in identiﬁcation of each element of Kand the 658

ﬁnal gain of L(q)ˆ

G(q) is shown in Table 2. RE is deﬁned as: 659

RE =100 ×var(y−ˆy)

var(y),(51) 660

ydenotes real output and ˆydenotes simulation output. RE is a 661

commonly used criteria to evaluate the ﬁtness to real data of the 662

identiﬁed model. 663

From Table 2, it can be seen that for controller identiﬁca- 664

tion, REs are typically small compared to plant identiﬁcation 665

problem due to its noise-free feature. When nK=15, REs are 666

all small except for the one of K12, whereas the gain matrix is 667

very unsatisfactory because gains of Γ21,Γ23,Γ31 are large es- 668

pecially for Γ21. Compared nK=15 and nK=20, REs are 669

slightly reduced while the gains of the nondiagonal elements of 670

Γare reduced considerably. When nKincreases to 30, the REs 671

are almost unchanged compared to nK=20 however the gain 672

matrix is much improved, which will lead to a better FI. No sig- 673

niﬁcant improvement occurs when increasing the order further, 674

so nK=30 will be used in this section. It can be concluded that 675

for noise-free identiﬁcation, RE is not sensitive to the bias error 676

caused by model reduction. To check whether nKis proper, one 677

should pay attention to the ﬁnal gain of Γ.678

Then, based on Section 3.3, the step response bands of Γ(q)679

0 2000 4000 6000

Time [s]

0 2000 4000 6000

Time [s]

0.1

0.2

0 2000 4000 6000

Time [s]

0.1

0.2

0 2000 4000 6000

Time [s]

0.05

0.1

0 2000 4000 6000

Time [s]

-10

-5

10-3

0 2000 4000 6000

Time [s]

0.005

0.01

0 2000 4000 6000

Time [s]

0.2

0.4

0 2000 4000 6000

Time [s]

-0.15

-0.1

-0.05

0 2000 4000 6000

Time [s]

-0.3

-0.2

-0.1

Boiler main control command Main steam valve Desuperheating water valve

2XWSXW

SRZHU

6WHDP

SUHVVXUH

0LGGOH

SRLQW

WHPSHUature

Figure 5: Step responses of real plant and of the identiﬁed model.

Table 2: REs and gain matrices corresponding to the identiﬁcation results using

diﬀerent orders.

Model order RE(%) Gain

15 

0.64 5.47 1.07

0.68 0.63 0.97

0.70 0.80 0.89



0.9637 0.0014 −0.0029

−1.8907 0.9376 −0.1558

0.2634 −0.0019 1.0171



20 

0.62 0.61 0.87

0.67 0.57 0.85

0.70 0.80 0.89



0.9907 0.0004 −0.0003

0.2298 0.9881 0.0264

−0.1834 0.0261 0.9812



30 

0.62 0.61 0.87

0.67 0.56 0.84

0.70 0.80 0.89



1.0012 −0.0001 0.0001

−0.0655 1.0045 0.0072

−0.0004 −0.042 0.9867



can be calculated and plotted, as shown in Fig. 6. From this ﬁg-680

ure, we can see that step responses of Γ0(q) falls into the step681

response bands of Γ(q), and the derived isolation ﬁlter can be682

used for FI because Vis diagonal and has full rank, satisfying683

situation 1) in Section 3.3. By comparing Fig. 5 and Fig. 6, it684

can also be found that the dynamic characteristics of the faults685

to residuals transfer function are accelerated because in design-686

ing procedure of L(q), large Qto Rratio is used.687

5.3. Fault isolation implementation688

Faults are assumed to be step-type and three diﬀerent simu-689

lations are carried out and each time only one fault occurs at690

7000s. The amplitude of the faults are set small compared to691

normal variations of the input signals. The results are shown692

in Fig. 7, where Hotelling’s T2statistic of each component of693

the residual is used and conﬁdence level of the threshold is set694

to F, for details about this statistic and threshold setting see e.g.695

[27, 49]. For the reason that the three faults are all minor faults696

the detection performance of rFI is low, in Fig. 7 one can ﬁnd697

that rFI cannot detect the faults. Then, by using optimal ﬁlters698

to components of rFI we obtain rFI

opt which can detect the faults699

and the three diﬀerent faults can be isolated. The optimal ﬁl-700

ters used are low-pass Butterworth ﬁlter with 0.05rad/s cutoﬀ701

frequency.702

Table 3: Model inputs and outputs.

Block name Variable name Variable number

D feed ﬂow XMV(1)

E feed ﬂow XMV(2)

A feed ﬂow XMV(3)

A and C feed ﬂow XMV(4)

Compressor recycle valve XMV(5)

Model input Purge valve XMV(6)

Separator pot liquid ﬂow XMV(7)

Stripper liquid product ﬂow XMV(8)

Stripper steam valve XMV(9)

Reactor cooling water ﬂow XMV(10)

Condenser Reactor cooling water ﬂow XMV(11)

Reactor feed rate XMEAS(6)

Reactor pressure XMEAS(7)

Reactor temperature XMEAS(9)

Model output Separator pressure XMEAS(13)

Stripper pressure XMEAS(16)

Compressor work XMEAS(20)

Reactor cooling water outlet temperature XMEAS(21)

Component A XMEAS(23)

6. Application to Tennessee Eastman process 703

This section is dedicated to show the ability of the proposed 704

method to handle large-scale system with unknown model 705

structure. The well-known TEP benchmark is used for the pur- 706

pose. TEP was developed to provide a realistic simulation of 707

an industrial process for the evaluation of monitoring methods 708

[51]. Fig. 8 shows the ﬂow diagram of TEP with 5 major units: 709

Table 4: Eight newly deﬁned faults in TEP.

Variable number Process variable Type Amplitude

FAULT(1) Reactor feed rate sensor Step 0.2

FAULT(2) Separator pressure sensor Step 0.5

FAULT(3) Stripper pressure sensor Step 0.5

FAULT(4) D feed loss Step 0.3

FAULT(6) Separator pot liquid leakage Step 2

FAULT(5) A feed loss Step 3

FAULT(7) Reactor cooling water valve Step 0.5

FAULT(8) Condenser Reactor cooling water valve Step 4

Fault 1 Fault 2Fault 3

rFI-component 1

rFI-component 2

rFI-component 3

Figure 6: Step responses of Γo(q) and step response bands of Γ(q), the conﬁdence level is 95%.

0 5000 10000 15000

T2 of component 1

0 5000 10000 15000

T2 of component 2

0 5000 10000 15000

Time [s]

T2 of component 3

FI FI

(a) u1fault

0 5000 10000 15000

T2 of component 1

0 5000 10000 15000

T2 of component 2

0 5000 10000 15000

Time [s]

T2 of component 3

), ),

(b) u2fault

0 5000 10000 15000

T2 of component 1

0 5000 10000 15000

15 T2 of component 2

0 5000 10000 15000

Time [s]

100

T2 of component 3

), ),

Figure 7: FI of three actuator faults, diﬀerent components of rFI and rFI

opt are shown for comparison. The red dashed line denotes the 99.9% threshold.

Figure 8: A process ﬂowsheet of Tennessee Eastman process given in [50].

reactor, condenser, compressor, separator and stripper. The pro-710

cess has two products from four reactants. Additionally, an inert711

and a by-product are also present making a total of 8 compo-712

nents denoted as A, B, C, D, E, F, G and H [49]. The process713

allows total 52 measurements out of which 41 are of process714

variables and 11 are manipulated variables; and the complete715

description of these variables can be found in [49]. A brief ver-716

sion covering the variables which will be used in this section is717

shown in Table 3.718

There are two alternatives to do fault diagnosis research with719

TEP, one is the data set given in [52], another is the Simulink720

code provided by Ricker [53] which is available to simulate the721

plant’s closed-loop behavior. In [52], 21 documented faults are722

deﬁned for fault diagnosis study, but the they are mainly mul-723

tiplicative (process) faults. In order to validate the proposed724

method, we use the simulators oﬀered by [53] and deﬁne eight725

additive faults as listed in Table 4. The fault to output transfer726

functions can be obtained using system identiﬁcation. Among727

these faults, FAULT(1-3) are output faults and FAULT(4-8) are728

input faults. The sampling interval in this section is chosen as729

3 min.730

6.1. TEP identiﬁcation731

Similarly to Section 5, a system identiﬁcation is needed be-732

fore we perform FI. Based on the deﬁned faults, we choose733

MV(1-11) as inputs and XMEAS(6,7,9,13,16,20,21,23) as out-734

puts to build a model for residual generation. TEP is operated735

in closed-loop so 11 uncorrelated GBN signals are added to736

the initial setpoints corresponding to the chosen outputs, then737

20000 samples are generated, among which 17000 for parame- 738

ter estimation and 3000 for model validation. Because 8 faults 739

are used in the study, it is suﬃcient to choose 8 outputs. The 740

comparison between real output and model simulation output 741

of the validation data is shown in Fig. 9. All the models we 742

used are 2nd-order ARMAX model and for each MISO system, 743

the orders of transfer functions from diﬀerent inputs are set the 744

same. The identiﬁed model contains no time delays but have 745

RHP zeros. Based on the identiﬁed model, eight OE residuals 746

can be built and combined into the basic residual r.747

6.2. Isolation ﬁlter design 748

To get the isolation ﬁlter using Algorithm 1, ﬁrst the fault 749

signal vector and fault transfer function should be determined. 750

Let 751

f(t)=[FAULT(1),FAULT(2),· · · ,FAULT(8)]T,(52) 752

then 753

Gy f (q)=



100G11 G13 G17 G1(10) G1(11)

000G21 G23 G27 G2(10) G2(11)

0 0

1 0

.0 1 .

.0.

000G81 G83 G87 G8(10) G8(11)



(53) 754

0 1000 2000 3000

Sample

-1

XMEAS(6): RE= 17.58%

0 1000 2000 3000

Sample

-20

XMEAS(7): RE= 23.81%

0 1000 2000 3000

Sample

-1

-0.5

0.5

XMEAS(9): RE= 24.66%

0 1000 2000 3000

Sample

-20

XMEAS(13): RE= 26.62 %

0 1000 2000 3000

Sample

-20

XMEAS(16): RE= 23.65 %

0 1000 2000 3000

Sample

-10

XMEAS(20): RE= 2.326 %

0 1000 2000 3000

Sample

-1

XMEAS(21): RE= 21.84 %

0 1000 2000 3000

Sample

-2

XMEAS(23): RE= 9.202 %

Real output Simulation output

Figure 9: Identiﬁcation result of TEP.

in which the left block corresponds to output faults FAULT(1-755

3), the right block corresponds to input faults FAULT(4-7). The756

step responses of d

Gy f (q) is shown in Fig. 10. The ﬁnal parame-757

ters we used are Q=250·[1 1 1 1 2 1 1 1], R=[1 1 1 1 1 1 1 1],758

P=120, M=60, n=60. The step response bands of the759

achieved Γ(q) is shown in Fig. 11, from which one can see that760

the identiﬁed MPC controller eliminates the couplings in d

Gy f (q)761

to a great extent. According to Section 3.3762



1 00000

1 00010

100000

10000

01000

05×300100

00010

00011



,(54)763

which has full rank and is in accordance with situation 2), hence764

the designed isolation ﬁlter can be used for FI purpose despite765

the model error.766

6.3. FI implementation767

As has mentioned and suggested in Section 4, to perform768

fault diagnosis, we recommend do FD based on rand rFD

opt and769

perform FI using the isolation ﬁlter. In [27], we have discussed770

the FD problem in detail and by using OEs and optimal ﬁlters,771

all the 21 documented faults in [52] can be well detected, based772

on the benchmark data set. Here, based on the simulator the773

8 newly deﬁned faults can also be well detected following the 774

same method, and in order to avoid repetition FD step will be 775

omitted, only the FI step will be presented. 776

Similar to Section 5, each time only one fault occurs and 777

thus the change should occur only in one component of rFI .778

Faults occur at 200 sample time. The conﬁdence interval of 779

the T2statistics are all set to 99.9%. In order to improve the FI 780

performance, QFI

opt(q) will be used as an optimal ﬁlter, which has 781

a diagonal structure and each diagonal element is a Butterworth 782

low-pass ﬁlter. The results are shown in Fig. 12, we take (d)(g) 783

for instance: for (d), Ξ=[00010000]T. According to (43), 784

k∗=4 hence one could conclude that FAULT(4) occurs; for (g), 785

k∗is not unique however due to the fact that only component 786

7 alarms, we conclude that FAULT(7) occurs. Similarly, the 787

8 faults can all be isolated. If faults occur sequentially, one 788

new fault will cause alarm in one or several new components 789

of rFI

opt, then decision should be made based on these new alarm 790

information. 791

Though TEP has become a benchmark for fault diagnosis 792

study, few model-based FD and FI methods has been validated 793

for TEP. One reason is that model-based methods are nearly all 794

based on ﬁrst-principle state-space model and the exact model 795

of TEP is unknown. Recently, some identiﬁcation based meth- 796

ods are proposed and validated in TEP, see e.g. [2, 49, 54, 27], 797

however they only focus on FD and the validations of FI meth- 798

ods are rare. In this paper, based on an identiﬁed model, the 799

proposed method is validated using TEP and the results can be 800

considered promising. 801

0.5

0 1

-0.02

0.02

0 1

-0.02

0.02

0 100 200

-0.4

-0.2

0 100 200

0.04

0.08

0 100 200

-0.2

-0.1

0 100 200

0.5

0 200

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 20

-3

-2

-1

0 20

0.04

0.08

0 20

-0.1

-0.05

0 20

0.5

0 20

-0.08

-0.04

0 1

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 50

0.2

0.4

0 50 100

0.04

0.08

0 50

-0.08

-0.04

0 20 40

-1

-0.5

0 50

-0.15

-0.1

-0.05

0 1

-0.02

0.02

10 20

0.5

0 1

-0.02

0.02

0 20

-4

-2

0 20

0.04

0.08

0 20 40

0.05

0.1

0 20

-0.2

-0.1

0 1

-0.02

0.02

0 1

-0.02

0.02

10 20

0.5

0 20

-2

-1

0 20

0.05

0.1

0 20

0.05

0.1

0 20

-0.2

0.2

0.4

0.6

0 20

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 500

-10

-5

0 500

-6

-4

-2

0 500

0.2

0.4

0.6

0.8

0 1

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 50

0.1

0.2

0 50 100

-0.02

0.02

0.06

0 50 100

-0.06

-0.04

-0.02

0 50

-1

-0.5

0 50 100

-0.1

-0.05

0 1 Sample

-0.02

0.02

0 1

-0.02

0.02

0 1

-0.02

0.02

0 1900

-3

-2

-1

0 1900

0.5

1.5

0 1900

-1

-0.5

0 1900

0.5

0 1900

0.2

0.4

0.6

0.8

FAULT(1)

XMEAS(6) FAULT(2)

XMEAS(13) FAULT(3)

XMEAS(16) FAULT(4)

XMV(1) FAULT(6)

XMV(7)

FAULT(5)

XMV(3) FAULT(7)

XMV(10) FAULT(8)

XMV(11)

100

Figure 10: Step responses of d

Gy f (q). Subﬁgures marked as red corresponds to d

Gy f

i j containing RHP zeros.

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

0.5

0 50

-0.5

0.5

0 50

-1

0 50

-1

0 50

-0.5

0.5

1.5

0 50

-10

0 50

-5

0 50

-0.5

0.5

0 50

-2

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-1

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-2

0 50

-0.5

0.5

1.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-1

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-4

-2

0 50

-1

0 50

-0.5

0.5

0 50

-2

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-1

0 50

-0.5

0.5

0 50

-0.5

0.5

0 50

-0.5

0.5

Figure 11: Step response bands of Γ(q). In columns 4-8, the red dashed lines denote the unity lines.

0 200 400 600 800 1000

100

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

(a) FAULT(1)

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

100 T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

(b) FAULT(2)

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

2104T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

(d) FAULT(4)

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

(e) FAULT(5)

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

100

200

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

(f) FAULT(6)

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

500

T2 of component 7

0 200 400 600 800 1000

Samples

T2 of component 8

(g) FAULT(7)

0 200 400 600 800 1000

T2 of component 1

0 200 400 600 800 1000

T2 of component 2

0 200 400 600 800 1000

T2 of component 3

0 200 400 600 800 1000

T2 of component 4

0 200 400 600 800 1000

T2 of component 5

0 200 400 600 800 1000

T2 of component 6

0 200 400 600 800 1000

T2 of component 7

0 200 400 600 800 1000

Samples

100

T2 of component 8

(h) FAULT(8)

Figure 12: FI of 8 diﬀerent faults in TEP. In the ﬁgure, the blue line is component of rFI

opt while the red dashed line denotes the 99.9% threshold.

7. Conclusion802

A model based fault isolation method is developed. Using803

the identiﬁed plant transfer function model, the output errors804

are used as basic residuals. Then based on the faults to out-805

puts transfer function matrix, an isolation ﬁlter is designed by806

combining an MPC algorithm and an identiﬁcation algorithm.807

A method to validate the isolation ﬁlter under model errors is808

developed; and an optimal isolation ﬁlter is proposed that sup-809

presses the disturbances. The method is straightforward that810

can handle large-scale processes, time delays and RHP zeros.811

The eﬀectiveness of the method is veriﬁed using two case stud-812

ies including the well-known TE process. The ill-conditioning813

of the plant may pose a problem for fault isolation ﬁlter de-814

sign because decoupler is involved, which will be investigated815

in future work. Further research concerning identiﬁcation based816

FDI can be detection and isolation of multiplicative faults and817

extending the developed methods to nonlinear systems.818

References

[1] Z. Ge, Z. Song, F. Gao, Review of recent research on data-based process

monitoring, Industrial & Engineering Chemistry Research 52 (10) (2013)

3543–3562.

[2] S. X. Ding, Model-based fault diagnosis techniques: design schemes, al-

gorithms, and tools, Springer Science & Business Media, 2008.

[3] H. Hong, C. Jiang, X. Peng, W. Zhong, Concurrent monitoring strategy

for static and dynamic deviations based on selective ensemble learning

using slow feature analysis, Industrial & Engineering Chemistry Research

59 (10) (2020) 4620–4635.

[4] S. J. Qin, Y. Dong, Q. Zhu, J. Wang, Q. Liu, Bridging systems theory and

data science: A unifying review of dynamic latent variable analytics and

process monitoring, Annual Reviews in Control (2020).

[5] J. MacGregor, A. Cinar, Monitoring, fault diagnosis, fault-tolerant con-

trol and optimization: Data driven methods, Computers & Chemical En-

gineering 47 (2012) 111–120.

[6] R. Isermann, P. Balle, Trends in the application of model-based fault de-

tection and diagnosis of technical processes, Control Engineering Practice

5 (5) (1997) 709–719.

[7] Q. Zhang, M. Basseville, A. Benveniste, Early warning of slight changes

in systems, Automatica 30 (1) (1994) 95–113.

[8] R. N. Clark, The dedicated observer approach to instrument failure detec-

tion, in: 1979 18th IEEE Conference on Decision and Control including

the Symposium on Adaptive Processes, Vol. 2, IEEE, 1979, pp. 237–241.

[9] P. M. Frank, Fault diagnosis in dynamic systems using analytical and

knowledge-based redundancy: A survey and some new results, Automat-

ica 26 (3) (1990) 459–474.

[10] J. W¨

unnenberg, P. Frank, Sensor fault detection via robust observers,

in: System fault diagnostics, reliability and related knowledge-based ap-

proaches, Springer, 1987, pp. 147–160.

[11] W. Ge, C.-Z. FANG, Detection of faulty components via robust observa-

tion, International Journal of Control 47 (2) (1988) 581–599.

[12] M. Hou, P. Muller, Disturbance decoupled observer design: A uniﬁed

viewpoint, IEEE Transactions on Automatic Control 39 (6) (1994) 1338–

1341.

[13] J. Zarei, E. Shokri, Robust sensor fault detection based on nonlinear un-

known input observer, Measurement 48 (2014) 355–367.

[14] F. Xu, J. Tan, X. Wang, V. Puig, B. Liang, B. Yuan, Mixed active/passive

robust fault detection and isolation using set-theoretic unknown input

observers, IEEE Transactions on Automation Science and Engineering

15 (2) (2017) 863–871.

[15] R. J. Patton, P. M. Frank, R. N. Clark, Issues of fault diagnosis for dy-

namic systems, Springer Science & Business Media, 2013.

[16] J. Gertler, Fault detection and diagnosis in engineering systems, CRC

press, 1998.

[17] P. M. Frank, X. Ding, Frequency domain approach to optimally robust

residual generation and evaluation for model-based fault diagnosis, Auto-

matica 30 (5) (1994) 789–804.

[18] P. M. Frank, X. Ding, Survey of robust residual generation and evalua-

tion methods in observer-based fault detection systems, Journal of Pro-

cess Control 7 (6) (1997) 403–424.

[19] S. X. Ding, T. Jeinsch, P. M. Frank, E. L. Ding, A uniﬁed approach to the

optimization of fault detection systems, International Journal of Adaptive

Control and Signal Processing 14 (7) (2000) 725–745.

[20] M. Zhong, S. X. Ding, J. Lam, H. Wang, An lmi approach to design robust

fault detection ﬁlter for uncertain lti systems, Automatica 39 (3) (2003)

543–550.

[21] B. Liu, J. Si, Fault isolation ﬁlter design for linear time-invariant systems,

IEEE Transactions on Automatic Control 42 (5) (1997) 704–707.

[22] L. Liu, S. Tian, D. Xue, T. Zhang, Y. Chen, S. Zhang, A review of indus-

trial mimo decoupling control, International Journal of Control, Automa-

tion and Systems 17 (5) (2019) 1246–1254.

[23] W. L. Luyben, Distillation decoupling, AIChE Journal 16 (2) (1970) 198–

203.

[24] C. E. Garcia, M. Morari, Internal model control. a unifying review and

some new results, Industrial & Engineering Chemistry Process Design

and Development 21 (2) (1982) 308–323.

[25] C. E. Garcia, M. Morari, Internal model control. 2. design procedure for

multivariable systems, Industrial & Engineering Chemistry Process De-

sign and Development 24 (2) (1985) 472–484.

[26] M. Morari, E. Zaﬁriou, Robust process control, Prentice Hall, 1989.

[27] J. Zhou, Y. Zhu, Identiﬁcation based fault detection: Residual selection

and optimal ﬁlter, Journal of Process Control 105 (2021) 1–14.

[28] J. H. Lee, Model predictive control: Review of the three decades of devel-

opment, International Journal of Control, Automation and Systems 9 (3)

(2011) 415–424.

[29] Y. Zhu, R. Patwardhan, S. B. Wagner, J. Zhao, Toward a low cost and

high performance mpc: The role of system identiﬁcation, Computers &

Chemical Engineering 51 (2013) 124–135.

[30] S. J. Qin, An overview of subspace identiﬁcation, Computers & Chemical

Engineering 30 (10-12) (2006) 1502–1513.

[31] M. Verhaegen, V. Verdult, Filtering and system identiﬁcation: a least

squares approach, Cambridge university press, 2007.

[32] Y. Zhu, Multivariable system identiﬁcation for process control, Elsevier,

2001.

[33] L. Ljung, System identiﬁcation, Wiley encyclopedia of electrical and

electronics engineering (1999) 1–19.

[34] M. Barenthin, X. Bombois, H. Hjalmarsson, G. Scorletti, Identiﬁcation

for control of multivariable systems: Controller validation and experiment

design via lmis, Automatica 44 (12) (2008) 3070–3078.

[35] M. Gevers, X. Bombois, B. Codrons, G. Scorletti, B. D. Anderson, Model

validation for control and controller validation in a prediction error iden-

tiﬁcation framework—part i: theory, Automatica 39 (3) (2003) 403–415.

[36] M. Gevers, Identiﬁcation for control: From the early achievements to the

revival of experiment design, European Journal of Control 11 (4-5) (2005)

335–352.

[37] B. Ninness, G. C. Goodwin, Estimation of model quality, Automatica

31 (12) (1995) 1771–1797.

[38] G. C. Goodwin, G. GC, P. RL, Dynamic System Identiﬁcation: Experi-

ment Design and Data Analysis, Volume 136 of Mathematics in Science

and Engineering, New York: Academic, 1977.

[39] S. X. Ding, Advanced methods for fault diagnosis and fault-tolerant con-

trol, Springer, 2020.

[40] X. Ding, P. M. Frank, Fault detection via factorization approach, Systems

& Control Letters 14 (5) (1990) 431–436.

[41] J. M. Maciejowski, Predictive control: with constraints, Pearson educa-

tion, 2002.

[42] S. J. Qin, T. A. Badgwell, A survey of industrial model predictive control

technology, Control Engineering Practice 11 (7) (2003) 733–764.

[43] J. H. Lee, M. Morari, C. E. Garcia, State-space interpretation of model

predictive control, Automatica 30 (4) (1994) 707–717.

[44] Y. Xi, Predictive control, Wiley Online Library, 2019.

[45] M. Morari, J. H. Lee, Model predictive control: past, present and future,

Computers & Chemical Engineering 23 (4-5) (1999) 667–682.

[46] C. E. Garcia, M. Morari, Internal model control. 3. multivariable control

law computation and tuning guidelines, Industrial & Engineering Chem-

istry Process Design and Development 24 (2) (1985) 484–494.

[47] H. J. Tulleken, Generalized binary noise test-signal concept for improved

identiﬁcation-experiment design, Automatica 26 (1) (1990) 37–49.

[48] L. Ljung, L. Guo, The role of model validation for assessing the size of

the unmodeled dynamics, IEEE Transactions on Automatic Control 42 (9)

(1997) 1230–1239.

[49] S. Yin, S. X. Ding, A. Haghani, H. Hao, P. Zhang, A comparison study of

basic data-driven fault diagnosis and process monitoring methods on the

benchmark tennessee eastman process, Journal of Process Control 22 (9)

(2012) 1567–1581.

[50] P. R. Lyman, Plant-wide control structures for the tennessee eastman pro-

cess, Ph.D. thesis, Lehigh University (1992).

[51] J. J. Downs, E. F. Vogel, A plant-wide industrial process control problem,

Computers & Chemical Engineering 17 (3) (1993) 245–255.

[52] L. H. Chiang, E. L. Russell, R. D. Braatz, Fault detection and diagnosis

in industrial systems, Springer Science & Business Media, 2000.

[53] N. L. Ricker, Decentralized control of the tennessee eastman challenge

process, Journal of Process Control 6 (4) (1996) 205–221.

[54] Z. Chen, K. Zhang, S. X. Ding, Y. A. Shardt, Z. Hu, Improved canoni-

cal correlation analysis-based fault detection methods for industrial pro-

cesses, Journal of Process Control 41 (2016) 26–34.

A review of research on diagnosability of control systems

Article

Jun 2024

A combined passive-active method for diagnosing multiplicative fault

Article

Aug 2023
PROCESS SAF ENVIRON

Active Fault Isolation for Multimode Fault Systems Based on a Set Separation Indicator

Article

Full-text available

May 2023
Entropy

This paper considers the active fault isolation problem for a class of uncertain multimode fault systems with a high-dimensional state-space model. It has been observed that the existing approaches in the literature based on a steady-state active fault isolation method are often accompanied by a large delay in making the correct isolation decision. To reduce such fault isolation latency significantly, this paper proposes a fast online active fault isolation method based on the construction of residual transient-state reachable set and transient-state separating hyperplane. The novelty and benefit of this strategy lies in the embedding of a new component called the set separation indicator, which is designed offline to distinguish the residual transient-state reachable sets of different system configurations at any given moment. Based on the results delivered by the set separation indicator, one can determine the specific moments at which the deterministic isolation is to be implemented during online diagnostics. Meanwhile, some alternative constant inputs can also be evaluated for isolation effects to determine better auxiliary excitation signals with smaller amplitudes and more differentiated separating hyperplanes. The validity of these results is verified by both a numerical comparison and an FPGA-in-loop experiment.

Identification-based sensor and actuator fault diagnosis for industrial control systems and its application to HTR-PM

Article

Feb 2023
CONTROL ENG PRACT

Automatic determination of optimal fault detection filter

Article

Full-text available

Oct 2022
J PROCESS CONTR

Optimal detection filters can greatly enhance fault detection performance, but designing these filters requires fault data which is difficult to obtain in practice. This paper proposes a scheme that automatically determines the optimal detection filter from a filter bank online without using fault data. The method can improve fault detection rate and accelerate detection speed. In order to reduce the false alarm rate, a method of threshold setting is introduced based on kernel density estimation. Implementation issues concerning filter bank design and online decision rule are also discussed. The method is validated in a numerical example and Tennessee Eastman process, and its performance is compared to those of other state-of-the-art methods.

Advanced methods for fault diagnosis and fault-tolerant control

Book

Full-text available

Dec 2020

Steven X. Ding

After the first two books have been dedicated to model-based and data-driven fault diagnosis respectively, this book addresses topics in both model-based and data-driven thematic fields with considerable focuses on fault-tolerant control issues and application of machine learning methods. The major objective of the book is to study basic fault diagnosis and fault-tolerant control problems and to build a framework for long-term research efforts in the fault diagnosis and fault-tolerant control domain. In this framework, possibly unified solutions and methods can be developed for general classes of systems. The book is composed of six parts. Besides Part I serving as a common basis for the subsequent studies, Parts II - VI are dedicated to five different thematic areas, including model-based fault diagnosis methods for linear time-varying systems, nonlinear systems and systems with model uncertainties, statistical and data-driven fault diagnosis methods, assessment of fault diagnosis systems, as well as fault-tolerant control with a strong focus on performance degradation monitoring and recovering. These parts are self-contained and so structured that they can also be used for self-study on the concerned topics. The content Basic requirements on fault detection and estimation – Basic methods for fault detection and estimation in static and dynamic processes – Feedback control, observer, and residual generation – Fault detection and estimation for linear time-varying systems – Detection and isolation of multiplicative faults in uncertain systems – Analysis, parameterisation and optimal design of nonlinear observer-based fault detection systems – Data-driven fault detection methods for large-scale and distributed systems – Alternative test statistics and data-driven fault detection methods – Application of randomised algorithms to assessment and design of fault diagnosis systems – Performance-based fault-tolerant control – Performance degradation monitoring and recovering – Data-driven fault-tolerant control schemes The target groups This book would be valuable for graduate and PhD students as well as for researchers and engineers in the field. The author Prof. Dr.-Ing. Steven X. Ding is a professor and the head of the Institute for Automatic Control and Complex Systems (AKS), University of Duisburg-Essen, Germany. His research interests are model-based and data-driven fault diagnosis, control and fault-tolerant systems as well as their applications in industry with a focus on automotive systems, chemical processes and renewable energy systems.

Mixed Active/Passive Robust Fault Detection and Isolation Using Set-Theoretic Unknown Input Observers

Article

Full-text available

Nov 2017

This paper proposes a robust fault detection and isolation (FDI) approach that combines active and passive robust FDI approaches. Standard active FDI approaches obtain robustness by using the unknown input observer (UIO) to decouple unknown inputs from residuals. Differently, standard passive FDI approaches achieve robustness by using the set theory to bound the effect of uncertain factors (disturbances and noises). In this paper, we combine the UIO-based and the set-based approaches to produce a mixed robust FDI, which can mitigate the disadvantages and exert the advantages of the two robust FDI approaches. In order to emphasize the role of set theory, the UIO design based on the set theory is named as the set-theoretic UIO (SUIO). A quadrotor subsystem is used to illustrate the effectiveness of the proposed FDI approach.

Improved canonical correlation analysis-based fault detection methods for industrial processes

Article

Full-text available

May 2016
J PROCESS CONTR

Recent research has emphasized the successful application of canonical correlation analysis (CCA) to perform fault detection (FD) in both static and dynamic processes with additive faults. However, dealing with multiplicative faults has not been as successful. Thus, this paper considers the application of CCA to deal with the detection of incipient multiplicative faults in industrial processes. The new approaches incorporate the CCA-based FD with the statistical local approach. It is shown that the methods are effective in detecting incipient multiplicative faults. Experiments using a continuous stirred tank heater and simulations on the Tennessee Eastman process are provided to validate the proposed methods.

Identification based fault detection: Residual selection and optimal filter

Article

Sep 2021
J PROCESS CONTR

In this work, an identification based fault detection method is proposed. The idea is to identify a dynamic process model from test data and to generate residuals using the identified model for fault detection. The method intends to improve fault detection performance while taking disturbance and model error into account. To this end, a fault detection performance index is introduced in a statistical framework. Then it is shown that the output error residual is more suitable for fault detection than the prediction error residual. Further an optimal detection filter maximizing the performance index is developed. Practical issues for implementing the detection filter are also addressed. Finally, the proposed method is illustrated through a numerical example and Tennessee Eastman process.

Bridging systems theory and data science: A unifying review of dynamic latent variable analytics and process monitoring

Article

Oct 2020
ANNU REV CONTROL

This paper is concerned with data science and analytics as applied to data from dynamic systems for the purpose of monitoring, prediction, and inference. Collinearity is inevitable in industrial operation data. Therefore, we focus on latent variable methods that achieve dimension reduction and collinearity removal. We present a new dimension reduction expression of state space framework to unify dynamic latent variable analytics for process data, dynamic factor models for econometrics, subspace identification of multivariate dynamic systems, and machine learning algorithms for dynamic feature analysis. We unify or differentiate them in terms of model structure, objectives with constraints, and parsimony of parameterization. The Kalman filter theory in the latent space is used to give a system theory foundation to some empirical treatments in data analytics. We provide a unifying review of the connections among the dynamic latent variable methods, dynamic factor models, subspace identification methods, dynamic feature extractions, and their uses for prediction and process monitoring. Both unsupervised dynamic latent variable analytics and the supervised counterparts are reviewed. Illustrative examples are presented to show the similarities and differences among the analytics in extracting features for prediction and monitoring.

Concurrent Monitoring Strategy for Static and Dynamic Deviations Based on Selective Ensemble Learning Using Slow Feature Analysis

Article

Feb 2020

Slow feature analysis (SFA) has been extensively adopted for process monitoring. Since the prominent ability of exploring dynamic information of industrial process, SFA could monitor the process static and dynamic deviations concurrently. However, for the complex and large-scale process, a single SFA model is hard to monitor the whole process well because of the complex relationship within massive volumes of variables. To address this issue and get a better monitoring performance, a novel ensemble process monitoring method based on slow feature analysis models is proposed as ensemble SFA (ESFA) in this paper. The proposed method develops a set of SFA models based on different combinations of variables and divisive hierarchical clustering algorithm (DHCA) is performed to pick out some models with the great diversity as the base learners. Then the fault detection results of base models would be combined into a comprehensive indicator through Bayesian inference. Furthermore, ESFA method also provides an statistic for monitoring process dynamics in order to differentiate the deviations of normal operating condition changes from dynamic anomalies incurred by real faults. Finally, compared with basic SFA and several PCA-based methods, the proposed method demonstrates the validity through the case studies of TE benchmark process and BSM1 process.

A Review of Industrial MIMO Decoupling Control

Article

Apr 2019

In recent decades, MIMO (Multi-Input-Multi-Output) systems become more and more widely used in industrial applications. A variety of decoupling control algorithms have been studied in the literature. Therefore, a review of the most extensively applied coupling interaction analysis and decoupler design methods for industrial processes is necessary to be carried out. In this paper, in order to benefit researchers and engineers with different academic backgrounds, the scattered coupling interaction analysis and decoupling algorithms are collected and divided into different categories with their characteristics, application domains and informative comments for selection. Moveover, some frequently concerned problems of decoupling control are also discussed.

Model-based Fault Diagnosis Techniques: Design Schemes, Algorithms, and Tools

Book

Jan 2008

Steven X. Ding

A most critical and important issue surrounding the design of automatic control systems with the successively increasing complexity is guaranteeing a high system performance over a wide operating range and meeting the requirements on system reliability and dependability. As one of the key technologies for the problem solutions, advanced fault detection and identification (FDI) technology is receiving considerable attention. The objective of this book is to introduce basic model-based FDI schemes, advanced analysis and design algorithms and the needed mathematical and control theory tools at a level for graduate students and researchers as well as for engineers.

Sensor Fault Detection via Robust Observers

Chapter

Jan 1987

This paper describes the design and the application of so called “Unknown Input Observers” or “Robust Observers” for failure detection. The fist part of the paper shows a design procedure for a discrete time observer which considers the concept of the “invariant subspace” and of the “almost invariant subspace”. Design tool is the Kronecker canonical form which also gives a complete answer to the question of state estimation of discrete time systems acting under unknown inputs. The resulting observer is described by equations that allow the estimation of a linear combination of the state with a delay of a finite number of samplings. This robust state estimation though hardly usable for control purposes can successfully be considered for failure detection. The second part of the paper shows as an example the application to instrument failure detection (IFD). A new IFD scheme is presented, and the applicability is demonstrated with the aid of simulation examples.

From the early achievement to the revival of experiment design

Article

Jan 2005
EUR J CONTROL

Michel Gevers

Fault isolation based on transfer-function models using an MPC algorithm

Abstract and Figures

Recommended publications

Automatic determination of optimal fault detection filter

On the optimality of Kalman Filter for Fault Detection

Identification based fault detection: Residual selection and optimal filter

A critical look at deep neural network for dynamic system modeling