Conference PaperPDF Available

Stochastic predictive control of autonomous vehicles in uncertain environments

September 2014

September 2014

Conference: 12th International Symposium on Advanced Vehicle Control

Authors:

Ashwin Carvalho

University of California, Berkeley

Yiqi Gao

University of California, Berkeley

Stephanie Lefevre

Voyage

Autonomous vehicles operating in dynamic urban environments must account for the uncertainty arising from the behavior of other objects in the environment. For this purpose, we develop an integrated environment modeling and stochastic Model Predictive Control (MPC) framework. The trade–off between risk and conservativeness is managed by a risk factor which is a parameter in the control design process. The environment model consists of an Interacting Multiple Model Kalman Filter to estimate and predict the positions of target vehicles. The uncertain predictions are used to formulate a chance–constrained MPC problem. The overall goal is to develop a framework for safe autonomous navigation in the presence of uncertainty and study the effect of the risk parameter on controller performance. Simulations of an autonomous vehicle driving in the presence of moving vehicles show the effectiveness of the proposed framework.

Block diagram of overall system.

…

Predicted trajectories for the target vehicle lateral motion models. The models represent typical maneuvers such as lane keeping, left lane change and right lane change. Variations within a maneuver come from different tunings of the gains K 1 and K 2 .

…

Vehicle model notation

…

Signed distance between convex shapes A and B

…

Figures - uploaded by Stephanie Lefevre

Content may be subject to copyright.

Content uploaded by Stephanie Lefevre

Content may be subject to copyright.

AVEC ’14

Stochastic Predictive Control of Autonomous Vehicles in

Uncertain Environments

Ashwin Carvalhoa, Yiqi Gaoa, St´

ephanie Lef`

evrea, Francesco Borrellia

aUniversity of California at Berkeley, USA

Berkeley, CA 94720, USA

Phone: +1-510-643-2044

E-mail: ashwinc@berkeley.edu

Autonomous vehicles operating in dynamic urban environments must account for the uncertainty

arising from the behavior of other objects in the environment. For this purpose, we develop an

integrated environment modeling and stochastic Model Predictive Control (MPC) framework. The

trade–off between risk and conservativeness is managed by a risk factor which is a parameter in the

control design process. The environment model consists of an Interacting Multiple Model Kalman

Filter to estimate and predict the positions of target vehicles. The uncertain predictions are used to

formulate a chance–constrained MPC problem. The overall goal is to develop a framework for safe

autonomous navigation in the presence of uncertainty and study the effect of the risk parameter on

controller performance. Simulations of an autonomous vehicle driving in the presence of moving

vehicles show the effectiveness of the proposed framework.

Topics / Autonomous driving and collision avoidance, Environment modeling

1. INTRODUCTION

Recent advances in sensing, computing and control

technologies make autonomous passenger vehicles a

viable solution to minimize the number of road acci-

dents and resulting fatalities. Autonomous vehicles must

navigate in dynamic urban environments in the presence

of other static or moving objects such as vehicles,

bicycles and pedestrians. One of the main challenges

is to systematically account for the uncertainties due to

other objects in the control design process.

Nominal control approaches that do not account for

system uncertainties may result in unsafe performance.

On the other hand, robust control approaches that deal

with worst–case disturbances may be too conservative

and computationally expensive. We propose a stochastic

MPC framework for controlling autonomous vehicles

where the risk of constraint violations is a tunable design

parameter. MPC has been shown to be an effective strat-

egy for the automotive control problem as it can handle

system constraints, nonlinearities and uncertainties in an

uniﬁed manner [1]–[3].

Chance–constrained optimization has been studied in

the context of autonomous vehicles. The work in [4]

addresses the tactical planning problem for autonomous

vehicles in uncertain environments. The relative motion

between controlled and uncontrolled vehicles is modeled

as a jump Markov linear system. Sampling is used to ap-

proximate the multimodal distribution of the future sys-

tem state, and the collision avoidance chance–constraints

are approximated by convex bounding functions.

Stochastic MPC was applied to a driver assistance

system in [1] using the approach presented in [5] for

linear time–invariant (LTI) systems with stochastic dis-

turbances. The time–invariance allows the uncertainty

propagation to be performed ofﬂine for disturbances

with any known distribution. However, vehicle models

are nonlinear in general due to the complex tire–road

interaction. Hence, we propose modeling the vehicle

dynamics as a linear time–varying (LTV) system where

the linearization is performed online. Disturbances in the

system are modeled as Gaussian to allow for the easy

propagation of uncertainties and handling of probabilis-

tic constraints.

In addition to a vehicle dynamics model, a predictive

controller requires a model of the dynamic environment

in which the vehicle is operating. This is used to

incorporate predictions of the behavior of other objects

in the decision making process. Previous work by the

authors assumes that the future positions of all objects in

the environment are known over the prediction horizon

[1]–[3]. We aim to relax that assumption by developing

an environment modeling framework to forecast the

positions of nearby objects. In this paper, we focus on

the problem of predicting the positions of surrounding

vehicles (denoted as target vehicles).

The work in [6] classiﬁes trafﬁc motion prediction

approaches into three categories, (1) physics–based, (2)

maneuver–based, and (3) interaction–aware. Maneuver–

based and interaction–aware models generally involve a

higher degree of abstraction, and are suited for relatively

long–term predictions. This is accompanied by higher

computational costs. Since MPC applies inputs in a re-

ceding horizon manner, we choose the class of physics–

based approaches. Speciﬁcally, an Interacting Multiple

Model Kalman Filter (IMM–KF) is used to estimate and

predict the positions of target vehicles. The IMM–KF is

a hybrid system state estimator popular in target tracking

literature [7]. A multiple model algorithm such as the

IMM–KF is a natural choice for our application due to

the multimodal behavior of target vehicles.

AVEC ’14

Fig. 1. Block diagram of overall system.

In summary, the overall goals of this work are to

(1) develop an integrated environment modeling and

probabilistic control framework, and (2) study the trade–

off between conservatism and risk for autonomous ve-

hicles through a tunable risk parameter in the chance–

constrained MPC. Results from simulations in urban

driving scenarios are presented to demonstrate the ef-

fectiveness of the proposed framework.

The paper is organized as follows. Section 2 describes

the overall architecture of the system. In Section 3, we

introduce the environment model to make predictions

of target vehicle positions. Section 6 details the for-

mulation of the chance–constrained MPC problem and

the stochastic MPC framework. We present simulation

results in Section 7 followed by concluding remarks in

Section 8.

2. SYSTEM DESCRIPTION

The proposed system architecture is shown in Figure

1. The “Sensors” block is assumed to provide processed

information about the vehicle states, road geometry, and

relative positions and velocities of target vehicles in

real–time. This information is used by the “Environment

Model” block to make probabilistic predictions of the

positions of target vehicles in the near future. The

“MPC” block uses the sensory information along with

the forecast evolution of the environment to formulate a

chance–constrained receding horizon control problem to

optimize the steering and braking inputs to the vehicle.

In this work, we focus on developing the “Environment

Model” (Section 3) and “MPC” (Section 6) blocks.

3. ENVIRONMENT MODEL

3.1 Target Vehicle Models

The dynamics of the jth target vehicle are described

by a Markov jump afﬁne system,

ξe

jk+1 =F(i)

kξe

jk+G(i)

kw(i)

k+E(i)

k(1a)

jk=H(i)

kξe

jk+v(i)

k,(1b)

where ξe

jkand ye

jkdenote the target vehicle state and

measurement, respectively, at time k. The process noise

w(i)

kand measurement noise v(i)

kare assumed to be

i.i.d. as N(0, Q(i)

k)and N(0, R(i)

k), respectively. The

superscript (i)in (1) refers to the model m(i)in the

model set M={m(1), m(2) , . . . , m(M)}, and transitions

between modes have ﬁxed probabilities given by a

matrix π∈RM×M, such that

πij =P(mk+1 =m(j)|mk=m(i)),(2)

where mkdenotes the mode at time k. For the predic-

tion problem, we decouple the longitudinal and lateral

motion of the target vehicle. The models used for each

are described below. The subscript jand superscript e

are dropped for visual clarity.

3.1.1 Longitudinal motion

We choose ξ= [s, ˙s, ¨s]Tand y= [s, ˙s]Tfor the

longitudinal state and measurement, respectively. Two

modes are used to model the longitudinal motion,

1) Constant velocity:

F(1)

k=



1Ts

0 1 Ts

0 0 0



G(1)

k=





,(3)

where Tsis the sampling time.

2) Constant acceleration:

F(2)

k=



1Ts

0 1 Ts

0 0 1



G(2)

k=





.(4)

3.1.2 Lateral motion

The target vehicle lateral state and measurement are

chosen to be ξ= [ey,˙ey]Tand y=ey, respectively.

We use state–feedback models to represent the lateral

dynamics in typical lane keeping and lane changing

maneuvers. The lateral acceleration of the target vehicle

with respect to the lane is assumed to evolve as

¨eyk≈˙eyk+1 −˙eyk

=−K2(eyk−ey,ref )−K1˙eyk.

(5)

Thus, eyis assumed to have a second–order response

where [K1, K2]affect the settling time and overshoot,

and ey→ey,ref . The system matrices are given by

F(i)

k=1Ts

−TsK(i)

21−TsK(i)

1(6a)

G(i)

k=T2

s/2

TsE(i)

k=0

TsK(i)

2e(i)

y,ref ,(6b)

where the superscript (i)denotes the mode. We use

a set of nine models (i.e. M= 9) corresponding to

three different tunings of K1and K2for each of the

lane keeping, lane change left and lane change right

maneuvers. Figure 2 shows the trajectories for the 9

modes with initial condition ξ0= [2,0]T, and a lane

width of 3m.

Remark 1: The parameters K1and K2in (6) are es-

timated heuristically in this paper. Tools from statistical

learning theory may be used to systematically estimate

parameters using data.

3.2 IMM–KF Algorithm

At each time step k, we estimate the following quan-

tities based on the current measurement and values at

the previous time step:

1) Model probability:

µ(i)

k=P(mk=i|y0, . . . , yk)

(i= 1, . . . , M )

AVEC ’14

0 2 4 6 8 10

−5

Lef t La ne

Center Lane

Right Lane

t[s]

ey[m]

Lane B oundar ies

Fig. 2. Predicted trajectories for the target vehicle lateral motion

models. The models represent typical maneuvers such as lane keeping,

left lane change and right lane change. Variations within a maneuver

come from different tunings of the gains K1and K2.

2) Model–conditioned posterior means and covari-

ances (i.e. assuming the mode is time–invariant and

equal to i):

ξ(i)

k|k=E[ξ|y0, . . . , yk, mk−1=i, mk=i]

Σ(i)

k|k=Cov[ξ|y0, . . . , yk, mk−1=i, mk=i]

(i= 1, . . . , M )

3) Fused posterior mean and covariance:

ξk|k=E[ξk|y0, . . . , yk]

Σk|k=Cov[ξk|y0, . . . , yk]

The algorithm to compute the above quantities is sum-

marized in [8], and stated below:

Step 1: Evaluate mixing probabilities:

µj|i

k−1=πij µ(j)

k−1

µ(i)

k|k−1

(7)

where µ(i)

k|k−1=

j=1

πji µ(j)

k−1

Mixing estimates:

ξ(i)

k−1|k−1=

j=1

ξ(j)

k−1|k−1µj|i

k−1(8)

Mixing covariances:

Σ(i)

k−1|k−1=

j=1

[Σ(j)

k−1|k−1+ (¯

ξ(i)

k−1|k−1−ˆ

ξ(i)

k−1|k−1)·

(9)

(¯

ξ(i)

k−1|k−1−ˆ

ξ(i)

k−1|k−1)T]µj|i

k−1

Step 2: For each mode M(i), run Kalman ﬁlter with

inputs (¯

ξ(i)

k−1|k−1,¯

Σ(i)

k−1|k−1)using Algorithm 1. We

obtain the following model–conditioned estimates,

(ˆ

ξ(i)

k|k,ˆ

Σ(i)

k|k,˜y(i)

k, S(i)

Step 3: Update the probability of each model,

µ(i)

k=µ(i)

k|k−1N(˜y(i)

k; 0, S(i)

j=1

µ(j)

k|k−1N(˜y(j)

k; 0, S(j)

(10)

where N(x;µ, Σ) is the probability density function

of a multivariate normal distribution with mean µand

covariance Σ, evaluated at x.

Step 4: Compute the fused estimates,

ξk|k=

j=1

ξ(i)

k|kµ(i)

k(11)

Σk|k=

i=1

[Σ(i)

k|k+ (ˆ

ξk|k−ˆ

ξ(i)

k|k)(ˆ

ξk|k−ˆ

ξ(i)

k|k)T]µ(i)

(12)

Algorithm 1 Kalman ﬁlter equations for mode (i)

Inputs: (¯

ξ(i)

k−1|k−1,¯

Σ(i)

k−1|k−1)

Prediction step:

ξ(i)

k|k−1=F(i)

k−1¯

ξ(i)

k−1|k−1+E(i)

k−1

Σ(i)

k|k−1=F(i)

k−1¯

Σ(i)

k−1|k−1F(i)T

k−1+G(i)

k−1Q(i)

k−1G(i)T

k−1

Update step:

˜y(i)

k=yk−H(i)

kˆ

ξ(i)

k|k−1

S(i)

k=H(i)

kˆ

Σ(i)

k|k−1H(i)T

k+R(i)

K(i)

k=ˆ

Σ(i)

k|k−1H(i)T

kS(i)−1

ξ(i)

k|k=ˆ

ξ(i)

k|k−1+K(i)

k˜y(i)

Σ(i)

k|k=ˆ

Σ(i)

k|k−1−K(i)

kS(i)

kK(i)T

Outputs: (ˆ

ξ(i)

k|k,ˆ

Σ(i)

k|k,˜y(i)

k, S(i)

3.3 Target State Prediction

Recall that one of the main goals of the environment

model is to forecast the state of the target vehicle over

the prediction horizon of the controller. At time k, we

ﬁrst estimate the most likely mode mkas the mode (i)

with the highest probability µ(i)

k. Given the mode, the

state is assumed to evolve according to the dynamics

(1a). Note that the longitudinal and lateral state pre-

dictions are performed separately and later combined

for the control design. The predicted positions of target

vehicles are used to formulate constraints on the ego

vehicle state. This is described later in Section 5.1.

4. VEHICLE MODEL

Starting from a nonlinear model in Section 4.1, we

derive a linear parameter–varying (LPV) model of the

vehicle dynamics in Section 4.2.

4.1 Nonlinear Bicycle Model

The notation used in the vehicle model is depicted

in Figure 3. The following set of nonlinear differential

equations are used to describe the motion of the vehicle

in the road–aligned coordinate frame,

¨x= ˙y˙

ψ+ax(13a)

¨y=−˙x˙

ψ+1

m(2Fy,f + 2Fy,r )(13b)

ψ=1

(2aFy,f −2bFy,r )(13c)

˙eψ=˙

ψ−κ˙s(13d)

˙ey= ˙xsin(eψ) + ˙ycos(eψ)(13e)

AVEC ’14

Fig. 3. Vehicle model notation

˙s=1

1−κey

( ˙xcos(eψ)−˙ysin(eψ),(13f)

where mand Izdenote the vehicle mass and yaw inertia,

respectively, aand bdenote the distances from the

vehicle’s center of gravity to the front and rear axles,

respectively. ˙xand ˙ydenote the longitudinal and lateral

speeds in the body frame, respectively, ˙

ψdenotes the

yaw rate, and axdenotes the commanded longitudinal

acceleration. eyand eψdenote the lateral position error

and angular error with respect to the road centerline,

respectively, κdenotes the curvature of the road, and s

denotes the longitudinal position of the vehicle along

the road. The lateral force in the body frame Fy,?

(?∈ {f, r}) is given by,

Fy,? =−C?α?,(14)

where C?denotes the tire cornering stiffness, and α?

denotes the lateral tire slip angle which is computed

using small angle approximations as,

αf=−δf+˙y+a˙

˙x, αr=˙y−b˙

˙x,(15)

where δfis the commanded front steering angle. The

nonlinear bicycle model can be compactly written as,

ξ(t) = f(ξ(t), u(t), p(t)),(16)

where ξ= [ ˙x, ˙y, ˙

ψ, eψ, ey, s]T,u= [δf, ax]T, and p=κ

are the state, input and parameter vectors, respectively.

4.2 Linear Parameter-Varying (LPV) Model

For the control design, we linearize the bicycle model

described above around the current longitudinal speed

to obtain an LPV model of the vehicle dynamics. The

following assumptions are introduced,

Assumption 1: The angular error eψis small, hence

sin(eψ)≈eψ,cos(eψ)≈1.

Assumption 2: The lateral velocity ˙yand yaw rate ˙

are small, hence ˙y˙

ψ≈0,˙yeψ≈0.

Assumption 3: The road curvature κis assumed to be

known as a function of the longitudinal position salong

the road.

Under these assumptions, the system dynamics (13)

can be expressed as,

ξ(t) = Ac(ρ(t))ξ(t) + Bcu(t),(17)

Fig. 4. Signed distance between convex shapes Aand B

where ρ= [ ˙x, κ]Tis the parameter vector about which

the linearization is performed, and

Ac(ρ) =







0 0 0 0 0 0

0−2(Cf+Cr)

m˙x

−2(aCf−bCr)

m˙x0 0 0

0−2(aCf−bCr)

Iz˙x

−2(a2Cf+b2Cr)

Iz˙x0 0 0

−κ0 1 0 0 0

0 1 0 ˙x0 0

1 0 0 0 0 0







Bc=







0 1

2Cf

2aCf

Iz0

0 0







In the following sections, we drop the dependence of Ac

on ρfor visual clarity.

To account for uncertainties due to model mis-

match and linearization errors, we introduce an additive

stochastic disturbance in (17). The modiﬁed LPV model

can be written in discrete–time as,

ξk+1 =Akξk+Bkuk+Dkwk,(18)

where wk∼ N (0,Σw). Note that (18) is obtained by a

forward Euler discretization of (17).

Remark 2: The data from [3] is used to estimate the

disturbance covariance Σw

Remark 3: The LPV model was derived using a linear

tire model approximation. The MPC approach presented

in Section 6 can be directly applied to LTV models

which use nonlinear tire models for the linearization [2].

5. SAFETY CONSTRAINTS

5.1 Collision avoidance

We employ an approach similar to that in [9] for for-

mulating collision avoidance constraints. This is based

on the notion of signed distance in collision detection

literature, which is deﬁned below.

Deﬁnition 1: Let Aand Bbe convex shapes. The

signed distance sd(A,B)is deﬁned as the length of the

smallest translation that puts the two shapes Aand B

(1) in contact, if they are currently separated or (2) out

of contact, if they are currently in contact.

This is illustrated in Figure 4. Details about the

signed distance and its computation are presented in

[9]. Without loss of generality, we consider the case of

AVEC ’14

one moving target vehicle. Recall that the “Environment

Model” block provides us with an estimate of the

current and future positions of the target vehicle. At any

time instant t, let the predicted position of the target

vehicle at time t+kbe denoted as ¯

ξe

t+k,t, and that

of the ego vehicle be denoted as ¯

ξt+k,t. Let the target

vehicle be represented by a convex shape Tt+k,t , and the

ego vehicle by Et+k,t. Note that Tt+k,t and Et+k,t are

functions of ¯

ξe

t+k,t and ¯

ξt+k,t, respectively. The collision

avoidance constraint can be expressed as,

sd(Tt+k,t,Et+k ,t)≥0.(19)

The above constraint is nonlinear and non–convex in

general. We linearize (19) around the predicted ego

vehicle state ¯

ξt+k,t to obtain

dk+∇dk·∆ξk≥0,(20)

(dk=sd(Tt+k,t,Et+k ,t))

where ∆ξkis the deviation of the ego vehicle state from

ξt+k,t. (20) can be expressed in the standard form of a

linear inequality as

kξt+k,t ≤hk.(21)

Note that the RHS of (21) is uncertain due to the

uncertainty in the predicted target vehicle position. This

is dealt with by suitably increasing the dimensions of

Tt+k,t such that the target vehicle will be contained in

Tt+k,t with a high probability. Moreover, the LHS is also

uncertain due to the uncertainty in ξt+k,t arising from

the additive disturbance in (18). We account for this by

formulating (21) as a chance–constraint to be satisﬁed

with a speciﬁed probability p. That is,

P r(gT

kξt+k,t ≤hk)≥p, 0.5≤p≤1,(22)

where pdenotes the tunable risk parameter. Naturally,

a large value of pwould lead to conservative behavior,

possibly making the chance–constrained optimal control

problem (25) infeasible. On the other hand, a low value

of pmight lead to a higher risk of collision. The effect

of varying pof the driving performance is studied in

more detail in Section 7.

5.2 Lane boundaries

The requirement that the vehicle stays within the lane

can be expressed as,

elane

y,min ≤ey≤elane

y,max,(23)

where the bounds elane

y,min and elane

y,max account for the

width of the vehicle.

5.3 Input constraints

Physical limitations on the actuators impose bounds

on the control inputs and input rates,

umin ≤u(t)≤umax,˙umin ≤˙u(t)≤˙umax .(24)

6. CONTROL DESIGN

6.1 Chance–constrained MPC

The vehicle model and safety constraints described

above are used to formulate a chance–constrained re-

ceding horizon control problem. At each sampling time,

a constrained ﬁnite–time optimal control problem is

solved to determine a sequence of control inputs which

minimizes a given cost function. Only the ﬁrst input

of the sequence is applied to the system. The process

is repeated at the next sampling instant using new

measurements. The optimization problem to be solved

at each time step is given by,

min

Hp−1

k=0

E(kξt+k+1,t −ξref,k+1 k2

Q+kut+k,tk2

(25a)

s.t. ξt+k+1,t =Akξt+k,t +Bkut+k,t +Dkwk(25b)

P r(gT

k+1ξt+k+1,t ≤hk+1 )≥p(25c)

(k= 0, . . . , Hp−1)

Guut∈ U (25d)

ξt,t =ξ(t),(25e)

where tdenotes the current time instant, Hpdenotes the

prediction horizon and ξt+k,t denotes the predicted ego

vehicle state at time t+kobtained by applying the con-

trol sequence ut={ut,t, ut+1,t , . . . , ut+Hp−1,t}to the

vehicle model (25b) with initial condition ξt,t =ξ(t).

(25d) is a compact representation of the input constraints

in (24). The reference state sequence {ξref,k }Hp

k=1 in

(25a) encodes control objectives such as tracking the

lane centerline and maintaining a desired speed.

The optimization problem (25) cannot be solved di-

rectly because of the presence of the stochastic distur-

bance win (25b), and the chance–constraints (25c). The

original constraints must be tightened to account for the

state uncertainty at each time step in the horizon. This

is presented below. The variables ξt+k,t and ut+k,t are

denoted as ξkand uk, respectively, for visual clarity.

6.2 Closed–loop approach

We use the closed–loop paradigm introduced in [5] to

decompose the state and input as,

ξk=zk+ek(26a)

uk=Kkξk+ck,(26b)

where Kkis a stabilizing feedback gain for the pair

(Ak, Bk),zkdenotes the deterministic component of ξk,

and ekdenotes the stochastic component. ckrepresents

perturbations on a given feedback control law. Substi-

tuting for ξkand ukin (25b) gives,

zk+1 = Φkzk+Bkck(27a)

ek+1 = Φkek+Dkwk,(27b)

where Φk=Ak+BkKk. We introduce the following

assumptions,

Assumption 4: The system has perfect state feedback,

i.e., ξ0=z0. This implies e0= 0 with probability 1.

Assumption 5: The pair (Ak, Bk)is controllable.

Hence, there exists Kksuch that Φkis Hurwitz.

Remark 4: Kkis chosen to be the inﬁnite horizon

LQR gain for (Ak, Bk).

AVEC ’14

6.3 Uncertainty Propagation

The separation of the deterministic and stochastic

components of ξkin (26) allows us to determine the

distributions of ek(k= 1, . . . , Hp)knowing the dis-

tributions of e0and wk. Let ek∼ N (0,Σk). Then,

ek+1 ∼ N (0,Σk+1), where

Σk+1 = ΦkΣkΦT

k+DkΣwDT

k.(28)

The initial condition is Σ0= 0 by Assumption 4.

6.4 Constraint Tightening

Based on (26), the probabilistic constraint (25c) be-

comes

P r(gT

kzk+gT

kek≤hk)≥p. (29)

(k= 1, . . . , Hp)

This is satisﬁed if [5],

kzk≤hk−γk(30a)

P r(gT

kek≤γk) = p(30b)

Since we know that gT

kek∼ N (0, gT

kΣkgk), we can

explicitly compute γkfrom the quantile function of a

univariate normal distribution as

γk=q2gT

kΣkgkerf −1(2p−1) (31)

where erf −1(·)is the inverse error function.

6.5 Modiﬁed Optimization Problem

By replacing (25c) with the tightened constraints

(30a) and using the feedback law (27b), we obtain the

following optimization problem to be solved at each time

step,

min

Hp−1

k=0

kzk+1 −ξref,k+1 k2

Q+kKkzk+ckk2

(32a)

s.t. zk=Akzk+Bkck(32b)

kzk≤hk−γk(32c)

(k= 0, . . . , Hp−1)

Gu(Kz +c)∈ U (32d)

z0=ξ(t),(32e)

where,

K=blkdiag(K0, . . . , KHp−1)

z= [z0, . . . , zHp−1]T

c= [c0, . . . , cHp−1]T.

(32) is a quadratic program (QP) which can be solved

efﬁciently in real–time. Note that (32c) is implemented

as a soft constraint to prevent the QP from becoming

infeasible.

7. RESULTS

7.1 Simulation setup

Simulations are performed in MATLAB, and the

constrained optimization problem (32) is solved using

Gurobi [10]. The controller is connected in closed–

loop with a higher ﬁdelity four–wheel nonlinear model

(which uses a Pacejka tire model) to simulate model

mismatch.

7.2 Test scenario

The scenario consists of two target vehicles (denoted

as T1and T2) on a straight road. At the start of

the simulation, T1is in the right lane moving at a

speed of 6m/s. It performs a lane change maneuver

at approximately 3seconds, and moves into the left

lane. T2starts in the right lane at a speed of 8m/s,

and stays in the right lane for the entire duration of the

simulation. The ego vehicle (denoted as E) starts off in

the left lane, just behind T1and T2, at a speed of 10

m/s. It’s goal is to track the left lane, while maintaining a

speed of 15 m/s. T1and T2are assumed to move with a

constant velocity of 6and 8m/s, respectively, perturbed

by the sum of a low–frequency sinusoidal term and a

Gaussian disturbance. The lane change trajectory for T1

is generated using a sigmoid function.

7.3 Results

To illustrate the effectiveness of the controller, we

show snapshots of the simulation at various time instants

in Figure 5. In each sub–ﬁgure, the red boxes depict the

current positions of the ego vehicle Eand target vehicles

T1and T2, while the gray boxes depict the predicted

positions. The predicted positions of Eare obtained

from the MPC, and those of T1and T2are given by the

IMM–KF. The red dashed lines for each target depict

the ground truth positions. As seen in Figure 5(a), the

IMM–KF predicts a lane change maneuver for T1. This

results in the Eslowing down (Figure 5(b)). In Figure

5(c), Etries to plan a path around T1, but is prevented

from doing so by the presence of T2. Finally, in Figure

5(e), we see that Eis able to plan a path in between T1

and T2, and increase its speed.

In addition, we demonstrate the ability of the envi-

ronment model to predict the lane change intention of

T1by plotting the lateral mode probabilities in Figure

6. For visual clarity, the lateral modes are classiﬁed

into two categories, “Right Lane” and “Left Lane”. The

probabilities for each category are shown in Figure 6 for

part of the simulation. It is observed that as T1starts

moving towards the left, the probability of the “Left

Lane” mode increases while that of the “Right Lane”

mode decreases.

7.4 Risk vs. conservativeness

An important element of the stochastic MPC problem

is the risk parameter p, deﬁned as the probability of

violating the chance–constraints (22). The nominal MPC

problem where the disturbances are assumed to be zero

is given by p= 0.5. We would like to study the effect

of varying pon the performance of the controller.

For the scenario described in Section 7.2, the distance

traveled by the ego vehicle Ein 25 seconds is used as

AVEC ’14

20 30 40 50

−2

X[m]

Y[m]

(a) t= 2 s

40 45 50 55 60 65 70

−2

ET1

X[m]

Y[m]

(b) t= 5 s

80 85 90 95 100

−2

X[m]

Y[m]

110 115 120 125 130 135 140

−2

X[m]

Y[m]

(d) t= 15 s

145 150 155 160 165 170 175 180

−2

X[m]

Y[m]

(e) t= 20 s

170 180 190 200 210

−2

X[m]

Y[m]

(f) t= 23.5s

Fig. 5. Snapshots of simulation with current and predicted positions of ego vehicle Eand target vehicles T1and T2. The red boxes

depict the current positions, while the gray boxes depict the future positions. For each target vehicle, the ground truth is shown by

the dashed red line.

0 5 10 15 20 25 30

0.2

0.4

0.6

0.8

t[s]

M ode P r obabi lity

Right Lane

Lef t Lan e

Fig. 6. Mode probabilities for target vehicle T1which changes lanes

in front of the ego vehicle

the comparison metric. It is seen that Etravels about

214 m with p= 0.998, and about 220 m with p= 0.5.

Although the difference is small, it must be noted that

the scenario presented is simple. For more complex

scenarios encountered in day–to–day commutes, the

difference in travel distances (and hence, travel times)

can be much larger.

Another simple scenario gives insight on the effect

of varying pon the conservatism of the controller. The

scenario consists of a slow target vehicle T1moving in

the left lane at a speed of 5m/s, and a faster vehicle T2

moving in the right lane at a constant speed of 9m/s.

The ego vehicle Estarts in the left lane with a speed of

10 m/s.

The initial positions of E,T1and T2are chosen

such that the nominal MPC with no constraint tightening

(p= 0.5) is able to plan a path around T1while avoiding

a collision with T2, as shown in Figure 7(a). On the

other hand, the chance–constrained MPC with p= 0.998

results in the ego vehicle slowing down signiﬁcantly to

wait for T2to pass before going around T1. A snapshot

of the simulation with the chance–constrained MPC is

shown in Figure 7(b). The variation in the distance

traveled by the ego vehicle with pis more signiﬁcant

in this case. For the nominal MPC with p= 0.5, the

ego vehicle travels about 278 m in 20 seconds, while in

AVEC ’14

25 30 35 40 45 50 55

−2

X[m]

Y[m]

(a) p= 0.5(Nominal MPC)

25 30 35 40 45 50 55

−2

X[m]

Y[m]

(b) p= 0.998 (Stochastic MPC)

Fig. 7. Simulation showing the effect of varying the risk

parameter pon the conservatism of the controller. The notation

is the same as in Figure 5

case of the chance–constrained MPC with p= 0.998, it

covers only 193 m.

In the context of urban driving, scenarios like the

one described above are expected to occur often. The

use of conservative worst–case approaches may result

in the ego vehicle stopping or slowing down unneces-

sarily, leading to longer commute times. The framework

proposed in this work is expected to be less restrictive,

while retaining the ability to systematically account for

system uncertainties.

8. CONCLUSIONS

In this paper, we presented the integration of an envi-

ronment model with a stochastic predictive control ap-

proach for autonomous vehicles. A multimodal approach

is used to estimate and predict the state of surrounding

vehicles. The predictions are embedded in a chance–

constrained MPC which optimizes control inputs to

achieve certain objectives while avoiding collisions with

other vehicles. The ability of the ego vehicle to safely

traverse a typical urban driving scenario is demonstrated

through simulation. In addition, we studied the effect of

the tunable risk parameter in the stochastic MPC on the

performance of the autonomous vehicle. The proposed

framework allows us to vary the conservatism of the

controller as a function of risk. Future work aims at

exploring this relationship in greater detail.

9. ACKNOWLEDGMENTS

This material is based upon work supported by the

National Science Foundation under Grant No. 1239323.

Any opinions, ﬁndings, and conclusions or recommen-

dations expressed in this material are those of the authors

and do not necessarily reﬂect the views of the National

Science Foundation.

REFERENCES

[1] A. Gray, Y. Gao, J. K. Hedrick, and F. Borrelli,

“Stochastic predictive control for semi-autonomous

vehicles with an uncertain driver model,” in Intel-

ligent Transportation Systems (ITS), IEEE Confer-

ence on, 2013.

[2] A. Carvalho, Y. Gao, A. Gray, H. E. Tseng, and

F. Borrelli, “Predictive control of an autonomous

ground vehicle using an iterative linearization

approach,” in Intelligent Transportation Systems

(ITS), IEEE Conference on, 2013.

[3] Y. Gao, A. Gray, H. E. Tseng, and F. Borrelli,

“A tube-based robust nonlinear predictive control

approach to semiautonomous ground vehicles,” Ve-

hicle System Dynamics, 2014.

[4] M. P. Vitus and C. J. Tomlin, “A probabilistic

approach to planning and control in autonomous

urban driving,” in Proceedings of the 52nd IEEE

Conference on Decision and Control, Florence,

Italy, December 2013.

[5] B. Kouvaritakis, M. Cannon, S. V. Rakovi´

c, and

Q. Cheng, “Explicit use of probabilistic distri-

butions in linear predictive control,” Automatica,

vol. 46, no. 10, pp. 1719–1724, 2010.

[6] S. Lef`

evre, D. Vasquez, and C. Laugier, “A sur-

vey on motion prediction and risk assessment for

intelligent vehicles,” Robomech Journal, to appear.

[7] E. Mazor, A. Averbuch, Y. Bar-Shalom, and

J. Dayan, “Interacting multiple model methods in

target tracking: a survey,” Aerospace and Elec-

tronic Systems, IEEE Transactions on, vol. 34,

no. 1, pp. 103–123, 1998.

[8] R. R. Pitre, V. P. Jilkov, and X. R. Li, “A compar-

ative study of multiple-model algorithms for ma-

neuvering target tracking,” in Defense and Security.

International Society for Optics and Photonics,

2005, pp. 549–560.

[9] J. Schulman, J. Ho, A. Lee, I. Awwal, H. Bradlow,

and P. Abbeel, “Finding locally optimal, collision-

free trajectories with sequential convex optimiza-

tion,” in Robotics: Science and Systems (RSS).

Citeseer, 2013.

[10] Gurobi Optimizer Reference Manual, Gurobi

Optimization, Inc., 2014. [Online]. Available:

http://www.gurobi.com

Safe and Non-Conservative Trajectory Planning for Autonomous Driving Handling Unanticipated Behaviors of Traffic Participants

Preprint

Full-text available

Jun 2024

Trajectory planning for autonomous driving is challenging because the unknown future motion of traffic participants must be accounted for, yielding large uncertainty. Stochastic Model Predictive Control (SMPC)-based planners provide non-conservative planning, but do not rule out a (small) probability of collision. We propose a control scheme that yields an efficient trajectory based on SMPC when the traffic scenario allows, still avoiding that the vehicle causes collisions with traffic participants if the latter move according to the prediction assumptions. If some traffic participant does not behave as anticipated, no safety guarantee can be given. Then, our approach yields a trajectory which minimizes the probability of collision, using Constraint Violation Probability Minimization techniques. Our algorithm can also be adapted to minimize the anticipated harm caused by a collision. We provide a thorough discussion of the benefits of our novel control scheme and compare it to a previous approach through numerical simulations from the CommonRoad database.

Adaptive Splitting of Reusable Temporal Monitors for Rare Traffic Violations

Preprint

Mar 2024

Autonomous Vehicles (AVs) are often tested in simulation to estimate the probability they will violate safety specifications. Two common issues arise when using existing techniques to produce this estimation: If violations occur rarely, simple Monte-Carlo sampling techniques can fail to produce efficient estimates; if simulation horizons are too long, importance sampling techniques (which learn proposal distributions from past simulations) can fail to converge. This paper addresses both issues by interleaving rare-event sampling techniques with online specification monitoring algorithms. We use adaptive multi-level splitting to decompose simulations into partial trajectories, then calculate the distance of those partial trajectories to failure by leveraging robustness metrics from Signal Temporal Logic (STL). By caching those partial robustness metric values, we can efficiently re-use computations across multiple sampling stages. Our experiments on an interstate lane-change scenario show our method is viable for testing simulated AV-pipelines, efficiently estimating failure probabilities for STL specifications based on real traffic rules. We produce better estimates than Monte-Carlo and importance sampling in fewer simulations.

A Condition-Aware Stochastic Dynamic Control Strategy for Safe Automated Driving

Article

Full-text available

Jun 2024

Condition-awareness regarding electrical and electronic components is not only significant for predictive maintenance of automotive vehicles but also plays a crucial role in ensuring the operational safety by supporting the detection of anomalies, faults, and degradations over lifetime. In this paper, we present a novel control strategy that combines stochastic dynamic control method with condition-awareness for safe automated driving. In particular, the effectiveness of condition-awareness is supported by two distinct condition-monitoring functions. The first function involves the monitoring of a vehicle's internal health condition using model-based approaches. The second function involves the monitoring of a vehicle's external surrounding conditions, using machine learning and artificial intelligence approaches. For the quantification of current conditions, the results from these monitoring functions are used to create system health indices, which are then utilized by a safety control function for dynamic behavior regulation. The design of this safety control function is based on a chance-constrained model predictive control model, combined with a control barrier function for ensuring safe operation. The novelty of the proposed method lies in a systematic integration of monitored external and internal conditions, estimated component degradation, and remaining useful life, with the controller's dynamic responsiveness. The efficacy of the proposed strategy is evaluated with adaptive cruise control in the presence of various sensory uncertainties.

A Gossip Learning Approach to Urban Trajectory Nowcasting for Anticipatory RAN Management

Article

Full-text available

Jan 2023

In future radio access networks, machine learning (ML) based strategies for short-term forecasting of vehicular trajectories will be key for anticipatory resource allocation and management at the mobile edge. However, training ML models in a centralized fashion, over data collected from a massive heterogeneous and dynamic set of devices, poses significant scalability, reliability, and efficiency challenges, which are still open to date. In this paper, we look at the specific issue of scalable and resource-efficient training of ML models in a vehicular environment. To address such a challenge, we propose a new Gossip Learning scheme, i.e., a fully distributed, collaborative training approach based on direct, opportunistic model exchanges via wireless device-to-device (D2D) communications with no centralized support. Our approach is based on constantly improving each node's own model instance through knowledge transfer among nodes, and on different strategies for estimating the potential contribution of neighboring nodes to the training process at a node. Extensive numerical assessments on a variety of measurement-based dynamic urban scenarios suggest that our schemes are able to converge rapidly and provide sufficiently accurate forecasts of vehicle position for time horizons which are typical of future 5G/6G dynamic resource allocation algorithms.

Improved Attention Mechanism for Human-like Intelligent Vehicle Trajectory Prediction

Article

Full-text available

Sep 2023

In order to overcome the low long-term predictive accuracy associated with mainstream prediction models and the limited consideration of driver characteristics, this study presents an enhanced attention mechanism for human-like trajectory prediction, which is based on Long Short-Term Memory (LSTM). A novel database structure is proposed that incorporates data about driving style and driving intent, pertaining to human factors. By utilizing the convolution computation of Convolutional Social-Long Short-Term Memory (CS-LSTM) for surrounding vehicles, spatial feature extraction around the target vehicle is achieved. Simultaneously, we introduce a dynamic driving style recognition model and a human-like driving intent recognition model to fulfill the output of the human-like module. From a temporal perspective, we employ a decoder attention mechanism to reinforce the emphasis on key historical information, while refining the attention mechanism based on driving style for human-like weight assignment. Comparative analysis with other models indicates that the proposed Driving Style-based Attention-enhanced Convolutional Social-Long Short-Term Memory (DACS-LSTM) model exhibits notable advantages in predicting human-like trajectories for long-term tasks. Visualizing the predicted trajectories of both the Attention-enhanced Convolutional Social-Long Short-Term Memory (ACS-LSTM) and our proposed model, and analyzing the impact of the human-like module on the predicted trajectory, shows that our model’s predicted trajectory aligns more closely with the actual one. By comparing the weight distribution of the conventional attention mechanism and the enhanced attention mechanism proposed here, and analyzing the trajectory changes in conjunction with the driving styles, it becomes evident that our proposed model offers a marked improvement.

VSFL: Trajectory Prediction Framework Based on Validity-aware Semi-asynchronous Federated Learning in Internet of Vehicles

Article

Jun 2024
COMPUT COMMUN

Identifying Reaction-Aware Driving Styles of Stochastic Model Predictive Controlled Vehicles by Inverse Reinforcement Learning

Conference Paper

Sep 2023

A Novel Stochastic Model Predictive Control Considering Predictable Disturbance With Application to Personalized Adaptive Cruise Control

Article

Feb 2024

A novel stochastic model predictive control (SMPC) scheme is proposed for automotive scenes based on high-performance and practical motion state prediction method. The significant properties of the proposed scheme are that: 1) it can accurately predict disturbances within the prediction horizon, and 2) the prediction results can be considered into the optimizing process to obtain a more efficient and accurate controller. As a result, the proposed adaptive cruise control (ACC) system can ensure driving safety and improve tracking accuracy and comfort performance while satisfying different driving styles. In detail, a large amount of naturalistic driving data is collected based on a real vehicle test platform at first. Then an adaptive optimization Gaussian process regression (AOGPR) is developed and trained with real measurements to predict the motion states of the preceding vehicle. The prediction module is embedded in SMPC to bind the collision conditions, tighten the states and finally construct a novel controller, i.e., AOGPR-SMPC controller. A bidirectional LSTM (BiLSTM) network is trained and tested for online recognizing driving styles to satisfy personalized car-following needs. The simulation and field tests verify and evaluate the proposed controller. The results demonstrate that the ACC system could realize personalized car-following according to the driver’s driving style, and the proposed controller can obtain better tracking accuracy and comfort performance compared with the GPR-SMPC controller and MPC controller.

Stochastic Model Predictive Control for Relative-Velocity Based Multimodal Risk Avoidance under a Vehicle Coming Scenarios

Article

Jan 2023

Robust Vehicle Lane Keeping Control with Networked Proactive Adaptation

Article

Sep 2023
ARTIF INTELL

Predictive control of an autonomous ground vehicle using an iterative linearization approach

Conference Paper

Full-text available

Oct 2013

This paper presents the design of a controller for an autonomous ground vehicle. The goal is to track the lane centerline while avoiding collisions with obstacles. A nonlinear model predictive control (MPC) framework is used where the control inputs are the front steering angle and the braking torques at the four wheels. The focus of this work is on the development of a tailored algorithm for solving the nonlinear MPC problem. Hardware-in-the-loop simulations with the proposed algorithm show a reduction in the computational time as compared to general purpose nonlinear solvers. Experimental tests on a passenger vehicle at high speeds on low friction road surfaces show the effectiveness of the proposed algorithm.

A survey on motion prediction and risk assessment for intelligent vehicles

Article

Full-text available

Jul 2014

With the objective to improve road safety, the automotive industry is moving toward more “intelligent” vehicles. One of the major challenges is to detect dangerous situations and react accordingly in order to avoid or mitigate accidents. This requires predicting the likely evolution of the current traffic situation, and assessing how dangerous that future situation might be. This paper is a survey of existing methods for motion prediction and risk assessment for intelligent vehicles. The proposed classification is based on the semantics used to define motion and risk. We point out the tradeoff between model completeness and real-time constraints, and the fact that the choice of a risk assessment method is influenced by the selected motion model.

Interacting multiple model methods in target tracking: A survey

Article

Full-text available

Feb 1998

The Interacting Multiple Model (IMM) estimator is a suboptimal hybrid filter that has been shown to be one of the most cost-effective hybrid state estimation schemes. The main feature of this algorithm is its ability to estimate the state of a dynamic system with several behavior modes which can “switch” from one to another. In particular, the IMM estimator can be a self-adjusting variable-bandwidth filter, which makes it natural for tracking maneuvering targets. The importance of this approach is that it is the best compromise available currently-between complexity and performance: its computational requirements are nearly linear in the size of the problem (number of models) while its performance is almost the same as that of an algorithm with quadratic complexity. The objective of this work is to survey and put in perspective the existing IMM methods for target tracking problems. Special attention is given to the assumptions underlying each algorithm and its applicability to various situations

Finding Locally Optimal, Collision-Free Trajectories with Sequential Convex Optimization

Conference Paper

Jun 2013

A probabilistic approach to planning and control in autonomous urban driving

Conference Paper

Dec 2013

This paper considers the problem of decision making and control for autonomous urban vehicles operating among other non-cooperating, possibly human controlled, vehicles. The difficulty in this problem stems from the fact that the behavior of the other vehicles is uncertain, and in many circumstances a collision cannot be prevented even under restrictive assumptions about the other drivers' actions. Traditional approaches that consider the worst-case actions of the other vehicles typically are inapplicable because the solutions are either too conservative or infeasible. This work proposes a new method for solving this problem using a chance constrained framework, which requires that the violation probability of all the constraints is guaranteed to be below a threshold. This formulation has the benefit of not only achieving traditional objectives such as minimization of fuel or travel time, but also hedges the nominal planned maneuver against potential crashes leading from the uncertainty in the other drivers' actions. The proposed framework is demonstrated on a passing maneuver example which exhibits a region where the passing vehicle cannot be guaranteed to prevent a collision in all circumstances.

A tube-based robust nonlinear predictive control approach to semiautonomous ground vehicles

Article

Apr 2014

This paper proposes a robust control framework for lane-keeping and obstacle avoidance of semiautonomous ground vehicles. It presents a systematic way of enforcing robustness during the MPC design stage. A robust nonlinear model predictive controller (RNMPC) is used to help the driver navigating the vehicle in order to avoid obstacles and track the road centre line. A force-input nonlinear bicycle vehicle model is developed and used in the RNMPC control design. A robust invariant set is used in the RNMPC design to guarantee that state and input constraints are satisfied in the presence of disturbances and model error. Simulations and experiments on a vehicle show the effectiveness of the proposed framework.

Robust Predictive Control for semi-autonomous vehicles with an uncertain driver model

Conference Paper

Oct 2013

A robust control design is proposed for the lane-keeping and obstacle avoidance of semiautonomous ground vehicles. A robust Model Predictive Controller (MPC) is used in order to enforce safety constraints with minimal control intervention. An uncertain driver model is used to obtain sets of predicted vehicle trajectories in closed-loop with the predicted driver's behavior. The robust MPC computes the smallest corrective steering action needed to keep the driver safe for all predicted trajectories in the set. Simulations of a driver approaching multiple obstacles, with uncertainty obtained from measured data, show the effect of the proposed framework.

A comparative study of multiple-model algorithms for maneuvering target tracking

Article

May 2005
Proceedings of SPIE

Many multiple-model (MM) algorithms for tracking maneuvering targets are available, but there are few comparative studies of their performance. This work compares seven MM algorithms for maneuvering target tracking in terms of tracking performance and computational complexity. Six of them are well known and widely used. They are the autonomous multiple-model algorithm, generalized pseudo-Bayesian algorithm of first order (GPB1), and of second order (GPB2), interacting multiple-model (IMM) algorithm, B-best based MM algorithm, and Viterbi-based MM algorithm. Also considered is the reweighted interacting multiple-model algorithm, which was developed recently. The algorithms were compared using three scenarios. The first scenario consists of two segments of tangential acceleration while the second scenario consists of two segments of normal acceleration. Both of these scenarios have maneuvers that are represented by one of the models in the model set. The third scenario, however, has a single maneuver that consists of a tangential and a normal acceleration. This type of maneuver is not covered by the model set and is used to see how the algorithms react to a maneuver outside of the model set. Based on the study, there is no clear-cut best algorithm but the IMM algorithm has the best computational complexity among the algorithms that have acceptable tracking errors. It also showed a remarkable robustness to model mismatching, and appears to be the top choice if the computational cost is of concern.

Explicit use of probabilistic distributions in linear predictive control

Article

Oct 2010
AUTOMATICA

The guarantee of feasibility given feasibility at initial time is an issue that has been overlooked by many of the recent papers on stochastic model predictive control. Effective solutions have recently been proposed, but these carry considerable online computational load and a degree of conservativism. For the case that the elements of the random additive disturbance vector are independent, the current paper ensures that probabilistic constraints are met and that a quadratic stability condition is satisfied. A numerical example illustrates the efficacy of the proposed algorithm, which achieves tight satisfaction of constraints and thereby attains near-optimal performance.

Stochastic predictive control for semi-autonomous vehicles with an uncertain driver model

Jan 2013

A Gray
Y Gao
J K Hedrick
F Borrelli

A. Gray, Y. Gao, J. K. Hedrick, and F. Borrelli, "Stochastic predictive control for semi-autonomous vehicles with an uncertain driver model," in Intelligent Transportation Systems (ITS), IEEE Conference on, 2013.

Stochastic predictive control of autonomous vehicles in uncertain environments

Abstract and Figures

Recommended publications

Tracking for moving object using adaptive Rao-Blackwellized particle filter and fuzzy control system

Nonlinear optimal control for autonomous navigation of a truck and trailer system

Cooperative control of multiple surface vessels in the presence of ocean currents and parametric mod...

Predictive control for autonomous driving, with experimental evaluation on a heavy-duty construction...