Conference PaperPDF Available

Deriving Spatial Policies for Overtaking Maneuvers with Autonomous Vehicles

January 2022

January 2022

DOI:10.1109/COMSNETS53615.2022.9668548

Conference: 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS)

Authors:

Jayanth Bhargav

Purdue University

Johannes Betz

Technische Universität München

Rahul Mangharam

University of Pennsylvania

Example racetrack for offline experiments: Budapest Circuit, Hungary Track portions are defined by T = {τ |1 ≤ τ ≤ 11, τ ∈ N} and are categorized in the four track segment types: • Straight: 11 • Sweeper Curve: 3, 7, 8 • Hairpin Curve: 1, 2, 4, 9, 10 • Chicane: 5, 6

…

Ego vehicle (red car) starting behind the opponent vehicle (blue car) on the track. Apart from the velocity, lateral and longitudinal positions of the ego vehicle are varied as shown in the figure. The opponent vehicle follows a pre-computed race line and is non-interactive. The overtaking maneuver is examined in the overtaking detection zone (marked in yellow).

…

Modes of SMPCC

…

Figures - uploaded by Jayanth Bhargav

Content may be subject to copyright.

Content uploaded by Jayanth Bhargav

Content may be subject to copyright.

Deriving Spatial Policies for Overtaking Maneuvers

with Autonomous Vehicles

Jayanth Bhargav

Electrical & Systems Engineering

University of Pennsylvania

Philadelphia, USA

jayanthb@seas.upenn.edu

Johannes Betz

Electrical & Systems Engineering

University of Pennsylvania

Philadelphia, USA

joebetz@seas.upenn.edu

Hongrui Zehng

Electrical & Systems Engineering

University of Pennsylvania

Philadelphia, USA

hongruiz@seas.upenn.edu

Rahul Mangharam

Electrical & Systems Engineering

University of Pennsylvania

Philadelphia, USA

rahulm@seas.upenn.edu

Abstract—Planning an accurate and safe trajectory is a crucial

element in autonomous driving. To execute complex driving ma-

neuvers like overtaking, motion planning requires an enhanced

decision-making algorithm that decides the when, where and how

of the overtaking maneuver. This paper proposes an algorithm

that increases the likelihood of a safe overtaking maneuver by

learning spatial information. Here, spatial information refers to

the track portion/curve and the position of the ego vehicle with

reference to that. The technique is applied to an autonomous

racing setup where vehicles have to detect and operate at the

limits of dynamic handling. To learn the spatial information,

ofﬂine experiments of a 2-player race are conducted to generate

probability distributions of overtaking maneuvers conditioned on

speed and relative-position of the ego vehicle with respect to the

opponent. Furthermore, a Switched Model Predictive Contouring

Controller (SMPCC) is proposed for incorporating the policy

learning algorithm into the path planning and control setup.

Extensive simulations show that the proposed algorithm is able

to achieve an increased number of overtakes at different track

portions on known and unknown race tracks.

Index Terms—autonomous systems, automobiles, intelligent

vehicles, optimal control, path planning

I. INTRODUCTION

A. Autonomous Racing

Autonomous racing has become popular over the recent

years and competitions like Roborace [1] or the Indy Au-

tonomous Challenge as well as small-scale competitions like

F1Tenth [2] provide platforms for evaluating autonomous

driving algorithms and software. The overall goal of all these

competitions is that researchers and engineers can develop

algorithms that operate vehicles at the edge: high speeds,

high accelerations, high computation power, adversarial en-

vironments. The algorithms that were developed in the ﬁeld

of autonomous racing so far are mostly focusing on single

vehicle only that try to achieve a human-like lap time. The

ﬁeld of high dynamic overtaking maneuver with dynamic

opponents are less displayed. In addition, achieving a human-

like behavior (e.g. like a Formula 1 race driver) that makes

the decision about an overtaking maneuver and executes a

secure and reliable maneuver at high speeds is still an unsolved

problem.

B. Contributions

In this paper, an approach to learn spatial information for

overtaking maneuvers in autonomous vehicles is presented.

This work has three primary contributions:

1) Design of Experiments (DoE) for ofﬂine policy learning.

2) An application of autonomous driving to learn effec-

tive overtaking maneuvers for autonomous race cars.

Discretization of selected f1 tracks into a category of

turns/curves and simulations of 2-player race to derive

overtaking policies for different track portions.

3) A Switched Model Predictive Contouring Controller

(SMPCC) setup based on [3], which combines a re-

ceding horizon control algorithm and speciﬁc driving

behaviours.

II. RELATED WORK

Dixit et al. [4] provide a state of the art review of trajectory

planning and control for autonomous overtaking maneuvers.

The authors state ﬁnally in their review, that two important

aspects of trajectory planning for high-speed overtaking need

to be addressed: (i) inclusion of vehicle dynamics and en-

vironmental constraints and (ii) accurate knowledge of the

environment and surrounding obstacles.

Although the state of the art displays a plethora of al-

gorithms for path and behavioral planning of autonomous

vehicles, explicit algorithm development for autonomous race

cars is relatively lesser. As part of the Roborace competition

[5], [6] [7] presented a planning and control system for real life

autonomous racing cars. Both approaches focused on a holistic

software architecture that is capable of dynamic overtaking.

Nevertheless none of them realized a head to head race with

the vehicles. As a part of the same competition, [8] presented

a nonlinear model predictive control (NMPC) for racing. The

overtaking strategy was implemented as a term in the objective

COMSNETS 2022: Intelligent Transportation Systems Workshop

Authorized licensed use limited to: University of Pennsylvania. Downloaded on January 17,2022 at 17:35:26 UTC from IEEE Xplore. Restrictions apply.

function. The NMPC has the freedom to choose the side

for an overtake and was mainly relying on the obstacles

velocity to perform the overtaking maneuver. In [9] a simple

Q-Learning algorithm is applied to learn the behavior of an

virtual opponent car to apply an effective overtaking strategy

on either the straight or right before tight bend.

In [10] a method to plan overtaking maneuvers in autonomous

racing based on gaussian processes is presented. This machine

learning method is able to learn the behavior of the opponent

vehicle. Based on the outputs of this process, a stochastic MPC

plans optimistic trajectories that lead to a controlled overtaking

maneuver of the lead vehicle.

In multi vehicle racing, [11] presented a non-cooperative game

theory approach where autonomous racing, formulated as rac-

ing decisions is a non-cooperative nonzero-sum game. Liniger

et al. [11] displayed that different games can be modelled that

achieve successfully different racing behaviors and generate

interesting racing situations e.g. blocking and overtaking.

Notomista et al. [12] considered a two-player racing game

where the ego vehicle is based on a Sensitivity-ENhanced

NAsh equilibrium seeking (SENNA) method, which uses an

iterated best response algorithm in order to optimize for a

trajectory in a two-car racing game. Jung et al. [13] present a

game-theoretic MPC approach for head-to-head autonomous

racing that consists of a (1) game-based opponents’ trajec-

tory predictor, (2) high-level race strategy planner, and (3)

MPC-based low-level controller. Based on the results of the

prediction, the high-level race strategy planner plans several

behaviors to respond to various race circumstances.

The state of the art displays that the autonomous rac-

ing community is focusing on integrating effective learning

techniques and strategies into dynamic path and behavioral

planning to make the car faster, more reliable and more in-

teractive [14] [15] [16]. Improvements in planning/control for

overtaking maneuvers have not yet been explored extensively

and learning from spatial information (track portions and

position of the vehicle on the track) has not been examined

before.

III. DESIGN OF EXPERIMENTS

We propose an ofﬂine experiment setup which will create

emphasis on speciﬁc track portions and examine the overtaking

maneuvers. With these ofﬂine experiments, it is possible to

create track-based policies that can be used in a high level

decision maker, behavior or motion planner.

A. Track Portions

Based on our racetrack application, in the ﬁrst step we

deﬁne the track portions that we will examine. We will use

four high level deﬁnitions of track portions that are the most

common kinds of curves/turns found on racetracks: Straight,

Sweeper Curve,Hairpin Curve and Chicane

For example, consider the racetrack in Budapast, Hungary.

In ﬁgure 1 we display 11 different track portions on the track

that are marked with labels 1 to 11.

Fig. 1. Example racetrack for ofﬂine experiments: Budapest Circuit, Hungary

Track portions are deﬁned by T={τ|1≤τ≤11, τ ∈N}

and are categorized in the four track segment types:

•Straight: 11

•Sweeper Curve: 3, 7, 8

•Hairpin Curve: 1, 2, 4, 9, 10

•Chicane: 5, 6

B. Sampling based trajectory rollouts

To examine these deﬁned track portions we setup an ofﬂine

simulation that varies different parameters visualized in ﬁgure

Lateral Variation

Longitudinal

Variation Opponent car

On optimal raceline

Overtaking

Detection Zone

Fig. 2. Ego vehicle (red car) starting behind the opponent vehicle (blue car)

on the track. Apart from the velocity, lateral and longitudinal positions of the

ego vehicle are varied as shown in the ﬁgure. The opponent vehicle follows

a pre-computed race line and is non-interactive. The overtaking maneuver is

examined in the overtaking detection zone (marked in yellow).

The opponent vehicle follows a curvature optimal pre-

computed race line based on [17] and is non-interactive.

For every track portion τ∈T, a uniformly sampled set

of positions P:XXY ⊂ R2are chosen as the starting

position for the ego. The obstacle vehicle speed is varied as

vobs =vbaseline ∗(1+ s)where s∈ {−0.2,0,+0.2},vbaseline

being the speed of the obstacle from the pre-computed optimal

race line.

The agents are initialised with these positions and set to

start the simulation. The ego vehicle synthesises dynamic tra-

jectories based on the MPCC planner with obstacle avoidance.

A fully observable model is used for the ego vehicle i.e. the

ego vehicle will have the information of the track portion τ

which it is driving in and the current state of the obstacle

Xobs = [xobs, yobs , ϕobs]

In this setup, we conduct simulations based on the following

parameter variations:

COMSNETS 2022: Intelligent Transportation Systems Workshop

860

Authorized licensed use limited to: University of Pennsylvania. Downloaded on January 17,2022 at 17:35:26 UTC from IEEE Xplore. Restrictions apply.

•Lateral offset: The position of the ego vehicle is varied

lateral across the track with an offset from the centerline.

•Longitudinal offset: The position of the ego vehicle is

varied longitudinal along the centerline of the track.

•Opponent speed change: The opponent speed is varied

with ±20% from baseline

With a high expectation, the ego vehicle will succeed in an

overtaking maneuver when the obstacle speed is 20% lower

than its baseline. This veriﬁes the fact that speed advantage

always helps in overtaking (e.g. DRS zones in F1). The next

set of parameters that inﬂuence the overtaking maneuver is

the position. In convoluted race tracks, we can display that

starting off at a speciﬁc position gives us a higher chance of

an overtaking maneuver. For each track portion, we deﬁne four

regions of interest: R1,R2,R3and R4. Starting positions of

the ego vehicle are uniformly sampled in all the four regions

to generate experimental data.

IV. PLANNING AND CONTROL SETUP

Continuous time system dynamics is used to develop a

constrained optimal controller to steer the vehicle in the

track. The optimal planner plans the path for a horizon of N

steps ahead, steers the vehicle with the ﬁrst step, and again

repeats the process for the speciﬁed amount of time. This is

a modiﬁed form of the Model Predictive Controller (MPC).

A. Model Predictive Contouring Control

The MPCC problem deﬁned in [3] is re-formulated into a

ﬁnite-continuous time optimal control problem as follows:

min ZT

0ϵlin

c(t)ϵlin

l(t)Qc 0

0Ql ϵlin

c(t)

ϵlin

l(t)

−Qθ˙

θ(t) + uT(t)Ru(t)dt

s.t. ˙x=f(x, u, Φ)

blower ⪯x(t)⪯bupper

llower ⪯u(t)⪯lupper

h(x, Φ) ≤0

given the system dynamics fand the arclength parametriza-

tion of the contour (the track) Φ. A single-track bicycle model

is used. Here x(t)denotes the system state, u(t)the inputs

to the system, bthe box constraints on the state, lthe box

constraints on the input and hcaptures the track boundary

constraints. The state of the system is augmented with the

advancing parameter θand the virtual input ˙

θis appended to

the inputs from the original system dynamics.

Qc,Ql,Qθand Rare the cost-function parameters of the

MPC controller.

The track boundary constraint is realized as a convex disk

constraint.

h(x, Φ) = x−xlin

t(θ)2+y−ylin

t(θ)2−rΦ(ˆ

θ)2

Here rΦ(ˆ

θ)is the half-width of the track at the last predicted

arc length.

The contouring error ϵlin

cand lag error ϵlin

ldescribed in [3]

are modiﬁed by linearizing them around the previous solution

θ.

The MPCC is optimizing to move the position of a virtual

point θ(t)along the track to achieve as much progress as

possible while steering the model of the vehicle to keep

contouring and lag errors as small as possible.

The center-line of the track is given in way-points (X-and Y-

position). To implement MPCC an arc-length parametrization

Φis required. This is realized by interpolating the way-points

using cubic splines with a cyclic boundary condition, and

creating a dense lookup table with the track location and the

linearization parameters.

B. Switched Model Predictive Contouring Control (SMPCC)

To achieve more control over the path planning of the ego

vehicle, the proposed SMPCC setup is displayed in ﬁgure 3.

Agent with SMPCC Planner

Normal Mode Drive Right Drive Left

−t<ec< t −ϵ<ec< t t<ec< ϵ

Fig. 3. Modes of SMPCC

In this, the agent switches between different modes deﬁned

by different solver formulations. The constraints for the modes

as shown in ﬁg. 3 are added to the problem formulated

in section IV-A. These constraints are tuned with the slack

variable (ϵ) to ensure that the planner does not get stuck into

an in-feasibility loop leading to a crash. The ego can therefore

overtake on both left and right side of the opponent vehicle.

The MPCC control problem is solved by efﬁcient interior point

solvers in FORCES [18].

V. RESULTS AND DISCUSSION

A. Ofﬂine Spatial Policy Learning

In the following section, we present the results from the

ofﬂine experiments. Algorithm 1 elucidates the ofﬂine exper-

iment based policy learning developed in this paper.

X,Yare the set of x and y coordinate offsets (expressed

as percentage of track width) and S={−0.2,0,+0.2}is

the speed offset expressed as a percentage change from the

baseline obstacle speed.

The obstacle update model is g(·), which is a pre-computed

curvature optimal race line of the race track under consider-

ation. Deﬁne Ψ:{Silverstone circuit (England), Hungaroring

circuit (Budapest), Catalunya circuit (Spain) and N¨

urburgring

circuit (Germany)}, the set of race circuits on which the race

COMSNETS 2022: Intelligent Transportation Systems Workshop

861

Authorized licensed use limited to: University of Pennsylvania. Downloaded on January 17,2022 at 17:35:26 UTC from IEEE Xplore. Restrictions apply.

is conducted for learning the policies. In total we simulate

576 experiments based on the 16 lateral, 12 longitudinal and

3 velocity variations for each track portion in each of the

four racetracks. The algorithm 1 populates the policy map Πk

with the track regions for each of the curves, having highest

probability of overtakes for all k∈Ψ.

Algorithm 1: Ofﬂine Spatial Policy Learning

Function MPCC Planner(Xobs):

solve MPCC problem deﬁned in Section IV

return u∗;

Function Check Overtake(Xobs,Xego ):

project Xobs,Xego as s1, s2on track

return bool(s1> s2);

for k∈Ψdo

initialize: Πk={}

for τ∈Tdo

initialize: p = {}, overtakes = {}, total = {}

for x, y, s ∈ X XYXSdo

initialize: Xego, Xobs

for t= 0 to Tsim do

u∗=MPCC Planner (Xobs)

steer ego: X+

ego =f(Xego , u∗)

update obstacle pos: X+

obs =g(Xobs)

Xobs, Xego =X+

obs, X +

ego

identify track region i∈ {0,1,2,3}

if Check Overtake(Xobs, Xego )then

overtakes[Ri]++;

total[Ri]++ ;

end

compute p[Ri] = overtakes[Ri]/total[Ri],∀i

Πk[τ] = argmax(p)

end

Figure 4 describes an example of the four track regions

and the overtaking probabilities that are evaluated based on

algorithm 1. The overtaking probabilities for two of the

racetracks with their respective track portions are displayed

in tables I and II.

Track Region R1

Front Left

p(R1)= 0.43

Rear Left

Track Region R4

p(R4) = 0.23

Overtaking Area R2

Front Right

p(R2) =0.38

Track Region R3

Rear Right

p(R3) =0.28

Fig. 4. Predeﬁned track regions of interest for overtaking maneuver at a

speciﬁc turn with overtaking success probabilities.

Results from the experiments conducted on the four race-

tracks are summarized in ﬁgures 5, 6 and 7. They display the

overtaking probability distribution for each overtaking zone

TABLE I

OVERTAKING PROBABI LITIE S FOR ALL T RACK PORTIONS - RACET RAC K 1

( SILVERS TONE, ENGLAND)

Track

Portion

(τ)

Track

Portion

Type

p(R1)p(R2)p(R3)p(R4)

1 Sweeper 0.99 0.93 0.52 0.59

2 Hairpin 0.63 0.52 0.31 0.33

3 Hairpin 0.41 0.38 0.25 0.21

4 Sweeper 0.65 0.67 0.57 0.59

5 Chicane 0.25 0.21 0.14 0.21

6 Straight 0.99 1.0 0.95 0.99

7 Sweeper 0.47 0.52 0.33 0.31

8 Hairpin 0.40 0.36 0.37 0.38

TABLE II

OVERTAKING PROBABI LITIE S FOR ALL T RACK PORTIONS - RACET RAC K 2

(BUDAPEST, HUN GARY)

Track

Portion

(τ)

Track

Portion

Type

p(R1)p(R2)p(R3)p(R4)

1 Hairpin 0.52 0.49 0.34 0.29

2 Hairpin 0.31 0.43 0.27 0.21

3 Sweeper 0.71 0.62 0.54 0.59

4 Hairpin 0.67 0.53 0.39 0.41

5 Chicane 0.41 0.43 0.22 0.25

6 Chicane 0.27 0.21 0.19 0.14

7 Sweeper 0.67 0.64 0.35 0.37

8 Sweeper 0.59 0.48 0.40 0.36

9 Hairpin 0.44 0.58 0.35 0.29

10 Hairpin 0.65 0.60 0.44 0.38

11 Straight 0.97 0.99 0.95 0.94

(R1-R4) at chicanes, sweeper curves and hairpins respectively

across all the race-tracks considered for policy learning. From

statistical analysis on the ofﬂine experiments for four different

racetracks and their 39 track portions, we have the following

observations:

•An overtaking maneuver will be more successful if we

are closer to the opponent vehicle. This can be seen in

both the raw numbers from table I and II as well in the

higher median and maximum in the boxplots for R1&

R2.

•On the straight, it does not really matter which side we

are on the track when trying to overtake, we just need

stay closer to the opponent.

•The sweeper curve generally has a high overtaking prob-

ability due to the high speeds of the car. We only get

an advantage here if we are close enough to the car and

therefore we need to be in region R1or R2.

•We can see that in each hairpin we have the highest

overtaking probability in either in R1or R2depending

on the nature of the hairpin turn: right or left. This is

due to the fact that being on the inside of the curve near

a hairpin, the car is able to achieve a better trajectory

through the hairpin.

•Achicane generally has the lowest overtaking probability

due to the fact that it is a complex region for the car to

maneuver and with lesser space for overtaking. Since the

chicane is a highly convoluted turn, curvature direction

does not indicate any better start regions.

COMSNETS 2022: Intelligent Transportation Systems Workshop

862

Authorized licensed use limited to: University of Pennsylvania. Downloaded on January 17,2022 at 17:35:26 UTC from IEEE Xplore. Restrictions apply.

Front Left Front Right Back Right Back Left

0.4

0.6

0.8

1.0

Overtaking Probability

Fig. 5. Overtaking probability distribution for sweeper curves.

Front Left Front Right Back Right Back Left

0.2

0.3

0.4

0.5

0.6

Overtaking Probability

Fig. 6. Overtaking probability distribution for hairpins.

B. Online Policy Execution

Algorithm 1 furnishes a policy map for all the different

curves/track portions from different race tracks considered

during the policy learning phase. We will now compute the

best policy Π(s)as a function of the track portion sand

integrate it into the SMPCC setup on the ego vehicle. We then

compare the number of overtakes with and without the spatial

policy based MPCC controller to verify the effectiveness of

our algorithm.

Algorithm 2: Online Evaluation with SMPCC

Function SMPCC(Xobs, mode):

if mode = ‘normal‘ then

u∗=solve MPCC problem in Section IV

elsemodify MPCC to integrate policy (see ﬁg. 3)

u∗=solve modiﬁed MPCC problem

end

return u∗;

initialize Xego,Xobs mode = ‘normal‘

for t= 0 to Tsim do

u∗=SMPCC (Xobs,mode)

steer the ego: X+

ego =f(Xego , u∗)

update obstacle position: X+

obs =g(Xobs)

Xobs, Xego =X+

obs, X +

ego

identify track portion τwhere ego is present

policy lookup: mode = Π[τ]

end

The results display that the ofﬂine policy learning approach

has been successful by showing an increased number of

Front Left Front Right Back Right Back Left

0.2

0.3

0.4

0.5

0.6

Overtaking Probability

Fig. 7. Overtaking probability distribution for chicanes.

TABLE III

RESULTS: NUMBER OF OVERTAKES WITH AND WITHOUT POLICY ON

RACETRACK 1 (SILVE RSTONE, EN GLAND )

Track

Portion (τ)Track

Portion

Type

Number of

Overtakes

Policy OFF

Number of

Overtakes

Policy ON

1 Sweeper 436 452

2 Hairpin 256 337

3 Hairpin 308 426

4 Sweeper 342 357

5 Chicane 117 302

6 Straight 565 566

7 Sweeper 237 283

8 Hairpin 218 394

TABLE IV

RESULTS: NUMBER OF OVERTAKES WITH AND WITHOUT POLICY ON

RACETRACK 2 (BUDAPEST, HUNGA RY)

Track

Portion (τ)Track

Portion

Type

Number of

Overtakes

Policy OFF

Number of

Overtakes

Policy ON

1 Hairpin 286 375

2 Hairpin 297 404

3 Sweeper 410 446

4 Hairpin 337 398

5 Chicane 270 361

6 Chicane 180 329

7 Sweeper 372 438

8 Sweeper 239 297

9 Hairpin 288 326

10 Hairpin 301 374

11 Straight 558 563

overtaking maneuvers at all racetracks and track portions.

We can observe that overtaking on the straights is usually

easy (even without policy). Since sweeper curves are usually

wide track portions that allow high speeds and do not involve

complicated maneuvers, both with and without switching

policy, we achieve higher overtaking maneuvers. Although

we see that the switching policy leads to more overtaking

maneuvers because having the right position for the overtaking

maneuver is crucial here, too. We see the highest impact of our

algorithm at hairpins and chicanes. This is mainly due to the

fact that overtaking at these curves is usually complicated and

needs a good strategy beforehand. We see that our algorithm

can nearly double the amount of overtaking maneuvers in the

chicane (track portion 10 of Catalunya) which substantiates the

fact that having the right starting position for an overtaking

maneuver is indispensable.

COMSNETS 2022: Intelligent Transportation Systems Workshop

863

Authorized licensed use limited to: University of Pennsylvania. Downloaded on January 17,2022 at 17:35:26 UTC from IEEE Xplore. Restrictions apply.

C. Evaluation on an Unknown Track

As an ultimate test, we now apply this online policy

algorithm to the agent racing in an unknown racetrack (Sakhir

Circuit, Bahrain). The results from Sakhir Circuit show an

increase in the number of overtakes with the policy at all track

portions as displayed in table V.

TABLE V

RESULTS: NUM BER OF OVE RTAKE S WITH AN D WITHO UT POLICY ON

UNKNOWN RAC ETRACK 5 (SAKHIR, BAHRAIN)

Track

Portion

(τ)

Track

Portion

Type

Turn Di-

rection

(Left/

Right)

Policy

Region Number

of Over-

takes

Policy

OFF

Number

of Over-

takes

Policy

1 Chicane Right R2254 298

2 Sweeper Right R1278 367

3 Chicane Left R1214 391

4 Hairpin Right R1319 421

5 Hairpin Left R2224 327

6 Straight Left R2551 560

7 Sweeper Right R1348 418

8 Sweeper Right R1297 368

9 Straight Left R2559 549

10 Sweeper Right R1311 396

11 Straight Right R1552 561

Additional ofﬂine policy evaluations have shown that a

generalization from turn directions and overtaking zones is

only partially useful. The generalised policies did not lead to

successful overtakes always and were not feasible in some

cases. A possible reﬁnement could be the focus on using a

parametric curvature of the turn rather than the high level

deﬁnition of left or right.

VI. CONCLUSION AND FUTURE WORK

In this paper, an algorithm for spatial policy learning from

ofﬂine experiments is proposed to learn effective overtaking

strategies based on position advantage. Extensive simulations

on real world racetrack layouts show that the proposed al-

gorithm is able to learn regions of high probabilities on a

racetrack for successful and safe overtaking maneuvers. The

(SMPCC) setup, that has the driving policies integrated into

the motion planning and control stack of the vehicle resulted in

an increase in the number of overtakes. Speciﬁcally, the policy

based algorithm was found to be highly effective for convo-

luted track portions like chicanes, where a positional advantage

plays a major role in a successful overtaking maneuver. In

summary, with the setup deﬁned in this paper, one can create

more realistic and better overtaking maneuvers for autonomous

vehicles. This brute-force technique of learning spatial infor-

mation serves as a fundamental result and ground truth for

future work. Extensions to this work will include learning-

based algorithms based on reinforcement learning techniques

to identify the overtaking probability based on the curvature

information of upcoming turns and can therefore applied to

behavioral planners for passenger autonomous vehicles. One

can consider non-reactive, reactive and aggressive opponents

which are defensive and sophisticated to overtake and therefore

deriving a holistic strategy for overtaking on e.g. highways.

With this setup, one can learn and integrate complex human-

like overtaking maneuvers for autonomous vehicles in a safe

and reliable manner.

REFERENCES

[1] J. Betz, A. Wischnewski, A. Heilmeier, F. Nobis, T. Stahl, L. Her-

mansdorfer, B. Lohmann, and M. Lienkamp, “What can we learn from

autonomous level-5 motorsport?” in Proceedings. Springer Fachmedien

Wiesbaden, Sep. 2018, pp. 123–146.

[2] M. OKelly, H. Zheng, D. Karthik, and R. Mangharam, “F1tenth: An

open-source evaluation environment for continuous control and rein-

forcement learning,” in Proceedings of the NeurIPS 2019 Competition

and Demonstration Track, ser. Proceedings of Machine Learning Re-

search, vol. 123. PMLR, 2020, pp. 77–89.

[3] A. Liniger, A. Domahidi, and M. Morari, “Optimization-based au-

tonomous racing of 1: 43 scale rc cars,” Optimal Control Applications

and Methods, vol. 36, no. 5, pp. 628–647, 2015.

[4] S. Dixit, S. Fallah, U. Montanaro, M. Dianati, A. Stevens, F. Mc-

cullough, and A. Mouzakitis, “Trajectory planning and tracking for

autonomous overtaking: State-of-the-art and future prospects,” Annual

Reviews in Control, vol. 45, pp. 76–86, 2018.

[5] J. Betz, A. Wischnewski, A. Heilmeier, F. Nobis, L. Hermansdorfer,

T. Stahl, T. Herrmann, and M. Lienkamp, “A software architecture

for the dynamic path planning of an autonomous racecar at the limits

of handling,” in 2019 IEEE International Conference on Connected

Vehicles and Expo (ICCVE). IEEE, Nov. 2019.

[6] T. Stahl, A. Wischnewski, J. Betz, and M. Lienkamp, “Multilayer

graph-based trajectory planning for race vehicles in dynamic scenarios,”

in 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

IEEE, Oct. 2019.

[7] D. Caporale, L. Venturini, A. Fagiolini, L. Pallottino, A. Settimi,

A. Biondo, F. Amerotti, F. Massa, S. D. Caro, and A. Corti, “A planning

and control system for self-driving racing vehicles,” in 2018 IEEE

4th International Forum on Research and Technology for Society and

Industry (RTSI). IEEE, Sep. 2018.

[8] A. Buyval, A. Gabdulin, R. Mustaﬁn, and I. Shimchik, “Deriving

overtaking strategy from nonlinear model predictive control for a race

car,” in 2017 IEEE/RSJ International Conference on Intelligent Robots

and Systems (IROS). IEEE, Sep. 2017.

[9] D. Loiacono, A. Prete, P. L. Lanzi, and L. Cardamone, “Learning

to overtake in TORCS using simple reinforcement learning,” in IEEE

Congress on Evolutionary Computation. IEEE, Jul. 2010.

[10] T. Br¨

udigam, A. Capone, S. Hirche, D. Wollherr, and M. Leibold, “Gaus-

sian process-based stochastic model predictive control for overtaking in

autonomous racing,” 2021.

[11] A. Liniger and J. Lygeros, “A noncooperative game approach to au-

tonomous racing,” IEEE Transactions on Control Systems Technology,

vol. 28, no. 3, pp. 884–897, May 2020.

[12] G. Notomista, M. Wang, M. Schwager, and M. Egerstedt, “Enhancing

game-theoretic autonomous car racing using control barrier functions,”

in 2020 IEEE International Conference on Robotics and Automation

(ICRA). IEEE, May 2020.

[13] C. Jung, S. Lee, H. Seong, A. Finazzi, and D. H. Shim, “Game-theoretic

model predictive control with data-driven identiﬁcation of vehicle model

for head-to-head autonomous racing,” 2021.

[14] U. Rosolia and F. Borrelli, “Learning how to autonomously race a car:

A predictive control approach,” IEEE Transactions on Control Systems

Technology, vol. 28, no. 6, pp. 2713–2719, Nov. 2020.

[15] J. Kabzan, L. Hewing, A. Liniger, and M. N. Zeilinger, “Learning-based

model predictive control for autonomous racing,” IEEE Robotics and

Automation Letters, vol. 4, no. 4, pp. 3363–3370, Oct. 2019.

[16] N. R. Kapania and J. C. Gerdes, “Learning at the racetrack: Data-

driven methods to improve racing performance over multiple laps,” IEEE

Transactions on Vehicular Technology, vol. 69, no. 8, pp. 8232–8242,

Aug. 2020.

[17] A. Heilmeier, A. Wischnewski, L. Hermansdorfer, J. Betz, M. Lienkamp,

and B. Lohmann, “Minimum curvature trajectory planning and control

for an autonomous race car,” Vehicle System Dynamics, vol. 58, no. 10,

pp. 1497–1527, Jun. 2019.

[18] A. Zanelli, A. Domahidi, J. Jerez, and M. Morari, “Forces nlp: an efﬁ-

cient implementation of interior-point methods for multistage nonlinear

nonconvex programs,” International Journal of Control, vol. 93, no. 1,

pp. 13–29, 2020.

COMSNETS 2022: Intelligent Transportation Systems Workshop

864

Authorized licensed use limited to: University of Pennsylvania. Downloaded on January 17,2022 at 17:35:26 UTC from IEEE Xplore. Restrictions apply.

Cone Slalom With Automated Sports Car–Trajectory Planning Algorithm

Article

Full-text available

Jan 2023

In this paper, we present the system architecture and algorithms of an automated vehicle to perform a slalom. We demonstrate a novel trajectory planning algorithm based on optimization techniques using logic-based Benders de-composition, where an external loop optimizes the position of the nearest waypoint and an internal loop generates the optimal trajectory. The positions of the cones in this use case are unknown, but a mono camera and LiDAR detect them. They can be in a line or dispersed, have equal or unequal spacing, and the U-turns can be symmetric or asymmetric. A bicycle model is used to formulate a non-linear quadratic optimization problem aimed at optimal trajectory generation considering vehicle kinematics. Finally, the trajectory tracking control keeps the vehicle on the planned slalom trajectory while driving. The control system is interfaced with the vehicle via CAN and FlexRay buses. Much of the work was devoted to experiments with a real vehicle and fine-tuning the system parameters. During the validation of the system, interesting observations were made regarding the components' precision, frequency, and sensitivity.

Combining Event-Based Maneuver Selection and MPC Based Trajectory Generation in Autonomous Driving

Article

Full-text available

May 2022

Maneuver planning, which plays a key role in selecting desired lanes and speeds, is an essential element of autonomous driving. Generally, for a vehicle driving on a multilane road, there are several potential maneuvers in both longitudinal and lateral directions. Selecting the best maneuver from the various options represents a significant challenge. In this paper, we propose a maneuver selection algorithm and combine it with a trajectory generation algorithm, which is based on model predictive control (MPC). The maneuver selection method is a higher-level planner, which selects only one maneuver from all possible maneuvers based on the current situation and delivers it to a lower-level MPC-based trajectory tracking controller. The effectiveness of the proposed algorithm is validated by simulating an overtaking scenario on a multilane highway.

Autonomous vehicular overtaking maneuver: A survey and taxonomy

Article

May 2023

A Real-Time NMPC Controller for Autonomous Vehicle Racing

Conference Paper

Sep 2022

A Software Architecture for the Dynamic Path Planning of an Autonomous Racecar at the Limits of Handling

Conference Paper

Full-text available

Nov 2019

Learning How to Autonomously Race a Car: A Predictive Control Approach

Article

Full-text available

Nov 2019

We present a learning model predictive controller (LMPC) for autonomous racing. We model the autonomous racing problem as a minimum time iterative control task, where an iteration corresponds to a lap. The system trajectory and input sequence of each lap are stored and used to systematically update the controller for the next lap. In the proposed approach, the race time does not increase at each iteration. The first contribution is to propose a local LMPC which reduces the computational burden associated with existing LMPC strategies. In particular, we show how to construct a local safe set and approximation to the value function, using a subset of the stored data. The second contribution is to present a system identification strategy for the autonomous racing iterative control task. We use data from previous iterations and the vehicle's kinematic equations of motion to build an affine time-varying prediction model. The effectiveness of the proposed strategy is demonstrated by experimental results on the Berkeley Autonomous Race Car (BARC) platform.

What can we learn from autonomous level-5 motorsport?: chassis.tech plus

Chapter

Full-text available

Jan 2019

Whether BMW, VW or Google: Almost all leading automobile and technology companies are researching and developing the multi-stage autonomy of vehicles, which enables a completely self-driven vehicle without a driver in autonomy level 5. Based on his assessment and experience, the driver had previously carried out environmental detection, localization and vehicle control. The elimination of the driver creates numerous challenges in the development of level 5 vehicles. However, in order to achieve an efficient and safe driving style, the driving strategy of the vehicle must be adapted to the current environmental conditions. With the beginning of the third Formula E series an additional support series called Roborace will take place on the tracks currently used by the Formula E [2]. The goal of Roborace is to provide the first racing series for autonomous vehicles. The teams that take part at this competition will develop only the software for the provided autonomous cars (Robocars) [1]. The following paper is showing an overview of the Roborace project and the different software parts that have to be developed for a car that has to drive autonomously. Afterwards an evaluation of about what we can learn from autonomous level 5 motorsport is done

Enhancing Game-Theoretic Autonomous Car Racing Using Control Barrier Functions

Conference Paper

May 2020

Learning at the Racetrack: Data-Driven Methods to Improve Racing Performance Over Multiple Laps

Article

May 2020

Autonomous vehicles will generate tremendous data from the variety of sensors they employ to track the surrounding environment. This data is inherently valuable, as it gives algorithm designers the potential to leverage prior experience in order to improve driving performance over time. This paper uses the lens of autonomous racing to provide an example of how data from previous iterations of driving can be used to improve quantitative metrics of performance. Two complementary algorithms are demonstrated in this paper. The first algorithm uses iterative learning control (ILC) to simultaneously improve lateral and longitudinal tracking of the desired racing trajectory over multiple laps, while the second algorithm is focused on altering the trajectory itself using a search method. When driven experimentally at the limits of handling, the result is a reduction in lap time of nearly 1.4 seconds, a major improvement.

Multilayer Graph-Based Trajectory Planning for Race Vehicles in Dynamic Scenarios

Conference Paper

Oct 2019

Trajectory planning at high velocities and at the handling limits is a challenging task. In order to cope with the requirements of a race scenario, we propose a far-sighted two step, multi-layered graph-based trajectory planner, capable to run with speeds up to 212~km/h. The planner is designed to generate an action set of multiple drivable trajectories, allowing an adjacent behavior planner to pick the most appropriate action for the global state in the scene. This method serves objectives such as race line tracking, following, stopping, overtaking and a velocity profile which enables a handling of the vehicle at the limit of friction. Thereby, it provides a high update rate, a far planning horizon and solutions to non-convex scenarios. The capabilities of the proposed method are demonstrated in simulation and on a real race vehicle.

Learning-Based Model Predictive Control for Autonomous Racing

Article

Jul 2019

In this letter, we present a learning-based control approach for autonomous racing with an application to the AMZ Driverless race car gotthard . One major issue in autonomous racing is that accurate vehicle models that cover the entire performance envelope of a race car are highly nonlinear, complex, and complicated to identify, rendering them impractical for control. To address this issue, we employ a relatively simple nominal vehicle model, which is improved based on measurement data and tools from machine learning.The resulting formulation is an online learning data-driven model predictive controller, which uses Gaussian processes regression to take residual model uncertainty into account and achieve safe driving behavior. To improve the vehicle model online, we select from a constant in-flow of data points according to a criterion reflecting the information gain, and maintain a small dictionary of 300 data points. The framework is tested on the full-size AMZ Driverless race car, where it is able to improve the vehicle model and reduce lap times by $ {\mathbf{10}{\%}}$ while maintaining safety of the vehicle.

Minimum curvature trajectory planning and control for an autonomous race car

Article

Jun 2019

This paper shows a software stack capable of planning a minimum curvature trajectory for an autonomous race car on the basis of an occupancy grid map and introduces a controller design that allows to follow the trajectory at the handling limits. The minimum curvature path is generated using a quadratic optimisation problem (QP) formulation. The key contributions of this paper are the extension of the QP for an improved accuracy of the curvature approximation, the introduction of curvature constraints and the iterative invocation of the QP to significantly reduce linearisation errors in corners. On the basis of the resulting raceline, a velocity profile is calculated using a forward-backward-solver that considers the velocity dependent longitudinal and lateral acceleration limits of the car. The advantages and disadvantages of the proposed trajectory planning approach are discussed critically with respect to practical experience from various racetracks. The software stack showed to be robust in a real world environment as it ran successfully on the Roborace DevBot during the Berlin Formula E event in May 2018. The lap time achieved was within a tenth of a second of a human driver and the car reached about 150km/h and 80% of its acceleration limits.

A Planning and Control System for Self-Driving Racing Vehicles

Conference Paper

Sep 2018

Trajectory planning and tracking for autonomous overtaking: State-of-the-art and future prospects

Article

Mar 2018
ANNU REV CONTROL

Trajectory planning and trajectory tracking constitute two important functions of an autonomous overtaking system and a variety of strategies have been proposed in the literature for both functionalities. However, uncertainties in environment perception using the current generation of sensors has resulted in most proposed methods being applicable only during low-speed overtaking. In this paper, trajectory planning and trajectory tracking approaches for autonomous overtaking systems are reviewed. The trajectory planning techniques are compared based on aspects such as real-time implementation, computational requirements, and feasibility in real-world scenarios. This review shows that two important aspects of trajectory planning for high-speed overtaking are: (i) inclusion of vehicle dynamics and environmental constraints and (ii) accurate knowledge of the environment and surrounding obstacles. The review of trajectory tracking controllers for high-speed driving is based on different categories of control algorithms where their respective advantages and disadvantages are analysed. This study shows that while advanced control methods improve tracking performance, in most cases the results are valid only within well-regulated conditions. Therefore, existing autonomous overtaking solutions assume precise knowledge of surrounding environment which is not representative of real-world driving. The paper also discusses how in a connected driving environment, vehicles can access additional information that can expand their perception. Hence, the potential of cooperative information sharing for aiding autonomous high-speed overtaking manoeuvre is identified as a possible solution.

Deriving Spatial Policies for Overtaking Maneuvers with Autonomous Vehicles

Figures

Recommended publications

Track based Offline Policy Learning for Overtaking Maneuvers with Autonomous Racecars

Indy Autonomous Challenge -- Autonomous Race Cars at the Handling Limits

Indy Autonomous Challenge - Autonomous Race Cars at the Handling Limits

Game-theoretic Objective Space Planning