Conference PaperPDF Available

A GPU Implementation of a Look-Ahead Optimal Controller for Eco-Driving Based on Dynamic Programming

June 2021

June 2021

DOI:10.23919/ECC54610.2021.9655197

Conference: 2021 European Control Conference (ECC)

Authors:

Zhaoxuan Zhu

The Ohio State University

Shobhit Gupta

General Motors Company

Shreshta Rajakumar Deshpande

The Ohio State University

Show all 5 authorsHide

Possible Cases of Traffic Light Phase Encountered by the Vehicle.

…

Route 15 and 19 on OpenStreetMap

…

Comparison of VD&PT State Trajectories and Cumulative Fuel Consumption on Urban Driving (Route 15)

…

Comparison of VD&PT State Trajectories and Cumulative Fuel Consumption on Mixed Driving (Route 19)

…

Figures - uploaded by Zhaoxuan Zhu

Content may be subject to copyright.

Content uploaded by Zhaoxuan Zhu

Content may be subject to copyright.

A GPU Implementation of a Look-Ahead Optimal Controller for

Eco-Driving Based on Dynamic Programming

Zhaoxuan Zhu∗, Shobhit Gupta, Nicola Pivaro, Shreshta Rajakumar Deshpande and Marcello Canova

Abstract— Predictive energy management of Connected and

Automated Vehicles (CAVs), in particular those with multiple

power sources, has the potential to signiﬁcantly improve energy

savings in real-world driving conditions. In particular, the eco-

driving problem seeks to design optimal speed and power usage

proﬁles based upon available information from connectivity and

advanced mapping features to minimize the fuel consumption

between two designated locations.

In this work, the eco-driving problem is formulated as a

three-state receding horizon optimal control problem and solved

via Dynamic Programming (DP). The optimal solution, in terms

of vehicle speed and battery State of Charge (SoC) trajectories,

allows a connected and automated hybrid electric vehicle to

intelligently pass the signalized intersections and minimize

fuel consumption over a prescribed route. To enable real-time

implementation, a parallel architecture of DP is proposed for

an NVIDIA GPU with CUDA programming. Simulation results

indicate that the proposed optimal controller delivers more

than 15% fuel economy beneﬁts compared to a baseline control

strategy and that the solver time can be reduced by more than

90% by the parallel implementation when compared to a serial

implementation.

I. INTRODUCTION

Connected and Automated Vehicles (CAVs) have the po-

tential to increase safety, driving comfort as well as fuel

economy, by exploiting features such as advanced map-

ping, GPS location, and information available via vehicle-

to-vehicle (V2V) and vehicle-to-infrastructure (V2I) com-

munication [1], [2]. Meanwhile, Hybrid Electric Vehicles

(HEVs) increase the overall powertrain efﬁciency by includ-

ing battery pack(s) and electric motor(s) as alternative energy

storage and power generation devices [3]. Combining the two

technologies could further improve the fuel economy, how-

ever, poses a greater challenge from the planning&control

perspective.

In this context, ﬁnding the optimal powertrain control

strategy that minimizes the total fuel consumption and the

travel time between origin and destination is known in

literature as the eco-driving problem [4]. The contributions

made in this ﬁeld distinguish among two aspects, namely

powertrain conﬁgurations and trafﬁc scenarios. Regarding

powertrain conﬁguration, the difference is in whether the

∗Corresponding author. Email: zhu.1083@osu.edu

The authors are with the Center for Automotive Research, The Ohio State

University, Columbus, OH 43212, USA

This version has been accepted for publication in Proc. European Control

Conference (ECC), 2021. Personal use of this material is permitted. Per-

mission from EUCA must be obtained for all other uses, in any current or

future media, including reprinting/republishing this material for advertising

or promotional purposes, creating new collective works, for resale or

redistribution to servers or lists, or reuse of any copyrighted component

of this work in other works.

powertrain is equipped with a single power source [5], [6],

[7], [8] or features a hybrid architecture [9], [10], [11].

The latter requires a more complex control algorithm, as

the battery State-of-Charge (SoC) needs to be regulated and

utilized efﬁciently.

The difference in trafﬁc scenarios lies in whether the

controller is capable of processing the real-time Signal Phase

and Timing (SPaT) information at signalized intersections.

Ozatay et al. [5] proposed a framework providing advisory

speed proﬁle using online optimization conducted on a cloud-

based server without considering the real-time trafﬁc light

variability. Olin et al. [10] used Dynamic Programming

(DP) to solve the eco-driving problem. As trafﬁc lights are

not explicitly considered in these studies, the control logic

requires assistance from other decision-making agents, such

as human drivers or adaptive cruise control (ACC) systems.

Other studies have explicitly modeled and considered SPaT,

for example Jin et al. [6] formulated the problem as a Mixed

Integer Linear Programming (MILP) for a conventional ve-

hicle. Asadi et al. [12] used trafﬁc simulation models and

proposed to solve the problem considering probabilistic SPaT

with DP. Sun et al. [8] formulated the eco-driving problem as

a distributionally robust stochastic optimization problem with

collected real-world data and solved it with DP. Guo et al.

[9] proposed a hierarchical control framework with a hybrid

vehicle. Bae [11] extended the work in [8] by including a

heuristic HEV supervisory controller.

Many of the aforementioned studies used DP [13] as the

technique to either solve the optimization within the receding

horizon of a MPC or the entire problem. Unfortunately, the

existing use of DP either focus on enabling the vehicle with

single power source to pass the signalized intersections [12],

[8] or on optimally splitting the power demand between

engine and electrical motor [10] without considering SPaT

information. In [8], two states, namely vehicle speed and

travel time, are included in their DP formulation, whereas in

[10], velocity and SoC are considered as the DP states. Since

the number of DP states is subject to curse of dimensionality

[13], including one extra state often results in intractable

computational requirements, preventing from online imple-

mentation in rapid prototyping control systems.

In this study, the DP formulation proposed in [8] and [10]

are merged to develop an eco-driving control strategy for

HEVs that is also able to optimally cross signalized inter-

sections, given real-time SPaT information. To mitigate the

computational burden from the additional state, an efﬁcient

parallel implementation is developed for a NVIDIA GPU

with CUDA programming.

arXiv:2104.01284v1 [eess.SY] 3 Apr 2021

𝑔𝑒𝑎𝑟

Vehicle

Dynamics

Transmission

(Gear Box)

Engine

BSG

Battery

Transmission Control

Module

Torque

Converter

𝑇

!"#

𝜔$%&

𝑇

$%& 𝑣

𝜔!"#

𝑇'(#

𝑇

𝑇'(#

*!( 𝜔'(#

𝑇

!"#

*!(

𝑃'(#

𝑇

𝜔&

Obta ined from the

model (feedback)

𝑙𝑜𝑐𝑘

𝑆𝑂𝐶

Plant Mod el

Fig. 1: Block Diagram of 48V P0 Mild-Hybrid Drivetrain.

II. MOD EL DEVELOPMENT AND VALIDATION

In this work, a forward-looking dynamic powertrain model

is utilized for performance and fuel economy prediction

over real-world routes [10]. The model considers a P0 mild-

HEV with a 48V Belted Starter Generator (BSG) performing

torque assist, regenerative braking and start-stop functions.

As shown in Fig. 1, the inputs to the vehicle dynamics and

powertrain (VD&PT) model are obtained from a simpliﬁed

model of the Engine Control Module (ECM), which contains

the essential functions to convert the driver’s input (pedal

position) to torque commands. The outputs of the ECM, the

desired BSG torque (Tdes

bsg ) and desired engine torque (Tdes

eng )

are obtained from a production torque split strategy, which

is used as the baseline for fuel economy evaluation.

The vehicle simulator contains low-frequency dynamic

models of the powertrain and longitudinal vehicle dynamics.

The battery is modeled as a zero-th order equivalent circuit,

from which the battery State-of-Charge (SoC) is calculated;

the vehicle longitudinal dynamics are described by the road

load equation [3]. Low-frequency, quasi-static models are

developed for the engine (fuel and friction maps), BSG,

torque converter and transmission (efﬁciency maps).

The vehicle model was validated based on chassis dy-

namometer data. Fig. 2 shows a veriﬁcation over the drive

cycle FTP-75, where the vehicle velocity, battery SoC (ξ)

and fuel consumption are compared against test data. Small

mismatches in the battery SoC proﬁles are attributed to sim-

pliﬁcation of the 12V electrical system, where the auxiliary

loads are modeled using a constant current bias. The fuel

consumption over FTP-75 is well predicted, with cumulative

error less than 4%.

III. PROB LEM FO RMU LATIO N AND SOLUTION ME THO DS

The objective of the eco-driving problem, formulated in

the spatial domain, is to minimize the fuel consumed by the

vehicle over an entire route consisting of N steps:

min

Teng,s,Tbsg,s

s=N

s=1

(γ·˙mf,s + (1 −γ)) ·∆ts,(1)

where sis the discrete index for the vehicle longitudinal

position, γis the weighing factor between the fuel consump-

tion and the travel time, and ˙mf,s is the instantaneous fuel

consumption. ∆tsis the travel time per step computed as

-10

-5

t [%]

Velocity [m/s]

Model

Experimental Data

0 200 400 600 800 1000 1200 1400

Time [s]

200

400

600

800

Fuel Consumed [g]

Final Error = 3.746 %

Fig. 2: Validation of Vehicle Velocity, SoC and Fuel Con-

sumed over FTP-75.

follows:

∆ts=∆d

¯vs

,(2)

where ∆dis the distance step, i.e. ∆d=ds+1 −ds,

calculated from dsthe distance traveled along the route, and

¯vs=vs+vs+1

2is the average velocity. The state and action

space are subject to the constraints as described in [10].

A primary beneﬁt of the spatial formulation [14] is that

it inherently aids in the incorporation of distance-based

route features, such as speed limits, grade, trafﬁc light and

stop sign locations. However, the spatial-domain formulation

would make it difﬁcult to incorporate time-based informa-

tion such as SPaT received from V2I communication. This

requires one to augment the state space with time as an

additional state, which will be elaborated later in this section.

Consider a case where a CAV is approaching a signal-

ized intersection and would receive a trafﬁc light phase

(red/green) through V2I communication. To avoid boundary

cases and ensure feasibility, the yellow phase is assumed to

be a part of the green phase if the distance to the trafﬁc

light is close to the critical braking distance as formulated in

[15]. The time remaining in the current phase or transitioning

to the next phase would change as the vehicle approaches

the signalized intersection. In this section, tGR,s and tRG,s

will represent time for a green-red and red-green transition

respectively at position s, as shown in the Fig. 3. In this

work it is assumed that the positions of all the trafﬁc lights

in the route are known a priori from a navigation system,

and contained in the set DTL.

Assume that the discretized state dynamics for the eco-

driving problem has the following form:

xs+1 =fs(xs, us), s = 1,· · · , N −1.(3)

where xs∈ Xs⊆Rpand us∈ Us⊆Rqare the state and

control action, respectively.

Fig. 3: Possible Cases of Trafﬁc Light Phase Encountered

by the Vehicle.

In this work, the state variables are the vehicle velocity

(vs), battery SoC (ξs)and travel time (ts). The control

actions are the engine torque (Teng,s)and BSG torque

(Tbsg,s):

xs= [vs, ξs, ts]T(4a)

us= [Teng,s, Tbsg,s ]T(4b)

The equations describing the state dynamics fs(xs, us)are:

vs+1 =sv2

s+ 2∆d·Ftr,s −Froad,s(vs)

M(5a)

ξs+1 =ξs−∆d

¯vs

·¯

Ibat,s

Cnom

(5b)

ts+1 =(ts+tRG,s , s ∈ DTL and ¯vs= 0

ts+∆d

¯vs, s /∈ DTL

(5c)

where Ftr,s is the tractive force produced by the powertrain

[10]. Froad,s is the road load resistive force, Mis the total

vehicle mass, ¯

Ibat,s is the average current evaluated over a

distance step, Cnom is the nominal battery capacity, tsis

the travel time at a position s. Intuitively, Eqn. (5c) shows

the time is teleported to the end of the red phase when the

vehicle stops at the trafﬁc light.

The eco-driving optimization problem is formulated as

a receding horizon optimal control problem where the full

route of Nsteps is solved over a reduced horizon NH(<<

N). At any given position s= 1, . . . , N −NH, the optimiza-

tion problem is formulated as:

J∗(xs) = min

{µk}s+NH−1

k=s

s+NH−1

k=s

c(xk, µk(xk)) + cT(xs+NH),

c(xk, µk(xk)) = (γ·˙mf,k(xk, µk(xk)) + (1 −γ)) ·∆tk

(6)

where µk:X → U is the admissible control policy at the

step kin the prediction horizon, c:X × U → Ris the

stage cost function deﬁned as the weighted average of the

fuel consumption and the travel time, cT:X → Ris the

terminal cost function.

The state space and action space are subject to following

Fig. 4: Travel Time Constraint at the Signalized Intersection

in a NHHorizon.

constraints: ∀s= 1, . . . , N −NH,∀k=s, . . . , s +NH:

vk∈[vmin

k, vmax

k],(7a)

ξk∈[ξmin

k, ξmax

k],(7b)

tk∈ TG,k,(7c)

ak∈[amin, amax ],(7d)

Teng,k ∈[Tmin

eng (vk), T max

eng (vk)],(7e)

Tbsg,k ∈[Tmin

bsg (vk), T max

bsg (vk)],(7f)

where vmin

s, vmax

sare the minimum and maximum route

speed limits respectively, ξmin

s, ξmax

sare the static limits

applied on battery SoC, TG,s represents the feasible set

on the travel time at the signalized intersection, amin, amax

represent the limits imposed on the acceleration for comfort,

Tmin

eng (vs), T max

eng (vs)are the minimum and maximum engine

torque limits, and Tmin

bsg (vs), T max

bsg (vs)are the minimum

and maximum BSG torque limits, respectively. To ensure

SoC neutrality over the entire itinerary, a terminal constraint

ξ1=ξNis applied to the battery.

Consider a case where a NHhorizon (k=s, . . . , s +

NH−1) comprises of a signalized intersection located at

kTL as shown in Fig. 4. The travel time at the beginning

of the horizon is given by tsand a maximum travel time tf

is imposed at the end of the horizon. tfis chosen such that

the ego vehicle does not impede the trafﬁc ﬂow. This can

be computed as an expectation from multiple historical trips

along the same route [16]. In this work, tfis assumed to

be 80s for a 200m horizon to ensure feasibility. The status

of the trafﬁc light is then sampled into an indicator vector

IG,k of size tf/∆twith each element representing no trafﬁc

light or green phase with 1 and red phase with 0. Note that

∆there is the constant time discretization for sampling. The

feasible set TG,k is then deﬁned as follows:

TG,k =t:ts+ ∆t·z|(IG,k)z= 1, z = 1, ..., tf/∆t,

(8)

where (IG,k)zrepresents the zth element of the vector. The

size of the sampled trafﬁc light status vector is chosen to

be the same as the grid size of the DP solver in the time

dimension, which will be explained in the next section.

Powertrain Model

Battery Model

Fig. 5: Serial DP Architecture

Knowing the driving conditions within the receding hori-

zon, the optimization problem can be solved ofﬂine via DP.

Following the nomenclature in [17], the optimal policy µ∗

along with the optimal cost-to-go function Jk:X → R,

∀k=s, . . . , s +N−1can be calculated through backward

recursion as follows:

Js+N(x) = cT(x) + φT(x),(9a)

Fk(x, u) = ck(x, u) + φk(x) + Jk+1(fk(x, u)),(9b)

µ∗

k= argmin

µk

Fk(x, µk(x)),(9c)

Jk(x) = Fk(x, µ∗

k(x)),(9d)

where ckand cTare the discretized stage and terminal

cost function respectively; φkand φTare penalty functions

introduced to ensure that the trajectory stays feasible. In

this paper, an approximate ofﬂine solution of a full-route

optimization under partial information is initially obtained

via DP and used as a terminal cost approximation. Other

techniques, such as manual calibration or reinforcement

learning techniques [18], [19] may also be employed. The

beneﬁt of using this method is that it provides the closed-

loop optimal policy that inherently adds robustness against

approximated plant dynamics or other modeling errors [7].

Further, DP can solve highly nonlinear and hybrid problems

while ensuring constraint satisfaction.

IV. ARCHITECTURES FOR DP I MPLEMENTATION

For the eco-driving optimization problem, a serial DP

architecture can be constructed, where each state-action

combination is evaluated in a 5-layer nested loop, as shown

in Fig. 5. The powertrain model takes the current vehicle

speed, the engine torque and the BSG torque as the inputs

and calculates the change in vehicle speed ∆vs, the change

in time ∆ts, BSG power demand Pbat (input to the battery

model) and the stage cost c(xs, µ(xs)). The battery model

takes Pbat and calculates the change in battery SoC ∆ξfor

the nodes in the SoC grid. If an infeasibility is encountered

for a state-action combination, the cost-to-go is set to a value

that is orders of magnitude larger than the stage cost.

To maintain a reasonable accuracy, the grid size is set

to be (nx1, nx2, nx3, nu1, nu2) = (35,26,40,23,30). As

a result, the number of execution of the innermost loop

is on the order of 107for each DP backward recursion

step. Meanwhile, with ∆d= 10m, 20 recursion steps

are required per distance step. Without any parallelization,

GPU(Device)

Kernel 1:Powertrain Model

DP backward recursion:

CPU(Host)

Interact with other modules;

Defineconstraints within prediction horizon;

Kernel 2:Battery Model and Cost-to-go Update

Total number of threads:

Constraints

Fig. 6: Parallel DP Architecture

the computational requirement becomes intractable for most

automotive applications.

To mitigate the issue, a parallel architecture is proposed in

this work, and implemented on a NVIDIA GPU with CUDA

C++ programming language. Fig. 6 shows the architecture of

the parallel implementation. With comparable expense and

power consumption, a CPU is faster at performing complex

and long computing sequences thanks to the more resources

dedicated to memory caching and control ﬂow, whereas a

GPU saves more resources on processors that can be run in

parallel [20].

In CUDA programming, CPU and GPU are referred to as

host and device, respectively, while the Kernel is the function

that is launched by the host and executed in parallel by the

device. Each thread represents a sub-process running on GPU

with unique inputs. To have an efﬁcient implementation, the

data transfer frequency between host and device should be

minimized and, in the current framework, such data transfer

occurs only once per DP optimization call.

In the ﬁrst Kernel, each thread represents a combina-

tion of (vs, Teng,s, Tbsg,s )on the 3D grid and calculates

the corresponding ∆vs,∆ts,Pbat,s and c(xs, us)for the

second Kernel. Each thread launched by the second Kernel

represents a combination of the three states (vs, ξs, ts), and

it loops through the control inputs, determines ∆ξbased on

Pbsg from the ﬁrst Kernel, and ﬁnally determines whether to

update the cost-to-go matrix and the optimal control matrix

following [17]. As Eqn. (9c) requires minimization to ﬁnd the

optimal control action, the nested loops for control actions

are kept to avoid race conditions.

Route 19

Route 15

Fig. 7: Route 15 and 19 on OpenStreetMap

V. SIMULATION AND EVAL UATI O N OF RE SULTS

In this paper, two real-world routes representing urban

(Route 15) and mixed-urban (Route 19) driving conditions

have been selected for the simulation and analysis. Both

routes are located in Columbus, OH, USA as shown in Fig. 7

[10]. The mixed-urban route is 7km in length and comprises

5 trafﬁc lights and 3 stop signs, while the urban route is

7.5km in length and includes 22 trafﬁc lights and 3 stop

signs.

For the comparison of results, a baseline controller was

considered for the VD&PT model. This controller consists

of a heuristic Enhanced Driver Model (EDM) [15] calibrated

to control the vehicle velocity in such a way that the ego

vehicle is able to follow a lead vehicle (when detected

within a ﬁxed line of sight) and to stop at trafﬁc lights. The

EDM parameters can represent different driver styles and can

be tuned to match a real-world driver [2]. The production

(rule-based) mHEV energy management controller was also

considered in the VD&PT model for the baseline case.

The 3-state DP-based MPC was then integrated into the

vehicle simulator. As indicated above, a terminal cost approx-

imation was obtained by computing a full route optimization

ofﬂine via DP and storing the value function [10]. Fig. 8

and Fig. 9 show the comparisons of the trajectories from

the baseline and DP-based MPC controller on urban and

highway driving, respectively. With comparable travel time,

the optimized VD&PT controller is able to reduce the fuel

consumption by 15% and 23%, respectively, by optimizing

the energy stored in battery and planning the speed trajectory

in a way that avoids unnecessary braking events. Note that

the serial and parallel implementation are also compared in

Fig. 8 and Fig. 9. As expected, no differences in the results

are present in the two different implementations.

Table I shows the mean, variance and the maximum values

of the solver time obtained by the serial and the parallel

implementations. To perform a comparison of the computa-

Fig. 8: Comparison of VD&PT State Trajectories and Cu-

mulative Fuel Consumption on Urban Driving (Route 15)

TABLE I: Solver Time Comparison

Route 15 Route 19

Serial Parallel Serial Parallel

Mean (ms) 1600 98 1638 101

Variance (ms2)112 11 287 13

Maximum (ms) 2210 118 2312 123

tion throughput, both implementations were compiled in C++

code. The serial and the parallel implementations were then

executed on a desktop PC with 2.9 GHz Intel Core i7 CPU

and a NVIDIA RTX 2080 GPU, respectively. Besides the

more than 90% reduction in averaged solver time, the parallel

implementation has a lower variance and maximum of the

solver time as well. This is because the serial implementation

has multiple break conditions in the nested loop and these

conditions are dependent on the speciﬁc driving scenario

and SPaT sequence encountered during the route. In the

meantime, the parallel implementation goes through all the

possible combinations by parallel threads, and thus is less

subject to the driving conditions. Although both the CPU

and GPU implementations beneﬁt of the improved compu-

tational capabilities than typical rapid prototyping units, the

beneﬁts of utilizing GPU-based parallel computing are very

appealing, especially considering that onboard GPUs are an

indispensable development tool for the control of self-driving

features in autonomous vehicles [21].

Fig. 9: Comparison of VD&PT State Trajectories and Cu-

mulative Fuel Consumption on Mixed Driving (Route 19)

VI. CONCLUSION

In this work, an eco-driving optimization problem was

formulated for a mHEV with V2I communication and lon-

gitudinal automation. The problem was cast as a receding

horizon optimal control problem with three states, namely

vehicle velocity, battery SoC and travel time. A MPC imple-

mentation was then developed and solved using DP. Com-

pared to previous studies where the predictive optimization is

applied to control the vehicle velocity or to perform energy

management of the hybrid powertrain, the work described in

this paper integrates all elements of the eco-driving problem

into a single controller.

To accommodate the increased computational requirement

from inclusion of additional states, a parallel implementation

of DP is developed with CUDA programming on a NVIDIA

GPU. The implementation reduces the computation time by

more than 90% (average, variance, and maximum of the

solver time), compared to the serial counterpart. Simulation

results indicate that the proposed controller decreases the fuel

consumption by more than 15% compared to the baseline

controller, while keeping comparable travel time.

Future work includes the implementation of the algorithm

on rapid-prototyping systems and the study on how to

optimally obtain the terminal cost function.

ACKNOWLEDGMENT

The authors acknowledge the support from the United

States Department of Energy, Advanced Research Projects

Agency – Energy (ARPA-E) NEXTCAR project (Award

Number DE-AR0000794).

REFERENCES

[1] J. Guanetti, Y. Kim, and F. Borrelli, “Control of connected and

automated vehicles: State of the art and future challenges,” Annual

Reviews in Control, vol. 45, pp. 18–40, 2018.

[2] S. Gupta, S. R. Deshpande, D. Tufano, M. Canova, G. Rizzoni,

K. Aggoune, P. Olin, and J. Kirwan, “Estimation of fuel economy on

real-world routes for next-generation connected and automated hybrid

powertrains,” SAE Technical Paper, no. 2020-01-0593, 2020.

[3] L. Guzzella, A. Sciarretta et al.,Vehicle propulsion systems. Springer,

2007, vol. 1.

[4] A. Sciarretta, G. De Nunzio, and L. L. Ojeda, “Optimal ecodriving

control: Energy-efﬁcient driving of road vehicles as an optimal control

problem,” IEEE Control Systems Magazine, vol. 35, no. 5, pp. 71–90,

2015.

[5] E. Ozatay, S. Onori, J. Wollaeger, U. Ozguner, G. Rizzoni, D. Filev,

J. Michelini, and S. Di Cairano, “Cloud-based velocity proﬁle op-

timization for everyday driving: A dynamic-programming-based so-

lution,” IEEE Transactions on Intelligent Transportation Systems,

vol. 15, no. 6, pp. 2491–2505, 2014.

[6] Q. Jin, G. Wu, K. Boriboonsomsin, and M. J. Barth, “Power-based

optimal longitudinal control for a connected eco-driving system,” IEEE

Transactions on Intelligent Transportation Systems, vol. 17, no. 10, pp.

2900–2910, 2016.

[7] J. Han, A. Vahidi, and A. Sciarretta, “Fundamentals of energy efﬁcient

driving for combustion engine and electric vehicles: An optimal

control perspective,” Automatica, vol. 103, pp. 558–572, 2019.

[8] C. Sun, J. Guanetti, F. Borrelli, and S. Moura, “Optimal eco-driving

control of connected and autonomous vehicles through signalized

intersections,” IEEE Internet of Things Journal, 2020.

[9] L. Guo, B. Gao, Y. Gao, and H. Chen, “Optimal energy manage-

ment for hevs in eco-driving applications using bi-level mpc,” IEEE

Transactions on Intelligent Transportation Systems, vol. 18, no. 8, pp.

2153–2162, 2016.

[10] P. Olin, K. Aggoune, L. Tang, K. Confer, J. Kirwan, S. R. Deshpande,

S. Gupta, P. Tulpule, M. Canova, and G. Rizzoni, “Reducing fuel con-

sumption by using information from connected and automated vehicle

modules to optimize propulsion system control,” SAE Technical Paper,

no. 2019-01-1213, 2019.

[11] S. Bae, Y. Choi, Y. Kim, J. Guanetti, F. Borrelli, and S. Moura, “Real-

time ecological velocity planning for plug-in hybrid vehicles with

partial communication to trafﬁc lights,” in 2019 IEEE 58th Conference

on Decision and Control (CDC). IEEE, 2019, pp. 1279–1285.

[12] B. Asadi and A. Vahidi, “Predictive cruise control: Utilizing upcoming

trafﬁc signal information for improving fuel economy and reducing

trip time,” IEEE Transactions on Control Systems Technology, vol. 19,

no. 3, pp. 707–714, 2011.

[13] D. P. Bertsekas, Dynamic programming and optimal control. Athena

scientiﬁc Belmont, MA, 2005, vol. 1, no. 3.

[14] B. Saerens, “Optimal control based eco-driving-theoretical approach

and practical applications,” Ph. D. Thesis, Katholieke Universiteit

Leuven, 2012.

[15] S. Gupta, S. R. Deshpande, P. Tulpule, M. Canova, and G. Rizzoni,

“An enhanced driver model for evaluating fuel economy on real-world

routes,” IFAC-PapersOnLine, vol. 52, no. 5, pp. 574–579, 2019.

[16] Y. Jiang and X. Li, “Travel time prediction based on historical

trajectory data,” Annals of GIS, vol. 19, no. 1, pp. 27–35, 2013.

[17] O. Sundstrom and L. Guzzella, “A generic dynamic programming mat-

lab function,” in 2009 IEEE control applications,(CCA) & intelligent

control,(ISIC). IEEE, 2009, pp. 1625–1630.

[18] R. S. Sutton and A. G. Barto, Reinforcement learning: An introduction.

MIT press, 2018.

[19] Z. Zhu, Y. Liu, and M. Canova, “Energy management of hybrid eletric

vehicles via deep q networks,” in 2020 American Control Conference

(ACC). IEEE, 2020.

[20] “CUDA C++ programming guide,” https://docs.nvidia.com/cuda/

cuda-c- programming-guide/index.html, Aug. 2020.

[21] W. Perez, A. Ruhela, and P. Tulpule, “Benchmarking computational

time of dynamic programming for autonomous vehicle powertrain

control,” SAE Technical Paper, no. 2020-01-0968, 2020.

On-Road Trajectory Planning of Connected and Automated Vehicles in Complex Traffic Settings: A Hierarchical Framework of Trajectory Refinement

Article

Full-text available

Jan 2024

This paper presents a hierarchical framework for on-road trajectory planning in complex traffic environments. Firstly, the processing of sparse coarse trajectories involves the utilization of DP (Dynamic Programming) generation and interpolation techniques. Then, for the waypoints with collision risk in the smoothed trajectory, the spiral search method is used to find some safe alternate waypoints. The alternate waypoints and the previous ones without collision risk form the amended trajectory. Concurrently, safety tunnels are constructed along the amended trajectory for the ego vehicle. Furthermore, with the constraint conditions of vehicle kinematics model and safety tunnels, nonlinear program (NLP) optimization is carried out for the amended trajectory of ego vehicle to obtain smooth and safe trajectories. For typical cases, simulation experiments demonstrate that the ego vehicle can ensure collision safety in dynamic traffic scenarios, while maintaining smooth vehicle velocity and small jitter of the front wheel angle. The proposed trajectory planning framework provides a novel decision-making method for connected and automated vehicles (CAVs).

Eco-Driving Control of Connected and Automated Vehicles Using Neural Network–Based Rollout

Article

Full-text available

Nov 2023

Connected and autonomous vehicles have the potential to minimize energy consumption by optimizing the vehicle velocity and powertrain dynamics with Vehicle-to-Everything info en route. Existing deterministic and stochastic methods created to solve the eco-driving problem generally suffer from high computational and memory requirements, which makes online implementation challenging. This work proposes a hierarchical multi-horizon optimization framework implemented via a neural network. The neural network learns a full-route value function to account for the variability in route information and is then used to approximate the terminal cost in a receding horizon optimization. Simulations over real-world routes demonstrate that the proposed approach achieves comparable performance to a stochastic optimization solution obtained via reinforcement learning, while requiring no sophisticated training paradigm and negligible on-board memory.

Improving Computational Efficiency for Energy Management Systems in Plug-in Hybrid Electric Vehicles Using Dynamic Programming based Controllers

Conference Paper

Full-text available

Aug 2023

Reducing computational time has become a critical issue in recent years, particularly in the transportation field, where the complexity of scenarios demands lightweight controllers to run large simulations and gather results to study different behaviors. This study proposes two novel formulations of the Optimal Control Problem (OCP) for the Energy Management System of a Plug-in Hybrid Electric Vehicle (PHEV) and compares their performance with a benchmark found in the literature. Dynamic Programming was chosen as the optimization algorithm to solve the OCP in a Matlab environment, using the DynaProg toolbox. The objective is to address the optimality of the fuel economy solution and computational time. In order to improve the computational efficiency of the algorithm, an existing formulation from the literature was modified, which originally utilized three control inputs. The approach involves leveraging the unique equations that describe the Input-Split Hybrid powertrain, resulting in a reduction of control inputs firstly to two and finally to one in the proposed solutions. The aforementioned formulations are referred to as 2-Controls and a 1-Control. Virtual tests were conducted to evaluate the performance of the two formulations. The simulations were carried out in various scenarios, including urban and highway driving, to ensure the versatility of the controllers. The results demonstrate that both proposed formulations achieve a reduction in computational time compared to the benchmark. The 2-Controls formulation achieved a reduction in computational time of approximately 40 times, while the 1-Control formulation achieved a remarkable reduction of approximately 850 times. These reductions in computational time were achieved while obtaining a maximum difference in fuel economy of approximately 1.5% for the 1-Control formulation with respect to the benchmark solution. Overall, this study provides valuable insights into the development of efficient and optimal controllers for PHEVs, which can be applied to various transportation scenarios. The proposed formulations reduce computational time without sacrificing the optimality of the fuel economy solution, making them a promising approach for future research in this area.

Maximizing Efficiency in Smart Adjustable DC Link Powertrains with IGBTs and SiC MOSFETs via Optimized DC-Link Voltage Control

Article

Full-text available

May 2023

In recent years, the push towards electrifying transportation has gained significant traction, with battery-electric vehicles (BEVs) emerging as a viable alternative. However, the widespread adoption of BEVs faces multiple challenges, such as limited driving range, making powertrain efficiency improvements crucial. One approach to improve powertrain energy efficiency is to adjust the DC-link voltage using a DC-DC converter between the battery and inverter. Here, it is necessary to address the losses introduced by the DC-DC converter. This paper presents a dynamic programming approach to optimize the DC-link voltage, taking into account the battery terminal voltage variation and its impact on the overall powertrain losses. We also examine the energy efficiency gains of IGBT-based and silicon carbide (SiC) MOSFET-based adjustable DC-link voltage powertrains during WLTC driving cycles through PLECS and Matlab/Simulink simulations. The findings indicate that both IGBT and MOSFET-based adjustable DC-link voltage powertrains can enhance the WLTC drive-cycle efficiency up to 2.51% and 3.25% compared to conventional IGBT and MOSFET-based powertrains, respectively.

Autonomous Electric Vehicle Route Optimization Considering Regenerative Braking Dynamic Low-Speed Boundary

Article

Full-text available

May 2023

Finding the optimal speed profile of an autonomous electric vehicle (AEV) for a given route (eco-driving) can lead to a reduction in energy consumption. This energy reduction is even more noticeable when the regenerative braking (RB) capability of AEVs is carefully considered in obtaining the speed profile. In this paper, a new approach for calculating the optimum eco-driving profile of an AEV is formulated using mixed-integer linear programming (MILP) while carefully integrating the RB capability and its limitations in the process of obtaining a driving profile with minimum energy consumption. One of the most important limitations of RB which has been neglected in previous studies is operation below the low-speed boundary (LSB) of electric motors, which impairs the energy extraction capability of RB. The novelty of this work is finding the optimal speed profile given this limitation, leading to a much more realistic eco-driving profile. Python is used to code the MILP problem, and CPLEX is employed as the solver. To verify the results, the eco-driving problem is applied to two scenarios to show the significance of considering a dynamic LSB. It is shown that for the route under study, up to 27% more energy can be harvested by employing the proposed approach.

Predicting Lead Vehicle Velocity for Eco-Driving in the Absence of V2V Information

Conference Paper

Full-text available

Apr 2023

div class="section abstract"> Accurately predicting the future behavior of the surrounding traffic, especially the velocity of the lead vehicle is important for optimizing the energy consumption and improve the safety of Connected and Automated Vehicles (CAVs). Several studies report methods to predict short-to-mid-length lead vehicle velocity using stochastic models or other data-driven techniques, which require availability of extensive data and/or Vehicle-to-Vehicle (V2V) communication. In the absence of connectivity, or in data-restricted cases, the prediction must rely only on the measured position and relative velocity of the lead vehicle at the current time. This paper proposes two velocity predictors to predict short-to-mid-length lead vehicle velocity. The first predictor is based on a Constant Acceleration (CA) with an augmented stop mode. The second one is based on a modified Enhanced Driver Model (EDM-LOS) with line-of-sight feature. Both predictors rely only on information on the present values of lead vehicle position and velocity to compute a future velocity estimate. An analysis is done to compare the prediction accuracy of the proposed predictors with different experimental driving data recorded using an OBD2 scanner plugged into a passenger vehicle. Finally, the predicted lead vehicle velocity is utilized to formulate time-gap constraints for the eco-driving optimal control problem, solved via Model Predictive Control (MPC). The energy savings of the considered velocity predictors are evaluated by performing a large-scale simulation study. The proposed velocity predictor provides closest energy savings to a wait-and-see solution for a CAV in absence of V2V communication. </div

Scalable FPGA Implementation of Dynamic Programming for Optimal Control of Hybrid Electrical Vehicles

Chapter

Jun 2024

Control of a Hybrid Electric Vehicle using Control Vector Parameterization and Reinforcement Learning

Conference Paper

Dec 2023

A Deep Reinforcement Learning Framework for Eco-Driving in Connected and Automated Hybrid Electric Vehicles

Article

Jan 2023

Connected and Automated Vehicles (CAVs), in particular those with multiple power sources, have the potential to significantly reduce fuel consumption and travel time in real world driving conditions. In particular, the eco-driving problem seeks to design optimal speed and power usage profiles based upon look-ahead information from connectivity and advanced mapping features, to minimize the fuel consumption over a given itinerary. In this work, the eco-driving problem is formulated as a Partially Observable Markov Decision Process (POMDP), which is then solved with a state-of-art Deep Reinforcement Learning (DRL) Actor Critic algorithm, Proximal Policy Optimization. An eco-driving simulation environment is developed for training and evaluation purposes. To benchmark the performance of the DRL controller, a baseline controller representing the human driver, a trajectory optimization algorithm and the wait-and-see deterministic optimal solution are presented. With a minimal onboard computational requirement and a comparable travel time, the DRL controller reduces the fuel consumption by more than 17% compared against the baseline controller by modulating the vehicle velocity over the route and performing energy efficient approach and departure at signalized intersections, over-performing the more computationally demanding trajectory optimization method.

A Bi-Level Optimization Approach for Eco-Driving of Heavy-Duty Vehicles

Conference Paper

Aug 2023

div class="section abstract"> With the increase of heavy-duty transportation, more fuel efficient technologies and services have become of great importance due to their environmental and economical impacts for the fleet managers. In this paper, we first develop a new analytical model of the heavy-truck for its dynamics and its fuel consumption, and valid the model with experimental measurements. Then, we propose a bi-level optimization approach to reduce the fuel consumption, thus the CO <sub>2</sub> emissions, while ensuring several safety constraints in real-time. Numerical results show that important reduction of the fuel consumption can be achieved, while satisfying imposed safety constraints. </div

Energy Management of Hybrid Electric Vehicles via Deep Q-Networks

Conference Paper

Full-text available

Jul 2020

Reducing Fuel Consumption by Using Information from Connected and Automated Vehicle Modules to Optimize Propulsion System Control

Conference Paper

Full-text available

Apr 2019

Optimal Energy Management for HEVs in Eco-Driving Applications Using Bi-Level MPC

Article

Full-text available

Dec 2016

Wide usage of vehicle's onboard navigation system offers vehicles better terms to improve energy efficiency. In this paper, a computationally effective energy management strategy using model predictive control (MPC) is proposed to find the energy optimal torque split, gear shift, and velocity control of a parallel hybrid electric vehicle (HEV). We consider the vehicles in urban driving, where the vehicle trajectory is constrained by the infrastructure (road signs) and other vehicles (traffic). Restricted by the discrete gear ratio, nonlinear dynamics of the vehicles, and especially different time scales between velocity trajectory and torque split optimization, finding these control variables in one optimal problem is quite challenging. Thus, this paper uses bi-level methodology to reduce computational time and simplify the hybrid optimal problem by decoupling its components into two subproblems. In the outer loop, the optimal velocity trajectory is obtained by solving a nonlinear time-varying optimal problem using a Krylov subspace method to improve computational efficiency. In the second subproblem, we provide an explicit solution of the optimal torque split ratio and gear shift schedule by combining Pontryagin's minimum principle and numerical methods in the framework of MPC. Simulation results on an AMESim model of an HEV with seven-speed automated manual transmission over multiple driving cycles are presented. The results indicate that both energy efficiency and computational speed are improved.

Benchmarking Computational Time of Dynamic Programming for Autonomous Vehicle Powertrain Control

Conference Paper

Apr 2020

Estimation of Fuel Economy on Real-World Routes for Next-Generation Connected and Automated Hybrid Powertrains

Conference Paper

Apr 2020

Real-time Ecological Velocity Planning for Plug-in Hybrid Vehicles with Partial Communication to Traffic Lights

Conference Paper

Dec 2019

Optimal Eco-Driving Control of Connected and Autonomous Vehicles Through Signalized Intersections

Article

Jan 2020

This paper focuses on the speed planning problem for connected and automated vehicles (CAVs) communicating to traffic lights. The uncertainty of traffic signal timing for signalized intersections on the road is considered. The eco-driving problem is formulated as a data-driven chance constrained robust optimization problem. Effective red light duration (ERD) is defined as a random variable, and describes the feasible passing time through the signalized intersections. Usually, the true probability distribution for ERD is unknown. Consequently, a data-driven approach is adopted to formulate chance constraints based on empirical sample data. This incorporates robustness into the eco-driving control problem with respect to uncertain signal timing. Dynamic programming (DP) is employed to solve the optimization problem. Simulation results demonstrate that the proposed method can generate optimal speed reference trajectories with 40% less vehicle fuel consumption, while maintaining the arrival time at a similar level compared to a modified intelligent driver model (IDM). The proposed control approach significantly improves the controller’s robustness in the face of uncertain signal timing, without requiring to know the distribution of the random variable a priori.

An Enhanced Driver Model for Evaluating Fuel Economy on Real-World Routes

Article

Jan 2019

Assessing vehicle fuel economy in real-world driving conditions is a critical requirement to establish a reliable baseline when evaluating driver assistance systems or autonomous vehicles, where the speed profile can be optimized based on route information. Since the benchmarking is traditionally done by collecting and analyzing large amounts of data over on-road testing, virtual driver models have been developed to conduct simulation studies that allow one to understand the impact of specific driver behaviors on the vehicle speed profile. This paper presents an enhanced driver model that predicts a longitudinal vehicle speed profile based on route data, which can be calibrated with simple tests. The model extends the Intelligent Driver Model to more accurately characterize the response to stop signs, traffic lights, and other conditions typical of urban driving. The enhanced driver model can be calibrated to match the behavior of specific drivers and determine statistically-relevant distributions of model parameters.

Fundamentals of energy efficient driving for combustion engine and electric vehicles: An optimal control perspective

Article

May 2019
AUTOMATICA

This paper formulates energy efficient driving of gasoline and electric powered vehicles as optimal control problems of various complexity. We show minimizing aerodynamic drag can maximize utilization of energy available at the wheel and requires low and constant speeds. By employing optimal control theory we show periods of maximal acceleration, maximal braking, and coasting often accompany constant speed cruising to satisfy boundary conditions on the states (bang-singular-bang optimal control). In the case of gasoline engine vehicles, analytical optimal control derivations show that pulse and glide operation of the engine while cruising can further reduce fuel use (chattering optimal control). For electric vehicles (EV), quadratic rather than linear dependence of energy use on control input results in different eco-driving patterns from gasoline engine vehicles. For EVs, analytical solution to the two point boundary value optimal control problem could be obtained after model simplification which is compared to numerical solution based on a more accurate model. We also evaluate optimal control solution in the presence of state constraints for EVs. Several simulation case studies are presented to showcase the energy efficiency gains with proposed eco driving strategies.

Control of Connected and Automated Vehicles: State of the Art and Future Challenges

Article

Apr 2018
ANNU REV CONTROL

Autonomous driving technology pledges safety, convenience, and energy efficiency. Challenges include the unknown intentions of other road users: communication between vehicles and with the road infrastructure is a possible approach to enhance awareness and enable cooperation. Connected and automated vehicles (CAVs) have the potential to disrupt mobility, extending what is possible with driving automation and connectivity alone. Applications include real-time control and planning with increased awareness, routing with micro-scale traffic information, coordinated platooning using traffic signals information, eco-mobility on demand with guaranteed parking. This paper introduces a control and planning architecture for CAVs, and surveys the state of the art on each functional block therein; the main focus is on techniques to improve energy efficiency. We provide an overview of existing algorithms and their mutual interactions, we present promising optimization-based approaches to CAVs control and identify future challenges.

A GPU Implementation of a Look-Ahead Optimal Controller for Eco-Driving Based on Dynamic Programming

Figures

Recommended publications

A Solution to the Optimal Tracking Problem for Linear Systems

Vectorization and multitasking of dynamic programming in control: Experiments on a CRAY-2

Model-free adaptive dynamic programming algorithms for H-infinity control of complex linear systems

Data-driven iterative adaptive dynamic programming algorithm for approximate optimal control of unkn...