ArticlePDF Available

Short-Term Highway Traffic State Prediction Using Structural State Space Models

June 2014
Journal of Intelligent Transportation Systems Technology Planning and Operations 18(3)

June 2014
18(3)

DOI:10.1080/15472450.2013.836929

Authors:

Chung-Cheng Lu

National Yang Ming Chiao Tung University

Xuesong Simon Zhou

Arizona State University

This research proposes a short-term highway traffic state prediction method based on a structural state space model, with the intention to provide a robust approach for obtaining accurate forecasts of traffic state under both recurring and non-recurring conditions. True traffic state is decomposed to three components, namely, regular traffic pattern, structural deviation, and random fluctuation. Particularly, the structural deviation term reflects the change of true traffic state from regular (historical) pattern, due to unexpected capacity reduction and/or demand variations. A polynomial trend is adopted to describe the temporal evolution of structural deviations across different time intervals. We derive an analytical form of structural deviations in a single bottleneck case based on cumulative flow count diagrams. The proposed model is incorporated in a Kalman filtering-based algorithmic framework, together with an adaptive scheme for determining the variances of random errors. A set of numerical experiments was conducted on two test beds in the northern Taiwan highway network. Experimental results show that the proposed approach is particularly superior to an ordinary Kalman filtering method presented in the literature under non-recurring conditions, highlighting the advantage of combining both the polynomial trend model and historical pattern into the proposed short-term traffic state prediction approach.

MAPEs (%) of the prediction results on test bed #1 (Highway No.5)

…

MAPEs of the prediction results on test bed #2 (Highway No.1)

…

Prediction performances for three selected incidents on Highway No.5 in June, 2011

…

Figures - uploaded by Xuesong Simon Zhou

Content may be subject to copyright.

Content uploaded by Xuesong Simon Zhou

Content may be subject to copyright.

Short-term Highway Traffic State Prediction Using Structural

State Space Models

Chung-Cheng Lu, Ph.D.

(Corresponding author)

Institute of Information and Logistics Management, National Taipei Univ. of Technology

1 Section 3 Chung-Hsiao East Road, Taipei, 10608, Taiwan

Tel: +886-2-27712171 ext 2306; E-Mail: jasoncclu@gmail.com

Xuesong Zhou, Ph.D.

School of Sustainable Engineering and Built Environment, Arizona State University

Tempe, Arizona, 85287, USA. E-Mail: xzhou@asu.edu

Abstract

This research proposes a short-term highway traffic state prediction method based on a

structural state space model, with the intention to provide a robust approach for obtaining

accurate forecasts of traffic state under both recurring and non-recurring conditions. True

traffic state is decomposed to three components, namely, regular traffic pattern, structural

deviation and random fluctuation. Particularly, the structural deviation term reflects the

change of true traffic state from regular (historical) pattern, due to unexpected capacity

reduction and/or demand variations. A polynomial trend is adopted to describe the temporal

evolution of structural deviations across different time intervals. We derive an analytical form

of structural deviations in a single bottleneck case based on cumulative flow count diagrams.

The proposed model is incorporated in a Kalman filtering-based algorithmic framework,

together with an adaptive scheme for determining the variances of random errors. A set of

numerical experiments were conducted on two test beds in the northern Taiwan highway

network. Experimental results show that the proposed approach is particularly superior to an

ordinary Kalman filtering method presented in the literature under non-recurring conditions,

highlighting the advantage of combining both the polynomial trend model and historical

pattern into the proposed short-term traffic state prediction approach.

Keywords: traffic state estimation and prediction; structural state space model; Kalman filter.

1. Introduction

Recent advances in route guidance systems (e.g., Mahmassani and Peeta, 1995;

Ben-Akiva, 1997; Bottom, 2000; Mahmassani et al., 2005), coordinated ramp metering (e.g.,

Hegyi et al., 2005; Papamichail et al., 2010), and state-dependent congestion and value

pricing (e.g., Dong et al., 2011; Lou et al., 2011) have exemplified that traffic control

measures and travel demand management strategies based on anticipatory traffic state

information produce better system performances than those based on prevailing traffic state

measurements. These research findings underscore the importance of traffic state estimation

and prediction to Advanced Traveler Information Systems (ATIS) and Advanced Traffic

Management Systems (ATMS) functionalities.

Existing traffic state estimation and prediction approaches can be classified into three

categories: (i) approaches based on statistical methods or other techniques for predicting link

traffic states or path travel times, (ii) approaches based on macroscopic models for estimating

traffic flows on freeway corridors, (iii) approaches based on dynamic traffic assignment for

estimating and predicting network-wide origin-destination (OD) trip demands and route

choices so as to obtain network flow patterns on links with and without observations (e.g.,

Zhou and Mahmassani, 2007). This research focuses on the first category approaches, in

particular, for predicting short-term highway link traffic states with point measurements. A

rich body of the literature has been devoted to traffic state prediction approaches in the past

decades, most of which focused on travel time prediction using, for instance, linear regression

(e.g., Zhang and Rice, 2003; van Hinsbergen and van Lint, 2008), Kalman filtering (e.g.,

Chen and Chien, 2001; Nanthawichit et al., 2003; Xia et al., 2011; Anand et al., 2013) neural

networks (e.g., Rilett and Park, 2001; van Lint et al., 2002; Abdulhai et al., 2002), Bayesian

forecasting (e.g., van Hinsbergen et al., 2009; Fei et al., 2011; Lu, 2012), support vector

regression (e.g., Lam and Toan, 2008; Chandra and Al-Deek, 2009), and Fuzzy methods (e.g.,

Zhang and Ye, 2008).

This research proposes a Kalman filtering-based algorithm for traffic state (e.g., link

density, speed, flow, or travel time) estimation and prediction on freeway corridors. Kalman

filter is a state estimation and prediction process applied to a dynamic system that involves

time-dependent system state transition and random perturbations. In this process, real-time

observations are obtained during each prediction period, and then processed through a

measurement equation to achieve a posteriori estimation of each period’s state variables. The

dynamic characteristics of state variables from one period to another are captured by a

transition equation, relating state variables across different periods. If the dynamic system

and measurement noise distributions are Gaussian, Kalman filter is an optimal filter with

minimum mean square error. This approach has been shown to be especially suitable for

real-time system applications (e.g., Chui and Chen, 1999). It has the capability of processing

real-time observations sequentially so that the computational burden and memory

requirements for large systems can be dramatically decreased. For instance, Chen and Chien

(2001) and Chien and Kuchipudi (2003) developed travel time prediction models using

Kalman filtering algorithm with simulated synthetic probe data. Nanthawichit et al. (2003)

used a macroscopic traffic flow model along with Kalman filtering technique to predict travel

time with synthetic detector data and probe vehicle data. Yang et al. (2004) presented a

Kalman filter method for online traffic speed prediction. Chu et al. (2005) developed an

adaptive Kalman filtering-based travel time prediction method that fuses both point detector

data and probe vehicle data. van Lint (2008) proposed censored extended Kalman filter (EKF)

and delayed EKF algorithms for online travel time prediction. Wang et al. (2005, 2006) also

developed a real-time freeway traffic estimation approach using an EKF algorithm.

The aforementioned literature demonstrates a wide spectrum of Kalman filtering

applications and constitutes the state-of-the-art techniques for traffic state or travel time

estimation and prediction. Nevertheless, most of them focused on predicting traffic states or

travel times under recurring conditions (in which traffic flow follows its regular daily pattern),

and may require special (or ad-hoc) treatments under non-recurring conditions (in which

traffic flow is impacted by irregular or unforeseen events, such as incidents, accidents, or bad

weather). Under recurring conditions, traffic states can be estimated and predicted accurately

using the existing methods with sensor measurements. However, under non-recurring traffic

conditions, because traffic states dramatically deviates from regular traffic patterns, traffic

state estimates and forecasts obtained by the existing methods may be significantly biased. A

robust highway traffic state estimation and prediction method should provide a unified

framework for obtaining accurate forecasts under both recurring and non-recurring traffic

conditions.

With a particular emphasis on highway traffic state estimation and prediction under both

recurring and non-recurring conditions, our work adapts a structural state space model (e.g.,

Chui and Chen, 1999) that decomposes true traffic state into three components, namely

regular traffic pattern, structural deviation and random fluctuation. Specifically, regular traffic

pattern is estimated using historical data (e.g., mean or median of historical data), and

structural deviation reflects the difference between true traffic state and regular traffic pattern,

due to unexpected capacity reduction and/or demand fluctuations (e.g., Lan et al., 2008). In

this modeling framework, structural deviation vanishes under recurring conditions, while

forecasting actual traffic states under non-recurring conditions focuses mainly on the

prediction of structural deviations based on real-time sensor measurements of traffic states. In

addition, random fluctuation (or error) is included to account for the effect of other

unobserved factors and the inherent stochastic nature of time-varying traffic states.

The structural deviation term captures the influence of combined external factors that

cause true traffic state to change from average pattern. This research adapts a polynomial

trend model to describe the temporal evolution of structural deviations across different time

intervals. Very few previous studies incorporated a similar term in modeling the impact of

various exogenous factors on regular traffic pattern. For instance, Zhou and Mahmassani

(2007) developed a structural state-space model for estimating and predicting dynamic OD

travel demands, and detrended structural deviations from the regular pattern of dynamic OD

travel demands represented by historical averages. Moreover, an analytical derivation of the

structural deviation term in a single bottleneck case is also proposed in this paper, based on

cumulative flow count diagrams.

The idea of the structural state space model with a linear (or polynomial) trend to

describe the structural deviations is illustrated in Fig. 1. In the figure, solid lines represent

true traffic state; dotted lines represent estimated historical pattern (i.e., regular pattern); red

arrows denote predicted pattern. The plot on the left hand side depicts a case of recurring

conditions in which the realized true state is similar to the regular pattern, while the plot on

the right hand side presents a case of non-recurring conditions in which the true state is

significantly different from the regular pattern. In the former case, the historical pattern and

linear trend model successfully captures the true state series and obtains accurate predictions.

On the other hand, in the latter case, the structural deviations between the true state and

regular pattern are significant due to an unexpected event. Our approach utilizes the

polynomial filter to absorb possible structural deviations, aiming to produce robust prediction

results under both recurring and non-recurring conditions.

[Insert Fig. 1 about here]

The next section describes the proposed structural state space model for traffic state

estimation and prediction, and the polynomial trend model based on a multiple-order Taylor’s

series which approximates a smooth function of structural deviation. Section 3 presents the

Kalman filtering-based solution algorithm, which integrates short-term traffic state estimation

and prediction equations under recurring and non-recurring conditions. An adaptive scheme

for updating measurement and system errors variances is also illustrated. Section 4 reports a

set of numerical experiments conducted on two test beds in northern Taiwan. Concluding

remarks are given in Section 5.

2. Structural state space model for traffic state estimation and prediction

Notation

a index for links with traffic measurements, a = 1,…, N

t index for prediction time intervals, t = 1,…, T



length of a prediction time interval



current time stamp

s, p, m order indices in a polynomial trend model

 traffic state on link a at time interval t

Kt vector of traffic states at time interval t



 a priori estimate (i.e., regular pattern) of traffic state based on historical data on link a

at time interval t, where the superscript h stands for historical data

 vector of a priori state estimates based on historical data at time interval t

 structural deviation from the a priori estimate 

 on link a at time interval t



󰆒, 

󰆒′, 

󰆒′′ the first-, second-, and third-order derivatives of structural deviation ,

respectively



, 

, 

 the pth-, mth-, and sth-order of derivatives of structural deviation ,

respectively

Zt vector of structural deviations from the a priori estimate  at time interval t

 observation on link a at time interval t

Yt vector of observations at time interval t



 system process error of traffic state  on link a at time interval t



 vector of system process errors at time interval t

 measurement error on link a at time interval t

 vector of measurement errors at time interval t

 state transition matrix from time interval t1 to time interval t

H measurement matrix

Qt vector of process error variances at time interval t

Rt vector of measurement error variances at time interval t

 vector of a priori traffic state predictions at time interval t

 vector of posteriori traffic state estimates at time interval t

 a priori prediction of state variance-covariance matrix at time interval t

 posteriori estimate of state variance-covariance matrix at time interval t

 Kalman gain matrix at interval t for state estimation under recurring conditions

 vector of a priori structural deviation predictions at time interval t

 vector of posteriori structural deviation estimates at time interval t



 a priori prediction of structural deviation variance of link a at time interval t

 a priori prediction of structural deviation variance matrix at time interval t



 posteriori estimate of structural deviation variance of link a at time interval t

 posteriori estimate of structural deviation variance matrix at time interval t

 Kalman gain matrix at interval t for state estimation under non-recurring conditions

 system process error (or noise) on link a at time interval t



 process error for pth-order derivative of structural deviation

Wt vector of system process errors at time interval t under non-recurring conditions

Vt vector of measurement errors at time interval t under non-recurring conditions

 vector of process error variances at time interval t under non-recurring conditions

 vector of measurement error variances at interval t under non-recurring conditions

var( ) the variance function

cov( ) the covariance function

2.1 Structural state space model

This research aims to estimate and predict traffic states, 󰇝󰇞, for a

near future (short-term) time interval t on a set of N highway links with real-time

observations available from traffic surveillance systems. As mentioned previously, this work

considers that true traffic state is composed of three components:

True traffic state = regular traffic pattern + structural deviations + random fluctuations.

Realistically, only the a priori estimates of the regular pattern, 󰇝

󰇞 based

on historical data, are available before forecasting real-time traffic states. These a priori

estimates, such as means or medians of historical data, are used to reflect regular traffic

pattern. Thus, the true traffic state on link a at time interval t, , is modeled as a linear

combination of the a priori estimate, structural deviation, and random disturbance, as shown

in Eq.(1).







. (1)

The proposed structural state space model consists of the following two equations.

State transition equation:





, where



t ~ N(0, Qt). (2)

Measurement equation:

, where



t ~ N(0, Rt). (3)

The state transition equation describes the temporal change in traffic state from time interval

t1 to time interval t, while the measurement equation models the relationship between the

state predictions, Kt, and real-world measurements at time t, Yt. In Eqs.(2) and (3),



t and



are white noise terms uncorrelated with initial state K0 and with each other.

In this modeling framework, for traffic state prediction under recurring conditions, the

structural deviation, , is considered as zero, and the regular pattern, 

, can be

constructed based on historical data under recurring traffic conditions. On the other hand, for

traffic state prediction under non-recurring conditions, a polynomial trend model can be

adopted to provide a close approximation for the short-term prediction of the structural

deviation at time interval t.

2.2 Traffic state estimation and prediction under recurring conditions

In the proposed approach, under recurring conditions, the a priori prediction of traffic

states at time t, , is determined as:

, (4)

and the a priori prediction of variances, , is determined as:

󰇛󰇜. (5)

Then, the posteriori traffic state estimates, , is updated through a linear function of the a

priori prediction, , and a weighted difference, .

󰇛󰇜. (6)

By assuming that the measurement error covariance Rt is uncorrelated with Gt and Yt, a

general formulation for the variance-covariance matrix of the posteriori estimate error can be

derived as follows:

󰇛󰇜󰇛󰇜. (7)

where I is an identity matrix with appropriate dimensions and Tr denotes the transpose of a

matrix. In Eqs.(6) and (7), Gt is known as the Kalman gain factor used to obtain the posteriori

estimate  and its covariance . The optimal form of Gt can be derived as:

󰇛󰇜. (8)

Under the optimal formulation of the Kalman gain matrix, a simplified expression of the

estimation error covariance is derived as follows:

󰇛󰇜. (9)

2.3 Traffic state estimation and prediction under non-recurring conditions

Under non-recurring conditions, structural deviations, Zt, may result from various

demand and/or capacity changes. In the proposed structural state space model, the transition

equation of structural deviations is as follows:

, where Wt ~ N(0, ). (10)

The measurement equation is as follows:

, where Vt ~ N(0, ). (11)

The prediction equations for the (a priori) structural deviation vector, , and its

variance-covariance matrix, , are presented in Eqs.(12) and (13), respectively.



, (12)





 

. (13)

Accordingly, the predicted traffic state vector and its variance-covariance matrix can be

obtained as follows:



. (14)





 

. (15)

The estimation equations for the (a posteriori) non-recurring structural deviation, , and its

variance-covariance matrix, , are given in Eqs.(16) and (17), respectively.

󰇛󰇜. (16)

󰇛󰇜. (17)

where

󰇛󰇜. (18)

2.4 Polynomial trend model

Let



be the current time. Consider the state prediction on link a under non-recurring

conditions, an m-order Taylor’s series can be used to approximate a smooth function of

structural deviation, z(a,





), expanded about the point z(a,



); that is,



󰆒



󰆒󰆒 







. (19)

The above representation assumes that structural deviation at time (





) can be captured

locally by an m-order polynomial function as Eq.(19) near time



for a small value of



, while

derivatives of higher orders are assumed to be zero. A general form for the pth-order

derivative of this polynomial trend model is given as follows (Chui and Chen, 1999):





󰇛󰇜



 . (20)

Based on the above representation of structural deviation, the temporal change of derivative



 from time



to (





) can be described as follows:





󰇛󰇜



 



󰇛

󰇜. (21)

As an example, the structural deviation transition equation based on the polynomial trend

model with order m = 3 for link a is given in Eq.(22), where the state vector consists of the

zeroth to third-order derivatives of structural deviation from the a priori regular pattern.









′



′′



′′′







   

  

 









′



′′



′′′

















′



′′



′′′







. (22)

Specifically, the transition equations with zeroth-order (p = 0), first-order (p =1), second-order

(p =2), and third-order (p =3) derivatives can be expanded as Eqs.(23)-(26), respectively.



′



′′ 



′′′ . (23)



󰆒

′

′′ 



′′′ 

′. (24)



󰆒󰆒 

′′ 

′′′ 

′′ . (25)



󰆒󰆒󰆒 

′′′ 

′′′ . (26)

Eq.(22) can be extended to include the derivatives of the structure deviations of the N

(independent) links with real-time measurements, and hence the complete transition equation

of structural deviations in Eq.(10) can be obtained by defining the state vector as:

󰇛

󰆒

󰆒󰆒

󰆒󰆒󰆒

󰆒

󰆒󰆒 

󰆒󰆒󰆒󰇜, (27)

the state transition matrix as:

󰇛󰇜, (28)

where    

  

 , for a = 1,…, N. (29)

and the evolution noise vector as:

󰇛

󰆒

󰆒󰆒

󰆒󰆒󰆒

󰆒

󰆒󰆒 

󰆒󰆒󰆒󰇜. (30)

Although a sophisticated higher-order polynomial describes structural deviations in

greater detail than a lower-order model, it might result in potential large prediction errors. As

shown in the following equation, prediction errors of the proposed model depends on the

order m of the model and the length of prediction interval,



  󰇟󰇛󰇜

 󰇡′󰇢

′󰇛

󰇛󰇜

󰇛′󰇜󰇜󰇠



′



 . (31)

Moreover, the computational complexity of Kalman filter is on the order of O(D3), where D =

N  m. A lower-order model can decrease the size of the state vector and hence improve

computational efficiency. However, if traffic state is drastically different from the historical

regular pattern, then a higher-order polynomial trend model will be necessary to approximate

the structural deviation.

2.5 Analytically-derived state transition matrix for a single bottleneck case

This section aims to offer an alternative approach to derive state transition matrix,

, for a particular single bottleneck case with a reduced capacity due to an incident (i.e.,

non-recurring congestion). For the sake of simplicity, the link subscript, a, is ignored in the

following discussion. As shown in Fig. 2, under recurrent congestion, the bottleneck has a

constant queue discharging rate C. An incident that causes a reduced capacity CR happens at

time s and ends at time e. The capacity restores back to C after time e and the queue vanishes

at time f.

[Insert Fig. 2 about here]

Consider the traffic state of interest as link density or number of vehicles on this link. To

derive the state transition matrix for this case, it is necessary to analyze the additional number

of vehicles in queue due to the incident. Depending on the relative position of the prediction

horizon, [





], to the incident time window, [s, e] there are three scenarios under

consideration. First, if the prediction horizon is inside the incident duration (i.e., s <



< e



then the structural deviations at time



: z



= (C  CR)  (



 s) and at time





: z





= (C 

CR)  (





 s). Thus, the corresponding state transition matrix (or factor) can be derived as:

 󰇛󰇜

󰇛󰇜 󰇛󰇜









. (32)

Second, if e 





< e, then the structural deviations at time



: z



= (C  CR)(



 s) and at

time





: z





= C  (





 s)  CR  (e  s)  C  (





 e) = (C  CR)(e  s). Hence,

󰇛󰇜 󰇛󰇜









. (33)

Lastly, if e <



< f, then the structural deviations at time



: z



= (C  CR)(e  s) and at time





: z





= (C  CR)(e  s). Therefore,

. (34)

The same approach could be applied to derive the state transition matrix for other traffic

states of interest, such as link travel time. Interestingly, the derived state transition matrices

for additional link travel time are the same as those for link density under the three scenarios

discussed above, as the link density and additional travel time are directly connected through

the same slope of output capacity. Similar matrices could also be derived for severs weather

and work zone cases. For a real-world corridor with multiple coupling bottlenecks, the matrix

 is time-dependent and situation-dependent. The derivation in this section also provides

strong evidences that the transition matrix under non-recurring conditions can be reasonably

approximated as a time-invariant constant transition coefficient of 1 shown in Eq.(34), or a

time-dependent transition coefficient shown in Eq.(33).

The above state transition matrix relies on the information of incident duration, so an

incident duration (or effect) prediction model needs to be incorporated in the traffic state

estimation and prediction framework to apply the derived method online. It is important to

note that predicting incident duration is not in the scope of our current paper. A variety of

models exist in the literature for predicting effects of incidents. Linear regression models with

independent variables, such as incident type, weather condition, and number of vehicles and

lanes involved, were commonly used in this regard. For example, Khattak et al. (1995), Garid

et al. (1997), Ozbay and Kachroo (1999) adopted variations of this approach. Sheu et al.

(2001) and Sheu et al. (2004) presented a stochastic modeling approach based on Kalman

filtering to real-time prediction of incident effects on freeway and surface street traffic

congestion, respectively. Li and Cheng (2011) presented an incident duration prediction

method based on latent Gaussian Naïve Bayesian classifier.

3. Solution Algorithm

3.1 Kalman filtering-based approach

The traffic state prediction process can be theoretically modeled as a probabilistic

combination of recurring and non-recurring conditions, but it is difficult to estimate or predict

the probability of non-recurring conditions (because supply reduction events and demand

fluctuations that cause non-recurrent traffic states are generally unexpected). To

accommodate both recurring and non-recurring traffic state predictions, the proposed

estimation and prediction equations under recurring and non-recurring conditions are

incorporated in a Kalman filtering-based algorithm. Essentially, for each link a and time

interval t, a threshold that consists of an upper bound (

) and a lower bound (

) is used

to define the range of regular traffic states. If an observation on link a at time t, ya,t, falls in

the range, then the corresponding traffic state ka,t is considered as under recurring conditions;

otherwise, ka,t is under non-recurring conditions. The bounds can be determined according to

the experiences of Traffic Control Center (TCC) or using historical data. For instance, the

lower bound and upper bound may be set as 30th and 70th percentiles, respectively, when the

regular pattern is defined as the median of historical data.

Algorithm: Kalman filtering-based algorithm (KF)

Step 0: (Initialization) Obtain the a priori estimate of regular traffic pattern, 

, and set up

the threshold, 󰇟



󰇠, for each link a and each time interval t = 1,…, T. For

each link a, set 



, 󰇛

󰇜, 

, and 

. Let t = 1.

Step 1: (State Prediction) Propagate a priori traffic state predictions (

) and its variances

() from interval t1 to t.







;





 

.

Step 2: (State Estimation) For each link a with real-time measurement, compute the Kalman

gain factor ( or 

) and update posteriori traffic state estimate (

) and its

variance (

) based on real-time link observation at current time interval t, ya,t.

Specifically,

If 



 (under recurring conditions) then



󰇛

󰇜;





󰇛

󰇜;





;



;



.

If 

 or 

 (under non-recurring conditions) then





;







 

;





󰇛

󰇜;







󰇛



󰇜;







;







;







.

Step 3: If t < T, then t = t + 1, and go to Step 1; otherwise (t = T), stop.

3.2 Adaptive scheme for updating measurement and system errors variances

Specifying the values of evolution error variances Qt and measurement error variances

Rt is an important task for operating the proposed recursive algorithm. This research adapts

the adaptive Kalman filtering approach (e.g., Chui and Chen, 1999) which utilizes statistical

properties of measurement residual sequence to dynamically estimate measurement error

variances Rt. Let





. (35)

By taking the variance on both sides, we can obtain the following

󰇛󰇜. (36)

Then,



 󰇛

󰇜󰇛

󰇜

 , where 



 (37)

Thus,

󰇡

 󰇛

󰇜󰇛

󰇜

 󰇢. (38)

Next, a constant signal-to-noise ratio , reflecting the relative weight of inherent

system error variance with respect to observational error variances, is assumed to obtain

evolution error variances Qt. The decision-maker at TCC can select an appropriate value of



so as minimize average prediction error in training data sets. According to West and Harrison

(1997), if a constant ratio



is assumed, the limiting behavior for the Kalman gain factor can

be derived as the following:



∞

󰇛

󰇜. (39)

For instance,



= 0.5 (i.e., 

∞) indicates that the a priori estimate and the real-time

(posteriori) estimate are equally important. The KF approach with the above adaptive scheme

for updating measurement and system error variances is denoted as Adaptive KF or AKF.

4. Numerical experiments

The proposed KF and AKF approaches were evaluated on two test beds in northern

Taiwan (see Fig. 3). One of them is a highway stretch between Nangang Interchange (0-km)

and Pinglin TCC (14-km) of National Highway No.5 (Jiang Wei-Shui Highway) eastbound.

Another is a highway segment between Taishan toll booth (35-km) and Taoyuan Interchange

(49-km) of National Highway No.1 (Sun Yat-Sen Highway) northbound. Taiwan’s National

Freeway Bureau has installed a dense set of detectors in these two major freeways to collect

traffic data, such as volume, speed and occupancy data (the spacing between two detectors is

less than 2-km). The traffic state of interest in the conducted experiments is density, defined

as number of vehicles per kilometer per lane. Traffic density forecasts have been used in

recently proposed coordinated ramp metering (e.g., Papamichail et al., 2010) and

state-dependent congestion pricing approaches based on anticipatory traffic state information

(e.g., Dong et al., 2011). The observed density data on the weekdays (Tuesday, Wednesday,

and Thursday) from February to June (five months), 2011, were collected and processed by

the TCCs of Highway No.5 and Highway No.1 using estimated average vehicle length.

[Insert Fig. 3 about here]

To guide against testing hypotheses suggested by the data, a cross-validation method

was used in the numerical experiments. Specifically, the weekday (Tuesday, Wednesday, and

Thursday) density data from February to May, 2011 were used as the historical data set to

estimate regular traffic patterns and the upper and lower bounds for recurring traffic

conditions for each (5-minute) prediction time interval. Because mean density may be

affected by the presence of extreme values in a statistical sample, median density was used as

the a priori estimate of regular density pattern. In addition, according to the experiences of

TCCs of Highway No.5 and Highway No.1, the upper and lower bounds of recurring

conditions were set as the 25th percentile and the 75th percentile of historical data in each time

interval. The weekday density data in June (fourteen days) were then employed as the test set

to evaluate the prediction results. For each weekday, there are two prediction periods under

consideration, i.e., 6:00-12:00 AM and 2:00-8:00 PM.

The proposed KF and AKF were implemented with the polynomial (or linear) trend

model with order m = 1 in the state transition equation. As discussed previously, a higher

order trend model was not considered, due to the tradeoff between better approximation of

structural deviations and prediction errors (see Eq.(31)). In the proposed approach, the state

vector in the model is (, 

󰆒), where the estimate of  was obtained as the median of

historical structural deviations in density of link a at time t = 0 (or the beginning of the

planning horizon) and 

󰆒 was approximated as the change of structural deviation per unit

time (

󰆒󰇛󰇜) using historical data. The initial state vector was then input to

the polynomial trend model with the state transition matrix 󰇣 

 󰇤 to determine the state

vector in subsequent time stamps: , , and so on.

In addition to evaluating the KF and AKF methods, this research also implements and

tests an ordinary Kalman filtering (OKF) approach (e.g., Chui and Chen, 1999; Chen and

Chien, 2001) that does not include historical data and the polynomial trend model. The state

transition and measurement equations of OKF are as follows.





, (40)

. (41)

where  denotes the state transition parameters from time t1 to t, which are externally

determined. Hence, there are three predictors under comparison: KF, AKF, and OKF.

The mean absolute percentage error (MAPE) was used to measure the effectiveness of

the three traffic state prediction methods. The MAPE is a commonly used measure of

accuracy of methods for constructing fitted time series values in statistics, particularly in

trend estimation and prediction (e.g., Xia et al., 2011). It usually expresses prediction

accuracy as a percentage, and is defined by the formula:





 



 (42)

where M is the number of observations. As shown in Eq.(42), the definition of the MAPE

involves the absolute operator which implicitly gives the same weight to the errors of

over-estimation/ prediction and under-estimation/ prediction, respectively. Typically, in

estimating and predicting a time series, both under-estimation and over-estimation errors

have to be taken into account when evaluating the performance of a method under

consideration (e.g., Lewis, 1982; Delurgio, 1998). The error of under-estimation cannot be

offset by the error of over-estimation, and vice versa. This principle is also found in several

other measures (indices) of prediction accuracy that are commonly-used in prediction, such

as mean absolute error (MAE) and root mean squared error (RMSE).



 



 (43)



󰇛󰇜



 (44)

Furthermore, one of the advantages over these other measures of prediction accuracy is that

the MAPE expresses prediction accuracy (or inaccuracy) using percentages, so that our

judgment would not be affected by the magnitude of prediction and observation values.

Typically, a prediction is considered as accurate if the MAPE is less than 20% and highly

accurate when the MAPE is lower than 10% (e.g., Lewis, 1982; Delurgio, 1998).

Table 1 and Table 2 present the MAPEs of the prediction results obtained by the three

predictors on test bed #1 (on Highway No. 5) and test bed #2 (on Highway No. 1),

respectively. As shown in the tables, the MAPEs of the three predictors are all smaller than

20%. On test bed #1, the average MAPE of KF is about 5% less than that of OKF and near

3% more than AKF. On test bed #2, the average MAPE of KF is about 6% less than that of

OKF and near 2.5% more than AKF. Furthermore, the maximum absolute percentage errors

(Max APE) for the three prediction methods (OKF, KF and AKF) under comparison in the

two test beds are reported in Table 1 and Table 2 (see the last row of the two tables),

respectively. The Max APEs of KF and AKF are much smaller than those of OKF, which

further highlights the advantage of the proposed approaches over the conventional KF

method in the literature. On both test beds, KF and AKF significantly outperform OKF in

terms of MAPE. This indicates the advantage of adopting the structural state space model

which incorporates historical data and considers structural deviations from regular traffic

patterns. Moreover, the performance of AKF is slightly better than KF, due to the adaptive

scheme, described in Section 3.2, for updating system and measurement error variances.

[Insert Table 1 and Table 2 about here]

To evaluate the performance of the proposed approach under non-recurring conditions,

this research selected three incident cases that occurred on National Highway No.5 in June,

2011, and investigated the prediction results in the immediate upstream segments of the

incident locations. The incident data, as shown in Table 3, were provided by Pinglin TCC,

which is responsible for the operation of National Highway No.5; the incident data on

National Highway No.1 were not available when the experiment was conducted. All the three

incidents involved multiple cars and caused one- or two-lane closure in the incident durations,

so the capacities were greatly reduced by 33% to 66% (there are three lanes in test bed #1).

[Insert Table 3 about here]

The MAPEs of OKF and AKF for these three incident cases are provided in the

rightmost two columns in Table 3, which show that AKF significantly outperforms OKF. In

the three cases, the MAPE of AKF is at least 8% lower than that of OKF. Figs. 4-1, 5-1, and

6-1 depict the prediction results of OKF, and Figs. 4-2, 5-2, and 6-2 plot the prediction results

of AKF for the three incident cases. As shown in the figures, the observed densities in the

incident durations (i.e., non-recurring conditions) deviate considerably from the historical

pattern and exceed the upper bounds (the 75th percentiles) of recurring conditions, while the

observed densities before and after incident durations follow closely with the historical

pattern. This observation lays a foundation for the proposed structural state space modeling

framework and the KF algorithm for traffic state prediction and estimation. Particularly, for

non-recurring conditions, the structural changes from the regular pattern are captured in the

proposed approach, while the forecasts for recurring conditions are based on the historical

pattern. The numerical results demonstrate that the prediction performance of AKF is better

than that of OKF under both recurring and non-recurring conditions. The superiority in

prediction performance of AKF over OKF is especially apparent under non-recurring

conditions, highlighting the advantage of incorporating the structural deviation term in the

model.

Essentially, under recurring conditions, the proposed approach based on the estimate of

regular pattern is capable of describing real-time traffic state time series and produces

accurate forecasts. However, in the presence of structural changes, deviations between actual

traffic states and regular pattern have non-zero means. The KF then combines the polynomial

trend filter with historical pattern to capture possible structural deviations, producing robust

forecasts under both recurring and non-recurring conditions. Therefore, a robust traffic state

predictor should incorporate both regular pattern and structural deviation components to

ensure prediction quality under a wide range of conditions.

[Insert Fig. 4-1, Fig. 4-2, Fig. 5-1, Fig. 5-2, Fig. 6-1, and Fig. 6-2 about here]

5. Concluding remarks

With a particular aim at developing a robust short-term traffic state prediction approach

that can produce accurate forecasts under both regular and irregular traffic conditions, this

research proposes the prediction model which integrates both the historical pattern and the

polynomial trend component. The proposed model decomposes actual freeway traffic state

into three components, namely, regular pattern, structural deviation, and random fluctuation,

where the median of historical data is used as the a priori estimate of regular pattern and the

polynomial trend model is adopted to describe structural changes due to unexpected capacity

reduction and demand fluctuations. In this modeling framework, freeway traffic states are

predicted by the proposed approach using a mixture of historical (median) data that capture

regular traffic patterns and real-time observations obtained from detectors that contain

intrinsic (structural deviation) information of current traffic states. We also show that the

structural deviation in a single bottleneck case can be analytically derived based on

cumulative flow counts, which provide theoretical supports on the use of polynomial trends

model under non-recurring congestion conditions .

The model is incorporated in the Kalman filtering-based algorithm. If a real-time

measurement falls within the range of recurring conditions, structural deviation vanishes and

the forecast of traffic state depends on historical pattern. Otherwise, structural change is

predicted and corrected based on that real-time observation. The proposed KF and AKF

algorithms were evaluated on the two test beds in the northern Taiwan highway network, and

compared with the OKF algorithm presented in the literature. The prediction results show that

the proposed approaches significantly outperform OKF in terms of MAPE on both test beds.

Furthermore, the forecasting results under incident (non-recurring) situations were

investigated. The prediction performance of AKF is particularly better than that of OKF

under non-recurring traffic conditions, highlighting the benefit of incorporating the structural

deviation term in the model.

Our future research is to compare the prediction performance of the proposed approach

with that of other state-of-the-art traffic state (or travel time) prediction methods, such as

Bayesian forecasting and neural networks, on the two test beds and some other test beds. A

set of sensitivity tests will also be conducted to investigate impacts and benefits of using

higher order polynomial trend models. Additionally, as a heterogeneous sensor network may

include a set of point, point-to-point and probe sensors, extending the proposed traffic state

predictor to fully utilize heterogeneous sources of measurement data will be an important

future research task for the deployment of AKF in traffic networks. With real-time

measurement data inevitably tagged with some degree of imprecision and uncertainty, a

robust traffic state prediction method should also explicitly address the issue of representing

and dealing with imprecise and uncertain data.

Acknowledgements

This paper is based on a project (NSC 102-2410-H-027-011-MY3) sponsored by the National

Science Council, Taiwan. The authors are grateful to three anonymous reviewers for many

insightful and constructive comments and suggestions which help improve the quality of this

paper. The authors are solely responsible for the content of this paper.

References

Abdulhai, B., Porwal, H. and Recker, W. (2002). Short-term traffic flow prediction using

Neuro-Genetic Algorithms. Journal of Intelligent Transportation Systems 7(1), 3-41.

Anand, A., Ramadurai, G. and Vanajakshi, L. (2013). Data fusion based traffic density

estimation and prediction. Journal of Intelligent Transportation Systems,

(DOI:10.1080/15472450.2013.806844).

Ben-Akiva, M. E., Bierlaire, M., Bottom, J., Koutsopoulos, H. N. and Mishalani, R. (1997).

Development of a route guidance generation system for real-time application. Presented at

the IFAC Conference, Chania, Greece.

Bottom, J. (2000). Consistent Anticipatory Route Guidance. Ph.D. dissertation,

Massachusetts Institute of Technology.

Chandra, S. R. and Al-Deek, H. (2009). Predictions of freeway traffic speeds and volumes

using vector autoregressive models. Journal of Intelligent Transportation Systems 13(2),

53-72.

Chen, M., and Chien, S. (2001). Dynamic freeway travel-time prediction with probe vehicle

data, link based versus path based. Transportation Research Record 1768, 157-161.

Chien, S., and Kuchipudi, C.M. (2003). Dynamic travel time prediction with real-time and

historic data. Journal of Transportation Engineering 129(6), 608-616.

Chu, L.Y., Oh, J.S., Recker, W. (2005). Adaptive kalman filter based freeway travel time

estimation. Paper presented in the 84th TRB Annual Meeting, Washington D.C., January,

2005.

Chui, C. and Chen, G. (1999). Kalman Filtering with Real-Time Applications.

Springer-Verlag.

Delurgio, S. A. (1998). Forecasting Principles and Applications, McGraw-Hill.

Dong, J., Mahmassani, H. S., Erdoğan, S., and Lu, C.-C. (2011). State-dependent pricing for

real-time freeway management: anticipatory versus reactive strategies. Transportation

Research Part C 19(4), 644-657.

Fei, X., Lu, C.-C., and Liu, K. (2011). A Bayesian dynamic linear model approach for

real-time short term freeway travel time prediction. Transportation Research Part C 19(6),

1306-1318.

Garib, A., Radwan, A. E. and Al-Deek, H. (1997). Estimating magnitude and duration of

incident delays. ASCE Journal of Transportation Engineering, 123, 459-466.

Hegyi, A., Schutter, B. D., and Hellendoorn, H. (2005). Model predictive control for optimal

coordination of ramp metering and variable speed limits. Transportation Research Part C

13(3), 185-209.

Khattak, A., Schofer, J. and Wang, M. (1995). A simple time-sequential procedure for

predicting freeway incident duration. IVHS Journal, 2, 113-138.

Lam, S. H., and Toan, T. D. (2008). Short-term travel time predictions using support vector

regression. Paper presented in the 87th TRB Annual Meeting, Washington D.C., January,

2008.

Lan, L. W., Sheu, J. B, and Huang, Y. S. (2008). Investigation of temporal freeway traffic

patterns in reconstructed state spaces. Transportation Research Part C 16(3), 116-136.

Lewis, C. D. (1982). Industrial and Business Forecasting Method. Butterworth Scientific.

Li, D. and Cheng, L. (2011). Incident duration prediction based on latent Gaussian naïve

Bayesian classifier. International Journal of Computational Intelligence Systems, 4(3),

345-352.

Lou, Y., Yin, Y. and Laval, J. A. (2011). Optimal dynamic pricing strategies for

high-occupancy/toll lanes. Transportation Research Part C, 19(1), 64-74.

Lu, C.-C. (2012). An adaptive system for predicting freeway travel time. International

Journal of Information Technology & Decision Making 11(4), 727-747.

Mahmassani, H. S. and Peeta, S. (1995). System optimal dynamic assignment for electronic

route guidance in a congested traffic network. In Gartner and Improta, (Eds.) Urban

Traffic Networks: Dynamic Flow Modeling and Control, pp.3-37, Springer-Verlag, Berlin.

Mahmassani, H. S., Lu, C.-C. and Dong J. (2005). Value of information: provision of

anticipatory descriptive travel information through a real-time traffic estimation and

prediction system. Proceedings of the 12th ITS World Congress, San Francisco, California,

USA.

Nanthawichit, C., Nakatsuji, T. and Suzuki, H. (2003). Application of probe-vehicle data for

real-time traffic-state estimation and short-term travel-time prediction on a freeway.

Transportation Research Record 2987, 49-59.

Ozbay, K. and Kachroo, P. (1999). Incident Management in Intelligent Transportation

Systems. Artech House, Boston.

Papamichail, I, Kotsialos, A., Margonis, I. and Papageorgiou, M. (2010), Coordinated ramp

metering for freeway networks – A model-predictive hierarchical control approach.

Transportation Research Part C 18(3), 311-331.

Rilett, L. R. and Park, D. J. (2001). Direct forecasting of freeway corridor travel times using

spectral basis neural networks. Transportation Research Record 1752, 140-147.

Sheu, J.-B., Chou, Y.-H. and Shen, L.-J. (2001). A stochastic estimation approach to

real-time prediction of incident effects on freeway traffic congestion. Transportation

Research Part B 35(6), 575-592.

Sheu, J.-B., Chou, Y.-H. and Chen, A. (2004). Stochastic modeling and real-time prediction

of incident effects on surface street traffic congestion. Applied Mathematical Modeling

28(5), 445-468.

van Hinsbergen, C. P. and van Lint, J.W.C. (2008). Bayesian combination of travel time

prediction models. Transportation Research Record 2064, 73-80.

van Hinsbergen, C. P., van Lint, J. W. C. and van Zuylen, H. J. (2009). Bayesian committee

of neural networks to predict travel times with confidence intervals. Transportation

Research Part C 17(5), 498-509.

van Lint, J. W. C., Hoogendoorn, S. P., and Van Zuylen, H. J. (2002). Freeway travel time

prediction with state-space neural networks-modeling state-space dynamics with recurrent

neural networks. Transportation Research Record 1811, 30-39.

Wang, Y. and Papageorgiou, M. (2005). Real-time freeway traffic state estimation based on

extended kalman filter: a general approach. Transportation Research Part B 39(2),

141-167.

Wang, Y., Papageorgiou, M. and Messmer, A. (2006). RENAISSANCE - A unified

macroscopic model based approach to real-time freeway network traffic surveillance.

Transportation Research Part C 14(3), 190–212.

West, M. and Harrison, J. (1999). Bayesian Forecasting and Dynamic Models. Springer

Series in Statistics, second edition.

Yang, F, Yin, Z. Z., Liu, H and Ran, B. (2004). Online recursive algorithm for short-term

traffic prediction. Transportation Research Record 1879, 1-8.

Zhang, X. Y. and Rice, J. A. (2003). Short-term travel time prediction using a time-varying

coefficient linear model. Transportation Research Part C 11(8), 187-210.

Zhang, Y. and Ye, Z. (2008). Short-term traffic flow forecasting using Fuzzy logic system

methods. Journal of Intelligent Transportation Systems 12(3), 102-112.

Zhou, X. and Mahmassani. H. S. (2007). A structural state space model for real-time

origin-destination demand estimation and prediction in a day-to-day updating

framework. Transportation Research Part B 41(8), 823-840.

Xia, J., Chen, M. and Huang, W. (2011). A Multistep corridor travel time prediction method

using presence-type vehicle detector data. Journal of Intelligent Transportation Systems

15(2), 104-113.

Fig. 1 Illustrative examples of the structural state space model

Fig. 2 Cumulative flow count curves for a single bottleneck with reduced capacity due to an

incident starting at time s

Time

Cumulative flow count

Departure curve

with incident (D')

Incident duration

Additional number

of vehicles in queue

(structural deviation)

Departure curve (D)

Arrival curve

Reduced capacity (CR)

Time

True state

Historical pattern

Deviation

Fig. 3 Two test beds in northern Taiwan area

(Source: National Freeway Bureau, Taiwan http://www.freeway.gov.tw/english/way_net.aspx)

Table 1 MAPEs (%) of the prediction results on test bed #1 (Highway No.5)

Methods

OKF

AKF

Date

2011/6/1

17.04

15.78

11.51

9.11

7.36

7.66

2011/6/2

17.64

17.96

13.64

11.01

10.31

8.41

2011/6/3

16.65

16.44

13.13

11.95

9.84

8.95

2011/6/8

17.16

15.37

9.24

9.11

7.55

8.52

2011/6/9

16.26

13.31

9.06

10.38

8.31

7.47

2011/6/10

17.16

17.06

11.15

10.87

7.98

7.95

2011/6/15

16.78

15.99

12.42

13.36

8.25

9.68

2011/6/16

16.58

17.32

8.73

13.66

4.83

9.63

2011/6/17

14.75

15.15

8.36

11.68

4.97

8.31

2011/6/22

15.81

15.87

10.12

9.23

8.34

7.85

2011/6/23

15.97

15.27

10.52

8.87

7.63

5.76

2011/6/24

16.32

17.89

12.43

10.72

9.32

6.33

2011/6/29

15.69

15.11

13.36

9.03

8.70

6.62

2011/6/30

15.34

14.63

10.55

12.29

8.02

8.09

Average

16.37

15.74

11.02

10.61

8.04

7.95

Max. APE

28.21

30.95

18.88

18.47

14.13

13.89

Table 2 MAPEs of the prediction results on test bed #2 (Highway No.1)

Methods

OKF

AKF

Date

2011/6/1

18.32

16.28

12.16

9.67

9.35

8.08

2011/6/2

17.50

15.84

9.44

9.48

9.32

8.27

2011/6/3

18.84

16.90

12.58

12.82

7.19

9.11

2011/6/8

14.91

16.72

9.63

8.82

8.94

8.14

2011/6/9

17.79

14.60

8.34

8.73

8.31

7.77

2011/6/10

17.51

14.59

10.77

8.24

8.35

6.62

2011/6/15

14.13

14.62

9.55

9.94

7.40

7.41

2011/6/16

14.80

15.39

8.19

11.22

7.76

9.22

2011/6/17

15.54

15.73

10.03

9.78

6.46

9.57

2011/6/22

14.74

15.03

9.38

10.38

7.16

9.37

2011/6/23

15.27

17.40

8.42

6.44

7.40

5.94

2011/6/24

16.33

15.71

12.97

8.51

6.18

7.02

2011/6/29

17.44

13.90

11.51

8.35

8.33

6.27

2011/6/30

18.29

14.24

11.19

11.37

7.23

8.77

Average

16.52

15.49

10.30

9.75

7.82

7.97

Max. APE

25.05

24.13

17.75

17.39

13.91

13.85

Table 3 Prediction performances for three selected incidents on Highway No.5 in June, 2011

No.

Location

Date

Start

Time

End

Time

# of Blocked

Lanes

OKF

(MAPE)

AKF

(MAPE)

11.2-km Eastbound

2011/6/1

16:50

17:45

17.87%

9.84%

7.1-km Eastbound

2011/6/9

9:10

10:05

18.12%

9.91%

4.5-km Eastbound

2011/6/15

11:10

12:15

18.15%

9.23%

Fig. 4-1 Prediction results of OKF for Incident No. 1

Fig. 4-2 Prediction results of AKF for Incident No. 1

100

120

140

Density (number of vehicles/lane-km)

Observations

Predictions

Historical median

100

120

140

Density (number of vehicles/lane-km)

Observations

Predictions

Historical median

Incident

Fig. 5-1 Prediction results of OKF for Incident No. 2

Fig. 5-2 Prediction results of AKF for Incident No. 2

Fig. 6-1 Prediction results of OKF for Incident No. 3

100

120

07:30

07:35

07:40

07:45

07:50

07:55

08:00

08:05

08:10

08:15

08:20

08:25

08:30

08:35

08:40

08:45

08:50

08:55

09:00

09:05

09:10

09:15

09:20

09:25

09:30

09:35

09:40

09:45

09:50

09:55

10:00

10:05

10:10

10:15

10:20

10:25

10:30

10:35

10:40

10:45

10:50

10:55

11:00

11:05

11:10

11:15

11:20

11:25

11:30

11:35

11:40

Density (number of vehicles/lane-km)

Observations

Predictions

Historical median

100

120

07:30

07:35

07:40

07:45

07:50

07:55

08:00

08:05

08:10

08:15

08:20

08:25

08:30

08:35

08:40

08:45

08:50

08:55

09:00

09:05

09:10

09:15

09:20

09:25

09:30

09:35

09:40

09:45

09:50

09:55

10:00

10:05

10:10

10:15

10:20

10:25

10:30

10:35

10:40

10:45

10:50

10:55

11:00

11:05

11:10

11:15

11:20

11:25

11:30

11:35

11:40

Density (number of vehicles/lane-km)

Observations

Predictions

Historical median

100

120

140

Density (number of vehicles/lane-km)

Observations

Predictions

Historical median

Incident

Fig. 6-2 Prediction results of AKF for Incident No. 3

100

120

140

Density (number of vehicles/lane-km)

Observations

Predictions

Historical median

Incident

Urban Traffic Congestion Prediction: A Multi-Step Approach Utilizing Sensor Data and Weather Information

Article

Full-text available

Jan 2024

Citation: Tsalikidis, N.; Mystakidis, A.; Koukaras, P.; Ivaškevičius, M.; Morkūnaitė, L.; Ioannidis, D.; Fokaides, P.A.; Tjortjis, C.; Tzovaras, D. Urban Traffic Congestion Prediction: A Multi-Step Approach Utilizing Sensor Data and Weather Information. Smart Cities 2024, 7, 233-253. https://doi. Abstract: The continuous growth of urban populations has led to the persistent problem of traffic congestion, which imposes adverse effects on quality of life, such as commute times, road safety, and the local air quality. Advancements in Internet of Things (IoT) sensor technology have contributed to a plethora of new data streams regarding traffic conditions. Therefore, the recognition and prediction of traffic congestion patterns utilizing such data have become crucial. To that end, the integration of Machine Learning (ML) algorithms can further enhance Intelligent Transportation Systems (ITS), contributing to the smart management of transportation systems and effectively tackling traffic congestion in cities. This study seeks to assess a wide range of models as potential solutions for an ML-based multi-step forecasting approach intended to improve traffic congestion prediction, particularly in areas with limited historical data. Various interpretable predictive algorithms, suitable for handling the complexity and spatiotemporal characteristics of urban traffic flow, were tested and eventually shortlisted based on their predictive performance. The forecasting approach selects the optimal model in each step to maximize the accuracy. The findings demonstrate that, in a 24 h step prediction, variating Ensemble Tree-Based (ETB) regressors like the Light Gradient Boosting Machine (LGBM) exhibit superior performances compared to traditional Deep Learning (DL) methods. Our work provides a valuable contribution to short-term traffic congestion predictions and can enable more efficient scheduling of daily urban transportation.

Adaptive Dimming of Highway Lights Using Recurrent Neural Networks

Chapter

Apr 2023

Highway operators are in a constant search of techniques and methodologies that can reduce their energy footprint. In this respect, the installation of dimmable light-emitting diode lights on the open road section of highways appears to be a promising solution, due to the reduced energy consumption (compared to high pressure sodium lamps) and the ability to adjust their brightness at various levels, based on the road’s traffic load. However, setting the desired level of light intensity cannot be performed instantly, due to safety and contractual reasons that a highway operator must follow. For this reason, an adaptive and intelligent system is proposed in this work, that models traffic load and is able to predict its future trend, based on current load and light intensity measurements. In this way, the unnecessary use of the lighting equipment is avoided, as brightness is dropped to a minimum level when traffic load is predicted to be low. The proposed model is based on recurrent neural networks and more specifically on long short-term memory cells that are able to model complex dependencies in data with temporal correlations, like traffic load measurements. The overall approach is evaluated on relevant data provided by Olympia Odos S.A. that operates the Elefsina-Korinthos-Patra highway in Greece, with promising results.KeywordsTraffic load predictionHighway LightingLED LightingIntelligent Traffic Control SystemsRecurrent Neural NetworksLong Short-Term Memory

A Multifeature Fusion Short-Term Traffic Flow Prediction Model Based on Deep Learnings

Article

Full-text available

May 2022
J ADV TRANSPORT

Short-term traffic flow prediction is an important component of intelligent transportation systems, which can support traffic trip planning and traffic management. Although existing predicting methods have been applied in the field of traffic flow prediction, they cannot capture the complex multifeatures of traffic flows resulting in unsatisfactory short-term traffic flow prediction results. In this paper, a multifeature fusion model based on deep learning methods is proposed, which consists of three modules, namely, a CNN-Bidirectional GRU module with an attention mechanism (CNN-BiGRU-attention) and two Bidirectional GRU modules with an attention mechanism (BiGRU-attention). The CNN-BiGRU-attention module is used to extract local trend features and long-term dependent features of the traffic flow, and the two BiGRU-attention modules are used to extract daily and weekly periodic features of the traffic flow. Moreover, a feature fusion layer in the model is used to fuse the features extracted by each module. And then, the number of neurons in the model, the loss function, and other parameters such as the optimization algorithm are discussed and set up through simulation experiments. Finally, the multifeature fusion model is trained and tested based on the training and test sets from the data collected from the field. And the results indicate that the proposed model can better achieve traffic flow prediction and has good robustness. Furthermore, the multifeature fusion model is compared and analyzed against the baseline models with the same dataset, and the experimental results show that the multifeature fusion model has superior predictive performance compared to the baseline models.

Examine the Prediction Error of Ride-Hailing Travel Demands with Various Ignored Sparse Demand Effects

Article

Full-text available

Apr 2022
J ADV TRANSPORT

The accurate short-term travel demand predictions of ride-hailing orders can promote the optimal dispatching of vehicles in space and time, which is the crucial issue to achieve sustainable development of such dynamic demand-responsive service. The sparse demands are always ignored in the previous models, and the uncertainties in the spatiotemporal distribution of the predictions induced by setting subjective thresholds are rarely explored. This paper attempts to fill this gap and examine the spatiotemporal sparsity effect on ride-hailing travel demand prediction by using Didi Chuxing order data recorded in Chengdu, China. To obtain the spatiotemporal characteristics of the travel demand, three hexagon-based deep learning models (H-CNN-LSTM, H-CNN-GRU, and H-ConvLSTM) are compared by setting various threshold values. The results show that the H-ConvLSTM model has better prediction performance than the others due to its ability to simultaneously capture spatiotemporal features, especially in areas with a high proportion of sparse demands. We found that increasing the minimum demand threshold to delete more sparse data improves the overall prediction accuracy to a certain extent, but the spatiotemporal coverage of the data is also significantly reduced. Results of this study could guide traffic operations in providing better travel services for different regions.

Joint predictions of multi-modal ride-hailing demands: A deep multi-task multi-graph learning-based approach

Article

Jun 2021
TRANSPORT RES C-EMER

Ride-hailing platforms generally provide various service options to customers, such as solo ride services, shared ride services, etc. It is generally expected that demands for different service modes are correlated, and the prediction of demand for one service mode can benefit from historical observations of demands for other service modes. Moreover, an accurate joint prediction of demands for multiple service modes can help the platforms better allocate and dispatch vehicle resources. Although there is a large stream of literature on ride-hailing demand predictions for one specific service mode, few efforts have been paid towards joint predictions of ride-hailing demands for multiple service modes. To address this issue, we propose a deep multi-task multi-graph learning approach, which combines two components: (1) multiple multi-graph convolutional (MGC) networks for predicting demands for different service modes, and (2) multi-task learning modules that enable knowledge sharing across multiple MGC networks. More specifically, two multi-task learning structures are established. The first one is the regularized cross-task learning, which builds cross-task connections among the inputs and outputs of multiple MGC networks. The second one is the multi-linear relationship learning, which imposes a prior tensor normal distribution on the weights of various MGC networks. Although there are no concrete bridges between different MGC networks, the weights of these networks are constrained by each other and subject to a common prior distribution. Evaluated with the for-hire-vehicle datasets in Manhattan, we show that our proposed approach outperforms the benchmark algorithms in prediction accuracy for different ride-hailing modes.

Traffic Demand Prediction Based on Multi-dimensional Graph Convolutional Network

Conference Paper

Dec 2023

Ridesplitting demand prediction via spatiotemporal multi-graph convolution network

Article

Aug 2024
EXPERT SYST APPL

On region-level travel demand forecasting using multi-task adaptive graph attention network

Article

Dec 2022
INFORM SCIENCES

Accurate travel demand forecasting at the regional level benefits to urban traffic management and service operations. Irregular regions can be naturally represented by graphs, and thus, graph neural network (GNN) is rapidly becoming the mainstream method in region-level travel demand forecasting. Most existing GNN-based methods encode the association between regions into single or multiple fixed adjacency matrices. The limitation of these methods is that they ignore that the correlation between regions may vary over time. Several studies have attempted to capture such change with purely data-driven dynamic graph approaches; however, they are highly dependent on data quality. Furthermore, regions with different functional distributions exhibit diverse demand patterns, which are not fully considered in demand forecasting. To address these issues, we propose a multi-task adaptive recurrent graph attention network, in which the spatio-temporal learning component combines the prior knowledge-driven graph learning mechanism with a novel recurrent graph attention network to capture the dynamic spatiotemporal dependencies automatically. Demand forecasting for all regions is divided into different learning tasks based on region function distributions, and an uncertainty-based multi-task learning component is then developed to coordinate multiple learning tasks adaptively. Experimental results based on two real-world datasets show the superiority of the proposed method in prediction accuracy. Compared with the state-of-the-art dynamic graph methods, the proposed method further reduces prediction error by 4%. In addition, various ablation experiments demonstrate the effectiveness of different components in the proposed method.

Prediction of Hot Spot Toll Stations based on Traffic Flow Model

Conference Paper

Aug 2022

Chen Jun

Improving Spatiotemporal Traffic Prediction in Adversary Weather Conditions Using Hierarchical Bayesian State Space Modeling

Conference Paper

Sep 2021

Direct Forecasting of Freeway Corridor Travel Times Using Spectral Basis Neural Networks

Article

Jan 2001

With the advent of advanced traveler information systems, the prediction of short-term link and corridor travel times has become increasingly important. The standard method for forecasting corridor travel times is a two-step process in which the link travel times are first forecast and then combined into a corridor travel time. If link travel times are not in dependent, however, there is the potential for erroneous corridor or route travel time estimates. As an alternative to the two-step approach, a direct or one-step approach for freeway corridor travel time forecasting is proposed that automatically takes into account interrelationships between link travel times. The use of spectral basis neural networks to directly forecast multiple-period freeway corridor travel times is examined first. The model is tested on observed travel times collected as part of the automatic vehicle identification component of the Houston Transtar system. The direct forecasting model is also compared with the two-step model, which uses forecast link travel times as input. It was found that the direct forecasting approach gave better results than any of the other models examined and that link travel time forecasting errors are not additive.

Bayesian Forecasting and Dynamic Models

Article

Feb 1992

Kalman Filtering

Book

Jan 1999

Optimal dynamic pricing strategies for high-occupancy/toll lanes

Article

Jan 2010

Direct forecasting of freeway corridor travel times using spectral basis neural networks

Conference Paper

Jan 2001

With the advent of advanced traveler information systems, the prediction of short-term link and corridor travel times has become increasingly important. The standard method for forecasting corridor travel times is a two-step process in which the link travel times are first forecast and then combined into a corridor travel time. If link travel times are not independent, however, there is the potential for erroneous corridor or route travel time estimates. As an alternative to the two-step approach, a direct or one-step approach for freeway corridor travel time forecasting is proposed that automatically takes into account interrelationships between link travel times. The use of spectral basis neural networks to directly forecast multiple-period freeway corridor travel times is examined first. The model is tested on observed travel times collected as part of the automatic vehicle identification component of the Houston Transtar system. The direct forecasting model is also compared with the two-step model, which uses forecast link travel times as input. It was found that the direct forecasting approach gave better results than my of the other models examined and that link travel time forecasting errors are not additive.

Value of information: Provision of anticipatory descriptive travel information through a real-time traffic estimation and prediction system

Article

Jan 2009

This paper demonstrates the capability of DYNASMART-X to provide anticipatory descriptive travel information, in the form of predictive time-dependent path or point-to-point travel times. For the sake of evaluating different information provision strategies, a closed-loop rolling horizon framework is implemented on the basis of original open loop system design. Two information provision strategies are compared in the conducted computational experiment, namely prevailing travel time information and predictive time-dependent travel time information. Consistency and value of the generated anticipatory travel time information are explored and discussed in the simulation results from a computational experiment.

Data Fusion-Based Traffic Density Estimation and Prediction

Article

Oct 2014

Traffic congestion has become a major challenge in recent years in many countries of the world. One way to alleviate congestion is to manage the traffic efficiently by applying intelligent transportation systems (ITS). One set of ITS technologies helps in diverting vehicles from congested parts of the network to alternate routes having less congestion. Congestion is often measured by traffic density, which is the number of vehicles per unit stretch of the roadway. Density, being a spatial characteristic, is difficult to measure in the field. Also, the general approach of estimating density from location-based measures may not capture the spatial variation in density. To capture the spatial variation better, density can be estimated using both location-based and spatial data sources using a data fusion approach. The present study uses a Kalman filter to fuse spatial and location-based data for the estimation of traffic density. Subsequently, the estimated data are utilized for predicting density to future time intervals using a time-series regression model. The models were estimated and validated using both field and simulated data. Both estimation and prediction models performed well, despite the challenges arising from heterogeneous traffic flow conditions prevalent in India.

An adaptive system for predicting freeway travel times

Article

Sep 2012

Chung-cheng Jasonlu

This paper presents an adaptive system that embeds a Bayesian inference-based dynamic model (BDM) for predicting real-time travel time on a freeway corridor. Bayesian forecasting is a learning process that revises sequentially the state of a priori knowledge of travel time based on newly available information. The prediction result is a posterior travel time distribution that can be employed to generate a single-value (typically but not necessarily the mean) travel time as well as a confidence interval representing the uncertainty of travel time prediction. To better track travel time fluctuations during nonrecurrent congestion due to unforeseen events (e.g., incidents, accidents, or bad weather), the BDM is integrated into an adaptive control framework that can automatically learn and adjust the system evolution noise level. The experimental results based on real loop detector data of a freeway stretch in Northern Taiwan suggest that the proposed method is able to provide accurate and reliable travel time prediction under both recurrent and nonrecurrent traffic conditions.

Incident Duration Prediction Based on Latent Gaussian Naive Bayesian classifier

Article

Mar 2012

The probability distribution of duration is a critical input for predicting the potential impact of traffic incidents. Most of the previous duration prediction models are discrete, which divide duration into several intervals. However, sometimes the continuous probability distribution is needed. Therefore a continuous model based on latent Gaussian naive Bayesian (LGNB) classifier is developed in this paper, assuming duration fits a lognormal distribution. The model is calibrated and tested by incident records from the Georgia Department of Transportation. The results show that LGNB can describe the continuous probability distribution of duration well. According to the evidence sensitivity analysis of LGNB, the four classes of incidents classified by LGNB can be interpreted by the level of severity and complexity.

A bayesian dynamic linear model approach for real-time short-term freeway travel time prediction

Article

Dec 2011
TRANSPORT RES C-EMER

This paper presents a Bayesian inference-based dynamic linear model (DLM) to predict online short-term travel time on a freeway stretch. The proposed method considers the predicted freeway travel time as the sum of the median of historical travel times, time-varying random variations in travel time, and a model evolution error, where the median is employed to recognize the primary travel time pattern while the variation captures unexpected supply (i.e. capacity) reduction and demand fluctuations. Bayesian forecasting is a learning process that revises sequentially the state of a priori knowledge of travel time based on newly available information. The prediction result is a posterior travel time distribution that can be employed to generate a single-value (typically but not necessarily the mean) travel time as well as a confidence interval representing the uncertainty of travel time prediction. To better track travel time fluctuations during non-recurrent congestion due to unforeseen events (e.g., incidents, accidents, or bad weather), the DLM is integrated into an adaptive control framework that can automatically learn and adjust the system evolution noise level. The experiment results based on the real loop detector data of an I-66 segment in Northern Virginia suggest that the proposed method is able to provide accurate and reliable travel time prediction under both recurrent and non-recurrent traffic conditions.

Short-Term Highway Traffic State Prediction Using Structural State Space Models

Abstract and Figures

Recommended publications

Path Flow Estimation Using Time Varying Coefficient State Space Model

Sensitivity Penalization Based Robust State Estimation for Uncertain Linear Systems

Kalman Filter

Receding Nonlinear Kalman (RNK) Filter for Nonlinear Constrained State Estimation