ArticlePDF Available

A Deep Learning Model for Off-Ramp Hourly Traffic Volume Estimation

Authors:

Abstract

This paper addresses estimation of traffic volume of freeway off-ramps. Freeways are the transportation network’s main corridors, serving a large portion of the traffic volume. This traffic passes into the lower-level roads through off-ramps. Therefore, the traffic condition of the off-ramps is an essential factor affecting the operation of the transportation network. The continuous collection of volume data is impractical, and transportation authorities install vehicle detectors permanently on only a few off-ramps and temporarily (e.g., a week) on some others. Thus, traffic volume is the most challenging to estimate among various traffic measures. Moreover, the existing literature on volume estimation is mainly concerned with evaluating traffic on the main road segments. However, the distinct characteristics of the connection links, such as off-ramps, demands specified modeling. This study estimates the hourly traffic volume of off-ramps using a deep learning model. It evaluates the advantages of inputting the connected lower-level road features to the model, and explores various detector installation strategies on the model training process. The primary data sources are volume counts, probe speeds, and road segment infrastructure characteristics. The model results indicate that the incorporation of traffic flow characteristics and infrastructure attributes of the lower-level road connected to the freeway significantly improves the accuracy of estimation off-ramp traffic volume. Further, analysis illustrated that the model trained with data from temporarily installed detectors on all interchanges outperformed models trained with permanently installed detectors on 90% of the interchanges, indicating the model’s ability in extracting temporal correlations significantly more than spatial correlations.
`
A Deep Learning Model for Off-ramp Hourly Traffic Volume Estimation
Amir Nohekhan1
Graduate Research Assistant
Ph.D. candidate
Email: amirn@umd.edu
(Corresponding Author)
ORCiD: 0000-0002-8745-6152
Sara Zahedian1
Graduate Research Assistant
Ph.D. candidate
Email: szahedi1@umd.edu
ORCiD: 0000-0002-6927-1189
Ali Haghani2
Professor
Email: haghani@umd.edu
ORCiD: 0000-0003-3181-7155
(1) Center for Advanced Transportation Technology
Department of Civil and Environmental Engineering
University of Maryland, College Park, MD, 20740
(2) Department of Civil and Environmental Engineering
University of Maryland, College Park, MD, 20740
Word Count: 6,050 words + 5 tables = 7,300 words
Submitted [11/29/2020]
1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
`
ABSTRACT
This paper aims to estimate the freeway off-ramps' traffic volume. Freeways are the
transportation network's main corridors, serving a large portion of the traffic volume. This traffic
passes into the lower-level roads through off-ramps. Therefore, the traffic condition of the off
ramps is an essential factor affecting the transportation network operation. The continuous
collection of volume data is impractical, and transportation authorities install vehicle detectors
permanently on only a few off-ramps and temporary (e.g., a week) on some others. Thus, traffic
volume is the most challenging to estimate among various traffic measures. Moreover, the
existing literature on volume estimation is mainly concerned with evaluating traffic on the main
road segments. However, distinct characteristics of the connection links, such as off-ramps,
demands specified modeling. This study estimates the off-ramps' hourly traffic volume using a
deep learning model and evaluates the advantages of inputting the connected lower-level road
features to the model, and explores various detector installation strategies on the model training
process. This study's primary data sources are volume counts, probe speeds, and road segments'
infrastructure characteristics. The model results indicate that the incorporation of traffic flow
characteristics and infrastructure attributes of the lower-level road connected to the freeway
significantly improves the off-ramp's traffic volume estimation accuracy. Further, analysis
illustrated that the model trained with data of the temporarily installed detectors on all
interchanges outperforms the models trained with permanently installed detectors on 90% of the
interchanges, indicating model ability in extracting temporal correlations significantly more than
spatial correlations.
Keywords: Traffic Volume Estimation, Off-ramp, Neural Network Model
2
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
`
INTRODUCTION AND BACKGROUND
This study aims to estimate the number of vehicles that exit a freeway in an hour and
enter the arterials using traffic counts data. As the backbone of a road network, freeways are the
essential corridors determining the entire network’s traffic conditions. The interaction of
freeways and the rest of the road network is through on and off-ramps. The large volume of
traffic using the freeways use off-ramps to go to lower-level roads and arrive at their destination.
The amount of traffic that exits a freeway, especially during peak hours, can create traffic
congestion, thereby severely impacting the performance of the downstream roads. The drop in
road performance is because arterials and collectors’ design is adequate for a much lower traffic
volume than freeways. Contrary to other road segments that typically have a high-resolution
speed record as an indicator of the traffic volume, such records are not available for on- and off-
ramps due to not having a Traffic Messaging Channel (TMC) segment designated to them. The
motivation of this study is to provide a framework for the system operators to estimate and
predict the traffic volume that exits a freeway and enters the local roads to plan accordingly.
The traditional method for estimating link flows relies on traffic assignment techniques.
These techniques aim to compute link flows based on the origin-destination (OD) matrix and
equilibrium assumptions [1]. Therefore, the OD matrix is a critical element in the process of
estimating link flows. Many studies focus on OD matrix estimation models and can be
categorized into two groups of static and dynamic based models [2]. Static models yield an
average OD matrix, which is time-independent and is useful for planning level flow estimations.
At the same time, dynamic methods aim to estimate the time-dependent OD matrix to be used at
the operational level ([3-6]). Most of the studies in this area have an outdated OD matrix as input
and aim to update that matrix using time-varying traffic counts [7]. Then, the modified OD
matrix is input into a dynamic traffic assignment (DTA) model to obtain link flows. There exist a
wide range of DTA models which also can be categorized into two class of analytical and
simulation-based. Interested readers may refer to [8] for more details on these models.
One other way to approach the link flow estimation problem is through macroscopic
traffic flow modeling. These models mainly aim to estimate various traffic state variables -
namely, flow, mean speed, and density - using data assimilation techniques such as Kalman
filtering or its variations such as extended Kalman filter (EKF) [9]. While these models may
provide acceptable estimations in a range of applications for instance in traffic speed estimation
[27], travel time estimation [31, 32], traffic volume estimation [28], and autonomous vehicle
driving [29, 30, 33, 40], their accuracy in capturing traffic flow’s dynamic pattern is debatable.
In recent years, advancements in machine learning-based methods besides the availability of
large-scale datasets such as probe vehicle data [39] provide the opportunity to attack the link
flow estimation problem from another aspect. Various machine learning-based models, such as
support vector regression (SVR) [10], random forest [11], neural networks (NN) [12], etc., are
utilized to estimate link flows. Several studies have illustrated the superior performance of NN
models in the estimation of various road traffic characteristics ([13-14, 34, 35]).
Contrary to the evident importance of freeway ramps flow, investigating this variable as a
stand-alone problem is overlooked in transportation literature [15]. While there are
methodologies for estimating networkwide hourly traffic volume, their application for ramps
volume estimation is limited. These methods generally require a high-resolution speed profile as
3
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
`
input ([13, 16]), which is not available for most ramps without a designated Traffic Messaging
Channel (TMC) segment.
This study trains neural network models to estimate traffic flow on freeways’ exits (i.e.,
off-ramps). An accurate estimation of the traffic counts on freeways’ ramps is beneficial from
two points of view. Firstly, the information on exiting vehicle counts can greatly benefit traffic
management systems to alleviate traffic congestion in the road network. Secondly, ramp flows,
which are an indicator of the traffic’s general movement, can be added as a useful input to the
OD matrix estimation models if estimated accurately. While the introduced methodology can be
simply modified to be applied on other connection links such as on-ramps, only off-ramps are
considered in this study to leave room for analyzing the Spatio-temporal nature of off-ramp
flows. These analyses are conducted considering two perspectives. The first one focuses on the
effects of several spatial features on the performance of the trained models. The second one
explores the impact of the ground truth volume data on off-ramp flows estimation accuracy. This
analysis is performed through training two models based on different strategies that exist for
volume data collection.
The current study findings are beneficial to the transportation network operators since it
provides them with a model framework, investigates the required input data and data collection
strategies to estimate off-ramp hourly traffic volume.
The rest of this paper is organized as follows. First, we provide an overview of the data
sources used in this study. Afterward, we explain the methodology used to estimate off-ramp
hourly traffic volume. The next section compares the performance of the introduced method
under various scenarios to analyze the effect of Spatio-temporal features on off-ramps flow
estimation accuracy. The paper concludes with some remarks regarding the current study and
suggestions for future research in this area.
DATA
This study's basic idea is to train a neural network model to estimate the hourly exiting flow
counts from a freeway off-ramp, exploring the impacts of feature space and evaluation of vehicle
detector installation strategies on the model performance. The focus of the analysis here is on the
interchanges on the national highway system in the state of California. The model inputs are
obtained from three primary sources and are described in this section. The used data are for the
entire year of 2019.
Vehicle detector flow counts: This input, as the ground-truth of the dependent variable, is
obtained from the Caltrans Performance Measurement System (PeMS) [17], where traffic
conditions are continuously collected every 30 seconds from almost 45,000 detectors
deployed on California's road network as shown in Figure 1. This network comprises
more than 41,000 directional miles, and more than 18,000 traffic count stations are
located on it [14]. Further, the collected data are aggregated into 5-minutes intervals and
uploaded to the PeMS website. However, as the uploaded data are raw, thus prone to
errors, measures are taken to omit the suspicious records described later. The sensor
counts are also used to compute the annual average daily traffic (AADT) which, will also
be input into the model.
4
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
`
Vehicle probe speeds: The vehicle probe speed is obtained from the Regional Integrated
Transportation Information System (RITIS) dashboard [18], where GPS data are used to
estimate the speeds. RITIS is a data-sharing repository created and maintained by the
CATT Lab at the University of Maryland. This website provides several measures from
various vendors for each TMC segment, of which two features of speed and free flow
speed (FFS) are used here. The advantage of having access to several vendors data in the
RITIS dashboard is minimizing the number of missing values for each TMC segment. By
definition, FFS is the speed that vehicles can traverse through the road in the absence of
any restriction on their movement. The computation method is described in the Urban
Congestion Report [19]. The closer the speed in a road segment is to the FFS, the lower
the congestion is and vice versa. Thus, this information can be a reference for the traffic
conditions in each segment.
Infrastructure data: Road characteristics are manually obtained from Google Maps
(2019) and OpenStreetMap (2019). The obtained features are the number of lanes for the
upstream, downstream, ramp, left-turn downstream, left-turn upstream, right turn
upstream, and right turn downstream, route number, and county. Additionally, the
FHWA-approved Functional Classification System (FCS) of each of the two roads in the
interchange is considered, which can be one of the following categories:
oInterstate
oPrincipal Arterial – Other Freeways and Expressways
oPrincipal Arterial – Other
oMinor Arterial
oMajor Collector
oMinor Collector
oLocal
The route number, county, and FCS are incorporated into the model with one-hot
encoding to account for the fact that these attributes are nominal categorical variables.
Temporal data: these attributes comprise the hour of the day (1, 2, …, 24), day of the
week (Monday, Tuesday, …, Sunday), and month of the year (January, February, …,
December). These variables are also fed to the model with one-hot encoding.
Here the interchange selection procedure and cleaning of the raw data are explained. The
movement speed of vehicles in a segment is an appropriate indicator of the traffic conditions, and
many agencies are utilizing this data to monitor and operate their traffic network. As traffic
volumes enter a road in an intersection, they can considerably change the segment's speed. Thus,
the information on speed profiles and how they vary with time can significantly benefit the
estimation of the traffic volumes and traffic conditions at road segments. Here, speeds at
different sections of an interchange reflect the highway's traffic flow patterns and the lower-level
road connected to the highway through this interchange. Figure 2 illustrates the approaches and
speeds that are considered in the model. Note that the illustrated off-ramp setting in this figure is
the most common among the study off-ramps in this study and the off-ramp types are not limited
to the illustrated one.
5
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
`
Figure 1 – Location of the traffic sensors in the road network of California
6
192
193
194
`
Figure 2 - Schematic illustration of the considered speed profiles
In this interchange, “Ramp-1” is the ramp that is under consideration. The speeds that
will be considered in the model are as follows:
- SRT-D, FFSRT-D: downstream speed and FFS of the exiting vehicles that are making a right
turn.
- SLT-U, FFSLT-U: upstream speed and FFS of the exiting vehicles that are making a left turn.
- SLT-D, FFSLT-D: downstream speed and FFS of the exiting vehicles that are making a left
turn.
- SRT-U, FFSRT-U: upstream speed, and FFS of the exiting vehicles that are making a right
turn.
Although the number of installed count stations in California's road network is extensive,
not all of them accurately record counts. Besides, the speed profiles for all segments of interest
are not available. Thus, before feeding the input to the model, a sanity check is required. This
study defines four steps for validating the selected interchanges as follows:
1. Installation of three traffic count sensors: The interchange should have installed traffic
count sensors on the off-ramp as well as upstream and downstream of the off-ramp, as
shown in Figure 3.
2. Defined six unique TMC segments: Designation of unique TMC segments for the
upstream and downstream of the off-ramp and the road connected to the freeway. In
figure 3, the yellow lines illustrate the unique TMC segments that must exist for the
interchange which allow extraction of movement speed at each time interval for each
segment.
3. Conservation of flow: Volume counts on the three detectors should conform with an
assumed maximum allowable deviation of %5 according to Equation 1.
V
Upstream
=V
Downstream
+V
Ramp
+ϵ ,
|
ϵ
|
0.05 V
Upstream
(1)
This equation must hold with
ϵ=0
if the vehicle detection is perfect. However, in real-
world situations there are errors in vehicle detection, which causes the equation to hold
with
|
ϵ
|
>0
. The threshold that we considered in the paper is
|
ϵ
|
max
=%5
.
4. A minimum number of observations: The interchange should have an appropriate number
of observations in the dataset to avoid biasing the dataset resulting in unreliable traffic
count estimates. Based on our experiments in capturing ramp traffic flow variations, we
found that at least one-month worth of data in an entire year (1×30×24=720) is an
appropriate minimum number of records for each interchange.
After cleaning the data, 79 interchanges with a total of 236,552 record rows are obtained as
illustrated in Figure 4 with 316 features, as provided in Table 1 with details. Figure 5 shows the
distribution of the off-ramp hourly counts. It can be seen in this figure that the distribution of
flow counts covers a vast range of volumes, thus making the estimation harder. Also, the highest
number of counts is located around 450 vehicles in an hour, and more than 90 percent of the
observations have an off-ramp hourly count of less than 1000 and more than 100 cars per hour.
7
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
`
Figure 3 – Criteria for interchange selection as model input
8
235
236
237
238
239
240
`
Figure 4 – Location of the 79 obtained interchanges
Table 1 – Input data number of attributes
Feature group Feature name Number of features
Speed features
Segment speed 6
Segment FFS 6
Segment speed relative to FFS 6
Volume features AADT 3
Infrastructure features Interchange number 79
Route number 19
9
241
242
243
`
County 12
FCS 26
Number of lanes 39
Temporal features
Hour of the day 24
Day of the week 7
Day of the month 31
Week of the year 52
Month of the year 12
Total 316
100
0
5
10
15
20
25
0
20
40
60
80
100
Hou rly off-ramp flow counts
Nu m be r of o bservation s 1 0 3)
Cum ulativ e d en s ity fu nc tio n (%)
Figure 5 - Distribution of off-ramp hourly vehicle counts.
To illustrate how the data is distributed temporally, Figure 6 demonstrates the number of
observations for each month, week, day, and hour. According to these radar charts, except for the
hour of the day, the observations are relatively uniformly distributed. The observations'
distribution in different hours of a day illustrates a skewness towards the AM and PM peak
periods and AM off-peak hours, which will not negatively impact this study's analysis as these
hours are the most critical intervals from the operator's and planner's perspective. Another crucial
factor in the distribution of input data is the number of observations per interchange. A summary
of this measure is as follows:
The average number of observations for each interchange: 2,994 observations
The median number of observations for each interchange: 2,650 observations
The standard deviation of observations for each interchange: 1,420 observations
Minimum number of observations for an interchange: 911 observations
Maximum number of observations for an interchange: 6,584 observations
10
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
`
Maximum possible number of observations for an interchange: 365×24=8,760
observations
Thus, on average, 34 percent (=
2994
8760 ¿
of the total possible observations for an interchange is
available for us. This sparsely distributed data makes it even harder for the modeler to train an
accurate and robust model and will be discussed later.
12
1
2
3
4
5
6
7
8
9
10
11
0
20000
40000
52 12345678
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
464748 4950 51
0
1500
3000
4500
6000
(a) (b)
Monday
Tuesday
Wednesday
ThursdayFriday
Saturday
Sunday
0
20000
40000
24 1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
0
10000
20000
(c) (d)
Figure 6 - Distribution of the total observations for (a) each month of the year, (b)
each week of the year, (c) each day of the week, and (d) each hour of the day.
METHODOLOGY
This study explores modeling the exiting flow counts from a freeway in an hour. The
model selected for this study is a fully connected feedforward multi-layer NN. As a deep
learning model class, neural networks have great potential in data analysis and forecasting
because of using distributed and hierarchical feature representation [20]. Especially in the case of
11
260
261
262
263
264
265
266
267
268
269
270
271
272
`
traffic volume estimation, previous studies [13] have shown the superiority of the fully
connected feedforward multi-layer NN relative to other models. Additionally, several studies
have illustrated the superiority of deep learning models relative to other statistical models as the
number of observations in the input data increases [13, 37, 38]. A neural network model consists
of neurons stacked in a number of layers, the first one being the input features, the last one
representing the model prediction, and the middle ones are hidden layers of the model. The
structure of these feedforward fully-connected models is in a manner that each neuron is
connected to all the neurons in its previous layer. Typically, two issues arise during the training
and testing of these models. The first problem is overfitting the model to the training set, in
which the model learns the patterns in the training set. However, it cannot produce reliable
estimates on other datasets. The overfitting problem is most often observed in models with too
many parameters; however, there are common approaches for tackling it, such as L1 and L2
regularizations, the addition of drop out layers, and using the k-fold cross-validation technique.
Generally, regularization methods tend to reduce overfitting by penalizing excess estimated
weights by incorporating them in the loss function [21]. In the addition of the dropout technique,
in each training step, randomly selected neurons are temporarily ignored, and weight updates are
not applied to the neuron in the backward pass [22]. The k-fold cross-validation technique
divides the entire input data into k subsets, and trains and tests the model in k steps. At each step,
one of the subsets forms the testing set while the rest of the subsets form the training set.
The second issue in deep learning models is computational efficiency during the training
procedure. While the most popular activation function for shallow networks is the sigmoid
function, this function is too slow for deep networks - due to having a derivative between -0.25
and 0.25 [13]. Thus, to increase the efficiency in the backpropagation process, the Rectified
Linear Unit (ReLU) function, a more efficient function [23], is used as the activation function
with the following Equation:
f
(
x
)
=max (0, x )
(2)
In this study, several NN models are trained with four hidden layers, each layer with 256
neurons to estimate off-ramps' hourly flow. For each model, the training set is randomly drawn
25 times and is tested on the rest of the data. This procedure ensures capturing the effects of data
variations on model estimates. The input variables are normalized to reduce the variations and
adjusting the scales [24], and the batch normalization technique is used to adjust the scales of
neurons [25]. As L2 regularization is not sufficient for tackling the overfitting issue in deep NNs
[13], we also add random dropout layers. The algorithm employed for training the models is the
Adaptive Moment Estimation (Adam) algorithm [26], a well-known stochastic gradient-based
optimization method capable of handling high-dimensional parameter spaces in non-convex
optimization problems.
The NN model structure in Figure 7 illustrates the input variables, four hidden layers,
each with 256 neurons, dropout layers, and the activation functions. The loss function of the NN
models is assumed to be the average of the squared errors between observed and estimated
turning movement counts plus the regularization term, according to the formulation in Equation
(3).
12
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
`
Loss=1
n
i=1
n
(
^
y
i
y
i
)
2
+λ
2n
l=1
L
i=1
n
[l1]
j=1
n
[l]
(w
ij
[l]
)
2
(3)
In equation 7,
n is the number of observations;
^
yi
is the estimated hourly turning movement count for observation i;
yi
is the observed hourly turning movement count for observation i;
λ
is the regularization parameter;
w
ij
[l]
is the weight factor associated with neuron j in layer l from neuron i in layer l-1.
The performance measures considered for evaluating the models are as follows:
Mean Squared Error (MSE):
A measure for quality of estimates with a higher penalty for higher errors.
Figure 7 – Structure of the NN models
MSE=1
n
i=1
n
(
^
yiyi)2
(4)
Mean Absolute Percentage Error (MAPE):
A measure that represents the relative accuracy of the model estimates.
MAPE=
(
1
n
i=1
n
|
^
y
i
y
i
y
i
|
)
100
(5)
Coefficient of determination (R2):
A measure indicating the proportion of traffic volume variance explained by the model.
R
2
=1
i=1
n
(
y
i
^
y
i
)
2
i=1
n
(
y
i
− ´y
)
2
(6)
where
´y
is the average of turning movement counts in an hour of all observations, and
other parameters are introduced earlier.
Typically, the installation of traffic count sensors by SHA and DOTs of each state is a
combination of permanently and temporarily installed detectors. The permanently installed
detectors continuously collect data for a long duration (e.g., more than a month). In contrast,
temporarily installed sensors collect data at a much shorter period and afterward replaced it to
13
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
`
another site. This study analyzes the exiting flow counts from two distinct perspectives of feature
space evaluations and detector installation strategies, contending with detector installation
strategies. The benefits of inputting the segments and traffic flow characteristics of the lower-
level road connected to the freeway are explored in the former perspective. The latter evaluates
the estimation performance of the models according to the sensor installation strategy.
ANALYSIS AND RESULTS
Explainability of Connected Lower-Level Road Inputs
A road network can be considered as a connected graph in which the links and vehicle
movements in the links interact. In the context of exit flow from a freeway, the traffic flow
conditions of the lower-level road connected to the freeway can play an important role.
Therefore, feeding the features of these roads to the ramps flow estimation model is intuitively
interpretable. However, to evaluate the explainability of these features, two separate models are
trained and compared in this study [12]. In the first model, the "base model," the lower level road
characteristics are not incorporated into the model. Thus, the model considers the attributes
presented in Figure 7 solely for the freeway upstream and downstream segments. The results of
training and testing this model are presented in Table 2. The second model, the "all-inclusive
model," incorporates the base model features in addition to the attributes in Figure 7 for each one
of the following segments of the lower-level road:
right-turn upstream,
right-turn downstream,
left-turn upstream, and
left-turn downstream.
To contend with the previously described detector installation scheme, the selection of data for
training the exiting flow count models is as follows:
permanent vehicle detector installation for nine interchanges yielding one year of data for
each one,
temporary sensor installation for 70 interchanges yielding one week of data for each one.
The summary statistics of this model are presented in Table 3. Note that each model is trained
and tested using k-fold cross-validation with k=25. In comparing the performance measures of
these models, it can be seen that the all-inclusive model has reduced the average validation
MAPE by almost nine percent from 32 percent to 23 percent. Also, the average validation R-
squared has risen from 0.73 to 0.89. additionally, in terms of variations in performance measures,
it is observed that the all-inclusive model is more stable, thus providing more reliable
estimations.
Table 2 – summary statistics of the results of the base model
Measure Average Standard
Deviation
Median Minimum Maximum
Training MSE 5,585 503 5,282 5,140 6,233
Validation MSE 23,944 2,922 23,413 19,884 28,209
Training MAPE 25.41 2.74 24.29 16.4 35.4
Validation
MAPE 32.19 3.72 33.6 23.71 38.32
Training R20.847 0.03 0.855 0.752 0.929
14
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
`
Validation R20.734 0.05 0.757 0.578 0.856
Training Set Size 25,646 794 25,982 23,713 26,606
Runtime (sec.) 1,487 210 1,523 1,218 1,693
Table 3 – summary statistics of the results of the all-inclusive model
Average Standard
Deviation
Median Minimum Maximum
Training MSE 4,739 906 4,654 3,788 6,236
Validation MSE 19,054 1,866 18,555 16,298 22,136
Training MAPE 17.29 1.25 16.2 12.19 22.91
Validation
MAPE 22.93 2.32 22.13 15.48 30.56
Training R20.919 0.01 0.923 0.846 0.969
Validation R20.888 0.022 0.895 0.838 0.923
Training Set Size 26,284 360 26,438 25,523 26,811
Runtime (sec.) 1984 228 1985 1820 2,064
Detector Installation Strategies
The installation of detectors for the previous models is assumed to be a combination of
permanently and temporarily installed sensors. However, to explore each installation scheme's
contribution to the model performance, two distinct models are trained and tested. In the first
model, the "permanent detector installation model," it is assumed that the vehicle detectors are
permanently installed on several interchanges; thus, the ground truth data for these interchanges
are available for the entire year. This strategy aims to estimate the hourly exiting flow counts at
interchanges without installed detectors in the whole hours of a year. In the second model, the
"temporary detector installation model," the sensors are installed for a short time interval at
several interchanges. Thus, ground truth data for these interchanges are available for a short
duration (e.g., with a maximum of one week). The goal here is to estimate hourly exiting flow
counts at these interchanges during the hours of the year when detectors were not installed. The
training and testing sets are 25 times randomly drawn accordingly to conform to each strategy.
Permanent detector installation model
In the strategy of installing sensors permanently, the model is trained on the data of the
interchanges with vehicle detectors. Further, the proportions of turning movements at other
interchanges are estimated to evaluate the model's accuracy. In this method, each random draw's
training set is obtained by dividing the data into two sets of training and testing, according to
interchange IDs; thus, resulting in two distinct sets of interchanges. For each random draw, 70
interchanges are selected (almost equivalent to 90 percent of all data interchanges) for model
training, and the rest is utilized for testing the model. The results of training and validating the
model for 25 training and validation sets drawn randomly from the whole dataset are shown in
Table 4. According to the table results, the average error percentage of the model over the
15
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
`
validation set is 50% and has relatively high variations, thus illustrating that the model estimates
are unstable. This finding is further repeated based on the R 2, which has a negative minimum
showing that on some validation sets, the model is performing worse than the reported average
off-ramp flow. This model's low performance is an interesting finding considering that the
number of input data to the model is more than nine times that of previous models. Thus, one
might reasonably deduce that the permanent detector installation's contribution to the all-
inclusive model is almost negligible.
Table 4- summary statistics of the results of the permanent detector installation model
Average Standard Deviation Median Minimum Maximum
Training MSE 21,296 1,639 20,134 17,479 25,383
Validation MSE 30,277 2,252 29,768 26,180 35,862
Training MAPE 45.38 2.43 46.34 41.24 48.94
Validation
MAPE 51.45 19.48 50.09 25.22 78.98
Training R20.517 0.043 0.509 0.411 0.627
Validation R20.417 0.250 0.433 -0.344 0.575
Training Set Size 213,119 2,922 214,372 206,950 217,392
Runtime (sec.) 2,211 148 1,013 2,107 2,395
Temporary detector installation model
In this method, for each interchange, one and only one week is selected from the whole
dataset and is added to the training dataset. Randomly selecting a week for each interchange in
the real world is equivalent to temporarily installing sensors for a single week on each
interchange. As described earlier, the average number of observations in a week for each
interchange in the dataset is almost 34 percent of the maximum possible observations in a week
due to a low-quality data collection procedure. This model is trained with 25 different random
training validation sets. The model performance summary is presented in Table 5. According to
the table results, this model is relatively stable; however, it is weaker than the all-inclusive
model. For instance, the average validation MAPE has increased from 23 to 29 percent, and the
average validation R-squared has reduced from 0.89 to 0.76. Interestingly, for the total number
of 79 interchanges, the average number of observations used for training the model is 3,253,
which means that on average, for each interchange, less than 42 hours worth of data (
¿3253
79
) is
employed for training the model. Thus, installing sensors on interchanges for two days and
nights enables training a model that can estimate the exiting flow counts with an average of 30
percent error for the entire year.
Table 5- summary statistics of the results of the temporary detector installation model
Average Standard Deviation Median Minimum Maximum
Training MSE 5,438 833 5,535 5,039 5,742
Validation MSE 21,756 2,319 22,173 17,625 25,451
16
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
`
Training MAPE 21.34 3.99 19.75 13.01 29.9
Validation
MAPE 29.12 4.14 35.92 19.44 38.09
Training R20.943 0.021 0.948 0.866 0.969
Validation R20.763 0.115 0.788 0.578 0.916
Training Set Size 3,253 212 3,253 2,770 3,697
Runtime (sec.) 2,758 1,152 2,457 1,227 4,643
SUMMARY AND CONCLUSION
Freeways are a significant determinant of the road network traffic conditions considering
that a large volume of vehicles traverse them. The interaction between freeways and the rest of
the network is through on and off-ramps. An accurate estimation of the ramps' traffic volume can
benefit the system operators in reducing traffic congestion, thus increasing mobility. This study
estimates the hourly off-ramp traffic volume using a fully connected feedforward multi-layer
NN. The proposed NN model has four fully connected hidden layers, each layer consisting of
256 neurons with the assumption of the ReLU activation function. For addressing the overfitting
and computational efficiency of the model, L2 regularization, batch normalization, and addition
of drop out layers are performed.
The data source of ramp traffic volume is the PeMS website, a repository for volume
count records, among other traffic flow measures of California's road network. After cleaning the
data, 79 interchanges' observations remain with 236,552 records for the year 2019. The average
number of observations for each interchange in the dataset is 2,994 indicating the data's sparsity
and a low-quality data collection. All models are trained and tested 25 times with random data
splits to account for the dataset variations and enable a more robust generalization of the
findings.
In the first step, we explored the impacts of inputting the connected lower-level road
features and its traffic flow characteristics on the model's performance. Thus, two models are
trained and tested; the first one, the "base model," without the mentioned attributes, and the
second one, the "all-inclusive model," with the inclusion of those attributes. As the detector
installation in real-world situations is a combination of permanent and temporarily installed
detectors, a combined detector installation is assumed for each of these models. Nine randomly
selected interchanges are considered to have permanent detectors installed; thus, their entire
year's data is added to the training set. For the rest of the 70 interchanges, only the data collected
during a single week is incorporated into the training set. The base model had an average
validation MAPE of 32 percent, while the same measure for the all-inclusive model was obtained
as 23 percent. The average validation R2 of the former model was 0.73, while the latter one was
0.89. Additionally, in terms of the model performance's stability, the all-inclusive model is more
stable than the base model in both measures. Therefore, it is concluded that a significant gain in
the flow estimation model's performance is achieved by incorporating the input features of the
connected lower-level roads.
In the second step of the analysis, the contribution of each detector installation strategy
(e.g., permanent or temporary) to the all-inclusive model is examined. In the permanent
17
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
`
installation model, yearly data of 70 randomly selected interchanges are used for training. The
results of hourly off-ramp flow estimation of this model on the nine remaining interchanges
illustrated a weak performance for this model – a minimum validation R2 of negative is reported.
This model's poor performance can be attributed to the fact that the model cannot construct the
dependencies and correlations between ramps' traffic volumes at different interchanges. The
temporary installation model yields an acceptable performance considering that the model uses
the data of less than 42 hours for each interchange on average. Thus, it is concluded that the
model can capture and establish the relations between the volume counts and the temporal
attributes.
This study sheds more light on an area that has not attained much attention in previous
studies. The findings help researchers gain more insight into the traffic volume estimation of off-
ramps regarding the required data and the modeling framework. Besides, this study provides
valuable information for the transportation system operators about the appropriate traffic sensor
installation strategies, yielding a more accurate traffic volume estimation model. Another
contribution of the current research is in ramp metering applications, where having a traffic
volume estimation is priceless. As a part of the future works, the introduced methodology will be
expanded to consider both on- and off-ramps to establish a more general framework for ramps'
traffic volume estimation. Besides, to address the detrimental impacts of the road network
structure absence in the proposed methodology, more advanced models capable of embedding
this structure and building the relations between link traffic flow characteristics will be employed
in future studies. Finally, the methodology of the current study will be developed to address
traffic flow prediction as well.
AUTHOR CONTRIBUTION STATEMENT
The author(s) do not have any conflicts of interest to declare.
REFERENCES
[1] Cascetta, Ennio. Transportation systems analysis: models and applications. Vol. 29. Springer
Science & Business Media, 2009.
[2] Bera, Sharminda, and K. V. Rao. "Estimation of origin-destination matrix from traffic counts:
the state of the art." (2011).
[3] Maher, M. J. "Inferences on trip matrices from observations on link volumes: a Bayesian
statistical approach." Transportation Research Part B: Methodological 17, no. 6 (1983): 435-447.
[4] Cascetta, Ennio, Domenico Inaudi, and Gerald Marquis. "Dynamic estimators of origin-
destination matrices using traffic counts." Transportation Science 27, no. 4 (1993): 363-373.
[5] Zhou, Xuesong, and Hani S. Mahmassani. "Dynamic origin-destination demand estimation
using automatic vehicle identification data." IEEE Transactions on intelligent transportation
systems 7, no. 1 (2006): 105-114.
18
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
`
[6] De Villa, Aleix Ruiz, Jordi Casas, Martijn Breen, and Josep Perarnau. "Minimising GEH in
static OD estimation." In Australasian Transport Research Forum Proceedings, vol. 4. 2013.
[7] Marzano, Vittorio, Andrea Papola, and Fulvio Simonelli. "Limits and perspectives of
effective O–D matrix correction using traffic counts." Transportation Research Part C: Emerging
Technologies 17, no. 2 (2009): 120-132.
[8] Peeta, Srinivas, and Athanasios K. Ziliaskopoulos. "Foundations of dynamic traffic
assignment: The past, the present and the future." Networks and spatial economics 1, no. 3-4
(2001): 233-265.
[9] Wang, Yibing, and Markos Papageorgiou. "Real-time freeway traffic state estimation based
on extended Kalman filter: a general approach." Transportation Research Part B:
Methodological 39, no. 2 (2005): 141-167.
[10] Vanajakshi, Lelitha, and Laurence R. Rilett. "A comparison of the performance of artificial
neural networks and support vector machines for the prediction of traffic speed." In IEEE
Intelligent Vehicles Symposium, 2004, pp. 194-199. IEEE, 2004.
[11] Ou, Jishun, Jingxin Xia, Yao-Jan Wu, and Wenming Rao. "Short-term traffic flow
forecasting for urban roads using data-driven feature selection strategy and Bias-corrected
random forests." Transportation Research Record 2645, no. 1 (2017): 157-167.
[12] Zahedian, Sara, Przemysław Sekuła, Amir Nohekhan, and Zachary Vander Laan.
"Estimating Hourly Traffic Volumes using Artificial Neural Network with Additional Inputs
from Automatic Traffic Recorders." Transportation Research Record 2674, no. 3 (2020): 272-
282.
[13] Sekuła, P., Marković, N., Laan, Z.V., Sadabadi, K.F., 2018. Estimating historical hourly
traffic volumes via machine learning and vehicle probe data: a Maryland case study. Transp.
Res. Part C 97, 147–158.
[14] Parsa, A.B., Taghipour, H., Derrible, S. and Mohammadian, A.K., 2019. Real-time accident
detection: coping with imbalanced data. Accident Analysis & Prevention, 129, pp.202-210.
[15] Kan, Yuheng, Yibing Wang, Dianhai Wang, Jian Sun, Chunfu Shao, and Markos
Papageorgiou. "A Novel Approach to Estimating Missing Pairs of On/Off Ramp Flows." IEEE
Transactions on Intelligent Transportation Systems (2020).
[16] Schrank, D., Eisele B., Lomax T., Bak J., 2015, Appendix A: Methodology for the 2015
Urban Mobility Scorecard. Technical Report. Texas Transportation Institute, Texas A&M
University. http://tti.tamu.edu/documents/mobility-scorecard-2015-wappx.pdf
[17] “Caltrans, Performance Measurement System (PeMS),” 2019. [Online]. Available:
http://pems.dot.ca.gov
[18] RITIS (2020), ‘The regional integrated transportation information system (RITIS),’
http://www.cattlab.umd.edu/?portfolio=ritis. Accessed: 2020-01-14.
19
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
`
[19] The Urban Congestion Report (UCR): Documentation and Definitions. Operations
Performance Measurement Program, FHWA, U.S. Department of Transportation.
https://ops.fhwa.dot.gov/perf_measurement/ucr/documentation.htm. Accessed November 3,
2020.
[20] Huang, W., Song, G., Hong, H., & Xie, K. (2014). Deep architecture for traffic flow
prediction: deep belief networks with multitask learning. IEEE Transactions on Intelligent
Transportation Systems, 15(5), 2191-2201
[21] Ng, A. Y. (2004, July). Feature selection, L 1 vs. L 2 regularization, and rotational
invariance. In Proceedings of the twenty-first international conference on Machine learning (p.
78).
[22] Dahl, G. E., Sainath, T. N., & Hinton, G. E. (2013, May). Improving deep neural networks
for LVCSR using rectified linear units and dropout. In 2013 IEEE international conference on
acoustics, speech and signal processing (pp. 8609-8613). IEEE.
[23] Krizhevsky, A., Sutskever, I. and Hinton, G.E. Imagenet classification with deep
convolutional neural networks. Communications of the ACM, 60(6), pp.84-90, (2017).
[24] Bishop, C. M. (1995). Neural networks for pattern recognition. Oxford university press.
[25] Ioffe, S., & Szegedy, C. (2015). Batch normalization: Accelerating deep network training by
reducing internal covariate shift. arXiv preprint arXiv:1502.03167.
[26] Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint
arXiv:1412.6980.
[27] He S, Guo X, Ding F, Qi Y, Chen T. Freeway traffic speed estimation of mixed traffic using data from
connected and autonomous vehicles with a low penetration rate. Journal of advanced transportation.
2020 Jun 13;2020.
[28] Xie Y, Zhang Y, Ye Z. Shortterm traffic volume forecasting using Kalman filter with discrete wavelet
decomposition. ComputerAided Civil and Infrastructure Engineering. 2007 Jul;22(5):326-34.
20
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
... Freeways are also connected to the lower mobility and higher accessibility arterials via on-ramps and off-ramps. A major traffic flow on freeways uses the off-ramps to get access to arterials, and a major traffic flow on arterials uses the on-ramps to get into the freeways (Nohekhan et al., 2021). Road users expect to move along their daily trips with the highest level of speed and reliability (Jalali Khalilabadi et al., 2023;Karimpour et al., 2023;Ma et al., 2023a). ...
... However, traffic flows of on-ramps and off-ramps are not always readily available to transportation agencies. That is, not all ramps are equipped with data collection sensors (such as loop detectors) for data collection, and/or they might not have designated traffic messaging channels for data communication (Nohekhan et al., 2021). Usually, transportation agencies face two major types of missing traffic data at onramps and off-ramps: 1) instances where continuous communication loss happens, which could lead to complete or partial missing data, and 2) instances where no physical sensor is installed for data collection. ...
... Comparably, less effort has been made to estimate missing traffic data at on-ramps and off-ramps in locations where no physical sensors have been installed (Kan et al., 2021;Nohekhan et al., 2021). Traditional approaches mainly focused on the estimation of link flow using the origin-destination (OD) matrix. ...
... In deep learning approaches, Nohekhan et al. [10] proposed using a deep learning model to estimate the hourly traffic volume of highway off-ramps. The downstream traffic volumes, probe speeds, and infrastructure characteristics of the road segments were used as explanatory variables. ...
Article
Full-text available
This study aims to develop a method for estimating the traffic volume at locations and time periods where past data are unavailable. To achieve this, we constructed a model based on seven years of traffic volume observations on 22 main roads in Tokyo. The estimation method comprises two components: a traffic volume fluctuation pattern model that explains pre-clustered patterns and an annual average daily traffic model. Through our analysis, we made three key observations. First, the distance to the city center is a significant explanatory factor for monthly fluctuations, the month number for weekly fluctuations, the day-to-night population ratio for daily fluctuations, and the city center direction angle for hourly fluctuations. These calendar and road geometry variables account for 70-90% of the traffic volume fluctuation. Second, when estimating traffic volume for specific time periods using the traffic volume fluctuation pattern model, we explained approximately 60% of routes where no traffic volume observations were conducted and approximately 80% of routes where observations were made infrequently. Finally, the optimal number of clusters for the traffic volume fluctuation pattern, which maximized the coefficient of determination of the traffic volume model, was 8 for monthly fluctuations, 4 for weekly fluctuations, 6 for daily fluctuations, and 11 for hourly fluctuations. These findings made it possible to estimate traffic volumes on routes where traffic volume observations are not conducted and to interpolate traffic volumes for any given month or date using the results of periodic surveys conducted every few years.
... In missing data manipulation literature, three main strategies are established: (i) direct analysis, (ii) imputation, and (iii) weighting complete records (Little 1988). Complete-case and pairwise deletion are the most common approaches in direct analyses which are widely employed in crash data analysis (Nohekhan et al. 2021;Rahimi et al. 2020). Imputation is based on substituting missing values in a dataset with better estimates. ...
... Reference [34] estimated freeway off-ramps hourly traffic volume using an ANN. is paper illustrated that the offramp volume estimation can achieve acceptable accuracy when proper data are fed into the model. An exhaustive review and classification of the deep learning methods and models used in the estimation of road traffic parameters is also included in [2]. ...
Article
Full-text available
This paper focuses on the problem of model transferability for machine learning models used to estimate hourly traffic volumes. The presented findings enable not only an increase in the accuracy of existing models but also, simultaneously, reduce the cost of data needed for training the models—making statewide traffic volume estimation more economically feasible. Previous research indicates that machine learning volume estimation models that leverage GPS probe data can provide transportation agencies with accurate estimates of hourly traffic volumes—which are fundamental for both operational and planning purposes—and do so with a higher level of accuracy than the prevailing profiling method. However, this approach requires a large dataset for model calibration (i.e., input and continuous count station data), which involves significant monetary investment and data-processing effort. This paper proposes solutions, which allow the model to be prepared using a much smaller dataset, given that a previously collected dataset, which may be gathered in a different place and time period, exists. Based on a broad selection of experiments, the results indicate that the proposed approach is capable of achieving similar model performance while collecting data for a 5 times shorter time period and utilizing 1/4 of the number of continuous count stations. These findings will help reduce the cost of preparing and maintaining the traffic volume models and render the traffic volume estimations more financially appealing.
Article
Each freeway stretch’s traffic states are indispensable in freeway traffic modeling, surveillance, and control. However, the unmeasured ramp pairs always exist in real-world freeway systems, and how to estimate the flows of those ramps is a longstanding and tricky issue. Set the stretch with intact traffic states as Source Stretch while the stretch with the unmeasured ramp pair as Target Stretch; existing work tries to train the non-transfer machine learning model like Random Forest by Source Stretch and act on Target Stretch. However, the estimation accuracy of non-transfer machine learning models could not be guaranteed because the mainstream traffic state distributions of the above two stretches are not the same, and the model structure is too simple to capture traffic flow’s temporal dependencies. Note the great success of transfer learning in distribution-changed situations; this paper addresses this issue via transfer learning and deep learning. First, the Gated Recurrent Unit-Based Ramp Flow Estimator is designed to establish the relationship between the mainstream traffic states and ramp flows in Source Stretch. Then, taking the trained estimator as the backbone, we propose the Deep Domain Adaptation to match the marginal distribution difference between Source Stretch and Target Stretch; design the Model Transfer to reduce the conditional distribution difference (i.e., estimator difference) between Source Stretch and Target Stretch. The two approaches both improve the performance of the Source Stretch’s estimator in Target Stretch. Finally, we evaluate the processed approach in two real-world freeway traffic datasets and observe satisfactory results.
Article
Based on Nagel–Schreckenberg (NaSch) traffic flow model, a novel fuel consumption model of on-ramp with acceleration lane under open boundary condition is proposed. Phase diagram is determined by analysis of spatial–temporal patterns. The phase diagram is divided into free flow on the main upstream and the on-ramp; free flow on the main upstream and low-speed motion on the on-ramp; low-speed motion on the main upstream and free flow on the on-ramp; low-speed motion on the main upstream, and global congestion on the on-ramp. The difference in the appearance of phase diagram is found to be related to the length of acceleration lane. The longer of acceleration lane, the more prone to low-speed motion on the main upstream. The numerical computations for fuel consumption in the injected probability of the main road, the injected probability of the on-ramp, and the length of acceleration lane are carried out. Fuel consumption is explained by the traffic flow, average velocity, and lane-change frequency of the accelerated lane. The injected probability of the main road and the on-ramp has a great impact on fuel consumption in free flow. The highest fuel consumption occurs during low-speed motion, whereas the lowest fuel consumption is observed during global congestion. Moreover, the results indicate that fuel consumption on the main road increases potentially by curtailing the length of acceleration lane in low-speed motion of the main road. Graphical Abstract
Article
Full-text available
Nonrecurring traffic events, including improvement actions and vehicular accidents, cause traffic congestion and travel delay. The impact of a traffic event, even when the event is quickly cleared, may last considerably longer than the event itself and may be incurred over long distances. To assess traffic impacts, identification of this larger impact area defined over time and space is required. This paper presents a statistical learning-based traffic event impact area identification method based on concepts of k-means clustering and LOcal regrESSion (LOESS). The proposed method was applied to, and evaluated on, traffic data collected by probe vehicles for three corridors within the Commonwealth of Virginia at 1-min increments over a seven-month period, producing over 14 million data records. Additional traffic incident and work zone data were also used in the analyses. The methodology’s performance was evaluated through the concept of unit delay and was compared against other methods for which data requirements could be fulfilled. Results of the evaluation show that the proposed methodology accurately identifies the traffic event impact area as compared with static methods commonly used. Comparing with dynamic methods, the proposed method has simple data requirements and needs less computational effort. It is also applicable across event types with no need for calibration.
Article
Full-text available
To improve driving safety on superhighways, longitudinal profile design parameters of a superhighway are calculated via force analysis while a car is driven on a slope. The calculations consider characteristics of drivers, cars, and roads. According to the vehicle type, design speed, and natural conditions, the maximum longitudinal slope of a superhighway is calculated and compared with those of an ordinary superhighway and high-speed railway. Based on analysis of the vehicle climbing performance, braking performance, and driver visual characteristics, the maximum and minimum slope lengths of a superhighway are calculated. By analyzing the elements of vertical curves, the minimum radius and minimum length of the vertical curves of a superhighway are calculated by considering factors such as mitigating the impact at the slope bottom, driving at night, and driving time along vertical curves. Analysis and calculation results show that when the maximum longitudinal slope is 2.50%, 2.25%, and 2.00%, the minimum slope length is 450 m, 400 m, and 350 m, respectively, and the minimum vertical curve length is 145 m, 130 m, and 115 m, respectively, and the superhighway travel requirements can be satisfied at speeds of 180 km/h, 160 km/h, and 140 km/h, respectively.
Article
Full-text available
Having access to accurate travel time is of great importance for both highway network users and traffic engineers. The travel time which is currently reported on highways is usually estimated by employing naïve methods and using limited sources of data. This could result in unreliable and inaccurate travel time prediction which causes inconvenience for travelers. Despite abundant effort on predicting travel time, the dynamic nature of travel time was neglected in many studies. Therefore, the main objective of this study is to predict the short-term travel time of highways dynamically, using multiple data sources including loop detectors, probe vehicles, weather condition, geometry, accidents, road works, special events, and sun glare. To this end, three well known machine learning methods, Artificial Neural Network, K-Nearest Neighbors and Random Forest, are employed to predict and compare short-term travel time for prediction horizons of 5 min in one hour ahead. The results confirm satisfying performance of models, especially Random Forest, in short-term travel time prediction. Our analyses also show that traffic variables, especially occupancy, are the most effective variables in predicting the travel time. Finally, we proposed an approach for a dynamic prediction of travel time for a corridor. Interestingly, the results of our dynamic approach improved the accuracy of travel time prediction over the snapshot travel time prediction.
Article
Full-text available
Connected and autonomous vehicles (CAVs) are on the way to the field application. In the beginning stage, there will be a mixed traffic flow, containing the regular human-driven vehicles and CAVs with a low penetration rate. Recently, the discussion about the impact of a small proportion of CAVs in the mixed traffic is controversial. This paper investigated the possibility of applying the limited data from these lowly penetrated CAVs to estimate the average freeway link speeds based on the Kalman filtering (KF) method. First, this paper established a VISSIM-based microsimulation model to mimic the mixed traffic with different CAV penetration rates. The characteristics of this mixed traffic were then discussed based on the simulation data, including the sample size distribution, data-missing rate, speed difference, and fundamental diagram. Accordingly, the traditional KF-based method was introduced and modified to adapt data from CAVs. Finally, the evaluations of the estimation accuracy and the sensitive analysis of the proposed method were conducted. The results revealed the possibility and applicability of link speed estimation using data from a small proportion of CAVs.
Conference Paper
Full-text available
Recent advances in artificial neural networks (ANN) pave the way for the development of sophisticated models that helps to predict human movements more realistically compared to traditional models. Benefiting from that, this study aims to predict pedestrian walking behavior when passing through a bottleneck in a heterogeneous flow involving individuals with disabilities. A novel long-short-term memory (LSTM)-based deep neural network is designed, capable of handling different walking behaviors of individuals with and without disabilities. This network consists of three different modules: Disability Module, Environment module, and Trajectory Prediction Module. The first two modules try to extract the important information about the overall walking behaviors of individuals and their neighboring figures, respectively, while the last module uses that information along with the previous spatial and speed information of each individual to predict his/her future spatial information. For training and testing the network, the data of a large scale experiment on pedestrian walking behavior involving individuals with disabilities was used. This data consists of several experiments, each of which tries to capture the essence of individuals’ walking behavior in a different situation. By sequencing and normalizing the input data and applying regularization techniques, the network is trained successfully. The results are compared to that of the state-of-the-art models, demonstrating that the network is capable of predicting trajectories of pedestrians with and without disabilities more realistically. It outperforms other models such as the social force model (SFM), shallow fully connected neural networks, and plain LSTMs in term of mean square error.
Article
Providing the users of a dynamic tolling system with predictions of tolling prices and the travel time difference between the toll road and the alternative routes enables them to make their travel decisions before starting their trip. This study aims to provide accurate predictions of tolling price through training and testing random forest, multilayer perceptron, and long short-term memory models and compare them with the current situation that the best prediction is extending the current toll to the next timesteps. The prediction time horizon includes five 6-minute time intervals ahead of the present time. The prediction performance of models over the testing set reveals that while all the models were significantly better than the base model, the random forest outperforms all models. For instance, while in the trained models, the mean absolute error range is from $1.5 to $2.5 for the next six minutes to the next 30 minutes, respectively, the same measure in the base model is in the range of $2.5 to $6. The prediction of travel time difference along the toll road and its alternative route with the shortest travel time revealed that the multilayer perceptron performs marginally better than the base model. However, due to a relatively stable travel time difference, the current travel time difference is an acceptable prediction for the next 30 minutes prediction horizon.
Article
Road reconstruction and the resulting work zones are considered as a major source of traffic congestion and delays on freeways. The roadway capacity is decreased as a result of a reduced number of traffic lanes, narrower lanes, and work zone speed limits. Accurate prediction of construction work zone capacity helps traffic engineers to have a better estimation of the traffic flow characteristics. To this end, multiple methodologies have been developed to quantify the impacts of work zones on traffic flow. This paper presents a critical review of the three types of approaches to estimating construction work zone capacities, including parametric, non-parametric, and simulation. Then the most commonly considered factors and their frequency are presented. It also performs a detailed review of the approaches, their objectives, and weaknesses. Lastly, it provides recommendations for future research. The presented work could help researchers in the area of work zone capacity estimation by presenting all the previous methodologies in one place.
Article
Traffic volume is a critical piece of information in many applications, such as transportation long-range planning and traffic operation analysis. Effectively capturing traffic volumes on a network scale is beneficial to Transportation Systems Management & Operations (TSM&O). Yet it is impractical to install sensors to cover a large road network. To address this issue, spatial prediction techniques are widely performed to estimate traffic volumes at sites without sensors. In retrospect, most relevant studies resort to machine learning methods and treat each prediction location independently during the training process, ignoring the potential spatial dependency among them. This paper presents an innovative spatial prediction method of hourly traffic volume on a network scale. To achieve this, we applied a state-of-the-art tree ensemble model - extreme gradient boosting tree (XGBoost) - to handle the large-scale features and hourly traffic volume samples, due to the model's powerful scalability. Moreover, spatial dependency among road segments is taken into account in the proposed model using graph theory. Specifically, we created a traffic network graph leveraging probe trajectory data, and implemented a graph-based approach - breadth first search (BFS) - to search neighboring sites in this graph for computing spatial dependency. The proposed spatial dependency feature is subsequently incorporated as a new feature fed into XGBoost. The proposed model is tested on the road network in the state of Utah. Numerical results not only indicate high computational efficiency of the proposed model, but also demonstrate significant improvement in prediction accuracy of hourly traffic volume comparing with the benchmarked models.
Article
A freeway stretch with even one pair of unmeasured on/off-ramps is not fully observable in traffic states. Flow observability is essential for freeway traffic modeling, surveillance, and control. It is a longstanding and tricky issue to estimate flows for unmeasured ramp pairs. This problem seems to be hardly tractable in conventional approaches, and this paper intends to handle it based on machine learning. The work was partially inspired by transfer learning. Consider that no measurements are available for a target ramp pair, and the knowledge about ramp flow estimation may be drawn from other (measured) ramp pairs, provided that measured and unmeasured ramp pairs would share similarities in some key traffic flow patterns. Two simple machine learning algorithms, random forest (RF) and gradient boosting machine (GBM), were employed to this end. RF and GBM were driven by real measurement data to establish models that relate ramp flows to adjacent mainstream traffic conditions. The models were then applied for our task. The estimation performance was evaluated using real measurement data from the Shanghai Urban Expressway and the Intercity Highway in California, with satisfactory results obtained.
Article
Traffic volumes are an essential input to many highway planning and design models; however, collecting this data for all road network segments is neither practical nor cost-effective. Accordingly, transportation agencies must find ways to leverage limited ground truth volume data to obtain reasonable estimates at scale on the statewide network. This paper aims to investigate the impact of selecting a subset of available automatic traffic recorders (ATRs) (i.e., the ground truth volume data source) and incorporating their data as explanatory variables into a previously developed machine learning regression model for estimating hourly traffic volumes. The study introduces a handful of strategies for selecting this subset of ATRs and walks through the process of choosing them and training models using their data as additional inputs using the New Hampshire road network as a case study. The results reveal that the overall performance of the artificial neural network (ANN) machine learning model improves with the additional inputs of selected ATRs. However, this improvement is more significant if the ATRs are selected based on their spatial distribution over the traffic message channel (TMC) network. For instance, selecting eight ATR stations according to the TMC coverage-based strategy and training the ANN with their inputs leads to average relative reductions of 35.39% and 13.44% in the mean absolute percentage error (MAPE) and error to maximum flow ratio (EMFR), respectively. The results achieved by this study can be further expanded to create a practical strategy for optimizing the number and location of ATRs through transportation networks in a state.