Conference PaperPDF Available

Accident Prediction by Using Poisson Regression for Unsignalised Junction in Khulna Metropolitan City, Bangladesh

Authors:
Proceedings of International Conference on Planning, Architecture and Civil Engineering, 07 - 09 February 2019,
Rajshahi University of Engineering & Technology, Rajshahi, Bangladesh
1
Accident Prediction by Using Poisson Regression for Unsignalised
Junction in Khulna Metropolitan City, Bangladesh
M. E. SHAIK1, Q. S. HOSSAIN2
1Department of Civil Engineering, KUET, Bangladesh (ebrahimkuet82@gmail.com)
2Department of Civil Engineering, KUET, Bangladesh (sazzad@ce.kuet.ac.bd)
Abstract
The Poisson regression models were used in the large area of application and likely to improve the quality and
important for the engineering aspects of accident prevention in Khulna Metropolitan City. A wide range of
accident data at each junction for the fifteen-year period (2000-2015) were used in the model development
process. In this model, numbers of accident were selected as an outcome variable and indicate the number of
accidents occurred at different types of junction in a year. Also total accident and junction types were selected as
a continuous predictor variable and categorical predictor variable respectively. It was found that the models
evaluated 2.19 times more variation in accidents at Not junctions and 1.80 times more variation at crossing
junctions than at T-junctions respectively. In absence of available dedicated left turning lane and traffic spot
speeds vehicle on the roads generally related with accident severity in each junction.
Keywords: Poisson Regression, Accident Prediction, Road, Unsignalised Junction, Variable.
1 Introduction
Road traffic accident is one of the most undesirable situations to occur to a road user resulting harmful to the people and
damages to the valuable properties. Now road traffic accident is the global phenomenon to the almost all countries of the
world. They are seriously concerned about the increasing number of people killed and injured on their highways. According
to the World Health Organization (WHO), more than 1.25 million people lives are cut short for the result of road traffic
accident every year. More people between 20 and 50 million suffer simple or non- fatal injuries, with many incurring a
disability as a result of their injury. For Bangladesh, accident severity and fatality in road accident is now an alarming issue.
In Bangladesh Khulna metropolitan city is the 3rd largest city with an area of 45.65 square kilometers and more than 1.022
million people live here. In this city, Total numbers of roads are 1215 with a total length of 356.65 kilometers (Khulna City
Corporation, 2015). Recently significant numbers of killed and injured people are increasing enormously in Khulna
metropolitan city (Ebrahim and Hossain, 2018).
For transportation safety, road traffic accident prediction models are very necessary tools given their effective for evaluating
both the frequency of accident frequency and the contributing factors that could then be determined by transportation
methods (Azad, 2017). To determine the accident frequency, accident severity level and different factors which are generally
responsible for road traffic accident, transportation authority and many research institute are often want to accident data and
factor to identify the most vulnerable road environment site. The different regression model including Poisson regression
model helps to explain the relationship between accident occurrence and responsible risk factor. Though multiple linear
regression models are widely used for prediction, it has been found that the Poisson regression model can often be better
fitted for prediction accident occurrence. The development of generalized theories concerning highway safety, road traffic
accident prediction models can also keep important contribution (Azad, 2017). Poisson regression has been often applied for
count road traffic and different transportation policies including frequency.
Generally Poisson regression has more advantages than other regression also related to different discrete distribution and
constraint to predicted value to non-negative integer number (Glenberg, 1996). In accident frequency analysis for different
platform, almost all the researcher has normally applied Poisson regression as a beginning point for analysis, they have often
found that the model data exhibit over and under dispersion that make the application of Poisson regression model
problematic (Lord and Mannering, 2010).As accidents are not occurred frequently, categorical style of accident data is more
efficient for expression of accident reaction and using this categorized accident data Poisson regression model make the taste
significance as fixed value (Sonia, 2012).
M. E. Shaik, Q. S. Hossain
ICPACE 2019
2
The situation of road accident in Khulna metropolitan city is actually frightful and the death of lives and infrastructure
damages are desired to continue if necessary corrective measures are not taken accordingly by applying proper engineering
measures through proper research. These situations for all metropolitan cities in Bangladesh are very dangerous. About 20
percent of road accident occurred in metropolitan cities viz. Dhaka, Chittagong, Khulna and Rajshahi (Hoque, M.M., 1991).
So, it is more important for Khulna city to predict road accident considering the basic factors including different types of
junctions.
2 Methodology
2.1 Data Collection
Five police stations were selected as study area namely Khulna sadar, Daulatpur, Khalishpur, Khanjahan Ali and
Sonadanga. The accident data from 2000 to 2015 were collected from the Khulna Metropolitan Police (KMP)
head quarter and Accident Research Institute (ARI), BUET.
2.2 Accidents Distribution
Figure 1 shows the yearly accident distribution.
Figure 1. Yearly accidents distribution.
Total 475 accidents were recorded within the year 2001 to 2015. Maximum 67 accidents and minimum 15
accidents were occurred in the year 2007 and 2004 respectively. Figure1 also showed that the accident rate
decreases in the last three years.
2.3 Model Development
IBM SPSS statistics 22 software was used for the development of Poisson regression model. In this study,
number of accidents, total number of accident and junction types were selected as model parameter where
number of accident was a outcome variable, total number of accident was a continuous predictor variable and
junction types is a categorical predictor variable.
The Poisson regression model is expressed as (Lord and Mannering, 2010):
!)exp(
)P(knn
k
nn
k
n
(1)
Where,
n = kn accidents per some time period,
kn = non- negative integer,
P(kn) = roadway entity probability, n having kn accidents per some time period,
M. E. Shaik & Q. S. Hossain
ICPACE 2019
3
n = Poisson parameter for roadway entity n,
Which is equal to roadway entity n’s expected number of accident per year, e [kn],
Poisson regression models were estimated by specifying the Poisson parameter n (the expected number of
accidents per period) as a function of explanatory variables, the most common functional form being
n = exp (xn),
Where xn is a vector of explanatory variables and is a vector of estimable parameters.
2.4 Akaike’s Information Criterion (AIC)
The performance of different statistical model for given data is measured by Akaike’s Information Criterion
(Akaike, 1973). Akaike’s Information Criterion (AIC) compares the models quality relative to each other for
their collective data. Therefore, AIC is widely used for the selection of better model. Smallest value of AIC
indicates the best model (Omari-Sasu et al, 2016).
2.5 Bayesian Information Criterions (BIC)
Bayesian Information Criterion (BIC) is generally used for model selection among finite data sets of different
model. BIC is the exponential function and small parts of a likelihood function are closely related to Akaike’s
Information Criterion (AIC). Smallest value of Bayesian Information Criterions (BIC) indicates the best model
(Omari-Sasu et al, 2016).
3 Results and Discussions
The total accidents 231, 40 and 60 were recorded at the sites of Not junctions, T junctions and crossing junction
respectively for the study period 2000 to 2015. Only junction type is used and most of the parameter and variable
are not included for developing this model. Therefore, this model can be used only for approximate determinates
of accident severity in Khulna Metropolitan City.
Table 1 shows the narrative statistics of model data. Table 1 also shows that each variable contains 16 legal
observations and from their distributions it indicates quite reasonable. The outcome variable for unconditional
mean and variance are not immensely different. These values and predictor variable for different condition will
be equal by the model assumption.
Table 1. Narrative Statistics
N
Min.
Max.
Mean
Standard Dev.
No. of accident
16
1.80
0
32
4.56
Total .accident
16
1.85
12
67
32.19
Valid N (list wise)
16
1.93
Table 2 shows that the qualities fit of the model and the model output starts by this table. The statistics lists of
Table 2 indicate the model is quite fit. The first row of the Table 2 indicates the qualities of fit Chi- Squared test.
It was also found that the Akaike’s Information Criterion (AIC) of this model 55.669 and Bayesian Information
Criterions (BIC) equal to 58.760. Model also evaluates the deviance 20.20 as Chi-square distributed with the 12
degrees of freedom. Lowest value of both AIC and BIC indicates the developed model is the best model.
Table 2. Qualities of Fit
Value
DF
Value/DF
20.200
12
1.683
20.200
12
14.301
12
1.192
14.301
12
-23.835
55.669
59.306
58.760
62.760
Dependent Variable: Number of Accident, Model: (Intercept), Parameter, Total Accident.
M. E. Shaik, Q. S. Hossain
ICPACE 2019
4
Table 3 shows the Estimate of Accident Parameters. These comprise the Poisson regression coefficients for each
variable including standard error, 95% confidence interval due to the coefficients. The regression coefficient for
total accident was found 0.082. This indicates that the expected increase in the log count for one unit increase in
total accident is 0.082. It was observed from the model effects that the output of this study is statistically
significant.
Table 3. Estimate of Accident Parameters
Parameter
B
Standard
error
95% Wald interval
confidence level
Hypothesis test
Lower
Upper
Wald Chi-
square
DF
Significance
Intercept
28.395
.4423
-29.262
-27.528
4121.570
1
.000
Parameter=1
26.548
.4027
25.758
27.337
4346.635
1
.000
Parameter=2
26.350
.
Parameter=2
0
.
Total
accident
.082
.0087
.065
.099
Scale
1
Dependent variable: Number of accident, Model: (intercept), parameter, total accident
Table 4 shows the parameter estimates of different types of junction. The average predicted value for Not
junctions was found 2.19. It indicates that the developed model evaluated 2.19 times more variation in road
accident at Not junctions and 1.80 times more variation at Crossing junctions than at T junctions respectively.
So, Not Junction is more severe for occurring accident among these junctions in Khulna Metropolitan City.
Table 4. Different types of parameter estimates
Types of parameter
Mean
Standard
error
95% Wald interval confidence
level
Lower
Upper
Not Junction
2.19
.582
1.30
3.69
Crossing Junction
1.80
.577
.96
3.37
T Junction
.00
.000
.00
.00
This Poisson regression model is the similar as that used in ordinary regression model except that the random
component is the Poisson distribution. The Poisson Regression model prediction is sometimes mentioned as a
Poisson Log linear model. Most of the regression model provides better for over dispersed data while Poisson
regression model provides better for equal- dispersion data.
4 Conclusions
The study of this research work was carried out the ability of Poisson regression model for prediction of accident
in Khulna Metropolitan City, Bangladesh. This paper developed the traffic accident prediction models to gather a
better knowledge by using this useful systems for predict the road traffic accidents including their hazard factors.
From the results it was found that the developed model evaluated 2.19 times more variation in road accident at
Not junctions and 1.80 times more variation at Crossing junctions than at T junctions respectively. So, Not
Junction is more severe for occurring accident among these junctions in Khulna Metropolitan City. It was found
from the Tests of Model Effects output that all the junctions overall, is statistically significant. It can be
concluded that the model fits reasonably well because the goodness-of-fit chi-squared test is not statistically
significant with 12 degrees of freedom. This model can able to predict the road accident at unsignalised junction
of Khulna Metropolitan City, Bangladesh.
M. E. Shaik & Q. S. Hossain
ICPACE 2019
5
References
Akaike, H. (1973). Information theory as an extension of the maximum likelihood principle. In the second
international symposium on information theory, edited B.V Petrov and B.F Csaki, Academical Kiado.
A.Y. Omari- Sasu, A. M. Isaac and R. K Boadi (2016). Statistical Models for Count Data with Applications to
Road Accidents in Ghana. International Journal of Statistics and Applications. vol. 6(3), 123-137.
Azad, A. (2017). Road Crash Prediction Models: Different Statistical Modeling Approaches. Journal of
Transportation Technologies. vol. 7, 190-205.
Ebrahim, S. and Hossain, Q. S. (2018). An Artificial Neural Network Model for Road Accident Prediction: A
Case Study of Khulna Metropolitan City. ICCESD 2018, 5193, 1-8.
Glenberg, A. (1996). Learning from Data: An Introduction to Statistical Reasoning. 2nd Edition, Lawrence
Erlbaum Associates, Mahwah.
Hoque. M.M. (1991). Accident investigation for the safety improvement of Dhaka- Aricha highway: A section of
Asian highway. Final report, Department of Civil Engineering, Bangladesh University of Engineering &
Technology, Dhaka, Bangladesh.
Khulna City Corporation (2015), Basic Statistics, Retrieved from:
http://www.khulnacity.org/Content/index.php?page=About_KCC&Z2Y&pid=30
Lord, D. and Mannering. F (2010). The statistical Analysis of Crash Frequency Data: A Review and Assessment
of Methodological Alternatives. Accident Analysis and Prevention. vol. 44, 292-303.
Sonia, R. (2012). Development of an Accident Prediction Model for Intersections of Dhaka City, Bangladesh.
International Journal of Computer Applications. vol. 47(16), 10-16.
... The total number of roads in this city is 1215, with a total length of 356.64 kilometers [21,22]. In practically all of cities in Bangladesh, tra c congestion is a prevalent issue. ...
Preprint
Full-text available
Short Term traffic flow prediction is one of the most major topics of research in traffic engineering field. It's incredibly useful in the design of a more modern transport network that can manage traffic signals and reduce congestion. Short Term traffic flow is a challenge that a third-world country like Bangladesh is all too familiar with. Khulna Metropolitan City, like the other cities of Bangladesh, is gradually becoming more aware of this situation. The Khulna-Jashore National Highway (N-7), which runs through the city and provides it a linear shape, serves as the backbone of the Khulna Metropolitan City traffic flow. This study developed an Artificial Neural Network (ANN) model for the Short Term Traffic Flow Prediction in Two Lane Highway in Khulna Metropolitan City, Bangladesh. From March 1, 2021, through June 30, 2021, data was collected during 600–900 and 1200–1500 h. Extremely good quality electronic cameras were utilized to record the vehicles on the full designated length. In the regression graphs, the network outputs were displayed with targets for the training, validation, and test sets. The various speed level parameters for which the fit is reasonable for all data sets, with R values of 0.98426 in each case. The various traffic volume parameters for which the fit is reasonable for all data sets, with R values of 0.96758 in each case. The model's superiority is indicated by its low mean squared error values. This study provides an opportunity to provide a suitable alternative for Short Term traffic flow forecasting in Khulna Metropolitan City with traffic flow conditions for two-lane undivided highways.
... In Bangladesh, a road accident prediction model has been developed by some researcher recently, but a statistical model for road accident analysis is not available. In Khulna metropolitan city, recently some study was carried out for developing road accidents model using Smeed's formula and Andreassen's equations for analyzing the road accident data [24], artificial neural network (ANN) model for road accident prediction [25], and Poisson regression model for accident prediction at unsignalized junction [26]. Also, accident predication with crash level at arterial segments using generalized linear model was analyzed [27] and Poisson regression model was developed to forecast and describe the district-wise accident data [28]. ...
Article
Full-text available
Road traffic accident is the most unwanted situation and one of the significant reasons for death and injuries of people of all ages worldwide. Statistical analysis of highway-related accidents is of utmost importance to evaluate the severity of the problem and speed up taking decision toward its attenuation. In this research, three statistical models, namely negative binomial, gamma regression and Poisson regression model, were developed by using the statistical software IBM SPSS 25.0 to determine the various contributing factors which were significantly responsible for road accidents occurring in Bangladesh. The parameters selected to develop each of the models are collision type, junction type, vehicle type, weather conditions, and driver behaviors. The goodness of fit test of the Poisson regression model indicates that there was an overdispersion problem in the accident data. The value of deviance and Pearson Chi-square of negative binomial regression analysis were found to be approximately 1.00. This determination declines that the negative binomial regression model was the best fit for the data. The gamma regression analysis was selected due to the handle under dispersion data. The significant contributing factors for road traffic accident occurring in this city based on the appropriate model were head-on and sideswipe as a collision type; T junction and cross junction as a junction type; bus and truck as a vehicle type; high speed and loss of control as a driver behavior. The weather condition is the only factor that has no significant contribution to road traffic accident occurrence.
Chapter
Conceptual framework for accident prediction is an essential toolkit to curb accidents and fatalities globally. Different statistical methods and soft computing techniques are used to develop accident prediction models. Accident prediction models have been developed using two approaches, i.e., multiple linear regression (MLR) and artificial neural network (ANN). ANN has been applied to predict the frequency of traffic accidents. Adaptive neuro-fuzzy inference system (ANFIS) has been used as the feature selection method. Feature selection using ANFIS gets more accuracy with ANN was considered the most suitable based on prediction accuracy and measuring errors. It gives around 81.81% accuracy. The framework of hybrid model proposed in this chapter concludes that the prediction accuracy is high when ANN is applied for accident prediction, followed by the ANFIS as a feature selection method.
Article
Full-text available
The occurrence rate of death and injury due to road traffic accidents is rising increasingly globally day by day. For several decades, the focus of research has been on getting a deeper understanding of the significant factors that influence the risk of road traffic fatalities. In today's modern world, neural network (NN) approaches play a crucial role in identifying the contributing factors that describe the frequency and severity of road accidents. Over the years, many researchers used neural network models for predicting the impact of such factors on road accident injury severity. Deep learning methods such as the recurrent neural network (RNN) and the convolutional neural network (CNN) has recently been successfully used for the prediction of road accidents and demonstrate their high accuracy and efficiency. This study overview and summarizes the different forms of neural network models such as the single layer perceptron (SLP) neural network, the multilayer layer perceptron (MLP) neural network, the radial basis function (RBF) neural network, the recurrent neural network, and the convolutional neural network used as a prediction method for the severity of road crash injuries and includes a discussion of future planning and difficulties. This article also summarizes the model input parameter or independent variable and output or dependent variable, as well as various performance assessment methods.
Article
Full-text available
Road crash prediction models are very useful tools in highway safety, given their potential for determining both the crash frequency occurrence and the degree severity of crashes. Crash frequency refers to the prediction of the number of crashes that would occur on a specific road segment or intersection in a time period, while crash severity models generally explore the relationship between crash severity injury and the contributing factors such as driver behavior, vehicle characteristics, roadway geometry, and road-environment conditions. Effective interventions to reduce crash toll include design of safer infrastructure and incorporation of road safety features into land-use and transportation planning; improvement of vehicle safety features; improvement of post-crash care for victims of road crashes; and improvement of driver behavior, such as setting and enforcing laws relating to key risk factors, and raising public awareness. Despite the great efforts that transportation agencies put into preventive measures, the annual number of traffic crashes has not yet significantly decreased. For in-stance, 35,092 traffic fatalities were recorded in the US in 2015, an increase of 7.2% as compared to the previous year. With such a trend, this paper presents an overview of road crash prediction models used by transportation agencies and researchers to gain a better understanding of the techniques used in predicting road accidents and the risk factors that contribute to crash occurrence.
Article
Full-text available
Road accidents in Ghana seems to be on ascendency and the root causes have been attributed to issues such as human errors and superstitions. Since the occurrences of these accidents are discrete, they are often modelled using count regression models. It is therefore the purpose of this study to determine an appropriate count regression model that adequately fits road accidents in Ghana and determine the key predictors using the appropriate model with respect to the expected number of persons killed in road accidents. Several models were compared to fit count data that encounter the field of transportation. These models include Poisson, Negative Binomial (NB) and Conway-Maxwell-Poisson (CMP) models. In order to compare the performance of these models, the various model selection methods such as Deviance goodness of fit test, Akaike's Information Criterion (AIC) and Bayesian Information Criterion (BIC) were employed. Because the values of Deviance goodness of fit test, AIC and BIC for the NB model was the smallest as compared to that of the Poisson and CMP models, it appeared that, the NB model performed best than the Poisson and CMP models. Base on the appropriate model selected (NB model), the key predictors that contributed significantly and also had a high effect on the expected or mean number of persons killed in road accidents within a particular period were Head-on collision as Collision type, Improper overtaking and Loss of control as Driver errors, Bus/Minibus as Type of vehicle, Fog/Midst as Weather condition and Night with street lights off as Light condition.
Article
Full-text available
Gaining a better understanding of the factors that affect the likelihood of a vehicle crash has been an area of research focus for many decades. However, in the absence of detailed driving data that would help improve the identification of cause and effect relationships with individual vehicle crashes, most researchers have addressed this problem by framing it in terms of understanding the factors that affect the frequency of crashes – the number of crashes occurring in some geographical space (usually a roadway segment or intersection) over some specified time period. This paper provides a detailed review of the key issues associated with crash-frequency data as well as the strengths and weaknesses of the various methodological approaches that researchers have used to address these problems. While the steady march of methodological innovation (including recent applications of random parameter and finite mixture models) has substantially improved our understanding of the factors that affect crash-frequencies, it is the prospect of combining evolving methodologies with far more detailed vehicle crash data that holds the greatest promise for the future.
Article
Road accidents are increasing at an alarming rate. Every year more than 1.17 million people die in road crashes around the world. The majority of these deaths, about 70 percent occur in developing countries .As a developing country, Bangladesh is not out of this situation. The road safety situation in Bangladesh has been deteriorating with rapid growth in population, motorisation, urbanisation and lack of investment in road safety. The combination of rapid urbanization and motorization has made the problem even severe . For our paper at first we collect data to analysis the severity of accident in Bangladesh. We collect the data of accidents & rearrange these with respect to weather ,collision type, day of week etc .For this arrangement we collect the data of accident for the last five years . Our main concern were the intersections of Dhaka city. We then select twenty five intersections and collect data of road width and number of approaches. Then we develop a model of accident prediction with the collected data. The major findings of our project is we found that accidents increase in good weather ,it may be because in good weather drivers may be more relax and less conscious as they think that everything is seen clearly .Another finding is as Dhaka is a populated city, number of accident with pedestrian is much higher than other type (with vehicle ,other objects etc.) of accidents . Again number of accident increase with increase in number of approaches in a intersection and decrease with increase in road width. From this prediction model we can get the approximate number of accident that can happen per year and we can take proper steps and precautions such as speed breakers , road dividers ,proper signs , marking ,speed limit ,proper signal design to avoid such accidents . General Terms Road safety issue.