Conference PaperPDF Available

Accident Prediction by Using Poisson Regression for Unsignalised Junction in Khulna Metropolitan City, Bangladesh

February 2019

February 2019

Conference: Proceedings of International Conference on Planning, Architecture and Civil Engineering, 07 - 09 February 2019, Rajshahi University of Engineering & Technology, Rajshahi, Bangladesh

Authors:

Md. Ebrahim Shaik

Bangabandhu Sheikh Mujibur Rahman Science & Technology University

Quazi Sazzad Hossain

Khulna University of Engineering and Technology

shows the narrative statistics of model data. Table 1 also shows that each variable contains 16 legal observations and from their distributions it indicates quite reasonable. The outcome variable for unconditional mean and variance are not immensely different. These values and predictor variable for different condition will be equal by the model assumption.

…

Qualities of Fit

…

Estimate of Accident Parameters

…

Different types of parameter estimates

…

Figures - uploaded by Md. Ebrahim Shaik

Content may be subject to copyright.

Content uploaded by Md. Ebrahim Shaik

Content may be subject to copyright.

Proceedings of International Conference on Planning, Architecture and Civil Engineering, 07 - 09 February 2019,

Rajshahi University of Engineering & Technology, Rajshahi, Bangladesh

Accident Prediction by Using Poisson Regression for Unsignalised

Junction in Khulna Metropolitan City, Bangladesh

M. E. SHAIK1, Q. S. HOSSAIN2

1Department of Civil Engineering, KUET, Bangladesh (ebrahimkuet82@gmail.com)

2Department of Civil Engineering, KUET, Bangladesh (sazzad@ce.kuet.ac.bd)

Abstract

The Poisson regression models were used in the large area of application and likely to improve the quality and

important for the engineering aspects of accident prevention in Khulna Metropolitan City. A wide range of

accident data at each junction for the fifteen-year period (2000-2015) were used in the model development

process. In this model, numbers of accident were selected as an outcome variable and indicate the number of

accidents occurred at different types of junction in a year. Also total accident and junction types were selected as

a continuous predictor variable and categorical predictor variable respectively. It was found that the models

evaluated 2.19 times more variation in accidents at Not junctions and 1.80 times more variation at crossing

junctions than at T-junctions respectively. In absence of available dedicated left turning lane and traffic spot

speeds vehicle on the roads generally related with accident severity in each junction.

Keywords: Poisson Regression, Accident Prediction, Road, Unsignalised Junction, Variable.

1 Introduction

Road traffic accident is one of the most undesirable situations to occur to a road user resulting harmful to the people and

damages to the valuable properties. Now road traffic accident is the global phenomenon to the almost all countries of the

world. They are seriously concerned about the increasing number of people killed and injured on their highways. According

to the World Health Organization (WHO), more than 1.25 million people lives are cut short for the result of road traffic

accident every year. More people between 20 and 50 million suffer simple or non- fatal injuries, with many incurring a

disability as a result of their injury. For Bangladesh, accident severity and fatality in road accident is now an alarming issue.

In Bangladesh Khulna metropolitan city is the 3rd largest city with an area of 45.65 square kilometers and more than 1.022

million people live here. In this city, Total numbers of roads are 1215 with a total length of 356.65 kilometers (Khulna City

Corporation, 2015). Recently significant numbers of killed and injured people are increasing enormously in Khulna

metropolitan city (Ebrahim and Hossain, 2018).

For transportation safety, road traffic accident prediction models are very necessary tools given their effective for evaluating

both the frequency of accident frequency and the contributing factors that could then be determined by transportation

methods (Azad, 2017). To determine the accident frequency, accident severity level and different factors which are generally

responsible for road traffic accident, transportation authority and many research institute are often want to accident data and

factor to identify the most vulnerable road environment site. The different regression model including Poisson regression

model helps to explain the relationship between accident occurrence and responsible risk factor. Though multiple linear

regression models are widely used for prediction, it has been found that the Poisson regression model can often be better

fitted for prediction accident occurrence. The development of generalized theories concerning highway safety, road traffic

accident prediction models can also keep important contribution (Azad, 2017). Poisson regression has been often applied for

count road traffic and different transportation policies including frequency.

Generally Poisson regression has more advantages than other regression also related to different discrete distribution and

constraint to predicted value to non-negative integer number (Glenberg, 1996). In accident frequency analysis for different

platform, almost all the researcher has normally applied Poisson regression as a beginning point for analysis, they have often

found that the model data exhibit over and under dispersion that make the application of Poisson regression model

problematic (Lord and Mannering, 2010).As accidents are not occurred frequently, categorical style of accident data is more

efficient for expression of accident reaction and using this categorized accident data Poisson regression model make the taste

significance as fixed value (Sonia, 2012).

M. E. Shaik, Q. S. Hossain

ICPACE 2019

The situation of road accident in Khulna metropolitan city is actually frightful and the death of lives and infrastructure

damages are desired to continue if necessary corrective measures are not taken accordingly by applying proper engineering

measures through proper research. These situations for all metropolitan cities in Bangladesh are very dangerous. About 20

percent of road accident occurred in metropolitan cities viz. Dhaka, Chittagong, Khulna and Rajshahi (Hoque, M.M., 1991).

So, it is more important for Khulna city to predict road accident considering the basic factors including different types of

junctions.

2 Methodology

2.1 Data Collection

Five police stations were selected as study area namely Khulna sadar, Daulatpur, Khalishpur, Khanjahan Ali and

Sonadanga. The accident data from 2000 to 2015 were collected from the Khulna Metropolitan Police (KMP)

head quarter and Accident Research Institute (ARI), BUET.

2.2 Accidents Distribution

Figure 1 shows the yearly accident distribution.

Figure 1. Yearly accidents distribution.

Total 475 accidents were recorded within the year 2001 to 2015. Maximum 67 accidents and minimum 15

accidents were occurred in the year 2007 and 2004 respectively. Figure1 also showed that the accident rate

decreases in the last three years.

2.3 Model Development

IBM SPSS statistics 22 software was used for the development of Poisson regression model. In this study,

number of accidents, total number of accident and junction types were selected as model parameter where

number of accident was a outcome variable, total number of accident was a continuous predictor variable and

junction types is a categorical predictor variable.

The Poisson regression model is expressed as (Lord and Mannering, 2010):

!)exp(

)P(knn







(1)

Where,

n = kn accidents per some time period,

kn = non- negative integer,

P(kn) = roadway entity probability, n having kn accidents per some time period,

M. E. Shaik & Q. S. Hossain

ICPACE 2019

n = Poisson parameter for roadway entity n,

Which is equal to roadway entity n’s expected number of accident per year, e [kn],

Poisson regression models were estimated by specifying the Poisson parameter n (the expected number of

accidents per period) as a function of explanatory variables, the most common functional form being

n = exp (xn),

Where xn is a vector of explanatory variables and  is a vector of estimable parameters.

2.4 Akaike’s Information Criterion (AIC)

The performance of different statistical model for given data is measured by Akaike’s Information Criterion

(Akaike, 1973). Akaike’s Information Criterion (AIC) compares the models quality relative to each other for

their collective data. Therefore, AIC is widely used for the selection of better model. Smallest value of AIC

indicates the best model (Omari-Sasu et al, 2016).

2.5 Bayesian Information Criterions (BIC)

Bayesian Information Criterion (BIC) is generally used for model selection among finite data sets of different

model. BIC is the exponential function and small parts of a likelihood function are closely related to Akaike’s

Information Criterion (AIC). Smallest value of Bayesian Information Criterions (BIC) indicates the best model

(Omari-Sasu et al, 2016).

3 Results and Discussions

The total accidents 231, 40 and 60 were recorded at the sites of Not junctions, T junctions and crossing junction

respectively for the study period 2000 to 2015. Only junction type is used and most of the parameter and variable

are not included for developing this model. Therefore, this model can be used only for approximate determinates

of accident severity in Khulna Metropolitan City.

Table 1 shows the narrative statistics of model data. Table 1 also shows that each variable contains 16 legal

observations and from their distributions it indicates quite reasonable. The outcome variable for unconditional

mean and variance are not immensely different. These values and predictor variable for different condition will

be equal by the model assumption.

Table 1. Narrative Statistics

Min.

Max.

Mean

Standard Dev.

No. of accident

1.80

4.56

Total .accident

1.85

32.19

Valid N (list wise)

1.93

Table 2 shows that the qualities fit of the model and the model output starts by this table. The statistics lists of

Table 2 indicate the model is quite fit. The first row of the Table 2 indicates the qualities of fit Chi- Squared test.

It was also found that the Akaike’s Information Criterion (AIC) of this model 55.669 and Bayesian Information

Criterions (BIC) equal to 58.760. Model also evaluates the deviance 20.20 as Chi-square distributed with the 12

degrees of freedom. Lowest value of both AIC and BIC indicates the developed model is the best model.

Table 2. Qualities of Fit

Value

Value/DF

Deviance

20.200

1.683

Scaled deviance

20.200

Chi- Square (Pearson)

14.301

1.192

Chi- Square (Scaled-Pearson)

14.301

Log-likelihood

-23.835

AIC

55.669

Corrected AIC (AICC)

59.306

BIC

58.760

Consistent AIC (CAIC)

62.760

Dependent Variable: Number of Accident, Model: (Intercept), Parameter, Total Accident.

M. E. Shaik, Q. S. Hossain

ICPACE 2019

Table 3 shows the Estimate of Accident Parameters. These comprise the Poisson regression coefficients for each

variable including standard error, 95% confidence interval due to the coefficients. The regression coefficient for

total accident was found 0.082. This indicates that the expected increase in the log count for one unit increase in

total accident is 0.082. It was observed from the model effects that the output of this study is statistically

significant.

Table 3. Estimate of Accident Parameters

Parameter

Standard

error

95% Wald interval

confidence level

Hypothesis test

Lower

Upper

Wald Chi-

square

Significance

Intercept

28.395

.4423

-29.262

-27.528

4121.570

.000

Parameter=1

26.548

.4027

25.758

27.337

4346.635

.000

Parameter=2

26.350

Parameter=2

Total

accident

.082

.0087

.065

.099

Scale

Dependent variable: Number of accident, Model: (intercept), parameter, total accident

Table 4 shows the parameter estimates of different types of junction. The average predicted value for Not

junctions was found 2.19. It indicates that the developed model evaluated 2.19 times more variation in road

accident at Not junctions and 1.80 times more variation at Crossing junctions than at T junctions respectively.

So, Not Junction is more severe for occurring accident among these junctions in Khulna Metropolitan City.

Table 4. Different types of parameter estimates

Types of parameter

Mean

Standard

error

95% Wald interval confidence

level

Lower

Upper

Not Junction

2.19

.582

1.30

3.69

Crossing Junction

1.80

.577

.96

3.37

T Junction

.00

.000

.00

This Poisson regression model is the similar as that used in ordinary regression model except that the random

component is the Poisson distribution. The Poisson Regression model prediction is sometimes mentioned as a

Poisson Log linear model. Most of the regression model provides better for over dispersed data while Poisson

regression model provides better for equal- dispersion data.

4 Conclusions

The study of this research work was carried out the ability of Poisson regression model for prediction of accident

in Khulna Metropolitan City, Bangladesh. This paper developed the traffic accident prediction models to gather a

better knowledge by using this useful systems for predict the road traffic accidents including their hazard factors.

From the results it was found that the developed model evaluated 2.19 times more variation in road accident at

Not junctions and 1.80 times more variation at Crossing junctions than at T junctions respectively. So, Not

Junction is more severe for occurring accident among these junctions in Khulna Metropolitan City. It was found

from the Tests of Model Effects output that all the junctions overall, is statistically significant. It can be

concluded that the model fits reasonably well because the goodness-of-fit chi-squared test is not statistically

significant with 12 degrees of freedom. This model can able to predict the road accident at unsignalised junction

of Khulna Metropolitan City, Bangladesh.

M. E. Shaik & Q. S. Hossain

ICPACE 2019

References

Akaike, H. (1973). Information theory as an extension of the maximum likelihood principle. In the second

international symposium on information theory, edited B.V Petrov and B.F Csaki, Academical Kiado.

A.Y. Omari- Sasu, A. M. Isaac and R. K Boadi (2016). Statistical Models for Count Data with Applications to

Road Accidents in Ghana. International Journal of Statistics and Applications. vol. 6(3), 123-137.

Azad, A. (2017). Road Crash Prediction Models: Different Statistical Modeling Approaches. Journal of

Transportation Technologies. vol. 7, 190-205.

Ebrahim, S. and Hossain, Q. S. (2018). An Artificial Neural Network Model for Road Accident Prediction: A

Case Study of Khulna Metropolitan City. ICCESD 2018, 5193, 1-8.

Glenberg, A. (1996). Learning from Data: An Introduction to Statistical Reasoning. 2nd Edition, Lawrence

Erlbaum Associates, Mahwah.

Hoque. M.M. (1991). Accident investigation for the safety improvement of Dhaka- Aricha highway: A section of

Asian highway. Final report, Department of Civil Engineering, Bangladesh University of Engineering &

Technology, Dhaka, Bangladesh.

Khulna City Corporation (2015), Basic Statistics, Retrieved from:

http://www.khulnacity.org/Content/index.php?page=About_KCC&Z2Y&pid=30

Lord, D. and Mannering. F (2010). The statistical Analysis of Crash Frequency Data: A Review and Assessment

of Methodological Alternatives. Accident Analysis and Prevention. vol. 44, 292-303.

Sonia, R. (2012). Development of an Accident Prediction Model for Intersections of Dhaka City, Bangladesh.

International Journal of Computer Applications. vol. 47(16), 10-16.

An Artificial Neural Network Model for Short Term Traffic Flow Prediction in Two Lane Highway in Khulna Metropolitan City, Bangladesh

Preprint

Full-text available

Dec 2022

Short Term traffic flow prediction is one of the most major topics of research in traffic engineering field. It's incredibly useful in the design of a more modern transport network that can manage traffic signals and reduce congestion. Short Term traffic flow is a challenge that a third-world country like Bangladesh is all too familiar with. Khulna Metropolitan City, like the other cities of Bangladesh, is gradually becoming more aware of this situation. The Khulna-Jashore National Highway (N-7), which runs through the city and provides it a linear shape, serves as the backbone of the Khulna Metropolitan City traffic flow. This study developed an Artificial Neural Network (ANN) model for the Short Term Traffic Flow Prediction in Two Lane Highway in Khulna Metropolitan City, Bangladesh. From March 1, 2021, through June 30, 2021, data was collected during 600–900 and 1200–1500 h. Extremely good quality electronic cameras were utilized to record the vehicles on the full designated length. In the regression graphs, the network outputs were displayed with targets for the training, validation, and test sets. The various speed level parameters for which the fit is reasonable for all data sets, with R values of 0.98426 in each case. The various traffic volume parameters for which the fit is reasonable for all data sets, with R values of 0.96758 in each case. The model's superiority is indicated by its low mean squared error values. This study provides an opportunity to provide a suitable alternative for Short Term traffic flow forecasting in Khulna Metropolitan City with traffic flow conditions for two-lane undivided highways.

Application of Statistical Models: Parameters Estimation of Road Accident in Bangladesh

Article

Full-text available

Aug 2020

Road traffic accident is the most unwanted situation and one of the significant reasons for death and injuries of people of all ages worldwide. Statistical analysis of highway-related accidents is of utmost importance to evaluate the severity of the problem and speed up taking decision toward its attenuation. In this research, three statistical models, namely negative binomial, gamma regression and Poisson regression model, were developed by using the statistical software IBM SPSS 25.0 to determine the various contributing factors which were significantly responsible for road accidents occurring in Bangladesh. The parameters selected to develop each of the models are collision type, junction type, vehicle type, weather conditions, and driver behaviors. The goodness of fit test of the Poisson regression model indicates that there was an overdispersion problem in the accident data. The value of deviance and Pearson Chi-square of negative binomial regression analysis were found to be approximately 1.00. This determination declines that the negative binomial regression model was the best fit for the data. The gamma regression analysis was selected due to the handle under dispersion data. The significant contributing factors for road traffic accident occurring in this city based on the appropriate model were head-on and sideswipe as a collision type; T junction and cross junction as a junction type; bus and truck as a vehicle type; high speed and loss of control as a driver behavior. The weather condition is the only factor that has no significant contribution to road traffic accident occurrence.

Conceptual Framework for the Application of the ANN Model in Accident Prediction: A Study of Central Kolkata

Chapter

Dec 2023

Conceptual framework for accident prediction is an essential toolkit to curb accidents and fatalities globally. Different statistical methods and soft computing techniques are used to develop accident prediction models. Accident prediction models have been developed using two approaches, i.e., multiple linear regression (MLR) and artificial neural network (ANN). ANN has been applied to predict the frequency of traffic accidents. Adaptive neuro-fuzzy inference system (ANFIS) has been used as the feature selection method. Feature selection using ANFIS gets more accuracy with ANN was considered the most suitable based on prediction accuracy and measuring errors. It gives around 81.81% accuracy. The framework of hybrid model proposed in this chapter concludes that the prediction accuracy is high when ANN is applied for accident prediction, followed by the ANFIS as a feature selection method.

A review on neural network techniques for the prediction of road traffic accident severity

Article

Full-text available

Nov 2021

The occurrence rate of death and injury due to road traffic accidents is rising increasingly globally day by day. For several decades, the focus of research has been on getting a deeper understanding of the significant factors that influence the risk of road traffic fatalities. In today's modern world, neural network (NN) approaches play a crucial role in identifying the contributing factors that describe the frequency and severity of road accidents. Over the years, many researchers used neural network models for predicting the impact of such factors on road accident injury severity. Deep learning methods such as the recurrent neural network (RNN) and the convolutional neural network (CNN) has recently been successfully used for the prediction of road accidents and demonstrate their high accuracy and efficiency. This study overview and summarizes the different forms of neural network models such as the single layer perceptron (SLP) neural network, the multilayer layer perceptron (MLP) neural network, the radial basis function (RBF) neural network, the recurrent neural network, and the convolutional neural network used as a prediction method for the severity of road crash injuries and includes a discussion of future planning and difficulties. This article also summarizes the model input parameter or independent variable and output or dependent variable, as well as various performance assessment methods.

AN ARTIFICIAL NEURAL NETWORK MODEL FOR ROAD ACCIDENT PREDICTION: A CASE STUDY OF KHULNA METROPOLITAN CITY

Conference Paper

Full-text available

Feb 2018

Road Crash Prediction Models: Different Statistical Modeling Approaches

Article

Full-text available

Jan 2017

Azad Abdulhafedh

Road crash prediction models are very useful tools in highway safety, given their potential for determining both the crash frequency occurrence and the degree severity of crashes. Crash frequency refers to the prediction of the number of crashes that would occur on a specific road segment or intersection in a time period, while crash severity models generally explore the relationship between crash severity injury and the contributing factors such as driver behavior, vehicle characteristics, roadway geometry, and road-environment conditions. Effective interventions to reduce crash toll include design of safer infrastructure and incorporation of road safety features into land-use and transportation planning; improvement of vehicle safety features; improvement of post-crash care for victims of road crashes; and improvement of driver behavior, such as setting and enforcing laws relating to key risk factors, and raising public awareness. Despite the great efforts that transportation agencies put into preventive measures, the annual number of traffic crashes has not yet significantly decreased. For in-stance, 35,092 traffic fatalities were recorded in the US in 2015, an increase of 7.2% as compared to the previous year. With such a trend, this paper presents an overview of road crash prediction models used by transportation agencies and researchers to gain a better understanding of the techniques used in predicting road accidents and the risk factors that contribute to crash occurrence.

Statistical Models for Count Data with Applications to Road Accidents in Ghana

Article

Full-text available

Jun 2016

Road accidents in Ghana seems to be on ascendency and the root causes have been attributed to issues such as human errors and superstitions. Since the occurrences of these accidents are discrete, they are often modelled using count regression models. It is therefore the purpose of this study to determine an appropriate count regression model that adequately fits road accidents in Ghana and determine the key predictors using the appropriate model with respect to the expected number of persons killed in road accidents. Several models were compared to fit count data that encounter the field of transportation. These models include Poisson, Negative Binomial (NB) and Conway-Maxwell-Poisson (CMP) models. In order to compare the performance of these models, the various model selection methods such as Deviance goodness of fit test, Akaike's Information Criterion (AIC) and Bayesian Information Criterion (BIC) were employed. Because the values of Deviance goodness of fit test, AIC and BIC for the NB model was the smallest as compared to that of the Poisson and CMP models, it appeared that, the NB model performed best than the Poisson and CMP models. Base on the appropriate model selected (NB model), the key predictors that contributed significantly and also had a high effect on the expected or mean number of persons killed in road accidents within a particular period were Head-on collision as Collision type, Improper overtaking and Loss of control as Driver errors, Bus/Minibus as Type of vehicle, Fog/Midst as Weather condition and Night with street lights off as Light condition.

The Statistical Analysis of Crash-Frequency Data: A Review and Assessment of Methodological Alternatives

Article

Full-text available

Jun 2010
TRANSPORT RES A-POL

Gaining a better understanding of the factors that affect the likelihood of a vehicle crash has been an area of research focus for many decades. However, in the absence of detailed driving data that would help improve the identification of cause and effect relationships with individual vehicle crashes, most researchers have addressed this problem by framing it in terms of understanding the factors that affect the frequency of crashes – the number of crashes occurring in some geographical space (usually a roadway segment or intersection) over some specified time period. This paper provides a detailed review of the key issues associated with crash-frequency data as well as the strengths and weaknesses of the various methodological approaches that researchers have used to address these problems. While the steady march of methodological innovation (including recent applications of random parameter and finite mixture models) has substantially improved our understanding of the factors that affect crash-frequencies, it is the prospect of combining evolving methodologies with far more detailed vehicle crash data that holds the greatest promise for the future.

Learning From Data: An Introduction to Statistical Reasoning

Book

Aug 2007

Development of an Accident Prediction Model for Intersections of Dhaka City, Bangladesh

Article

Jun 2012

Sonia M Atikur Rahman

Road accidents are increasing at an alarming rate. Every year more than 1.17 million people die in road crashes around the world. The majority of these deaths, about 70 percent occur in developing countries .As a developing country, Bangladesh is not out of this situation. The road safety situation in Bangladesh has been deteriorating with rapid growth in population, motorisation, urbanisation and lack of investment in road safety. The combination of rapid urbanization and motorization has made the problem even severe . For our paper at first we collect data to analysis the severity of accident in Bangladesh. We collect the data of accidents & rearrange these with respect to weather ,collision type, day of week etc .For this arrangement we collect the data of accident for the last five years . Our main concern were the intersections of Dhaka city. We then select twenty five intersections and collect data of road width and number of approaches. Then we develop a model of accident prediction with the collected data. The major findings of our project is we found that accidents increase in good weather ,it may be because in good weather drivers may be more relax and less conscious as they think that everything is seen clearly .Another finding is as Dhaka is a populated city, number of accident with pedestrian is much higher than other type (with vehicle ,other objects etc.) of accidents . Again number of accident increase with increase in number of approaches in a intersection and decrease with increase in road width. From this prediction model we can get the approximate number of accident that can happen per year and we can take proper steps and precautions such as speed breakers , road dividers ,proper signs , marking ,speed limit ,proper signal design to avoid such accidents . General Terms Road safety issue.

Accident Prediction by Using Poisson Regression for Unsignalised Junction in Khulna Metropolitan City, Bangladesh

Figures

Recommended publications

An Artificial Neural Network Model for Road Accident Prediction: A Case Study of Khulna Metropolitan...

Application of Statistical Models: Parameters Estimation of Road Accident in Bangladesh

A review on neural network techniques for the prediction of road traffic accident severity

AN ARTIFICIAL NEURAL NETWORK MODEL FOR ROAD ACCIDENT PREDICTION: A CASE STUDY OF KHULNA METROPOLITAN...