Conference PaperPDF Available

Capturing Uncertainty in Heavy Goods Vehicle Driving Behaviour

Authors:

Abstract and Figures

There is a growing interest in understanding and identifying risky driving behaviours due to the numerous deaths and fatalities attributed to road traffic accidents. For Heavy Goods Vehicles (HGVs), understanding driving behaviour and its impact on road safety is a subject of interest for researchers, governments and industrial sectors, as they rely on HGVs for the delivery of goods and services. The current literature on HGV driving behaviour uses machine learning techniques to uncover core driving incident stereotypes. However, human behaviour contains different levels of uncertainty and stereotyping driving behaviour with traditional crisp methods may cause information loss and establish unfair boundaries as they do not take context into consideration. Moreover, the sensor readings also have uncertainties, and the driving stereotypes may have different subjective interpretations. In order to capture those intermediate possibilities in driver stereotyping, we propose a data-driven Fuzzy Logic system that can capture the uncertainties within driving features (data) and between driving stereotypes, and classifies drivers according to the risk of their driving styles on a scale of 0 to 100, where 0 is a low risk driver and 100 high risk. The results from telematics data show that our proposed method provides a reliable, fair and explainable approach for real-time identification of HGV driving risk level.
Content may be subject to copyright.
Capturing Uncertainty in Heavy Goods Vehicles
Driving Behaviour
Jimiama Mafeni Mase1, Utkarsh Agrawal2, Direnc Pekaslan1, Mohammad Mesgarpour3,
Peter Chapman4, Mercedes Torres Torres1, Grazziela P. Figueredo1
1School of Computer Science, The University of Nottingham
2School of Medicine, University of St Andrews
3Microlise, Farrington Way, Eastwood, Nottingham
4School of Psychology, The University of Nottingham
Abstract—There is a growing interest in understanding and
identifying risky driving behaviours due to the numerous road
fatalities attributed to them. For Heavy Goods Vehicles (HGVs),
understanding driving behaviour and its impact on road safety
is a subject of interest for researchers, the government and
industrial sectors, as they rely on HGVs for the delivery of goods
and services. The current literature on HGV driving behaviour
uses machine learning techniques to uncover core driving incident
stereotypes. However, human behaviour contains different levels
of uncertainty and stereotyping driving behaviour with tradi-
tional crisp methods may cause information loss and establish
unfair boundaries as they do not take context into consideration.
Moreover, the sensor readings also have uncertainties, and the
driving stereotypes may have different subjective interpretations.
In order to capture those intermediate possibilities in driver
stereotyping, we propose a data-driven Fuzzy Logic system that
can capture the uncertainties within driving features (data) and
between driving stereotypes, and classifies drivers according to
the risk of their driving styles on a scale of 0 to 100, where 0 is
a low risk driver and 100 high risk. The results from telematics
data show that our proposed method provides a reliable, fair and
explainable approach for real-time identification of HGV driving
risk level.
Index Terms—Fuzzy Logic Systems, Driving Risk Level, Clus-
tering, Uncertainty, Classification, Heavy Goods Vehicles, Driving
Incidents, Explainable AI
I. INTRODUCTION
Every year the World Health Organisation releases a report
with statistics on road fatalities and the report consistently
attributes human errors and violations as one of the main
causes [1]. This has prompted a growing research interest in
understanding driving styles [2]–[5], distraction triggers [6]–
[8], driver interactions with vehicle technologies [9], [10] and
psychological factors affecting safe driving [11], [12]. One
area of specific interest is understanding the different types of
incidents caused by risky driving behaviours of Heavy Goods
Vehicles (HGVs) and its impact on road safety.
The current approaches to monitor driving behaviour are
based on sensors producing large volumes of data. Effective
data-driven tools for the detection and mitigation of driving
errors as well as approaches to report and understand vehicle
incidents due to human errors or violations are therefore neces-
sary. Furthermore, better interpretation of the data produced by
HGVs can assist stakeholders to develop actions, policies and
technologies for safe driving and to minimise risks and costs
within the HGV transportation network. In a recent research by
Agrawal et al. [5], the authors uncovered eleven HGV driving
incident stereotypes using telematics1data from three years
i.e. 2014, 2015 and 2016. Their study is limited with regards
to their methodology, which considers driving features and
stereotypes as crisp sets (where a driving incident value either
belongs to a particular category or not). Such a system may
be bias and cause information loss as driving features and
stereotypes contain different levels of uncertainty produced by
sensor readings and subjective interpretations. For example,
in traditional logic,‘50’ harsh braking incidents in a day can
be considered as ‘high-risk’ while ‘49’ can be regarded as
not ‘high-risk’, when it is obvious that if ‘50’ harsh braking
incidents has some degree of ‘high risk’, ‘49’ should also have
some degree of ‘high-risk’.
In this paper we build upon this aforementioned research
and extend it by presenting a novel data-driven Fuzzy Logic
System (FLS) for capturing the uncertainties in the available
telematics data and scoring HGV drivers according to the risk
of their driving incidents. In order to achieve this, a frame-
work consisting three steps is employed. First, a classification
framework and profile labelling algorithm (based on works in
[4] and [5]) is employed on telematics data. The data contains
driving incidents for four years, i.e., 2014, 2015, 2016 and
2017. The objective is to identify core driving stereotypes.
The driving incidents (antecedents) and driving stereotypes
(consequents) are subsequently used to obtain the fuzzy sets
for the FLS. Secondly, Wang-Mendel [13] method is employed
to learn the rules (knowledge base) that maps these antecedents
to consequents of the FLS. Lastly, a Mamdani inference [14]
approach is used to combine these rules to produce a scoring
system that can rate drivers’ risk level (between 0 and 100)
in real-time based on their driving incidents, where a score 0
represents a ‘low risk’ driving behaviour, while 100 is a ‘high
risk’ driving behaviour.
This paper is organised as follows, in Section II we re-
view the literature on understanding driving behaviour using
telematics data. Subsequently, we review FLS basic concepts
and their applications to driving behaviour. In Section III, we
1Telematics are communication systems which link to vehicles’ Controller
Area Network (CAN) databus for capturing vehicle data using sensors, e.g.,
engine revs, fuel consumption, accelerator position etc
provide an overview of the 2-stage classification framework,
profile labelling algorithm, introduce the datasets used in our
study and describe our methodology in detail. The results are
presented along with discussion in Section IV, and Section V
concludes the paper and establishes the opportunity for future
work.
II. BACKGROU ND
A. Related Work
Apart from interviews and questionaires, telematics have
been one of the main data sources for detecting driving
behaviour in modern studies. Researchers have analysed telem-
atics data using machine learning techniques to understand and
classify driving behaviour with promising results. For instance,
Constantinescu et al. [15] use Hierarchical Clustering Analysis
(HCA) to identify six profiles among 25 drivers in the city of
Bucharest, based on four telematics driving features i.e. speed,
acceleration, braking and kinetic energy. Similarly, Ellison
et al. [16] propose a driver risky index framework based
on three telematics driving incidents (i.e. over speed, harsh
acceleration and harsh braking) collected from 106 drivers
in Sidney within 25 days. These studies employ different
variations of clustering algorithms to identify groups of driving
incidents stereotypes, however, they are limited with regards to
the number of samples analysed, which is relatively small, and
their experiments are performed in controlled environments,
which is unrepresentative of real driving behaviour.
In a seminal work by Figueredo et al. [4], the authors
tackle these shortcomings by analysing driving incidents data
captured for more than 20,000 HGV drivers in the UK. The
authors use a 2-stage classification framework [17] and a
subjective profile labelling algorithm to uncover eight HGV
driving stereotypes from driving incidents data of the year
2015. Agrawal et al. [5] build upon this work to put forward
a classification system to classify HGV drivers in real time.
They use a decision tree classifier to learn driving behaviours
of drivers from the years 2014 to 2016 and test its performance
on new drivers from the year 2017. Although they achieve high
classification accuracy, their proposed model does not consider
the uncertainties within the driving incidents and stereotypes
injected by sensor readings and subjectivity, which can cause
information loss and unfair classification systems.
B. Fuzzy Logic Systems and their applications to driving
behaviour
Fuzzy Logic was introduced by [18] to address imprecision
or uncertainties in input and output variables directly by defin-
ing them using linguistic terms with their degrees of member-
ship between 0 and 1 (fuzzy sets). Fuzzy Logic Systems (FLS)
consist of three stages: 1) Fuzzification, 2) Fuzzy inference,
and 3) Defuzzification. In fuzzification, inputs (e.g. driving
incidents) and outputs (e.g. driving stereotypes of behaviour)
also known as antecedents and consequents respectively, are
represented by linguistic variables (e.g. ‘Low’, ‘Moderate’
and ‘High’ number of harsh braking incidents). The linguistic
variables have soft boundaries that allows them to better
Fig. 1: An example of fuzzy sets for a driving feature
capture the uncertainties in inputs and outputs. In Fig. 1, ‘35’
harsh braking incidents is considered ‘Moderate’ risk with a
high degree of membership and also considered ’High’ risk
with a low degree of membership.
Fuzzy rules are defined to capture the relationships between
the input and output variables and are usually based on IF-
THEN statements, which are easy to understand. For example,
a fuzzy rule for driving behaviour can be:
IF the number of harsh braking incidents is ‘Low’
risk AND the number of over speeding incidents is ‘Low’
risk THEN the driver’s risk level is ‘Low’.
Lastly, the fuzzy inference systems combine the input fuzzy
sets using the fuzzy rules to produce output fuzzy sets, and
defuzzification maps the output fuzzy sets to a real or crisp
number (e.g. a driving score in this paper).
Recently, FLS are examined in transport research to capture
the uncertainties in driving features, and to develop fair
and explainable classification systems [19]–[22]. Aljaafreh et
al. [19] employ FLS and two driving features (acceleration and
speed) to classify drivers into four predefined driving stereo-
types (below normal, normal, aggressive, and very aggressive
profiles). In a different study, Imkamon et al. [20] use FLS
for detecting unsafe driving behaviour. They use three sensors:
an engine control unit (ECU) reader, an accelerometer and a
camera, to extract features, such as vehicle movement. The
features are then combined using a fuzzy inference system
to identify a driver’s risk level ranging from 1 to 3. In a
similar study by Boonmee et al. [21], the authors use harsh
acceleration, sudden braking, and turning quickly incidents
and a FLS for rating drivers on 1 to 10 scale (where 10 is
the most risky driver) .
The above studies on the use of FLS to describe driving
behaviours are limited as their input fuzzy sets and mem-
bership functions are defined using small number of drivers
leading to fewer driving stereotypes. Thus, this limitation can
introduce bias and make their outcome unrepresentative of
more general driving patterns. In this work, we bridge these
gaps by capturing uncertainties in HGV driving incidents using
a FLS developed from four years of telematics data i.e. input
fuzzy sets will be defined by driving stereotypes captured by
real world data. We will use the FLS to score drivers according
to the risk of their driving behaviours in the range from 0 to
100, where 0 is a low risk driver while 100 is high risk.
III. METHODOLOGY
In this section we present our novel data-driven fuzzy logic
approach for scoring HGV drivers based on driving incidents
captured using telematics. First, the fuzzy sets and membership
functions are obtained from the driving profiles uncovered
from the classification framework. Subsequently, the fuzzy
rules are generated using the Wang-Mendel method. Lastly, the
Mamdani inference method are used to score drivers according
to the risk level of their driving behaviour. An overview of our
proposed method is shown in Fig. 2. Each stage is described
in detail below.
A. Dataset Description
The dataset utilised in this study is provided by our industry
partner Microlise [23]. Information produced by their telemat-
ics solutions are transmitted and collected from the HGVs in
real time. Data is captured by sensors connected to multiple
electronic control units using a Controller Area Network
(CAN) bus. The HGV drivers must complete a minimum of 10
journeys per quarter (i.e. minimum 40 journeys yearly) each
year on any road in the UK to be considered in our analysis.
The data was collected between the first of January and the
thirty first of December for the years 2014, 2015, 2016, and
2017. The dataset consists of four driving incidents: frequency
of Harsh Braking (HB) events, Over-Speed (OS) duration
in seconds, Excessive Throttle (ET) duration in seconds and
frequency of Over Revving (ORev) events of HGV drivers.
It is important to note that these incidents were chosen for
our study because they are the most relevant incidents present
in all of the HGVs, which are related to the risk of crashes
and vehicle costs. In total, 15,893 drivers in year 2014, 21,234
drivers in year 2015, 34,675 drivers in year 2016, and 35,432
drivers in year 2017 were collected using this criteria. The
drivers were grouped into three subgroups based on their daily
average mileage travelled as uncovered by [4] including short
subgroup drivers who cover an average daily mileage upto
136.70 miles, medium subgroup drivers who cover an average
daily mileage between 136.70 and 217.48 miles, and long
subgroup drivers who covers an average daily mileage more
than 217.48 miles. Table I presents the distribution of the
drivers for the four years within the three driving subgroups.
TABLE I: Distribution of drivers among Average Daily
Mileage groups for years 2014, 2015, 2016 and 2017
Average daily Distribution of Distribution of Distribution of Distribution of
mileage groups drivers in 2014 drivers in 2015 drivers in 2016 drivers in 2017
Short 3,327 5,076 16,281 10,745
Medium 6,419 8,392 10,232 14,833
Long 6,147 7,766 8,162 9.854
Total 15,893 21,234 34,675 35,432
B. Data Pre-processing
For unbiased results, we normalise the number of driving
incidents produced by each driver by dividing with the total
driving time (in seconds). To obtain stable driving stereotypes,
the drivers who were consistently present across all the years
and who did not change subgroups across these years are
considered for the analysis. For example, if a driver is present
in short average daily mileage subgroup in the year 2014,
that driver should also be in the short average daily mileage
subgroup for the years 2015, 2016 and 2017 respectively. As a
result, 2,462 drivers are considered across the four years with
569, 964 and 929 drivers distributed between short, medium
and long average daily mileage subgroups respectively.
C. Generating Driving Stereotypes for FLS
To generate Membership Functions (MFs) and fuzzy rules,
the first step is to obtain the driving stereotypes. The driving
stereotypes are obtained using 2-stage classification framework
followed by a profile labelling algorithm (introduced in [4],
[5]) on the datasets from four years. The classification frame-
work is a two step process: 1) Multiple clustering algorithms
are run to group the data. Using a consensus approach, most of
the data are assigned to one of the identified groups (profiles),
while some of the data remain unclustered due to lack of
consensus. 2) An ensemble of classification algorithms is
trained on the clustered data in the previous step. The trained
model is then run on the unclustered data to assign them to one
of the groups uncovered during clustering. The classification
framework uncovers eleven clusters. Subsequently, a profile
labelling algorithm sub-divides each driving incident into three
categories or linguistic terms (fuzzy sets); ‘low’, ‘moderate’
and ‘high’. Fig. 4 (a), (b), (c) and (d) show the boxplot
distributions of the four driving incidents HB, OS, ET and
ORev respectively, for short mileage subgroup (due to space
constraints, we only show the distributions of short mileage
subgroup). It is important to note that the boxplot for ORev
has no distribution for ‘high’ as the profile labelling algorithm
produced only two categories for ORevs of short mileage
drivers due to their relatively smaller values compared to the
other subgroups. The clusters obtained above represent the
eleven driving stereotypes, as shown in Table II. The first
column represents the driving profile number, columns two to
five represent the labels of the occurrence of incidents in each
profile and the last column indicates the mileage subgroups
in which these profiles are present. A more elaborate descrip-
tion of the classification framework and the profile labelling
algorithm can be found in [4] and [5].
TABLE II: HGV driving stereotypes and their feature labels
uncovered across 2014, 2015, 2016 and 2017
Profile Harsh Over-Speed Excessive Number of Driving
Number Braking duration Throttle Over Revs Subgroup
1 Low Low Low Moderate S,M,L
2 Low Low High Low M
3 Low Low High Moderate S
4 Moderate Moderate High Low L
5 Moderate Moderate High Moderate S,M,L
6 Moderate Moderate High High M
7 Moderate High Moderate Low S
8 Moderate High High Low L
9 High Moderate Moderate Low M
10 High High Moderate Low S,M,L
11 High High High Low M,L
Fig. 2: Our proposed Data-Driven Fuzzy Logic architecture to score HGV driving behaviour
Fig. 3: Driving profiles (Consequents) membership functions
D. Generating fuzzy sets and membership functions
For all subgroups, the quartiles of the boxplots for each driv-
ing incident are used to construct their respective membership
functions (antecedents MFs). For example, Fig. 5 shows the
membership functions for the four driving features of short
daily average mileage subgroup generated from their boxplot
distributions shown in Fig. 4. It is important to note that for
each feature, the three fuzzy sets represent the corresponding
low, moderate and high distributions respectively. The eleven
driving profiles i.e. the consequents of the FLS obtained
by the classification framework, are represented in Fig. 3.
The consequent membership functions are equally distributed
between 0 and 100, which represents the driving risk level.
E. Fuzzy rule generation, inference and defuzzification
The Wang-Mendel method [13], one of the most commonly
used rule generation technique, is employed to generate the
rules of the FLS. In our dataset, the four different features
and output (profile) pairs are denoted as:
(x1
HB , x1
OS , x1
ET , x1
ORev ;y1
P R),
(x2
HB , x2
OS , x2
ET , x2
ORev ;y2
P R),
.
.
..
.
.
(xN
HB , xN
OS , xN
ET , xN
ORev ;yN
P R),
(1)
where Nis the number of input-output data pairs and each
abbreviation denotes the corresponding feature (e.g. HB =
Harsh Braking incident).
By using the 2-stage classification framework (see details in
Section III-C), the domain and three Membership Functions
(MFs) i.e. Low, Medium, High, are defined for each feature in
the datasets. Then, the output domain is defined by splitting
11 MFs as shown in Fig. 3 and each driver is assigned by a
profile number between 1-11. After constructing the MFs and
consequents for all the input-output pairs, each pair is assigned
to the corresponding MFs so that the model rules are generated
along with their weights. By following the practice from
[13], the rule weights are utilised to implement rule reduction
procedure on the conflicting rules. Table III shows the rules for
the short mileage subgroup generated in a data-driven manner
(due to space constraints, we only show fuzzy rules of short
mileage subgroup). For a more elaborate description of the
Wang-Mendel rule generation method, please refer [13].
Next, we use the Mamdani inference technique [24], a
commonly used rule-based inference method, to combine
the antecedents fuzzy sets using the fuzzy rules and fuzzy
operators (min and max for T-norm and T-conorm operators) to
obtain output fuzzy sets. The output fuzzy sets are combined to
calculate a crisp value (driver’s score) using centroid defuzzi-
fication [25] (a common and fast defuzzification technique).
IV. RES ULT S AN D DISCUSSION
A. Driving stereotypes
The 2-stage classification framework and profile labelling
algorithm uncovers eleven HGV driving incident stereotypes
(Table II), which are used for generating the membership
functions (Fig. 3 and 5) to capture the uncertainties in the
driving incidents and stereotypes. Profile 1 represents a ‘very
low’ risk level stereotype with low incidents, except for over
revving that has a moderate number of incidents. Profiles 2 and
3 can be considered ‘low’ risk level stereotypes because of low
harsh braking and over-speeding incidents. Profiles 4, 5 and 6
are similar except for their economic behaviour (over revving
incidents) and can be described as ‘moderate’ risk level
stereotypes with moderate harsh braking and over-speeding
incidents. Profiles 7 and 8 are ‘high’ risk level and speedy
Fig. 4: Boxplots for the driving features of short mileage drivers
Fig. 5: Driving incidents membership functions for short mileage drivers
profiles with high duration of over speeding incidents while
profile 9 represents a ‘high’ risk level and aggressive profile
due to high harsh braking and excess throttling incidents.
Profile 10 can be observed to be a ‘very high’ risk level driving
stereotype with high harsh braking and over-speeding incidents
and profile 11 can be described as ’extremely’ risky and the
most aggressive of all the profiles with high harsh braking,
over-speeding and excessive throttling. The output domain (0
to 100) is defined by placing these 11 profiles according to
our subjective risk level interpretations from ‘very low’ to
‘extremely’ risky profiles.
B. Uncertainties in driving incidents
The MFs produced by the 11 profiles capture the uncertain-
ties within driving incidents caused by sensor readings and
multiple subjective interpretations (e.g., subjectivity produced
by the profile labelling algorithm). Such uncertainty can be
captured and visualised in the over revving incident MF in
Fig. 6 with the fuzzy sets: Low, Moderate and High. The
uncertainty of whether the over revving value, 0.01, should
be considered ‘Low’, ‘Moderate’ or ‘High’ is represented by
the MFs as follows:
TABLE III: Fuzzy rules for short mileage drivers
Rule Harsh braking Over speeding Excessive Throttle Over Revs Profile
No No
1 LOW AND (LOW OR MODERATE) AND (LOW OR MODERATE) AND (LOW OR MODERATE) 1
2 MODERATE AND LOW AND (LOW OR MODERATE) AND (LOW OR MODERATE) 1
3 MODERATE AND MODERATE AND (LOW OR MODERATE) AND MODERATE 1
4 HIGH AND LOW AND (LOW OR MODERATE) AND (LOW OR MODERATE) 1
5 HIGH AND MODERATE AND LOW AND MODERATE 1
6 LOW AND MODERATE AND HIGH AND LOW 3
7 (MODERATE OR HIGH) AND LOW AND HIGH AND MODERATE 3
8 LOW AND (LOW OR MODERATE) AND HIGH AND MODERATE 5
9 MODERATE AND MODERATE AND HIGH AND (LOW OR MODERATE) 5
10 (LOW OR MODERATE) AND LOW AND HIGH AND LOW 5
11 HIGH AND MODERATE AND HIGH AND LOW 5
12 LOW AND HIGH AND MODERATE AND LOW 7
13 LOW AND HIGH AND HIGH AND MODERATE 7
14 LOW AND HIGH AND LOW AND LOW 10
15 MODERATE AND MODERATE AND (LOW OR MODERATE) AND LOW 10
16 (MODERATE OR HIGH) AND HIGH AND (LOW OR MODERATE OR HIGH) AND (LOW OR MODERATE) 10
17 HIGH AND MODERATE AND (LOW OR MODERATE) AND LOW 10
18 HIGH AND MODERATE AND (MODERATE OR HIGH) AND MODERATE 10
Note: Profiles 2, 4, 6, 8, 9 and 11 are found in the other driving subgroups (i.e., Medium and Long) as shown in Table II.
Fig. 6: Over revving membership function produced by our
Fuzzy Logic approach
ORLow (0.01) = 0.75
ORM oderate(0.01) = 0.35
ORH igh(0.01) = 0
These fuzzy values are used during rule evaluation to
determine the driver’s risk level or score. For example, if we
consider Rule 7 of short mileage drivers in Table III:
IF HB is either ‘MODERATE’ OR ‘HIGH’ AND OS
is ‘Low’ AND ET is ‘HIGH’ AND ORev is ‘MODER-
ATE’ THEN the driver’s risk level is ‘LOW’
The degree of membership 0.35 represented by
ORModer ate(0.01) as well as those obtained from the
other driving incidents will be used to evaluate the firing
strength of the rule and the rules will be used to obtain the
driving score.
C. Scoring HGV driving behaviour
The Mamdani inference system combines the antecedents
(driving incidents) fuzzy sets using the knowledge base
(rules) generated by the Wang-Mendel method to produce
rule strengths. The rule strengths are then applied to the
consequents to produce inferred output fuzzy sets. These
output fuzzy sets are combined using centroid defuzzification,
which considers the center of the area of the output fuzzy sets
to produce the driving risk score. The scores or risk levels
range from 0 to 100, where 0 represents low risk drivers and
100 is high risk drivers. Fig. 7 shows an example of output
fuzzy sets obtained from the FIS. The shaded regions in the
figure represent the fired consequents i.e., for certain values
of the driving incidents, the rule strengths for the consequent
fuzzy sets ‘profile 5’ and ‘profile 10’ (denoted by the purple
and blue shades) are 0.8 and 0.05 respectively. This means
that the driver’s risky behaviour belongs to profile 5 with
a degree of 0.8 and to profile 10 with a degree of 0.05.
The defuzzification method computes the center of the area
between the shaded regions which represents the driver’s risk
level i.e., 39. Furthermore, the logical rule knowledge base
(IF-THEN rules) coupled with the linguistic fuzzy sets make
the proposed system explainable to end-users and stakeholders
in the HGV community. This framework can be applied to the
development of driver alert systems by providing feedback to
drivers about the risk level of their driving incidents when they
exceed predefined risk scores.
V. CONCLUSION AND FUTURE WORK
This study captures the uncertainties within driving features
and stereotypes produced by sensor readings and varying
subjective interpretations using a data-driven Fuzzy Logic
System. Our proposed framework provides a risk level for
driving incidents within the range 0 to 100, where 0 is a low
risk driver and 100 is a high risk driver.
To generate the antecedent and consequent MFs, the afore-
mentioned 2-stage classification framework is employed on
four years of HGV driving incidents data (number of harsh
braking events, over speeding duration, excessive throttling
duration and number of over revving events) in the UK i.e.
2014, 2015, 2016 and 2017. A total of eleven stereotypes
were uncovered. Subsequently, the Wang-Mendel method was
Fig. 7: A sample result of our FIS to determine the risk level
of a certain driving behaviour
employed to generate the fuzzy rules and the Mamdani infer-
ence system for scoring HGV drivers in the range 0 to 100.
Although the proposed framework requires further validation
in real time using experts, it can be incorporated into driver
assistance and alert systems for monitoring and providing
feedback to the drivers in real time when they exhibit risky
driving styles.
Lastly, some limitations and scope for future works are
discussed.
Limited number of features: This study is limited in
the number of driving features analysed (only four driving
features). In the future, we plan on including more driving
features such as harsh cornering, lane changing and close
following, which seem to be relevant for road safety.
Other facets of driver behaviour: This study only explores
telematics data. In future, we plan on capturing other facets
of driving behaviour such as driver distraction, workload,
situation awareness, stress, fatigue and attention, to provide
a more reliable holistic view of driving behaviour to assist
stakeholders in the decision making process.
ACK NOW LE DG EM EN T
The first author is supported by the Horizon Centre for
Doctoral Training at the University of Nottingham (UKRI
Grant No. EP/L015463/1) and by Microlise.
REFERENCES
[1] Who. global status report on road safety: World health organization.
2018. Last Accessed: 2-3-2020.
[2] R. Kalsoom and Z. Halim. Clustering the driving features based on data
streams. In Multi Topic Conference (INMIC), 2013 16th International,
pages 89–94, Dec 2013.
[3] C. Saiprasert, S. Thajchayapong, T. Pholprasit, and C. Tanprasert. Driver
behaviour profiling using smartphone sensory data in a v2i environment.
In 2014 Int Conf on Connected Vehicles and Expo, pages 552–557, 2014.
[4] G. P. Figueredo, U. Agrawal, J. M. Mase, M. Mesgarpour, C. Wagner,
D. Soria, J. M. Garibaldi, P.O. Siebers, and R. I. John. Identifying heavy
goods vehicle driving styles in the united kingdom. IEEE Transactions
on Intelligent Transportation Systems (in press), tbc:tbc, 2018.
[5] Utkarsh Agrawal, Jimiama Mafeni Mase, Grazziela P Figueredo, Chris-
tian Wagner, Mohammad Mesgarpour, and Robert I John. Towards real-
time heavy goods vehicle driving behaviour classification in the united
kingdom. In 2019 IEEE Intelligent Transportation Systems Conference
(ITSC), pages 2330–2336. IEEE, 2019.
[6] Jimiama M.M. Mafeni, Peter Chapman, Grazziela P. Figueredo, and
Mercedes T. Torres.
[7] Mohammed S Majdi, Sundaresh Ram, Jonathan T Gill, and Jeffrey J
Rodr´
ıguez. Drive-net: Convolutional network for driver distraction
detection. In 2018 IEEE Southwest Symposium on Image Analysis and
Interpretation (SSIAI), pages 1–4. IEEE, 2018.
[8] Jimiama M.M. Mafeni, Peter Chapman, Grazziela P. Figueredo, and
Mercedes T. Torres. A hybrid deep learning approach for driver
distraction detection. unpublished, 2020.
[9] Jeffrey S Hickman and Richard J Hanowski. Use of a video monitoring
approach to reduce at-risk driving behaviors in commercial vehicle
operations. Transportation research part F: traffic psychology and
behaviour, 14(3):189–198, 2011.
[10] Jennifer L Bell, Matthew A Taylor, Guang-Xiang Chen, Rachel D
Kirk, and Erin R Leatherman. Evaluation of an in-vehicle monitoring
system (ivms) to reduce risky driving behaviors in commercial drivers:
Comparison of in-cab warning lights and supervisory coaching with
videos of driving behavior. Journal of safety research, 60:125–136,
2017.
[11] Hannah J Foy and Peter Chapman. Mental workload is reflected in driver
behaviour, physiology, eye movements and prefrontal cortex activation.
Applied ergonomics, 73:90–99, 2018.
[12] Jimiama M.M. Mafeni, Shazmin Majid, Mohammad Mesgarpour, Mer-
cedes T. Torres, Grazziela P. Figueredo, and Peter Chapman. Evaluating
the impact of heavy goods vehicle driver monitoring and coaching to
reduce risky behaviour. unpublished, 2020.
[13] L-X Wang and Jerry M Mendel. Generating fuzzy rules by learning from
examples. IEEE Trans. Syst., Man, Cybern., 22(6):1414–1427, 1992.
[14] Arshdeep Kaur and Amrit Kaur. Comparison of mamdani-type and
sugeno-type fuzzy inference systems for air conditioning system. In-
ternational journal of soft computing and engineering, 2(2):323–325,
2012.
[15] Zoran Constantinescu, Cristian Marinoiu, and Monica Vladoiu. Driving
style analysis using data mining techniques. Int J of Comp Communi-
cations & Control, 5(5):654–663, 2010.
[16] Adrian B Ellison, SP Greaves, and Rhonda Daniels. Profiling drivers’
risky behaviour towards all road users. In A safe system: expanding the
reach: Australasian College of Road Safety national conference, 2012.
[17] Utkarsh Agrawal, Daniele Soria, Christian Wagner, Jonathan Garibaldi,
Ian O Ellis, John MS Bartlett, David Cameron, Emad A Rakha, and
Andrew R Green. Combining clustering and classification ensembles:
A novel pipeline to identify breast cancer profiles. Artificial Intelligence
in Medicine, 2019.
[18] Lotfi A Zadeh. Fuzzy sets. Information and control, 8(3):338–353,
1965.
[19] Ahmad Aljaafreh, Nabeel Alshabatat, and Munaf S Najim Al-Din.
Driving style recognition using fuzzy logic. In 2012 IEEE International
Conference on Vehicular Electronics and Safety (ICVES 2012), pages
460–463. IEEE, 2012.
[20] T Imkamon, P Saensom, P Tangamchit, and P Pongpaibool. Detection of
hazardous driving behavior using fuzzy logic. In 2008 5th International
Conference on Electrical Engineering/Electronics, Computer, Telecom-
munications and Information Technology, volume 2, pages 657–660.
IEEE, 2008.
[21] Sakchai Boonmee and Poj Tangamchit. Portable reckless driving
detection system. In 2009 6th International Conference on Electrical
Engineering/Electronics, Computer, Telecommunications and Informa-
tion Technology, volume 1, pages 412–415. IEEE, 2009.
[22] Rui Ara´
ujo, ˆ
Angela Igreja, Ricardo De Castro, and Rui Esteves Araujo.
Driving coach: A smartphone application to evaluate driving efficient
patterns. In 2012 IEEE Intelligent Vehicles Symposium, pages 1005–
1010. IEEE, 2012.
[23] Microlise. Microlise (telematics, transport and fleet management solu-
tions). Available at: http://www.microlise.com/, Last accessed 30th April
2018.
[24] Ebrahim H Mamdani and Sedrak Assilian. An experiment in linguistic
synthesis with a fuzzy logic controller. International journal of man-
machine studies, 7(1):1–13, 1975.
[25] Antonio Ginart, Gustavo Sanchez, I Links, and G Back. Fast defuzzi-
fication method based on centroid estimation. Applied Modelling and
Simulation, 58(1):20–25, 2002.
... The current literature on intelligence-supported road safety assessment of commercial driving are limited to the manner by which drivers operate vehicle controls (Figueredo et al. [2015(Figueredo et al. [ , 2018, Mase et al. [2020a], Agrawal et al. [2019], Mehdizadeh et al. [2021], , Hébert et al. [2021], Satrawala et al. [2022]) and do not consider the impact of inevitable contextual factors on driving performance, such as individual drivers' physical and mental states, weather conditions, traffic conditions, road geometry, road types, and work schedules. These factors influence drivers' responses, and therefore need to be considered to better understand the circumstances that led to a driver's performance or/and produce context-specific driving risk assessments. ...
... Bob was driving in favourable conditions e.g., good weather and no time pressure for delivery, while Alice was driving under poor weather conditions with pressure to deliver on time. Current driver-assessment approaches (such as Mase et al. [2020a], Agrawal et al. [2019], Mehdizadeh et al. [2021], , Hébert et al. [2021], Satrawala et al. [2022]) fail to include the contextual information and provide the same assessment for both drivers. In this paper, we argue that driving performance and risk in both cases is not the same and that context should be taken into account. ...
... The current literature on intelligent commercial driver-assessment employ different computational and artificial intelligence techniques on driver data that represents the manner by which drivers operate vehicle controls, such as, driving incident data (Figueredo et al. [2018], Mase et al. [2020a], Agrawal et al. [2019], Mehdizadeh et al. [2021]) and GPS data ], Hébert et al. [2021, Satrawala et al. [2022]) or/and a narrow subset of contextual factors , Satrawala et al. [2022], Feng et al. [2017], Öz et al. [2010]). This approach of analysing driving performance and risk using solely driver data could potentially lead to incomplete and unfair assessments as real-world commercial driving is mostly affected by drivers' personal traits and external conditions, and the influence of those contextual factors are not represented in driver data. ...
Preprint
Full-text available
Commercial driving is a complex multifaceted task influenced by drivers' personal traits and external contextual factors, such as weather, traffic, and road conditions. Current intelligent commercial driving assessment systems only focus on the manner by which drivers operate vehicle controls when analysing driving performance and the impact of driver behaviour on road safety, potentially producing incomplete and unfair assessments. In this paper, we introduce a stakeholder co-designed approach to complement current driver-assessment systems and provide more context-aware, comprehensive and we hope--fairer assessments of commercial driving. The approach engages key stakeholders in the commercial driving sector to capture the impact of personal traits and external contextual factors on driving performance and provides a systematic approach to embed this knowledge into the assessment of commercial drivers. We demonstrate the effectiveness and utility of the approach by illustrating its end-to-end deployment in the context of heavy goods vehicle driving assessment.
... Road traffic fatalities have been on the rise for the last few years [4]. In this regard, researchers have begun to explore the benefits of artificial intelligence when applied to a diverse range of problems, including, but not limited to, understanding driving behaviors, mitigating road incidents, and developing driver's assistance systems [5,6]. ...
... As opposed to using handcrafted features, the automatic extraction of deep features has cause a paradigm shift towards the usage of convolutional neural networks. Various studies have used recurrent neural networks (RNNs) for extraction of spectral information [6] and convolutional neural networks (CNNs) for spatial information extraction [13,14] to classify various driving postures, and they yielded better results. ...
Article
Full-text available
In the last decade, distraction detection of a driver gained a lot of significance due to increases in the number of accidents. Many solutions, such as feature based, statistical, holistic, etc., have been proposed to solve this problem. With the advent of high processing power at cheaper costs, deep learning-based driver distraction detection techniques have shown promising results. The study proposes ReSVM, an approach combining deep features of ResNet-50 with the SVM classifier, for distraction detection of a driver. ReSVM is compared with six state-of-the-art approaches on four datasets, namely: State Farm Distracted Driver Detection, Boston University, DrivFace, and FT-UMT. Experiments demonstrate that ReSVM outperforms the existing approaches and achieves a classification accuracy as high as 95.5%. The study also compares ReSVM with its variants on the aforementioned datasets.
... F. proposed an explainable model for drivers' fatigue prediction using Gaussian Process Boosting and SHAP. In another work, an explainable fuzzy logic system was proposed that can capture the uncertainties within driving features and classify drivers in terms of risky driving styles (Mase et al., 2020). An explainable model for detecting riding patterns of motorbikes was developed by Leyli abadi and Boubezoul (2021). ...
Thesis
Artificial Intelligence (AI) is recognized as advanced technology that assists in decision-making processes with high accuracy and precision. However, many AI models are generally appraised as black boxes due to their reliance on complex inference mechanisms. The intricacies of how and why these AI models reach a decision are often not comprehensible to human users, resulting in concerns about the acceptability of their decisions. Previous studies have shown that the lack of associated explanation in a human-understandable form makes the decisions unacceptable to end-users. Here, the research domain of Explainable AI (XAI) provides a wide range of methods with the common theme of investigating how AI models reach to a decision or explain it. These explanation methods aim to enhance transparency in Decision Support Systems (DSS), particularly crucial in safety-critical domains like Road Safety (RS) and Air Traffic Flow Management (ATFM). Despite ongoing developments, DSSs are still in the evolving phase for safety-critical applications. Improved transparency, facilitated by XAI, emerges as a key enabler for making these systems operationally viable in real-world applications, addressing acceptability and trust issues. Besides, certification authorities are less likely to approve the systems for general use following the current mandate of Right to Explanation from the European Commission and similar directives from organisations across the world. This urge to permeate the prevailing systems with explanations paves the way for research studies on XAI concentric to DSSs. To this end, this thesis work primarily developed explainable models for the application domains of RS and ATFM. Particularly, explainable models are developed for assessing drivers’ in-vehicle mental workload and driving behaviour through classification and regression tasks. In addition, a novel method is proposed for generating a hybrid feature set from vehicular and electroencephalography (EEG) signals using mutual information (MI). The use of this feature set is successfully demonstrated to reduce the efforts required for complex computations of EEG feature extraction. The concept of MI was further utilized in generating human-understandable explanations of mental workload classification. For the domain of ATFM, an explainable model for flight take-off time delay prediction from historical flight data is developed and presented in this thesis. The gained insights through the development and evaluation of the explainable applications for the two domains underscore the need for further research on the advancement of XAI methods. In this doctoral research, the explainable applications for the DSSs are developed with the additive feature attribution (AFA) methods, a class of XAI methods that are popular in current XAI research. Nevertheless, there are several sources from the literature that assert that feature attribution methods often yield inconsistent results that need plausible evaluation. However, the existing body of literature on evaluation techniques is still immature offering numerous suggested approaches without a standardized consensus on their optimal application in various scenarios. To address this issue, comprehensive evaluation criteria are also developed for AFA methods as the literature on XAI suggests. The proposed evaluation process considers the underlying characteristics of the data and utilizes the additive form of Case-based Reasoning, namely AddCBR. The AddCBR is proposed in this thesis and is demonstrated to complement the evaluation process as the baseline to compare the feature attributions produced by the AFA methods. Apart from generating an explanation with feature attribution, this thesis work also proposes the iXGB - interpretable XGBoost. iXGB generates decision rules and counterfactuals to support the output of an XGBoost model thus improving its interpretability. From the functional evaluation, iXGB demonstrates the potential to be used for interpreting arbitrary tree-ensemble methods. In essence, this doctoral thesis initially contributes to the development of ideally evaluated explainable models tailored for two distinct safety-critical domains. The aim is to augment transparency within the corresponding DSSs. Additionally, the thesis introduces novel methods for generating more comprehensible explanations in different forms, surpassing existing approaches. It also showcases a robust evaluation approach for XAI methods.
... However, their analysis only looked at two types of distractions: looking in front and not looking in front postures. Mase et al. (2020) proposed a hybrid deep learning model that combines pretrained InceptionV3 CNNs and stacked Bidirectional LSTMs for detecting the distracted behavior with an accuracy of around 92.70%. Ou & Karray (2019) proposed a generative adversarial network-based driver distraction detection system (GANs). ...
Article
Full-text available
Recently, the number of road accidents has been increased worldwide due to the distraction of the drivers. This rapid road crush often leads to injuries, loss of properties, even deaths of the people. Therefore, it is essential to monitor and analyze the driver's behavior during the driving time to detect the distraction and mitigate the number of road accident. To detect various kinds of behavior like- using cell phone, talking to others, eating, sleeping or lack of concentration during driving; machine learning/deep learning can play significant role. However, this process may need high computational capacity to train the model by huge number of training dataset. In this paper, we made an effort to develop CNN based method to detect distracted driver and identify the cause of distractions like talking, sleeping or eating by means of face and hand localization. Four architectures namely CNN, VGG-16, ResNet50 and MobileNetV2 have been adopted for transfer learning. To verify the effectiveness, the proposed model is trained with thousands of images from a publicly available dataset containing ten different postures or conditions of a distracted driver and analyzed the results using various performance metrics. The performance results showed that the pre-trained MobileNetV2 model has the best classification efficiency.
... Extensive advances in Machine Learning (ML) have demonstrated its potential in successfully addressing complex problems in safety-critical areas, such as in healthcare [1][2][3], aerospace [4][5][6], driver distraction [7][8][9], civil engineering [10,11], and manufacturing [12,13]. Historically, however, many ML models, especially those involving neural networks, are viewed as 'black boxes', where little is known about the decision-making process. ...
Article
Full-text available
When machine learning supports decision-making in safety-critical systems, it is important to verify and understand the reasons why a particular output is produced. Although feature importance calculation approaches assist in interpretation, there is a lack of consensus regarding how features’ importance is quantified, which makes the explanations offered for the outcomes mostly unreliable. A possible solution to address the lack of agreement is to combine the results from multiple feature importance quantifiers to reduce the variance in estimates and to improve the quality of explanations. Our hypothesis is that this leads to more robust and trustworthy explanations of the contribution of each feature to machine learning predictions. To test this hypothesis, we propose an extensible model-agnostic framework divided in four main parts: (i) traditional data pre-processing and preparation for predictive machine learning models, (ii) predictive machine learning, (iii) feature importance quantification, and (iv) feature importance decision fusion using an ensemble strategy. Our approach is tested on synthetic data, where the ground truth is known. We compare different fusion approaches and their results for both training and test sets. We also investigate how different characteristics within the datasets affect the quality of the feature importance ensembles studied. The results show that, overall, our feature importance ensemble framework produces 15% less feature importance errors compared with existing methods. Additionally, the results reveal that different levels of noise in the datasets do not affect the feature importance ensembles’ ability to accurately quantify feature importance, whereas the feature importance quantification error increases with the number of features and number of orthogonal informative features. We also discuss the implications of our findings on the quality of explanations provided to safety-critical systems.
... The growing proliferation of computers in cars makes it possible to use these computers to assess driver behavior. The literature shows an increasing number of applications that use forward-facing or driver-facing cameras, sometimes combined with accelerationbased triggers, to detect drowsiness and distraction (Chowdhury et al., 2018;Kashevnik et al., 2019;Lechner et al., 2019;Ramzan et al., 2019;Sikander & Anwar, 2018) and unsafe driving behavior (Hickman & Hanowski, 2011;Mase et al., 2020). Other types of systems rely on in-vehicle data recorders (Shimshoni et al., 2015) or smartphones for driver assessment (e.g., Bergasa et al., 2014;Farah et al., 2014;Musicant & Lotan, 2016;Shanly et al., 2018), and see Michelaraki et al. (2021) for a review on post-trip feedback solutions, including smartphone apps, gamification approaches, and reward schemes. ...
Article
Full-text available
Vehicles are increasingly equipped with sensors that capture the state of the driver, the vehicle, and the environment. These developments are relevant to formal driver testing, but little is known about the extent to which driving examiners would support the use of sensor data in their job. This semi-structured interview study examined the opinions of 37 driving examiners about data-driven assessment of test candidates. The results showed that the examiners were supportive of using data to explain their pass/fail verdict to the candidate. According to the examiners, data in an easily accessible form such as graphs of eye movements, headway, speed, or braking behavior, and color-coded scores, supplemented with camera images, would allow them to eliminate doubt or help them convince disagreeing test-takers. The examiners were skeptical about higher levels of decision support, noting that forming an overall picture of the candidate's abilities requires integrating multiple context-dependent sources of information. The interviews yielded other possible applications of data collection and sharing, such as selecting optimal routes, improving standardization, and training and pre-selecting candidates before they are allowed to take the driving test. Finally, the interviews focused on an increasingly viable form of data collection: simulator-based driver testing. This yielded a divided picture, with about half of the examiners being positive and half negative about using simulators in driver testing. In conclusion, this study has provided important insights regarding the use of data as an explanation aid for examiners.
... The growing proliferation of computers in cars makes it possible to use these computers to assess driver behavior. The literature shows an increasing number of applications that use forward-facing or driverfacing cameras, sometimes combined with acceleration-based triggers, to detect drowsiness and distraction (Chowdhury et al., 2018;Kashevnik et al., 2019;Lechner et al., 2019;Ramzan et al., 2019;Sikander & Anwar, 2018) and unsafe driving behavior (Hickman & Hanowski, 2011;Mase et al., 2020). ...
Preprint
Full-text available
Vehicles are increasingly equipped with sensors that capture the state of the driver, the vehicle, and the environment. These developments are relevant to formal driver testing, but little is known about the extent to which driving examiners would support the use of sensor data in their job. This semi-structured interview study examined the opinions of 37 driving examiners about data-driven assessment of test candidates. The results showed that the examiners were supportive of using data to explain their pass/fail verdict to the candidate. According to the examiners, data in an easily accessible form such as graphs of eye movements, headway, speed, or braking behavior, and color-coded scores, supplemented with camera images, would allow them to eliminate doubt or help them convince disagreeing test-takers. The examiners were skeptical about higher levels of decision support, noting that forming an overall picture of the candidate's abilities requires integrating multiple context-dependent sources of information. The interviews yielded other possible applications of data collection and sharing, such as selecting optimal routes, improving standardization, and training and pre-selecting candidates before they are allowed to take the driving test. Finally, the interviews focused on an increasingly viable form of data collection: simulator-based driver testing. This yielded a divided picture, with about half of the examiners being positive and half negative about using simulators in driver testing. In conclusion, this study has provided important insights regarding the use of data as an explanation aid for examiners. Future research should consider the views of test candidates and experimentally evaluate different forms of data-driven support in the driving test.
Article
Full-text available
With the widespread use of machine learning to support decision-making, it is increasingly important to verify and understand the reasons why a particular output is produced. Although post-training feature importance approaches assist this interpretation, there is an overall lack of consensus regarding how feature importance should be quantified, making explanations of model predictions unreliable. In addition, many of these explanations depend on the specific machine learning approach employed and on the subset of data used when calculating feature importance. A possible solution to improve the reliability of explanations is to combine results from multiple feature importance quantifiers from different machine learning approaches coupled with re-sampling. Current state-of-the-art ensemble feature importance fusion uses crisp techniques to fuse results from different approaches. There is, however, significant loss of information as these approaches are not context-aware and reduce several quantifiers to a single crisp output. More importantly, their representation of “importance” as coefficients may be difficult to comprehend by end-users and decision makers. Here we show how the use of fuzzy data fusion methods can overcome some of the important limitations of crisp fusion methods by making the importance of features easily understandable.
Conference Paper
Full-text available
The World Health Organisation reports distracted driving actions as the main cause of road traffic accidents. Current studies to detect distraction postures focus on analysing spatial features of images using Convolutional Neural Networks (CNN). However, approaches addressing both spectral and spatial features of images for driving distraction are scarce. Our hypothesis is that deep learning approaches can further be exploited to consider spatial and spectral features, so that the spatial features capture the spatial information within the image and the spectral features capture the spectral correlations among the image channels. This paper introduces a novel driver distraction posture detection method using CNNs and stacked Bidirectional Long Short Term Memory (BiLSTM) Networks to capture the spectral-spatio features of the images. The proposed methodology consists of two stages: first, we automatically learn the spatial posture features using pre-trained CNNs. Subsequently, we utilise BiLSTMs architecture to extract the spectral features amongst the stacked feature maps from the pre-trained CNNs. Our proposed approach is evaluated on the American University in Cairo (AUC) Distracted Driver Dataset, the most comprehensive and detailed dataset on driver distraction postures to date. Results show that our approach beats state-of-the-art CNN models with an average classification accuracy of 92.7%.
Article
Full-text available
Determining the impact of driver-monitoring technologies to improve risky driving behaviours allows stakeholders to understand which aspects of on-board sensors and feedback need enhancement to promote road safety and education. This study investigates the influence of camera monitoring on Heavy Goods Vehicle (HGV) drivers' risky behaviours. We also assess whether monitoring affects individual driving events further when coupled with safe driving practices coaching. We evaluate the outcome of those practices on three telematics incidents heavily reliant on driving errors and violations, i.e., the number of vehicle harsh braking, harsh cornering and over speeding incidents. The objective is to understand how frequently individual incidents caused by risky driving behaviour occur (a) without camera monitoring and without any coaching; (b) after camera installation; and (c) after camera installation and coaching. We investigate two commercial HGV companies (Company 1 and Company 2) with 263 and 269 vehicles, respectively, over a 16 months period, from which the first 8 months contain data collected before the installation of cameras (baseline) and the rest of the dataset contains incident counts after the installation of cameras (intervention). Company 1 provides coaching during the intervention phase while Company 2 does not offer coaching. Our analysis considers the baseline and the intervention phases during the same seasons to eliminate any possible bias due to the influence of weather on driving behaviour. Results show an overall significant reduction in the mean frequency of harsh braking incidents from baseline to intervention by 16.82% in Company 1 and 4.62% in Company 2, and a significant reduction in the mean frequency of over speeding incidents from baseline to intervention by 34.29% in Company 1 and 28.13% in Company 2. Furthermore, the effect of coaching has a significant difference in reducing the frequency of harsh braking (p = 0.011) and harsh cornering (p < 0.001) compared to just camera monitoring. These results suggest that coaching interventions are more effective in reducing driving errors while monitoring reduces both driving errors and violations.
Conference Paper
Full-text available
Determining the driving styles and the factors causing incidents in real time could assist stakeholders to promote actions and develop feedback systems to reduce risks, costs and to increase safety in roads. This paper presents a classification system for Heavy Goods Vehicles (HGVs) drivers, using a core set of driving pattern stereotypes which were uncovered from driving incidents across three years i.e. 2014, 2015 and 2016. To achieve that, the driving stereotypes are established by employing a 2-stage ensemble classification framework followed by a profile labelling algorithm to define the set of driving stereotypes. Very similar stereotypes are later merged to form the core driving stereotypes for UK HGV drivers. Upon establishing core driving stereotypes across these three years, a decision tree classifier learns the classification rules to identify the driving stereotypes for the HGV drivers in a new dataset. High accuracy is achieved, indicating that the core driving patterns uncovered in this work could potentially be employed to identify UK HGV driving patterns in real-time.
Article
Full-text available
Breast Cancer is one of the most common causes of cancer death in women, representing a very complex disease with varied molecular alterations. To assist breast cancer prognosis, the classification of patients into biological groups is of great significance for treatment strategies. Recent studies have used an ensemble of multiple clustering algorithms to elucidate the most characteristic biological groups of breast cancer. However, the combination of various clustering methods resulted in a number of patients remaining unclustered. Therefore, a framework still needs to be developed which can assign as many unclustered (i.e. biologically diverse) patients to one of the identified groups in order to improve classification. Therefore, in this paper we develop a novel classification framework which introduces a new ensemble classification stage after the ensemble clustering stage to target the unclustered patients. Thus, a step-by-step pipeline is introduced which couples ensemble clustering with ensemble classification for the identification of core groups, data distribution in them and improvement in final classification results by targeting the unclustered data. The proposed pipeline is employed on a novel real world breast cancer dataset and subsequently its robustness and stability are examined by testing it on standard datasets. The results show that by using the presented framework, an improved classification is obtained. Finally, the results have been verified using statistical tests, visualisation techniques, cluster quality assessment and interpretation from clinical experts.
Article
Full-text available
Although driving behaviour has been largely studied amongst private motor vehicles drivers, the literature addressing heavy goods vehicle (HGV) drivers is scarce. Identifying the existing groups of driving stereotypes and their proportions enables researchers, companies and policy makers to establish group-specific strategies to improve safety and economy. In addition, insights into driving styles can assist predicting drivers' reactions and therefore enable the modelling of interactions between vehicles and the possible obstacles encountered on a journey. Consequently, there are also contributions to the research and development of autonomous vehicles and smart roads. In this study our interest lies in investigating driving behaviour within the HGV community in the United Kingdom (UK). We conduct the analysis of a telematics dataset containing incident information on 21,193 HGV drivers across the UK. We are interested in answering two research questions: (i) What groups of behaviour are we able to uncover? (ii) How do these groups complement current findings in the literature? To answer these questions we apply a two-stage data analysis methodology involving consensus clustering and ensemble classification to the dataset. Through the analysis, eight patterns of behaviour are uncovered. It is also observed that although our findings have similarities to those from previous work on driving behaviour, further knowledge is obtained, such as extra patterns and driving traits arising from vehicle and road characteristics.
Conference Paper
Full-text available
To help prevent motor vehicle accidents, there has been significant interest in finding an automated method to recognize signs of driver distraction, such as talking to passengers, fixing hair and makeup, eating and drinking, and using a mobile phone. In this paper, we present an automated supervised learning method called Drive-Net for driver distraction detection. Drive-Net uses a combination of a convolutional neural network (CNN) and a random decision forest for classifying images of a driver. We compare the performance of our proposed Drive-Net to two other popular machine-learning approaches: a recurrent neural network (RNN), and a multi-layer perceptron (MLP). We test the methods on a publicly available database of images acquired under a controlled environment containing about 22425 images manually annotated by an expert. Results show that Drive-Net achieves a detection accuracy of 95%, which is 2% more than the best results obtained on the same database using other methods.
Article
Full-text available
Problem: Roadway incidents are the leading cause of work-related death in the United States. Methods: The objective of this research was to evaluate whether two types of feedback from a commercially available in-vehicle monitoring system (IVMS) would reduce the incidence of risky driving behaviors in drivers from two companies. IVMS were installed in 315 vehicles representing the industries of local truck transportation and oil and gas support operations, and data were collected over an approximate two-year period in intervention and control groups. In one period, intervention group drivers were given feedback from in-cab warning lights from an IVMS that indicated occurrence of harsh vehicle maneuvers. In another period, intervention group drivers viewed video recordings of their risky driving behaviors with supervisors, and were coached by supervisors on safe driving practices. Results: Risky driving behaviors declined significantly more during the period with coaching plus instant feedback with lights in comparison to the period with lights-only feedback (ORadj=0.61 95% CI 0.43-0.86; Holm-adjusted p=0.035) and the control group (ORadj=0.52 95% CI 0.33-0.82; Holm-adjusted p=0.032). Lights-only feedback was not found to be significantly different than the control group's decline from baseline (ORadj=0.86 95% CI 0.51-1.43; Holm-adjusted p>0.05). Conclusions: The largest decline in the rate of risky driving behaviors occurred when feedback included both supervisory coaching and lights. Practical applications: Supervisory coaching is an effective form of feedback to improve driving habits in the workplace. The potential advantages and limitations of this IVMS-based intervention program are discussed.
Conference Paper
Full-text available
Road traffic accidents prevention and mitigation are important issues that appear on top of the priority list of many countries around the world today. Many measures and approaches have been put in place in terms of policy level as well as technical level. Driver behaviour is one of many key factors that should be seriously considered to improve road safety. This paper proposes a method for driver behaviour profiling using sensory data on smartphones in a vehicle-to-infrastructure environment. Based on driving behaviours with the most risk to causing accidents, the profiling algorithm takes into account sudden driving events which occur during a journey to categorise drivers into different profiles according to their safety levels. The profiling algorithm offers the flexibility to adjust the parameters weightings in order to put an emphasis on specific driving events for different scenarios and applications. The impact on vehicle-to-infrastructure is that the stored driving profiles can be used to generate a norm for a given road section. Approaching vehicles deviating from the norm can be notified in real-time. Moreover, localised dangerous driving events can be clustered together to form a potential blackspot which can be deployed as an advanced warning for approaching vehicles as a location based service. As a result, the risk of road traffic accidents can be reduced. Real-world driving data was collected over two major routes in Thailand with four distinct profiles and five major factors to road accidents.
Article
Mental workload is an important factor during driving, as both high and low levels may result in driver error. This research examined the mental workload of drivers caused by changes in road environment and how such changes impact upon behaviour, physiological responses, eye movements and brain activity. The experiment used functional near infrared spectroscopy to record prefrontal cortex activation associated with changes in mental workload during simulated driving. Increases in subjective ratings of mental workload caused by changes in road type were accompanied by increases in skin conductance, acceleration signatures and horizontal spread of search. Such changes were also associated with increases in the concentration of oxygenated haemoglobin in the prefrontal cortex. Mental workload fluctuates during driving. Such changes can be identified using a range of measures which could be used to inform the development of in-vehicle devices and partially autonomous systems.