Conference PaperPDF Available

Capturing Uncertainty in Heavy Goods Vehicle Driving Behaviour

May 2020

May 2020

DOI:10.1109/ITSC45102.2020.9294378

Conference: IEEE International Conference on Intelligent Transportation Systems
At: Virtual

Authors:

Jimiama Mafeni Mase

University of Nottingham

Utkarsh Agrawal

University of Oxford

Direnc Pekaslan

University of Nottingham

Mercedes Torres Torres

University of Nottingham

Show all 7 authorsHide

There is a growing interest in understanding and identifying risky driving behaviours due to the numerous deaths and fatalities attributed to road traffic accidents. For Heavy Goods Vehicles (HGVs), understanding driving behaviour and its impact on road safety is a subject of interest for researchers, governments and industrial sectors, as they rely on HGVs for the delivery of goods and services. The current literature on HGV driving behaviour uses machine learning techniques to uncover core driving incident stereotypes. However, human behaviour contains different levels of uncertainty and stereotyping driving behaviour with traditional crisp methods may cause information loss and establish unfair boundaries as they do not take context into consideration. Moreover, the sensor readings also have uncertainties, and the driving stereotypes may have different subjective interpretations. In order to capture those intermediate possibilities in driver stereotyping, we propose a data-driven Fuzzy Logic system that can capture the uncertainties within driving features (data) and between driving stereotypes, and classifies drivers according to the risk of their driving styles on a scale of 0 to 100, where 0 is a low risk driver and 100 high risk. The results from telematics data show that our proposed method provides a reliable, fair and explainable approach for real-time identification of HGV driving risk level.

An example of fuzzy sets for a driving feature

…

Our proposed Data-Driven Fuzzy Logic architecture to score HGV driving behaviour

…

Boxplots for the driving features of short mileage drivers

…

Over revving membership function produced by our Fuzzy Logic approach

…

Figures - uploaded by Jimiama Mafeni Mase

Content may be subject to copyright.

Content uploaded by Jimiama Mafeni Mase

Content may be subject to copyright.

Capturing Uncertainty in Heavy Goods Vehicles

Driving Behaviour

Jimiama Mafeni Mase1, Utkarsh Agrawal2, Direnc Pekaslan1, Mohammad Mesgarpour3,

Peter Chapman4, Mercedes Torres Torres1, Grazziela P. Figueredo1

1School of Computer Science, The University of Nottingham

2School of Medicine, University of St Andrews

3Microlise, Farrington Way, Eastwood, Nottingham

4School of Psychology, The University of Nottingham

Abstract—There is a growing interest in understanding and

identifying risky driving behaviours due to the numerous road

fatalities attributed to them. For Heavy Goods Vehicles (HGVs),

understanding driving behaviour and its impact on road safety

is a subject of interest for researchers, the government and

industrial sectors, as they rely on HGVs for the delivery of goods

and services. The current literature on HGV driving behaviour

uses machine learning techniques to uncover core driving incident

stereotypes. However, human behaviour contains different levels

of uncertainty and stereotyping driving behaviour with tradi-

tional crisp methods may cause information loss and establish

unfair boundaries as they do not take context into consideration.

Moreover, the sensor readings also have uncertainties, and the

driving stereotypes may have different subjective interpretations.

In order to capture those intermediate possibilities in driver

stereotyping, we propose a data-driven Fuzzy Logic system that

can capture the uncertainties within driving features (data) and

between driving stereotypes, and classiﬁes drivers according to

the risk of their driving styles on a scale of 0 to 100, where 0 is

a low risk driver and 100 high risk. The results from telematics

data show that our proposed method provides a reliable, fair and

explainable approach for real-time identiﬁcation of HGV driving

risk level.

Index Terms—Fuzzy Logic Systems, Driving Risk Level, Clus-

tering, Uncertainty, Classiﬁcation, Heavy Goods Vehicles, Driving

Incidents, Explainable AI

I. INTRODUCTION

Every year the World Health Organisation releases a report

with statistics on road fatalities and the report consistently

attributes human errors and violations as one of the main

causes [1]. This has prompted a growing research interest in

understanding driving styles [2]–[5], distraction triggers [6]–

[8], driver interactions with vehicle technologies [9], [10] and

psychological factors affecting safe driving [11], [12]. One

area of speciﬁc interest is understanding the different types of

incidents caused by risky driving behaviours of Heavy Goods

Vehicles (HGVs) and its impact on road safety.

The current approaches to monitor driving behaviour are

based on sensors producing large volumes of data. Effective

data-driven tools for the detection and mitigation of driving

errors as well as approaches to report and understand vehicle

incidents due to human errors or violations are therefore neces-

sary. Furthermore, better interpretation of the data produced by

HGVs can assist stakeholders to develop actions, policies and

technologies for safe driving and to minimise risks and costs

within the HGV transportation network. In a recent research by

Agrawal et al. [5], the authors uncovered eleven HGV driving

incident stereotypes using telematics1data from three years

i.e. 2014, 2015 and 2016. Their study is limited with regards

to their methodology, which considers driving features and

stereotypes as crisp sets (where a driving incident value either

belongs to a particular category or not). Such a system may

be bias and cause information loss as driving features and

stereotypes contain different levels of uncertainty produced by

sensor readings and subjective interpretations. For example,

in traditional logic,‘50’ harsh braking incidents in a day can

be considered as ‘high-risk’ while ‘49’ can be regarded as

not ‘high-risk’, when it is obvious that if ‘50’ harsh braking

incidents has some degree of ‘high risk’, ‘49’ should also have

some degree of ‘high-risk’.

In this paper we build upon this aforementioned research

and extend it by presenting a novel data-driven Fuzzy Logic

System (FLS) for capturing the uncertainties in the available

telematics data and scoring HGV drivers according to the risk

of their driving incidents. In order to achieve this, a frame-

work consisting three steps is employed. First, a classiﬁcation

framework and proﬁle labelling algorithm (based on works in

[4] and [5]) is employed on telematics data. The data contains

driving incidents for four years, i.e., 2014, 2015, 2016 and

2017. The objective is to identify core driving stereotypes.

The driving incidents (antecedents) and driving stereotypes

(consequents) are subsequently used to obtain the fuzzy sets

for the FLS. Secondly, Wang-Mendel [13] method is employed

to learn the rules (knowledge base) that maps these antecedents

to consequents of the FLS. Lastly, a Mamdani inference [14]

approach is used to combine these rules to produce a scoring

system that can rate drivers’ risk level (between 0 and 100)

in real-time based on their driving incidents, where a score 0

represents a ‘low risk’ driving behaviour, while 100 is a ‘high

risk’ driving behaviour.

This paper is organised as follows, in Section II we re-

view the literature on understanding driving behaviour using

telematics data. Subsequently, we review FLS basic concepts

and their applications to driving behaviour. In Section III, we

1Telematics are communication systems which link to vehicles’ Controller

Area Network (CAN) databus for capturing vehicle data using sensors, e.g.,

engine revs, fuel consumption, accelerator position etc

provide an overview of the 2-stage classiﬁcation framework,

proﬁle labelling algorithm, introduce the datasets used in our

study and describe our methodology in detail. The results are

presented along with discussion in Section IV, and Section V

concludes the paper and establishes the opportunity for future

work.

II. BACKGROU ND

A. Related Work

Apart from interviews and questionaires, telematics have

been one of the main data sources for detecting driving

behaviour in modern studies. Researchers have analysed telem-

atics data using machine learning techniques to understand and

classify driving behaviour with promising results. For instance,

Constantinescu et al. [15] use Hierarchical Clustering Analysis

(HCA) to identify six proﬁles among 25 drivers in the city of

Bucharest, based on four telematics driving features i.e. speed,

acceleration, braking and kinetic energy. Similarly, Ellison

et al. [16] propose a driver risky index framework based

on three telematics driving incidents (i.e. over speed, harsh

acceleration and harsh braking) collected from 106 drivers

in Sidney within 25 days. These studies employ different

variations of clustering algorithms to identify groups of driving

incidents stereotypes, however, they are limited with regards to

the number of samples analysed, which is relatively small, and

their experiments are performed in controlled environments,

which is unrepresentative of real driving behaviour.

In a seminal work by Figueredo et al. [4], the authors

tackle these shortcomings by analysing driving incidents data

captured for more than 20,000 HGV drivers in the UK. The

authors use a 2-stage classiﬁcation framework [17] and a

subjective proﬁle labelling algorithm to uncover eight HGV

driving stereotypes from driving incidents data of the year

2015. Agrawal et al. [5] build upon this work to put forward

a classiﬁcation system to classify HGV drivers in real time.

They use a decision tree classiﬁer to learn driving behaviours

of drivers from the years 2014 to 2016 and test its performance

on new drivers from the year 2017. Although they achieve high

classiﬁcation accuracy, their proposed model does not consider

the uncertainties within the driving incidents and stereotypes

injected by sensor readings and subjectivity, which can cause

information loss and unfair classiﬁcation systems.

B. Fuzzy Logic Systems and their applications to driving

behaviour

Fuzzy Logic was introduced by [18] to address imprecision

or uncertainties in input and output variables directly by deﬁn-

ing them using linguistic terms with their degrees of member-

ship between 0 and 1 (fuzzy sets). Fuzzy Logic Systems (FLS)

consist of three stages: 1) Fuzziﬁcation, 2) Fuzzy inference,

and 3) Defuzziﬁcation. In fuzziﬁcation, inputs (e.g. driving

incidents) and outputs (e.g. driving stereotypes of behaviour)

also known as antecedents and consequents respectively, are

represented by linguistic variables (e.g. ‘Low’, ‘Moderate’

and ‘High’ number of harsh braking incidents). The linguistic

variables have soft boundaries that allows them to better

Fig. 1: An example of fuzzy sets for a driving feature

capture the uncertainties in inputs and outputs. In Fig. 1, ‘35’

harsh braking incidents is considered ‘Moderate’ risk with a

high degree of membership and also considered ’High’ risk

with a low degree of membership.

Fuzzy rules are deﬁned to capture the relationships between

the input and output variables and are usually based on IF-

THEN statements, which are easy to understand. For example,

a fuzzy rule for driving behaviour can be:

IF the number of harsh braking incidents is ‘Low’

risk AND the number of over speeding incidents is ‘Low’

risk THEN the driver’s risk level is ‘Low’.

Lastly, the fuzzy inference systems combine the input fuzzy

sets using the fuzzy rules to produce output fuzzy sets, and

defuzziﬁcation maps the output fuzzy sets to a real or crisp

number (e.g. a driving score in this paper).

Recently, FLS are examined in transport research to capture

the uncertainties in driving features, and to develop fair

and explainable classiﬁcation systems [19]–[22]. Aljaafreh et

al. [19] employ FLS and two driving features (acceleration and

speed) to classify drivers into four predeﬁned driving stereo-

types (below normal, normal, aggressive, and very aggressive

proﬁles). In a different study, Imkamon et al. [20] use FLS

for detecting unsafe driving behaviour. They use three sensors:

an engine control unit (ECU) reader, an accelerometer and a

camera, to extract features, such as vehicle movement. The

features are then combined using a fuzzy inference system

to identify a driver’s risk level ranging from 1 to 3. In a

similar study by Boonmee et al. [21], the authors use harsh

acceleration, sudden braking, and turning quickly incidents

and a FLS for rating drivers on 1 to 10 scale (where 10 is

the most risky driver) .

The above studies on the use of FLS to describe driving

behaviours are limited as their input fuzzy sets and mem-

bership functions are deﬁned using small number of drivers

leading to fewer driving stereotypes. Thus, this limitation can

introduce bias and make their outcome unrepresentative of

more general driving patterns. In this work, we bridge these

gaps by capturing uncertainties in HGV driving incidents using

a FLS developed from four years of telematics data i.e. input

fuzzy sets will be deﬁned by driving stereotypes captured by

real world data. We will use the FLS to score drivers according

to the risk of their driving behaviours in the range from 0 to

100, where 0 is a low risk driver while 100 is high risk.

III. METHODOLOGY

In this section we present our novel data-driven fuzzy logic

approach for scoring HGV drivers based on driving incidents

captured using telematics. First, the fuzzy sets and membership

functions are obtained from the driving proﬁles uncovered

from the classiﬁcation framework. Subsequently, the fuzzy

rules are generated using the Wang-Mendel method. Lastly, the

Mamdani inference method are used to score drivers according

to the risk level of their driving behaviour. An overview of our

proposed method is shown in Fig. 2. Each stage is described

in detail below.

A. Dataset Description

The dataset utilised in this study is provided by our industry

partner Microlise [23]. Information produced by their telemat-

ics solutions are transmitted and collected from the HGVs in

real time. Data is captured by sensors connected to multiple

electronic control units using a Controller Area Network

(CAN) bus. The HGV drivers must complete a minimum of 10

journeys per quarter (i.e. minimum 40 journeys yearly) each

year on any road in the UK to be considered in our analysis.

The data was collected between the ﬁrst of January and the

thirty ﬁrst of December for the years 2014, 2015, 2016, and

2017. The dataset consists of four driving incidents: frequency

of Harsh Braking (HB) events, Over-Speed (OS) duration

in seconds, Excessive Throttle (ET) duration in seconds and

frequency of Over Revving (ORev) events of HGV drivers.

It is important to note that these incidents were chosen for

our study because they are the most relevant incidents present

in all of the HGVs, which are related to the risk of crashes

and vehicle costs. In total, 15,893 drivers in year 2014, 21,234

drivers in year 2015, 34,675 drivers in year 2016, and 35,432

drivers in year 2017 were collected using this criteria. The

drivers were grouped into three subgroups based on their daily

average mileage travelled as uncovered by [4] including short

subgroup drivers who cover an average daily mileage upto

136.70 miles, medium subgroup drivers who cover an average

daily mileage between 136.70 and 217.48 miles, and long

subgroup drivers who covers an average daily mileage more

than 217.48 miles. Table I presents the distribution of the

drivers for the four years within the three driving subgroups.

TABLE I: Distribution of drivers among Average Daily

Mileage groups for years 2014, 2015, 2016 and 2017

Average daily Distribution of Distribution of Distribution of Distribution of

mileage groups drivers in 2014 drivers in 2015 drivers in 2016 drivers in 2017

Short 3,327 5,076 16,281 10,745

Medium 6,419 8,392 10,232 14,833

Long 6,147 7,766 8,162 9.854

Total 15,893 21,234 34,675 35,432

B. Data Pre-processing

For unbiased results, we normalise the number of driving

incidents produced by each driver by dividing with the total

driving time (in seconds). To obtain stable driving stereotypes,

the drivers who were consistently present across all the years

and who did not change subgroups across these years are

considered for the analysis. For example, if a driver is present

in short average daily mileage subgroup in the year 2014,

that driver should also be in the short average daily mileage

subgroup for the years 2015, 2016 and 2017 respectively. As a

result, 2,462 drivers are considered across the four years with

569, 964 and 929 drivers distributed between short, medium

and long average daily mileage subgroups respectively.

C. Generating Driving Stereotypes for FLS

To generate Membership Functions (MFs) and fuzzy rules,

the ﬁrst step is to obtain the driving stereotypes. The driving

stereotypes are obtained using 2-stage classiﬁcation framework

followed by a proﬁle labelling algorithm (introduced in [4],

[5]) on the datasets from four years. The classiﬁcation frame-

work is a two step process: 1) Multiple clustering algorithms

are run to group the data. Using a consensus approach, most of

the data are assigned to one of the identiﬁed groups (proﬁles),

while some of the data remain unclustered due to lack of

consensus. 2) An ensemble of classiﬁcation algorithms is

trained on the clustered data in the previous step. The trained

model is then run on the unclustered data to assign them to one

of the groups uncovered during clustering. The classiﬁcation

framework uncovers eleven clusters. Subsequently, a proﬁle

labelling algorithm sub-divides each driving incident into three

categories or linguistic terms (fuzzy sets); ‘low’, ‘moderate’

and ‘high’. Fig. 4 (a), (b), (c) and (d) show the boxplot

distributions of the four driving incidents HB, OS, ET and

ORev respectively, for short mileage subgroup (due to space

constraints, we only show the distributions of short mileage

subgroup). It is important to note that the boxplot for ORev

has no distribution for ‘high’ as the proﬁle labelling algorithm

produced only two categories for ORevs of short mileage

drivers due to their relatively smaller values compared to the

other subgroups. The clusters obtained above represent the

eleven driving stereotypes, as shown in Table II. The ﬁrst

column represents the driving proﬁle number, columns two to

ﬁve represent the labels of the occurrence of incidents in each

proﬁle and the last column indicates the mileage subgroups

in which these proﬁles are present. A more elaborate descrip-

tion of the classiﬁcation framework and the proﬁle labelling

algorithm can be found in [4] and [5].

TABLE II: HGV driving stereotypes and their feature labels

uncovered across 2014, 2015, 2016 and 2017

Proﬁle Harsh Over-Speed Excessive Number of Driving

Number Braking duration Throttle Over Revs Subgroup

1 Low Low Low Moderate S,M,L

2 Low Low High Low M

3 Low Low High Moderate S

4 Moderate Moderate High Low L

5 Moderate Moderate High Moderate S,M,L

6 Moderate Moderate High High M

7 Moderate High Moderate Low S

8 Moderate High High Low L

9 High Moderate Moderate Low M

10 High High Moderate Low S,M,L

11 High High High Low M,L

Fig. 2: Our proposed Data-Driven Fuzzy Logic architecture to score HGV driving behaviour

Fig. 3: Driving proﬁles (Consequents) membership functions

D. Generating fuzzy sets and membership functions

For all subgroups, the quartiles of the boxplots for each driv-

ing incident are used to construct their respective membership

functions (antecedents MFs). For example, Fig. 5 shows the

membership functions for the four driving features of short

daily average mileage subgroup generated from their boxplot

distributions shown in Fig. 4. It is important to note that for

each feature, the three fuzzy sets represent the corresponding

low, moderate and high distributions respectively. The eleven

driving proﬁles i.e. the consequents of the FLS obtained

by the classiﬁcation framework, are represented in Fig. 3.

The consequent membership functions are equally distributed

between 0 and 100, which represents the driving risk level.

E. Fuzzy rule generation, inference and defuzziﬁcation

The Wang-Mendel method [13], one of the most commonly

used rule generation technique, is employed to generate the

rules of the FLS. In our dataset, the four different features

and output (proﬁle) pairs are denoted as:

(x1

HB , x1

OS , x1

ET , x1

ORev ;y1

P R),

(x2

HB , x2

OS , x2

ET , x2

ORev ;y2

P R),

(xN

HB , xN

OS , xN

ET , xN

ORev ;yN

P R),

(1)

where Nis the number of input-output data pairs and each

abbreviation denotes the corresponding feature (e.g. HB =

Harsh Braking incident).

By using the 2-stage classiﬁcation framework (see details in

Section III-C), the domain and three Membership Functions

(MFs) i.e. Low, Medium, High, are deﬁned for each feature in

the datasets. Then, the output domain is deﬁned by splitting

11 MFs as shown in Fig. 3 and each driver is assigned by a

proﬁle number between 1-11. After constructing the MFs and

consequents for all the input-output pairs, each pair is assigned

to the corresponding MFs so that the model rules are generated

along with their weights. By following the practice from

[13], the rule weights are utilised to implement rule reduction

procedure on the conﬂicting rules. Table III shows the rules for

the short mileage subgroup generated in a data-driven manner

(due to space constraints, we only show fuzzy rules of short

mileage subgroup). For a more elaborate description of the

Wang-Mendel rule generation method, please refer [13].

Next, we use the Mamdani inference technique [24], a

commonly used rule-based inference method, to combine

the antecedents fuzzy sets using the fuzzy rules and fuzzy

operators (min and max for T-norm and T-conorm operators) to

obtain output fuzzy sets. The output fuzzy sets are combined to

calculate a crisp value (driver’s score) using centroid defuzzi-

ﬁcation [25] (a common and fast defuzziﬁcation technique).

IV. RES ULT S AN D DISCUSSION

A. Driving stereotypes

The 2-stage classiﬁcation framework and proﬁle labelling

algorithm uncovers eleven HGV driving incident stereotypes

(Table II), which are used for generating the membership

functions (Fig. 3 and 5) to capture the uncertainties in the

driving incidents and stereotypes. Proﬁle 1 represents a ‘very

low’ risk level stereotype with low incidents, except for over

revving that has a moderate number of incidents. Proﬁles 2 and

3 can be considered ‘low’ risk level stereotypes because of low

harsh braking and over-speeding incidents. Proﬁles 4, 5 and 6

are similar except for their economic behaviour (over revving

incidents) and can be described as ‘moderate’ risk level

stereotypes with moderate harsh braking and over-speeding

incidents. Proﬁles 7 and 8 are ‘high’ risk level and speedy

Fig. 4: Boxplots for the driving features of short mileage drivers

Fig. 5: Driving incidents membership functions for short mileage drivers

proﬁles with high duration of over speeding incidents while

proﬁle 9 represents a ‘high’ risk level and aggressive proﬁle

due to high harsh braking and excess throttling incidents.

Proﬁle 10 can be observed to be a ‘very high’ risk level driving

stereotype with high harsh braking and over-speeding incidents

and proﬁle 11 can be described as ’extremely’ risky and the

most aggressive of all the proﬁles with high harsh braking,

over-speeding and excessive throttling. The output domain (0

to 100) is deﬁned by placing these 11 proﬁles according to

our subjective risk level interpretations from ‘very low’ to

‘extremely’ risky proﬁles.

B. Uncertainties in driving incidents

The MFs produced by the 11 proﬁles capture the uncertain-

ties within driving incidents caused by sensor readings and

multiple subjective interpretations (e.g., subjectivity produced

by the proﬁle labelling algorithm). Such uncertainty can be

captured and visualised in the over revving incident MF in

Fig. 6 with the fuzzy sets: Low, Moderate and High. The

uncertainty of whether the over revving value, 0.01, should

be considered ‘Low’, ‘Moderate’ or ‘High’ is represented by

the MFs as follows:

TABLE III: Fuzzy rules for short mileage drivers

Rule Harsh braking Over speeding Excessive Throttle Over Revs Proﬁle

No No

1 LOW AND (LOW OR MODERATE) AND (LOW OR MODERATE) AND (LOW OR MODERATE) 1

2 MODERATE AND LOW AND (LOW OR MODERATE) AND (LOW OR MODERATE) 1

3 MODERATE AND MODERATE AND (LOW OR MODERATE) AND MODERATE 1

4 HIGH AND LOW AND (LOW OR MODERATE) AND (LOW OR MODERATE) 1

5 HIGH AND MODERATE AND LOW AND MODERATE 1

6 LOW AND MODERATE AND HIGH AND LOW 3

7 (MODERATE OR HIGH) AND LOW AND HIGH AND MODERATE 3

8 LOW AND (LOW OR MODERATE) AND HIGH AND MODERATE 5

9 MODERATE AND MODERATE AND HIGH AND (LOW OR MODERATE) 5

10 (LOW OR MODERATE) AND LOW AND HIGH AND LOW 5

11 HIGH AND MODERATE AND HIGH AND LOW 5

12 LOW AND HIGH AND MODERATE AND LOW 7

13 LOW AND HIGH AND HIGH AND MODERATE 7

14 LOW AND HIGH AND LOW AND LOW 10

15 MODERATE AND MODERATE AND (LOW OR MODERATE) AND LOW 10

16 (MODERATE OR HIGH) AND HIGH AND (LOW OR MODERATE OR HIGH) AND (LOW OR MODERATE) 10

17 HIGH AND MODERATE AND (LOW OR MODERATE) AND LOW 10

18 HIGH AND MODERATE AND (MODERATE OR HIGH) AND MODERATE 10

Note: Proﬁles 2, 4, 6, 8, 9 and 11 are found in the other driving subgroups (i.e., Medium and Long) as shown in Table II.

Fig. 6: Over revving membership function produced by our

Fuzzy Logic approach

ORLow (0.01) = 0.75

ORM oderate(0.01) = 0.35

ORH igh(0.01) = 0

These fuzzy values are used during rule evaluation to

determine the driver’s risk level or score. For example, if we

consider Rule 7 of short mileage drivers in Table III:

IF HB is either ‘MODERATE’ OR ‘HIGH’ AND OS

is ‘Low’ AND ET is ‘HIGH’ AND ORev is ‘MODER-

ATE’ THEN the driver’s risk level is ‘LOW’

The degree of membership 0.35 represented by

ORModer ate(0.01) as well as those obtained from the

other driving incidents will be used to evaluate the ﬁring

strength of the rule and the rules will be used to obtain the

driving score.

C. Scoring HGV driving behaviour

The Mamdani inference system combines the antecedents

(driving incidents) fuzzy sets using the knowledge base

(rules) generated by the Wang-Mendel method to produce

rule strengths. The rule strengths are then applied to the

consequents to produce inferred output fuzzy sets. These

output fuzzy sets are combined using centroid defuzziﬁcation,

which considers the center of the area of the output fuzzy sets

to produce the driving risk score. The scores or risk levels

range from 0 to 100, where 0 represents low risk drivers and

100 is high risk drivers. Fig. 7 shows an example of output

fuzzy sets obtained from the FIS. The shaded regions in the

ﬁgure represent the ﬁred consequents i.e., for certain values

of the driving incidents, the rule strengths for the consequent

fuzzy sets ‘proﬁle 5’ and ‘proﬁle 10’ (denoted by the purple

and blue shades) are 0.8 and 0.05 respectively. This means

that the driver’s risky behaviour belongs to proﬁle 5 with

a degree of 0.8 and to proﬁle 10 with a degree of 0.05.

The defuzziﬁcation method computes the center of the area

between the shaded regions which represents the driver’s risk

level i.e., 39. Furthermore, the logical rule knowledge base

(IF-THEN rules) coupled with the linguistic fuzzy sets make

the proposed system explainable to end-users and stakeholders

in the HGV community. This framework can be applied to the

development of driver alert systems by providing feedback to

drivers about the risk level of their driving incidents when they

exceed predeﬁned risk scores.

V. CONCLUSION AND FUTURE WORK

This study captures the uncertainties within driving features

and stereotypes produced by sensor readings and varying

subjective interpretations using a data-driven Fuzzy Logic

System. Our proposed framework provides a risk level for

driving incidents within the range 0 to 100, where 0 is a low

risk driver and 100 is a high risk driver.

To generate the antecedent and consequent MFs, the afore-

mentioned 2-stage classiﬁcation framework is employed on

four years of HGV driving incidents data (number of harsh

braking events, over speeding duration, excessive throttling

duration and number of over revving events) in the UK i.e.

2014, 2015, 2016 and 2017. A total of eleven stereotypes

were uncovered. Subsequently, the Wang-Mendel method was

Fig. 7: A sample result of our FIS to determine the risk level

of a certain driving behaviour

employed to generate the fuzzy rules and the Mamdani infer-

ence system for scoring HGV drivers in the range 0 to 100.

Although the proposed framework requires further validation

in real time using experts, it can be incorporated into driver

assistance and alert systems for monitoring and providing

feedback to the drivers in real time when they exhibit risky

driving styles.

Lastly, some limitations and scope for future works are

discussed.

Limited number of features: This study is limited in

the number of driving features analysed (only four driving

features). In the future, we plan on including more driving

features such as harsh cornering, lane changing and close

following, which seem to be relevant for road safety.

Other facets of driver behaviour: This study only explores

telematics data. In future, we plan on capturing other facets

of driving behaviour such as driver distraction, workload,

situation awareness, stress, fatigue and attention, to provide

a more reliable holistic view of driving behaviour to assist

stakeholders in the decision making process.

ACK NOW LE DG EM EN T

The ﬁrst author is supported by the Horizon Centre for

Doctoral Training at the University of Nottingham (UKRI

Grant No. EP/L015463/1) and by Microlise.

REFERENCES

[1] Who. global status report on road safety: World health organization.

2018. Last Accessed: 2-3-2020.

[2] R. Kalsoom and Z. Halim. Clustering the driving features based on data

streams. In Multi Topic Conference (INMIC), 2013 16th International,

pages 89–94, Dec 2013.

[3] C. Saiprasert, S. Thajchayapong, T. Pholprasit, and C. Tanprasert. Driver

behaviour proﬁling using smartphone sensory data in a v2i environment.

In 2014 Int Conf on Connected Vehicles and Expo, pages 552–557, 2014.

[4] G. P. Figueredo, U. Agrawal, J. M. Mase, M. Mesgarpour, C. Wagner,

D. Soria, J. M. Garibaldi, P.O. Siebers, and R. I. John. Identifying heavy

goods vehicle driving styles in the united kingdom. IEEE Transactions

on Intelligent Transportation Systems (in press), tbc:tbc, 2018.

[5] Utkarsh Agrawal, Jimiama Mafeni Mase, Grazziela P Figueredo, Chris-

tian Wagner, Mohammad Mesgarpour, and Robert I John. Towards real-

time heavy goods vehicle driving behaviour classiﬁcation in the united

kingdom. In 2019 IEEE Intelligent Transportation Systems Conference

(ITSC), pages 2330–2336. IEEE, 2019.

[6] Jimiama M.M. Mafeni, Peter Chapman, Grazziela P. Figueredo, and

Mercedes T. Torres.

[7] Mohammed S Majdi, Sundaresh Ram, Jonathan T Gill, and Jeffrey J

Rodr´

ıguez. Drive-net: Convolutional network for driver distraction

detection. In 2018 IEEE Southwest Symposium on Image Analysis and

Interpretation (SSIAI), pages 1–4. IEEE, 2018.

[8] Jimiama M.M. Mafeni, Peter Chapman, Grazziela P. Figueredo, and

Mercedes T. Torres. A hybrid deep learning approach for driver

distraction detection. unpublished, 2020.

[9] Jeffrey S Hickman and Richard J Hanowski. Use of a video monitoring

approach to reduce at-risk driving behaviors in commercial vehicle

operations. Transportation research part F: trafﬁc psychology and

behaviour, 14(3):189–198, 2011.

[10] Jennifer L Bell, Matthew A Taylor, Guang-Xiang Chen, Rachel D

Kirk, and Erin R Leatherman. Evaluation of an in-vehicle monitoring

system (ivms) to reduce risky driving behaviors in commercial drivers:

Comparison of in-cab warning lights and supervisory coaching with

videos of driving behavior. Journal of safety research, 60:125–136,

2017.

[11] Hannah J Foy and Peter Chapman. Mental workload is reﬂected in driver

behaviour, physiology, eye movements and prefrontal cortex activation.

Applied ergonomics, 73:90–99, 2018.

[12] Jimiama M.M. Mafeni, Shazmin Majid, Mohammad Mesgarpour, Mer-

cedes T. Torres, Grazziela P. Figueredo, and Peter Chapman. Evaluating

the impact of heavy goods vehicle driver monitoring and coaching to

reduce risky behaviour. unpublished, 2020.

[13] L-X Wang and Jerry M Mendel. Generating fuzzy rules by learning from

examples. IEEE Trans. Syst., Man, Cybern., 22(6):1414–1427, 1992.

[14] Arshdeep Kaur and Amrit Kaur. Comparison of mamdani-type and

sugeno-type fuzzy inference systems for air conditioning system. In-

ternational journal of soft computing and engineering, 2(2):323–325,

2012.

[15] Zoran Constantinescu, Cristian Marinoiu, and Monica Vladoiu. Driving

style analysis using data mining techniques. Int J of Comp Communi-

cations & Control, 5(5):654–663, 2010.

[16] Adrian B Ellison, SP Greaves, and Rhonda Daniels. Proﬁling drivers’

risky behaviour towards all road users. In A safe system: expanding the

reach: Australasian College of Road Safety national conference, 2012.

[17] Utkarsh Agrawal, Daniele Soria, Christian Wagner, Jonathan Garibaldi,

Ian O Ellis, John MS Bartlett, David Cameron, Emad A Rakha, and

Andrew R Green. Combining clustering and classiﬁcation ensembles:

A novel pipeline to identify breast cancer proﬁles. Artiﬁcial Intelligence

in Medicine, 2019.

[18] Lotﬁ A Zadeh. Fuzzy sets. Information and control, 8(3):338–353,

1965.

[19] Ahmad Aljaafreh, Nabeel Alshabatat, and Munaf S Najim Al-Din.

Driving style recognition using fuzzy logic. In 2012 IEEE International

Conference on Vehicular Electronics and Safety (ICVES 2012), pages

460–463. IEEE, 2012.

[20] T Imkamon, P Saensom, P Tangamchit, and P Pongpaibool. Detection of

hazardous driving behavior using fuzzy logic. In 2008 5th International

Conference on Electrical Engineering/Electronics, Computer, Telecom-

munications and Information Technology, volume 2, pages 657–660.

IEEE, 2008.

[21] Sakchai Boonmee and Poj Tangamchit. Portable reckless driving

detection system. In 2009 6th International Conference on Electrical

Engineering/Electronics, Computer, Telecommunications and Informa-

tion Technology, volume 1, pages 412–415. IEEE, 2009.

[22] Rui Ara´

ujo, ˆ

Angela Igreja, Ricardo De Castro, and Rui Esteves Araujo.

Driving coach: A smartphone application to evaluate driving efﬁcient

patterns. In 2012 IEEE Intelligent Vehicles Symposium, pages 1005–

1010. IEEE, 2012.

[23] Microlise. Microlise (telematics, transport and ﬂeet management solu-

tions). Available at: http://www.microlise.com/, Last accessed 30th April

2018.

[24] Ebrahim H Mamdani and Sedrak Assilian. An experiment in linguistic

synthesis with a fuzzy logic controller. International journal of man-

machine studies, 7(1):1–13, 1975.

[25] Antonio Ginart, Gustavo Sanchez, I Links, and G Back. Fast defuzzi-

ﬁcation method based on centroid estimation. Applied Modelling and

Simulation, 58(1):20–25, 2002.

Contextually Intelligent Decisions: A Co-designed Approach for Commercial Driving Assessment

Preprint

Full-text available

Nov 2022

Commercial driving is a complex multifaceted task influenced by drivers' personal traits and external contextual factors, such as weather, traffic, and road conditions. Current intelligent commercial driving assessment systems only focus on the manner by which drivers operate vehicle controls when analysing driving performance and the impact of driver behaviour on road safety, potentially producing incomplete and unfair assessments. In this paper, we introduce a stakeholder co-designed approach to complement current driver-assessment systems and provide more context-aware, comprehensive and we hope--fairer assessments of commercial driving. The approach engages key stakeholders in the commercial driving sector to capture the impact of personal traits and external contextual factors on driving performance and provides a systematic approach to embed this knowledge into the assessment of commercial drivers. We demonstrate the effectiveness and utility of the approach by illustrating its end-to-end deployment in the context of heavy goods vehicle driving assessment.

Deep Learning Approach Based on Residual Neural Network and SVM Classifier for Driver’s Distraction Detection

Article

Full-text available

Jun 2022

In the last decade, distraction detection of a driver gained a lot of significance due to increases in the number of accidents. Many solutions, such as feature based, statistical, holistic, etc., have been proposed to solve this problem. With the advent of high processing power at cheaper costs, deep learning-based driver distraction detection techniques have shown promising results. The study proposes ReSVM, an approach combining deep features of ResNet-50 with the SVM classifier, for distraction detection of a driver. ReSVM is compared with six state-of-the-art approaches on four datasets, namely: State Farm Distracted Driver Detection, Boston University, DrivFace, and FT-UMT. Experiments demonstrate that ReSVM outperforms the existing approaches and achieves a classification accuracy as high as 95.5%. The study also compares ReSVM with its variants on the aforementioned datasets.

Explainable Artificial Intelligence for Enhancing Transparency in Decision Support Systems

Thesis

Jan 2024

Mir Riyanul Islam

Artificial Intelligence (AI) is recognized as advanced technology that assists in decision-making processes with high accuracy and precision. However, many AI models are generally appraised as black boxes due to their reliance on complex inference mechanisms. The intricacies of how and why these AI models reach a decision are often not comprehensible to human users, resulting in concerns about the acceptability of their decisions. Previous studies have shown that the lack of associated explanation in a human-understandable form makes the decisions unacceptable to end-users. Here, the research domain of Explainable AI (XAI) provides a wide range of methods with the common theme of investigating how AI models reach to a decision or explain it. These explanation methods aim to enhance transparency in Decision Support Systems (DSS), particularly crucial in safety-critical domains like Road Safety (RS) and Air Traffic Flow Management (ATFM). Despite ongoing developments, DSSs are still in the evolving phase for safety-critical applications. Improved transparency, facilitated by XAI, emerges as a key enabler for making these systems operationally viable in real-world applications, addressing acceptability and trust issues. Besides, certification authorities are less likely to approve the systems for general use following the current mandate of Right to Explanation from the European Commission and similar directives from organisations across the world. This urge to permeate the prevailing systems with explanations paves the way for research studies on XAI concentric to DSSs. To this end, this thesis work primarily developed explainable models for the application domains of RS and ATFM. Particularly, explainable models are developed for assessing drivers’ in-vehicle mental workload and driving behaviour through classification and regression tasks. In addition, a novel method is proposed for generating a hybrid feature set from vehicular and electroencephalography (EEG) signals using mutual information (MI). The use of this feature set is successfully demonstrated to reduce the efforts required for complex computations of EEG feature extraction. The concept of MI was further utilized in generating human-understandable explanations of mental workload classification. For the domain of ATFM, an explainable model for flight take-off time delay prediction from historical flight data is developed and presented in this thesis. The gained insights through the development and evaluation of the explainable applications for the two domains underscore the need for further research on the advancement of XAI methods. In this doctoral research, the explainable applications for the DSSs are developed with the additive feature attribution (AFA) methods, a class of XAI methods that are popular in current XAI research. Nevertheless, there are several sources from the literature that assert that feature attribution methods often yield inconsistent results that need plausible evaluation. However, the existing body of literature on evaluation techniques is still immature offering numerous suggested approaches without a standardized consensus on their optimal application in various scenarios. To address this issue, comprehensive evaluation criteria are also developed for AFA methods as the literature on XAI suggests. The proposed evaluation process considers the underlying characteristics of the data and utilizes the additive form of Case-based Reasoning, namely AddCBR. The AddCBR is proposed in this thesis and is demonstrated to complement the evaluation process as the baseline to compare the feature attributions produced by the AFA methods. Apart from generating an explanation with feature attribution, this thesis work also proposes the iXGB - interpretable XGBoost. iXGB generates decision rules and counterfactuals to support the output of an XGBoost model thus improving its interpretability. From the functional evaluation, iXGB demonstrates the potential to be used for interpreting arbitrary tree-ensemble methods. In essence, this doctoral thesis initially contributes to the development of ideally evaluated explainable models tailored for two distinct safety-critical domains. The aim is to augment transparency within the corresponding DSSs. Additionally, the thesis introduces novel methods for generating more comprehensible explanations in different forms, surpassing existing approaches. It also showcases a robust evaluation approach for XAI methods.

Automatic Driver Distraction Detection using Deep Convolutional Neural Networks

Article

Full-text available

Apr 2022

Recently, the number of road accidents has been increased worldwide due to the distraction of the drivers. This rapid road crush often leads to injuries, loss of properties, even deaths of the people. Therefore, it is essential to monitor and analyze the driver's behavior during the driving time to detect the distraction and mitigate the number of road accident. To detect various kinds of behavior like- using cell phone, talking to others, eating, sleeping or lack of concentration during driving; machine learning/deep learning can play significant role. However, this process may need high computational capacity to train the model by huge number of training dataset. In this paper, we made an effort to develop CNN based method to detect distracted driver and identify the cause of distractions like talking, sleeping or eating by means of face and hand localization. Four architectures namely CNN, VGG-16, ResNet50 and MobileNetV2 have been adopted for transfer learning. To verify the effectiveness, the proposed model is trained with thousands of images from a publicly available dataset containing ten different postures or conditions of a distracted driver and analyzed the results using various performance metrics. The performance results showed that the pre-trained MobileNetV2 model has the best classification efficiency.

Towards a More Reliable Interpretation of Machine Learning Outputs for Safety-Critical Systems Using Feature Importance Fusion

Article

Full-text available

Dec 2021

When machine learning supports decision-making in safety-critical systems, it is important to verify and understand the reasons why a particular output is produced. Although feature importance calculation approaches assist in interpretation, there is a lack of consensus regarding how features’ importance is quantified, which makes the explanations offered for the outcomes mostly unreliable. A possible solution to address the lack of agreement is to combine the results from multiple feature importance quantifiers to reduce the variance in estimates and to improve the quality of explanations. Our hypothesis is that this leads to more robust and trustworthy explanations of the contribution of each feature to machine learning predictions. To test this hypothesis, we propose an extensible model-agnostic framework divided in four main parts: (i) traditional data pre-processing and preparation for predictive machine learning models, (ii) predictive machine learning, (iii) feature importance quantification, and (iv) feature importance decision fusion using an ensemble strategy. Our approach is tested on synthetic data, where the ground truth is known. We compare different fusion approaches and their results for both training and test sets. We also investigate how different characteristics within the datasets affect the quality of the feature importance ensembles studied. The results show that, overall, our feature importance ensemble framework produces 15% less feature importance errors compared with existing methods. Additionally, the results reveal that different levels of noise in the datasets do not affect the feature importance ensembles’ ability to accurately quantify feature importance, whereas the feature importance quantification error increases with the number of features and number of orthogonal informative features. We also discuss the implications of our findings on the quality of explanations provided to safety-critical systems.

Driving examiners' views on data-driven assessment of test candidates: An interview study

Article

Full-text available

Oct 2021
TRANSPORT RES F-TRAF

Vehicles are increasingly equipped with sensors that capture the state of the driver, the vehicle, and the environment. These developments are relevant to formal driver testing, but little is known about the extent to which driving examiners would support the use of sensor data in their job. This semi-structured interview study examined the opinions of 37 driving examiners about data-driven assessment of test candidates. The results showed that the examiners were supportive of using data to explain their pass/fail verdict to the candidate. According to the examiners, data in an easily accessible form such as graphs of eye movements, headway, speed, or braking behavior, and color-coded scores, supplemented with camera images, would allow them to eliminate doubt or help them convince disagreeing test-takers. The examiners were skeptical about higher levels of decision support, noting that forming an overall picture of the candidate's abilities requires integrating multiple context-dependent sources of information. The interviews yielded other possible applications of data collection and sharing, such as selecting optimal routes, improving standardization, and training and pre-selecting candidates before they are allowed to take the driving test. Finally, the interviews focused on an increasingly viable form of data collection: simulator-based driver testing. This yielded a divided picture, with about half of the examiners being positive and half negative about using simulators in driver testing. In conclusion, this study has provided important insights regarding the use of data as an explanation aid for examiners.

Driving Examiners' Views on Data-Driven Assessment of Test Candidates: An Interview Study

Preprint

Full-text available

Aug 2021

Investigating Uncertainty in Behavioral Modeling for C-ITS Services

Conference Paper

Sep 2023

Driving Risk Assessment Using Intervals and Weighted Fuzzy Rules

Conference Paper

Aug 2023

Feature Importance in Machine Learning Models: A Fuzzy Information Fusion Approach

Article

Full-text available

Sep 2022
NEUROCOMPUTING

With the widespread use of machine learning to support decision-making, it is increasingly important to verify and understand the reasons why a particular output is produced. Although post-training feature importance approaches assist this interpretation, there is an overall lack of consensus regarding how feature importance should be quantified, making explanations of model predictions unreliable. In addition, many of these explanations depend on the specific machine learning approach employed and on the subset of data used when calculating feature importance. A possible solution to improve the reliability of explanations is to combine results from multiple feature importance quantifiers from different machine learning approaches coupled with re-sampling. Current state-of-the-art ensemble feature importance fusion uses crisp techniques to fuse results from different approaches. There is, however, significant loss of information as these approaches are not context-aware and reduce several quantifiers to a single crisp output. More importantly, their representation of “importance” as coefficients may be difficult to comprehend by end-users and decision makers. Here we show how the use of fuzzy data fusion methods can overcome some of the important limitations of crisp fusion methods by making the importance of features easily understandable.

A Hybrid Deep Learning Approach for Driver Distraction Detection

Conference Paper

Full-text available

Sep 2020

The World Health Organisation reports distracted driving actions as the main cause of road traffic accidents. Current studies to detect distraction postures focus on analysing spatial features of images using Convolutional Neural Networks (CNN). However, approaches addressing both spectral and spatial features of images for driving distraction are scarce. Our hypothesis is that deep learning approaches can further be exploited to consider spatial and spectral features, so that the spatial features capture the spatial information within the image and the spectral features capture the spectral correlations among the image channels. This paper introduces a novel driver distraction posture detection method using CNNs and stacked Bidirectional Long Short Term Memory (BiLSTM) Networks to capture the spectral-spatio features of the images. The proposed methodology consists of two stages: first, we automatically learn the spatial posture features using pre-trained CNNs. Subsequently, we utilise BiLSTMs architecture to extract the spectral features amongst the stacked feature maps from the pre-trained CNNs. Our proposed approach is evaluated on the American University in Cairo (AUC) Distracted Driver Dataset, the most comprehensive and detailed dataset on driver distraction postures to date. Results show that our approach beats state-of-the-art CNN models with an average classification accuracy of 92.7%.

Evaluating the Impact of Heavy Goods Vehicle Driver Monitoring and Coaching to Reduce Risky Behaviour

Article

Full-text available

Aug 2020
ACCIDENT ANAL PREV

Determining the impact of driver-monitoring technologies to improve risky driving behaviours allows stakeholders to understand which aspects of on-board sensors and feedback need enhancement to promote road safety and education. This study investigates the influence of camera monitoring on Heavy Goods Vehicle (HGV) drivers' risky behaviours. We also assess whether monitoring affects individual driving events further when coupled with safe driving practices coaching. We evaluate the outcome of those practices on three telematics incidents heavily reliant on driving errors and violations, i.e., the number of vehicle harsh braking, harsh cornering and over speeding incidents. The objective is to understand how frequently individual incidents caused by risky driving behaviour occur (a) without camera monitoring and without any coaching; (b) after camera installation; and (c) after camera installation and coaching. We investigate two commercial HGV companies (Company 1 and Company 2) with 263 and 269 vehicles, respectively, over a 16 months period, from which the first 8 months contain data collected before the installation of cameras (baseline) and the rest of the dataset contains incident counts after the installation of cameras (intervention). Company 1 provides coaching during the intervention phase while Company 2 does not offer coaching. Our analysis considers the baseline and the intervention phases during the same seasons to eliminate any possible bias due to the influence of weather on driving behaviour. Results show an overall significant reduction in the mean frequency of harsh braking incidents from baseline to intervention by 16.82% in Company 1 and 4.62% in Company 2, and a significant reduction in the mean frequency of over speeding incidents from baseline to intervention by 34.29% in Company 1 and 28.13% in Company 2. Furthermore, the effect of coaching has a significant difference in reducing the frequency of harsh braking (p = 0.011) and harsh cornering (p < 0.001) compared to just camera monitoring. These results suggest that coaching interventions are more effective in reducing driving errors while monitoring reduces both driving errors and violations.

Towards Real-Time Heavy Goods Vehicle Driving Behaviour Classification in the United Kingdom

Conference Paper

Full-text available

Jul 2019

Determining the driving styles and the factors causing incidents in real time could assist stakeholders to promote actions and develop feedback systems to reduce risks, costs and to increase safety in roads. This paper presents a classification system for Heavy Goods Vehicles (HGVs) drivers, using a core set of driving pattern stereotypes which were uncovered from driving incidents across three years i.e. 2014, 2015 and 2016. To achieve that, the driving stereotypes are established by employing a 2-stage ensemble classification framework followed by a profile labelling algorithm to define the set of driving stereotypes. Very similar stereotypes are later merged to form the core driving stereotypes for UK HGV drivers. Upon establishing core driving stereotypes across these three years, a decision tree classifier learns the classification rules to identify the driving stereotypes for the HGV drivers in a new dataset. High accuracy is achieved, indicating that the core driving patterns uncovered in this work could potentially be employed to identify UK HGV driving patterns in real-time.

Combining Clustering and Classification Ensembles: A Novel Pipeline to Identify Breast Cancer Profiles

Article

Full-text available

May 2019
ARTIF INTELL MED

Breast Cancer is one of the most common causes of cancer death in women, representing a very complex disease with varied molecular alterations. To assist breast cancer prognosis, the classification of patients into biological groups is of great significance for treatment strategies. Recent studies have used an ensemble of multiple clustering algorithms to elucidate the most characteristic biological groups of breast cancer. However, the combination of various clustering methods resulted in a number of patients remaining unclustered. Therefore, a framework still needs to be developed which can assign as many unclustered (i.e. biologically diverse) patients to one of the identified groups in order to improve classification. Therefore, in this paper we develop a novel classification framework which introduces a new ensemble classification stage after the ensemble clustering stage to target the unclustered patients. Thus, a step-by-step pipeline is introduced which couples ensemble clustering with ensemble classification for the identification of core groups, data distribution in them and improvement in final classification results by targeting the unclustered data. The proposed pipeline is employed on a novel real world breast cancer dataset and subsequently its robustness and stability are examined by testing it on standard datasets. The results show that by using the presented framework, an improved classification is obtained. Finally, the results have been verified using statistical tests, visualisation techniques, cluster quality assessment and interpretation from clinical experts.

Identifying Heavy Good Vehicle Driving Styles in the United Kingdom

Article

Full-text available

Oct 2018

Although driving behaviour has been largely studied amongst private motor vehicles drivers, the literature addressing heavy goods vehicle (HGV) drivers is scarce. Identifying the existing groups of driving stereotypes and their proportions enables researchers, companies and policy makers to establish group-specific strategies to improve safety and economy. In addition, insights into driving styles can assist predicting drivers' reactions and therefore enable the modelling of interactions between vehicles and the possible obstacles encountered on a journey. Consequently, there are also contributions to the research and development of autonomous vehicles and smart roads. In this study our interest lies in investigating driving behaviour within the HGV community in the United Kingdom (UK). We conduct the analysis of a telematics dataset containing incident information on 21,193 HGV drivers across the UK. We are interested in answering two research questions: (i) What groups of behaviour are we able to uncover? (ii) How do these groups complement current findings in the literature? To answer these questions we apply a two-stage data analysis methodology involving consensus clustering and ensemble classification to the dataset. Through the analysis, eight patterns of behaviour are uncovered. It is also observed that although our findings have similarities to those from previous work on driving behaviour, further knowledge is obtained, such as extra patterns and driving traits arising from vehicle and road characteristics.

Drive-Net: Convolutional Network for Driver Distraction Detection

Conference Paper

Full-text available

Apr 2018

To help prevent motor vehicle accidents, there has been significant interest in finding an automated method to recognize signs of driver distraction, such as talking to passengers, fixing hair and makeup, eating and drinking, and using a mobile phone. In this paper, we present an automated supervised learning method called Drive-Net for driver distraction detection. Drive-Net uses a combination of a convolutional neural network (CNN) and a random decision forest for classifying images of a driver. We compare the performance of our proposed Drive-Net to two other popular machine-learning approaches: a recurrent neural network (RNN), and a multi-layer perceptron (MLP). We test the methods on a publicly available database of images acquired under a controlled environment containing about 22425 images manually annotated by an expert. Results show that Drive-Net achieves a detection accuracy of 95%, which is 2% more than the best results obtained on the same database using other methods.

Evaluation of an In-Vehicle Monitoring System (IVMS) to Reduce Risky Driving Behaviors in Commercial Drivers: Comparison of In-Cab Warning Lights and Supervisory Coaching with Videos of Driving Behavior

Article

Full-text available

Dec 2016

Problem: Roadway incidents are the leading cause of work-related death in the United States. Methods: The objective of this research was to evaluate whether two types of feedback from a commercially available in-vehicle monitoring system (IVMS) would reduce the incidence of risky driving behaviors in drivers from two companies. IVMS were installed in 315 vehicles representing the industries of local truck transportation and oil and gas support operations, and data were collected over an approximate two-year period in intervention and control groups. In one period, intervention group drivers were given feedback from in-cab warning lights from an IVMS that indicated occurrence of harsh vehicle maneuvers. In another period, intervention group drivers viewed video recordings of their risky driving behaviors with supervisors, and were coached by supervisors on safe driving practices. Results: Risky driving behaviors declined significantly more during the period with coaching plus instant feedback with lights in comparison to the period with lights-only feedback (ORadj=0.61 95% CI 0.43-0.86; Holm-adjusted p=0.035) and the control group (ORadj=0.52 95% CI 0.33-0.82; Holm-adjusted p=0.032). Lights-only feedback was not found to be significantly different than the control group's decline from baseline (ORadj=0.86 95% CI 0.51-1.43; Holm-adjusted p>0.05). Conclusions: The largest decline in the rate of risky driving behaviors occurred when feedback included both supervisory coaching and lights. Practical applications: Supervisory coaching is an effective form of feedback to improve driving habits in the workplace. The potential advantages and limitations of this IVMS-based intervention program are discussed.

Driver behaviour profiling using smartphone sensory data in a V2I environment

Conference Paper

Full-text available

Nov 2014

Road traffic accidents prevention and mitigation are important issues that appear on top of the priority list of many countries around the world today. Many measures and approaches have been put in place in terms of policy level as well as technical level. Driver behaviour is one of many key factors that should be seriously considered to improve road safety. This paper proposes a method for driver behaviour profiling using sensory data on smartphones in a vehicle-to-infrastructure environment. Based on driving behaviours with the most risk to causing accidents, the profiling algorithm takes into account sudden driving events which occur during a journey to categorise drivers into different profiles according to their safety levels. The profiling algorithm offers the flexibility to adjust the parameters weightings in order to put an emphasis on specific driving events for different scenarios and applications. The impact on vehicle-to-infrastructure is that the stored driving profiles can be used to generate a norm for a given road section. Approaching vehicles deviating from the norm can be notified in real-time. Moreover, localised dangerous driving events can be clustered together to form a potential blackspot which can be deployed as an advanced warning for approaching vehicles as a location based service. As a result, the risk of road traffic accidents can be reduced. Real-world driving data was collected over two major routes in Thailand with four distinct profiles and five major factors to road accidents.

Mental workload is reflected in driver behaviour, physiology, eye movements and prefrontal cortex activation

Article

Nov 2018

Mental workload is an important factor during driving, as both high and low levels may result in driver error. This research examined the mental workload of drivers caused by changes in road environment and how such changes impact upon behaviour, physiological responses, eye movements and brain activity. The experiment used functional near infrared spectroscopy to record prefrontal cortex activation associated with changes in mental workload during simulated driving. Increases in subjective ratings of mental workload caused by changes in road type were accompanied by increases in skin conductance, acceleration signatures and horizontal spread of search. Such changes were also associated with increases in the concentration of oxygenated haemoglobin in the prefrontal cortex. Mental workload fluctuates during driving. Such changes can be identified using a range of measures which could be used to inform the development of in-vehicle devices and partially autonomous systems.

Driver distraction detection using single convolutional neural network

Conference Paper

Oct 2017

Capturing Uncertainty in Heavy Goods Vehicle Driving Behaviour

Abstract and Figures

Recommended publications

Evaluating the Impact of Heavy Goods Vehicle Driver Monitoring and Coaching to Reduce Risky Behaviou...

Contextual Intelligent Decisions: Expert Moderation of Machine Outputs for Fair Assessment of Commer...

Benchmarking Deep Learning Models for Driver Distraction Detection

Towards Real-Time Heavy Goods Vehicle Driving Behaviour Classification in the United Kingdom