ArticlePDF Available

Survey on Big Data Techniques in Intelligent Transportation System (ITS)

Authors:

Abstract and Figures

Big Data is an evolving exemplar moreover, particularly within in the transportation system, has now become a powerful enticement of global interest. On the basis, Big Data is often seen intact undertaking whereas transit system fore efficiently handle utterly the information needed by aforementioned part to provide secure, unstained including reliable modes of transit, and to create custom the recipient's transport experience. By having access to centralized services, a smart city enhances the effectiveness of its inhabitants. Intelligent Transportation Systems (ITS) play an essential role in the transformation of a cosmopolitan environment into a digital city. With the last two decennary, numerous ITS technologies have been deployed, such as city-wide traffic regulation control, smart parking, general transport information assistance (transit vehicle, rail, compact, air, etc.), shipping, legitimate traffic, measuring of highway speed restrictions, etc. Most subsequent Data Analytics insight is needed for transport and movability industry, ITS implementations, threshold instances, areas and use cases, including the routing, planning, ability to track of infrastructure, platform architecture, and more, are examined in this article. Finally, the paper tackles some open problems in ITS with the use of big data analytics.
Content may be subject to copyright.
Survey on Big Data Techniques in Intelligent Transportation System (ITS)
Anjaneyulu Mohandu, Mohan Kubendiran
School of Computer Science and Engineering (SCOPE), Vellore Institute of Technology (VIT), Vellore, India
article info
Article history:
Received 7 March 2021
Accepted 17 March 2021
Available online xxxx
Keywords:
Big data analytics
Smart transport
Intelligent transport system
abstract
Big Data is an evolving exemplar moreover, particularly within in the transportation system, has now
become a powerful enticement of global interest. On the basis, Big Data is often seen intact undertaking
whereas transit system fore efficiently handle utterly the information needed by aforementioned part to
provide secure, unstained including reliable modes of transit, and to create custom the recipient’s trans-
port experience. By having access to centralized services, a smart city enhances the effectiveness of its
inhabitants. Intelligent Transportation Systems (ITS) play an essential role in the transformation of a cos-
mopolitan environment into a digital city. With the last two decennary, numerous ITS technologies have
been deployed, such as city-wide traffic regulation control, smart parking, general transport information
assistance (transit vehicle, rail, compact, air, etc.), shipping, legitimate traffic, measuring of highway
speed restrictions, etc. Most subsequent Data Analytics insight is needed for transport and movability
industry, ITS implementations, threshold instances, areas and use cases, including the routing, planning,
ability to track of infrastructure, platform architecture, and more, are examined in this article. Finally, the
paper tackles some open problems in ITS with the use of big data analytics.
Ó2021 Elsevier Ltd. All rights reserved.
Selection and peer-review under responsibility of the scientific committee of the 12th National Confer-
ence on Recent Advancements in Biomedical Engineering.
1. Introduction
In both academia and industry, Data analytics is now a debating
point. It contains massive data out-of various sources. Many of the
Big Data mechanisms include text mining, data science, robotics,
socioeconomic networking, knowledge discovery and much more.
[1]. For economies and characteristics of life around the world, suc-
cessful expansion of cities and automobiles is always important.
Inefficiencies are costing money, rising emissions and taking time
out of the lives of people. The problem is that transport infrastruc-
ture supplies are rising shorter unlike trend. Automobiles has been
designed quicker than tracks. Urban areas are growing smarter like
a roadway, but it seems the automobile industry is shifting to data
analytics to search alternative ways to existing capital, reduce con-
gestion and enhance the experience of travel. Since the start of the
1970s, intelligent transportation systems (ITS) have been built.
That’s the long term future including its transportation network.
Advanced technologies, including automatic innovations, wireless
data tools and intelligent control techniques, are integrated even
in transportation networks upon the ITS [3].
Objective of ITS is to provide good infrastructure to road users
and riders in transport systems. Information collected diverse
sources in the ITS, for instance cash plus card, GPS, actuators, mul-
timedia locater, internet community, etc. using detailed efficient
content analysis with apparently undisciplined records, the infor-
mation created in ITS expanding Trillion byte to Petabyte with
expansion of ITS. Conventional database frameworks are inade-
quate, with this volume of data, and do not fulfil the requirements
for data analytics. This is because the exponential growth of the
volume and sophistication of data is not foreseen. Big Data analyt-
ics offers a modern technological approach for ITS. ITS may benefit
from the analytics of Big Data. Large data systems including HDFS
and Spark’s, which are being widely used in academic and com-
mercial, are capable of processing large quantities of data. Top
social media sites like Facebook, Twitter, Wei bo, and We Chat deli-
ver citizens with pervasive opportunities through exchange
thoughts, emotions and knowledge officially or in learning associ-
ations, producing enormous amounts respecting concurrent gen-
eral waving [5].
Social network collects the mortal portability data through
smartphones, and even evoke vast number with social signals.
https://doi.org/10.1016/j.matpr.2021.03.479
2214-7853/Ó2021 Elsevier Ltd. All rights reserved.
Selection and peer-review under responsibility of the scientific committee of the 12th National Conference on Recent Advancements in Biomedical Engineering.
Corresponding author.
E-mail addresses: anjaneyulu.m@vit.ac.in (A. Mohandu), mohan.k@vit.ac.in (M.
Kubendiran).
Materials Today: Proceedings xxx (xxxx) xxx
Contents lists available at ScienceDirect
Materials Today: Proceedings
journal homepage: www.elsevier.com/locate/matpr
Please cite this article as: A. Mohandu and M. Kubendiran, Survey on Big Data Techniques in Intelligent Transportation System (ITS), Materials Today: Pro-
ceedings, https://doi.org/10.1016/j.matpr.2021.03.479
Compact and flexible equipment (e.g. smart phones) can access
information [6]. During this previous assertion, we will propose
that the new field of social transportation should concentrate pri-
marily on five areas: 1) big data and social signal traffic or trans-
portation analytics using pattern discovery, data science and
speech recognition tactics; 2) Publicizing frameworks upon social
media, Personalization, the Internet of Things (IoT) or perhaps
(IoT) based transportation mechanisms. 3) New location-based ser-
vices (LBS), like automation of transit expertise, in particular
decision-based services (DBS) or task-based services (TBS) compre-
hensive information needed through timely manner for transit
actions or objectives, but instead communication or intelligence
based services (IBS) or knowledge-based services (KBS) which sug-
gest operators or activist groups that may obtain information. 4)
Web-based mobility regulation and strategic agent technology
[8], other than intelligent machines, information automation or
app advisers besides monitoring system, preventative mainte-
nance, health and energy governance, should seek to develop dif-
ferent IoT devices which really accumulate community road
connect individuals with congestion and automobiles through
legitimate time; and 5) actual solutions or even reviews towards
further growth as well as analysis. Incorporating commute evalua-
tions, each linear motion model and perhaps cognitive science,
researchers have modelled various aspects of transport. It is impor-
tant for advanced analytics structures that provide a distributed
design that mould simple toward build along with address when-
ever a problem occurs.
Advanced Analytics be comic method about analyzing vast
quantities of data containing aforetime information categories triv-
ial unknown trends, hidden equate, retail fashion, consumer trends
or other business intelligence evidence; [23]. Classification of ana-
lytics, namely, descriptive analytics, predictive analytics, and per-
spective analytics. Descriptive analytics that describe ‘‘what has
happened or what is going on” help to discovery modern event
and threat for business. Predictive analytics that specify ‘‘What’s
going to happen and why”, facilitated applying multiple techniques
[24]. Prescriptive analytics, that specify ‘‘what should I do and why
should I do it,” refers to the systematic of Visualization, compe-
tence as well as analyzing through examine different preference
and implement decision makers with proposal. To predict
exactly future circumstances and states, like text/web/data mining
[25].
The volume and speed by transport and movability knowledge
endure produced today has far exceeded the scales at which at the
beginning of this century they used to be collected, processed and
analyzed. The combination of modern digitalization ideologies
including Internet of Things (IoT), the rise of digital Communities’,
the sharp decline in data transfer expenses of sensor industry, the
economical also popular use of sensors or personal computers,
each issues raised in digital networks, increased drastically
enhanced the intelligence of humans facing gain further compre-
hensive understanding of travel. A transport network persist sev-
eral platform that enables individuals with different aspects
include automobile, bus, train, walking, bicycle, etc. to travel from
an origin to a destination. Despite the accessibility and connectiv-
ity advantages of transport networks, as the usually-increasing rise
in community but rather volume of traffic, cities have always been
dealing with mobility concerns (e.g., traffic congestion and air pol-
lution). Brilliant transportation, as a savvy and huge segment of the
shrewd civic, carry emotional auspicious objective to ease concern
and enhance urban communities live ability, usefulness, and main-
tainability by creating astute and novel vehicle models [12].A
definitive viability of keen transportation models is profoundly
dependent upon the nature of sent information and qualities of
the insightful methods into which the information are taken care
of.
2. Background
Smart transport’s comprehensive development draws analyzer
often in context of an insightful architecture of Intelligent Trans-
portation System. A variety of benefits can be taken forward by
the regular smart transport system. Investigator move currently
engaged to define the overall result with intelligent transit in order
to attain meaningful computing [13]. Similarly, in the current liter-
ature, a range of suggestions have been presented that seek rigor-
ous experimentation and simulations to overcome the issues based
on the test bed. Whenever the concerns regarding commuter traffic
being examined, therefore concentrate on traffic density, remains
factors such as traffic speed, road etc. are also considered to affect
traffic. Freight performs an identical decisive aspect in every soci-
ety earnings including every resident growth. This article explains
the technique of reducing RFID congestion problems instead of the
conventional adhoc gateway method. It is also clean and cost-
effective. The aim of aforementioned procedure intend to manage
and track roadblock in the event of a necessity.
Smart transportation based on IoT adoption certain data analyt-
ics frameworks do important whereas information refine, such as
MapReduce, Cassandra, NoSQL, etc. For information analysis,
Hadoop uses the Map Reduce mechanism [14]. MapReduce the
job in2 distinct phases: (1) mapping. (2) reduce. The mapping task
convert integrate information gathered in the mapping in a modi-
fied ways used to reduce the process. MapReduce produces several
data sets that are responsible for a comprehensive selection of rou-
tine activity. The information is separated by MapReduce within
autonomous are handled simultaneously. The framework pattern
classifies the outputs (map) and transfer them toward following
assignment (reduce). These tasks are performed simultaneously
by MapReduce, i.e. integrity, error leniency, and load distribution.
Society now turn to innovation mechanics and intelligent transport
methods are mere informative. Events move related also expressed
along one another via the internet and this vision of the internet is
assumed to be Ubiquitous IoT [16]. In this literature, found to man-
age a massive volume of data as well as provides IoT applications
to a diverse customer base. However, massive Data bear different
impenetrability, IoT is beginning a valuable examine analysis. In
addition, the refine of massive data and the direction of the enor-
mous online-offline input against the IoT-based domain are used
by different architectures.
3. Smart city and smart transportation
3.1. Smart city
A huge complex system has been created by the brisk expansion
like different metropolis aspects involve, with digital government,
IT proposal, machinery, framework, and habitat. Such a compound
structure presents a range of threat and liability, with examples
vary from carbon emissions and blockage, to rising unemployment
rates and negative social impacts. The use of information and com-
munication technology (ICT) to make cities ‘‘smart” is one solution
for managing urban problems and improving the livability, oper-
ability and sustainability of cities. There are three layers in the
specific context of intelligent community: 1) data acquisition and
manipulation, 2) systems analyst, 3) resource provision. First, data
must be collected with the help of smart devices located through-
out the city, such as traffic sensors, utility usage sensors, weather
stations, mobile phones, and social media networks, when building
a smart city. In order to prepare temporal information, heteroge-
neous data from different sources must be organized using data
normalization and data management techniques as the main
inputs for the analytics phase are shown in Fig. 1.
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
2
3.2. Smart transportation
Every out-of-home activity today depends on the transportation
system, which includes roads, train line, underpass, control signal,
automobile, etc. Setup that shift nation to work, shopping, travel-
ling, going to schools, and spending time with friends around the
city. On the other hand, important problems are caused by the
rapid adaption about transport channels across urban areas related
to the fast expansion in methods of travel and economy, the main
challenge is traffic congestion. Use of such electricity, pollution
levels, economic growth, and Social care has tight harmful impacts
on power depletion, environmental impact, productivity, global
health and freight collision. [17].
Freight input origin are categorized in six division: 1) data from
traffic flow sensors, 2) data from video image processors (VIP), 3)
data from vehicles and people, 4) data from locality placed internet
community, 5) transit data centering on intelligent cards, and 6)
habitat knowledge. VIP has introduced far more traffic applications
compared to other traffic flow sensors due to rapid advances in
computer vision and image processing fields. So, though it belongs
to the traffic flow sensor group, we consider VIP as a separate
group. The probe vehicle and the group of people referring to
motility detector receiving stochastic traffic information is seg-
mented mostly on basis of the data collection technology used:
Global Positioning Systems (GPS), bluetooth, and mobile cellular
networks.
4. Architectural design of data analytics in ITS
The design for performing ITS Data analysis is demonstarted in
Fig. 2 [35]. This can be divided into 3 layers: the data collection
layer, the data analytics layer, and the application layer.
4.1. Data analytics architecture in ITS
4.1.1. Data collection layer
The architecture is based on the data collection layer, although
it gather the data requested only for top layer. Information comes
against various derivation, detectors for inductance loops, radio
waves, broadcast surveillance, acquiring information, radio waves,
and GPS. In the following sections, details on Big Data collection
will be presented.
4.1.2. Data analytics layer
The core architecture layer is the data analytics layer. The pre-
sent layer in essence usually designed to capture information
from data gathering and implementing various approaches to data
analytics and the relevant framework for storage devices, mainte-
nance, mining, data presentation and accessing.. In next sections,
details of massive Data analytics interface and models are
described.
4.1.3. Application layer
The application layer is uppermost layer of architecture. Statis-
tical model outcomes from data analytics layer are applied in
diverse transport conditions including traffic forecast and traffic
assistance.
4.2 A global review
Aims to provide a critical review of data fusion initiatives based
on smart objects in mobile devices, with a special focus on the clas-
Fig. 1. Framework for smart transportation with data sources [36].
Fig. 2. Architecture of Big Data analytics in ITS [35].
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
3
sification of human activities [18]. The surveys relevant to our
paper are further developed in Table 1. A summary of each relevant
survey paper’s main focus is given. It is clear that, although there
are a number of data fusion surveys, these are mainly focused on
areas or classifications of specific applications (e.g. ITS embedded
sensors in mobile devices).
4.3. Evaluation of data analytics techniques in C-ITS
The Cooperative Intelligent Transport (C-ITS) System endure an
essential facilitator of inevitable state traffic regulation. An innova-
tive C-ITS study implemented automobile, street side entities and
control zones for traffic they produce a lot of traffic, that incorpo-
rates contextual movability and infrastructure. [19]. In C-ITS, data
analytics tools can be used to assess the accuracy of services by
means of congestion control and competitive transmission. Certain
C-ITS services that mostly depend on intelligent choices informa-
tion gathered, a few data analytics strategies often optimize accu-
rate simulation in Big Data computing. Several more C-ITS
technologies obtain and examine but provide user-based data.
Two such applications are discussed here, namely the smart park-
ing system and monitoring of road conditions, study offers an
intelligent, cloud-based parking via the Internet of Things (IoT).
Thus every car has quite an RFID tag and RFID reader is accessible
there in car at both entry and exit points. The system is imple-
mented collects data on approaching traffic employing RFID read-
ers. Users allows to book a parking space with a mobile app until a
database controller intends to assign parking space.
4.4. Cloud storage
Cloud based data is quite well categorized in online storage that
attains the cloud. Data Manager (DM) is responsible for classifying
the different datasets referred on the basis of their sources. The
Time Calculator (TC) and Navigator (NG) of cloud elements utilize
data sources labeled by DM [32], to build innovative data points in
cloud services. The data sets operated in cloud storage were indeed
illustrated as follows.
4.4.1. Mobility datasets
Mobility datasets (mdata) describes RSU data based on their
Lane Idd in different datasets (lanId). These datasets of distinct
routes are processed in the Mobility Data Repository (MD). The
mdata append vehicle id (vId), vehicle velocity (v), vehicle air resis-
tance (a), vehicle location (vLoc) and vehicle distance (d) from lane
joining communication phase. On-board sensors capture the
rapidity, since this acceleration is analyzed by the OBU [20]. The
spacing is inferred from the vehicle’s spot (vLoc) and the signal
point’s location down the lane. The GPS encoded as in vehicle cor-
respond the sensor values, whereas position of signal point is
determined using corresponding Lane Id (lanId), about every pay-
load heading against RSUs, the mobility datasets are kept updated.
4.4.2. Destination list dataset
Destination List dataset (dList) Specifies a data source repre-
senting all the different destination locations (desLoc) that the
vehicles are proceeding. The riders sustains device location in the
vehicle OBU either at beginning, so every new service from RSUs
is reviewed mostly with RSUs travel desired place and including
mobility data mostly from destination list (dListData).
4.4.3. Traffic flow dataset
Traffic flow dataset (tFlowdata) organizes information in a tra-
vel time database from different LDSs (tFlowdata), lane id (lanId)
but the differing volume of vehicles are included (vCount). A loop
detector for stationary vehicles capture levels of automation resid-
ing at the frequency (LDS). After each period of time, the traffic
flow data (tFlowdata) is retained up to date.
4.4.4. Pre-decision datasets
Pre-decision Datasets (pDecData) cloud aspect of Time Calcula-
tor (TC) quantify time to reach (TTR) of each automobile using cor-
responding mData refer the corresponding sensor abandoned. The
obtained values track-wise pre-decision (pDecData) time-series
embedded in various pre-decision data repositories (PDD) follows
TTR is determined.
4.4.5. The Navigator (NG)
Cloud module takes dListData but the online service to deter-
mine all optimal routes with each dListData destination. A new
dataset, a sequence about all the adjacent nodes to associated
places, is deposited there in shortest path (sPathdata) for every
registry, comprised of a desired destination even a specify of all
the nearest neighbors along their intervals, contribute to subse-
quent given destination. To determine all quickest pathways to
the route, NG benifit Dijkstra’s single-destination shortest path
algorithm.
5. Literature survey
The premise of big data has been interpreted throughout the
framework of transit and mobility, a research study of smart
Table 1
Comprehensive review of work associated with data analytics in Intelligent Transportation System.
Author(s) Description CC IoT BDA ITS
Tene, Omer[2] Entities need significant privileges inside a functional, digital format for accessing data to solve big data privacy
predicament.
U✘✘
Yuan Wei et al [7] D2ITS (Data-Driven ITS) systems, through use of actual data, along with navigation system, feeling absorber and
pictures, probably makes better practice of data sources. .
UUU
Shi Qi [11] Microwave Vehicle Detection System (MVDS) familiar to optimize the functionality of metropolitan overpass order
to decrease traffic problems and the consequences of collisions.
✘✘ UU
Amini,Sasan [22] To adopt the emerging approaches to Big Data, Kafka for the construction of a portable real-time traffic control
system.
UU U
Marjani, Mohsen[23] Big IoT data analytics UU U
Liu, Yang [2 6] Analyzing the smart urban transport system using GPS technology, GIS technology and structure ✘✘ UU
Tomar, Pradeep [27] IoT-based Real Time Smart Street Parking System Prototype (RTSSPS) UU U U
Siow, Eugen [28] Analytics for Internet of Things UU U
Ge, Mouzhi [29] Usage of Big Data Analytics tools in IoT Domains UU U U
Hopkins Jon [33] In the supply chain, real-world deployment of BDA and IoT technologies UU U
Shadroo, Shabnam [37] Intelligent transport applications and concepts were associated in making an investment in the ideologies. UU U U
Legend: CC = Cloud Computing, IoT = Internet of Things, BDA = Big Data Analytics, ITS = Intelligent Transport System.
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
4
transportation systems would be provided, emphasizing the diver-
sity of existing literature in recent decades, putting forward a ques-
tion of route map for deploying large traffic data on the cloud
(Table 2).
5.1. Big data in transportation and mobility
Big Data is coming to transportation industry to solve conven-
tional digitize issues to provide modern scope, application mainte-
nance through unique data origin and applied science. The
legitimate implementation of traffic influencing data streams is
indeed ideal example; such a specific problem is not revolutionary,
but its volume, velocity and range seem to be attributes which
compromise the information processing. Contextualized descrip-
tion towards Big Data mixes collectively assortment of huge mea-
sure of immense speed, heterogeneous, developing area
information and utilization of cutting edge methods and exem-
plary to repository, recover, oversee, measures break down the
caught data [26].
Big data means large quantities, instant rather than realistically
provides context infrastructure demanding multiple categories of
optimizing complexities, the inclusion with expertise and process
simplification. From the point of view of transport, apparent as a
listing of applied science enable all the data needed via progress
different path of providing impregnable, further effectual transport
to be managed effectively (store, process and access), representing
consumer to illustrate and customize transport familiarity. Three
transportation features and scales are major aspect quote in refer-
ence to Big Data characteristics 3 Vs volume, velocity and variety
model defined by Gartner. There are other aspects, such as veracity,
visualization and visibility, referred to by other authors.
5.2. Features and scales of transportation Big data
Additionally, we will concentrate on the previous variables, as
they are important in the manner in which information is col-
lected, analyzed, organize, repository, diffused and treated [29].
The ensuing inventory outlines in designing 3 majority important
marker that distinguish large data set from typical data.
5.3. Data-fusion architectures
Attempt to acquire a deep assessment of flexibility behavior and
traveler trends which cannot be extracted from a given criterion,
data fusion algorithms are employed to individual but supplemen-
tary data sources. The main objective of smart transportation data
fusion algorithms to establish an advanced automation strategy
that connects expertise from disparate pieces of knowledge. The
types of input/output, which are subdivided into different data/fea-
ture/decision combinations versus data/feature/decision output. In
fusing techniques, the protocol layers of the data installed, where
raw data, features, and decisions are three levels of abstraction that
are transformed by feeding into fusion algorithms to the more pre-
cise levels. In the transportation system [32], Big Data comes from
many sources, such as road and inside vehicle traffic monitoring
systems. In addition, during normal, uncertain or crash conditions,
detector set up in automobile play a significant role in analyzing
information. In addition to the network architecture, technologies
such as Global Positioning System, mobile, WPAN, Radio Naviga-
tion, Automatic Vehicle Identification performs significant job.
5.4. Collection of transportation and mobility data
Consequently big data gathering are being comprehended as
from the assignment of accumulating data gathered by sensing
devices that have been triggered via various transportation and
mobility platforms by residents and aircrafts. Information gather-
ing comprises a number of assignments to obtain data from fre-
quencies. Implementation able to contribute of the actual data
structure even these electronic data, specimen which can be orga-
nized via a computer within electronic signals.
Recently, Data Analytics in transit and movability, global tech-
nological trend recognized as elite rate of model for which pattern
assumption, learning and optimization system are used.
Table 2
A summary of existing primary work focus on Application.
Article Main Idea Advantages Disadvantages
Wang, Chao [6] Intelligent transport systems don’t ever end
utilizing evolutionary computation and
feedback control system in larger datasets,
designed with sensing to instantly monitor
traffic lights at the crossroads.
Designed to reduce driving or waiting time
to solve nonlinear control issues
Neural networks could increase uncertainty,
not adaptive to management team
Engelbrecht, Jarret [9] Smartphones include a digital system for the
application of sensing applications and
connected cars, other devices of the
Intelligent Transportation System (ITS)
Links actions to an entity, instead of to a
vehicle of which the driver might be unclear
Lane positioning problems and sudden
acceleration or erratic braking.
Zheng, Xinhu [10] Agent technology is widely used in transit
control and management, and more digital
tools and human intelligence techniques is
often used in automobiles to strengthen
comfortable ride.
Distributed Advanced Driver Assistance
Systems (ADAS) control and strategic plan
premised on connected devices connecting
automobile, residences, and retail outlets.
Unknown road disturbances are dealt with
ensures a realistic method that allows
roadway connectivity and intelligent
platform obtain up-to-date information on
the road.
Rathore, Muhammad
Mazhar [21]
The main objective of a Smart City is just to
bring precise data there in right time to make
moral choice easier.
To mitigate the design patterns, the
outcomes of every digraph are consolidated
with input size of the Hadoop Map
generating the final result.
Incapable of making knowledgeable opinion.
Gohar, Moneeb [30] Designing future intelligent cities aim to
incorporate new buildings and design
effective ways to enhance use of upgrading
the eco system of the city, like connected
phones and data analytics.
ITS relevant toward reduced traffic
congestion, additionally to significant lower
fuel economy relevantly the costing of
lifetimes by limiting traffic fatalities and
acquiring driver emergency aid.
Accuracy, reliability, actual interaction
among customers and facilities are all
concerns that have to be addressed.
Iqbal, Rahat [34] Machine learning (ML) methods provide a
means to model Data Trends and inferences
can be used to explore interactions that is
used to generate forecasts based on earlier
invisible events.
The acquisition of more complex insights
into consumer preferences will give modern
organizations and companies a major benefit
over the rivals
Due to high complexity, mathematical and
conventional modelling are unable to work.
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
5
5.4.1. Descriptive analytics
It includes elements with knowledge discovery (clustering,
detection of outliers), identification of patterns and significant to
contrive conventional data design. Models falling within this cate-
gory essentially try to summarize, condense either explain the
knowledge gained, that agreed take off entire useless documenta-
tion and implied phenomena from the context of categories or
shaped coefficients of facts. There is an abundance of applications
for descriptive statistic with in automotive sector also in studies,
except for autonomous driving counter measures and safety proto-
cols, vehicle safety analysis, incident detection or autonomous
vehicles, since these zones are quite widely arrived employing
descriptive methodologies.
5.4.2. Predictive analytics
To identify the correlation between two possible outcomes or
consequences and a regression line, taining schemes are extended
to knowledge representation. Once trained, such models are possi-
ble to forecast the costs for new input data of an unknown target
variable, refusal inevitably similar as all the current facts [30] on
the concept was constructed. In the field of transportation, Frame-
works of certain classification algorithms are numerous, along with
self-driving in instantaneous recognition, forecast of traffic event,
e-taxi objectives like Uber, Ola (challenge prediction) or Commu-
nity content incident examination firms like Nexar (comprehen-
sion and ignoring collisions). The Interaction discerned mostly in
perspective with predictive modeling with traffic forecast (supply,
environment and efficiency) when revealed in early findings in the
subject is of particular concern to highlight.
5.4.3. Prescriptive analytics
Through optimization procedure, intelligent retrieval moreover
different aspects from reckoning Intelligence, Optimization theory
and performance Research, they suggest a better influence or out-
come taken between collections of potentiality. To this end, the
insight accumulated by descriptive and predictive adversaries from
previous ones is used by prescriptive analytics [31]. Issues in trans-
port and logistics enhancement have consistently taken advantage
of prescriptive analytical tools such as logistics, influential transit
planning or data services.
As a consequence, deep learning techniques said to be well
adapted for descriptive, predictive and prescriptive analysis.
Exceeding massive, composite datasets in particular predicting
traffic fatalities from internet community including sequential
semantic network, intelligent retrieval, adversarial networks fur-
ther proposal.For example, the HBase-based massive data refine
concept arrange to analyze data provided by intelligent transporta-
tion systems for recording and monitoring. The task, where analyt-
ics attached to build models of recognition that detects
unregistered taxis, is another contributors of interest.
5.4.4. Visualization of data in transit and mobility
A few of data analysis firms greatest powerful sectors, currently
is data visualization. The full examination of data has evolved into
an architectural discipline aimed at contextualizing data from the
derivation of methodologies for attaining insights into complex
datasets. During this segment authors may attract especially on
graphic data sources since an extensive assessment in those tech-
nologies and standards besides computing tube. Significant analy-
sis that asserts data extraction like an ambitious plan towards
expressing immense transport databases with data envision regard
to visualization as a mechanism for knowledge discovery. For
aggregate data representation, storage capacity moreover fre-
quency areas of responsibility will be functioning strengthened
and incorporated.
A study focused on information recognition is designed to visu-
alize frequented venues and behavior tendencies of along with
entity performers to analyze meetings with both local organiza-
tions. In particular, Visualization have been categorized into
descriptive, predictive and prescriptive analytics, represent the
characteristics selected from data for discriminating patterns.
5.5. Data origin in transport
Data comes with many forms within the transit platform, such
as road or within vehicle traffic surveillance systems. In addition,
as indeed automobile perform a significant job in information
gathering throughout unpredictable or collision circumstances.
Until then, practical such as GPS, mobile, WPAN, radio navigation,
automatic identification vehicles, RFID. The demand for traffic on
different paths differ slightly at various route.
Quantity margin and level of service has traditionally imple-
mented a signifier of traffic jam by the transport authorities. This
would lead to comprehensive analyses like mailing alerts to rider,
employing different techniques for smoothing traffic, proportional
traffic laws, renovations of sections, strategizing rider, much of this
initiative may offer the congestion down.
5.5.1. Automobile path planning
Several variants in rider down force are recommended mostly
on premise of a design characteristic in effort to stop certain auto-
mobiles and maintain individuals in driveway, a range of
approaches are addressed. A technique for attaching obstacles to
a potential field protecting fatal accidents whenever the cargo
approaches the same distance into the potential field area. How-
ever, in event of unexpected fatal accidents it’s an abnormal
moment on a route including a sharp curve either at expressway
can eventuate to be difficult.
5.5.2. Vehicle position detection
If Vehicle position calculated accurately in advance, the position
or location of the vehicle conserve powerful influence, assumption
of such prospects as accident. Conversely, developing a visualiza-
tion of capability automobiles is often a major obstacle since the
speed of the vehicle in evaluation will also be involved. Strategy
being used to simulate an automobile 3-d input images is sug-
gested, besides able to design a detailed plan apart from a few
commuting features or even to estimate a collision, data obtained
by the 3D map could be used.
5.6. Data storage and Interpretation using spark
Main stage in the detailed layout which intends the entire oper-
ation is analysis phases of Big Data. Fig. 3 demonstrates the compo-
nents and mechanisms involved in this phase. The whole retrieval
is premised on the idea of data programming use different process
executions are done simultaneously to save time. Multiple proces-
sors usually makes a programme run faster because it is run by a
lot of engines (CPUs). With parallel computing, the principle clus-
tering is evolving Load balancing improves the availability of
bridge tasks to several supercomputers. The objective of data
transmission include to minimize use of microcontroller, mitigate
response time, maximize output nor avoid afflict. At first, the col-
lection and analysis module of Data Analytics delivers load balanc-
ing methodology that is used to split the load into same density for
each server. Equivalent portion of segment increased the quality of
the process, extremely similar output will be processed and an out-
put produced simultaneously by each and every server. In attempt
mostly in data acquisition context to be quite consistent, enforce-
ment is reached by computing technique of MapReduce.
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
6
Since the present stage, equivalent instruction of MapReduce
and HDFS will be addressed. In addition to HDFS, SQL’s supposed
HBASE, HIVE need into store historical data for database adminis-
tration (offline or in-memory). In the Hadoop cluster, MapReduce
and HDFS overall function. Consequently, we also need a data
stream in real time as well as offline data analysis. In addition,
we need a 3rd party software to combine Hadoop computing pro-
tocol for legitimate computation in order to deliver implementa-
tion in actual environments. Spark is used for legitimate
implementation, mostly with the Hadoop platform. For extensive
data processing, Apache Spark is a specific engine. It provides Big
Data with faster access and allows recyclability across streaming
applications. Usually used for big data workloads, interface of
advanced parallel computing. Fast performance, combines buffer-
ing and enriched effectiveness in cognition. Apache Spark is
assisted by Amazon on Hadoop YARN patterns are being checked
and performed quickly.
5.7. Big data approaches used in ITS application
Now it is possible to access and share detailed transport data on
unprecedented scales, which reveals new paradigms of transporta-
tion and smart mobility advantages. A multitude of transport and
mobility applications are supported by the Big Data framework,
which includes some including all of the key stakeholders in the
study entry:[34] Residents (walker), the civil service (executives)
and capitalism. The encompassing of massive data approach proce-
dure in real applications derived from data acquisition, observation
and manipulation does not vary significantly between systems and
services associated with automotive sector. Conversely, if there are
changes in the findings as well as scientific strategies incorporated
for those technologies, each stakeholder’s commitment will cover
the full massive data phase, from research process to its assertion
with intellect records (that had importantly provoked a new set of
issues devised with distributed computing.
5.8. Data analysis issues and implementations
Therefore, in this category experts evaluate several essential
questions pertaining to C-ITS data science. Such difficulties and
their potential solutions are discussed below [35] (Table 3).
5.8.1. Data collection
Each C-ITS station collects data based on information on traffic
and mobility. Adequate originate of continuum 5.9 GHz as the
exchange of C-ITS relevant information has been assigned by both
the IEEE and ETSI norms. The DSRC is again split in to communica-
tion link but a dedicated short-range communication continuum
different broadcasts facility besides the respective exchange of
security non-security information.
5.8.2. Data diffusion
Additional challenging task connection with information
insights is successful data diffusion. Automobiles accept recurrent
CAMs for making choices related to security implementations, as
discussed earlier. As assumptions on reliability are taken on basis
of community movability information, these messages must be
received with high reliability by vehicles. The default wireless
technology for the dissemination of these messages within the
local community region is IEEE 802.11p in this context. Along with
large matrices, from the instance alert messages, might be used a
multi-hop approach respectively vehicles and RSUs may be
informative.
5.8.3. Data calculation
ITS data calculation can be executed with a distributed or cen-
tralized way. Instead every driver preserve to evaluate automobiles
have been associated with a virtualized environment but rather
communicate periodic Recorders, results to give itself a safety deci-
sions including utilizing brake system, oncoming traffic includes
several interventions.
5.8.4. Metrics
Using effective metrics is an additional question with using C-
ITS analytics. In evaluating a reliability on safety applications in a
C-ITS, primary consequence is identifying the level of consensus
of automobiles and its nature by enhancing effective safety mea-
sure on automobile integrity may be assured metric.
5.8.5. Data analysis
Concise data analysis is another significant aspect of data ana-
lytics. Even more information could be received by RSUs and
TCC, the volume of automobiles and expected amount of traffic
on the road and essential security algorithm stronger key lengths
implies secure automobile methodologies, suitable inferences can
be made on such data. A k - mean clustering, correlated transition
will result in a drop as in data transmission unless the volume of
the automobile is maximum than every interval. Within city, fuzzy
inference principles can been required to develop effectively trav-
elling determinations of essential contemporaries.
6. Open challenges
The substantial interaction evinced within that existing litera-
ture conducted around in the subsequent section could be a clear
sign of the innovations recently introduced with data analytics
Fig. 3. Data Storage and Interpretation using spark [40].
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
7
implemented to urban mobility areas. In addition, indeed a variety
case studies hardly unresolved but never identified with moment,
and yet major innovations persist also emerging a consequence of
data acquire and impoverishment all over heterogeneous transport
regions, adaptation to the latest Big Data features and actual data
analytics. Creativity as well as performance were further regulated
while being consistent in providing actual data refine and applica-
tion development.
6.1. Security and privacy
Data security is a long-standing issue that is widely studied in
the fields of mobility and transport. In this respect, the survey pro-
vides guidance for future research in which the use of sensor data
to improve battery power and time is proposed for smart e-bike
control system. The statistical analysis was done and merged to
ensure effective usability purposes along with approval organize
aggregate advice needs in this area.
6.2. Data collection
The data collected during mass transit could be inconsistent,
inadequate, or incompetent, despite the enormous momentum of
vehicles and pedestrians, in particular locations perhaps at specific
intervals. Instance, hardly complete automobiles will be integrated
with in key quality that implement actual position data and road
traffic statistics against traffic detector could be ignoring. Another
conceivable solution to evaluate the contest will be to improve
existing quantitative strategies to improve data collection capabil-
ities with the adoption of IoT, promote specific detector tools
through manually that can significantly improve information gath-
ering and efficiency.
6.3. Data privacy
The most demanding and fretting issue in the Big Data era is
privacy. During data transmission, storage and individual informa-
tion is sometimes revealed. Collected information from transport
systems, governments should review current data confidentiality
policy which comprises personal information are being reported
with context of manuscript or use of records the general rules for
dataset and sharing of resources and several aspects to avoid addi-
tional loss of relevant sensitive data.
6.4. Data storage
At present, the volume of data stepped against Terabyte to Peta-
byte, advancement in computing capacity is still initiating. Each
day, particularly ITS will have a huge range of information from
various actuators. Conventional database facilities and server
equipment were not capable of dealing via an excessive lot of
issues along with complicated records infrastructure and database
tools. [35].
Implementing the most appropriate memory structure is now a
crucial issue for cloud services like Google, Microsoft are continu-
ing through enhance everyone assistance in a built-in efficiency of
data analytics Cloud assets, Solid State Drives storage are growing
as major aspects for huge information storage.
6.5. Data processing
Predictability is essential for ITS Real - world applications, along
with traffic data classification, traffic status identification, actual
traffic control, vehicle routing guidance and the scheduling of
actual public transport. Traffic data using multiple platforms via
a lot of methods can be evaluated to historical data and Analyzed,
with a minute period of moment. Data acquisition method is cap-
able of processing more complex and increasingly extensive data
[36]. Maintaining the reliability of the process with these strong
and powerful statistics is a critical concern. A range of different
massive data architectures which deal with actual datasets, like
Apache Storm, Apache Flink, Apache Samza, Apache Spark Stream-
ing and Kafka Streams gained prominence.
6.6. Data opening
Information will be modified and shared with the public in high
quality to allow consumers of mass transit content providers to
consider reusable relevant data. The integrity of the information
pertains with its reliability, truthfulness, data analytics may delude
decision-making or even generate information without better
quality performance with negative impacts. Setting up data of good
quality however may necessitate moment as well as payments. In
respect, swap among establishing information sensitive to a
change expenses and creating qualitative information applicable
on massive cost, making this a difficult task to open up good qual-
ity data [37]. Technical programs provide the acceptance of artifi-
cial intelligence for test execution by automatic data acquisition
and/or use. In addition, to ensure vibrant and accurate data on
transport entities might have implemented a trade activities.
6.7. Big data analytics
Infrastructure methodology is often applied in analytics unit.
The ITS input dataset is transferred into portions in the map-
reduce framework, therefore the mapper clusters are analyzed.
Table 3
Study of ITS data analytics technologies.
References Objective of the technique Classification Analytics Techniques
used
Data collected
Wang, X [15] business objectives and technical requirements on demand behavior Traffic
congestion
Visualization
techniques
Open source big data
Hua, Xingcheng [17] On the basis of HDFS, HBase supports a wide tables also as extensible
data structure.
Congestion Data mining
Techniques, Cloud
Computing techniques
Hadoop Traffic Data
Management System
Vankudre, Ashish [20] For real-time traffic object tracking from Tweets flow analysis, Smart
framework based on textual extraction and machine learning..
Traffic
congestion
or crash
time-based evasion
and crawler evasion
exist
Big data
Javed, Muhammad
Awais [31]
Equipment to minimize the time complexity of accepted protection
mails.
Congestion Parallel and distributed
computing techniques
Bigdata
Neelam, Sahil [39] The sustainable urban transport system that relies on the Cloud And
edge IoT is required to determine traffic inflow and time-optimized
smart vehicle navigation.
Congestion data classification
techniques
Road Side Unit Sensor
(RSUs), Laser Distance
Sensor (LDSs)
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
8
Mostly on assigned records, thus every mapper module performs
independently, intermediate outcomes remain generated. Includ-
ing modifiers which operate the clustering, the result obtained
by reducers were integrated into final outcome which carried onto
simulation component, while the result obtained by the mappers
are essential. [38].
7. Discussion and future research directions
The constraints of existing research are noted out in this sec-
tion. As a consequence, few essential issues could arise to be
addressed in future research support full use of big data trans-
portation [22].
Information gathered by ITS mechanisms is intended to opti-
mize opportunities with more extensive traffic congestion, further
exploration is meaningful. Most of the recent findings depend with
1 or 2 of a substantial dataset for transport, while certain numer-
ous factors of valuable insights. [39].
Data on transportation is being obtained through various chan-
nels. Even so, there has been highly heterogeneous data. Data
aggregation would therefore throughout the longer term serve a
powerful transport systems of data analytic effort.
Simulators of urban transit systems possibly vastly upgrade
upon basis of actual data via a smart transport collected device
[41]. Aforesaid program for simulation a significant role in ramping
up ITS advancement whereas limiting new platform expenditures
being tested.
Some of the Big data analytics challenges are expressed in
Table 4
8. Conclusion
Intelligent transit stimulates the development of nation’s
infrastructure and improves the sense of happiness of residents.
Through this article, developers discussed significant growth along
with data Analytics and also the recognizing with ITS. Effective
structure for the implementation as concerns ITS data analytics
has been explored. A lot of ITS Data Analytics platforms are being
addressed, along with route optimization, traffic congestion fore-
casting, fatal crash analysis, massive transit organizing, private
route planning and transport infrastructure management. Smart
mobility framework is developed for actual storage employing
Advanced Analytics tools. We are indeed enhancing the tasks by
building a strong automobile data assets in place to ensure real-
time decisions on real-world live traffic.
Declaration of Competing Interest
The authors declare that they have no known competing finan-
cial interests or personal relationships that could have appeared
to influence the work reported in this paper.
References
[1] J. Zhang, F.-Y. Wang, K. Wang, W.-H. Lin, X. Xu, C. Chen, Data-Driven Intelligent
Transportation Systems: A Survey, IEEE Transactions on Intelligent
Transportation Systems. 12 (4) (2011) 1624–1639, https://doi.org/10.1109/
TITS.2011.2158001.
[2] O. Tene, J. Polonetsky, Big Data for All: Privacy and User Control in the Age of
Analytics, Northwestern Journal of Technology and Intellectual Property. 11
(2012).
[3] M. Shtern, R. Mian, M. Litoiu, S. Zareian, H. Abdelgawad, A. Tizghadam, in:
Towards a multi-cluster analytical engine for transportation data, IEEE, 2014,
pp. 249–257.
[4] R., Ghanbari, H., Zareian, S., Shtern, M., Litoiu, M.: A data platform for the
highway traffic data. In: 2014 IEEE 8th International Symposium on the
Maintenance and Evolution of Service-Oriented and Cloud-Based Systems
(MESOCA), pp. 47–52. IEEE (2014)
[5] Bello Orgaz, Gema & Jung, Jason & Camacho, David. (2015). Social Big Data:
Recent achievements and new challenges. Information Fusion. 28. 10.1016/j.
inffus.2015.08.005.
[6] C. Wang, X. Li, X. Zhou, A. Wang, N. Nedjah, Soft computing in big data
intelligent transportation systems, Appl. Soft Comput. 38 (2015), https://doi.
org/10.1016/j.asoc.2015.06.006.
[7] W. Yuan, P. Deng, T. Taleb, J. Wan, C. Bi, An Unlicensed Taxi Identification
Model Based on Big Data Analysis, IEEE Trans. Intell. Transp. Syst. 17 (2015) 1–
11, https://doi.org/10.1109/TITS.2015.2498180.
[8] L. Rodríguez, C. Rodriguez-Enriquez, J. Sánchez-Cervantes, J. Cervantes, J.
García-Alcaraz, G. Alor-Hernández, A general perspective of Big Data:
applications, tools, challenges and trends, The Journal of Supercomputing. 72
(2015), https://doi.org/10.1007/s11227-015-1501-1.
[9] Engelbrecht, Jarret & Booysen, M.J. (Thinus) & van Rooyen, G-J & Bruwer, F.J..
(2015). A Survey of Smartphone-based Sensing in Vehicles for ITS Applications.
IET Intelligent Transport Systems. 9. 10.1049/iet-its.2014.0248.
[10] X. Zheng, W. Chen, P. Wang, D. Shen, S. Chen, X. Wang, Q. Zhang, L. Yang, Big
Data for Social Transportation, IEEE Trans. Intell. Transp. Syst. (2015), https://
doi.org/10.1109/TITS.2015.2480157.
[11] Q. Shi, M. Abdel-Aty, Big Data applications in real-time traffic operation and
safety monitoring and improvement on urban expressways, Transportation
Research Part C: Emerging Technologies. 58 (2015), https://doi.org/10.1016/j.
trc.2015.02.022.
[12] Vlahogianni, Eleni. (2015). Computational Intelligence and Optimization for
Transportation Big Data: Challenges and Opportunities. 10.1007/978-3-319-
18320-6_7.
[13] Y. Xia, J. Chen, X. Lu, C. Wang, C. Xu, Big Traffic Data Processing Framework for
Intelligent Monitoring and Recording Systems, Neurocomputing. 181 (2015),
https://doi.org/10.1016/j.neucom.2015.07.140.
[14] Rathore, Muhammad Mazhar & Ahmad, Awais & Paul, Anand & Thikshaja,
Uthra. (2016). Exploiting real-time big data to empower smart transportation
using big graphs. 135-139. 10.1109/TENCONSpring.2016.7519392.
[15] X. Wang, Z. Li, Traffic and transportation smart with cloud computing on big
data. 13 (2016) 1–16.
[16] I.A.T. Hashem, V. Chang, N.B. Anuar, K. Adewole, I. Yaqoob, A. Gani, E. Ahmed,
H. Chiroma, The Role of Big Data in Smart City, Int. J. Inf. Manage. 36 (5) (2016)
748–758, https://doi.org/10.1016/j.ijinfomgt.2016.05.002.
[17] Hua, Xingcheng & Wang, Jierui & Lei, Li & Zhou, Bin & Zhang, Xiaolin & Liu,
Peng. (2016). H-TDMS: A System for Traffic Big Data Management. 626. 85-96.
10.1007/978-981-10-2209-8_8.
[18] El Faouzi, Nour-Eddin & Klein, Lawrence. (2016). Data Fusion for ITS:
Techniques and Research Needs. Transportation Research Procedia. 15. 495-
512. 10.1016/j.trpro.2016.06.042.
[19] Nandury, Satyanarayana & Ara Begum, Beneyaz. (2016). Strategies to handle
big data for traffic management in smart cities. 356-364. 10.1109/
ICACCI.2016.7732072.
[20] A. Vankudre, M. Bhosale, Detection of Real-Time Traffic through Twitter
Stream Analysis, International Research Journal of Advanced Engineering and
Science ISSN (Online) 2455–9024 (2017) 2.
Table 4
Data Analytics Issues in ITS.
Reference Data Analytics Issues in ITS
Ghanbari [4] 1- Vertical and Horizontal Scaling
2- Data Normalization
3- Data Representation
El Faouzi [18] 1- Integration scheme of trustworthiness and
dependability
2- Evaluation of Integration scheme with key
metrics
2–1 Adaptive nature of MOPs.
2–2 Relying on MOPs cluster based context.
2–3 Ground truth
3- File systems as well as promotional system
software.
4-Design for worst-case scenarios.
Nandury,
Satyanarayana [19]
1- Actual traffic information involvement with
knowledge review
2- Strategies to handle Big data
3-Supervised Learning Model
4-Elementary Tools Assessment Perceptron and
Smarter Modem
5-Data Fusion Engine
6-Decision Logic
7-Smart Decision & Control Enabler
Torre-Bastida [32] 1- Innovative revolutionary kinds of transit.
2- Prospect of transit in urban communities
3- Infrastructure upon motility
4- Modern Intelligent Transport system
5- Economic and Regulatory trends.
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
9
[21] M.M. Rathore, A. Paul, W.-H. Hong, H. Seo, I. Awan, S. Saeed, Exploiting IoT and
Big Data Analytics: Defining Smart Digital City using Real-Time Urban Data,
Sustainable Cities and Society. 40 (2017), https://doi.org/10.1016/j.
scs.2017.12.022.
[22] Amini, Sasan & Gerostathopoulos, Ilias & Prehofer, Christian. (2017). Big data
analytics architecture for real-time traffic control. 710-715. 10.1109/
MTITS.2017.8005605.
[23] M. Marjani, F. Nasaruddin, A. Gani, A. Karim, I. Hashem, A. Siddiqa, I. Yaqoob,
Big IoT Data Analytics: Architecture, Opportunities, and Open Research
Challenges, IEEE Access 5 (2017), https://doi.org/10.1109/
access.2017.2689040.
[24] Shukla, Sumit & Champaneria, Tushar. (2017). Survey of various data
collection ways for smart transportation domain of smart city. 681-685.
10.1109/I-SMAC.2017.8058265.
[25] E. Ahmed, I. Yaqoob, I.A.T. Hashem, I. Khan, A.I.A. Ahmed, M. Imran, A.V.
Vasilakos, The role of big data analytics in Internet of Things, Comput. Netw.
129 (2017) 459–471, https://doi.org/10.1016/j.comnet.2017.06.013.
[26] Liu, Yang. (2018). Big Data Technology and Its Analysis of Application in Urban
Intelligent Transportation System. 17-19. 10.1109/ICITBS.2018.00012.
[27] Tomar, Pradeep & Kaur, Gurjit & Singh, Prabhjot. (2018). A Prototype of IoT-
Based Real Time Smart Street Parking System for Smart Cities. 10.1007/978-3-
319-60435-0_10.
[28] E. Siow, T. Tiropanis, W. Hall, Analytics for the Internet of Things: A Survey,
ACM Comput. Surv. 51 (4) (2018) 1–36, https://doi.org/10.1145/3204947.
[29] M. Ge, H. Bangui, B. Buhnova, Big Data for Internet of Things: A Survey, Future
Generation Computer Systems. 87 (2018) 601–614, https://doi.org/10.1016/
j.future.2018.04.053.
[30] M. Gohar, M. Muzammal, A. Ur Rahman, SMART TSS: Defining Transportation
System Behavior using Big Data Analytics in Smart Cities, Sustainable Cities
and Society. 41 (2018) 114–119, https://doi.org/10.1016/j.scs.2018.05.008.
[31] M.A. Javed, S. Zeadally, Data Analytics for Cooperative Intelligent Transport
Systems, Veh. Commun. 15 (2018), https://doi.org/10.1016/j.
vehcom.2018.10.004.
[32] A.I. Torre-Bastida, J. Del Ser, I. Laña, M. Ilardia, M.N. Bilbao, S. Campos-
Cordobés, Big Data for Transportation and Mobility: Recent Advances, Trends
and Challenges, IET Intel. Transport Syst. 12 (8) (2018) 742–755, https://doi.
org/10.1049/itr2.v12.810.1049/iet-its.2018.5188.
[33] J. Hopkins, P. Hawking, Big Data Analytics and IoT in logistics: a case study, The
International Journal of Logistics Management. 29 (2) (2018) 575–591, https://
doi.org/10.1108/IJLM-05-2017-0109.
[34] R. Iqbal, F. Doctor, B. More, S. Mahmud, U. Yousuf, Big data analytics:
Computational intelligence techniques and application areas, Technol.
Forecast. Soc. Chang. 153 (2018), https://doi.org/10.1016/j.
techfore.2018.03.024.
[35] Zhu, Li & Yu, Fei & Wang, Yige & Ning, Bin & Tang, Tao. (2018). Big Data
Analytics in Intelligent Transportation Systems: A Survey. IEEE Transactions
on Intelligent Transportation Systems. PP. 1-16. 10.1109/TITS.2018.2815678.
[36] S. Dabiri, K. Heaslip, Transport-domain applications of widely used data
sources in the smart transportation: A survey, Researchgate. (2018) 1–53.
[37] S. Shadroo, A.M. Rahmani, Systematic Survey of Big Data and Data Mining in
Internet of Things, Comput. Netw. 139 (2018) 19–47, https://doi.org/10.1016/
j.comnet.2018.04.001.
[38] Ana Isabel Torre-Bastida1, Javier Del Ser et al. Big Data for transportation and
mobility: recent advances, trends and challenges. IET Journals. June (2018),
Bizana, Spain, pp. 742-755
[39] Neelam, Sahil & Sood, Sandeep. (2019). Smart Vehicular Traffic Management:
An Edge Cloud centric IoT based Framework. 10.1016/j.iot.2019.100140.
[40] M. Babar, F. Arif, Real-time data processing scheme using big data analytics in
internet of things based smart transportation environment, J. Ambient Intell.
Hum. Comput. 10 (10) (2019) 4167–4177, https://doi.org/10.1007/s12652-
018-0820-5.
[41] Amin, Mohammed & Hadouaj, Sameh & Darwish, Tasneem. (2019). Big Data
Role in Improving Intelligent Transportation Systems Safety: A Survey.
10.1007/978-3-030-12839-5_18.
A. Mohandu and M. Kubendiran Materials Today: Proceedings xxx (xxxx) xxx
10
... In this light, Kaffash et al. [8] performed a detailed review of big data algorithms and applications for ITS. In addition, Mohandu and Kubendiran [9] presented an extensive literature review on big data in ITS by describing different architectural approaches and applications. In their study, they provided an overview of various architectural schemes and, finally, adopted a similar scheme as the three-layer architecture proposed by Zhu et al. [7]. ...
... Our proposed DBDA platform, presented in detail in Section 2, is divided into three layers, namely the data exchange and collection layer, the data processing, analysis and cloud layer and the monitoring-application layer. The three layer architectural scheme, adopted by other studies (e.g., [7,9,36]) as well, was selected as it efficiently materializes the notion of conceptualization in such systems. Considering the study of related works, the main limitations identified included the design only at the conceptual level of the suggested solutions without adequate validation and testing, the over-focusing on specific tasks such as traffic prediction, the constraints due to lack of high-quality data and the non-validation in real-life operating environments or for a time duration adequate to support their feasibility, efficiency and performance. ...
Article
Full-text available
The unceasingly increasing needs for data acquisition, storage and analysis in transportation systems have led to the adoption of new technologies and methods in order to provide efficient and reliable solutions. Both highways and vehicles, nowadays, host a vast variety of sensors collecting different types of highly fluctuating data such as speed, acceleration, direction, and so on. From the vast volume and variety of these data emerges the need for the employment of big data techniques and analytics in the context of state-of-the-art intelligent transportation systems (ITS). Moreover, the scalability needs of fleet and traffic management systems point to the direction of designing and deploying distributed architecture solutions that can be expanded in order to avoid technological and/or technical entrapments. Based on the needs and gaps detected in the literature as well as the available technologies for data gathering, storage and analysis for ITS, the aim of this study is to provide a distributed architecture platform to address these deficiencies. The architectural design of the system proposed, engages big data frameworks and tools (e.g., NoSQL Mongo DB, Apache Hadoop, etc.) as well as analytics tools (e.g., Apache Spark). The main contribution of this study is the introduction of a holistic platform that can be used for the needs of the ITS domain offering continuous collection, storage and data analysis capabilities. To achieve that, different modules of state-of-the-art methods and tools were utilized and combined in a unified platform that supports the entire cycle of data acquisition, storage and analysis in a single point. This leads to a complete solution for ITS applications which lifts the limitations imposed in legacy and current systems by the vast amounts of rapidly changing data, while offering a reliable system for acquisition, storage as well as timely analysis and reporting capabilities of these data.
... The given framework is verified by a twin-rotor aerodynamic system (TRAS) model for operations defined within a finite duration. Another important category of ML-based data collection protocols are the ones with Apache Spark integration [43][44][45][46][47][48][49]. In Apache Spark, cluster environment, R, a statistical computing and graphics software within Spark, is used to give the user the capability to construct statistical and prediction models using the traffic data. ...
Article
Full-text available
The continuous global urbanization with rapid and dynamic transitioning in traffic situations among highly populated cities results in difficulty for data collection and communication. Data collection for millions of vehicles hinders by various problems, i.e., higher cost of energy, time, space, and storage resources. Moreover, higher data traffic results in higher delays, larger throughput, excessive bottlenecks, and frequent repetition of data. To better facilitate the aforementioned challenges and to provide a solution, we have proposed a lightweight Machine Learning based data collection protocol named ML-TDG to effectively deal with higher data volumes in a real-time traffic environment capable of bringing the least burden on the network while utilizing less space, time, and energy. ML-TDG is functional based on Apache Spark, an effective data processing engine that indexes the data based on two logs, i.e., old commuters or frequent/daily commuters and second new/occasional commuters. The proposed protocol’s main idea is to utilize real-time traffic, distinguish the indexes in parallel based on two assigned logs criteria to train the network, and collect data with the least sources. For energy and time optimization, dynamic segmentation switching is introduced which is an intelligent road segments division and switching for reducing bottlenecks and replication. ML-TDG is tested and verified on Dublin, Ireland’s busiest motorway M50. ML-TDG performs the data collection, data sorting, and network training to decide the next execution altogether for better optimization every time. The experimental results verify that our proposed protocol is attaining higher performance with lower resource requirements along with rich and time-efficient sustainable data collection clusters in comparison with baseline protocols.
... The data collected by smart objects is used to analyze their users' activities and interests. Aside from the devices mobile phone and Internet users, the volume of data collected by these gadgets is growing (Aloini et al., 2007;Mohandu and Kubendiran, 2021). This data can be used to extract useful information from the various systems and equipment that are used (Mikalef et al., 2020). ...
Article
Full-text available
Big data analysis allows analysts, researchers, and business users to make better and faster decisions using data that was previously inaccessible or used. Companies can use advanced analytics techniques such as text analysis, machine learning, predictive analytics, data mining, statistics, and natural language processing to gain new insights and insights from previously untapped data sources independently or with existing enterprise data. Significant challenges are still in how to deal with this data and maximize their use, so large data have become the clock that has become the most national, regional, and international institutions, and a broad range of database management, methodologies and measures that can be adopted for the exploitation of large data in all areas of life. This paper investigates the effects of big data analytics on project risk management with examples from Healthcare sector in UAE. Inclusive research has been done by searching approximately more than 20 references resulted in a literature review studied the effect of implementing data analytics in business, technology, industry, and society businesses aspects. A research methodology has been done by interviewing professionals from healthcare field investigating further the role of Data analytics in analyzing and managing data in healthcare, its benefits in predicting risks and improving healthcare outcomes and future insights. This research’s result reveals that collecting data and applying data analytics even in businesses or healthcare represent an important kind of digital transformation. An obvious finding from this research is that business intelligence and data analytics has been implemented widely in UAE healthcare sector by both government and privet sectors resulting in opportunities that develop and emphasize positive changes to this sector.
Article
Full-text available
Efficient traffic management has become a major concern within the framework of smart city projects. However, the increasing complexity of data exchanges and the growing importance of big data makes this task more challenging. Vehicular ad hoc networks (VANETs) face various challenges, including the management of massive data generated by different entities in their environment. In this context, a proposal is put forth for a real-time anomaly detection system with parallel data processing, thereby speeding up data processing. This approach accurately computes vehicle density for each section at any given time, enabling precise traffic management and the provision of information to vehicles regarding traffic density and the safest route to their destination. Furthermore, a machine learning-based prediction system has been developed to mitigate congestion problems and reduce accident risks. Simulations demonstrate that the proposed solution effectively addresses transportation issues while maintaining low latency and high precision.
Article
Full-text available
The advent of technology has facilitated communication among individuals, structures, and transport, thereby revolutionizing contemporary living. In this regard, the Internet of Things has emerged as a transformative approach. Smart vehicles and the Internet of Vehicles (IoV) play a vital role in the development of smart cities and intelligent mobility systems, relying heavily on connectivity and data exchange. The storage, processing, and collection of data generated by smart cars are presently achievable through cloud computing. This study aims to utilize weight identification and ranking algorithms to explore, evaluate, and prioritize the essential transportation attributes for enhancing intelligent transportation systems. This study uses the VIKOR approach to identify top-ranking options for Intelligent Transportation Systems, focusing on their effectiveness, safety, efficiency, and sustainability in transportation. The VIKOR technique is useful in urban planning and IoV settings, providing a balanced decision-making framework. Future recommendations include refining criteria, integrating AI, fostering collaboration, ensuring security, promoting public awareness, and making transportation reliable, convenient and easy for use.
Chapter
This research model proposed to empirically examine the impact of big data security on digital operations with the mediating role of supply chain risk in the UAE transportation and shipment industry. This is the first study linking perspectives on big data security and digital operations with a mediating effect of supply chain risk. This research does not claim to be comprehensive; instead, it analyses empirical research and literature to further the conversation by using a conceptual framework for examining the UAE transportation and shipment industry. An online survey was administered to the 226 employees of the UAE’s Dubai transportation and shipment companies. Exploratory research proposed data analysis performed by regression ANOVA using SPSS. Supply chain risk as a mediation link in the research originated as positively significant, whereas there was a direct relationship between big data security and digital operations. To support long- and short-term strategic decision-making, it is necessary to examine risk management procedures as supply chain and transportation networks develop in a dynamic environment. Organizational risk exposure must be rigorously analyzed using objective, transparent criteria. The costs and advantages of different risk mitigation strategies must be considered with digital criteria.
Article
A thorough understanding of carbon lock-in is an essential precondition for the effective design and continuous improvement of climate policy. Based on a balanced panel dataset of 30 provinces in China during the period 2002–2021, we explore the nexus between smart transportation and carbon lock-in using the System-Generalized Method of Moments (SYS-GMM) model. We also investigate the heterogeneous, asymmetric, and threshold effects among the above two issues, and examine three internal impact mechanisms. We thus arrive at the following four main conclusions: (1) Smart transportation significantly reduces carbon lock-in, highlighting its importance in eradicating carbon lock-in. (2) Smart transportation has the most pronounced impact on carbon lock-in in the central region, and can effectively mitigate all aspects of carbon lock-in, especially industry lock-in and institution lock-in. (3) Smart transportation is more effective in alleviating carbon lock-in in provinces with a higher level of carbon lock-in. Moreover, a threshold of environmental regulation exists between smart transportation and carbon lock-in, with stricter environmental regulation leading to a stronger carbon lock-in reduction effect of smart transportation. (4) Smart transportation indirectly influences carbon lock-in through three channels of economic scale, industrial structure upgrading, and technological innovation. Based on these findings, we propose some policy recommendations for smart transportation development and carbon lock-in mitigation.
Article
Full-text available
Cooperative Intelligent Transport System (C-ITS) is a key enabler of future road traffic management systems. The core component of C-ITS includes vehicles, road side units and traffic command centers. They generate a large amount of traffic that is made up of both mobility and service-related data. To extract useful and relevant information out of this data, data analytics will play a crucial role in future C-ITS applications. We present a review of how data analytics can benefit C-ITS applications. We describe the various types of data generated by C-ITS applications and potential dissemination techniques using various wireless technologies. We demonstrate how meaningful results from this data could be beneficial to C-ITS. We also demonstrate the improved reliability of C-ITS applications that can be achieved with data analytics using simulation results. Finally, we discuss future possible applications of data analytics in C-ITS.
Article
Full-text available
In recent times, a massive amount of smart devices or objects are connected that enhances the scale of the digital world. These smart objects are referred as “things” or physical devices that have the potential to sense the real-world physical objects, collect the data, and network with others. The objects are connected through the internet, which crafts the terminology of Internet of Things (IoT). IoT has been developed and become the center of consideration due to the novelty of embedded device and a rapid enhancement in its number. This increase is resulting in the creative applications of smart environments. Smart transportation is a central stake for the quality of life of citizens in smart environment. Smart transportation involves the use of devices and sensors in the control system of vehicle; for example navigation system of cars, traffic signal management system, number recognition system and speed monitoring system. In this research article, we propose architecture for smart transportation system using Big Data analytics, in order to achieve real time processing and facilitate a friendly communication in the environment of IoT based smart transportation. The proposed architecture is a 3-phase scheme which is responsible for the organization and management of Big Data, real-time processing of Big Data and service management. The proposed architecture is a generic solution for the smart transportation planning using real time Big Data processing. The proposed scheme is realized using Spark over single node Hadoop setup with various input libraries. A huge amount of data from different authentic and reliable sources is measured to validate the proposed architecture. In addition, the effectiveness of proposed scheme also highlighted with regard to throughput.
Article
Urbanization is imposing many challenges, and vehicular traffic management is one such challenge. It hampers smooth traffic flow, wastes time, and threats of road safety. Moreover, it also impacts the environment, economy, health, and other essential services. The root cause of traffic mismanagement is the dynamic nature of the traffic on roads and the incapability of legacy systems to interpret such dynamics in real-time. The data analysis services based on Edge Cloud frameworks in technology outfitted urban spaces provide real-time robust and smart solutions to ever facing challenges from urbanization. Hence, Edge Cloud-based traffic management can be used to manage urban vehicular traffic in real-time. In this paper, Edge Cloud-centric IoT based smart traffic management system is developed for traffic inflow prediction and time optimized smart navigation of the vehicles. The traffic inflow prediction adapts the traffic movement phase time accordingly and avoids long waiting queues and congestions at intersections. The smart navigation enables the optimal distribution of traffic to possible paths and subsequently improves road safety at intersections. Baseline classifiers are used to predict the traffic inflow, and the statistical analyses acknowledge the prediction efficiency of the J48 decision tree as compared to other utilized classifiers. Edge Computing is used for time optimized smart navigation of vehicles and, subsequently, optimal traffic load balancing in real-time. The road safety perspective of vehicles at intersections is also get benefited from the optimal traffic load balancing. The results depict the efficiency of the proposed system for smart navigation, optimal traffic load balancing, and improved road safety at intersections.
Chapter
Achieving smart and intelligent transportation requires the use of millions of devices which generate a huge volume of data, termed as Big Data. With the flourishing of Big Data analytics, intelligent transportation management and control is now becoming more data driven. Big data can provide ample information obtained from vehicles, traffic infrastructure, smart phones and weather stations. Such data has promising applications in intelligent transportation systems, especially in the road safety sector. In particular, utilizing Big Data analytics assesses accident prevention and detection, thereby reducing causalities, loses and damage. This survey paper explores the role of big data in shaping the intelligent transportation systems with a focus on the road safety sector. In addition, the limitations of existing studies are discussed and future research directions are suggested.
Article
Big Data is an emerging paradigm and has currently become a strong attractor of global interest, specially within the transportation industry. The combination of disruptive technologies and new concepts such as the Smart City upgrades the transport data life cycle. In this context, Big Data is considered as a new pledge for the transportation industry to effectively manage all data this sector required for providing safer, cleaner and more efficient transport means, as well as for users to personalize their transport experience. However, Big Data comes along with its own set of technological challenges, stemming from the multiple and heterogeneous transportation/mobility application scenarios. In this survey we analyze the latest research efforts revolving on Big Data for the transportation and mobility industry, its applications, baselines scenarios, fields and use case such as routing, planning, infrastructure monitoring, network design, among others. This analysis will be done strictly from the Big Data perspective, focusing on those contributions gravitating on techniques, tools and methods for modeling, processing, analyzing and visualizing transport and mobility Big Data. From the literature review a set of trends and challenges is extracted so as to provide researchers with an insightful outlook on the field of transport and mobility.
Article
A smart city improves the quality of its citizens by providing access to ubiquitous services. Intelligent Transportation Systems (ITS) have a fundamental role in transforming a metropolitan area into a smart city. In the past two decades, many applications of ITS, e.g. city-wide traffic management and monitoring, smart parking, public transportation information services (bus, train, taxi, plane, etc.), logistics, real-time traffic, road speed limit monitoring and management etc., are deployed in smart cities. The sensors or mobile objects in ITS constantly generate mobility data and the scale at which this data is generated is witnessing an exponential increase in volumes. To store and subsequently analyze such massive data generated by sensors, new architectures are needed which are primarily designed for working with big data. In this work, we propose a big data analytics architecture for ITS. The proposed architecture has a built-in storage and analysis capability to work with ITS data and is composed of four modules, namely (1) Big Data Acquisition and Preprocessing Unit (2) Big Data Processing Unit (3) Big Data Analytics Unit and (4) Data Visualization Unit. A detailed analysis of ITS big data for monitoring the average speed of a vehicle at w.r.t. the time attribute is provided. The proposed architecture is evaluated using Hadoop thus validating the proof of concept. The empirical results are encouraging and open directions for future research.
Article
With the rapid development of the Internet of Things (IoT), Big Data technologies have emerged as a critical data analytics tool to bring the knowledge within IoT infrastructures to better meet the purpose of the IoT systems and support critical decision making. Although the topic of Big Data analytics itself is extensively researched, the disparity between IoT domains (such as healthcare, energy, transportation and others) has isolated the evolution of Big Data approaches in each IoT domain. Thus, the mutual understanding across IoT domains can possibly advance the evolution of Big Data research in IoT. In this work, we therefore conduct a survey on Big Data technologies in different IoT domains to facilitate and stimulate knowledge sharing across the IoT domains. Based on our review, this paper discusses the similarities and differences among Big Data technologies used in different IoT domains, suggests how certain Big Data technology used in one IoT domain can be re-used in another IoT domain, and develops a conceptual framework to outline the critical Big Data technologies across all the reviewed IoT domains.
Article
Big Data has significant impact in developing functional smart cities and supporting modern societies. In this paper, we investigate the importance of Big Data in modern life and economy, and discuss challenges arising from Big Data utilization. Different computational intelligence techniques have been considered as tools for Big Data analytics. We also explore the powerful combination of Big Data and Computational Intelligence (CI) and identify a number of areas, where novel applications in real world smart city problems can be developed by utilizing these powerful tools and techniques. We present a case study for intelligent transportation in the context of a smart city, and a novel data modelling methodology based on a biologically inspired universal generative modelling approach called Hierarchical Spatial-Temporal State Machine (HSTSM). We further discuss various implications of policy, protection, valuation and commercialization related to Big Data, its applications and deployment.
Article
Big data is becoming a research focus in intelligent transportation systems (ITS), which can be seen in many projects around the world. Intelligent transportation systems will produce a large amount of data. The produced big data will have profound impacts on the design and application of intelligent transportation systems, which makes ITS safer, more efficient, and profitable. Studying big data analytics in ITS is a flourishing field. This paper first reviews the history and characteristics of big data and intelligent transportation systems. The framework of conducting big data analytics in ITS is discussed next, where the data source and collection methods, data analytics methods and platforms, and big data analytics application categories are summarized. Several case studies of big data analytics applications in intelligent transportation systems, including road traffic accidents analysis, road traffic flow prediction, public transportation service plan, personal travel route plan, rail transportation management and control, and assets maintenance are introduced. Finally, this paper discusses some open challenges of using big data analytics in ITS.
Article
The Internet of Things (IoT) envisions a world-wide, interconnected network of smart physical entities. These physical entities generate a large amount of data in operation and as the IoT gains momentum in terms of deployment, the combined scale of those data seems destined to continue to grow. Increasingly, applications for the IoT involve analytics. Data analytics is the process of deriving knowledge from data, generating value like actionable insights from them. This article reviews work in the IoT and big data analytics from the perspective of their utility in creating efficient, effective and innovative applications and services for a wide spectrum of domains. We review the broad vision for the IoT as it is shaped in various communities, examine the application of data analytics across IoT domains, provide a categorisation of analytic approaches and propose a layered taxonomy from IoT data to analytics. This taxonomy provides us with insights on the appropriateness of analytical techniques, which in turn shapes a survey of enabling technology and infrastructure for IoT analytics. Finally, we look at some tradeoffs for analytics in the IoT that can shape future research.