PreprintPDF Available

Risk-Aware Resource Allocation for URLLC: Challenges and Strategies with Machine Learning

December 2018

December 2018

Authors:

Amin Azari

KTH Royal Institute of Technology

Mustafa Ozger

KTH Royal Institute of Technology

Cicek Cavdar

KTH Royal Institute of Technology

Preprints and early-stage research may not have been peer reviewed yet.

Supporting ultra-reliable low-latency communications (URLLC) is a major challenge of 5G wireless networks. Stringent delay and reliability requirements need to be satisfied for both scheduled and non-scheduled URLLC traffic to enable a diverse set of 5G applications. Although physical and media access control layer solutions have been investigated to satisfy only scheduled URLLC traffic, there is a lack of study on enabling transmission of non-scheduled URLLC traffic, especially in coexistence with the scheduled URLLC traffic. Machine learning (ML) is an important enabler for such a co-existence scenario due to its ability to exploit spatial/temporal correlation in user behaviors and use of radio resources. Hence, in this paper, we first study the coexistence design challenges, especially the radio resource management (RRM) problem and propose a distributed risk-aware ML solution for RRM. The proposed solution benefits from hybrid orthogonal/non-orthogonal radio resource slicing, and proactively regulates the spectrum needed for satisfying delay/reliability requirement of each URLLC traffic type. A case study is introduced to investigate the potential of the proposed RRM in serving coexisting URLLC traffic types. The results further provide insights on the benefits of leveraging intelligent RRM, e.g. a 75% increase in data rate with respect to the conservative design approach for the scheduled traffic is achieved, while the 99.99% reliability of both scheduled and nonscheduled traffic types is satisfied.

Content uploaded by Amin Azari

Content may be subject to copyright.

Risk-Aware Resource Allocation for URLLC:

Challenges and Strategies with Machine Learning

Amin Azari, Mustafa Ozger, and Cicek Cavdar

Abstract—Supporting ultra-reliable low-latency communica-

tions (URLLC) is a major challenge of 5G wireless networks.

Stringent delay and reliability requirements need to be satisﬁed

for both scheduled and non-scheduled URLLC trafﬁc to enable

a diverse set of 5G applications. Although physical and media

access control layer solutions have been investigated to satisfy

only scheduled URLLC trafﬁc, there is a lack of study on

enabling transmission of non-scheduled URLLC trafﬁc, especially

in coexistence with the scheduled URLLC trafﬁc. Machine

learning (ML) is an important enabler for such a co-existence

scenario due to its ability to exploit spatial/temporal correlation

in user behaviors and use of radio resources. Hence, in this

paper, we ﬁrst study the coexistence design challenges, especially

the radio resource management (RRM) problem and propose

a distributed risk- aware ML solution for RRM. The proposed

solution beneﬁts from hybrid orthogonal/non-orthogonal radio

resource slicing, and proactively regulates the spectrum needed

for satisfying delay/reliability requirement of each URLLC trafﬁc

type. A case study is introduced to investigate the potential of

the proposed RRM in serving coexisting URLLC trafﬁc types.

The results further provide insights on the beneﬁts of leveraging

intelligent RRM, e.g. a 75% increase in data rate with respect

to the conservative design approach for the scheduled trafﬁc is

achieved, while the 99.99% reliability of both scheduled and non-

scheduled trafﬁc types is satisﬁed.

Index Terms—5G, non-scheduled trafﬁc, IoT, machine learn-

ing, proactive resource provisioning, URLLC.

I. INTRODUCTION

THE ﬁfth generation of wireless networks (5G) has tar-

geted a diverse set of wireless services, from enhanced

mobile broadband (eMBB) to the Internet of Things (IoT) [1].

The latter itself could be categorized into two distinct service

types, i.e. massive machine type communications (mMTC),

which aims at connecting everything that beneﬁts from be-

ing connected, and ultra-reliable low-latency communications

(URLLC), which requires successful data transmission within

a strictly bounded short time interval [2]. URRLC is consid-

ered as an essential prerequisite of a new wave of services

including drone-based delivery, smart factory, remote control,

and intelligent transportation systems (ITS) [3].

Different URLLC applications may have different setup

delay tolerances, where the setup delay is deﬁned as the time

from generation of the ﬁrst URLLC packet at the devices

until when the packet is transmitted. Serving URLLC trafﬁc

with very-low setup delay tolerance is extremely challenging

because this type of trafﬁc cannot wait to be scheduled, e.g.

an accident report by a vehicle. Let us denote this category

The authors are with the School of Electrical Engineering and Computer

Science, KTH - The Royal Institute of Technology, Stockholm, Sweden (e-

mail: {aazari, ozger, cavdar}@kth.se).

of trafﬁc as non-scheduled URLLC trafﬁc. This could be (i)

from connected/disconnected devices which have critical data,

usually of short payload size; (ii) from devices which ask

access reservation for subsequent scheduled transmissions of

critical data; or (iii) critical device-to-device communications.

In the uplink direction, enabling URLLC mandates guaran-

teeing delay and reliability requirements for both scheduled

URLLC transmissions, e.g. continuous control of a drone,

as well as the non-scheduled URLLC transmissions, e.g.

immediate accident reports. The coexistence of scheduled and

non-scheduled trafﬁc could happen in many scenarios like

vehicular networks, and monitoring and alarm systems [3, 4].

Then, the set of resources allocated to URLLC should be

managed to be used for both scheduled and non-scheduled

URLLC trafﬁc. Isolation among services via allocating orthog-

onal slices of resources is a common practice when dealing

with ergodic objectives like throughput. However, in URLLC

applications, due to the limitations of time-diversity and cru-

cial need to large spectrum bands for providing frequency

diversity, isolation through orthogonal slicing is challenging

[3, 5]. Recently, serving coexisting eMBB/URLLC services

over limited radio resources has attracted attentions, and non-

orthogonal RRM for serving URLLC has been proposed

for spectrum efﬁciency [1, 6]. While the orthogonal RRM is

itself a complex problem, the non-orthogonal RRM, which

requires specifying shared/dedicated resources and scheduling

rules over shared ones, is a much more complex problem.

Furthermore, to comply with URLLC delay requirements, this

problem is needed to be solved in very short time scales.

Among candidate enablers for solving such complex, dy-

namic and time-limited RRM problem, artiﬁcial intelligence

(AI) is promising [7]. In the light of recent advances in

computing and storage technologies, AI is making the leap

from traditional pattern recognition use cases to governing

complex systems through advanced machine learning (ML)

approaches. While in previous decades, ML, as a major branch

of AI, has experienced several up and down periods, this time

its technology readiness level is so high that it has already

penetrated to the design of many complex systems [7]. This

motivates us to investigate usefulness of leveraging ML in

RRM for serving the URLLC trafﬁc. While the traditional

ML consists in a single node which makes decisions in a cen-

tralized manner, the delay/reliability requirements of URLLC

call for novel, scalable and distributed learning approaches.

In this paper, our main focus is on serving urgent

short packet URLLC transmissions along with the sched-

uled URLLC transmissions. The results derived within this

work could be easily extended to the coexistence of URLLC

arXiv:1901.04292v1 [cs.NI] 22 Dec 2018

Power

Frequency

20$120$·5DGLRXVHUPRELOLW\PDSV·,QWHUIHUHQFHDQGFKDQQHOTXDOLW\PHDVXUHPHQWVFrequency

120$20$hƌŐĞŶƚƚƌĂĨĨŝĐ

6WDWH6WDWHRIWKHQHWZRUN^

$FWLRQVFKHGXOLQJRYHUVKDUHGUHVRXUFHVFrequency

0/SRZHUHG+0$U: Urgent WUDIILFS : Scheduled WUDIILFUU

Power

5DGLR(QYLURQPHQW,QWHOOLJHQW553556DQG558([DPSOHWRRO4OHDUQLQJIRU556Traditional

550120$20$3URSRVHG5500/SRZHUHG+0$

^ĐŚĞĚƵůĞĚƚƌĂĨĨŝĐ

Power

Fig. 1: A graphical illustration of the proposed intelligent

RRM solution. The network status, including measurements

in the radio environment, is fed to the intelligent RRM

modules, which enable more efﬁcient use of radio resources

for scheduled and non-scheduled URLLC trafﬁc types.

and non-URLLC trafﬁc by modifying the delay/reliability

constraints of the scheduled trafﬁc. The main contributions

of this work include: (i) introduce coexistence of sched-

uled and non-scheduled URLLC trafﬁc as a challenge to be

tackled for realization of URLLC; (ii) introduce a hybrid

orthogonal/non-orthogonal multiple access (OMA/NOMA),

called HMA scheme, to serve coexisting URLLC trafﬁc;

(iii) investigate a distributed learning-powered RRM solution,

including intelligent resource provisioning (RRP), scheduling

(RRS), and utilizing (RRU) modules, for spectrum efﬁcient yet

reliable coexistence management; and (iv) investigate different

sources of uncertainty in proactive RRM, design of a risk-

aware ML-powered RRM for compensating such uncertainties,

and ﬁguring out the amount of radio resources which are

sacriﬁced because of such uncertainties. Fig. 1 presents a

graphical illustration of the proposed RRM solution and how

ML can be mapped to RRM problem and HMA to satisfy

URLLC requirements for both scheduled and non-scheduled

trafﬁc. We have also present open challenges and potential

solutions in RRM for URLLC coexistence.

II. ENABLING URLLC: THE RO LE O F EFFIC IE NT RA DI O

RESOURCE MANAGEMENT

In URLLC applications, we usually deal with infrequent

bursty trafﬁc, where the arrival time of the burst could

not be predicted [1, 6]. Due to stringent delay constraints,

once a URLLC packet is generated, the packet must be

transmitted immediately without any delay [3, 6]. This abrupt

transmission usually having a short payload size, could be an

alarm message, or the ﬁrst packet transmission (FPT) which

will be followed by upcoming scheduled transmissions. Apart

from the non-scheduled URLLC trafﬁc, we also need serving

scheduled URLLC trafﬁc which is not necessary of short

payload size [3, 10]. The resource allocation to this trafﬁc

type should guarantee no packet drop at the device due to

the expiration of data [10].

Fig. 2(a) represents a graphical illustration of the scheduled

and non-scheduled URLLC trafﬁc coexistence in the ITS use-

case. In this use case, intelligent vehicles communicate with

each other (V2V) as well as with the communication infras-

tructure (V2I) for trafﬁc efﬁciency and safety. One observes

that non-scheduled trafﬁc transmissions include (i) FPT of

devices which transmit critical data and reserve access for

further messages [3] (refer to FPT-NS in Fig. 2(b)); (ii) critical

message from a device to the access network [6] (refer to

V2I-NS in Fig. 2(b)); and (iii) critical V2V communications

[4] (refer to V2V-NS in Fig. 2(b)). One further observes that

non-scheduled trafﬁc transmissions are infrequent and could

have temporal correlation with the scheduled transmissions

(the FPT-NS in Fig. 2(b)).

Based on the above discussions, enabling URLLC requires

tackling three major issues: (i) enabling ultra-reliability in

communications, which on the other hand requires diversity;

(ii) enabling low-latency communications, which limits time

diversity and enforces frequency diversity; and (iii) coexis-

tence management of scheduled and non-scheduled trafﬁc,

which is due to the fact that frequency resources could not

be scaled with the amount of URLLC trafﬁc. The coexistence

management calls for an efﬁcient resource management strat-

egy, without which, scalability of communications systems in

serving URLLC service is challenging. While in prior studies,

the ﬁrst two issues have been investigated, there is lack of

research on the coexistence management within the URLLC

services [6, 10]. Towards serving a coexistence of scheduled

and non- scheduled URLLC trafﬁc, not only the queuing delay

violation for scheduled trafﬁc should be minimized, but also

the delay and packet error probability for the ﬁrst transmission

(non-scheduled trafﬁc) should be reduced dramatically. Es-

pecially, serving non-scheduled trafﬁc requires revolutionary

multiplexing schemes breaking the barrier of interference-free

orthogonal transmissions [1, 6]. The need for prompt access

may occur in any time instant, e.g. while resources have been

reserved for the scheduled-users, hence, serving these two

trafﬁc types together poses coexistence challenges.

A. The Hybrid Multiple Access Solution

The straightforward way of co-serving scheduled and non-

scheduled trafﬁc is to allocate them orthogonal sets of radio

resources, i.e. OMA. The OMA can be very suboptimal in

spectral efﬁciency because the set of allocated resources to

non-scheduled trafﬁc could be unused most of the time due

to the bursty nature of such safety-critical messages [1]. On

the other hand, insufﬁcient resource allocation to this trafﬁc

type results in reliability degradation. In order to compensate

spectral inefﬁciency of OMA, NOMA multiplexes different

trafﬁc types over a set of shared resources to overcome

spectral inefﬁciency of OMA. However, keeping isolation

among URLLC services with NOMA is challenging due to

potential temporal overloading in one service.

BS!

(a) V2X communications within the ITS use case

signaling

scheduled

transmission

V2V-NS

V2I-NS

FPT-NS

signaling

scheduled

transmission

V2V-NS

downlink data /control/ACK

(b) Communications exchanges

Fig. 2: An illustrative example of system model in the ITS use case, including scheduled (green dashed arrows) and

non-scheduled (red dotted arrows) URLLC communications.

Here, we consider a hybrid of OMA/NOMA, namely

HMA, in which, frequency resources, are divided into three

groups: (i) resources allocated to serving the non-scheduled

URLLC trafﬁc; (ii) resources allocated to serving the sched-

uled URLLC trafﬁc; and (iii) resources shared among them.

The HMA could be reduced to OMA and NOMA by tuning

the amount of shared resources. Over the shared resources,

we need to limit the aggregated received interference from the

scheduled users. Furthermore, in order to prevent performance

degradation for the scheduled trafﬁc over the shared resources,

we need to assure reliability of scheduled data transmission

even in presence of non-scheduled URLLC trafﬁc. Towards

this end, proactive or reactive strategies could be used [15].

The former aims at making the transmission robust, e.g. by

planning for occurring erasure in the channel (erasure coding),

while the latter consists in cancellation of received interference

from the non-scheduled trafﬁc. Regarding the stringent delay

requirement of URLLC, the reactive strategy increases the

delay violation probability for the scheduled trafﬁc. Thus, risk-

aware proactive RRM for minimizing reliability degradation is

required.

B. RRM for the Hybrid Multiple Access

To make the HMA solution working, the radio access

network (RAN) control center needs to control the trafﬁc load

dynamically, and adjust proactively the size of resource pools

associated to each trafﬁc stream. Furthermore, coordination

among the BSs is needed to make sure the aggregated intra-

and inter-cell interference will not cause an outage. Then,

the RRM must be done in a multi-cell level to compensate

the inter-cell interference impact on the outage probability.

While a centralized solution seems promising for satisfying the

reliability constraint even in cell-edges; the required signaling,

including data gathering, forwarding to the center, and decision

feedback, can potentially violate the URLLC delay constraints

[7]. Even in a single cell scenario, due to the potential

inter-dependence of the scheduled and non-scheduled trafﬁc,

keeping the outage risk for both trafﬁc types lower than the

URLLC requirement is not an easy task. All in all, regard-

ing the high-mobility of users, their time/location-dependent

distributions within the service area, and the bursty nature of

the service requests, the RRM for assuring a minimum service

requirement over a wide service area is highly challenging [5,

7, 10].

Action:

RRP: (i) inter-service/cell

resource allocation, (ii) BSs’

activity management.

Learning:

(i) risk-aware online learning for RRP update,

(ii) inter-BS transfer learning for RRS,

(iii) federated learning of mobility pattern, etc.

Action:

RRU: (i) intelligent resource

utilization for data transmission

and reception.

Action:

RRS: (i) inter and intra service

resource scheduling.

Learning:

(i) risk-aware online learning for RRS update,

(ii) regression-based user/radio map update.

Learning:

(i) risk-aware online learning for RRU update.

RAN

control

center

Intelligent RRP at the RAN control center

Intelligent RRS at the BSs

Intelligent RRU at the end-devices

Fig. 3: Distributed ML for radio resource management. The

proposed RRM includes (i) RRP at the RAN control center,

(ii) RRS at the BSs, and (iii) RRU at the end-devices.

III. ML IN SU PP ORT O F EFFIC IE NT RRM

By broadening the set of services and resources in 5G,

control and troubleshooting of radio access network (RAN) by

human resources become too complicated and costly [7]. Fur-

thermore, the stringent delay/reliability requirements of newly

established URLLC services make them more challenging.

Moreover, many decisions taken in serving users’ requests in

legacy networks have temporal/spatial correlations, i.e. they

are repeated in different locations and time instances. Then,

it is wise to leverage the huge volumes of available data

in cellular networks, and extract patterns of request arrivals,

channel variations of users along mobility patterns, and outage

statistics [7–9]. The derived patterns could be beneﬁcial in

reliable resource provisioning for future service requests. This

brings the idea of leveraging ML tools in design and operation

control of future wireless networks. ML aims at estimating a

function, e.g. a rule or pattern, from a set of noisy data which

has been generated by the true function. The recent advances

in centralized and distributed computing, storage, and learning

algorithms have improved the position of ML as a design and

optimization tool signiﬁcantly [7].

The Proposed ML-Powered RRM

To manage network resources for serving scheduled and

non-scheduled URLLC trafﬁc, one needs to identify the set of

dedicated resources to non-scheduled trafﬁc, the set of shared

resources, and the scheduling policy over the shared and

dedicated resources. Once the set of resources and scheduling

rules are determined, the scheduler allocates resources to users

based on their rankings. The rank of each user depends on its

delay budget and the expected level of received interference

from the user over the shared resources. We propose to

break the intelligent RRM problem into 3 sub-problems: radio

resource provisioning (RRP), radio resource scheduling (RRS),

and radio resource utilizing (RRU).

Fig. 3 represents a graphical representation of the proposed

hierarchical learning solution. In the proposed solution, a

proactive resource provisioning based on training and pre-

diction is done at the RAN control center. RRP at the RAN

control center beneﬁts from (i) transfer learning, i.e. making

decision for a service area based on decisions taken in neigh-

boring areas or previous time instances [11]; (ii) federated

learning, i.e. leveraging the huge volumes of available records

of requests/responses at the edges for extracting common rules

and policy sets [7]; and (iii) risk-aware learning from past

RRP decisions for outage minimization of coexisting services

and trafﬁc types. Distributing the learning task between BSs

and the RAN control center aims at compensating partial

observability of BSs in cellular networks. As BSs have access

to local information, i.e. they observe only a limited part of

the service area, and URLLC users could be highly mobile,

resource provisioning at the BS-level could be less effective.

For example, a BS cannot predict the arriving of a vehicle pla-

toon which is currently in the neighboring service area. Then,

distributed learning can further enable transfer knowledge of

arrival of such demanding users, and sharing of respective

effective resource scheduling policies. As an example, such

learning may result in the knowledge for the RAN control

center to activate some dormant BSs to balance the scheduled

and non-scheduled trafﬁc serving responsibilities among BSs,

and hence, to minimize the probability of outage.

Furthermore, in the proposed intelligent RRM solution, BSs

decide how to control access of the scheduled users over

the shared and dedicated resources to minimize the queuing

delay violation for them, while complying with the maximum

allowed interference constraint over the shared resources. This

distributed control allows BSs to react to instant changes in the

channel quality of connected users, and schedule them based

on their remaining delay budgets. Finally, the intelligent usage

of radio resources for reliable data transmission/reception is

done at the device side. Devices can decide how to allocate

their power resources to the set of shared and dedicated

frequency resources to ensure the highest possible reliability.

This is especially the case in non-scheduled URLLC trans-

missions, either to the BS or to another device/vehicle, where

devices can beneﬁt from reinforcement learning for improving

reliability of their transmissions [12].

The proposed RRM requires a set of ML solutions at the

RAN control center, BS, and device-side for addressing the

challenges posed in serving the URLLC trafﬁc, as described

in Section I. In the following section, we describe some of

these solutions, as well as the open problems. These solutions

have been also depicted in the learning boxes of Fig. 3.

IV. ML S OLUTIONS EMBEDDED IN THE PROPOSED RRM

A. Federated Learning of User and Radio Maps for RRP

The RRP for URLLC trafﬁc at the RAN control center

requires knowledge about the outage risk of potential users.

When we consider no a priori information about the users,

the design is based on the worst case scenario, and hence,

huge volumes of frequency resources are required to satisfy a

level of QoS. Here, we investigate the combined use of radio

and user maps in achieving an estimate of the outage risk of

URLLC users.

1) Radio Map: They are expected to be an essential 5G

component for enabling agile yet efﬁcient resource allocation

for mobile use cases like autonomous driving [13]. For a given

service area, radio map includes information like radio signal

strength, delay spread, and coexisting interference level. Re-

garding the inevitable changes in the environment, radio maps

need to be updated once a time. In the proposed hierarchical

RRM solution, this is the role of the RAN control center to

coordinate continuous update of the radio map, and to manage

location-based inter-update time intervals. Furthermore, there

might be time periods during which, no update of signal

strength from a speciﬁc region is sent to the BSs. This could

be due to temporal absence of users in a location for an

extended period of time. In such cases, this is the role of

the involving BSs to predict and update the received signal

strength in that location, e.g. by leveraging temporal/spatial

regression as described in [13].

2) User Map: With the ever increasing number of con-

nected devices and the volume of online data, the locations

and mobility patterns of many vehicles, specially public trans-

portation systems, are known or could be learnt [14]. Further

than ofﬂine learning of mobility patterns, the RAN control

center can coordinate information exchange among BSs in a

service area to let them collaboratively construct and update

a common user map. Having a map of users and mobility

patterns, enables prediction of presence of users in different

regions of the service area for an extended period of time [14].

The map of users requires continuous update to be effective

in URLLC applications. In our proposed structure, this is the

role of the RAN control center to keep the consistency of the

global user map, and coordinate the inter-update time intervals

among the involving BSs.

Once the position/speed/direction information of a user is

known, and the radio map of the respective environment

is given, we are able to estimate path-loss and shadowing

components of its wireless communication channel. Then, the

uncertainty of the channel scales down to the small-scale

fading. Thus, we will be able to predict the received power

at the BS as a function of transmit power of users. Within

the context of URLLC, this type of information is useful

for early identiﬁcation of a user which is expected to enter

a crowded/low-coverage area. Thus, the RAN would be able

to conﬁgure the network -manage the radio/energy resources-

to lower the risk of communication outage for the respective

users. The radio maps will further enable user management

based on regional characteristics. This means that once devices

enter a speciﬁc region, they could be granted higher-level

support, e.g. dual connectivity, or extra resources for their

transmissions could be reserved, or their urgent transmissions

could be handed over to multiple neighboring BSs.

B. Risk-Aware Learning for RRS

Introduction of URLLC into cellular networks mandates a

transition from average-utility network design into risk-aware

network design, where a rare event may incur a huge loss [2].

Instead of legacy ML algorithms, which aim at maximizing

the average return of agents, or equivalently minimizing the

average regret of agents, we aim at minimizing the risk of huge

loss. The loss could be deﬁned in different ways as a function

of the return. When serving solely one trafﬁc type, the risk-

aware learning-powered scheduling enforces maximizing the

ﬁrst moment of the return, i.e. reliability, while minimizing

its higher moments, e.g. the variance. Minimizing higher

moments of the return, i.e. optimizing the worst case return,

could be achieved by deﬁning the utility function as a non-

linear function of the return, R. A well-known example is

exponential utility function in which, instead of maximizing

the expected value of R, the objective is to maximize the

expected value of exp(-R)[2]. In our proposed RRM solution,

scheduling of shared radio resources is done by leveraging

risk-aware learning at the BSs. In other words, the scheduling

task is performed such that the risk of outage for scheduled

and non-scheduled trafﬁc remains meets the URLLC’s require-

ments.

C. Open Challenges and Potential Solutions

The main challenge in utilizing intelligent RRM consists

in large dimensionality and complexity. The heterogeneity

of services, QoS requirements, and diverse set of available

resources make the RRM a very complex problem. This com-

plexity has a signiﬁcant impact on the convergence time of the

learning algorithms. Furthermore, the high mobility of users

demanding URLLC service, especially in the ITS use case,

and the partial observability of the network by each BS, put

further constraints to the RRM problem. In order to overcome

the shortcomings of ML in timely ﬁnding optimal decisions,

it is important to compare it against the way human beings

learn. Human beings beneﬁt from planning and reasoning for

reducing the search space and explore the solution in most

probable areas. In contrast, ML approaches treat the search

space unbiased, and in the exploration phase, select all actions

randomly to get an estimate of their respective payoffs. One

way to reduce the search space, and speed up the learning

process, is to add smart planning to the learning algorithms.

Towards this end, one may beneﬁt from the human-resources,

and integrate them in the learning loop, to reduce the search

space, and prevent BSs in taking risky/time-consuming actions

during the exploration process. Another major challenge in

utilizing intelligent RRM consists in exchanging information

among different network entities, e.g. from a BS to the RAN

control center, which is energy and radio resource consuming.

Towards addressing this problem, designing communication-

efﬁcient learning algorithms, e.g. federated learning in which

only the learnt models are communicated [7], are of crucial

importance.

V. ML-POWE RE D RR M: A C AS E STU DY

In this section, we demonstrate how to take advantage of

intelligent RRM for spectrum-efﬁcient yet reliable serving

of scheduled and non-scheduled URLLC trafﬁc. First, we

investigate performance of intelligent RRP and RRS (Fig.

4(a)). Consider a single-cell scenario in which URLLC users

generate and transmit non-scheduled URLLC trafﬁc (infre-

quent) as well as scheduled URLLC trafﬁc. The minimum

and maximum experienced path-loss in the service area are

denoted by -70 and -120 dB, respectively. Other simulation

parameters could be found in Table I. Based on the outage

risk of the most critical device with potential non-scheduled

URLLC trafﬁc, the required outage probability for the non-

scheduled trafﬁc, and the learnt risk of outage in the service

area, the ML-powered RRP and RRS modules are imple-

mented and used for resource management, as described in

Section IV and Table I. Fig. 4(a) represents the level of

reliable data-rate achieved for the scheduled trafﬁc, versus

the required outage probability for the non-scheduled trafﬁc.

The dashed-curve represents a conservative RRM solution in

which, resource allocation is done based on the risk to the

cell-edge user. For easier interpretation of the results, the

decoupled gains of RRM with OMA, NOMA, and HMA

have been presented. OMA represents orthogonal allocation of

RRBs to scheduled/non-scheduled trafﬁc based on the online

outage risk-level, in contrast with the conservative design

(the dashed curve). NOMA represents the case in which, all

RRBs could be reused by the scheduled trafﬁc, and hence,

there is no dedicated RRBs for the non-scheduled trafﬁc.

HMA represents the advanced form of NOMA in which, once

reliability requirement of the non-scheduled trafﬁc increases,

it dedicates some resources to the non-scheduled trafﬁc. To

quantify the achieved gain by intelligent RRM, let us focus

on the 99.99% reliability requirement for both scheduled and

non-scheduled trafﬁc types. One observes that 75% increase

in reliable data rate for the scheduled trafﬁc is achieved by

TABLE I: Parameters of the case study.

Parameters Values

Service area Circular area, radius 3 Km, BS at the center

Trafﬁc Scheduled: saturated; Non-scheduled: exp(0.01)

Radio resources 5 radio resource blocks (RBBs), each 180 KHz

Transmit power Scheduled: 21 dBm, Non-scheduled: 23 dBm

Intelligent RRM Q-learning: learning rate=0.1, exploration

rate=0.85, Q(S,A): value of state-action

combinations

Q-learning for

RRP and RRS

S: state of the network including reliability require-

ments and position of users, A: scheduling of a user

over shared resources

Q-learning for

RRU

S: state of the network including interference level

on RRBs, A: power allocation over RRBs

1e-1 1e-2 1e-3 1e-4 1e-5 1e-6 1e-7

Target outage probaility for urgent traffic

0.5

1.5

2.5

0.9999-reliable datarate (b/s), sched. traffic

106

No-learning (conservative design)

Intelligent RRP and RRS (OMA)

Intelligent RRP and RRS (NOMA)

Intelligent RRP and RRS (HMA)

(a) Intelligent RRP and RRS: reliable data rate for the scheduled

trafﬁc versus the outage probability for the non-scheduled trafﬁc

(single-shot transmissions). Increase in the required reliability

highlights the merits of intelligent RRM and HMA.

0 0.2 0.4 0.6 0.8 1

α: fraction of allocated power to RRB1

10-7

10-6

10-5

10-4

10-3

10-2

Outage probability for non-sched traffic

PA= [1,0,0]

PA= [α, 1-α ,0]

PA= [α, (1-α)/2, (1- α)/2]

PA= [α, 1-α, 0]

PA= [α, (1-α)/2, (1- α)/2]

PSD of Interference=

[0,-164,-164] dBm/Hz

PSD of Interference=

[0,-172,-172] dBm/Hz

(b) Intelligent RRU: outage probability for different power

allocation (PA) decisions. PA=[α, x, y]represents a decision in

which, αfraction of power is used for the dedicated RRB, and x

and yfraction is used for the other two shared RRBs.

Fig. 4: Performance evaluation of the proposed RRM.

intelligent RRM, while the reliability of both trafﬁc types is

satisﬁed. One further observes, the higher the outage risk,

the higher room for spectrum efﬁciency by intelligent RRM.

Also, we see that ultra-reliable serving of the non-scheduled

trafﬁc mandates dedicating some RRBs, and beyond the target

reliability level of 99%, the NOMA scheme fails in serving

the non-scheduled trafﬁc. Furthermore, the beneﬁts of HMA

increase by increase in the required reliability level. Finally,

we observe that uncertainty in the wireless channel, as well

as user demand, mandates reserving more radio resources for

guaranteeing reliability performance, which on the other hand

reduces the overall spectrum efﬁciency.

Now, let us examine the beneﬁt of intelligent radio resource

utilization. We assume a typical device with non-scheduled

URLLC trafﬁc experiences a distance-dependent path-loss of

-80 dB, and needs to combat small-scale fading as well as

path-loss for achieving reliable communications. Out of 5

RRBs, 3could be used for urgent transmission of this device,

where the power spectral density (PSD) of scheduled trafﬁc’s

interference over them is denoted by [0,-172,-172] dBm/Hz.

In other words, the ﬁrst one is dedicated and the other two

are shared with the scheduled trafﬁc. Fig. 4(b) represents

the outage probability versus the fraction of allocated power

to the dedicated RRB, i.e. α. In this ﬁgure, PA=[α, x, y]

represents the power allocation decision over the RRBs by

HMA, which reduces to OMA and NOMA when PA= [1,0,0]

and PA=[1/3,1/3,1/3], respectively. One sees that the best

action is to distribute power equally on all RRBs. This is

because the PSD of interference is comparable with the PSD

of noise, i.e. -174 dBm/Hz. On the other hand, one observe

that when the PSD of interference is [0,-164,-164] dBm/Hz,

i.e. interference is much stronger than the noise, the best action

is to use half of the power over the dedicated RRB, and divide

the other half among the two shared RRBs. In both cases, one

observes that an intelligent selection of α, could bring a huge

gain for HMA in comparison with the OMA and NOMA.

VI. CONCLUSION

Regarding the stringent reliability and delay requirements

of URLLC, there is still lack of communications solutions

enabling the URLLC service. One major challenge consists in

addressing URLLCs scalability and co-existence with other

URLLC/non-URLLC services. Here, we have investigated

coexistence of scheduled and non-scheduled URLLC services

and discussed challenges to be addressed for satisfying their

stringent requirements in co-existence scenarios. Then, we

have proposed a hybrid multiple access (HMA) solution for

addressing such issues in a spectral/energy efﬁcient way. In

the heart of our proposed solution, we have further leveraged

a distributed hierarchical ML approach for proactive RRM to

different URLLC trafﬁc streams. Finally, we have provided a

case study to demonstrate the potential of leveraging ML in

serving coexisting URLLC trafﬁc over limited radio resources.

The results indicated that even for target outage probability

of 10−7for non-scheduled URLLC trafﬁc, approximately

more than 6times data rate is achieved in comparison to

conservative design for scheduled URLLC trafﬁc with 99.99%

reliability thanks to our ML-powered HMA solution.

ACKNOWLEDGMENT

This work is supported in part by the Celtic Plus Project

SooGreen (Service Oriented Optimization of Green Mobile

Networks).

REFERENCES

[1] P. Popovski, et al., “5G Wireless Network Slicing for eMBB, URLLC,

and mMTC: A Communication-Theoretic View,” IEEE Access, vol. 6, pp.

55765-55779, Sept. 2018.

[2] M. Bennis, et al., “Ultra-reliable and Low-latency Wireless Communica-

tion: Tail, risk and scale,” Proceedings of the IEEE, vol. 106, no. 10, pp.

1834-1853, Oct. 2018.

[3] I. Parvez, et al., “A Survey on Low Latency towards 5G: RAN, core

network and caching solutions,” IEEE Communications Surveys and

Tutorials, vol. 10, no. 2, pp. 36-49, May. 2018.

[4] H. Ye and G. Li, “Deep Reinforcement Learning for Resource Allocation

in V2V Communications,” in Proc. of IEEE ICC, May. 2018.

[5] A. Kasgari, et al., “Stochastic Optimization and Control Famework for

5G Network Slicing with Effective Isolation,” in Proc. of IEEE CISS,

Mar. 2018.

[6] H. Ji, et al., “Ultra-Reliable and Low-Latency Communications in 5G

Downlink: Physical Layer Aspects,” IEEE Wireless Communications, vol.

25, no. 3, pp. 124-130, June 2018.

[7] F. Calabrese, et al., “Learning Radio Resource Management in RANs:

Framework, Opportunities, and Challenges,” IEEE Communications Mag-

azine, vol. 56, no. 9, pp. 138-145, Sept. 2018.

[8] R. Li, et al., “The Prediction Analysis of Cellular Radio Access Network

Trafﬁc: From entropy theory to networking practice,” IEEE Communica-

tions Magazine, vol. 52, no. 6, pp. 234-240, June 2014.

[9] C. Yuan, et al., “On the Emerging of Scaling Law, Fractality and Small-

World in Cellular Networks,” in Proc. of IEEE VTC-Spring, June 2017.

[10] C. She, C. Yang, and T. Quek, “Radio Resource Management for

Ultra-reliable and Low-latency Communications,” IEEE Communications

Magazine, vol. 55, no. 6, pp. 72-78, June. 2017.

[11] K. Weiss, et al., “A Survey of Transfer Learning,” Journal of Big Data,

vol. 3, no. 1, pp. 1-40, May 2016.

[12] A. Azari and C. Cavdar, “Self-organized Low-power IoT Networks: A

distributed learning approach,” in Proc. of IEEE GLOBECOM, Dec. 2018.

[13] V. Chowdappa, et al., “Distributed Radio Map Reconstruction for 5G

Automotive,” IEEE Intelligent Transportation Systems Magazine, vol.

10, no. 2, pp. 36-49, Apr. 2018.

[14] R. Taranto, et al., “Location-aware Communications for 5G Networks,”

IEEE Signal Processing Magazine, vol. 31, no. 6, pp. 102-112, Oct. 2014.

[15] R. Alieiev, et al., “Predictive Communication and its Application to

Vehicular Environments: Doppler-shift compensation,” IEEE Transactions

on Vehicular Technology, May. 2018.

Amin Azari received the B.Sc. and M.Sc. degrees in electrical and communi-

cation systems engineering from University of Tehran, Iran in 2011 and 2013,

respectively, and the Ph.D. degree in information and communication technol-

ogy from KTH Royal Institute of Technology, Sweden in 2018. Currently, he

is a postdoctoral researcher at Stockholm University. His research interests

include 5G radio access network design, Internet of Things and machine

learning.

Mustafa Ozger received his B.Sc. degree in electrical and electronics

engineering from Middle East Technical University, Ankara, Turkey, in 2011,

and his M.Sc. and Ph.D. degrees in electrical and electronics engineering from

Koc University, Istanbul, Turkey, in 2013 and 2017, respectively. Currently,

he is a postdoctoral researcher at KTH Royal Institute of Technology. His

research interests include wireless communications and the Internet of Things.

Cicek Cavdar is an assistant professor at the School of EECS at KTH in

Sweden. She has been leading the Intelligent Network Systems research group

under the Radio Systems Lab focusing on design and planning of intel-

ligent network architectures, direct air-to-ground communications, and IoT

connectivity platforms. She has been coordinating the EU EIT Digital project

Seamless DA2GC in Europe, which has resulted in successful technology

transfer cases to industry. She served as Symposium Chair for IEEE ICC

GCSN 2017.

ResearchGate has not been able to resolve any citations for this publication.

Self-Organized Low-Power IoT Networks: A Distributed Learning Approach

Conference Paper

Full-text available

Jul 2018

Enabling large-scale energy-efficient Internet-of-things (IoT) connectivity is an essential step towards realization of networked society. While legacy wide-area wireless systems are highly dependent on network-side coordination, the level of consumed energy in signaling, as well as the expected increase in the number of IoT devices, makes such centralized approaches infeasible in future. Here, we address this problem by self-coordination for IoT networks through learning from past communications. To this end, we first study low-complexity distributed learning approaches applicable in IoT communications. Then, we present a learning solution to adapt communication parameters of devices to the environment for maximizing energy efficiency and reliability in data transmissions. Furthermore, leveraging tools from stochastic geometry, we evaluate the performance of proposed distributed learning solution against the centralized coordination. Finally, we analyze the interplay amongst energy efficiency, reliability of communications against noise and interference over data channel, and reliability against adversarial interference over data and feedback channels. The simulation results indicate that compared to the state of the art approaches, both energy efficiency and reliability in IoT communications could be significantly improved using the proposed learning approach. These promising results, which are achieved using lightweight learning, make our solution favorable in many low-cost low-power IoT applications.

Stochastic optimization and control framework for 5G network slicing with effective isolation

Conference Paper

Full-text available

Mar 2018

5G Wireless Network Slicing for eMBB, URLLC, and mMTC: A Communication-Theoretic View

Article

Full-text available

Apr 2018

The grand objective of 5G wireless technology is to support services with vastly heterogeneous requirements. Network slicing, in which each service operates within an exclusive slice of allocated resources, is seen as a way to cope with this heterogeneity. However, the shared nature of the wireless channel allows non-orthogonal slicing, where services us overlapping slices of resources at the cost of interference. This paper investigates the performance of orthogonal and non-orthogonal slicing of radio resources for the provisioning of the three generic services of 5G: enhanced mobile broadband (eMBB), massive machine-type communications (mMTC), and ultra-reliable low-latency communications (URLLC). We consider uplink communications from a set of eMBB, mMTC and URLLC devices to a common base station. A communication-theoretic model is proposed that accounts for the heterogeneous requirements and characteristics of the three services. For non-orthogonal slicing, different decoding architectures are considered, such as puncturing and successive interference cancellation. The concept of reliability diversity is introduced here as a design principle that takes advantage of the vastly different reliability requirements across the services. This study reveals that non-orthogonal slicing can lead, in some regimes, to significant gains in terms of performance trade-offs among the three generic services compared to orthogonal slicing.

Ultra-Reliable and Low-Latency Wireless Communication: Tail, Risk and Scale

Article

Full-text available

Jan 2018

Ensuring ultra-reliable and low-latency communication (URLLC) for 5G wireless networks and beyond is of capital importance and is currently receiving tremendous attention in academia and industry. At its core, URLLC mandates a departure from expected utility-based network design approaches, in which relying on average quantities (e.g., average throughput, average delay and average response time) is no longer an option but a necessity. Instead, a principled and scalable framework which takes into account delay, reliability, packet size, network architecture, and topology (across access, edge, and core) and decision-making under uncertainty is sorely lacking. The overarching goal of this article is a first step to fill this void. Towards this vision, after providing definitions of latency and reliability, we closely examine various enablers of URLLC and their inherent tradeoffs. Subsequently, we focus our attention on a plethora of techniques and methodologies pertaining to the requirements of ultra-reliable and low-latency communication, as well as their applications through selected use cases. These results provide crisp insights for the design of low-latency and high-reliable wireless networks.

A Survey on Low Latency Towards 5G: RAN, Core Network and Caching Solutions

Article

Full-text available

Aug 2017

The fifth generation (5G) wireless network technology is to be standardized by 2020, where main goals are to improve capacity, reliability, and energy efficiency, while reducing latency and massively increasing connection density. An integral part of 5G is the capability to transmit touch perception type real-time communication empowered by applicable robotics and haptics equipment at the network edge. In this regard, we need drastic changes in network architecture including core and radio access network (RAN) for achieving end-to-end latency on the order of 1 ms. In this paper, we present a detailed survey on the emerging technologies to achieve low latency communications considering three different solution domains: RAN, core network, and caching. We also present a general overview of 5G cellular networks composed of software defined network (SDN), network function virtualization (NFV), caching, and mobile edge computing (MEC) capable of meeting latency and other 5G requirements.

Deep Reinforcement Learning for Resource Allocation in V2V Communications

Conference Paper

May 2018

Ultra-Reliable and Low-Latency Communications in 5G Downlink: Physical Layer Aspects

Article

Jun 2018

URLLC is a new service category in 5G to accommodate emerging services and applications having stringent latency and reliability requirements. In order to support URLLC, there should be both evolutionary and revolutionary changes in the air interface named 5G NR. In this article, we provide an up-to-date overview of URLLC with an emphasis on the physical layer challenges and solutions in 5G NR downlink. We highlight key requirements of URLLC and then elaborate the physical layer issues and enabling technologies including packet and frame structure, scheduling schemes, and reliability improvement techniques, which have been discussed in the 3GPP Release 15 standardization.

Predictive Communication and Its Application to Vehicular Environments: Doppler-Shift Compensation

Article

May 2018

We introduce predictive communication algorithms for channel estimation and equalization in vehicular environments. The proposed concept makes use of vehicular sensors combined with unique properties of surrounding environment to predict future changes in the communication scenario. This combination enables a new range of improvements in the direct-link vehicular communications. Among others it allows predicting the Doppler frequency shift for some strong channel components in highly dynamic scenarios and estimating the likelihood for the communicating vehicles to remain in the line-of-sight conditions. A predictive Doppler-shift compensation is presented as an illustrative example of one of potential benefits for future vehicular communications. The algorithm is applied to compensate the strong line-of-sight component as well as the line-of-sight with one strong reflection which is received at the same sampling interval. The obtained simulation results show that the proposed sensor-based Doppler-shift prediction and compensation approach outperforms a benchmark least-squares channel estimation with linear interpolation algorithm in various road scenarios (urban, rural, highway) and relative driving directions (same and oncoming) of communicating vehicles.

Distributed Radio Map Reconstruction for 5G Automotive

Article

Apr 2018

Radio maps are expected to be an essential tool for the resource optimization and management of 5G automotive. In this paper, we consider the problem of radio map reconstruction using a wireless sensor network formed by sensor nodes in vehicles, access nodes from a smart city infrastructure, etc. Due to limited resource constraints in sensor networks, it is crucial to select a small number of sensor measurements for field reconstruction. In this context, we present a novel distributed incremental clustering algorithm based on the regression Kriging method for radio map reconstruction in terms of average received power at locations where no sensor measurements are available. The path-loss and shadowing components of the wireless channel are separately estimated. For shadowing estimation, clusters of sensors are adaptively formed and their size is optimized in terms of the least number of sensors by minimizing the ordinary Kriging variance. The complexity of the proposed algorithm is analyzed and simulation results are presented to showcase the algorithm efficacy to field reconstruction.

On the Emerging of Scaling Law, Fractality and Small-World in Cellular Networks

Conference Paper

Jun 2017

Risk-Aware Resource Allocation for URLLC: Challenges and Strategies with Machine Learning

Abstract

Recommended publications

Resource Allocation Algorithm with Dynamic Subcarrier Assignment in OFDMA-based Wireless Networks

Analysis of Radio Resource Allocation in LTE Uplink

Risk-Aware Resource Allocation for URLLC: Challenges and Strategies with Machine Learning

PHY- and MAC-aware resource allocation and packet scheduling for single-cell OFDMA packet networks