Conference PaperPDF Available

Reinforcement learning for licensed-assisted access of LTE in the unlicensed spectrum

June 2015

June 2015

DOI:10.1109/WCNC.2015.7127653

Conference: 2015 IEEE Wireless Communications and Networking Conference (WCNC)

Authors:

Nadisanka Rupasinghe

Samsung

In order to coexist with the WiFi systems in the unlicensed spectrum, Long Term Evolution (LTE) networks can utilize periodically configured transmission gaps. In this paper, considering a time division duplex (TDD)-LTE system, we propose a Q-Learning based dynamic duty cycle selection technique for configuring LTE transmission gaps, so that a satisfactory throughput is maintained both for LTE and WiFi systems. By explicitly taking the impact of IEEE 802.11n beacon transmission mechanism into account, we evaluate the coexistence performance of WiFi and LTE using the proposed technique. Simulation results show that the proposed approach can enhance the overall capacity performance by 19% and WiFi capacity performance by 77%, hence enabling effective coexistence of LTE and WiFi systems in the unlicensed band.

WiFi APs and LAA BSs operating simultaneously in the unlicensed spectrum.

…

Beacon PPDU.

…

Beacon frame is expected to be transmitted at the end of Beacon Interval. But this is not possible always as WiFi AP has to wait for the completion of all ongoing WiFi transmissions.

…

20 ms duty cycle period is considered with four duty cycles (20%, 40%, 60%, 80%). Here x% represents the percentage of time where the LTE network is transmitting.

…

WiFi APs and LAA BSs in a two-layer cell layout.

…

Figures - uploaded by Nadisanka Rupasinghe

Content may be subject to copyright.

Content uploaded by Nadisanka Rupasinghe

Content may be subject to copyright.

Reinforcement Learning for Licensed-Assisted

Access of LTE in the Unlicensed Spectrum

Nadisanka Rupasinghe and ˙

Ismail G¨

uvenc¸

Department of Electrical and Computer Engineering, Florida International University, Miami, FL 33174

Email: {rrupa001, iguvenc}@fiu.edu

Abstract—In order to coexist with the WiFi systems in the

unlicensed spectrum, Long Term Evolution (LTE) networks

can utilize periodically conﬁgured transmission gaps. In this

paper, considering a time division duplex (TDD)-LTE system,

we propose a Q-Learning based dynamic duty cycle selection

technique for conﬁguring LTE transmission gaps, so that a

satisfactory throughput is maintained both for LTE and WiFi

systems. By explicitly taking the impact of IEEE 802.11n beacon

transmission mechanism into account, we evaluate the coexistence

performance of WiFi and LTE using the proposed technique.

Simulation results show that the proposed approach can enhance

the overall capacity performance by 19% and WiFi capacity

performance by 77%, hence enabling effective coexistence of LTE

and WiFi systems in the unlicensed band.

Index Terms—Beacon, licensed-assisted access (LAA), Q-

Learning, reinforcement learning, TDD-LTE, WiFi 802.11n.

I. INTRODUCTION

Use of Long Term Evolution (LTE) technology in the unli-

censed spectrum has been recently gaining signiﬁcant attention

to enable higher throughput, cater insatiable trafﬁc demand,

and allow a better quality of service for cellular users [1]–[4].

The unlicensed spectrum is traditionally occupied by wireless

communication technologies such as WiFi, bluetooth, and

radar. Since WiFi provides a higher user throughput, network

operators, at the moment, prefer using WiFi for expanding

their capacity by ofﬂoading trafﬁc to WiFi. Due to its coor-

dinated deployment and operation, LTE has the potential to

provide higher capacity and better coverage than WiFi for the

same transmit power, while providing seamless connectivity

[3]. Motivated by this potential, the 3GPP standardization

group has recently initiated a study item on licensed-assisted

access (LAA) using LTE in the unlicensed spectrum [1].

To enable the operation of LTE in the unlicensed band,

coexistence with WiFi technology carries critical importance.

Different coexistence mechanisms between WiFi and LAA

have been studied in the literature. In [5], coexistence of

LAA and WiFi as secondary users in TV white space is

investigated, and two techniques are proposed to facilitate

interference management: 1) spectrum sensing (Listen-Before-

Talk (LBT)) by LAA, and 2) coexistence gap during which

LAA refrain from transmitting. LBT based approaches have

been considered also in [6], [7] for LTE systems, to fa-

cilitate operation in the unlicensed spectrum. In [6], carrier

aggregation for LTE from licensed and license-exempt bands

This research was supported in part by the U.S. National Science Founda-

tion under the grants CNS-1406968 and AST-1443999.

is proposed. In that, to access license-exempt band, LBT is

used by LTE systems along with request-to-send (RTS) and

clear-to-send (CTS) message exchange prior to starting the

original LTE transmission. LBT based approach proposed in

[7] considers handling of both inter-radio access technology

(RAT) interference and intra-RAT interference. To handle

inter-RAT interference, energy detection based LBT approach

is proposed, whereas to handle intra-RAT interference, LBT

based on cross correlation detection is proposed. Exchanging

spectrum allocation information between WiFi and LAA via

a common database is considered in [8] for enabling simulta-

neous access to unlicensed spectrum by LTE and WiFi.

In [9], blank sub frame allocation technique by LAA is

introduced to facilitate simultaneous WiFi and LTE operation

in the unlicensed spectrum. During silent subframes referred

to as blank subframes, LAA refrains from transmitting and as

a result WiFi gets more opportunities to access the channel.

Similar type of approach is considered in [10], in which LAA

allocates silent gaps with a predeﬁned duty cycle to facilitate

better coexistence with WiFi. An uplink (UL) power control

based mechanism is evaluated for LAA systems in [11] to

allow simultaneous operation of WiFi and LTE in the unli-

censed spectrum. In that, LAA UL transmit power is reduced

in a controlled manner based on interference measurements,

generating more transmission opportunities to WiFi.

In this paper, we introduce a reinforcement learning based

dynamic duty cycle selection technique for LAA to facilitate

WiFi-LAA simultaneous operation in the unlicensed spectrum.

In particular, we use Q-Learning to dynamically conﬁgure

transmission gaps in LAA periodically, based on its learn-

ings from the environment. First, using a 3GPP-compliant

simulation setting, we evaluate the system performance under

different duty cycles of the transmission gaps. Then, the per-

formance of Q-Learning based dynamic duty cycle selection

technique is evaluated. The simulation results show that the

Q-Learning based approach improves overall system capacity

performance by 19% and WiFi capacity performance by 77%

compared to the scenario with ﬁxed duty cycles that yields the

highest aggregate capacity.

The rest of the paper is organized as follows. In Section II,

we provide details of the considered system model. Section III

introduces the proposed Q-Learning based dynamic duty cycle

selection approach for LTE transmission gaps. Simulation

results with various parameter conﬁgurations are presented in

Section IV. Finally, Section V provides concluding remarks.

2015 IEEE Wireless Communications and Networking Conference (WCNC): - Track 3: Mobile and Wireless Networks

Fig. 1: WiFi APs and LAA BSs operating simultaneously in the

unlicensed spectrum.

II. SY ST EM M ODEL

A. Deployment Scenario

In order to evaluate the coexistence challenges and related

interference management methods for LAA operation of LTE,

we consider a scenario as shown in Fig. 1, where MLAA

base stations (BSs) and WiFi access points (APs) are oper-

ating simultaneously in the unlicensed band. Each WiFi AP

(LAA BS) consists of NWiFi stations (STAs) (LAA user

equipment (UEs)), which are uniformly randomly distributed

within the cell coverage area. Time division duplex (TDD)-

LTE is considered and it is assumed that LAA BSs and LAA

UEs are synchronized together all the time.

As shown in Fig. 1, due to simultaneous operation of WiFi

and LAA in the unlicensed spectrum, targeted WiFi STA

experiences interference from LAA DL/UL transmissions and

other WiFi DL/UL transmissions. This will result in degrading

the signal to interference plus noise ratio (SINR) at the targeted

WiFi STA. In the same way, for WiFi UL transmissions

and LAA DL/UL transmissions, WiFi and LAA simultaneous

operation will increase interference and hence reduce SINR

which will then degrade capacity performance. Due to carrier

sense multiple access with collision avoidance (CSMA/CA)

mechanism in WiFi [12], when coexisting with LTE, WiFi

transmissions get delayed, further degrading WiFi capacity

performance.

For both WiFi and LAA, we have considered a non full

buffer trafﬁc model as given in 3GPP FTP trafﬁc model-2 [13].

In order to evaluate the capacity of WiFi and LAA for different

simulation scenarios, a physical (PHY) layer abstraction is

used. In particular, Shannon capacity is calculated at the

granularity of each WiFi OFDM symbol duration (4µs) to

obtain the number of successfully received bits [14]. In all

the simulations, wireless channel is modeled according to

[13]. Both for WiFi and LAA, Indoor Hotspot (InH) scenario

is considered when determining path loss and shadowing

parameters used in the simulations.

B. Beacon Transmission Model

Beacon transmissions in WiFi networks are utilized by the

WiFi STAs to detect WiFi APs. Reception of beacon frame

Fig. 2: Beacon PPDU.

is important since it contains information such as beacon

interval, supported rates by the WiFi AP, and time stamp

to synchronize with WiFi AP for transmission/reception of

data to/from WiFi AP by a WiFi STA. Fig. 2 shows the

beacon physical protocol data unit (PPDU) considered in the

paper. Beacon is a management medium access control (MAC)

frame. In Fig. 2, beacon payload represents that MAC frame.

Beacon frame is always transmitted using BPSK modulation

with code rate of 1/2.

Beacon transmission based WiFi STA/AP association is

used in infrastructure basic service set (BSS)1with passive

scanning. In that, WiFi AP periodically broadcasts beacon

frames and WiFi STAs can associate with that WiFi AP if

they receive beacon frames properly [12]. Also, as shown in

Fig. 3, all the WiFi STAs that are associated with an WiFi AP

wait for beacon frame when target beacon transmission time

(TBTT), the time period for beacon transmission, is reached.

Before transmitting a beacon frame, WiFi AP waits for a

time duration speciﬁed by the point co-ordination function

inter-frame space (PIFS) to ensure medium is free. Successful

reception of beacon frame is important because, without that

it is not possible for an STA to transmit/receive data. In this

paper, we consider infrastructure BSS with passive scanning

for WiFi transmission as explained here.

Fig. 3: Beacon frame is expected to be transmitted at the end of

Beacon Interval. But this is not possible always as WiFi AP has to

wait for the completion of all ongoing WiFi transmissions.

As PHY layer abstraction is used in the paper to calculate

the capacity in WiFi and LTE transmissions, we implement

following method to identify successful reception of a beacon

PPDU (frame) at an STA. First, to determine whether an

orthogonal frequency division multiplexing (OFDM) symbol

carrying a portion of the beacon PPDU was received at an

STA, observed SINR of that OFDM symbol is compared with

a threshold; if it is larger than the threshold, it is assumed that

the information in that OFDM symbol was properly received

by the STA2. The same detection mechanism is used by the

STA for all the OFDM symbols belongs to a particular beacon

1A BSS is formed in IEEE 802.11 systems when an association is created

by STAs which are located within a certain coverage area.

2Each OFDM symbol is carrying a ﬁxed number of symbols all the time

as modulation scheme and code rate are ﬁxed for a beacon PPDU.

2015 IEEE Wireless Communications and Networking Conference (WCNC): - Track 3: Mobile and Wireless Networks

1280

PPDU. At the end of the beacon PPDU transmission, WiFi

STA calculates erroneously received beacon PPDU bits by

summing up bits in all the unsuccessfully received beacon

OFDM symbols. Then, the ratio (ρ) between erroneously

received bits to all the transmitted beacon bits (NB) of the

beacon PPDU is calculated as

ρ=Nerr ×NOFDM

,(1)

where Nerr and NOFDM

Bare the number of erroneously

received beacon OFDM symbols and number of bits in a

beacon OFDM symbol, respectively. The ratio in (1) is then

compared with a predeﬁned threshold for acceptable bit error

ratio of a beacon PPDU and determines whether the beacon

PPDU is successfully received at the WiFi STA.

C. Duty cycle implementation for LAA

To implement duty cycle based LAA transmission, we

consider a TDD conﬁguration as shown in Fig. 4. The rationale

behind selecting this type of a TDD conﬁguration is to keep

the UL to DL sub frame ratio constant irrespective of the

selected duty cycle. In particular, a sequence of four subframes

are assumed always to consist of two DL subframes, one UL

subframe and one guard subframe.

Fig. 4: 20 ms duty cycle period is considered with four duty cycles

(20%, 40%, 60%, 80%). Here x%represents the percentage of time

where the LTE network is transmitting.

Four different duty cycles are considered with a transmis-

sion gap duty cycle period of 20 ms. As shown in Fig. 4, LTE

transmits for xpercentage of time from the allocated duty

cycle period. For an example, if we consider 60% duty cycle,

LTE will transmit for 12 ms out of 20 ms duty cycle period.

When moving between adjacent duty cycles (i.e., from 20% to

40%), LTE transmission duration is increased/decreased with

a granularity of 4 ms. As the subframe pattern gets repeated

for every 4 ms, changing between duty cycles will add/remove

block(s) of considered subframe pattern while keeping DL to

UL subframe ratio constant.

III. Q-LEA RN IN G BA SE D DYNAM IC D UT Y CY CL E

SE LE CT IO N FOR LAA

In this section, we present Q-Learning based dynamic duty

cycle selection algorithm for LAA transmission. Dynamic

duty cycle selection is important since the network trafﬁc is

bursty in realistic systems. Hence, the proposed approach can

help in enhancing LTE operation in the unlicensed spectrum

while providing more opportunities for WiFi transmission. As

proposed in [15], we consider a Q-Learning algorithm with

-greedy policy. In that, a pre-deﬁned target capacity value

(Ctar) is set for LAA DL, and LAA BSs autonomously aim

to operate at a capacity close to Ctar by dynamically adjusting

their duty cycles.

When formulating the proposed Q-Learning algorithm, we

consider set of LAA BSs (B), as the players/agents of the

multi-agent system. Each player i∈ B has set of actions Ai=

{ai,1, ai,2, .., ai,|Ai|}and states Si={si,1, si,2, .., si,|Si|}

where ai,j and si,k represents a possible action and a state

of player i, respectively. In Q-Learning, each player i∈ B

keeps a Q-table with Q-values Qi(si,j, ai,k )for each state

si,j ∈ Si,1≤j≤ |Si|and action ai,k ∈ A,1≤k≤ |Ai|

pair. This Q-value provides an estimate for future costs, if the

player iselects the action ai,k when he is in the state si,j .

A player iin a particular state si,j , selects and deploys an

action ai,k. Then, based on the feedback from the environment,

the player learns about the outcome of the deployed action ai,k

in state si,j . This feedback is given as a cost value ci, i ∈ B,

which determines the absolute difference between the achieved

LAA DL capacity CLAA,i, i ∈ B, during the previous duty

cycle period and the target capacity Ctar. Using CLAA,i new

state of player i,si,l ∈ Si,1≤l≤ |Si|is also identiﬁed.

Then, using the identiﬁed next state si,l and calculated cost

value ci, Q-value of the current state (si,j) and action (ai,k )

pair is updated as follows:

Qi(si,j , ai,k)←(1 −α)Qi(si,j , ai,k )

+αci+γminai,m Qi(si,l, ai,m ),(2)

where, α,γare the learning rate and discount factor re-

spectively. As can be seen from (2), the new Q-value

of the current state/action pair depends on the current Q-

value of that state/action pair (Qi(si,j, ai,k )), calculated cost

(ci), and minimum Q-value of the identiﬁed next state,

minai,m Qi(si,l, ai,m ). In this way, learning is achieved in the

proposed algorithm.

The learning rate α(0 ≤α≤1) determines how quickly

the learning can occur. If αis too small, it will take long

time to complete the learning process, while if it is too high,

algorithm might not converge. The discount factor γ(0 ≤

γ≤1) controls the value placed on the future costs. If γ

is too small, learning will not depend on future costs much

and immediate costs are optimized. On the other hand, if it is

too high, learning will count on future costs heavily. Through

a careful selection of these two parameters, it is possible to

effectively control the learning process of the proposed Q-

Learning approach.

Once the Q-value of the current state (si,j) and action (ai,k )

pair is updated, an action ai,m ∈ Ai,1≤m≤ |Ai|is

selected for the next state si,l. A random number r∈ U (0,1) is

generated ﬁrst and compared against the -greedy parameter

which is usually a very small value (0.01 ≤≤0.05). If

ris smaller than the -greedy parameter, an action will be

selected randomly. Otherwise, the action with the minimum

Q-value, (ai,m = argminai,mQi(si,l , ai,m )) in the identiﬁed

next state (si,l), is selected. The -greedy parameter allows

selecting an action in an exploratory way, and ensures that all

state/action pairs will be explored as the number of trials goes

2015 IEEE Wireless Communications and Networking Conference (WCNC): - Track 3: Mobile and Wireless Networks

1281

Algorithm 1 Q-Learning for duty cycle selection of LAA BS i∈ B

1: Initialize:

2: for each si,j ∈ Si,1≤j≤ |Si|,ai,k ∈ A,1≤k≤ |Ai|do

3: Initialize the Q-value representation mechanism Qi(si,j, ai,k )

4: end for

5: Evaluate the starting state s=si,j ∈ Si,1≤j≤ |Si|

6: Learning:

7: loop

8: Generate a random number r∈ U(0,1)

9: if (r < )then

10: Select action randomly

11: else

12: Select the action ai,m ∈ Aicharacterized by the min(Q-value)

13: end if

14: Execute ai,m

15: Receive an immediate capacity CLAA,i and cost ci

16: Observe the next state si,l ∈ Si,1≤l≤ |Si|

17: Update the Q-table entry as follows:

18: Qi(si,j , ai,k)←(1 −α)Qi(si,j , ai,k )

+α[ci+γminai,m Qi(si,l, ai,m )

19: s=si,l

20: end loop

to inﬁnity. The proposed Q-Learning algorithm is summarized

in Algorithm 1.

Without any loss of generality, we consider that the action,

state and cost deﬁnitions in the proposed algorithm are deﬁned

as follows.

•Action: Ai={20%,40%,60%,80%}.

•State:

si,j =











0, CLAA,i <1 Mbps

1,1 Mbps ≤CLAA,i <10 Mbps

2,10 Mbps ≤CLAA,i <20 Mbps

3,20 Mbps ≤CLAA,i <30 Mbps

4,30 Mbps ≤CLAA,i <40 Mbps

5, CLAA,i ≥40 Mbps

.(3)

•Cost:

ci=|Ctar −CLAA,i|,(4)

where CLAA,i is given by,

CLAA,i =NDC

Bits,i

TDC

Tx,i +TDC

Wait,i

.(5)

In (5), for LAA BS i∈ B,NDC

Bits,i represents number of

bits successfully transmitted during the previous duty cycle

period. TDC

Tx,i and TDC

Wait,i are the total transmitting time and the

waiting time due to silent subframe allocation3respectively,

during the previous duty cycle period.

IV. SIMULATION RESULTS

In simulations, we consider a two layer cell layout as shown

in Fig. 5. Each layer consists of M= 7 cells. There are

N= 10 WiFi STAs (LAA UEs) associated with each WiFi

AP (LAA BS). WiFi STAs (LAA UEs) move within the cell

with a speed of 3 km/h. WiFi and LAA trafﬁc arrival rates,

λWiFi =λLAA = 2.5, are considered in all the simulations.

LTE and WiFi 802.11n MAC and PHY layers are implemented

as described in [14]. Round robin user scheduling is consid-

3LAA BS i∈ B has data to schedule in DL. However, due to silent

subframe allocation, it has to wait.

−50 0 50

−50

100

dist anc e (m)

WiFi AP WiFi STA

LAA BS

LAA UE

Fig. 5: WiFi APs and LAA BSs in a two-layer cell layout.

TABLE I: LTE PHY/MAC parameters.

Parameter Value

Transmission Scheme OFDM

Bandwidth 20 MHz

DL Tx power 23 dBm

UL Tx power Path Loss based TPC

Frame duration 10 ms

Scheduling Round robin

P0-106 dBm

Path loss compensation factor (α) 1

Transmission time interval 1 ms

Trafﬁc model FTP Trafﬁc model-2 [13]

ered in LAA DL transmission and only one user is scheduled

during each transmission time interval (TTI). The LAA UEs

report the observed DL SINR value during a DL transmission

to the LAA BS, which is then used by the LAA BS to

determine the number of RBs to be allocated for the next DL

transmission. Based on the number of LAA UE requests for

UL transmission during one subframe, bandwidth is equally

divided between them. All the conﬁguration parameters used

for LAA in simulations are given in Table I.

For WiFi, CSMA/CA is implemented with enhanced dis-

tributed channel access (EDCA) and clear channel assessment

(CCA) [14]. WiFi beacon transmission is implemented, as

discussed in Section II-B, for realistic performance evalua-

tions. All the STAs (including WiFi AP) having data in their

respective queues can compete for the channel access when

no transmission is going on in the cell. The WiFi STA (or the

WiFi AP) sensing the channel to be idle and having the shortest

back-off time will gain the access to the channel if it has

received the most recent beacon successfully. If the beacon has

not been received successfully, the WiFi STA can not initiate

any transmission or reception. All the conﬁguration parameters

used for WiFi in simulations are summarized in Table II. In

all performance evaluations, we focus on the performance of

center cell in both WiFi and LAA cell layouts.

A. Performance analysis with WiFi beacon transmission

We evaluate WiFi and LAA performance with WiFi beacon

transmission considering TDD conﬁguration 2. Fig. 6 shows

WiFi and LAA DL aggregate capacity, with/without beacon

transmission. There is an improvement in LAA DL capacity

and degradation in WiFi capacity when beacon transmission

2015 IEEE Wireless Communications and Networking Conference (WCNC): - Track 3: Mobile and Wireless Networks

1282

TABLE II: WiFi PHY/MAC parameters.

Parameter Value

Transmission scheme OFDM

Bandwidth 20 MHz

DL/UL Tx Power 23 dBm

Access Category Best Effort

MAC protocol EDCA

Slot time 9µs

CCA Carrier sensing threshold -82 dBm

CCA Energy detection threshold -62 dBm

No. of service bits in PPDU 16 bits

No. of tail bits in PPDU 12 bits

Contention window size U(0,31)

Noise ﬁgure 6 [12]

Beacon Interval 100 ms

Beacon OFDM symbol detection threshold 10 dB

Beacon error ratio threshold 15

Trafﬁc model FTP Trafﬁc model-2 [13]

0.5

1.5

2.5

3.5 x 107

Capaci ty (bit s/s)

0.5

1.5

2.5

3x 107

Capaci ty (bit s/s)

λLAA : 2.5

λWiFi : 2.5

λLAA : 2.5

λWiFi : 2.5

With Beacon With out Beac on

With out Beac on

Average Wi Fi c apac ity

With Beacon

Average L AA DL c apaci ty

Fig. 6: WiFi and LAA DL capacity variation with/without beacon

transmission.

exists. The reason for this is that, when a STA misses a beacon,

it can not transmit or receive until a beacon is received suc-

cessfully. Therefore, when WiFi beacon transmission exists,

number of simultaneous WiFi data transmissions reduces. As

a result, WiFi interference on LAA DL reduces and LAA DL

capacity improves. Moreover, missing a beacon at a WiFi STA

further delays WiFi transmission. This will result in increasing

WiFi waiting time, and hence reduces WiFi capacity.

Fig. 7 shows SINR distributions at WiFi and LAA DL

with/without WiFi beacon transmission. The LAA DL SINR

improves with WiFi beacon transmission, since the WiFi in-

terference on LAA reduces due to the reduction of the number

of simultaneous WiFi transmissions. WiFi SINR distribution

with/without WiFi beacon transmission is also shown in Fig. 7,

where an improvement in WiFi SINR can be seen with beacon

transmission. This is due to the lower WiFi interference with

the reduced number of simultaneous WiFi transmissions. Note

−20 −10 0 10 20 30 40 50 60 70

0.2

0.4

0.6

0.8

SINR (dB)

CDF

LAA SINR − wi th beaco n

LAA SINR − without beacon

WiFi SINR − wit h b eacon

WiFi SINR − wit ho ut b eaco n

WiFi DL SINR di st rib ut ions

with/w it hou t b eacon.

λWiFi : 2.5

λLAA : 2.5

LAA DL SINR d is tri bu tions

with/w it hou t b eacon.

Fig. 7: WiFi/LAA DL SINR distributions with/without WiFi beacon

transmission.

here that the SINR is captured during WiFi transmission and

this does not help much for improving WiFi capacity, as

waiting time for WiFi increases with missed beacons. That

is why we see a capacity reduction in Fig. 6 for WiFi, when

WiFi beacon transmission exists.

B. Performance analysis with different LAA duty cycles

In this section, we evaluate WiFi and LAA performance

under four different duty cycles considering TDD conﬁg-

uration presented in Section II-C. Fig. 8 shows WiFi and

LAA capacity variation under different LAA duty cycles.

While the WiFi capacity decreases with larger LAA duty

cycles, the LAA capacity increases. This is because, with

larger LAA duty cycles, LAA interference on WiFi increases

and as a result WiFi capacity decreases. On the other hand,

LAA capacity increases with higher duty cycles due to more

transmission opportunities. Note here that the rate of WiFi

capacity degradation reduces with LAA duty cycle. The reason

for this observation is, with higher duty cycles, number of

simultaneous WiFi transmissions reduces. Therefore, WiFi

interference is reduced, decreasing the WiFi capacity degra-

dation rate.

0.2 0.4 0.6 0.8 1

0.5

1.5

2.5

3.5

4.5 x 107

Duty cy cl e of L AA transm is sion

Capacity (bi ts/s )

Average Wi Fi c apaci ty

Average L AA DL cap aci ty

λWiFi : 2.5

λLAA : 2.5

Fig. 8: Average LAA DL and WiFi capacity variations with different

duty cycles for duty cycle period of 20 ms.

Fig. 9 captures WiFi DL SINR distributions with four dif-

ferent duty cycles. The results show that WiFi SINR degrades

with higher LAA duty cycles. This is because, interference

coming from LAA increases with higher LAA duty cycles.

A step like behavior can be observed in the WiFi DL SINR

distribution. This is due to the difference in LTE DL and UL

interference on WiFi [14].

−30 −20 −10 0 10 20 30 40 50 60 70

0.2

0.4

0.6

0.8

SINR (dB)

CDF

Duty Cyc le 0.2

Duty Cyc le 0.4

Duty Cyc le 0.6

Duty Cyc le 0.8

λWiFi : 2.5

λLAA : 2.5

When dut y c ycle is red uced , LA A i nter fer enc e

on WiFi r edu ces. Hence, Wi Fi DL

SINR impro ves .

Fig. 9: WiFi DL SINR distribution with different LAA duty cycles.

2015 IEEE Wireless Communications and Networking Conference (WCNC): - Track 3: Mobile and Wireless Networks

1283

−20 −15 −10 −5 0 5 10 15 20 25

0.2

0.4

0.6

0.8

SINR (dB)

CDF

Duty Cyc le 0.2

Duty Cyc le 0.4

Duty Cyc le 0.6

Duty Cyc le 0.8

λWiFi : 2.5

λLAA : 2.5

When dut y c ycle is lar ge, WiFi in ter feren ce

on LAA red uces . Henc e, LA A DL SINR

impro ves .

Fig. 10: LAA DL SINR distribution with different LAA duty cycles.

4.5

5.5

6.5

7x 107

Duty cycle o f LAA t ransmi ssion

Capaci ty (bit s/s)

λWiFi : 2.5

λLAA : 2.5

0.2 0.4 Dynami c

0.8 1

0.6

Fig. 11: Aggregate capacity (WiFi + LAA DL) variation with different

duty cycles and Q-Learning algorithm (Dynamic).

From Fig. 10, we can observe LAA DL SINR distribution

with four different duty cycles. In that, LAA SINR improves

with larger duty cycle. This is because of the lower WiFi

interference experienced due to the reduced number of simul-

taneous WiFi transmissions.

C. Performance analysis with Q-Learning based dynamic duty

cycle selection for LAA

In this section, we evaluate the performance of the proposed

Q-Learning based dynamic duty cycle selection technique. For

simulations, we consider α= 0.5,γ= 0.9,= 0.03, and

Ctar = 30 Mbps with the TDD conﬁguration introduced in

Section II-C.

Fig. 11 shows the aggregate capacity variation (WiFi and

LAA DL) with different duty cycles and Q-Learning based

dynamic duty cycle selection technique. The Q-Learning based

dynamic duty cycle selection technique provides highest total

capacity when compared with ﬁxed duty cycle and full LAA

transmission scenarios. The reason for this capacity gain is

that, as the LAA BSs dynamically adjust their operating

duty cycles based on the bursty trafﬁc arrival given the

capacity constraint Ctar, WiFi gets fair amount of transmission

opportunities. As the medium sensing procedure in WiFi is

one of the main barriers which prevents WiFi from achieving

higher capacities, the proposed technique provides a solution

to overcome that problem. According to Fig. 11, the next

highest total capacity is achieved when operating without any

transmission gaps. However, as can be seen from Fig. 8,

achievable WiFi capacity is the lowest (21.45 Mbps) in this

case, whereas with Q-Learning based approach, WiFi capacity

of 39.7 Mbps could be achieved while keeping LAA capacity

around Ctar = 30 Mbps.

V. CONCLUDING REMARKS

In this paper, we have proposed a Q-Learning based dy-

namic duty cycle selection approach in which periodic trans-

mission gaps are conﬁgured by LAA, so as to effectively

coexist with WiFi systems in the unlicensed spectrum. First,

we evaluate WiFi and LAA performance with a ﬁxed value of

the transmission gap. Then, the overall system performance

with the proposed Q-Learning based dynamic duty cycle

selection approach is evaluated. Simulation results show that

the proposed dynamic duty cycle selection approach for LAA

can effectively enhance the overall capacity performance.

ACK NOWLED GM EN T

The authors would like to thank Fujio Watanabe from

DOCOMO Innovations, Inc., for fruitful discussions and his

useful feedback on the ﬁnal version of the manuscript.

REFERENCES

[1] “Study on Licensed-Assisted Access using LTE,” 3GPP Study Item -

RP-141397, Edinburgh, Scotland, Sep. 2014.

[2] A. Cavalcante, E. Almeida, R. Vieira, F. Chaves, R. Paiva, F. Abinader,

S. Choudhury, E. Tuomaala, and K. Doppler, “Performance Evaluation

of LTE and Wi-Fi Coexistence in Unlicensed Bands,” in Proc. IEEE

Vehi. Technol. Conf. (VTC), Jun. 2013, pp. 1–6.

[3] Qualcomm, “Extending LTE Advanced to unlicensed spectrum,” Dec.

2013, White Paper.

[4] T. Nihtila, V. Tykhomyrov, O. Alanen, M. Uusitalo, A. Sorri, M. Moisio,

S. Iraji, R. Ratasuk, and N. Mangalvedhe, “System performance of LTE

and IEEE 802.11 coexisting on a shared frequency band,” in Proc. IEEE

Wireless Commun. Networking Conf. (WCNC), Apr. 2013.

[5] M. Beluri, E. Bala, Y. Dai, R. Di Girolamo, M. Freda, J. Gauvreau,

S. Laughlin, D. Purkayastha, and A. Touag, “Mechanisms for LTE

Coexistence in TV White Space,” in Proc. IEEE Int. Symp. on Dynamic

Spectrum Access Networks (DYSPAN), Oct. 2012, pp. 317–326.

[6] R. Ratasuk, M. Uusitalo, N. Mangalvedhe, A. Sorri, S. Iraji, C. Wijting,

and A. Ghosh, “License-exempt LTE deployment in heterogeneous

network,” in Proc. Int. Symp. Wireless Commun. Sys. (ISWCS), Aug.

2012.

[7] NTT DOCOMO, “Views on LAA for Unlicensed Spectrum - Scenarios

and Initial Evaluation Results,” 3GPP RAN1 standard contribution -

RWS-140026, Sophia Antipolis, France, Jun. 2014.

[8] SONY, “Requirements and Coexistence Topics for LTE-U,” 3GPP

RAN1 standard contribution - RWS-140010, Sophia Antipolis, France,

Jun. 2014.

[9] E. Almeida, A. Cavalcante, R. Paiva, F. Chaves, F. Abinader, R. Vieira,

S. Choudhury, E. Tuomaala, and K. Doppler, “Enabling LTE/WiFi

Coexistence by LTE blank subframe allocation,” in Proc. IEEE Int. Conf.

on Commun. (ICC), Jun. 2013, pp. 5083–5088.

[10] CableLabs, “Cable Labs perspective on LTE-U Coexistence with Wi-Fi

and Operational Modes for LTE-U,” 3GPP RAN1 standard contribution

- RWS-140004, Sophia Antipolis, France, Jun. 2014.

[11] F. Chaves, E. Almeida, R. Vieira, A. Cavalcante, F. Abinader, S. Choud-

hury, and K. Doppler, “LTE UL Power Control for the Improvement of

LTE/Wi-Fi Coexistence,” in Proc. IEEE Vehic. Technol. Conf. (VTC),

Sep. 2013, pp. 1–6.

[12] E. Perahia and R. Stacey, Next Generation Wireless LANs: Throughput,

Robustness, and Reliability in 802.11n. Cambridge Univ. Press, 2008.

[13] “Evolved Universal Terrestrial Radio Access (E-UTRA); Further ad-

vancements for E-UTRA physical layer aspects (Release 9),” Tech. Rep.

3GPP TR36.814, V9.0.0, Mar. 2010.

[14] N. Rupasinghe and I. G¨

uvenc¸, “Licensed-Assisted Access for WiFi-

LTE Coexistence in the Unlicensed Spectrum,” in Proc. IEEE Global

Telecommun. Conf. (GLOBECOM) Workshops - Emerging Technologies

for 5G Wireless Cellular Networks, Dec. 2014.

[15] M. Simsek, A. Czylwik, A. Galindo Serrano, and L. Giupponi, “Im-

proved Decentralized Q-learning Algorithm for Interference Reduction

in LTE-femtocells,” in Proc. Wireless Adv., Jun. 2011, pp. 138–143.

2015 IEEE Wireless Communications and Networking Conference (WCNC): - Track 3: Mobile and Wireless Networks

1284

DRL meets DSA Networks: Convergence Analysis and Its Application to System Design

Preprint

Full-text available

May 2023

In dynamic spectrum access (DSA) networks, secondary users (SUs) need to opportunistically access primary users' (PUs) radio spectrum without causing significant interference. Since the interaction between the SU and the PU systems are limited, deep reinforcement learning (DRL) has been introduced to help SUs to conduct spectrum access. Specifically, deep recurrent Q network (DRQN) has been utilized in DSA networks for SUs to aggregate the information from the recent experiences to make spectrum access decisions. DRQN is notorious for its sample efficiency in the sense that it needs a rather large number of training data samples to tune its parameters which is a computationally demanding task. In our recent work, deep echo state network (DEQN) has been introduced to DSA networks to address the sample efficiency issue of DRQN. In this paper, we analytically show that DEQN comparatively requires less amount of training samples than DRQN to converge to the best policy. Furthermore, we introduce a method to determine the right hyperparameters for the DEQN providing system design guidance for DEQN-based DSA networks. Extensive performance evaluation confirms that DEQN-based DSA strategy is the superior choice with regard to computational power while outperforming DRQN-based DSA strategies.

Wi-Fi Meets ML: A Survey on Improving IEEE 802.11 Performance with Machine Learning

Article

Full-text available

Sep 2022

Wireless local area networks (WLANs) empowered by IEEE 802.11 (Wi-Fi) hold a dominant position in providing Internet access thanks to their freedom of deployment and configuration as well as the existence of affordable and highly interoperable devices. The Wi-Fi community is currently deploying Wi-Fi 6 and developing Wi-Fi 7, which will bring higher data rates, better multi-user and multi-AP support, and, most importantly, improved configuration flexibility. These technical innovations, including the plethora of configuration parameters, are making next-generation WLANs exceedingly complex as the dependencies between parameters and their joint optimization usually have a non-linear impact on network performance. The complexity is further increased in the case of dense deployments and coexistence in shared bands. While classical optimization approaches fail in such conditions, machine learning (ML) is able to handle complexity. Much research has been published on using ML to improve Wi-Fi performance and solutions are slowly being adopted in existing deployments. In this survey, we adopt a structured approach to describe the various Wi-Fi areas where ML is applied. To this end, we analyze over 250 papers in the field, providing readers with an overview of the main trends. Based on this review, we identify specific open challenges and provide general future research directions.

A Survey on Technologies and Challenges of LTE-U

Article

Jan 2022
COMPUT SYST SCI ENG

Learning-Based Neighboring Station Coverage Identification and Dynamic Resource Utilization for 3-D Cells in NR-U Networks

Article

Dec 2023

With ongoing proliferation of wireless networks and dramatic growth of data traffic, offloading cellular network traffic to unlicensed band (UB) enabled new radio (NR) base stations (BSs) becomes one of the critical alternatives to address the radio resource constraint. Due to network densification and reduced small cell coverage, this solution can be further improved through efficient and dynamic utilization of 3-D spatially distributed common UB radio resources. The overall objective of this study is to better estimate the 3-D location-specific interference from neighboring UB BSs and to support efficient and dynamic radio resource utilization in the overlapped coverage space. In the first part of this study, the problem of determining the location-specific interference of a neighbor NR-Unlicensed (NR-U) BS is addressed. This problem is solved with a deep regression neural network assisted algorithm while predicting 3-D location-based interference power of a neighbor NR-U BS and bypassing the complex path loss parameter estimation processes. In the second part of this study, the problem of efficient and dynamic UB radio resource utilization is considered for NR-U transmissions. To solve this problem, a collaborative double $Q$ -learning (DQL) algorithm is developed. Under ideal, estimated and theoretical categories of predictions, approximately 11%, 23%, and 20% better performance and approximately 59%, 212%, and 67% faster algorithm convergence are shown by the DQL assisted approach over conventional $Q$ -learning-based solutions, respectively.

Data-Driven Next-Generation Wireless Networking: Embracing AI for Performance and Security

Conference Paper

Jul 2023

Spectrum sharing for LTE and WiFi coexistence using decision tree and game theory

Conference Paper

Apr 2016

Multi-armed bandit for LTE-U and WiFi coexistence in unlicensed bands

Conference Paper

Apr 2016

Intelligent Access to Unlicensed Spectrum: A Mean Field Based Deep Reinforcement Learning Approach

Article

Jan 2022

As the demand for mobile data traffic continues to grow, offloading data traffic to unlicensed spectrum is a promising approach that can relieve the pressure on cellular systems. Therefore, it is an urgent need to propose an unlicensed spectrum access method to guarantee the harmonious and efficient coexistence between cellular network technologies such as LTE and incumbent users such as WiFi in the unlicensed spectrum. However, existing coexistence schemes such as licensed assisted access (LAA) and LTE-unlicensed (LTE-U) still suffer from inefficient spectrum utilization and unsatisfactory fairness. In the paper, we formulate the optimization problem of the unlicensed spectrum access among multiple small bases (SBSs) as a game, and then solve the Nash Equilibrium (NE) with cooperative and distributed multi-agent deep reinforcement learning (MADRL). Specifically, a two level access framework for the coexistence scenario, which consists of feedback cycle and executive cycle, is first proposed, and then the key elements of MADRL including state, action, reward and Q-network are designed in detail based on the proposed access framework. To overcome the problems of learning divergence and prohibitive computation overhead in the coexistence scenario with multiple SBSs due to the non-stability phenomena, we adopt the mean field technology to solve the NE, which can simplify the process of solving NE by converting the interaction of an agent with the remaining multiple agents into an action with the average effect of them. Simulation results show that 1) the proposed algorithm can overcome the learning divergence problem and converge to the NE quickly, and 2) the proposed algorithm can achieve the bi-objective optimization of total throughput and fairness of the coexistence network, and can achieve better performance in terms of throughput and fairness compared with the baseline methods such as Cat-4 LBT, Cooperative LBT and Random schemes.

Device and Network Coordination for Opportunistic Utilization of Radio Resources in 3D Networks

Article

Full-text available

Jan 2022

Device and network coordination is critical for efficient radio resource (RR) utilization while meeting Quality of Service (QoS) requirements in heavily congested future heterogeneous wireless networks featured with 3-Dimensional (3D) small cells (SCs). Device and network coordination assisted opportunistic and coordinated use of RRs in distinct bands could dramatically improve the spectrum utilization in these networks. In this study, overall communication performance enhancement through better utilization of opportunistically available spatially distributed RRs in a 3D SC is addressed considering two co-located networks operated in licensed band (LB) and unlicensed band (UB) while jointly accounting for several related factors like 3D spatial positions and QoS requirements of the devices. To confront this problem, a device and network coordination assisted solution is developed using Q-learning and Slotted-ALOHA principles. Then, to maintain performance standards, device and network coordination aided scheduling, power control and access prioritization schemes are discussed. Subsequently, regret based learning assisted algorithm is presented for the UB to optimally utilize RRs. In these solutions, both device-network and network-network interactions are considered. In results, approximately 75% better overall coordination efficiency over conventional methods is shown at the initial iterations for the scenarios with the highest device density demonstrating attractive performance.

Genetic Algorithm for LTE and WiFi Networks in 5G Heterogeneous Environment

Conference Paper

Dec 2021

Yuewei Lin

Licensed-assisted access for WiFi-LTE coexistence in the unlicensed spectrum

Conference Paper

Full-text available

Mar 2015

One of the effective ways to address the exponentially increasing traffic demand in mobile communication systems is to use more spectrum. Although licensed spectrum is always preferable for providing better user experience, unlicensed spectrum can be considered as an effective complement. Before moving into unlicensed spectrum, it is essential to carry out proper coexistence performance evaluations. In this paper, we analyze WiFi 802.11n and Long Term Evolution (LTE) coexistence performance considering multi-layer cell layouts through system level simulations. We consider a time division duplexing (TDD)-LTE system with an FTP traffic model for performance evaluation. Simulation results show that WiFi performance is more vulnerable to LTE interference, while LTE performance is degraded only slightly. However, WiFi throughput degradation is lower for TDD configurations with larger number of LTE uplink sub-frames and smaller path loss compensation factors.

LTE UL Power Control for the Improvement of LTE/Wi-Fi Coexistence

Conference Paper

Full-text available

Sep 2013
Veh Tech Conf

Spectrum sharing is a powerful alternative to deal with the exponential increase on the wireless communication capacity demand. In this context, the coexistence of two of the most prominent wireless technologies today, Long Term Evolution (LTE) and Wi-Fi, is an important research topic. In the most common Wi-Fi network operation, the Distributed Coordination Function (DCF), communication nodes access the channel only if the interference level is below a certain threshold. Then, Wi-Fi operation is severely affected when in coexistence with LTE. This paper proposes the use of LTE uplink (UL) power control to improve LTE/Wi-Fi coexistence. With the introduction of an additional factor to the conventional LTE UL power control, a controlled decrease of LTE UL transmit powers is carried out according to interference measurements, giving opportunity to Wi-Fi transmissions. The proposed LTE UL power control with interference aware power operating point is a flexible tool to deal with the trade-off between LTE and Wi-Fi performances in coexistence, since it is able to set different LTE/Wi-Fi coexistence configurations with the choice of a single parameter. Simulation results show that the proposed approach can provide similar or better performance for both LTE and Wi-Fi networks than a previously proposed interference avoidance mechanism.

Performance Evaluation of LTE and Wi-Fi Coexistence in Unlicensed Bands

Conference Paper

Full-text available

Jun 2013

The deployment of modern mobile systems has faced severe challenges due to the current spectrum scarcity. The situation has been further worsened by the development of different wireless technologies and standards that can be used in the same frequency band. Furthermore, the usage of smaller cells (e.g. pico, femto and wireless LAN), coexistence among heterogeneous networks (including amongst different wireless technologies such as LTE and Wi-Fi deployed in the same frequency band) has been a big field of research in the academy and industry. In this paper, we provide a performance evaluation of coexistence between LTE and Wi-Fi systems and show some of the challenges faced by the different technologies. We focus on a simulator-based system- level analysis in order to assess the network performance in an office scenario. Simulation results show that LTE system performance is slightly affected by coexistence whereas Wi-Fi is significantly impacted by LTE transmissions. In coexistence, the Wi-Fi channel is most often blocked by LTE interference, making the Wi-Fi nodes to stay on the LISTEN mode more than 96% of the time. This reflects directly on the Wi-Fi user throughput, that decreases from 70% to ~100% depending on the scenario. Finally, some of the main issues that limit the LTE/Wi-Fi coexistence and some pointers on the mutual interference management of both the systems are provided.

Enabling LTE/Wi-Fi coexistence by LTE blank subframe allocation

Conference Paper

Full-text available

Jun 2013

The recent development of regulatory policies that permit the use of TV bands spectrum on a secondary basis has motivated discussion about coexistence of primary (e.g. TV broadcasts) and secondary users (e.g. WiFi users in TV spectrum). However, much less attention has been given to coexistence of different secondary wireless technologies in the TV white spaces. Lack of coordination between secondary networks may create severe interference situations, resulting in less efficient usage of the spectrum. In this paper, we consider two of the most prominent wireless technologies available today, namely Long Term Evolution (LTE), and WiFi, and address some problems that arise from their coexistence in the same band. We perform exhaustive system simulations and observe that WiFi is hampered much more significantly than LTE in coexistence scenarios. A simple coexistence scheme that reuses the concept of almost blank subframes in LTE is proposed, and it is observed that it can improve the WiFi throughput per user up to 50 times in the studied scenarios.

Improved Decentralized Q-learning Algorithm for Interference Reduction in LTE-Femtocells

Conference Paper

Full-text available

Jun 2011

Femtocells are receiving considerable interest in mobile communications as a strategy to overcome the indoor coverage problems as well as to improve the efficiency of current macrocell systems. Nevertheless, the detrimental factor in such networks is co-channel interference between macrocells and femtocells, as well as among neighboring femtocells which can dramatically decrease the overall capacity of the network. In this paper we propose a Reinforcement Learning (RL) framework, based on an improved decentralized Q-learning algorithm for femtocells sharing the macrocell spectrum. Since the major drawback of Q-learning is its slow convergence, we propose a smart initialization procedure. The proposed algorithm will be compared with a basic Q-learning algorithm and some power control (PC) algorithms from literature, e.g., fixed power allocation, received power based PC. The goal is to show the performance improvement and enhanced convergence.

Next Generation Wireless LANs: Throughput, Robustness, and Reliability in 802.11n

Book

Jan 2008

This exciting and comprehensive overview describes the underlying principles, implementation details, and key enhancing features of the new IEEE 802.11n standard, which has been created to significantly improve network throughput. A detailed discussion of important strength and reliability enhancing features is given in addition to a clear summary of any issues. Advanced topics are also covered. With numerous examples and simulation results included to highlight the benefits of the new features, this is an ideal reference for designers of Wireless Local Area Network (LAN) equipment, and network managers whose systems adopt the new standard. It is also a useful distillation of 802.11n technology for graduate students and researchers in the field of wireless communication. © Cambridge University Press 2008 and Cambridge University Press, 2009.

System performance of LTE and IEEE 802.11 coexisting on a shared frequency band

Conference Paper

Apr 2013

This paper presents the system performance analysis of 3GPP Long-Term Evolution (LTE) and IEEE 802.11 Wireless Local Area Networks (WLAN) in a situation where LTE downlink (DL) has been expanded over to unlicensed frequency band usually used by WLAN. Simple fractional bandwidth sharing mechanism is used to allow both technologies to transmit. The system performance is evaluated by means of fully dynamic network simulations.

Enabling LTE/WiFi coexistence by LTE blank subframe allocation

Conference Paper

Jan 2013

License-exempt LTE deployment in heterogeneous network

Conference Paper

Aug 2012

Mobile broadband data usage in Long Term Evolution (LTE) networks is growing exponentially and capacity constraint is becoming an issue. Heterogeneous network, WiFi offload, and acquisition of additional radio spectrum can be used to address this capacity constraint. Licensed spectrum, however, is limited and can be costly to obtain. This paper investigates deploying LTE on a license-exempt band as part of the pico-cell underlay. Coexistence mechanism and other modifications to LTE are discussed. Performance analysis shows that LTE can deliver significant capacity even while sharing the spectrum with WiFi systems.

Mechanisms for LTE coexistence in TV white space

Conference Paper

Oct 2012

This paper presents a high level description of an LTE system operating in license exempt bands. Since wireless networks potentially using different air interfaces may operate in these bands, coexistence is a challenge that needs to be addressed. This paper focuses on non-coordinated mechanisms for secondary users coexistence, and introduces a coexistence gap based method for LTE to dynamically share the spectrum with other secondary users. A simulation based analysis of the coexistence gap method is presented, and the results are compared with an energy based sensing channel access method.

Reinforcement learning for licensed-assisted access of LTE in the unlicensed spectrum

Abstract and Figures

Recommended publications

Multi-Armed Bandit for LTE-U and WiFi Coexistence in Unlicensed Bands

Dynamic max TxOP algorithms in Licensed-Assisted Access system

Licensed-assisted access for WiFi-LTE coexistence in the unlicensed spectrum

LTE/Wi-Fi Coexistence: Challenges and Mechanisms

Downlink Performance Analysis of LTE and WiFi Coexistence in Unlicensed Bands with a Simple Listen-B...