ArticlePDF Available

Dimming-Aware Deep Learning Approach for OOK-Based Visible Light Communication

June 2020
Journal of Lightwave Technology PP(99):1-1

June 2020
PP(99):1-1

DOI:10.1109/JLT.2020.3004664

Authors:

Tsinghua University

Visible light communication (VLC) is a secure, low-cost and high-rate communication method. On-off keying (OOK) is one of the modulation schemes of VLC, turning each light either on or off to generate binary signals. Recently, deep learning (DL) technologies have made a series of breakthroughs for dimming in VLC system. This task is actually quite challenging for DL, since the VLC system needs to be able to support various dimming targets on account of the different preferences from users in practical applications, resulting in an optimization problem with multiple constraints. This paper presents a DL framework for the dimming-aware binary VLC system, which can meet arbitrary dimming requirements by a universal neural network, named universal auto-encoder (UAE). The proposed UAE creatively utilizes a multi-branch architecture with several carefully designed concatenated patches, and a novel multi-stage training strategy for the optimization problem with multiple dimming constraints. The experiments indicate that the proposed DL approach outperforms existing techniques in terms of the average bit error rate, the satisfaction of the dimming constraints and the robustness for imperfect optical channels.

The ISI problem from signal reflection.

…

The whole structure of the proposed UAE while training.

…

The operation of the patch concatenation layer.

…

The structure of the trained UAE for practical application.

…

The convergence behavior with N = 16, H = I N , ψ 2 = 0 and SNR = 10 dB.

…

Figures - uploaded by Fang Yang

Content may be subject to copyright.

Content uploaded by Fang Yang

Content may be subject to copyright.

JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 38, NO. 20, OCTOBER 15, 2020 5733

Dimming-Aware Deep Learning Approach for

OOK-Based Visible Light Communication

Cong Zou, Student Member, IEEE, and Fang Yang , Senior Member, IEEE

Abstract—Visible light communication (VLC) is a secure, low-

cost, and high-rate communication method. On-off keying (OOK)

is one of the modulation schemes of VLC, turning each light either

on or off to generate binary signals. Recently, deep learning (DL)

technologies have made a series of breakthroughs for dimming

in VLC system. This task is actually quite challenging for DL,

since the VLC system needs to be able to support various dim-

ming targets on account of the different preferences from users in

practical applications, resulting in an optimization problem with

multiple constraints. This article presents a DL framework for

the dimming-aware binary VLC system, which can meet arbitrary

dimming requirements by a universal neural network, named uni-

versal auto-encoder (UAE). The proposed UAE creatively utilizes

a multi-branch architecture with several carefully designed con-

catenated patches, and a novel multi-stage training strategy for

the optimization problem with multiple dimming constraints. The

experiments indicate that the proposed DL approach outperforms

existing techniques in terms of the average bit error rate, the

satisfaction of the dimming constraints, and the robustness for

imperfect optical channels.

Index Terms—Constant weight code, constrained optimization,

deep learning, on-off keying, visible light communication.

I. INTRODUCTION

VISIBLE light communication (VLC) [1]–[3], using light-

emitting diodes (LEDs) to transmit information, becomes

an emerging and promising optical wireless communication

(OWC) technology. Compared with radio frequency (RF) com-

munication, VLC has a number of strengths, such as high-

speed transmission, low cost, no electromagnetic interference,

enhanced security, etc., which result in widespread concern and

research.

In VLC, the message is conveyed via high-rate temporal

changes in light intensity of LEDs, so that intensity modulation

and direct detection (IM/DD) is the critical technique. The

Manuscript received March 20, 2020; revised May 29, 2020; accepted June

22, 2020. Date of publication June 24, 2020; date of current version October

15, 2020. This work was supported in part by the National Natural Science

Foundation of China under Grant 61871255, in part by the Natural Science

Foundation of Guangdong Province under Grant 2015A030312006, in part by

the National Key Research and Development Program of China under Grant

2017YFE0113300, and in part by the Fok Ying-Tung Education Foundation

(Corresponding author: Fang Yang.)

Cong Zou and Fang Yang are with the Department of Electronic Engineering

Beijing National Research Center for Information Science and Technology,

Tsinghua University, Beijing 100084, China, and also with the Key Laboratory

of Digital TV System of Guangdong Province and Shenzhen City, Research

Institute of Tsinghua University in Shenzhen, Shenzhen 518057, China (e-mail:

zouc19@mails.tsinghua.edu.cn; fangyang@tsinghua.edu.cn).

Color versions of one or more of the ﬁgures in this article are available online

at https://ieeexplore.ieee.org.

Digital Object Identiﬁer 10.1109/JLT.2020.3004664

modulation schemes [4], [5] include on-off keying (OOK) [6]–

[9], pulse position modulation (PPM) [10], [11], pulse width

modulation (PWM) [12] and orthogonal frequency division mul-

tiplexing (OFDM) [13], [14], and this paper will pay attention to

the OOK-based VLC systems generating binary messages. At

the same time, the LEDs also serve as lighting sources, giving

rise to intensity constraint problem because of the illumination

requirements from users and LED property. Thus, for the OOK-

based VLC systems, the key issue is the constant weight codes

(CWCs) designing problem.

However, the optimal design of CWCs [15], [16] only fo-

cused on the mathematical properties like minimum Hamming

distance, instead of the system performance like bit error rate

(BER), which is still a challenging task. Furthermore, the im-

perfections of the optical channel [17] in the practical imple-

mentation, such as the inter-symbol interference (ISI) problem,

the thermal noise and the signal-dependent shot noise, also

raise much difﬁculties for these coding techniques. To deal

with this issue, recently, a number of deep learning (DL) based

methods have made a series of breakthroughs for the VLC

system design [18]–[21]. In [18] and [19], an auto-encoder

(AE) and a convolutional AE are employed to learn a set of

CWCs, respectively. However, the trained neural network can

only satisfy a speciﬁc dimming intensity. In [20], though the

universal support of arbitrary dimming targets is achieved, it can

only learn a set of semi-CWC [22], which means the dimming

intensity is controlled by the average Hamming weight over a

codebook, resulting in an unstable dimming intensity. Therefore,

the goal of this paper is to explore a dimming-aware CWC-based

VLC system using a novel DL framework.

In this paper, an AE with multiple branches aiming to meet

different dimming intensities, named universal auto-encoder

(UAE), is creatively proposed. As the difference among the AE

with different dimming constraints in [18] is only the regular-

ization term, it is not necessary to retrain another network for a

different dimming target, which is quite time-consuming. Hence,

in the proposed UAE, a certain part of the structure is shared in

various branches. This idea is similar to hard parameter sharing

in multi-task learning (MLT) [23], which shares the hidden lay-

ers among all tasks, while keeping several task-speciﬁc output

layers. But in the proposed UAE, the output layers are shared,

while the hidden layers are dimming-speciﬁc. By this means, the

risk of overﬁtting as well as the Rademacher complexity can be

reduced, which is crucial for VLC system with imperfect chan-

nels. Furthermore, there are two performance improvements can

be made based on UAE. Firstly, too many branches still lead to

See https://www.ieee.org/publications/rights/index.html for more information.

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

5734 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 38, NO. 20, OCTOBER 15, 2020

considerable computational complexity. Therefore, a specially

designed binary patch is concatenated to the codeword, so that

the weight of the spliced codeword can be adjusted by changing

the number of ones in the patch. In this way, a few branches

are sufﬁcient to meet arbitrary dimming requirements, which

signiﬁcantly reduces the computational complexity. Secondly,

since there are multiple dimming constraints in UAE, it is

unpractical to preset an appropriate penalty parameter for each

constraint artiﬁcially. Hence, to avoid the process of selecting

hyper-parameters, we propose a multi-stage training strategy,

which jointly optimizes the network parameters and the penalty

parameters via a single stochastic gradient descent (SGD) up-

date, making the proposed UAE more feasible and adaptive.

In summary, the contributions of this paper are as fol-

lows. Firstly, the DL framework is utilized to avoid the time-

consuming search process of optimizing the binary CWC code-

words in conventional VLC systems, and to improve system

robustness against the imperfect channels. Secondly, we propose

an innovative universal AE with several branches, where for each

branch, the other branches act as regularizer. Thus, the risk of

overﬁtting and the ability to ﬁt the noise are reduced, resulting in

better BER performance over the channels with thermal noise,

shot noise and ISI problem. Thirdly, the multi-branch structure

with concatenated patches makes the proposed UAE able to

meet arbitrary dimming requirements with a single training

process, which reduces computational complexity and improves

practicability. Finally, a novel multi-stage training strategy is

investigated for optimization problem with multiple constraints,

which enhances the satisfaction of the dimming targets.

The rest of the paper is organized as follows. Section II

contains a brief review of OOK-based VLC system. The network

architecture of the proposed UAE, the corresponding training

strategy and practical application are investigated in Section III.

Section IV contains the implementation details and the experi-

mental results. Finally, Section V includes the conclusion of the

paper.

Notations: Throughout this paper, boldface lowercase letters

(e.g., a) denote vectors, boldface capital letters (e.g., A) denote

matrices, and boldface Euler script letters (e.g., A) denote sets.

[a]krepresents the k-th element of vector a,[A]ij represents

the (i, j)-th element of matrix A. Besides, |A|represents the

number of elements in set A.

II. SYSTEM MODEL

The OOK-based VLC system is mainly concerned with

transmitting Mdifferent messages bi∈M={b1,...,b

M}in

the form of OOK optical pulses emitted by LEDs, which can

be symbolized as binary codewords s∈Sof dimension N,

where Srepresents the code-book. To reﬂect the imperfec-

tions of the optical channel in the practical implementation,

the signal is added with thermal noise nth ∈RN∼N(0,σ

and signal-dependent shot noise nsh ∈RN∼N(0,sψ2σ2),

where ψ2stands for the shot noise variance scaling factor.

Thus, the received signal ydetected by a photo-detector (PD) is

given by

y=Hs +nth +nsh,(1)

Fig. 1. The ISI problem from signal reﬂection.

where H∈RN×Nstands for an optical communication channel

matrix. When H=IN, the channel is an additive noise channel.

However, in practical indoor VLC environment, the reﬂection

of wall will lead to ISI problem, as shown in Fig. 1 [24]. In this

way, the received signal is

y(t)=h(1)(t)⊗x(t)+h(2) (t)⊗x(t−τd)+nth(t)+nsh (t),

(2)

where h(1)(t)and h(2) (t)are the corresponding impulse re-

sponses for LED and reﬂection on the wall. τdis time delay

calculated as τd=(d1+d2−d)/c, and cis the speed of light.

Thus, the optical communication channel matrix His expressed

[H]ij =⎧

⎨

⎩

1+γ(1 −Δ),for j=i

γΔ,for j=i−1

0,else

(3)

where γ=h(2)/h(1) =d4/(d1+d2)4,Δ=τd/T , and Tis the

bit time interval.

To meet the illumination requirements, the number of ones

in the binary codewords s, i.e., the Hamming weight, should be

equal to the required dimming intensity I∈0,1,...,N, that is



i=1

[s]i=I, for s∈S,(4)

and the code-book Sis called CWC.

III. THE PROPOSED UNIVERSAL AUTOENCODER

The structure of the proposed dimming-aware neural network,

UAE, is introduced in this section. While training, the proposed

UAE is constructed with an encoding network, a binarization

operation, a dimming constraint operation, a patch concatenation

operation, an optical channel and a decoding network, as shown

in Fig. 2, which will be discussed in detail in the following.

A. Encoding and Decoding Network

The input data of the encoding network is the one-hot repre-

sentation xof message bi∈M, which sets the value of the i-th

element of a M-dimensional zero vector to one. The encoding

network consists of two fully connected (FC) layers: The ﬁrst

hidden layer has input size of M(Mis the number of different

messages) and output size of N(Nis the dimension of the binary

codeword s), with the rectiﬁed linear unit (ReLU) applied as

the activation function. To make UAE able to satisfy various

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

ZOU AND YANG: DIMMING-AWARE DEEP LEARNING APPROACH FOR OOK-BASED VISIBLE LIGHT COMMUNICATION 5735

Fig. 2. The whole structure of the proposed UAE while training.

dimming constraints, the second hidden layer is divided into B

branches, each of which is trained to meet a speciﬁc dimming

target, and is provided with input size of N, output size of N.

Thus, the output of the encoding network can be expressed as

h2[k]=W2[k]×ReLU(W1x+b1)+b2[k],k=1,...,B.

(5)

The input data of the decoding network is the received sig-

nal y[k]as (1). The decoding network is almost symmetrical

to the encoding network, which also consists of two FC layers:

The third hidden layer is divided into Bbranches, each of which

has its own parameters {W3[k],b3[k]}(k=1,...,B)with the

ReLU function applied as the activation function, and is provided

with input size of N, output size of M. The fourth hidden layer

has input size of Mand output size of N, with the softmax

function applied as the activation function. Thus, the output of

the decoding network can be expressed as

h4[k]=softmax W4×ReLU W3[k]+b3[k]+b4,(6)

and each element of h4[k], denoted as [h4[k]]j, stands for the

probability that the input message bbelongs to the j-th one.

Hence, according to the maximum likelihood theorem, the re-

constructed message is obtained as

b[k]=arg max

1≤j≤M[h4[k]]j,k=1,...,B. (7)

By optimizing the network parameters Θ={Wl,bl|l=

1,...,4}, our goal is to minimize the cost function, i.e.,

min

ΘC(Θ)= 1



k=1

log [h4[k]]bi,(8)

which is the cross entropy between the probability distribution

vector h4[k]and the input message bi. The parameters Θcan be

optimized by SGD algorithm step by step as follows

Θt=Θt−1−η∇ΘCΘt−1,(9)

where Θtrepresents the value of Θat the t-th step, ηdenotes

the learning rate and the gradient ∇C(Θ)is obtained by the back

propagation algorithm [25].

In this way, training only one network, instead of training

different networks for different dimming constraints like

[18], [19], can meet multiple dimming needs at the same

time. Thus, to meet Bdifferent dimming intensities, the

computational complexity of the encoding network is

reduced from O(BMN +BN2)to O(MN +BN2), and

the same is true for decoding network. Besides, the ﬁrst

and the last hidden layer are shared among all branches,

that decreases the number of network parameters to be

trained from {Wl[k],bl[k]|l=1,...,4,k=1,...,B}to

{W1,b1,W2[k],b2[k],W3[k],b3[k],W4,b4|k=1,...,B}.

Further, for any of these branches, the other branches act as

regularizer, that reduces the risk of overﬁtting and the ability to

ﬁt the additive noise from the imperfect channels.

B. Binarization

To implement OOK modulation, the binarization operation

needs to convert the output of the encoding network h2[k](k=

1,...,B), which is continuous in nature, to binary code e[k].

The most direct way is to apply the unit step function on h2[k].

However, since the unit step function is unsmooth and the

gradient is zero for all nonzero [h2[k]]j(j=1,...,N), which

further leads to the gradient of cost function with respect to the

parameters of encoding network diminishing to zero. According

to (9), these parameters cannot be optimized, which gives rise

to the vanishing gradient issue [26].

To tackle this problem, this paper implements a multi-stage

training strategy for binarization [27], which gradually anneals

a soft binarization procedure to a hard one. Since it is known

that the scaled sigmoid function with a scaling parameter β,

sigmoid(βz)= 1

1+e−βz ,β>0,(10)

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

5736 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 38, NO. 20, OCTOBER 15, 2020

Algorithm 1: The Binarization Algorithm.

Input:Θ[0] and β[i],i=1,...,P;

1: for g=1to Pdo

2: Initialize Θ0=Θ[g−1];

3: Set β=β[g];

4: Set t=1;

5: repeat

6: Θt=Θt−1−η∇ΘC(Θt−1);

7: t=t+1;

8: until the converged parameters Θ[g]are obtained;

9: end for

is smooth and as the value of βincreases, the scaled sigmoid

function will become more unsmooth and closer to the unit step

function. Thus, we can replace the unit step function with the

scaled sigmoid function, i.e., [e[k]]j=sigmoid(β[h2[k]]j), and

employ a multi-stage training strategy for binarization to obtain

binary outputs.

As the sigmoid function is a common and effective activa-

tion function which can be successfully trained, the scaling

parameter βis initialized as β[0] = 1, then, for the following

stages, βis increased such that β[0] <β[1] <··· <β[P].At

each stage g, the parameters of the proposed UAE is initialized

with the converged parameters Θ[g−1] obtained at the previous

stage g−1, and the proposed UAE at stage gis trained until

convergence with β=β[g]. After that, the converged param-

eters Θ[g]of stage gwill be used as initialization in the next

stage, and the overall binarization algorithm is illustrated in

Algorithm 1. Thus, with the previous pre-training as base-

ment, the parameters just need to be ﬁne-tuned at each stage,

which makes the binarization operation easier to converge and

equipped with better performance. When βis large enough, this

binarization operation can generate exactly binary codes just as

the unit step function does.

C. Dimming Constraint

To satisfy the dimming constraints, the unconstrained opti-

mization problem in (8) is converted to the one with multiple

constraints, i.e.,

min

ΘC(Θ)

s.t. Hk(Θ)=Ik,k=1,2,...,B,

(11)

where Hk(Θ)=N

i=1[d[k]]i, and Ik(k=1,2,...,B)are B

various dimming constraints in Fig. 2. Since DL approaches

are only effective for unconstrained optimization problem, a

popular way to remove these constraints is adding regularization

terms Rk=Hk(Θ)−Ik(k=1,2,...,B)behind the original

cost function as

min

ΘC(Θ)+



k=1

λkR2

k,(12)

Algorithm 2: The Proposed Constrained Training Algo-

rithm.

Input:Θ[0] and λ[0];

1: for g=1to pdo

2: Initialize Θ0=Θ[g−1];

3: Set t=1;

4: repeat

5: Θt=Θt−1−η(∇ΘC(Θt−1)+B

k=1 2λk[g−

1] ×(Hk(Θt−1)−Ik)∇ΘHk(Θt−1));

6: t=t+1;

7: until the converged parameters Θ[g]are obtained;

8: Initialize λ0=λ[g−1];

9: Set t=1;

10: repeat

11: λt=[λt−1

1,...,λt−1

B]T+η[(H1(Θ[g]) −I1)2,··· ,

(HB(Θ[g]) −IB)2]T;

12: t=t+1;

13: until the converged parameters λ[g]are obtained;

14: end for

where λ=[λ1,...,λB]Tare positive hyper parameters, con-

trolling the weigh between the cost function and the regular-

ization terms. On the one hand, if λis too large, the output

reconstructed message will be quite likely to differ from the input

message b, leading to high BER. On the other hand, if λis too

small, the dimming targets cannot be achieved. The determina-

tion of the value of λis usually relied on a trial-and-error based

searching process, which has shown a satisfying performance

for DL with a single constraint like [18], [19]. However, with

multiple constraints, the proposed UAE using trial-and-error is

extremely time-consuming and troublesome. Thus, instead of

presetting the value of λby hand, we improve the constrained

training algorithm in [28], which will be discussed in detailed

next.

The Lagrange duality method [29] is an effective way to

achieve our goal. The Lagrange function is just the cost function

with regularization terms as (12), which is given by

L(Θ,λ)=C(Θ)+



k=1

λkR2

k,(13)

Then, the dual function is represented as

G(λ)=min

ΘL(Θ,λ),(14)

and the dual problem is given by

max

λG(λ)

s.t. λk≥0,k=1,...,B.

(15)

Based on the weak duality, the Lagrange function and the dual

function can be iteratively optimized to gradually approximate

the optimal solution. Whereas instead of updating Θand λonly

once in each iteration as [28], which is not easy to converge,

we also propose a multi-stage training strategy for dimming

constraint in this section. At each stage g, the network param-

eters and penalty parameters are initialized as Θ0=Θ[g−1]

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

ZOU AND YANG: DIMMING-AWARE DEEP LEARNING APPROACH FOR OOK-BASED VISIBLE LIGHT COMMUNICATION 5737

Fig. 3. The operation of the patch concatenation layer.

and λ0=λ[g−1], respectively, where Θ[g−1] and λ[g−1]

are the converged parameters obtained at the previous stage g−

1. The Lagrange function is optimized ﬁrstly, where the network

parameters Θare updated by SGD algorithm step by step until

getting Θ[g]as follows

Θt=Θt−1−η∂ΘL(Θt−1,λ[g−1])

=Θt−1−η∇ΘC(Θt−1)+



k=1

2λk[g−1]

×Hk(Θt−1)−Ik∇ΘHk(Θt−1)

.(16)

Then, the dual function is optimized based on Θ[g], where the

penalty parameters λare also updated by SGD algorithm till

getting λ[g]as

λt=λt−1+η∂λG(λ)

=⎡

⎣

λt−1

···

λt−1

⎤

⎦+η⎡

⎣

(H1(Θ[g]) −I1)2

···

(HB(Θ[g]) −IB)2⎤

⎦.(17)

After that, the converged parameters Θ[g]and λ[g]of stage g

will be used as initialization in the next stage, and the overall

constrained training algorithm is illustrated in Algorithm 2.

This algorithm makes the penalty parameters λ, which need

to be preset in [18], [19], also become trainable and adjustable.

In this way, the time-consuming trial-and-error based searching

process can be avoided, and the penalty parameters are automat-

ically adjusted according to the change of the tradeoff between

cost function and regularization terms during the training pro-

cess, bringing about more outstanding performance that will be

shown in the experimental results.

D. Patch Concatenation

After dimming constraint, a binary patch of length Lis con-

catenated to the codeword d[i]as Fig. 3, so that the Hamming

weight of the spliced codeword can be adjusted by changing

the number of ones in the patch, in other words, the dimming

intensity Iis adjustable, where I∈{0,1,...,N +L}.It’s

worth noting that, the length of the patch should be carefully

determined, because if Lis too large, the length of the spliced

codeword will become too long to cause excessive pressure on

the channel capacity. Conversely, if Lis too small, the adjustable

range of dimming intensities will become pretty limited, so that

the goal of dimming-aware VLC system cannot be achieved.

Thus, an appropriate value of Lneed to be settled, which is the

Algorithm 3: The Overall Training Algorithm.

Input: Θ[0],λ[0] and β[i],i=1,...,P;

1: for g=1to Pdo

2: Set β=β[g];

3: Initialize Θ0=Θ[g−1];

4: Set t=1;

5: repeat

6: Sample a mini-batch set Tm⊂T;

7: Θt=Θt−1−η(∇ΘC(Θt−1)+B

k=1 2λk[g−

1] ×(Hk(Θt−1)−Ik)∇ΘHk(Θt−1));

8: t=t+1;

9: until the converged parameters Θ[g]is obtained;

10: Initialize λ0=λ[g−1];

11: Set t=1;

12: repeat

13: Sample a mini-batch set Tm⊂T;

14: λt=[λt−1

1,...,λt−1

B]T+η[(H1(Θ[g]) −I1)2,··· ,

(HB(Θ[g]) −IB)2]T;

15: t=t+1;

16: until the converged parameters λ[g]is obtained;

17: end for

minimum value that can be achieved when most of the range of

dimming intensities can be covered.

For the k-th branch, suppose the dimming target is Ik, and

through changing the number of ones in the patch, the set of

the achievable dimming intensities is Ik={I∈N|Ik≤I≤

Ik+L}, where Nis the natural number set. In order to get as

many dimming intensities as possible, the maximum value of the

elements in set Ikshould not be less than the minimum value

of the elements in set Ik+1, then the optimization problem with

regard to Lis

min L

s.t. Ik+L≥Ik+1,k=1,...,B, (18)

and it is obvious that

min L=max{Ik+1 −Ik,k=1,...,B−1}.(19)

Then, if Ik+1 −Ik=L,|IkIk+1|=1. While if Ik+1 −Ik<

L,|IkIk+1|>1, which will result in a waste of resources. To

avoid this issue, we deﬁne that

Ik+1 −Ik=L, ∀k=1,...,B, (20)

and it can be obtained that Ik=I1+(k−1)L.

To meet all of the dimming requirements, the following equa-

tions need to be satisﬁed

I1=0

IB+L=N+L⇒L=N

B−1.(21)

As the number of binary length-Nsequences of Hamming

weight Iis N

I, to get more CWCs, the dimming requirement is

generally not less than 0.1 Nand not larger than 0.9 N. Hence,

to further optimize the value of Land relieve the pressure of the

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

5738 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 38, NO. 20, OCTOBER 15, 2020

optical channel, eqn.(21) is improved as

⎧

⎪

⎨

⎪

⎩

I1=1

10(N+L)

IB+L=9

10(N+L)

⇒L=N

4B−1,(22)

from which it can be noticed that Lis negatively correlated

with the number of branches B. Therefore, Bshould also be

carefully designed, as a large Bwill lead to quite considerable

computational complexity and network storage space, while a

small Bwill bring about a long patch. In the experiments, we

ﬁnd that when B=4,Lis just N/B, which is more concise,

and the results prove that it achieves a good tradeoff.

E. The Overall Training Strategy

The overall training strategy is summarized in this section,

with the training set constructed as T={b(1),...,b(Nt)},

where b(j)∈Mis the randomly generated message and Nt

is the number of training data. We apply the mini-batch SGD

algorithm, which is a stochastic optimization tool to reduce

memory space via randomly sampling a mini-batch set Tm⊂T

of size Jat each training epoch. Then, the objection function to

be minimized, which is just the improved Lagrange function in

(13), is deﬁned as

L(Θ,λ)= 1



j=1



k=1 log [h4[k]]b(j)

+λkN



i=1 db(j)

[k]i−Ik2,

(23)

where db(j)

[k]is the binary code of the k-th branch and input b(j).

The ﬁrst term of (23) is the cost function C(Θ)as (8), and the

second term is the regularization terms Rk(k=1,...,B).

Next, as mentioned in Section III-B and III-C, there are two

multi-stage training strategies in our proposed UAE, one for

binarization and one for dimming constraint. In the overall

training strategy we proposed, these two multi-stage methods

are merged into a single one, as shown in Algorithm 3. Thus,

the trained UAE can be used as a dimming-aware VLC system,

which is more effective with lower BER and computational

complexity.

F. Practical Application

In practical application, the binarization and dimming con-

straint operation can be removed from the trained UAE, whose

structure is shown in Fig. 4. As mentioned above, the set of

the achievable dimming intensities of the k-th branch is Ik=

{I∈N|Ik≤I≤Ik+L}. Therefore, if the desired dimming

intensity Ibelongs to Ik, then the one-hot representation x

will be encoded by the k-th branch of the encoding network,

and be decoded by the k-th branch of the decoding network.

The output of the encoding network is directly binarized by the

unit step function to get the binary code d[k], whose Hamming

weight is just the dimming target of the k-th branch Ik.To

meet the desired dimming intensity I, the number of ones in the

Fig. 4. The structure of the trained UAE for practical application.

concatenated patch is set as I−Ik. Thus, the trained encoding

network as well as the patch concatenation operation can map

message bto (N+L)-length CWCs with desired Hamming

weight I, and the trained decoding network can be used for

message reconstruction.

IV. EXPERIMENTAL RESULTS

In this section, we describe the concrete structure of the

proposed UAE and compare the performance of our method

to that of other state-of-the-art VLC systems in term of BER,

the satisfaction of the dimming constraints and the robustness

for imperfect optical channels.

A. Implementation Details

Our proposed UAE network is constructed based on the

Tensorﬂow package [30]. We randomly generate a training set

with 104samples and a test set with 2×104samples. While

the validation set is a reference set consisting of Mmessages,

e.g., M={b1,...,b

M}. The network parameters Θ[0] and

the penalty parameters λ[0] are updated using the Adam opti-

mizer [31] with learning rate ηattenuating from 0.001 to 0.0001,

and the mini-batch size is set as 256. The number of training

stage P=15and the scaling parameter βis multiplied by 21/3

at each stage with β[0] = 1. Thus, at the ﬁnal stage, the scaled

sigmoid function is almost the same with the unit step func-

tion, so that the scaled sigmoid function can be replaced with

the unit step function while testing. To simulate the practical

optical channel, the encoded signal s[k]is added with thermal

noise nth ∼N(0,σ

2)and shot noise nsh ∼N(0,s[k]ψ2σ2).

SNR is deﬁned as SNR =Es/σ2=I/(Nσ2), and the pro-

posed UAE is trained at a certain SNR, which is determined

by the validation process, while it is tested for SNR from 0 dB

to 20 dB.

B. The Superiority of the Proposed Training Algorithm

To show the impact of out proposed training algorithm as

Algorithm 3, we compare it with two conventional training ap-

proaches. The ﬁrst one is the proposed training strategy without

the multi-stage for binarization as Algorithm 1, which trains the

proposed UAE with βalways equaling to β[P]. Fig. 5 illustrates

the convergence behavior of our proposed training algorithm

and the ﬁrst conventional training approach with same training

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

ZOU AND YANG: DIMMING-AWARE DEEP LEARNING APPROACH FOR OOK-BASED VISIBLE LIGHT COMMUNICATION 5739

Fig. 5. The convergence behavior with N=16,H=IN,ψ2=0 and

SNR =10dB.

TAB L E I

THE DIMMING ACCURACY WITH M=64,N=16,H=IN,ψ2=0,

SNR =10dB, AND DIMMING TARGET =14

epochs, M={64,32},N=16,H=IN,ψ2=0and SNR =

10 dB. It can be found that the value of the cost function can

converge to a lower value by our proposed training algorithm,

because it make the easy-to-train sigmoid function gradually

approach to the hard-to-train unit step function, which leads to a

smoother training process. While the ﬁrst conventional training

approach encounters the vanishing gradient problem caused by

the unit step function, which results in a poor performance.

Notice that the increase in cost function value at the beginning of

the training process comes from maximizing the dual function,

and in the following epochs, both the value of the cost function

and the dual function are close to optimal, which is almost

the same, so that the cost function converges according to the

Karush-Kuhn-Tucker Conditions(KKT).

The second conventional training approach is the proposed

training strategy without the multi-stage for dimming constraints

as Algorithm 2, which trains the proposed UAE with ﬁxed

penalty parameters λas (12). Fig. 6 and Table I shows the

performance of our proposed training algorithm and the second

conventional training approach in each validation stage with

M=64,N=16,H=IN,ψ2=0,SNR=10dB and dim-

ming target I=14. The dimming accuracy is deﬁned as Nd/M ,

where Ndis the amount of validation data that meet the dimming

targets as (4). It is discovered that, with smaller value of λsuch

as 0.03, the second conventional training approach can achieve

lower BER, but the dimming accuracy is always zero, which

means this approach does not constrain the dimming intensity,

Fig. 6. The average BER with M=64,N=16,H=IN,ψ2=0, SNR =

10 dB, and dimming target =14.

Fig. 7. The variety of λduring training process.

whereas the second conventional approach with larger value of λ

reﬂects an opposite situation. However, our proposed training

strategy outperforms the second conventional one in dimming

accuracy, no matter what the value of λis, which implies that

ﬁxed penalty parameters are not sufﬁcient and adjustable for

multi-constraint problems, and the variety of λin our proposed

training strategy is demonstrated in Fig. 7.

C. Comparison With Existing Methods

To show the performance of our proposed UAE, we compare

it with the AE network in [18], which is taken as the baseline

in this paper. We set B=4,N=16 and L=4. It can be

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

5740 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 38, NO. 20, OCTOBER 15, 2020

Fig. 8. The average BER for various test SNR with H=INand N=16.

TAB L E I I

THE DIMMING ACCURACY FOR EACH INTENSITY WITH M=64,

N=16,H=INAND ψ2=0

calculated that the dimming constraints of these four branches

are {2,6,10,14}and the achievable dimming intensities of

each branch are I1={I∈N|2≤I≤6},I2={I∈N|

6≤I≤10},I3={I∈N|10 ≤I≤14}, and I4={I∈

N|14 ≤I≤18}, respectively. The codewords with various

dimming intensities can be obtained by training UAE only

once, whereas the baseline can only satisfy a speciﬁc dimming

intensity, in other words, to compare with the proposed UAE,

the baseline needs to be trained with different parameters for

different dimming constraints. That is, the baseline requires to

be trained for a total of 18 −2+1=17times, which leads to

high computational complexity.

With H=IN,ψ2={0,5},M={64,32}, Fig. 8 depicts

the average BER for various test SNR, in addition, Table II

TABLE III

THE DIMMING ACCURACY FOR EACH INTENSITY WITH M=32,

N=16,H=INAND ψ2=5

and Table III illustrate the dimming accuracy for each dimming

intensity of the UAE and the baseline schemes. It is worth noting

that, no matter what the value of ψ2is, the proposed UAE

performs better than the baseline in terms of BER, which proves

that the proposed UAE is efﬁcient for both thermal noise and

shot noise. This is because the multi-branch structure acts as a

regularizer, which reduces the ability to ﬁt the additive noise

from the imperfect channels. Besides, it can be found from

Table II that, when the dimming constraint is larger than 12, the

baseline with ﬁxed small penalty parameter λ=0.03 is unable to

effectively control the dimming intensity. This is because higher

dimming targets require larger penalty parameters, whereas

larger penalty parameters cause poorer BER performance, that

means ﬁxed penalty parameters cannot lead to a good tradeoff

between BER and dimming accuracy performance. But with

trainable penalty parameters, whose variety is just shown in

Fig. 7, the proposed UAE demonstrates satisfying performance

in the aspect of both BER and dimming accuracy regardless of

the dimming constraints. Moreover, Table III illustrates that the

dimming accuracy of the proposed UAE is still pretty high with

signal-dependent noise, while that of the baseline is below 50%

for all dimming intensity regimes.

Also, it can be realized that the dimming accuracy of the

second and the third branches are higher than the ﬁrst and

the fourth branches. As mentioned above, the number of bi-

nary length-Nsequences of Hamming weight Iis N

I, which

is higher with I={6,10}than with I={2,14}. Thus, the

ratio M/N

Iare different among these branches, leading to

different minimum Hamming distances, further resulting in

different dimming accuracy. While with N=24and various M

and I, similar ratio M/N

Iare obtained in Fig. 9, where the

BER performances of the proposed UAE are pretty close, and

the dimming accuracy under these conditions are all 100%.

Again, both the BER and dimming accuracy performance of

the proposed UAE are superior than those of the baseline.

To stimulate the ISI problem in realistic VLC system, we

assume that LED and PD are equipped on the center of the

ceiling and the ﬂoor of a two-dimensional 3 m-by-3 m room,

that is the coordinate of LED and PD are (1.5, 3) and (1.5,

0), respectively. The detector physical length of the PD is

0.2 m, which means the light can be detected in the range

of (1.5±0.1,0). Thus, to capture the general feature of the ISI

channel, we randomly generate the location of the detected light

as [p, 0] while training, where pis the uniform random variable

within [1.4, 1.6]. Fig. 10 shows the average BER over ISI channel

speciﬁed by (3) with ψ2=0,N=32,M= 128,B=4and

the bit time interval T=10

−8sec. It can be calculated that

the patch length L=8and the dimming constraints of these

four branches are {4,12,20,28}. Compared with the baseline,

the proposed baseline can still generate efﬁcient OOK symbol

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

ZOU AND YANG: DIMMING-AWARE DEEP LEARNING APPROACH FOR OOK-BASED VISIBLE LIGHT COMMUNICATION 5741

Fig. 9. The average BER for various test SNR with H=IN,ψ2=0,N=

24 and similar ratio M/N

I.

Fig. 10. The average BER over ISI channel with ψ2=0,N=32 and

M= 128.

set and learn the general feature of a more complex channel

environment. Besides, as the ratio M/N

Iof these four branches

are all relatively small, the dimming accuracy of the achievable

dimming intensities 4≤I≤36 are all almost 100%.

V. C ONCLUSION

In this paper, we proposed a novel dimming-aware DL frame-

work named UAE for VLC system, which is equipped with a

innovative structure with several branches and binary patches

and a novel multi-stage training strategy to solve this optimiza-

tion problem with multiple dimming constraints. Compared with

existing approaches, our proposed UAE can meet arbitrary dim-

ming targets with a single network, instead of training different

networks for different dimming constraints. Experiments show

that the proposed UAE achieves a superior performance with

lower BER and higher dimming accuracy, and has satisfying

robustness to channel noise.

REFERENCES

[1] T. Komine and M. Nakagawa, “Fundamental analysis for visible-light

communication system using led lights,” IEEE Trans. Consum. Electron.,

vol. 50, no. 1, pp. 100–107, Feb. 2004.

[2] S. H. Lee, S.-Y. Jung, and J. K. Kwon, “Modulation and coding for

dimmable visible light communication,” IEEE Commun. Mag., vol. 53,

no. 2, pp. 136–143, Feb. 2015.

[3] S. Rajagopal, R. D. Roberts, and S.-K. Lim, “IEEE 802.15. 7 visible

light communication: Modulation schemes and dimming support,” IEEE

Commun. Mag., vol. 50, no. 3, pp. 72–82, Mar. 2012.

[4] L. Wu, Z. Zhang, J. Dang, and H. Liu, “Adaptive modulation schemes

for visible light communications,” J. Lightw. Technol., vol. 33, no. 1,

pp. 117–125, Jan. 2015.

[5] Q. Gao, S. Hu, C. Gong, and Z. Xu, “Modulation designs for visible light

communications with signal-dependent noise,”J. Lightw. Technol., vol. 34,

no. 23, pp. 5516–5525, Dec. 2016.

[6] D.-F. Zhang, Y.-J. Zhu, and Y.-Y. Zhang, “Multi-led phase-shifted OOK

modulation based visible light communication systems,” IEEE Photon.

Technol. Lett., vol. 25, no. 23, pp. 2251–2254, Dec. 2013.

[7] S. Zhao, “A serial concatenation-based coding scheme for dimmable vis-

ible light communication systems,” IEEE Commun. Lett., vol. 20, no. 10,

pp. 1951–1954, Oct. 2016.

[8] A. B. Siddique and M. Tahir, “Joint error-brightness control coding for

led based vlc link,” in Proc. IEEE Wireless Commun. Netw. Conf., 2014,

pp. 400–404.

[9] B. Fahs, A. J. Chowdhury, and M. M. Hella, “A 12-m 2.5-gb/s light-

ing compatible integrated receiver for OOK visible light communication

links,” J. Lightw. Technol., vol. 34, no. 16, pp. 3768–3775, Aug. 2016.

[10] B. Bai, Z. Xu, and Y. Fan, “Joint led dimming and high capacity visible

light communication by overlapping ppm,” in Proc. 19th Annu. Wireless

Opt. Commun. Conf., 2010, pp. 1–5.

[11] M. Noshad and M. Brandt-Pearce, “Application of expurgated PPM to

indoor visible light communicationspart ii: Access networks,” J. Lightw.

Technol., vol. 32, no. 5, pp. 883–890, Mar. 2014.

[12] G. Ntogari, T. Kamalakis, J. Walewski, and T. Sphicopoulos, “Combining

illumination dimming based on pulse-width modulation with visible-light

communications based on discrete multitone,” J. Opt. Commun. Netw.,

vol. 3, no. 1, pp. 56–65, Jan. 2011.

[13] M. Z. Afgani, H. Haas, H. Elgala, and D. Knipp, “Visible light communi-

cation using OFDM,” in Proc. 2nd Int. Conf. Testbeds Res. Infrastructures

Develop. Netw. Communities, 2006, pp. 6–134.

[14] J. Lian and M. Brandt-Pearce, “Clipping-enhanced optical OFDM for

visible light communication systems,” J. Lightw. Technol., vol. 37, no. 13,

pp. 3324–3332, Jul. 2019.

[15] P. R. Ostergard, “Classiﬁcation of binary constant weight codes,” IEEE

Trans. Inf. Theory, vol. 56, no. 8, pp. 3779–3785, Aug. 2010.

[16] A. E. Brouwer, J. B. Shearer, N. J. Sloane, and W. D. Smith, “A new

table of constant weight codes,” IEEE Trans. Inf. Theory, vol. 36, no. 6,

pp. 1334–1380, Nov. 2006.

[17] P. Chvojka, S. Zvanovec, P. A. Haigh, and Z. Ghassemlooy, “Channel

characteristics of visible light communications within dynamic indoor en-

vironment,” J. Lightw. Technol., vol. 33, no. 9, pp. 1719–1725, May 2015.

[18] H. Lee, I. Lee, T. Q. Quek, and S. H. Lee, “Binary signaling design for

visible light communication: A deep learning framework,” Opt. Express,

vol. 26, no. 14, pp. 18 131–18 142, Jul. 2018.

[19] H. Lee, S. H. Lee, T. Q. Quek, and I. Lee, “Deep learning framework for

wireless systems: Applications to optical wireless communications,” IEEE

Commun. Mag., vol. 57, no. 3, pp. 35–41, Mar. 2019.

[20] H. Lee, T. Q. Quek, and S. H. Lee, “A deep learning approach to universal

binary visible light communication transceiver,” IEEE Trans. Wireless

Commun., vol. 19, no. 2, pp. 956–969, Feb. 2019.

[21] H. Lee, I. Lee, and S. H. Lee, “Deep learning based transceiver design for

multi-colored vlc systems,” Opt. Express, vol. 26, no. 5, pp. 6222–6238,

Mar. 2018.

[22] S. Zhao and X. Ma, “A spectral-efﬁcient transmission scheme for

dimmable visible light communication systems,” J. Lightw. Technol.,

vol. 35, no. 17, pp. 3801–3809, Sep. 2017.

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

5742 JOURNAL OF LIGHTWAVE TECHNOLOGY, VOL. 38, NO. 20, OCTOBER 15, 2020

[23] S. Ruder, “An overview of multi-task learning in deep neural networks,”

2017, arXiv:1706.05098.

[24] H. Wang and S. Kim, “Decoding of polar codes for intersymbol interfer-

ence in visible-light communication,”IEEE Photon. Technol. Lett., vol. 30,

no. 12, pp. 1111–1114, Jun. 2018.

[25] R. Hecht-Nielsen, “Theory of the backpropagation neural network,” Netw.

Perception, vol. 1, pp. 593–605, Feb. 1989.

[26] G. E. Hinton, S. Osindero, and Y.-W. Teh, “A fast learning algorithm for

deep belief nets,” Neural Computation, vol. 18, no. 7, pp. 1527–1554,

Jul. 2006.

[27] Z. Cao, M. Long, J. Wang, and P. S. Yu, “Hashnet: Deep learning to hash

by continuation,” in Proc. IEEE Int. Conf. Comput. Vision, Oct. 2017,

pp. 5609–5618.

[28] H. Lee, S. H. Lee, and T. Q. Quek, “Constrained deep learning for wireless

resource management,” in Proc. IEEE Int. Conf. Commun., May 2019,

pp. 1–6.

[29] S. Boyd, S. P. Boyd, and L. Vandenberghe, Convex Optimization.Cam-

bridge, U.K.: Cambridge Univ. Press, 2004.

[30] M. Abadi et al., “Tensorﬂow: A system for large-scale machine learning,”

in Proc. 12th USENIX Symp. Operating Syst. Des. Implementation, 2016,

pp. 265–283.

[31] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,”

2014, arXiv:1412.6980.

Cong Zou (Student Member, IEEE) is a Ph.D. student with the Department

of Electronic Engineering, Tsinghua University, Beijing, China. Her research

interests lie in the ﬁeld of visible light communication and deep learning for

communications.

Fang Yang (Senior Member, IEEE) received the B.S.E. and Ph.D. degrees in

electronic engineering from Tsinghua University, Beijing, China, in 2005 and

2009, respectively. He is currently working as an Associate Professor with the

Department of Electronics Engineering, Tsinghua University. He has published

over 120 peer-reviewed journal and conference papers. He holds over 40 chinese

patents and two PCT patents. His research interests lie in the ﬁelds of channel

coding, channel estimation, interference cancellation, and signal processing

techniques for communication system, especially in power line communication,

visible light communication, and digital television terrestrial broadcasting. Dr.

Yang received the IEEE Scott Helt Memorial Award (Best Paper Award in IEEE

TRANSACTIONS IN BROADCASTING) in 2015. He is the Secretary General of the

Sub-Committee 25 of the China National Information Technology Standardiza-

tion (SAC/TC28/SC25). He currently serves as an Associate Editor for the IEEE

ACCESS and is the fellow of IET.

Authorized licensed use limited to: Tsinghua University. Downloaded on October 08,2020 at 02:25:15 UTC from IEEE Xplore. Restrictions apply.

Channel Autoencoder for Wireless Communication: State of the Art, Challenges, and Trends

Article

Full-text available

May 2021

To tackle the sub-optimization problem of the conventional block structure communication systems, recently, a novel concept named end-to-end communication system that can optimize the whole system jointly has been proposed. A channel autoencoder (AE) is one of the methods, which regards the wireless communication system as an AE along with a channel model. In this article, we present a comprehensive overview of the recent advancements of channel AEs, whose practicability mainly depends on the robustness of the impairments in actual channels. Among existing works, assuming the imperfect channel models before training or constructing a communication system without channel models are both viable methods to deal with channel impairments. Therefore, we divide the channel AEs into two categories, model-assumed and model-free channel AEs, for each of which a universal structure is investigated, namely radio transformer network and gradient generation network, respectively. Then their performance is compared extensively, and the open research issues are discussed in the end to provide some directions for future study.

DNN-Based Physical-Layer Network Coding for Visible Light Communications

Article

Full-text available

Dec 2022

The key difference between visible light communication (VLC) and radio frequency (RF) communication is the former’s line-of-sight (LOS) transmission nature, and hence a relay node has to be adopted for VLC to extend its coverage. Physical-layer network coding (PNC) has the advantage of doubling the throughput of a two-way relay network (TWRN), where two end nodes exchange information via the help of a relay, compared with the conventional store-and-forward routing strategy. Although PNC has been studied for VLC in the literature, the state-of-the-art schemes are highly inefficient, requiring tight phase synchronization between the two end nodes, and hence difficult to realize. This paper proposes the application of a deep neural network (DNN) to a PNC VLC system, named DP-VLC, that enables misaligned phases and can deal with the light channel gains and noises in a satisfactory manner without introducing additional computation complexities. We implement DP-VLC using the universal software radio peripheral (USRP) software radio platform and a self-developed VLC optical front-end using commercial off-the-shelf (COTS) light-emitting diodes (LEDs) and photo-diodes (PDs). We find that irregular constellations generated by DP-PNC can be transmitted and recovered in a 1.5 m VLC link effectively. Experimental results show that our DP-PNC prototype performs better than conventional PNC VLC system when the signal-interference-to-noise ratio (SINR) of received optical signals is larger than 13.63 dB and can achieve a throughput of up to 77.38 Mbps in a 20 MHz channel under PNC scheme when the SINR is 22.86 dB. More importantly, we find that DP-VLC performs even better than fixed-constellation PNC system in the saturated SINR regime (e.g., 20–25 dB) where non-linear effects may happen compared with moderate SINR regimes (e.g., 10–20 dB), showing its adaptability to unpredictable impairments in optical links. Our first attempt at realizing DNN-based optical PNC in a TWRN has paved the way for future PNC-enhanced VLC systems.

Autoencoder for Optical Intelligent Reflecting Surface-Assisted VLC System: From Model and Data-Driven Perspectives

Article

Dec 2023

Due to the wide and license-free bandwidth, visible light communication (VLC) functions as a potential technology to meet the exponentially expanding traffic demands in wireless communications. However, the sensitivity to obstacles and high path loss are the key issues that practical VLC systems must carefully deal with. In this paper, the utilization of optical intelligent reflecting surface (OIRS) array in VLC is considered to create additional light propagation paths, thereby achieving a remarkable performance gain. In the OIRS-assisted VLC system, though the power of the signal at receiver can be increased, the resource allocation is relatively complex. Besides, the OIRS also causes time delays among signals received via various propagation paths, which is usually overlooked in existing works. To overcome these issues, the OIRS-assisted VLC system is interpreted as an autoencoder (AE), named OIRS-AE, whose architecture is enhanced according to both the model-driven and data-driven perspectives. By this way, the processing modules at the transmitter, OIRS, and receiver, including the corresponding encoding, resource management, and decoding schemes can be simultaneously optimized, which is expected to achieve more reliable communication. Moreover, the impact of the OIRS-induced time delay spread on system performance is explored under various situations. The simulation results show that the proposed OIRS-AE can outperform the traditional OIRS-assisted VLC systems in terms of bit error rate performance.

基于深度学习和分层角谱的三维纯相位全息显示

Article

Jan 2022

Bit Error Probability Performance of Binary Dimmable Visible Light Communication Systems

Article

Jul 2021

This paper evaluates the practically achievable best error performance obtained from a dimmable visible light communication (VLC) system that conveys information using binary modulations, such as on-off keying (OOK). As compared to previous studies that address the ideal performance of dimmable VLC systems only over reliable channels with arbitrarily small error probability, this paper presents the best achievable error probability of various configurations with the dimming targets, signal quality, and data rate. The numerical results establish the ultimate decoding error performance targeted for dimmable VLC systems and are expected to provide a practical guideline for the enhanced VLC system design.

Binary signaling design for visible light communication: a deep learning framework

Article

Full-text available

Jun 2018
OPT EXPRESS

This paper develops a deep learning framework for the design of on-off keying (OOK) based binary signaling transceiver in dimmable visible light communication (VLC) systems. The dimming support for the OOK optical signal is achieved by adjusting the number of ones in a binary codeword, which boils down to a combinatorial design problem for the codebook of a constant weight code (CWC) over signal-dependent noise channels. To tackle this challenge, we employ an autoencoder (AE) approach to learn a neural network of the encoder-decoder pair that reconstructs the output identical to an input. In addition, optical channel layers and binarization techniques are introduced to reflect the physical and discrete nature of the OOK-based VLC systems. The VLC transceiver is designed and optimized via the end-to-end training procedure for the AE. Numerical results verify that the proposed transceiver performs better than baseline CWC schemes.

Deep learning based transceiver design for multi-colored VLC systems

Article

Full-text available

Feb 2018
OPT EXPRESS

This paper presents a deep-learning (DL) based approach to the design of multi-colored visible light communication (VLC) systems where RGB light-emitting diode (LED) lamps accomplish multi-dimensional color modulation under color and illuminance requirements. It is aimed to identify a pair of multi-color modulation transmitter and receiver leading to efficient symbol recovery performance. To this end, an autoencoder (AE), an unsupervised deep learning technique, is adopted to train the end-to-end symbol recovery process that includes the VLC transceiver pair and a channel layer characterizing the optical channel along with additional LED intensity control features. As a result, the VLC transmitter and receiver are jointly designed and optimized. Intensive numerical results demonstrate that the learned VLC system outperforms existing techniques in terms of the average symbol error probability. This framework sheds light on the viability of DL techniques in the optical communication system design.

A Deep Learning Approach to Universal Binary Visible Light Communication Transceiver

Article

Nov 2019

This paper studies a deep learning (DL) framework for the design of binary modulated visible light communication (VLC) transceiver with universal dimming support. The dimming control for the optical binary signal boils down to a combinatorial codebook design so that the average Hamming weight of binary codewords matches with arbitrary dimming target. An unsupervised DL technique is employed for obtaining a neural network to replace the encoder-decoder pair that recovers the message from the optically transmitted signal. In such a task, a novel stochastic binarization method is developed to generate the set of binary codewords from continuous-valued neural network outputs. For universal support of arbitrary dimming target, the DL-based VLC transceiver is trained with multiple dimming constraints, which turns out to be a constrained training optimization that is very challenging to handle with existing DL methods. We develop a new training algorithm that addresses the dimming constraints through a dual formulation of the optimization. Based on the developed algorithm, the resulting VLC transceiver can be optimized via the end-to-end training procedure. Numerical results verify that the proposed codebook outperforms theoretically best constant weight codebooks under various VLC setups.

Constrained Deep Learning for Wireless Resource Management

Conference Paper

May 2019

Clipping-Enhanced Optical OFDM for Visible Light Communication Systems

Article

May 2019

Visible light communications (VLC), a new optical wireless communication technology that uses illumination light-emitting diodes (LEDs) as transmitters, requires a modulation scheme that is well suited to these devices' nonlinear response. Optical orthogonal frequency division multiplexing (OFDM) is a promising technique to provide high-speed data transmission for VLC. However, the peak transmitted power limitation and nonnegative transmitted signal constraint of the lighting sources can result in nonlinear signal distortion from clipping. In this paper, we propose a novel optical OFDM scheme for VLC systems called clipping-enhanced optical OFDM (CEO-OFDM) that transmits via extra time slots the information clipped by the peak power constraint. CEO-OFDM sacrifices bandwidth to allow a higher modulation index to improve the signal to noise ratio and reduce the clipping distortion caused by the peak power limitation. From analytical and numerical results, the proposed CEO-OFDM provides better bit error rate performance and higher data rate than DC-biased optical OFDM (DCO-OFDM), unipolar OFDM (U-OFDM) and asymmetrically clipped optical OFDM (ACO-OFDM). Furthermore, CEO-OFDM can provide a better illumination performance that supports light dimming.

Deep Learning Framework for Wireless Systems: Applications to Optical Wireless Communications

Article

Mar 2019

Optical wireless communication (OWC) is a promising technology for future wireless communications due to its potential for cost-effective network deployment and high data rate. There are several implementation issues in OWC that have not been encountered in radio frequency wireless communications. First, practical OWC transmitters need illumination control on color, intensity, luminance, and so on, which poses complicated modulation design challenges. Furthermore, signal-dependent properties of optical channels raise nontrivial challenges in both modulation and demodulation of the optical signals. To tackle such difficulties, deep learning (DL) technologies can be applied for optical wireless transceiver design. This article addresses recent efforts on DL-based OWC system designs. A DL framework for emerging image sensor communication is proposed, and its feasibility is verified by simulation. Finally, technical challenges and implementation issues for the DL-based optical wireless technology are discussed.

Decoding of Polar Codes for Intersymbol Interference in Visible Light Communication

Article

Apr 2018

In this paper, a modified likelihood ratio (LR) for decoding of polar codes is proposed for the intersymbol interference (ISI) environment, which is a major drawback in visible light communication. In general, the method to alleviate ISI include equalization and error correction code. Our proposed algorithm focuses on error correction code aspect, where we adopt polar codes to reduce the effect of ISI. In our proposed system, we analyze the detailed effect of ISI and derive that modified LR function, different from the conventional LR function without considering ISI, can efficiently reduce effect of ISI when it was used as input of decoding, such as list-CRC decoding of polar codes. Simulation results show that our proposed algorithm provides a better bit error rate (BER) performance than a referenced algorithm that uses the conventional LR function.

HashNet: Deep Learning to Hash by Continuation

Conference Paper

Oct 2017

An Overview of Multi-Task Learning in Deep Neural Networks

Article

Jun 2017

Sebastian Ruder

Multi-task learning (MTL) has led to successes in many applications of machine learning, from natural language processing and speech recognition to computer vision and drug discovery. This article aims to give a general overview of MTL, particularly in deep neural networks. It introduces the two most common methods for MTL in Deep Learning, gives an overview of the literature, and discusses recent advances. In particular, it seeks to help ML practitioners apply MTL by shedding light on how MTL works and providing guidelines for choosing appropriate auxiliary tasks.

A Spectral-Efficient Transmission Scheme for Dimmable Visible Light Communication Systems

Article

May 2017

Dimming control is of vital importance for visible light communication (VLC) systems. Conventional dimmable transmission schemes based on compensation are spectrally inefficient. In this paper, we propose the use of two techniques, i.e., time-sharing and superposition, to construct spectral-efficient dimmable transmission schemes. For VLC systems with on-off keying (OOK) signaling, we present a general framework to construct dimmable transmission scheme by time-sharing among different dimming control codes. Arbitrary dimming target is achieved by adjusting the proportion of each dimming control code. We then give a practical construction of dimming control codes with semi-constant weight codes. We obtain optimal proportions of the semi-constant weight codes which maximize the asymptotic spectral efficiency using linear programming. We also compute achievable rates of the proposed scheme under different dimming targets and different signal-intensity-to-noise-amplitude ratios. To achieve higher spectral efficiency(>1.0 bps/Hz), we present a dimmable multilevel transmission scheme based on superposition. In the proposed scheme, the first ℓ�1 levels adopt traditional OOK modulation, while the ℓ-th level adopts the dimmable transmission scheme for OOK modulation. The transmitted signal is formed by superimposing modulated signals of the ℓ levels. Analysis shows that the proposed scheme achieves a higher spectral efficiency than the state-of-the-art schemes. Hence, it provides an attractive candidate for dimmable VLC systems with demanding spectral efficiency.

Dimming-Aware Deep Learning Approach for OOK-Based Visible Light Communication

Abstract and Figures

Recommended publications

Binary signaling design for visible light communication: a deep learning framework

Deep Learning-Aided Binary Visible Light Communication Systems

A Deep Learning Approach to Universal Binary Visible Light Communication Transceiver

A Deep Learning Approach to Universal Binary Visible Light Communication Transceiver