ArticlePDF Available

A Deep Learning Approach for MIMO-NOMA Downlink Signal Detection

Authors:

Abstract and Figures

As a key candidate technique for fifth-generation (5G) mobile communication systems, non-orthogonal multiple access (NOMA) has attracted considerable attention in the field of wireless communication. Successive interference cancellation (SIC) is the main NOMA detection method applied at receivers for both uplink and downlink NOMA transmissions. However, SIC is limited by the receiver complex and error propagation problems. Toward this end, we explore a high-performance, high-efficiency tool—deep learning (DL). In this paper, we propose a learning method that automatically analyzes the channel state information (CSI) of the communication system and detects the original transmit sequences. In contrast to existing SIC schemes, which must search for the optimal order of the channel gain and remove the signal with higher power allocation factor while detecting a signal with a lower power allocation factor, the proposed deep learning method can combine the channel estimation process with recovery of the desired signal suffering from channel distortion and multiuser signal superposition. Extensive performance simulations were conducted for the proposed MIMO-NOMA-DL system, and the results were compared with those of the conventional SIC method. According to our simulation results, the deep learning method can successfully address channel impairment and achieve good detection performance. In contrast to implementing well-designed detection algorithms, MIMO-NOMA-DL searches for the optimal solution via a neural network (NN). Consequently, deep learning is a powerful and effective tool for NOMA signal detection.
Content may be subject to copyright.
sensors
Article
A Deep Learning Approach for MIMO-NOMA
Downlink Signal Detection
Chuan Lin 1ID , Qing Chang 1,* and Xianxu Li 2
1School of Electronic and Information Engineering, Beihang University, Beijing 100191, China;
lclkzjp@hotmail.com
2State Grid Information & Telecommunication Branch, Beijing 100761, China; lixianxu@buaa.edu.cn
*Correspondence: changq@263.net
Received: 7 April 2019; Accepted: 31 May 2019; Published: 2 June 2019


Abstract:
As a key candidate technique for fifth-generation (5G) mobile communication systems,
non-orthogonal multiple access (NOMA) has attracted considerable attention in the field of wireless
communication. Successive interference cancellation (SIC) is the main NOMA detection method applied
at receivers for both uplink and downlink NOMA transmissions. However, SIC is limited by the
receiver complex and error propagation problems. Toward this end, we explore a high-performance,
high-efficiency tool—deep learning (DL). In this paper, we propose a learning method that automatically
analyzes the channel state information (CSI) of the communication system and detects the original
transmit sequences. In contrast to existing SIC schemes, which must search for the optimal order of the
channel gain and remove the signal with higher power allocation factor while detecting a signal with a
lower power allocation factor, the proposed deep learning method can combine the channel estimation
process with recovery of the desired signal suffering from channel distortion and multiuser signal
superposition. Extensive performance simulations were conducted for the proposed MIMO-NOMA-DL
system, and the results were compared with those of the conventional SIC method. According to
our simulation results, the deep learning method can successfully address channel impairment and
achieve good detection performance. In contrast to implementing well-designed detection algorithms,
MIMO-NOMA-DL searches for the optimal solution via a neural network (NN). Consequently, deep
learning is a powerful and effective tool for NOMA signal detection.
Keywords:
5G; non-orthogonal multiple access (NOMA); multiple-input multiple-output (MIMO); deep
learning (DL); wireless communication
1. Introduction
Since the concept of non-orthogonal multiple access (NOMA) transmission was proposed, there have
been at least three categories of non-orthogonal multiple access: power-domain NOMA, code-domain
NOMA, and hybrid-domain NOMA. In this article, we focus on power-domain NOMA for downlink
multiuser multiple-input multiple-output (MIMO) systems on account of its representativeness and
typicality of the non-orthogonal multiple access technique. Results in this paper can offer a reference for
deep learning on other more complicated NOMA techniques. Meanwhile, with the benefit of the MIMO
technique, MIMO-NOMA can enhance the throughput of the communication system [1].
To date, the most universal method for MIMO-NOMA signal detection has been successive
interference cancellation (SIC) reception. In the downlink, a multiuser signal is first multiplexed by
Sensors 2019,19, 2526; doi:10.3390/s19112526 www.mdpi.com/journal/sensors
Sensors 2019,19, 2526 2 of 22
superposition coding with a total power constraint at the base station. At the receiver, a signal suffering
from channel impairment will invoke the SIC mechanism in order to decode the expected signal depending
on the order of the user equipment (UE) channel gains in a cluster. The UE signal with poorer channel
gain is allocated higher transmit power and is decoded first, while the UE signal with lower power
allocation is treated as interference. After the signal with higher power is detected and decoded
correctly, a modulation reconstruction is executed. Then, the reconstructed signal is subtracted from
the received signal. The process continues until the UE can decode its desired information. From an
information-theoretic perspective, SIC is an optimal multiple-access scheme in terms of the achievable
multiuser capacity region in both uplink and downlink [2].
Instead of the classic SIC method, we resort to the power of deep learning—multilayer neural
networks (NNs)—to achieve MIMO-NOMA signal detection. As one branch of machine learning, deep
learning has made substantial progress in recent years and has been applied to many fields. The major
evolution of deep learning mainly involves deep neural networks (DNNs) in pattern classification and
recognition, conventional neutral networks (CNNs) in image processing, and recurrent neural networks
(RNNs) in voice recognition for natural language processing (NLP). The powerful performance of deep
learning has considerably changed our daily lives and the cognition of artificial intelligence (AI).
Note that although Graphic Processing Unit (GPU) resources are still somewhat expensive for users,
the cost in recent years has been in steady decline according to Moore’s law. New products with higher
performance and lower cost continue to spring up. Tensor processing units (TPUs), a customized ASIC
chip for AI, were proposed by Google in 2016. It is reported that the cost of third-generation TPUs could
be lower by 80%. We firmly believe that the era of AI is coming.
In the field of wireless communication, although successful commercial applications related to deep
learning are relatively less abundant [
3
], many researchers are attempting to introduce machine learning
techniques into communication systems, especially for improving existing signal processing algorithms.
In channel decoding, deep learning can learn a form of decoding algorithm rather a simple classifier,
and successful examples have been reported, such as decoding linear code [
4
] and polar code [
5
]. Ref. [
6
]
improves the standard belief propagation (BP) decoder to a BP-CNN and achieves improved bit error
rate (BER) performance with low complexity. Compressed sensing (CS) based on deep learning theory
can outperform the state-of-the-art CS-based methods in terms of both recovery quality and computation
time [
7
]. Modulation classification and identification based on deep learning is also a popular area of
research [
8
]. In the realm of mobile networking, the schematic of the stacked auto-encoder (SAE) works
well in the applications of feature learning, protocol classification, and anomalous protocol detection [
9
].
Mobile traffic classifiers based DL have also been reported to handle encrypted traffic and reflect their
complex traffic patterns [
10
,
11
]. Achievements in blind detection for MIMO systems with deep learning
are also continuously reported [
12
,
13
]. Additionally, a scheme integrating DL into an orthogonal frequency
division multiplexing (OFDM) system has been put forward [
14
], and its numerical results revealed the
potential performance of DNNs. A fully connected end-to-end DL system including an encoding layer,
noise layer, and decoding layer was reported in [
15
] for MIMO, and [
16
] designed a long short term
memory (LSTM) network for uplink NOMA to realize the end-to-end transmission as well. Their results
showed the excellent performance of the autoencoder for jointly learning transmit and receive functions.
However, in most realistic scenarios, it is difficult to synthetically train and optimize both send and receive
functions of the DL system.
Traditionally, the SIC method is affected by the error propagation (EP) and receiver complexity
related to the number of UEs. We propose a learning method based on a DNN to efficiently estimate
the channel state and decode the expected signal. Its benefits not only lie in the performance gain in the
communication system, but in the reduction of reference signal overhead to increase the throughput in the
downlink system.
Sensors 2019,19, 2526 3 of 22
The main contributions of this paper are summarized as follows:
1.
To the best of our knowledge, we designed the first downlink MIMO-NOMA detecting system based
on DL methods. The proposed system can process the traditional MIMO-NOMA signal directly
instead of implementing an SIC receiver.
2.
We take full advantage of the ability of DNNs to process high-dimensional data. The detection
performance can be improved by the DL-based method.
3.
The MIMO-NOMA-DL system can estimate the characteristics of the MIMO Rayleigh fading channel
and then decode the signal. The process of channel estimation and signal detection can be considered
synthetically instead of separately.
4.
We evaluated the performance of the proposed system. Results comparable to traditional
MIMO-NOMA based on SIC were obtained.
5.
We provide relevant simulation results on the impact of some key parameters. These parameters
include the modulation type, power allocation, and mini-batch. The proposed system outperformed
the traditional SIC method in all testing situations.
The remainder of this paper is structured as follows: In Section 2, the necessary background on
MIMO-NOMA system and DL is briefly reviewed. Then, the proposed MIMO-NOMA-DL system
is depicted in detail, including the feasibility analysis, overall structure, and DNN design in Section
3, followed by numerical studies in Section 4. Finally, Section 5concludes the paper and discusses
future works.
Notations: Vectors are denoted by boldface small letters, while matrices are denoted by boldface
capital letters. Superscripts
(·)
,
(·)T
,
(·)H
,
(·)1
and
|| ·||
represent the conjugate, transpose, Hermitian
transpose, inverse, and Frobenius norm operators, respectively. Also,
RN×M×K
denotes the vector space of
all N×M×Kreal matrices and H(n)is the node-n matricization.
2. Fundamentals of MIMO-NOMA and the Deep Learning System
In this section, we describe the fundamental theory of NOMA transmission and the traditional SIC
algorithm. Then, we analyze the basic architecture of deep learning. For simplicity, we assumed that the
bandwidth is 1 Hz. The channel impairment includes Rayleigh fading channel and the additive white
Gaussian noise (AWGN) channel.
2.1. MIMO-NOMA Basics
NOMA can implement multiple-signal multiplexing by allocating unbalanced power within a
same time/frequency/code resource. In contrast to traditional orthogonal multiple access (OMA) using
orthogonal resources such as OFDM, NOMA utilizes power in a non-orthogonal form to substantially
enhance the spectrum efficiency at the expense of receiver complexity. Figure 1shows the overall
architecture of the basic NOMA system.
Sensors 2019,19, 2526 4 of 22
Figure 1.
Non-orthogonal multiple access (NOMA) system architecture. AWGN: additive white Gaussian
noise; SIC: successive interference cancellation.
In conventional downlink multiuser MIMO, UEs generally occupy orthogonal resources. Interbeam
interference can be completely eliminated when the number of base station (BS) transmit antennas is equal
to or greater than the number of receive antennas [
17
]. In the MIMO-NOMA system, however, the total
number of UE antennas is always greater than the number of transmit antennas, and these UEs have to
share a cluster. Thus, interference from other UEs in the same cluster is inevitable. Multiple antennas at
the base station (BS) can radiate multiple beams to form different directions via beamforming technology.
A schematic is shown in Figure 2. For simplicity, in this paper, we focus on signal detection from UEs in a
single cluster.
Figure 2.
System block diagram of the multibeam multiple-input multiple-output (MIMO)-NOMA.
BS: base station.
Supposing that the number of users is
K
, the
i
-th UE signal at the BS can be denoted as
si(t)
,
(i=
1, 2,
···
,
K)
. The power allocated to user
i
is denoted as
pi
, and the transmission power is limited by
the total power P, where P=p1+p2+· ·· +pK. The total transmission signal can be expressed as:
Sensors 2019,19, 2526 5 of 22
s(t)=
K
i=1
pisi(t). (1)
During power allocation, various strategies can be followed for different situations and destinations.
A detailed analysis of these strategies can be seen in [
18
]. The signal received through the fading channel
and AWGN channel can be denoted as:
y(t)=h(t)
K
i=1
pisi(t)+n(t). (2)
As mentioned previously, the detection of NOMA usually adopts the SIC method, and NOMA with
a SIC has shown to be an optimal multiple-access scheme in terms of the achievable multiuser capacity
region in both uplink and downlink [2]. The SIC processes are shown in Figure 3.
Figure 3. SIC receiver architecture. SNR: signal-to-noise ratio.
At the receiver, the SIC process is executed in descending order of signal-to-noise ratio (SNR).
For example, for
p1>p2>·· · >pK
, user 1 will be decoded directly while the other signals are viewed as
noise. The throughput can be expressed as:
R1=log2 1+p1|h1|2
K
i=2pi|h1|2+N0!. (3)
For user
k[
1,
K]
, assuming that the first
k
1 users are decoded perfectly, the throughput for user
kis:
Rk=log2 1+pk|hk|2
K
i=k+1pi|hk|2+N0!. (4)
Hereby, the throughput for user Kis:
RK=log2 1+pK|hK|2
N0!. (5)
Sensors 2019,19, 2526 6 of 22
Clearly, the decoding error of the higher-power signal is accumulated and affects the decoding
accuracy of the lower-power signal. This is a key problem of the SIC method that must be solved.
2.2. Deep Learning Basics
The main techniques of deep learning include DNN, CNN, and RNN. In this section, the basics of
these DL-based approaches are briefly introduced.
The DNN is a deeper version of the neural network that generally consists of three types of layers:
input, hidden, and output. The input layer and output layer are single layers, whereas the hidden layers
can be extended to multiple layers depending on the complexity of the signal-processing algorithm.
Each layer contains multiple nodes, and the effects are exerted only on adjacent layers. The construction of
DNN model can be seen in Figure 4.
Figure 4. Construction of the deep neural network (DNN) model.
There are two components of the relation between adjacent layers: linear and nonlinear. The linear
component is responsible for the linear relation between the input and output for each layer. It includes
two types of operation: multiplication, represented by the weight
w
, and addition, represented by the bias
b
. In most practical scenarios, however, we face nonlinear problems that cannot be solved by the linear
method. Therefore, the nonlinear component is addressed via the activation function f(·).
Suppose that the output of the
(n
1
)
th layer is
yn1
, the weight matrix of the
n
th layer is
wn
, the bias
vector is bn, and the output of the n-th layer yncan be denoted as:
yn=f(wn·yn1+bn). (6)
A classic DNN activation function is the sigmoid function (7). The range of the function is limited
to
[
0, 1
]
and can approximately represent the probability. The tanh function (8) is also a classic activation
function. The range of the tanh function is extended to
[
1, 1
]
, and the center of each layer’s output is 0,
which results in faster convergence via stochastic gradient descent (SGD). Another powerful activation
function is the rectified linear unit (ReLU) function (9). Instead of restricting the value to
[
0, 1
]
or
[
1, 1
]
,
the ReLU function increases linearly when
x
0 and is zero when
x<
0. The gradient does not disappear
after repeated nonlinear operations.
Sensors 2019,19, 2526 7 of 22
fsig (x)=1
1+expx(7)
tanh(x) = exp xexp x
exp x+exp x(8)
ReLU(x) = (x,x0
0 , x<0(9)
For the multiple hidden layers, assuming that the bias is 0 for simplicity, the transmission expression
can be defined as:
yn=f(wnf(wn1f·· ·w2f(w1y0))(10)
For the output layer, the most popular choices are the sigmoid function expressed in (7) and the
softmax function. The softmax function is used mainly for multiclass classification and can be defined as:
fso f t (x)i=exp (xi)
jexp xj. (11)
In deep learning algorithms, we often need to feed the system considerable data, called the training
set, so that the system can adjust itself adaptively to the optimal status offline. During the training process,
correct data should be used to rectify the output. Then, a connection between the input and the output can
be established in a supervised manner. Afterwards, the trained system can be applied to the test set to
assess the performance of the DNN.
In addition, CNNs also play an important role in DL, especially in the realm of computer vision.
In contrast to the traditional NNs, CNNs have a special structure. A classic CNN is LeNet-5, which can be
seen in Figure 5. Layer C1 is a convolution layer, S2 is a pooling layer, C3 is another convolution layer,
and next is a pooling layer again in S4. C5 can be identified as a dense layer together with the F6 layer.
Figure 5. Construction of the LeNet-5.
Obviously, before the fully connected layers, the input data are processed by multiple convolution
layers and pooling layers. The convolution layer, which is composed of multiple feature maps, can
dramatically reduce the number of connections via the convolution operation. The pooling layers,
also called down-sample layers, can further compress the local data and avoid the overfitting problem.
Max-pooling and mean-pooling are the most common choices to extract the maximum and mean values,
respectively, from the output of convolution data according to the pooling size.
The RNN is another research hotspot in the field of NLP for its ability to memory the data.
By establishing the relationship between current data and past data (and even future data), RNNs can deal
with the situation where sequences at different slots have relations to each other. The basic structure of an
Sensors 2019,19, 2526 8 of 22
RNN is shown in Figure 6. It can be seen that the information of previous data are summarized as state
W(t)
k
to solve the output
ˆ
y(t)
with the current input
x(t)
. The number of RNNs’ output can differ from the
input for various purposes, such as one input to many outputs for music generation, many inputs to one
output for sentiment classification, and many inputs to many outputs for machine translation.
Figure 6. Construction of the recurrent neural network (RNN).
3. Deep-Learning Scheme Based on the MIMO-NOMA System
In wireless communication, signal detection can be considered to be a classification process of
recovering a discrete sequence from an impaired signal. The DL technique has good ability to address this
problem. Therefore, in this section, we consider a novel detector that adopts a DNN in a MIMO-NOMA
system. In contrast to the traditional SIC block, which divides the process of detection into separate blocks,
including channel estimation, MMSE detection, demodulation, channel decoding and signal decision,
the deep learning method can perform all these procedures as a single process. The optimal parameters
can be acquired by continuous iteration to determine the rules relating the output and the label.
3.1. Feasibility Analysis of DL in a MIMO-NOMA System
First, it should be proved that it is realizable to use DL to detect the MIMO-NOMA signal instead of an
SIC receiver. We assume that the number of transmitting and receiving antennas is
M
and
N
. The number
of UEs is K. The MIMO-NOMA transmission signal in (1) can be expressed in matrix form as (12):
S=(S1,S2,·· · ,SM). (12)
Sm(m[1, M]) is the m-th transmission antenna and it can be expressed as:
Sm=
K
k=1pPkSk
m. (13)
Sensors 2019,19, 2526 9 of 22
For M-ary phase shift keying (MPSK) modulation,
Sk
mXi
is the kth UE transmission signal of mth
antenna, and Xiis the set of Mtransmission signals:
Xi=Aexp (jωt+ϕi)(i=1, 2, . . . , M). (14)
The MIMO-NOMA signal uses the power dimension to improve the channel capacity, so the channel
matrix can be doted by a three order tensor
HRN×M×K
.
hk
nm (k[
1,
K]
,
m[
1,
M]
and
n[
1,
N])
is
the channel gain of the kth UE from the mth transmitting antenna to the n-th receiving antenna.
(15)
Transforming the tensor
H
into a matrix
H(n)
is called the node-
n
matricization [
19
]. Here, a mode-3
matricization H(3)can be expressed as:
H(3)=
h1
11 ·· · h1
1N·· · hK
11 ·· · hK
1N
h1
21 ·· · h1
2N·· · hK
21 ·· · hK
2N
.
.
.....
.
..
.
..
.
.....
.
.
h1
M1·· · h1
MN · ·· hK
M1·· · hK
MN
. (16)
The mode-3 matricization of the received signal can be expressed as:
Y(3)=SH (3)+N(3). (17)
The channel gain and receiver signal of the kth UE are denoted by Hkand Yk, respectively:
Hk=
hk
11 hk
12 ·· · hk
1N
h1
21 hk
22 ·· · h1
2N
.
.
..
.
.....
.
.
hk
M1hk
M2·· · hk
MN
, (18)
Yk=yk
1yk
2·· · yk
M. (19)
The SIC detection needs a continuous process of decoding, reconstructing, and signal canceling.
Assume that the power allocated to the UEs decreases gradually
(P1>P2>··· >PK)
.
lk
q
is the estimation
output of the
q
th UE signal at the
k
-th UE receiver. The channel state information (CSI) is perfectly known.
For UE1, the MMSE detection process can be expressed as:
l1
1=arg min
ikY1˜
H1pP1Xik, (20)
˜
H1=H1HH1H1H+σ2
1I1
. (21)
Sensors 2019,19, 2526 10 of 22
For UE2, the information of UE1 should be extracted first, and the process is similar to (20):
l2
1=arg min
ikY2˜
H2pP1Xik, (22)
˜
H2=H2HH2H2H+σ2
2I1
. (23)
A reconstructed signal can be denoted as:
Sl2
1=Aexp (jωt+ϕl2
1). (24)
Then, the reconstructed signal is subtracted from the received signal:
ˆ
Y1=Y2, (25)
ˆ
Y2=ˆ
Y1pP1Sl2
1H2. (26)
Decoding the UE2 signal:
l2
2=arg min
ikˆ
Y2˜
H2pP2Xik. (27)
As the process mentioned above, for the K-th UE, the detection can be shown as:
ˆ
YK=ˆ
YK1pPK1SlK
K1HK, (28)
lK
K=arg min
ikˆ
YK˜
HKpP2Xik. (29)
From Equations (20), (27), (29), it can been seen that the final classification result can be expressed in
the form:
fK·· · f2(Y2f1(Y1A+b1)+b2)·· ·+bK, (30)
where Ykis a constant matrix, bkis a constant vector, and firepresents some form of nonlinear function.
Comparing with (10), it can be found that the DNN detector is capable of replacing the traditional
SIC method in a MIMO-NOMA system and is even more powerful due to its ability to find the optimal
solution with the data-driven mode.
3.2. MIMO-NOMA-DL System
In this subsection, we propose a novel DL detector for MIMO-NOMA signal detection. Without any
extra signal processing, the signal from the receiving antennas can be sent directly to the MIMO-NOMA-DL
detector. Therefore, MIMO-NOMA-DL is an easier and more efficient scheme to replace the SIC receiver.
Overall, the MIMO-NOMA-DL system is composed of three components: training block, testing block,
and DNN detecting block. The construction of the MIMO-NOMA-DL model is illustrated in Figure 7.
The training block is responsible for producing the MIMO-NOMA signal and providing the labels
to the DNN. In this block, to acquire the MIMO-NOMA signal for
Nt
antennas, we should produce two
training sequences of UE1 and UE2 for each antenna. They are then modulated by superposition coding
with different allocation power factors. After impairment by the fading channel and AWGN channel,
Sensors 2019,19, 2526 11 of 22
the receive signal is acquired at the receiver. Meanwhile, these sequences are known by the receiver as
labels, and it is similar to the pilot sequence.
The testing block is used to simulate real-time MIMO-NOMA transmission. In this block, we first
produce the MIMO-NOMA signal. Labels are not required in this part. The testing data are used to assess
the performance of the DNN detection. Notably, in order to avoid a perfect match, the channel models
and the generated data in training blocks and testing block are i.i.d. to ensure that the DNN performs well
in both the training and testing process. The SNR in training block is generated randomly with the data
time slot varying over the range of interest, whereas the SNR in the testing block is fixed so that the error
performance of the DNN can be evaluated in certain SNR conditions.
The DNN block is the main detection block for decoding the received signal. The channel
characteristics and MIMO-NOMA decoding algorithm can be studied by optimizing the hyperparameters
of the deep neural network. In this block, multiple parameters, including the number of layers, activation
function, loss function, and optimization criteria iteration algorithm, must be designed; the details are
discussed in the next subsection.
The first two blocks provide labels and signals polluted by channel, and the last block recovers the
original data. Accordingly, the detection process can be divided into two steps:
Figure 7. Construction of the MIMO-NOMA-DL model. DL: deep learning.
Step 1: Training mode
In training mode, the offline training block is active while the online deployment block remains
inactive. The input of the DNN training system includes two components: the received MIMO-NOMA
signal as the input layer of the DNN system, and the labels as supervised data to help the DNN to
optimize the parameters.
Step 2: Testing mode
The testing mode is activated after the DNN has been trained. In Step 2, the offline block is suspended,
and the online block accesses the DNN system. The system performance is evaluated in this step,
and the results of the simulation are presented in Section 4.
3.3. DNN Design
As mentioned above, a DNN is a deep version of a neural network. The adjustable hyperparameters
include the weights, bias, regularization parameter, learning rate, and drop-out. The DNN model we
designed for MIMO-NOMA detection comprises seven layers: one input layer, one output layer, and five
Sensors 2019,19, 2526 12 of 22
hidden layers. The input layer and hidden layer are fully connected, whereas the output layers are divided
into groups to decode the signals of multiple antennas in a slot.
The input layer is where the MIMO-NOMA signal is received. Suppose that the numbers of transmit
and receive antennas of the BS and UE are
Nt
and
Nr
, respectively. The complex receive signal is
decomposed into the real part and the imaginary part, so the number of input layer cells is 2
Nr
. The input
signal is a two-dimensional vector of a slot and multiple antennas. That is, the 2
Nr
data are sent to the
network in one slot as a column vector.
The hidden layers are composed of five fully connected layers. To avoid the vanishing gradient
problem of the sigmoid function, the ReLU function (9), an effective nonlinear function, is used to activate
the neurons after the linear operation.
The output layer is used to report the final detection results. The normal DNN output layer is usually
fully connected, and has one-hot encoding with the softmax function. In MIMO-NOMA signal detection,
however, signals from multiple antennas should be decoded in a single slot. So, the proposed output layer
was designed to form groups. The number of groups is equal to the number of transmitting antennas
Nt
, and the number of neurons in each group is equal to the number of one-hot encodings. For example,
the output layer structure of a 4
×
4 MIMO-NOMA system is shown in Figure 8. Because the label is not in
the traditional form, but the group one-hot encoding, the output data adopt the soft-decision form with
the sigmoid function (7).
Figure 8. Design of the output layer.
Additionally, the choice of loss function and optimization algorithm is another key point for the
MIMO-NOMA-DL network. The loss function measures the distance between the predictions and labels.
The classic loss function is the mean square error (MSE) function. In logistic regression, MSE performs in
terms of accuracy. However, in multiclass classification problems, MSE has a slow convergence speed.
Here, we consider the cross-entropy function. In Shannon’s information theory, the Kullback–Leibler
divergence (KLD) can be used to represent the difference between two probability distributions, and the
expression is written as:
DKL (P||Q)=
i
P(i)log P(i)
Q(i)(x). (31)
The process of minimizing the KLD is equivalent to minimizing the cross-entropy
H(P
,
Q)
, which is
defined as:
Sensors 2019,19, 2526 13 of 22
H(P,Q) = DKL (P||Q)+H(P). (32)
The cross-entropy function has fast convergence and low complexity in the iterative
optimization process.
Moreover, considering the self-adaptation of the learning rate and the robustness, the Adam [
20
]
method is used as the optimization algorithm. The Adam method improves upon the Momentum and
RMSPro algorithms, and is more robust in terms of the hyperparameters. Details of the optimization
algorithms and their comparison can be found in [21].
To avoid overfitting, we add the L2 regularization term
λ·ikωik2
to the loss function to effectively
decrease the sensitivity of the parameters to parametric variation so that the performance of the test result
is close to the training result.
The proposed DL algorithm based on the above DNN architecture design is summarized in
Algorithm 1.
4. Simulation and Analyses
In this section, we build the NOMA signal detection system based on the deep learning method
and present the numerical results for different parameters. First, we investigated the performance
of the proposed scheme in comparison with the traditional SIC method in a certain scenario. Then,
the influence of diverse types of MIMO-NOMA modulations on the symbol error rate (SER) performance
was studied. Next, we simulated the impact of the power allocation factor on the system performance.
Then, we explored the situation where the estimated CSI is deviated from the actual one. Additionally, to
achieve faster convergence of the MIMO-NOMA-DL algorithm, we conducted simulations with different
mini-batch sizes. Finally, useful recommendations to accelerate training are provided.
Many software and tools are available for machine learning. Considering their efficiency and usability,
two of the most popular tools—Python 3.6 and MATLAB—were used in our numerical analysis. As a
powerful open-source machine-learning framework from Google, TensorFlow with GPU acceleration was
also employed to implement the proposed deep learning algorithm. For simplicity, we concentrated on a
single cluster with two UEs. A 4
×
4 MIMO channel with a complex Rayleigh distribution was considered.
The total transmitted power for one antenna was set to 1 W. UE1 was allocated 80% of the power, and UE2,
20%. The activation function of the output layer was the sigmoid function (7) and that of the hidden
layers was the ReLU function (9). The total number of training samples was 409,600, in the form of the
n
-power of 2, so we could use a smaller data set—mini-batch—to accelerate the convergence. For the 4
×
4
NOMA-MIMO signal, the number of input layer cells in a slot was 8, and the input data were fed to the
DNN as a column vector. All the labels used in the supervised training were one-hot encoded. The key
parameters are summarized in Table 1, and the detailed algorithm is depicted in Algorithm 1.
Sensors 2019,19, 2526 14 of 22
Table 1. Parameter settings used in the simulation. ReLU: rectified linear unit.
Parameter Value
Operating system Windows 7
Framework TensorFlow
Programming language Python 3.6/MATLAB
Channel MIMO channel/AWGN
Fading Rayleigh distribution
Number of UEs per cluster 2
Number of transmit antennas 4
Number of receive antennas 4
Modulation PSK
Number of training samples 409,600
Total transmitted power per antenna 1 W
Power allocation factor 0.8
Hidden layer ReLU
Output later Sigmoid
Algorithm 1 MIMO-NOMA Based on the DL Training Algorithm.
1: Initialize the DNN model;
2:
Generate and adjust the format of the training data. Assuming that the number of slots is
N
, the input
data are denoted as x={x[1],x[2],·· · ,x[N]}. Each x[i]is a MIMO-NOMA column vector in slot i;
3:
Set the key parameters, including mini-batch, learning rate, and output functions of the hidden layer
and output layer, and initialize the weight and bias of each DNN layer;
4:
Implement the forward DNN process and obtain the results of the output layer’s data, denoted as
ˆyi={ˆy[1]
i, ˆy[2]
i,·· · , ˆy[N]
i};
5: Calculate the loss function, that is, the cross-entropy Loss(y, ˆy):
Loss(y, ˆy) =
i
yilog yi
ˆyi
+H(y) + λ·
ikωik2. (33)
6:
Calculate the corrective parameter with the Adam optimization algorithm. Update the parameters
with the algorithm to search for the optimal solution;
7:
Return to Step 4 if the loss function is not small enough, otherwise proceed to the next step. If the loss
function does not meet the requirement, the DNN with the updated parameters should be retrained;
8: Test the trained DNN with the test data and plot the SER–SNR curve.
To evaluate the performance gap, the proposed MIMO-NOMA-DL signal detection was compared
with the traditional MIMO-NOMA-SIC scheme. We assumed that the SIC had perfect knowledge
of channel parameters and that the modulation type of both UE superposition coding signals at the
transmitter was binary phase shift keying (BPSK). In the traditional MIMO-NOMA-SIC detection scheme,
the UE1 signal—which treats the UE2 signal as interference—should be demodulated first. Then, we can
demodulate the UE2 signal after the modulated UE1 signal is removed from the received NOMA-MIMO
signal. In the MIMO-NOMA-DL scheme, however, the received signal is sent to the DNN, and labels
are chosen only for the UE2 sequence during the training step. Figure 9shows the SER–SNR curve of
the numerical simulation. Taking 10
4
as the standard to measure performance gain, we can see that
the proposed MIMO-NOMA-DL reached 12.6 dB, whereas the traditional scheme reached 16.2 dB—a
Sensors 2019,19, 2526 15 of 22
difference of approximately 3.6 dB. Notably, no preprocessing nor postprocessing was performed. Instead
of the traditional complex signal processing for channel estimation and signal demodulation, we used
powerful deep learning tools to perform accurate signal detection. That is, we used a computer to
automatically search for the most rational schemes for channel estimation and signal demodulation to
avoid complicated human designs, which is the main reason we achieved performance gains.
Figure 9. Performance comparison of MIMO-NOMA-DL and MIMO-NOMA-SIC.
As indicated previously, the MIMO-NOMA system adopts superposition coding in which signals from
different UEs in a cluster are overlaid with the specific proportion of the power. Different modulation types
can be used for diverse UE signals. Because power-field NOMA is a type of non-orthogonal technology,
interference from other UEs—especially the type of their modulation—is a significant factor in determining
the demodulation performance. Here, we simulated several groups of different types of modulation
from the most common PSK modulation, and we assessed whether the NOMA-DL system had good
performance in these situations. Table 2shows three groups of simulation parameter settings, including
the situation where both UEs had BPSK or quadrature phase shift keying (QPSK) modulation and where
one UE used BPSK modulation while the other used QPSK. Considering that decoding the UE2 signal
requires the UE1 signal to be decoded first according to the SIC method, we considered only the UE2
signal detection in the DL method due to its higher complexity.
Table 2. Parameter settings used in the simulation.
Modulation Scheme UE1 UE2
1 BPSK BPSK
2 BPSK QPSK
3 QPSK QPSK
Figure 10 shows the modulated MIMO-NOMA signal constellation graph for all three cases. The initial
phases of BPSK and QPSK were 0
and 45
, respectively, and there were two UEs in a cluster. The allocation
Sensors 2019,19, 2526 16 of 22
of the transmit signal power affects the Euclidean distance between constellation points and the error
probability. Here, we were not concerned with the power distribution, and set the power allocation factor
to 0.8 for simplicity.
(a) (b) (c)
Figure 10.
Constellation of UE signal superposition coding: (
a
) BPSK and BPSK, (
b
) BPSK and QPSK,
and (c) QPSK and QPSK.
Figure 11 depicts the detection performance in these three cases. Clearly, the MIMO-NOMA signal
detection based on the DL system had good performance. Besides case 1 mentioned above, nearly 3.5 dB
performance gain was achieved in case 2, and the gain was 1 dB in case 3. These results indicate that both
the characteristics of the wireless MIMO channel with Rayleigh fading and the signal demodulation with
NOMA could be learned through the DNN.
Figure 11. Performance comparison of MIMO-NOMA-DL and MIMO-NOMA-SIC.
Sensors 2019,19, 2526 17 of 22
The allocation of the transmit power of the UEs at the BS is a crucial component of the conventional
NOMA scheme, and has a substantial impact on the NOMA throughput with Shannon’s information
theory. Several studies have provided optimal or suboptimal allocation methods. According to (5), the sum
throughput of two UEs can be denoted as:
Rsum(α) = log21+(1α)ρ|h2|21+ρ|h1|2
1+(1α)ρ|h1|2, (34)
where
ρ
is the transmit SNR.
hi(i=
1, 2
)
is the channel gain,
h1<h2
. The throughput can be shown
to be a monotonically decreasing function of
α
for
dRsum
dα<
0, where
α[
0, 1
]
. Figure 12 shows the
throughput of the NOMA system with power allocation factor fixed at 0.6, 0.7, 0.8, and 0.9. The throughput
increased as the transmit SNR increased. Moreover, the larger the power allocation factor was, the smaller
the throughput.
Figure 12. NOMA throughput with different αvalues.
Usually, the throughput of different UEs should satisfy the quality of service (QoS) requirements, or
the optimal power should be allocated according to the feedback of the channel state. Here, we considered
the performance of the proposed methods from the perspective of reliability. Figure 13 shows the
SER performance in MIMO-NOMA signal detection for different power allocation factors with BPSK
modulation. The proposed DL-based methods achieved good SER performance compared with the SIC
receiver in all cases of power allocation factors. Furthermore, a power allocation factor of 0.8 appeared to
be closest to the optimal solution in terms of the minimum symbol error probability. Here, we considered
some specific fixed power allocation factors on MIMO-NOMA signal detection. The optimal power
allocation factors to minimize the SER performance and sum throughput for DL methods in cases of
multiple users and clusters can be considered in our follow-up study.
Sensors 2019,19, 2526 18 of 22
Figure 13. SER performance with different power allocation factors.
The process of channel estimation and detection was finished at the training stage. By introducing
the error into the channel at the testing stage, we explored how the proposed DL approach behaved when
the estimated CSI deviated from the actual situation. The channel error model can be denoted as:
H=ˆ
H+β, (35)
where
H
is the actual channel matrix impaired by error.
ˆ
H
and
are the original Rayleigh channel matrix
and the channel error matrix, respectively, and they are i.i.d.
β
is the error factor. Figure 14 shows the
impact of channel estimation error on the DL-based approach. It can be seen that the SER deteriorated
gradually with the increase of
β
from 0 to 0.1. Compared with the SIC method with perfect CSI, the DL
approach still showed superiority when
β
was less than 0.08. This result explains that the performance
suffered losses when deviations appeared between the estimated and actual CSI, though the performance
of the DL-based approach could keep its predominance within a specified tolerance range.
Sensors 2019,19, 2526 19 of 22
Figure 14. SER performance with channel estimation error.
Restricted by the performance of the CPU/GPU/RAM, the convergence of deep learning algorithms
is often time consuming when addressing a large amount of data. Mini-batch gradient descent is a good
way to shorten the training run-time to obtain the results efficiently. Here, different mini-batch sizes
in the MIMO-NOMA-DL detection method were studied to provide a reference for related research.
Figure 15 shows the loss values for three mini-batch sizes. The loss value for a mini-batch size of 1024
reached 0.32 at approximately 360 epochs. A mini-batch size of 1024 resulted in the fastest convergence
but the worst loss value. By contrast, a mini-batch size of 102,400 produced the best loss performance
but the longest run-time: the loss value tended towards stability at 0.015 at approximately 8000 epochs.
The middle mini-batch size of 10,240 achieved moderate performance. After 6000 epochs, the loss value
approached 0.55. Hence, the choice of mini-batch size is a trade-off between convergence speed and error
precision. The most suitable mini-batch size can be identified to meet the specific requirements. In this
article, we recommend that a smaller mini-batch size be used first. When the loss tends toward stability,
the mini-batch size can be increased until the loss value reaches the required precision.
As mentioned above, the computational complexity of SIC is related to the number of UEs. Suppose
the number of UEs in a cluster is
L
and the computational complexity of SIC is
O(L)
. For a trained DNN
model, the signal detection is the process of forward propagation, and its computational complexity is
O(1). This means that the trained DNN model can realize efficient and real-time signal detection.
Sensors 2019,19, 2526 20 of 22
Figure 15. Loss function for different mini-batch sizes in the MIMO-NOMA-DL method.
5. Conclusions
The application of deep learning in MIMO-NOMA communication systems is a promising approach to
address the shortcomings of the SIC method. Instead of the complicated algorithm design and interference
cancellation process, the deep learning approach can search for the optimal solution of the hyperparameters
of the multilayer neural network with machine learning.
In this paper, we designed an MIMO-NOMA-DL signal-detection system to perform signal
recovery. The proposed technique can simultaneously complete the processes of channel estimation
and MIMO-NOMA signal detection. The detailed construction and learning algorithm have been
provided. We first compared the SER performance of the proposed method and the SIC algorithm
via simulations. The highest performance gain reached 3.6 dB. Then, the impact of the crucial parameters,
including the modulation type and power allocation, were studied. Numerical results showed that the
MIMO-NOMA-DL method had powerful detection performance. Finally, mini-batch gradient descent
simulations were conducted to accelerate the training step of the MIMO-NOMA-DL algorithm. The results
indicate that the mini-batch size is a key parameter for balancing the convergence speed and loss precision.
Future works will explore the DL-based approach to detect other types of NOMA signals, such as
the sparse code multiple access (SCMA), multi-user shared access (MUSA), and pattern-division multiple
access (PDMA). Moreover, we also consider an extension assessing the performance under different
channel situations and the multiple clusters situation. Additionally, detecting the communication signal
with memory using RNNs will be explored. CNNs, another advanced DL approach, could be deeply
developed in terms of their potential in signal detection as our following work.
Author Contributions:
Conceptualization, C.L. and Q.C.; methodology, C.L.; software, C.L.; validation, Q.C. and X.L;
formal analysis, C.L. and Q.C.; investigation, C.L.; resources, C.L. and Q.C.; data curation, C.L.; writing—original
draft preparation, C.L.; writing—review and editing, C.L. and Q.C.; visualization, C.L.; supervision, Q.C.; project
administration, Q.C.; funding acquisition, Q.C., C.L., and X.L.
Funding: This research was funded by the National Natural Science Foundation of China, grant number 61471021.
Sensors 2019,19, 2526 21 of 22
Acknowledgments:
We would like to thank Han Xiao for his significant discuss during this research. In addition,
we thank Kejia Hu for his direction on mathematical calculation.
Conflicts of Interest: The authors declare no conflict of interest.
References
1.
Ding, Z.G.; Adachi, F.; Poor, H.V. The Application of MIMO to Non-Orthogonal Multiple Access. IEEE Trans.
Wirel. Commun. 2015,15, 537–552. [CrossRef]
2.
Higuchi, K.; Benjebbour, A. Non-Orthogonal Multiple Access (NOMA) with Successive Interference Cancellation
for Future Radio Access. IEICE Trans. Commun. 2015,98, 403–414 [CrossRef]
3.
O’Shea, T.; Timothy, J.; Hoydis, J. An Introduction to Machine Learning Communications Systems. arXiv
2017
,
arXiv:1702.00832.
4.
Nachmani, E; Be’ert, Y; Burshtein, D. Learning to Decode Linear Codes Using Deep Learning. arXiv
2016
,
arXiv:1607.04793.
5.
Gruber, T.; Cammerer, S.; Hoydis, J.; Brink, S.T. On Deep Learning-Based Channel Decoding. arXiv
2017
,
arXiv:1701.07738.
6.
Liang, F.; Shen, C.; Wu, F. An Iterative BP-CNN Architecture for Channel Decoding. IEEE J. Sel. Top. Sign. Process.
2018,12, 144–159. [CrossRef]
7.
Sun, B.; Feng, H. Efficient Compressed Sensing for Wireless Neural Recording: A Deep Learning Approach.
IEEE Signal Process. Lett. 2017,24, 863-867. [CrossRef]
8.
Xu, Y.; Li, D.; Wang, Z. A Deep Learning Method Based on Convolutional Neural Network for Automatic
Modulation Classification of Wireless Signals. In Proceedings of the 2nd EAI International Conference on
Machine Learning & Intelligent Communications (MLICOM 2017), Weihai, China, 5–6 August 2017
9.
Wang, Z. The Applications of Deep Learning on Traffic Identification. Available online: https:
//www.blackhat.com/docs/us-15/materials/us-15-Wang-The-Applications-Of-Deep-Learning- On-Traffic-
Identification-wp.pdf (accessed on 1 July 2017).
10.
Aceto, G.; Ciuonzo, D.; Montieri, A.; Pescape, A. Mobile Encrypted Traffic Classification Using Deep Learning.
Available online: https://tma.ifip.org/2018/wp-content/uploads/sites/3/2018/06/tma2018_paper40.pdf
(accessed on 1 June 2019).
11.
Aceto, G.; Ciuonzo, D.; Montieri, A.; Pescape, A. Mobile Encrypted Traffic Classification Using Deep Learning:
Experimental Evaluation, Lessons Learned, and Challenges. IEEE Trans. Netw. Serv. Manag. 2019. [CrossRef]
12.
Jeon, Y.; Hong, S.N.; Lee, N. Blind Detection for MIMO Systems With Low-Resolution ADCs Using Supervised
Learning. In Proceedings of the IEEE International Conference on Communications (IEEE ICC 2017), Paris,
France, 21–25 May 2017
13.
Wang, C. Research and Application of Traffic Sign Detection and Recognition Based on Deep Learning.
In Proceedings of the International Conference on Robots & Intelligent System (ICRIS2018), Amsterdam,
The Netherlands, 21–23 February 2018
14.
Ye, H.; Li, G.Y.; Juang, B.H. Power of Deep Learning for Channel Estimation and Signal Detection in OFDM
Systems. IEEE Wirel. Commun. Lett. 2017,7, 114–117. [CrossRef]
15.
O’Shea, T.; Erpek, T.; Clancy, T.C. Deep Learning-Based MIMO Communications. arXiv
2017
, arXiv:1707.07980v1.
16.
Gui, G.; Huang, H.; Juang, B.H.; Song, Y.; Sari, H. Deep Learning for an Effective Nonorthogonal Multiple Access
Scheme. IEEE Trans. Veh. Technol. 2018,67, 8440–8450. [CrossRef]
17.
Ali, S.; Hossain, E.; Kim, D.I. Non-Orthogonal Multiple Access (NOMA) for Downlink Multiuser MIMO Systems:
User Clustering, Beamforming, and Power Allocation. IEEE Access 2017,5, 565–577. [CrossRef]
18.
Cai, Y.; Qin, Z.; Cui, F.; Li, G.Y.; McCann, J.A. Modulation and Multiple Access for 5G Networks. IEEE Commun.
Surv. Tutor. 2018,20, 629–646. [CrossRef]
19. Kolda, T.G.; Bader, B.W. Tensor decompositions and applications. SIAM Rev. 2009,51, 455–500. [CrossRef]
Sensors 2019,19, 2526 22 of 22
20.
Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International
Conference on Learning Representations (ICLR 2015), San Diego, CA, USA, 7–9 May 2015
21.
Schaul, T.; Antonoglou, I.; Silver, D. Unit Tests for Stochastic Optimization. In Proceedings of the International
Conference on Learning Representations (ICLR 2014), Banff, AB, Canada, 24–26 April 2014
©
2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article
distributed under the terms and conditions of the Creative Commons Attribution (CC BY)
license (http://creativecommons.org/licenses/by/4.0/).
... ln conclusion, NOMA performance can be improved by applying AI technologies in to channel estimation, interference mitigation, detection or modulation, resource optimization, and signal processing [50], [87], [93], [277]- [279]. For example, most AI solutions for various PD-NOMA designs include DL (e.g., [182]- [186], [251], [255], [260], [261]), DRL (e.g., [178], [179], [250], [252], [253], [257]- [259], [262], [263]), and FL (e.g., [192], [193]). In particular, DL is used in MIMO NOMA for CSI [182] and resource allocation [183], in RSMA for the precoder vector [184] and robustness [185], in HMAS for signal detection [186], in multi-carrier NOMA for power optimizing [255], in NOMA-VLC for signal demodulation [255], in FTN-NOMA for sliding window detection [260], and in cooperative NOMA for channel classification [261]. ...
... For example, most AI solutions for various PD-NOMA designs include DL (e.g., [182]- [186], [251], [255], [260], [261]), DRL (e.g., [178], [179], [250], [252], [253], [257]- [259], [262], [263]), and FL (e.g., [192], [193]). In particular, DL is used in MIMO NOMA for CSI [182] and resource allocation [183], in RSMA for the precoder vector [184] and robustness [185], in HMAS for signal detection [186], in multi-carrier NOMA for power optimizing [255], in NOMA-VLC for signal demodulation [255], in FTN-NOMA for sliding window detection [260], and in cooperative NOMA for channel classification [261]. DRL is used in RSMA and PDMA for resource allocation and optimization [178], [179], in up/downlink NOMA system for transmission scheduling [250], resource allocation [252], [257], [258], beamforming [253], channel estimation [259], [262], and in random NOMA for slot allocation [263]. ...
Preprint
Full-text available
With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimization methods are gradually losing ground to artificial intelligence (AI) techniques that have proven their superiority in handling complexity. AI-empowered MA and its optimization strategies aimed at achieving high Quality-of-Service (QoS) are attracting more attention, especially in the area of latency-sensitive applications in 6G systems. In this work, we aim to: 1) present the development and comparative evaluation of AI-enabled MA; 2) provide a timely survey focusing on spectrum sensing, protocol design, and optimization for AI-empowered MA; and 3) explore the potential use cases of AI-empowered MA in the typical application scenarios within 6G systems. Specifically, we first present a unified framework of AI-empowered MA for 6G systems by incorporating various promising machine learning techniques in spectrum sensing, resource allocation, MA protocol design, and optimization. We then introduce AI-empowered MA spectrum sensing related to spectrum sharing and spectrum interference management. Next, we discuss the AI-empowered MA protocol designs and implementation methods by reviewing and comparing the state-of-the-art, and we further explore the optimization algorithms related to dynamic resource management, parameter adjustment, and access scheme switching. Finally, we discuss the current challenges, point out open issues, and outline potential future research directions in this field.
... Propagation error has an impact on the SIC system as well. Signal classification allows the deep neural network (DNN) to recover a discrete data sequence from a decaying signal [22]. Using DL, the SIC system might be much improved. ...
... The output of both the backward and forward layers is calculated in the same way as the unidirectional RNN. The output Y t of the Bi-LSTM method is given as [22], ...
Article
Full-text available
In this work, we investigate the end-to-end performance of the uplink (U/L) non-orthogonal multiple access (NOMA) system based on the novel Bi-directional long short-term memory (Bi-LSTM) algorithm over the frequency flat independent and identically distributed (i.i.d.) Rayleigh fading channel conditions. When compared to conventional successive interference cancellation (SIC) MIMO-NOMA-based detection systems, the suggested deep learning (DL) based technique integrates the conventional multiple-input multiple-output (MIMO) and power-domain NOMA schemes to improve the symbol error rate (SER) performance. To this end, an optimal power allocation problem for the MIMO-NOMA scheme has been developed that maximizes the data throughput of the end-to-end system. In this work, both imperfect and perfect SIC schemes are considered, and performance comparison is provided between the Bi-LSTM based MIMO-NOMA and LSTM MIMONOMA schemes. The SIC NOMA system achieves 15 dB for 10 6 iterations, but the DL-based MIMO-NOMA scheme achieves 15 dB for 100 iterations. By a factor of four, Bi-LSTM MIMO-NOMA schemes outperform SIC MIMO-NOMA methods. Rather than utilizing conventional SIC systems to determine fading channel coefficients and decode signals, the suggested scheme estimates the relevant data symbol using the more efficient Bi-LSTM algorithm. There is a 4 dB difference, indicating that DL-based MIMO-NOMA outperforms conventional SIC MIMO-NOMA approaches. Furthermore, when the channel estimation error is enhanced from 0 to 1, the performance of DL is considerably decreased. Even with perfect channel state information (CSI), the DL detector outperforms the SIC detector for channel estimate errors of less than 0.07. When differences between the real and predictable channel states occur, the DL detector's performance suffers significantly, yet it can still maintain its majority within a predetermined tolerance range Keywords: signal-to-interference-and-noise ratio (SINR), long short-term memory, millimeter wave (mmWave), signal-to-noise ratio (SNR), NOMA, DL, machine learning (ML)
... There are many benefits were introduced as a result of implementing NOMA technology in 5G networks such as; minimum latency communication with throughput improvement, good reliability with extremely massive connectivity [15]- [18]. NOMA technology was introduced with to approaches; power domain (PD) and coding domain (CD) [19]. In power domain NOMA system, multi-user communication is achieved using multiplexing in power domain unlike the second approach where multi-user communication is handled by implementing multiplexing in code domain such as in code division multiple access (CDMA) ...
... Many researchers discussed the applicant of channel coding receiver's design techniques for NOMA systems [19]- [23]. The tests are examined and evaluated as a BER performances in fading channels in addition to additive white gaussian noise (AWGN) channel. ...
... On the receiver side, signals from various User Equipment (UE) were decoded using a power-domain NOMA technique with SIC. In contrast, it was expected that adopting user-specific spreading sequences might enhance code-domain NOMA approaches to multiplex signals from numerous UEs [2]. Despite the fact that the received signal has severe channel degradation dependent on the UE in the cluster, the receiver decodes it using the SIC method to calculate the UE's channel gain. ...
Article
Full-text available
The present research initiatives are based on improved multiple access methodologies with a focus on future wireless communication technologies. A skilled competitor, Non-Orthogonal Multiple Access (NOMA), can be utilized to construct the next generation of wireless communications. When compared to other orthogonal resources, NOMA's main strength is its ability to handle many users. The major NOMA detection method used at receivers for downlink NOMA transmissions is Successive Interference Cancellation (SIC). The receiver complexity and concerns about error propagation are the key limitations of SIC. Deep Learning (DL) is used for downlink NOMA transmission, which is decoded using a Sparse Code Multiple Access (SCMA) decoder. SCMA is used in conjunction with DL to forecast the channel and decode it at the receiver. Two users are provided equal access to resources, notably power, based on their proximity to the base station (BS). With SCMA decoding at the receiver, simulations for AWGN, Rayleigh, and Rician channels were carried out while various constraints were taken into account. SCMA-DL surpasses the MMSE and SIC detection methods in terms of Bit Error Rate (BER) during the decoding phase.
Chapter
Emotion recognition is a challenging task, but it has many potential applications in areas such as human–computer interaction, healthcare, and marketing. The capacity to record in-the-moment electrical activity in the brain makes the use of EEG brainwave data as a possible signal for emotion identification extremely promising. This study examined the use of deep learning algorithms for EEG brainwave data-based emotion classification. We obtained the “EEG Brainwave Dataset: Feeling Emotions” dataset from Kaggle to carry out our study. This dataset consists of EEG recordings made across three-minute periods of pleasant, neutral, and negative emotional states from two persons (one male and one female). Using a Muse EEG headband, dry electrodes were placed on the TP9, AF7, AF8, and TP10 EEG locations to record the brainwave waves. Three deep learning algorithms were used throughout our training process: a deep neural network (DNN), a long short-term memory (LSTM) network, and a gated recurrent unit (GRU) network. The DNN achieved the highest accuracy of 98.44%, followed by the LSTM network with 97.5% accuracy and the GRU network with 97.18% accuracy. We also calculated other metrics such as precision, recall, and F1-score. The results showed that all three algorithms were able to achieve high accuracy in classifying emotions from EEG data. The LSTM network had the highest precision for the positive state (1.00), followed by the DNN (0.99), and the GRU (0.98). The GRU network had the highest recall for the negative state (0.96), followed by the DNN (0.97) and the LSTM (0.95). The DNN had the highest F1-score for all three states.
Article
This research study focuses on the impact of interference on different 5G system parameters such as modulation techniques (64-QAM, 256-QAM), channel models (AWGN, Rayleigh, and Rician), and MIMO (multiple input, multiple output) in terms of Bit Error Rate (BER). After the analysis of interference for each parameter, an analysis of interference by Short Range Devices (SRDs) on the 5G system network on the downlink channel is done. Finally, successive interference cancellation (SIC) for a 5G Non-Orthogonal Multiple Access (NOMA) system with Power Division Multiplexing (PDM) is applied. A 5G NOMA has higher spectral efficiency as compared to an Orthogonal Frequency Division Multiple Access (OFDMA), so it can fulfil the needs of the Internet of Things (IoT). SIC is implemented using PDM, where each user is assigned a certain power factor and generates a signal, the generated signal is superimposed and the New-Radio (NR) transmitter that transmits the superimposed signal. Finally, the signal is detected with and without SIC on the access side with SRDs as an interferer. After simulation and analysis of different 5G system parameters, it is found that the BER is decreased with an increase in the MIMO and is more applicable to the higher order modulation. With the SIC, the BER for the NOMA users decreased as compared to OFDMA users. The power coefficients for the stronger signal user (user 1) and weaker signal user (user 2) are 0.25 and 0.75 respectively.
Article
Full-text available
The massive adoption of hand-held devices has led to the explosion of mobile traffic volumes traversing home and enterprise networks, as well as the Internet. Traffic Classification (TC), i.e. the set of procedures for inferring (mobile) applications generating such traffic, has become nowadays the enabler for highly-valuable profiling information (with certain privacy downsides), other than being the workhorse for service differ-entiation/blocking. Nonetheless, the design of accurate classifiers is exacerbated by the raising adoption of encrypted protocols (such as TLS), hindering the suitability of (effective) deep packet inspection approaches. Also, the fast-expanding set of apps and the moving-target nature of mobile traffic makes design solutions with usual machine learning, based on manually-and expert-originated features, outdated and unable to keep the pace. For these reasons Deep Learning (DL) is here proposed, for the first time, as a viable strategy to design practical mobile traffic classi-fiers based on automatically-extracted features, able to cope with encrypted traffic, and reflecting their complex traffic patterns. To this end, different state-of-the-art DL techniques from (standard) TC are here reproduced, dissected (highlighting critical choices), and set into a systematic framework for comparison, including also a performance evaluation workbench. The latter outcome, although declined in the mobile context, has the applicability appeal to the wider umbrella of encrypted TC tasks. Finally, the performance of these DL classifiers is critically investigated based on an exhaustive experimental validation (based on three mobile datasets of real human users' activity), highlighting the related pitfalls, design guidelines, and challenges.
Conference Paper
Full-text available
The massive adoption of hand-held devices has led to the explosion of mobile traffic volumes traversing home and enterprise networks, as well as the Internet. Procedures for inferring (mobile) applications generating such traffic, known as Traffic Classification (TC), are the enabler for highly-valuable profiling information while certainly raise important privacy issues. The design of accurate classifiers is however exacerbated by the increasing adoption of encrypted protocols (such as TLS), hindering the applicability of highly-accurate approaches, such as deep packet inspection. Additionally, the (daily) expanding set of apps and the moving-target nature of mobile traffic makes design solutions with usual machine learning, based on manually-and expert-originated features, outdated. For these reasons, we suggest Deep Learning (DL) as a viable strategy to design traffic classifiers based on automatically-extracted features, reflecting the complex mobile-traffic patterns. To this end, different state-of-the-art DL techniques from TC are here reproduced, dissected, and set into a systematic framework for comparison, including also a performance evaluation workbench. Based on three datasets of real human users' activity, performance of these DL classifiers is critically investigated, highlighting pitfalls, design guidelines, and open issues of DL in mobile encrypted TC.
Article
Full-text available
Automatic modulation classification plays an important role in many fields to identify the modulation type of wireless signals in order to recover signals by demodulation. In this paper, we contribute to explore the suitable architecture of deep learning method in the domain of communication signal recognition. Based on architecture analysis of the convolutional neural network, we used real signal data generated by instrument as dataset, and achieved compatible recognition accuracy of modulation classification compared with several representative structure. We state that the deeper network architecture is not suitable for the signal recognition due to its different characteristic. In addition, we also discuss the difficult of training algorithm in deep learning methods and employ the transfer learning method in order to reap the benefits, which stabilize the training process and lift the performance. Finally, we adopt the denoising autoencoder to preprocess the received data and provide the ability to resist finite perturbations of the input. It contributes to a higher recognition accuracy and it also provide a new idea to design the denoising modulation recognition model.
Article
Full-text available
This article presents our initial results in deep learning for channel estimation and signal detection in orthogonal frequency-division multiplexing (OFDM). OFDM has been widely adopted in wireless broadband communications to combat frequency-selective fading in wireless channels. In this article, we take advantage of deep learning in handling wireless OFDM channels in an end-to-end approach. Different from existing OFDM receivers that first estimate CSI explicitly and then detect/recover the transmitted symbols with the estimated CSI, our deep learning based approach estimates CSI implicitly and recovers the transmitted symbols directly. To address channel distortion, a deep learning model is first trained offline using the data generated from the simulation based on the channel statistics and then used for recovering the online transmitted data directly. From our simulation results, the deep learning based approach has the ability to address channel distortions and detect the transmitted symbols with performance comparable to minimum mean-square error (MMSE) estimator. Furthermore, the deep learning based approach is more robust than conventional methods when fewer training pilots are used, the cyclic prefix (CP) is omitted, and nonlinear clipping noise is presented. In summary, deep learning is a promising tool for channel estimation and signal detection in wireless communications with complicated channel distortions and interferences.
Article
Nonorthogonal multiple access (NOMA) has been considered as an essential multiple access technique for enhancing system capacity and spectral efficiency in future communication scenarios. However, the existing NOMA systems have a fundamental limit: high computational complexity and a sharply changing wireless channel make exploiting the characteristics of the channel and deriving the ideal allocation methods very difficult tasks. To break this fundamental limit, in this paper, we propose a novel and effective deep learning (DL)-aided NOMA system, in which several NOMA users with random deployment are served by one base station. Since DL is advantageous in that it allows training the input signals and detecting sharply changing channel conditions, we exploit it to address wireless NOMA channels in an end-to-end manner. Specifically, it is employed in the proposed NOMA system to learn a completely unknown channel environment. A long short-term memory (LSTM) network based on DL is incorporated into a typical NOMA system, enabling the proposed scheme to detect the channel characteristics automatically. In the proposed strategy, the LSTM is first trained by simulated data under different channel conditions via offline learning, and then the corresponding output data can be obtained based on the current input data used during the online learning process. In general, we build, train and test the proposed cooperative framework to realize automatic encoding, decoding and channel detection in an additive white Gaussian noise channel. Furthermore, we regard one conventional user activity and data detection scheme as an unknown nonlinear mapping operation and use LSTM to approximate it to evaluate the data detection capacity of DL based on NOMA. Simulation results demonstrate that the proposed scheme is robust and efficient compared with conventional approaches. In addition, the accuracy of the LSTM -aided NOMA scheme is studied by introducing the well-known tenfold cross-validation procedure.
Conference Paper
Optimization by stochastic gradient descent is an important component of many large-scale machine learning algorithms. A wide variety of such optimization algorithms have been devised; however, it is unclear whether these algorithms are robust and widely applicable across many different optimization landscapes. In this paper we develop a collection of unit tests for stochastic optimization. Each unit test rapidly evaluates an optimization algorithm on a small-scale, isolated, and well-understood difficulty, rather than in real-world scenarios where many such issues are entangled. Passing these unit tests is not sufficient, but absolutely necessary for any algorithms with claims to generality or robustness. We give initial quantitative and qualitative results on numerous established algorithms. The testing framework is open-source, extensible, and easy to apply to new algorithms.
Article
Inspired by the recent advances in deep learning, we propose a novel iterative belief propagation - convolutional neural network (BP-CNN) architecture for channel decoding under correlated noise. This architecture concatenates a trained CNN with a standard BP decoder. The standard BP decoder is used to estimate the coded bits, followed by a CNN to remove the estimation errors of the BP decoder and obtain a more accurate estimation of the channel noise. Iterating between BP and CNN will gradually improve the decoding SNR and, hence, result in better decoding performance. To train a well-behaved CNN model, we define a new loss function that involves not only the accuracy of the noise estimation but also the normality test for the estimation errors, i.e., to measure how likely the estimation errors follow a Gaussian distribution. The introduction of the normality test to the CNN training shapes the residual noise distribution and further reduces the bit error rate of the iterative decoding, compared to using the standard quadratic loss function. We carry out extensive experiments to analyze and verify the proposed framework. <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">11</sup> Code is available at https://github.com/liangfei-info/Iterative-BP-CNN.
Article
Data compression is crucial for resource-constrained wireless neural recording applications with limited data bandwidth, and Compressed Sensing (CS) theory has successfully demonstrated its potential in neural recording applications. Based on deep learning theory, this paper presents a binarized autoencoder (BAE) scheme for CS, in which a binary sensing matrix and a non-iterative recovery solver are jointly optimized. Experimental results on synthetic dataset reveal that the proposed approach outperforms the state-of-the-art CS-based methods both in terms of recovery quality and computation time.