ArticlePDF Available

CNN-based Precoder and Combiner Design in mmWave MIMO Systems

Authors:

Abstract and Figures

Hybrid beamformer design is a crucial stage in millimeter wave (mmWave) MIMO systems. In this work, we propose a convolutional neural network (CNN) framework for the joint design of precoder and combiners. The proposed network accepts the input of channel matrix and gives the output of analog and baseband beamformers. Previous works are usually based on the knowledge of steering vectors of array responses which is not always accurately available in practice. The proposed CNN framework does not require such a knowledge and it provides higher performance in capacity as compared to the conventional greedy-and optimization-based algorithms.
Content may be subject to copyright.
1
CNN-based Precoder and Combiner Design in
mmWave MIMO Systems
Ahmet M. Elbir
Abstract—Hybrid beamformer design is a crucial stage in
millimeter wave (mmWave) MIMO systems. In this work, we
propose a convolutional neural network (CNN) framework for the
joint design of precoder and combiners. The proposed network
accepts the input of channel matrix and gives the output of analog
and baseband beamformers. Previous works are usually based
on the knowledge of steering vectors of array responses which is
not always accurately available in practice. The proposed CNN
framework does not require such a knowledge and it provides
higher performance in capacity as compared to the conventional
greedy- and optimization-based algorithms.
Index Terms—mmWave, MIMO, Hybrid beamforming, Deep
learning, Convolutional neural network.
I. INTRODUCTION
Hybrid beamforming is a promising architecture to be
used in next generation millimeter wave (mmWave) MIMO
(Multiple Input Multiple Output) systems where robust beam-
forming performance is provided with smaller cost and less
number of fully-digital beamformers [1]–[4]. Several methods
are proposed to design the hybrid beamformers [3]–[7]. In
[6], a greedy-based approach, orthogonal matching pursuit
(OMP), is proposed where the analog precoder and combiners
are selected from a dictionary of transmit and receive array
responses. This algorithm requires the knowledge of the user
direction-of-arrival/aperture (DOA/DOD) angles to construct
such a dictionary. Using the connection between the optimum
and the hybrid beamformers, [7] proposes an alternating
minimization approach to estimate the analog and baseband
beamformers based on phase extraction.
The above works provides optimization-based and greedy-
based solutions for hybrid beamforming problem. However
achieving the optimum solution and the computation time
are the main drawbacks of the above techniques. In order
to circumvent this issue, we consider deep learning (DL)-
based techniques for the hybrid beamforming problem [8]. DL
has several advantages such as low computational complex-
ity when solving optimization-based or combinatorial/greedy
search problems and the ability to extrapolate new features
from a limited set of features contained in a training set
[9]. A great deal of attention is received for DL-based tech-
niques in communications society for the problems such as
channel estimation [10]–[12] DOA estimation [12], antenna
selection [13], and analog beam selection [14]. An end-to-
end communication scenario is modeled in [15] and [16] by
using auto-encoders where single-input-single-output (SISO)
systems are considered. [17] also uses auto-encoders for the
A.M.E. is with the Department of Electrical and Electronics Engineering,
Duzce University, Duzce, Turkey. E-mail: ahmetmelbir@gmail.com.
channel state information (CSI) feedback problem. In [14], a
sub-optimum method is proposed based on the support vector
machines (SVMs) for analog beamforming vector selection.
Very recently, a DL based hybrid beamforming is considered
in [18] where only precoder design is considered whereas
joint precoder and combiner design is used in massive MIMO
system where the beamforming is required in both end of
the communication [6]. The proposed network architecture
in [18] is based on multi-layer perceptrons which do not
effectively extract the hidden features inherit in the input data
[9], [13]. In order to achieve feature extraction and obtain
better performance, we propose a convolutional neural network
(CNN) framework for mmWave massive MIMO systems.
In this study, we propose a CNN-based framework with
two CNNs, each of which is dedicated to estimate the analog
precoders and combiners respectively. The CNNs accept the
input of channel matrix and give the beamformer weights at
the output. In order to train the network we generated different
channel realizations with synthetic noise added to each input
data. A decoupled optimization problem is formulated and
solved to obtain the best beamformers providing the highest
spectral efficiency. Using the best analog beamformers, we
construct the input-output pairs of the network. Since the
best beamformers are the optimum solution of the problem,
the proposed CNN framwork enjoys better spectral efficiency
and less computation time. Furthermore, our CNN approach
does not require the knowledge of array responses of users’
DOA/DOD angles which are not always accurately available
in practical scenarios.
II. SI GNA L MOD EL A ND PROBLEM FORMULATION
In this work, we consider a single user mmWave MIMO
communication system with multiple antennas. Let NSbe
the number of data streams to be transmitted from the base
station (BS) with NTtransmit antennas to the user with NR
antennas. The BS is equipped with NRF
Tanalog phase shifters
with analog beamformer FRF CNT×NRF
Tand baseband
beamformer FBB CNRF
T×NS. Then the transmitted signal
becomes x=FRF FBB swhere sCNSis the symbol
vector desired to be transmitted and E{ssH}=INS/NS.
The analog beamformers are unitary matrices with equal-norm
elements, i.e., [[FRF ]:,i[FRF ]H
:,i]i,i = 1/NTand we have power
constraint on the transmitter as ||FRF FBB ||F=NS. We can
write the received signal at the NRantennas for a narrowband
block-fading channel as
y=ρHFRF FBB s+n,(1)
where yCNRand ρis the average received power.
nCNRdenotes the additive white Gaussian noise (AWGN)
2
with n∼ CN(0, σ2
nINR)and HCNR×NTis the channel
matrix with E{||H||F}=NRNT. In mmWave transmission,
the channel can be represented by Saleh-Valenzuela (SV)
model [19] where the clustered channel model is used as the
contribution of Ncclusters of Nray paths as
H=γ
Nc
X
i=1
Nray
X
j=1
αij gR(ij)
R)gT(ij)
T)aR(ij)
R)aH
T(ij)
T),
where Θ(ij)
R= (φ(ij)
R, θ(ij)
R)and Θ(ij)
T= (φ(ij)
T, θ(ij)
T)
respectively denote the angle of arrivals and angle of
departures. We denote the angular parameters φand θ
as the azimuth and the elevation angles respectively.
γ=pNTNR/(NcNray)is the normalization factor and αij
is the complex channel gain associated with the ith scattering
cluster and jth path for i= 1, . . . , Ncand j= 1, . . . , Nray.
gR(ij)
R)and gT(ij)
T)are the antenna element gains for
receive and transmit antennas respectively. aR(ij)
R)and
aT(ij)
T)are NR×1and NT×1steering vectors representing
the array responses at the receiver and transmitters
respectively. The nth element of the steering vector
aR(ij)
R)is given as [aR(ij)
R)]n= exp{−2π
λpT
nr(ij)
R)},
where pn= [xn, yn, zn]Tis the position of the nth receive
antenna in Cartesian coordinate system and r(ij)
R) =
[sin(φ(ij)
R) cos(θ(ij)
R),sin(φ(ij)
R) sin(θ(ij)
R),cos(θ(ij)
R)]T. The
transmit side steering vector aT(ij)
T)can be defined in a
similar way as for aR(ij)
R). In order to generate the labels
(precoder and combiners) the proposed CNN frameworks
requires the perfect CSI. However, we use imperfect channel
matrices both in the training and testing stages.
The transmitted signal is received and processed by analog
and baseband combiners as ˜
y=WH
BB WH
RF y, i.e.,
˜
y=ρWH
BB WH
RF HFRF FBB s+WH
BB WH
RF n,(2)
where WRF CNR×NRF
Ris the analog combiner with
the constrained [[WRF ]:,i[WRF ]H
:,i]i,i = 1/NRand WBB
CNRF
R×NSdenotes the baseband combiner matrix. By as-
suming that the Gaussian symbols are transmitted through
the mmWave channel, we can define the spectral efficiency
achieved by the hybrid beamforming [3]–[6] as
RHYB = log2
INS+ρ
NS
Λ1
nWH
BB WH
RF H
×FRF FBB FH
BB FH
RF HHWRF WBB
,(3)
where Λ=
nσ2
nWH
BB WH
RF WRF WBB CNS×NSis the covari-
ance matrix of the noise term in (2) after combining.
Hence the aim in this work is to estimate the hybrid
beamformers FRF ,WRF ,FBB and WRF that maximize the
spectral efficiency as in (3) given the channel matrix H.
III. HYBRID BEA MF OR ME R DESIGN
The optimization problem for joint estimation of hybrid
beamformers ˆ
FRF ,ˆ
FBB ,ˆ
WRF ,ˆ
WBB can be stated as follows
argmax
FRF ,FBB ,WRF ,WBB
RHYB
s.t.: FRF ∈ FRF ,||FRF FBB ||2
F=NS,WRF ∈ WRF ,(4)
where FRF and WRF denote the feasible sets of analog beam-
formers which obey the constraints defined for FRF and WRF .
Obtaining the real-time solution to the problem in (4) is im-
practical due to the complexity of several matrix variables. To
cast the problem in (4) more effectively, we first define the sets
FRF and WRF . Note that the analog beamformers FRF ,WRF
are related with the array responses aT(ij)
T),aR(ij)
R)
through linear transformation [6]. Hence the feasible RF
beamformer sets can be formed as FRF ={F(1)
RF ,...,F(QF)
RF }
where F(qF)
RF =aT(ij)
T), i = 1, . . . , Nc, j = 1, . . . , Nray for
qF= 1, . . . , QF.QF=Npath
NRF
Tis the number RF precoder
candidates and Npath =NcNray. The feasible set for RF
combiner is similarly defined as WRF ={W(1)
RF ,...,W(QW)
RF }
where W(qW)
RF =aR(ij)
R), i = 1, . . . , Nc, j = 1, . . . , Nray
and QW=Npath
NRF
R. Now we can present the joint precoder
and combiner design problem as follows
¯qF,¯qW= argmax
qF,qW
log2
INS+ρ
NS
Λ1
nWH
BB WH
RF
×HFRF FBB FH
BB FH
RF HHWRF WBB
,s.t.:
FRF =F(qF)
RF ,WRF =W(qW)
RF ,FBB = (FH
RF FRF )1FH
RF Fopt,
WBB = (WH
RF ΛWRF )1(WH
RF ΛWopt),(5)
where ¯qF,¯qWrepresent the selected elements in the feasible
sets. Λis the covariance of the array output in (1) which is
given by Λ=ρ
NSHFRF FBB FH
BB FH
RF HH+σ2
nINRS .Fopt and
Wopt are the optimum baseband beamformers which can be
obtained from the singular value decomposition (SVD) of the
channel matrix. Let UCNR×rank(H)and VCNT×rank(H)be
the left and the right singular value matrices of Hrespectively,
where the SVD of HCNR×NTis H=UΣVHwhere Σ
is rank(H)×rank(H)matrix composed of the singular values
of Hin descending order. By decomposing Σand Vas Σ=
diag{Σ(1),Σ(2) },V= [V(1),V(2) ]where V(1) CNT×NS
and V(2) CNT×NRNSrespectively, one can readily select
the unconstrained precoder as Fopt =V(1) [6]. Using the
unconstrained beamformer Fopt,Wopt can be computed as [20]
Wopt =1
ρFoptHHHHFopt +NSσ2
n
ρINS1FoptHHHH.
The solution of (5) requires to visit QFQWnodes which is
computationally prohibitive. In order to reduce the complexity,
(5) is decomposed into two different problems where precoders
(FRF and FBB ) and combiners (WRF and WB B ) are sepa-
rately estimated. By doing so, the complexity is reduced from
QFQWto QF+QW. In order to find the precoders we solve
the following problems, i.e.,
¯qF= argmax
qF
log2|INS+ρ
NSσ2
n
(WoptHWopt)1WoptH
×HFRF FBB FH
BB FH
RF HHWopt|,s.t.:
FRF =F(qF)
RF ,FBB = (FH
RF FRF )1FH
RF Fopt,(6)
3
Algorithm 1 Training data generation.
Input: L,N,NT,NR,NRF
T,NRF
R, SNRTRAIN.
Output: Training data DFand DW.
1: Generate {H(n)}N
n=1 with {F(n)
RF }N
n=1 and {W(n)
RF }N
n=1.
2: for 1nNand 1lLdo
3: [H(l,n)]i,j ∼ CN ([H(l)]i,j, σ2
TRAIN).
4: Find ¯qFby solving (6) for F(qF,l,n)
RF ,1qFQF.
5: Construct ˆ
F(l,n)
RF and F(l,n)
BB from F( ¯qF,l,n)
RF .
6: Find ¯qWby solving (7) for W(qW,l,n)
RF ,1qWQW.
7: Construct ˆ
W(l,n)
RF and W(l,n)
BB from W( ¯qW,l,n)
RF .
8: [[X(l,n)]:,:,1]i,j =|[H(l,n)]i,j |.
9: [[X(l,n)]:,:,2]i,j = Re{[H(l,n)]i,j }.
10: [[X(l,n)]:,:,3]i,j = Im{[H(l,n)]i,j } ∀ij.
11: z(l,n)
F=vec{ˆ
F(l,n)
RF },z(l,n)
W=vec{ˆ
W(l,n)
RF }.
12: end for n,l
13: Training data for CNNFand CNNWis obtained as
DF= ((X(1,1),z(1,1)
F),...,(X(L,N),z(L,N )
F)),
DW= ((X(1,1),z(1,1)
W),...,(X(L,N),z(L,N )
W)).
Fig. 1. The proposed CNN framework for precoder (CNNFat the top) and
combiner design (CNNWat the bottom).
¯qW= argmax
qW
log2|INS+ρ
NSσ2
n
(WH
BB WH
RF
×WRF WBB WH
RF WH
BB )1HFopt FoptHHHWRF WBB |,s.t.:
WRF =W(qW)
RF ,WBB = (WH
RF ΛWRF )1(WH
RF ΛWopt),
Λ=ρ
NS
HFoptFoptHHH+σ2
nINR.(7)
Once (6) and (7) are solved, the analog beamformers are
constructed as ˆ
FRF =FqF)
RF and ˆ
WRF =WqW)
RF . The
baseband beamformers can also be obtained accordingly.
IV. CNN-BASED APPROACH
In this section, we present our CNN framework for joint
precoder and combiner design which is shown in Fig. 1. The
proposed network is composed of two CNNs with 8 layers
which have identical structures except the last layer. The first
layer is the input layer of size NR×NT×3with c= 3
channels. The first channel of the input is the element-wise
absolute value of the channel matrix as [[X]:,:,1]i,j =|[H]i,j|.
The second and the third channels are defined as the real and
the imaginary parts of the channel matrix as [[X]:,:,2]i,j =
Re{[H]i,j }and [[X]:,:,3]i,j = Im{[H]i,j }. The second and third
layer are the convolutional layers with 32 filters of size 2×2.
The fourth and sixth layers are fully connected layers with
1024 units. There are dropout layers after each fully connected
layers (the fifth and seventh layers) with %50 probability. The
output layer of CNNFis of size NTNRF
T×1which is the
vectorized version of the phases of FRF . Similarly, the size of
the output layer of CNNWis NRNRF
R×1. The complexity of
a CNN is directly proportional with the number of parameters
which, in our case, calculated as C2(2Ncv(wh+1) + 2(Nf c +
1) ·50
100 )[21]. Here C= 3 is the number of channels, w=
h= 2 is the filter size, Ncv = 32 is the number of filters,
Nfc = 1024 is the number of units in the fully connected
layer for %50 dropout probability. Hence the CNN structure
in Fig. 1 has 12105 parameters.
In data generation, Ndifferent realizations of channel
matrices H(n)for different user locations are generated to-
gether with the corresponding sets F(n)
RF and W(n)
RF . Then
for each realization, Lnoisy channel matrices are obtained
where the added element-wise synthetic noise is defined by
SNRTRAIN = 20 log10(|[H]i,j |2
σ2
TRAIN
). To account for the changes
in the wireless environment, we use three different SNRTRAIN
levels. Hence the total size of the training input data becomes
NR×NT×3×3NL. In order to obtain the output data the
problems in (6) and (7) are solved n, l. Then the output data
of each network is obtained. We summarize the algorithmic
steps of the training data generation in Algorithm 1.
V. NUMERICAL SIMULATIONS
In this section, we evaluate the performance of our CNN
framework (referred to as HBDL, Hybrid Beamforming via
Deep Learning) and compare it with the state-of-the-art tech-
niques such as SOMP [6] and PE-Alt-Min [7]. Uniform
square arrays are considered with half wavelength spacing with
NR=NT= 36 antennas. The number of analog beamformers
are NRF
R=NRF
T= 4. The feasible sets FRF ,WRF are used
for training only, and the output from CNN can be directly
used for analog beamforming since the analog beamformer
does not have to lie in the set of array response vectors.
The CNNs are fed with the training data generated for N=
L= 100. For each channel matrix realization, the propagation
environment is modeled with Nc= 4 and Nray = 5 for each
clusters with σ2
Θ= 5for all transmit and receive azimuth
and elevation angles which are uniform randomly selected
from the interval [60,60]and [20,20]respectively.
The proposed network is realized in MATLAB on a PC
with 768-core GPU. Stochastic gradient decent algorithm is
used to update the network parameters with the learning rate
0.005 and mini-batch size 500 for 200 epochs. As a loss
function, we use the negative log-likelihood or cross-entropy
loss [9]. In the training process, 70% and 30% of all data
generated are selected as the training and validation datasets,
respectively. Validation aids in hyperparameter tuning during
the training phase to avoid the network simply memorizing the
training data rather than learning general features for accurate
prediction with new data. The validation data is used to test the
performance of the network in the simulations for JT= 100
Monte Carlo trials. In order to prevent the similarity between
the test data and the training data we also add synthetic
noise to the test data where the SNR in testing is defined
4
-20 -15 -10 -5 0 5 10 15 20
SNR, [dB]
2
4
6
8
10
12
14
16
18
Spectral Efficiency [bits/s/Hz]
OPT
Best
HBDL
PE-Alt-Min
SOMP
-0.5 0 0.5
9.6
9.7
9.8
9.9
(a)
-20 -15 -10 -5 0 5 10 15 20
SNR, [dB]
0
5
10
15
20
25
30
35
Spectral Efficiency [bits/s/Hz]
OPT
Best
HBDL
PE-Alt-Min
SOMP
-0.1 0 0.1
18
18.2
18.4
18.6
(b)
-20 -15 -10 -5 0 5 10 15 20
SNR, [dB]
0
5
10
15
20
25
30
35
40
45
Spectral Efficiency [bits/s/Hz]
OPT
Best
HBDL
PE-Alt-Min
SOMP
(c)
Fig. 2. Spectral efficiency versus SNR for (a) NR=NT= 25,NS= 1; (b) NR=NT= 36,NS= 2; (c) NR=NT= 36,NS= 3.
similar to SNRTRAIN as SNRTEST = 20 log10(|[H]i,j |2
σ2
TEST
)and
SNRTRAIN ∈ {10,15,20}dB is selected.
In Fig. 2, the spectral efficiency for different algorithms is
presented for NS={1,2,3}and SNRTEST = 10dB. As it
is seen, HBDL provides better performance as compared to
the optimization-based method PE-Alt-Min and greedy-based
algorithm SOMP. The performance plot ”Best” denotes the
performance of the test data without prediction. We observe
that HBDL is very close to the best performance as well as the
fully-digital beamformer. HBDL effectively selects the analog
beamformers from the feasible sets which maximizes the spec-
tral efficiency. The effectiveness of HBDL is attributed to the
best selection of analog beamformers which are the optimum
solution of (4) through the SVD of the channel matrix [6].
SOMP has poor performance due the the fact that it cannot
select the ”best” set of array responses from the dictionary.
While PE-Alt-Min has sufficiently good performance, HBDL
performs better even when the output of PE-Alt-Min is inserted
to the feasible sets used for HBDL.
To compare the computation time of the algorithms we
consider the same settings and observe that HBDL spends
about 0.020s to compute both precoder and combiners whereas
SOMP and PE-Alt-Min take about 0.450s and 1.200s respec-
tively.
VI. CONCLUSIONS
In this work, a CNN framework is proposed for the joint
estimation of precoder and combiners in hybrid beamform-
ing problem. We show that the proposed network archi-
tecture provides better spectral efficiency as compared to
the optimization-based and greedy-based algorithm. In future
work, we reserve the case when the training data is small
where transfer learning-like approaches can be developed.
REFERENCES
[1] J. G. Andrews, S. Buzzi, W. Choi, S. V. Hanly, A. Lozano, A. C. K.
Soong, and J. C. Zhang, “What Will 5G Be?,IEEE Journal on Selected
Areas in Communications, vol. 32, pp. 1065–1082, June 2014.
[2] F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta,
O. Edfors, and F. Tufvesson, “Scaling Up MIMO: Opportunities and
Challenges with Very Large Arrays,” IEEE Signal Processing Magazine,
vol. 30, pp. 40–60, Jan 2013.
[3] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Channel Estima-
tion and Hybrid Precoding for Millimeter Wave Cellular Systems,” IEEE
Journal of Selected Topics in Signal Processing, vol. 8, pp. 831–846,
Oct 2014.
[4] A. Alkhateeb, G. Leus, and R. W. Heath, “Limited Feedback Hybrid
Precoding for Multi-User Millimeter Wave Systems,” IEEE Transactions
on Wireless Communications, vol. 14, pp. 6481–6494, Nov 2015.
[5] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Hybrid
precoding for millimeter wave cellular systems with partial channel
knowledge,” in 2013 Information Theory and Applications Workshop
(ITA), pp. 1–5, Feb 2013.
[6] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath,
“Spatially Sparse Precoding in Millimeter Wave MIMO Systems,” IEEE
Transactions on Wireless Communications, vol. 13, pp. 1499–1513,
March 2014.
[7] X. Yu, J. Shen, J. Zhang, and K. B. Letaief, “Alternating Minimization
Algorithms for Hybrid Precoding in Millimeter Wave MIMO Systems,”
IEEE Journal of Selected Topics in Signal Processing, vol. 10, pp. 485–
500, April 2016.
[8] D. Yu and L. Deng, “Deep learning and its applications to signal
and information processing [exploratory dsp],” IEEE Signal Processing
Magazine, vol. 28, pp. 145–154, Jan 2011.
[9] Y. Lecun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521,
no. 7553, pp. 436–444, 2015.
[10] A. Alkhateeb, S. Alex, P. Varkey, Y. Li, Q. Qu, and D. Tujkovic, “Deep
Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave
Systems,” CoRR, vol. abs/1804.10334, 2018.
[11] H. Ye, G. Y. Li, and B. Juang, “Power of Deep Learning for Channel
Estimation and Signal Detection in OFDM Systems,” IEEE Wireless
Communications Letters, vol. 7, pp. 114–117, Feb 2018.
[12] H. Huang, J. Yang, H. Huang, Y. Song, and G. Gui, “Deep Learning
for Super-Resolution Channel Estimation and DOA Estimation Based
Massive MIMO System,IEEE Transactions on Vehicular Technology,
vol. 67, pp. 8549–8560, Sep. 2018.
[13] A. M. Elbir, K. V. Mishra, and Y. C. Eldar, “Cognitive Radar Antenna
Selection via Deep Learning,” IET Radar, Sonar & Navigation, January
2019.
[14] Y. Long, Z. Chen, J. Fang, and C. Tellambura, “Data-Driven-Based
Analog Beam Selection for Hybrid Beamforming Under mm-Wave
Channels,” IEEE Journal of Selected Topics in Signal Processing,
vol. 12, pp. 340–352, May 2018.
[15] S. Drner, S. Cammerer, J. Hoydis, and S. t. Brink, “Deep Learning
Based Communication Over the Air,” IEEE Journal of Selected Topics
in Signal Processing, vol. 12, pp. 132–143, Feb 2018.
[16] V. Raj and S. Kalyani, “Backpropagating Through the Air: Deep Learn-
ing at Physical Layer Without Channel Models,IEEE Communications
Letters, vol. 22, pp. 2278–2281, Nov 2018.
[17] C. Wen, W. Shih, and S. Jin, “Deep Learning for Massive MIMO CSI
Feedback,” IEEE Wireless Communications Letters, vol. 7, pp. 748–751,
Oct 2018.
[18] H. Huang, Y. Song, J. Yang, G. Gui, and F. Adachi, “Deep-Learning-
based Millimeter-Wave Massive MIMO for Hybrid Precoding,IEEE
Transactions on Vehicular Technology, pp. 1–1, 2019.
[19] R. Mndez-Rial, C. Rusu, A. Alkhateeb, N. Gonzlez-Prelcic, and R. W.
Heath, “Channel estimation and hybrid combining for mmWave: Phase
shifters or switches?,” in 2015 Information Theory and Applications
Workshop (ITA), pp. 90–97, Feb 2015.
[20] T. Kailath, B. Hassibi, and A. H. Sayed, Linear estimation . Upper
Saddle River, NJ : Prentice-Hall ; London : Prentice-Hall International,
2000. Includes bibliographical references (p. 817-839) and indexes.
[21] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks
for Large-Scale Image Recognition,” CoRR, vol. abs/1409.1556, 2015.
... MmWave increases the carrier frequency significantly, resulting in a wide range of usable spectrum bands. Moreover, using microstrip technology allows a large number of antenna components to be crammed into a small space since working in the high-frequency region reduces the size of the antenna [40]. The mmWave MIMO system has a high bit error rate and increased system complexity despite achieving enhanced spectral efficiency (SE) and efficient power consumption, particularly when including a large number of data streams or users. ...
... The mmWave MIMO system has a high bit error rate and increased system complexity despite achieving enhanced spectral efficiency (SE) and efficient power consumption, particularly when including a large number of data streams or users. In addition, the fully digital precoding in mmWave system is costly and consumes a lot of power [40]. As a result, Hybrid analog/digital beamforming is the optimal approach for overcoming the constraints of pure digital or analog beamforming in both single and multiuser scenarios. ...
... In this section, the digital precoding in mmWave system has high cost and power consumption [40] while analog precoding has bad performance. Consequently, Hybrid beamforming is the better choice for overcoming the constraints of pure digital or analog beamforming in communication systems. ...
Article
Full-text available
Pattern synthesis is widely used in many radar and communication systems and received great interest. So, this paper proposes a new beamforming strategy based on a hybrid combination between grey wolf optimizer (GWO) with L2-norm called proposed GWO. This approach is applied to synthesized uniform linear arrays (ULA), Chebyshav arrays, and shaped pattern arrays. Moreover, it is utilized for side lobe level (SLL) and size reduction of antenna elements. In this strategy, the GWO is utilized to optimize the element spacing to adjust the half-power beam-width (HPBW) to save it the same as desired pattern. Furthermore, the excitations of the antenna elements are optimized via the L2-norm minimization problem. The proposed GWO has low complexity (fewer iterations and computing time) compared to other algorithms. In addition, it has a very accurate approximation of the original radiation pattern. As well, the computer simulation technology (CST) microwave package is utilized to achieve the practical validation of the proposed methodologies. As an application of the proposed GWO, it is employed to create a proposed hybrid beamforming (PHB) structure for Multi-input Multi-output (MIMO) systems. Consequently, the BS transmitting antennas are synthesized for gain maximization while utilizing the current amount of antenna elements. This results in considerable savings in antenna components and associated radio frequency (RF) chains which reduces system complexity. Furthermore, array gain maximization will increase the received signal-to-noise ratio (SNR). In addition, the SLL reduction scenario will decrease the interference from undesired users which in turn will also increase SNR. Hence, the performance of the system in terms of spectral efficiency (SE) and power utilization will be improved.
... In this realm, deep learning (DL) models, like artificial neural networks (ANNs), are employed for precise computations. Such endeavors are exemplified in [124], which investigates decentralized hybrid beamforming in 5G next-generation node base stations (gNodeBs). ...
Article
Full-text available
Rapid advancements in the development of smart terminals and infrastructure, coupled with a wide range of applications with complex requirements, are creating traffic demands that current networks may not be able to fully handle. Accordingly, the study of 6G networks deserves attention from both industry and academia. Artificial intelligence (AI) has emerged for application in the optimization and design process of new 6G networks. The developmental trend of 6G is towards effective resource management, along with the architectural improvement of the current network and hardware specifications. Cloud RAN (CRAN) is considered one of the major concepts in sixth- and fifth-generation wireless networks, being able to improve latency, capacity, and connectivity to huge numbers of devices. Besides bettering the current set-up in terms of setting the carriers’ network architecture and hardware specifications, among other potential enablers, the developmental trend of 6G also means that there must be effective resource management. As a result, this study covers a thorough analysis of resource management plans in CRAN, optimization, and AI taxonomy, and how AI integration might enhance existing resource management.
... Some numerical algorithms are implemented in last years to obtain feasible hybrid pre-coding such as manifold optimization (MO) [7], Karush-Kuhn-Tucker (KKT) [8] and orthogonal matching pursuit (OMP) [5] based algorithms. To accomplish optimal performance of HBF and reduction of time consumption Deep Learning based algorithms were proposed [9,10]. For hybrid pre-coding matrices designing, we need a precise assessment of Millimeter Wave-channel and an exigent task due to the highest quantities of antenna ratios at both the transceiver ends. ...
Article
Full-text available
This work aims to provide an effective hybrid beam forming method with Dual-Deep-Network to overcome overhead for mm-wave massive MIMO systems. In this paper, a Dual-Deep-Network technique is described for the extraction of statistical structures from a hybrid beam forming model based on mmWave logics, as well as training logic for the network map functions. The proposed approach of DDN is trained with proper data sequences used for communication and the training phase is conducted with the norms of numerous channel variants. With the nature of diverse channel states, a Dual-Deep-Network is required to manipulate the level of presence and abilities even after training as well. The performance level improvements are practically summarized in both the transmission and reception entities with the help of the proposed hybrid network architecture and the associated Dual Deep Network algorithm. Specifically, the BER versus SNR and spectral efficiency versus SNR are evaluated as well as the resulting accuracy levels are cross validated with numerous classical communication techniques. This paper shows the processing difficulties of the proposed approach and typically cross-validates with other beam forming logics. The computational cost and performance estimations are improved, and the metrics are clearly visualized on this paper based on improved beamforming procedures as well as the proposed approach of DDN based Multi-Resolution Code Book performance metrics are estimated clearly with proper mathematical model investigations. With 7Kbits/s/Hz and 1e-1, respectively, the key metrics of spectral efficiency and BER are enhanced.
... The fully digital precoding in mmwave system with large antennas arrays required high cost and power consumption [2], also the pure analog solutions have severe performance limitations. So, the best solution is hybrid analog / digital beamforming to overcome the limitations of pure digital or analog beamforming for single user and multi user scenario. ...
Article
Full-text available
Multiple input multiple output (MIMO) system with Millimeter Wave spectrum is currently used in most wireless applications and all cellular system to provides high data rates. with using large antenna array which is possible by decrease the wavelength to achieve high beamforming gain and improve the spectral efficiency. in this paper, used low complexity with hybrid precoding at the transmitting side and combining at the receiver side with limited feedback system, by using the concept of orthogonal matching pursuit (OMP) in single and multi-user cases, and compared the results with analog only beasmstring. The results of simulation showed that when used Minimum Mean Square Error (MMSE) precoders performed better than other hybrid precoding approaches, in addition the MMSE hybrid precoding /combining technique offers higher spectral efficiency compared with analog only beamstring.
Article
Full-text available
Hybrid combiner and precoder architectures, radio frequency (RF) chain, analog phase shifters, digital-to-analog converter (DAC), and analog-to-digital converter (ADC) are components of a millimeter wave cellular system. Prior works in the area of millimeter wave cellular system design employ receiver with infinite bit and large amount of RF chain that scales linearly with the quantity of transmitting and receiving antennas. This mode of design no doubt increases power demand or requirement of a typical millimeter wave system. In this work, hybrid architecture with few RF chains and small number of ADC bits are proposed and are used as candidate for millimeter wave channel estimation and cellular communication. In that connection, least square (LS), orthogonal matching pursuit (OMP), compressed sampling matching pursuit (CoSAMP), and deep learning (DL) techniques are utilized for analytical investigation. Indeed, computational results reveal that, when ADC consisting of uniform mid- rise quantizer is employed, the performance of 4 and 6 bits at signal-to-noise ratio (SNR) values of − 10 dB and 20 dB is at par with infinite bit (unquantized case). As a validation, DL compares favorably well with adaptive compressed sensing (ACS) technique previously used in the literature for channel estimation, while OMP and CoSAMP show better performance than ACS.
Article
Full-text available
Supporting high mobility in millimeter wave (mmWave) systems enables a wide range of important applications such as vehicular communications and wireless virtual/augmented reality. Realizing this in practice, though, requires overcoming several challenges. First, the use of narrow beams and the sensitivity of mmWave signals to blockage greatly impact the coverage and reliability of highly-mobile links. Second, highly-mobile users in dense mmWave deployments need to frequently hand-off between base stations (BSs), which is associated with critical control and latency overhead. Further, identifying the optimal beamforming vectors in large antenna array mmWave systems requires considerable training overhead, which significantly affects the efficiency of these mobile systems. In this paper, a novel integrated machine learning and coordinated beamforming solution is developed to overcome these challenges and enable highly-mobile mmWave applications. In the proposed solution, a number of distributed yet coordinating BSs simultaneously serve a mobile user. This user ideally needs to transmit only one uplink training pilot sequence that will be jointly received at the coordinating BSs using omni or quasi-omni beam patterns. These received signals draw a defining signature not only for the user location, but also for its interaction with the surrounding environment. The developed solution then leverages a deep learning model that learns how to use these signatures to predict the beamforming vectors at the BSs. This renders a comprehensive solution that supports highly-mobile mmWave applications with reliable coverage, low latency, and negligible training overhead. Extensive simulation results, based on accurate ray-tracing, show that the proposed deep-learning coordinated beamforming strategy approaches the achievable rate of the genie-aided solution that knows the optimal beamforming vectors with no training overhead. Compared to traditional beamforming solutions, the results show that the proposed deep learning based strategy attains higher rates, especially in high-mobility large-array regimes.
Article
Full-text available
Hybrid beamforming is a promising low-cost solution for large multiple-input multiple-output (MIMO) systems, where the base station (BS) is equipped with fewer radio frequency chains. In these systems, the selection of codeword for analog beamforming is essential to optimize the sum-rate performance. In this paper, based on machine learning, we propose a data-driven method of analog beam selection to achieve a near-optimal sum-rate with low complexity, which is highly dependent on training data. To be more specific, we take the beam selection problem as a multiclass-classification problem, where a large number of samples of millimeter-wave channels are considered as training data. By using the training data, we exploit support vector machine (SVM) to obtain a statistical classification model in terms of maximizing sum-rate. For real-time transmissions, with the derived classification model, we are able to select the optimal analog beam for each user with low complexity. Besides, we propose a novel method to determine the optimal parameter of Gaussian kernel function by resorting to McLaughlin expansion. Analysis and simulation results reveal that, as long as the training data is sufficient, the proposed data-driven method is able to achieve a near-optimal sum-rate performance, while the complexity reduces by several orders of magnitude, compared with the conventional method.
Article
Full-text available
Direction of arrival (DoA) estimation of targets improves with the number of elements employed by a phased array radar antenna. Since larger arrays have high associated cost, area and computational load, there is recent interest in thinning the antenna arrays without loss of far-field DoA accuracy. In this context, a cognitive radar may deploy a full array and then select an optimal subarray to transmit and receive the signals in response to changes in the target environment. Prior works have used optimization and greedy search methods to pick the best subarrays cognitively. In this paper, we leverage deep learning to address the antenna selection problem. Specifically, we construct a convolutional neural network (CNN) as a multi-class classification framework where each class designates a different subarray. The proposed network determines a new array every time data is received by the radar, thereby making antenna selection a cognitive operation. Our numerical experiments show that the proposed CNN structure outperforms existing random thinning and other machine learning approaches.
Article
Full-text available
This article presents our initial results in deep learning for channel estimation and signal detection in orthogonal frequency-division multiplexing (OFDM). OFDM has been widely adopted in wireless broadband communications to combat frequency-selective fading in wireless channels. In this article, we take advantage of deep learning in handling wireless OFDM channels in an end-to-end approach. Different from existing OFDM receivers that first estimate CSI explicitly and then detect/recover the transmitted symbols with the estimated CSI, our deep learning based approach estimates CSI implicitly and recovers the transmitted symbols directly. To address channel distortion, a deep learning model is first trained offline using the data generated from the simulation based on the channel statistics and then used for recovering the online transmitted data directly. From our simulation results, the deep learning based approach has the ability to address channel distortions and detect the transmitted symbols with performance comparable to minimum mean-square error (MMSE) estimator. Furthermore, the deep learning based approach is more robust than conventional methods when fewer training pilots are used, the cyclic prefix (CP) is omitted, and nonlinear clipping noise is presented. In summary, deep learning is a promising tool for channel estimation and signal detection in wireless communications with complicated channel distortions and interferences.
Article
Full-text available
End-to-end learning of communications systems is a fascinating novel concept that has so far only been validated by simulations for block-based transmissions. It allows learning of transmitter and receiver implementations as deep neural networks (NNs) that are optimized for an arbitrary differentiable end-to-end performance metric, e.g., block error rate (BLER). In this paper, we demonstrate that over-the-air transmissions are possible: We build, train, and run a complete communications system solely composed of NNs using unsynchronized off-the-shelf software-defined radios (SDRs) and open-source deep learning (DL) software libraries. We extend the existing ideas towards continuous data transmission which eases their current restriction to short block lengths but also entails the issue of receiver synchronization. We overcome this problem by introducing a frame synchronization module based on another NN. A comparison of the BLER performance of the "learned" system with that of a practical baseline shows competitive performance close to 1 dB, even without extensive hyperparameter tuning. We identify several practical challenges of training such a system over actual channels, in particular the missing channel gradient, and propose a two-step learning procedure based on the idea of transfer learning that circumvents this issue.
Article
Millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) has been regarded to be an emerging solution for the next generation of communications, in which hybrid analog and digital precoding is an important method for reducing the hardware complexity and energy consumption associated with mixed signal components. However, the fundamental limitations of the existing hybrid precoding schemes is that they have high computational complexity and fail to fully exploit the spatial information. To overcome these limitations, this paper proposes a deep-learning-enabled mmWave massive MIMO framework for effective hybrid precoding, in which each selection of the precoders for obtaining the optimized decoder is regarded as a mapping relation in the deep neural network (DNN). Specifically, the hybrid precoder is selected through training based on the DNN for optimizing precoding process of the mmWave massive MIMO. Additionally, we present extensive simulation results to validate the excellent performance of the proposed scheme. The results exhibit that the DNN-based approach is capable of minimizing the bit error ratio (BER) and enhancing spectrum efficiency of the mmWave massive MIMO, which achieves better performance in hybrid precoding compared with conventional schemes while substantially reducing the required computational complexity.
Article
Recent developments in applying deep learning techniques to train end-to-end communication systems have shown great promise in improving the overall performance of the system. However, most of the current methods for applying deep learning to train physical layer characteristics assume the availability of explicit channel model. Training a neural network requires the availability of functional form all the layers in the network to calculate gradients for optimization. The unavailability of gradients in a physical channel forced previous works to adopt simulation based strategies to train the network and then fine tune only the receiver part with actual channel. In this paper, we present a practical method to train an end-to-end communication system without relying on explicit channel models. By utilizing stochastic perturbation techniques, we show that the proposed method can train a deep learning based communication system in real channel without any assumption on channel models.
Article
The recent concept of massive multiple input multiple output (MIMO) can significantly improve the capacity of the communication network and it is regarded as a promising technology for the next generation wireless communications. However, the fundamental challenge of existing massive MIMO systems is that high computational complexity and complicated spatial structures bring great difficulties to exploit the characteristics of the channel and sparsity of these multi-antennas systems. To address this problem, in this paper, we focus on channel estimation and direction of arrivals (DOA) estimation, and a novel framework that integrates the massive MIMO into deep learning is proposed. To realize end-to-end performance, a deep neural network (DNN) is employed to conduct offline learning and online learning procedures, which is effective to learn the statistics of the wireless channel and the spatial structures in angle domain. Concretely, the DNN is first trained by simulated data in different channel condition with the aids of the offline learning, and then corresponding output data can be obtained based on current input data during online learning process. In order to realize super-resolution channel estimation and DOA estimation, two algorithms based on the deep learning are developed, in which the DOA can be estimated in angle domain without additional complexity directly. Furthermore, simulation results corroborate that the proposed deep learning based scheme can achieve better performance in terms of the DOA estimation and the channel estimation compared with conventional methods, and the proposed scheme is well investigated by extensive simulation in various cases for testing its robustness. IEEE
Article
In frequency division duplex mode, the downlink channel state information (CSI) should be conveyed to the base station through feedback links so that the potential gains of a massive multiple-input multiple-output can be exhibited. However, the excessive feedback overhead remains a bottleneck in this regime. In this letter, we use beep learning technology to develop CsiNet, a novel CSI sensing and recovery network that learns to effectively use channel structure from training samples. In particular, CsiNet learns a transformation from CSI to a near-optimal number of representations (codewords) and an inverse transformation from codewords to CSI. Experiments demonstrate that CsiNet can recover CSI with significantly improved reconstruction quality compared with existing compressive sensing (CS)-based methods. Even at excessively low compression regions where CS-based methods cannot work, CsiNet retains effective beamforming gain.