ArticlePDF Available

CNN-based Precoder and Combiner Design in mmWave MIMO Systems

May 2019
IEEE Communications Letters PP(99):1-1

May 2019
PP(99):1-1

DOI:10.1109/LCOMM.2019.2915977

Authors:

University of Luxembourg

Hybrid beamformer design is a crucial stage in millimeter wave (mmWave) MIMO systems. In this work, we propose a convolutional neural network (CNN) framework for the joint design of precoder and combiners. The proposed network accepts the input of channel matrix and gives the output of analog and baseband beamformers. Previous works are usually based on the knowledge of steering vectors of array responses which is not always accurately available in practice. The proposed CNN framework does not require such a knowledge and it provides higher performance in capacity as compared to the conventional greedy-and optimization-based algorithms.

The proposed CNN framework for precoder (CNN F at the top) and combiner design (CNN W at the bottom).

…

Figures - uploaded by Ahmet M. Elbir

Content may be subject to copyright.

Content uploaded by Ahmet M. Elbir

Content may be subject to copyright.

Content uploaded by Ahmet M. Elbir

Content may be subject to copyright.

CNN-based Precoder and Combiner Design in

mmWave MIMO Systems

Ahmet M. Elbir

Abstract—Hybrid beamformer design is a crucial stage in

millimeter wave (mmWave) MIMO systems. In this work, we

propose a convolutional neural network (CNN) framework for the

joint design of precoder and combiners. The proposed network

accepts the input of channel matrix and gives the output of analog

and baseband beamformers. Previous works are usually based

on the knowledge of steering vectors of array responses which is

not always accurately available in practice. The proposed CNN

framework does not require such a knowledge and it provides

higher performance in capacity as compared to the conventional

greedy- and optimization-based algorithms.

Index Terms—mmWave, MIMO, Hybrid beamforming, Deep

learning, Convolutional neural network.

I. INTRODUCTION

Hybrid beamforming is a promising architecture to be

used in next generation millimeter wave (mmWave) MIMO

(Multiple Input Multiple Output) systems where robust beam-

forming performance is provided with smaller cost and less

number of fully-digital beamformers [1]–[4]. Several methods

are proposed to design the hybrid beamformers [3]–[7]. In

[6], a greedy-based approach, orthogonal matching pursuit

(OMP), is proposed where the analog precoder and combiners

are selected from a dictionary of transmit and receive array

responses. This algorithm requires the knowledge of the user

direction-of-arrival/aperture (DOA/DOD) angles to construct

such a dictionary. Using the connection between the optimum

and the hybrid beamformers, [7] proposes an alternating

minimization approach to estimate the analog and baseband

beamformers based on phase extraction.

The above works provides optimization-based and greedy-

based solutions for hybrid beamforming problem. However

achieving the optimum solution and the computation time

are the main drawbacks of the above techniques. In order

to circumvent this issue, we consider deep learning (DL)-

based techniques for the hybrid beamforming problem [8]. DL

has several advantages such as low computational complex-

ity when solving optimization-based or combinatorial/greedy

search problems and the ability to extrapolate new features

from a limited set of features contained in a training set

[9]. A great deal of attention is received for DL-based tech-

niques in communications society for the problems such as

channel estimation [10]–[12] DOA estimation [12], antenna

selection [13], and analog beam selection [14]. An end-to-

end communication scenario is modeled in [15] and [16] by

using auto-encoders where single-input-single-output (SISO)

systems are considered. [17] also uses auto-encoders for the

A.M.E. is with the Department of Electrical and Electronics Engineering,

Duzce University, Duzce, Turkey. E-mail: ahmetmelbir@gmail.com.

channel state information (CSI) feedback problem. In [14], a

sub-optimum method is proposed based on the support vector

machines (SVMs) for analog beamforming vector selection.

Very recently, a DL based hybrid beamforming is considered

in [18] where only precoder design is considered whereas

joint precoder and combiner design is used in massive MIMO

system where the beamforming is required in both end of

the communication [6]. The proposed network architecture

in [18] is based on multi-layer perceptrons which do not

effectively extract the hidden features inherit in the input data

[9], [13]. In order to achieve feature extraction and obtain

better performance, we propose a convolutional neural network

(CNN) framework for mmWave massive MIMO systems.

In this study, we propose a CNN-based framework with

two CNNs, each of which is dedicated to estimate the analog

precoders and combiners respectively. The CNNs accept the

input of channel matrix and give the beamformer weights at

the output. In order to train the network we generated different

channel realizations with synthetic noise added to each input

data. A decoupled optimization problem is formulated and

solved to obtain the best beamformers providing the highest

spectral efﬁciency. Using the best analog beamformers, we

construct the input-output pairs of the network. Since the

best beamformers are the optimum solution of the problem,

the proposed CNN framwork enjoys better spectral efﬁciency

and less computation time. Furthermore, our CNN approach

does not require the knowledge of array responses of users’

DOA/DOD angles which are not always accurately available

in practical scenarios.

II. SI GNA L MOD EL A ND PROBLEM FORMULATION

In this work, we consider a single user mmWave MIMO

communication system with multiple antennas. Let NSbe

the number of data streams to be transmitted from the base

station (BS) with NTtransmit antennas to the user with NR

antennas. The BS is equipped with NRF

Tanalog phase shifters

with analog beamformer FRF ∈CNT×NRF

Tand baseband

beamformer FBB ∈CNRF

T×NS. Then the transmitted signal

becomes x=FRF FBB swhere s∈CNSis the symbol

vector desired to be transmitted and E{ssH}=INS/NS.

The analog beamformers are unitary matrices with equal-norm

elements, i.e., [[FRF ]:,i[FRF ]H

:,i]i,i = 1/NTand we have power

constraint on the transmitter as ||FRF FBB ||F=NS. We can

write the received signal at the NRantennas for a narrowband

block-fading channel as

y=√ρHFRF FBB s+n,(1)

where y∈CNRand ρis the average received power.

n∈CNRdenotes the additive white Gaussian noise (AWGN)

with n∼ CN(0, σ2

nINR)and H∈CNR×NTis the channel

matrix with E{||H||F}=NRNT. In mmWave transmission,

the channel can be represented by Saleh-Valenzuela (SV)

model [19] where the clustered channel model is used as the

contribution of Ncclusters of Nray paths as

H=γ

i=1

Nray

j=1

αij gR(Θ(ij)

R)gT(Θ(ij)

T)aR(Θ(ij)

R)aH

T(Θ(ij)

T),

where Θ(ij)

R= (φ(ij)

R, θ(ij)

R)and Θ(ij)

T= (φ(ij)

T, θ(ij)

respectively denote the angle of arrivals and angle of

departures. We denote the angular parameters φand θ

as the azimuth and the elevation angles respectively.

γ=pNTNR/(NcNray)is the normalization factor and αij

is the complex channel gain associated with the ith scattering

cluster and jth path for i= 1, . . . , Ncand j= 1, . . . , Nray.

gR(Θ(ij)

R)and gT(Θ(ij)

T)are the antenna element gains for

receive and transmit antennas respectively. aR(Θ(ij)

R)and

aT(Θ(ij)

T)are NR×1and NT×1steering vectors representing

the array responses at the receiver and transmitters

respectively. The nth element of the steering vector

aR(Θ(ij)

R)is given as [aR(Θ(ij)

R)]n= exp{−2π

λpT

nr(Θ(ij)

R)},

where pn= [xn, yn, zn]Tis the position of the nth receive

antenna in Cartesian coordinate system and r(Θ(ij)

R) =

[sin(φ(ij)

R) cos(θ(ij)

R),sin(φ(ij)

R) sin(θ(ij)

R),cos(θ(ij)

R)]T. The

transmit side steering vector aT(Θ(ij)

T)can be deﬁned in a

similar way as for aR(Θ(ij)

R). In order to generate the labels

(precoder and combiners) the proposed CNN frameworks

requires the perfect CSI. However, we use imperfect channel

matrices both in the training and testing stages.

The transmitted signal is received and processed by analog

and baseband combiners as ˜

y=WH

BB WH

RF y, i.e.,

y=√ρWH

BB WH

RF HFRF FBB s+WH

BB WH

RF n,(2)

where WRF ∈CNR×NRF

Ris the analog combiner with

the constrained [[WRF ]:,i[WRF ]H

:,i]i,i = 1/NRand WBB ∈

CNRF

R×NSdenotes the baseband combiner matrix. By as-

suming that the Gaussian symbols are transmitted through

the mmWave channel, we can deﬁne the spectral efﬁciency

achieved by the hybrid beamforming [3]–[6] as

RHYB = log2

INS+ρ

Λ−1

nWH

BB WH

RF H

×FRF FBB FH

BB FH

RF HHWRF WBB 

,(3)

where Λ=

nσ2

nWH

BB WH

RF WRF WBB ∈CNS×NSis the covari-

ance matrix of the noise term in (2) after combining.

Hence the aim in this work is to estimate the hybrid

beamformers FRF ,WRF ,FBB and WRF that maximize the

spectral efﬁciency as in (3) given the channel matrix H.

III. HYBRID BEA MF OR ME R DESIGN

The optimization problem for joint estimation of hybrid

beamformers ˆ

FRF ,ˆ

FBB ,ˆ

WRF ,ˆ

WBB can be stated as follows

argmax

FRF ,FBB ,WRF ,WBB

RHYB

s.t.: FRF ∈ FRF ,||FRF FBB ||2

F=NS,WRF ∈ WRF ,(4)

where FRF and WRF denote the feasible sets of analog beam-

formers which obey the constraints deﬁned for FRF and WRF .

Obtaining the real-time solution to the problem in (4) is im-

practical due to the complexity of several matrix variables. To

cast the problem in (4) more effectively, we ﬁrst deﬁne the sets

FRF and WRF . Note that the analog beamformers FRF ,WRF

are related with the array responses aT(Θ(ij)

T),aR(Θ(ij)

through linear transformation [6]. Hence the feasible RF

beamformer sets can be formed as FRF ={F(1)

RF ,...,F(QF)

RF }

where F(qF)

RF =aT(Θ(ij)

T), i = 1, . . . , Nc, j = 1, . . . , Nray for

qF= 1, . . . , QF.QF=Npath

NRF

Tis the number RF precoder

candidates and Npath =NcNray. The feasible set for RF

combiner is similarly deﬁned as WRF ={W(1)

RF ,...,W(QW)

RF }

where W(qW)

RF =aR(Θ(ij)

R), i = 1, . . . , Nc, j = 1, . . . , Nray

and QW=Npath

NRF

R. Now we can present the joint precoder

and combiner design problem as follows

¯qF,¯qW= argmax

qF,qW

log2

INS+ρ

Λ−1

nWH

BB WH

×HFRF FBB FH

BB FH

RF HHWRF WBB 

,s.t.:

FRF =F(qF)

RF ,WRF =W(qW)

RF ,FBB = (FH

RF FRF )−1FH

RF Fopt,

WBB = (WH

RF ΛWRF )−1(WH

RF ΛWopt),(5)

where ¯qF,¯qWrepresent the selected elements in the feasible

sets. Λis the covariance of the array output in (1) which is

given by Λ=ρ

NSHFRF FBB FH

BB FH

RF HH+σ2

nINRS .Fopt and

Wopt are the optimum baseband beamformers which can be

obtained from the singular value decomposition (SVD) of the

channel matrix. Let U∈CNR×rank(H)and V∈CNT×rank(H)be

the left and the right singular value matrices of Hrespectively,

where the SVD of H∈CNR×NTis H=UΣVHwhere Σ

is rank(H)×rank(H)matrix composed of the singular values

of Hin descending order. By decomposing Σand Vas Σ=

diag{Σ(1),Σ(2) },V= [V(1),V(2) ]where V(1) ∈CNT×NS

and V(2) ∈CNT×NR−NSrespectively, one can readily select

the unconstrained precoder as Fopt =V(1) [6]. Using the

unconstrained beamformer Fopt,Wopt can be computed as [20]

Wopt =1

ρFoptHHHHFopt +NSσ2

ρINS−1FoptHHHH.

The solution of (5) requires to visit QFQWnodes which is

computationally prohibitive. In order to reduce the complexity,

(5) is decomposed into two different problems where precoders

(FRF and FBB ) and combiners (WRF and WB B ) are sepa-

rately estimated. By doing so, the complexity is reduced from

QFQWto QF+QW. In order to ﬁnd the precoders we solve

the following problems, i.e.,

¯qF= argmax

log2|INS+ρ

NSσ2

(WoptHWopt)−1WoptH

×HFRF FBB FH

BB FH

RF HHWopt|,s.t.:

FRF =F(qF)

RF ,FBB = (FH

RF FRF )−1FH

RF Fopt,(6)

Algorithm 1 Training data generation.

Input: L,N,NT,NR,NRF

T,NRF

R, SNRTRAIN.

Output: Training data DFand DW.

1: Generate {H(n)}N

n=1 with {F(n)

RF }N

n=1 and {W(n)

RF }N

n=1.

2: for 1≤n≤Nand 1≤l≤Ldo

3: [H(l,n)]i,j ∼ CN ([H(l)]i,j, σ2

TRAIN).

4: Find ¯qFby solving (6) for F(qF,l,n)

RF ,1≤qF≤QF.

5: Construct ˆ

F(l,n)

RF and F(l,n)

BB from F( ¯qF,l,n)

RF .

6: Find ¯qWby solving (7) for W(qW,l,n)

RF ,1≤qW≤QW.

7: Construct ˆ

W(l,n)

RF and W(l,n)

BB from W( ¯qW,l,n)

RF .

8: [[X(l,n)]:,:,1]i,j =|[H(l,n)]i,j |.

9: [[X(l,n)]:,:,2]i,j = Re{[H(l,n)]i,j }.

10: [[X(l,n)]:,:,3]i,j = Im{[H(l,n)]i,j } ∀ij.

11: z(l,n)

F=∠vec{ˆ

F(l,n)

RF },z(l,n)

W=∠vec{ˆ

W(l,n)

RF }.

12: end for n,l

13: Training data for CNNFand CNNWis obtained as

DF= ((X(1,1),z(1,1)

F),...,(X(L,N),z(L,N )

F)),

DW= ((X(1,1),z(1,1)

W),...,(X(L,N),z(L,N )

W)).

Fig. 1. The proposed CNN framework for precoder (CNNFat the top) and

combiner design (CNNWat the bottom).

¯qW= argmax

log2|INS+ρ

NSσ2

(WH

BB WH

×WRF WBB WH

RF WH

BB )−1HFopt FoptHHHWRF WBB |,s.t.:

WRF =W(qW)

RF ,WBB = (WH

RF ΛWRF )−1(WH

RF ΛWopt),

Λ=ρ

HFoptFoptHHH+σ2

nINR.(7)

Once (6) and (7) are solved, the analog beamformers are

constructed as ˆ

FRF =F(¯qF)

RF and ˆ

WRF =W(¯qW)

RF . The

baseband beamformers can also be obtained accordingly.

IV. CNN-BASED APPROACH

In this section, we present our CNN framework for joint

precoder and combiner design which is shown in Fig. 1. The

proposed network is composed of two CNNs with 8 layers

which have identical structures except the last layer. The ﬁrst

layer is the input layer of size NR×NT×3with c= 3

channels. The ﬁrst channel of the input is the element-wise

absolute value of the channel matrix as [[X]:,:,1]i,j =|[H]i,j|.

The second and the third channels are deﬁned as the real and

the imaginary parts of the channel matrix as [[X]:,:,2]i,j =

Re{[H]i,j }and [[X]:,:,3]i,j = Im{[H]i,j }. The second and third

layer are the convolutional layers with 32 ﬁlters of size 2×2.

The fourth and sixth layers are fully connected layers with

1024 units. There are dropout layers after each fully connected

layers (the ﬁfth and seventh layers) with %50 probability. The

output layer of CNNFis of size NTNRF

T×1which is the

vectorized version of the phases of FRF . Similarly, the size of

the output layer of CNNWis NRNRF

R×1. The complexity of

a CNN is directly proportional with the number of parameters

which, in our case, calculated as C2(2Ncv(wh+1) + 2(Nf c +

1) ·50

100 )[21]. Here C= 3 is the number of channels, w=

h= 2 is the ﬁlter size, Ncv = 32 is the number of ﬁlters,

Nfc = 1024 is the number of units in the fully connected

layer for %50 dropout probability. Hence the CNN structure

in Fig. 1 has 12105 parameters.

In data generation, Ndifferent realizations of channel

matrices H(n)for different user locations are generated to-

gether with the corresponding sets F(n)

RF and W(n)

RF . Then

for each realization, Lnoisy channel matrices are obtained

where the added element-wise synthetic noise is deﬁned by

SNRTRAIN = 20 log10(|[H]i,j |2

σ2

TRAIN

). To account for the changes

in the wireless environment, we use three different SNRTRAIN

levels. Hence the total size of the training input data becomes

NR×NT×3×3NL. In order to obtain the output data the

problems in (6) and (7) are solved ∀n, l. Then the output data

of each network is obtained. We summarize the algorithmic

steps of the training data generation in Algorithm 1.

V. NUMERICAL SIMULATIONS

In this section, we evaluate the performance of our CNN

framework (referred to as HBDL, Hybrid Beamforming via

Deep Learning) and compare it with the state-of-the-art tech-

niques such as SOMP [6] and PE-Alt-Min [7]. Uniform

square arrays are considered with half wavelength spacing with

NR=NT= 36 antennas. The number of analog beamformers

are NRF

R=NRF

T= 4. The feasible sets FRF ,WRF are used

for training only, and the output from CNN can be directly

used for analog beamforming since the analog beamformer

does not have to lie in the set of array response vectors.

The CNNs are fed with the training data generated for N=

L= 100. For each channel matrix realization, the propagation

environment is modeled with Nc= 4 and Nray = 5 for each

clusters with σ2

Θ= 5◦for all transmit and receive azimuth

and elevation angles which are uniform randomly selected

from the interval [−60◦,60◦]and [−20◦,20◦]respectively.

The proposed network is realized in MATLAB on a PC

with 768-core GPU. Stochastic gradient decent algorithm is

used to update the network parameters with the learning rate

0.005 and mini-batch size 500 for 200 epochs. As a loss

function, we use the negative log-likelihood or cross-entropy

loss [9]. In the training process, 70% and 30% of all data

generated are selected as the training and validation datasets,

respectively. Validation aids in hyperparameter tuning during

the training phase to avoid the network simply memorizing the

training data rather than learning general features for accurate

prediction with new data. The validation data is used to test the

performance of the network in the simulations for JT= 100

Monte Carlo trials. In order to prevent the similarity between

the test data and the training data we also add synthetic

noise to the test data where the SNR in testing is deﬁned

-20 -15 -10 -5 0 5 10 15 20

SNR, [dB]

Spectral Efficiency [bits/s/Hz]

OPT

Best

HBDL

PE-Alt-Min

SOMP

-0.5 0 0.5

9.6

9.7

9.8

9.9

(a)

-20 -15 -10 -5 0 5 10 15 20

SNR, [dB]

Spectral Efficiency [bits/s/Hz]

OPT

Best

HBDL

PE-Alt-Min

SOMP

-0.1 0 0.1

18.2

18.4

18.6

(b)

-20 -15 -10 -5 0 5 10 15 20

SNR, [dB]

Spectral Efficiency [bits/s/Hz]

OPT

Best

HBDL

PE-Alt-Min

SOMP

(c)

Fig. 2. Spectral efﬁciency versus SNR for (a) NR=NT= 25,NS= 1; (b) NR=NT= 36,NS= 2; (c) NR=NT= 36,NS= 3.

similar to SNRTRAIN as SNRTEST = 20 log10(|[H]i,j |2

σ2

TEST

)and

SNRTRAIN ∈ {10,15,20}dB is selected.

In Fig. 2, the spectral efﬁciency for different algorithms is

presented for NS={1,2,3}and SNRTEST = 10dB. As it

is seen, HBDL provides better performance as compared to

the optimization-based method PE-Alt-Min and greedy-based

algorithm SOMP. The performance plot ”Best” denotes the

performance of the test data without prediction. We observe

that HBDL is very close to the best performance as well as the

fully-digital beamformer. HBDL effectively selects the analog

beamformers from the feasible sets which maximizes the spec-

tral efﬁciency. The effectiveness of HBDL is attributed to the

best selection of analog beamformers which are the optimum

solution of (4) through the SVD of the channel matrix [6].

SOMP has poor performance due the the fact that it cannot

select the ”best” set of array responses from the dictionary.

While PE-Alt-Min has sufﬁciently good performance, HBDL

performs better even when the output of PE-Alt-Min is inserted

to the feasible sets used for HBDL.

To compare the computation time of the algorithms we

consider the same settings and observe that HBDL spends

about 0.020s to compute both precoder and combiners whereas

SOMP and PE-Alt-Min take about 0.450s and 1.200s respec-

tively.

VI. CONCLUSIONS

In this work, a CNN framework is proposed for the joint

estimation of precoder and combiners in hybrid beamform-

ing problem. We show that the proposed network archi-

tecture provides better spectral efﬁciency as compared to

the optimization-based and greedy-based algorithm. In future

work, we reserve the case when the training data is small

where transfer learning-like approaches can be developed.

REFERENCES

[1] J. G. Andrews, S. Buzzi, W. Choi, S. V. Hanly, A. Lozano, A. C. K.

Soong, and J. C. Zhang, “What Will 5G Be?,” IEEE Journal on Selected

Areas in Communications, vol. 32, pp. 1065–1082, June 2014.

[2] F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta,

O. Edfors, and F. Tufvesson, “Scaling Up MIMO: Opportunities and

Challenges with Very Large Arrays,” IEEE Signal Processing Magazine,

vol. 30, pp. 40–60, Jan 2013.

[3] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Channel Estima-

tion and Hybrid Precoding for Millimeter Wave Cellular Systems,” IEEE

Journal of Selected Topics in Signal Processing, vol. 8, pp. 831–846,

Oct 2014.

[4] A. Alkhateeb, G. Leus, and R. W. Heath, “Limited Feedback Hybrid

Precoding for Multi-User Millimeter Wave Systems,” IEEE Transactions

on Wireless Communications, vol. 14, pp. 6481–6494, Nov 2015.

[5] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Hybrid

precoding for millimeter wave cellular systems with partial channel

knowledge,” in 2013 Information Theory and Applications Workshop

(ITA), pp. 1–5, Feb 2013.

[6] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath,

“Spatially Sparse Precoding in Millimeter Wave MIMO Systems,” IEEE

Transactions on Wireless Communications, vol. 13, pp. 1499–1513,

March 2014.

[7] X. Yu, J. Shen, J. Zhang, and K. B. Letaief, “Alternating Minimization

Algorithms for Hybrid Precoding in Millimeter Wave MIMO Systems,”

IEEE Journal of Selected Topics in Signal Processing, vol. 10, pp. 485–

500, April 2016.

[8] D. Yu and L. Deng, “Deep learning and its applications to signal

and information processing [exploratory dsp],” IEEE Signal Processing

Magazine, vol. 28, pp. 145–154, Jan 2011.

[9] Y. Lecun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521,

no. 7553, pp. 436–444, 2015.

[10] A. Alkhateeb, S. Alex, P. Varkey, Y. Li, Q. Qu, and D. Tujkovic, “Deep

Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave

Systems,” CoRR, vol. abs/1804.10334, 2018.

[11] H. Ye, G. Y. Li, and B. Juang, “Power of Deep Learning for Channel

Estimation and Signal Detection in OFDM Systems,” IEEE Wireless

Communications Letters, vol. 7, pp. 114–117, Feb 2018.

[12] H. Huang, J. Yang, H. Huang, Y. Song, and G. Gui, “Deep Learning

for Super-Resolution Channel Estimation and DOA Estimation Based

Massive MIMO System,” IEEE Transactions on Vehicular Technology,

vol. 67, pp. 8549–8560, Sep. 2018.

[13] A. M. Elbir, K. V. Mishra, and Y. C. Eldar, “Cognitive Radar Antenna

Selection via Deep Learning,” IET Radar, Sonar & Navigation, January

2019.

[14] Y. Long, Z. Chen, J. Fang, and C. Tellambura, “Data-Driven-Based

Analog Beam Selection for Hybrid Beamforming Under mm-Wave

Channels,” IEEE Journal of Selected Topics in Signal Processing,

vol. 12, pp. 340–352, May 2018.

[15] S. Drner, S. Cammerer, J. Hoydis, and S. t. Brink, “Deep Learning

Based Communication Over the Air,” IEEE Journal of Selected Topics

in Signal Processing, vol. 12, pp. 132–143, Feb 2018.

[16] V. Raj and S. Kalyani, “Backpropagating Through the Air: Deep Learn-

ing at Physical Layer Without Channel Models,” IEEE Communications

Letters, vol. 22, pp. 2278–2281, Nov 2018.

[17] C. Wen, W. Shih, and S. Jin, “Deep Learning for Massive MIMO CSI

Feedback,” IEEE Wireless Communications Letters, vol. 7, pp. 748–751,

Oct 2018.

[18] H. Huang, Y. Song, J. Yang, G. Gui, and F. Adachi, “Deep-Learning-

based Millimeter-Wave Massive MIMO for Hybrid Precoding,” IEEE

Transactions on Vehicular Technology, pp. 1–1, 2019.

[19] R. Mndez-Rial, C. Rusu, A. Alkhateeb, N. Gonzlez-Prelcic, and R. W.

Heath, “Channel estimation and hybrid combining for mmWave: Phase

shifters or switches?,” in 2015 Information Theory and Applications

Workshop (ITA), pp. 90–97, Feb 2015.

[20] T. Kailath, B. Hassibi, and A. H. Sayed, Linear estimation . Upper

Saddle River, NJ : Prentice-Hall ; London : Prentice-Hall International,

2000. Includes bibliographical references (p. 817-839) and indexes.

[21] K. Simonyan and A. Zisserman, “Very Deep Convolutional Networks

for Large-Scale Image Recognition,” CoRR, vol. abs/1409.1556, 2015.

Beamforming of Transmit Antennas Using Grey Wolf Optimization and L2-Norm for Performance Enhancement of Beyond 5G Communications

Article

Full-text available

Jan 2024

Pattern synthesis is widely used in many radar and communication systems and received great interest. So, this paper proposes a new beamforming strategy based on a hybrid combination between grey wolf optimizer (GWO) with L2-norm called proposed GWO. This approach is applied to synthesized uniform linear arrays (ULA), Chebyshav arrays, and shaped pattern arrays. Moreover, it is utilized for side lobe level (SLL) and size reduction of antenna elements. In this strategy, the GWO is utilized to optimize the element spacing to adjust the half-power beam-width (HPBW) to save it the same as desired pattern. Furthermore, the excitations of the antenna elements are optimized via the L2-norm minimization problem. The proposed GWO has low complexity (fewer iterations and computing time) compared to other algorithms. In addition, it has a very accurate approximation of the original radiation pattern. As well, the computer simulation technology (CST) microwave package is utilized to achieve the practical validation of the proposed methodologies. As an application of the proposed GWO, it is employed to create a proposed hybrid beamforming (PHB) structure for Multi-input Multi-output (MIMO) systems. Consequently, the BS transmitting antennas are synthesized for gain maximization while utilizing the current amount of antenna elements. This results in considerable savings in antenna components and associated radio frequency (RF) chains which reduces system complexity. Furthermore, array gain maximization will increase the received signal-to-noise ratio (SNR). In addition, the SLL reduction scenario will decrease the interference from undesired users which in turn will also increase SNR. Hence, the performance of the system in terms of spectral efficiency (SE) and power utilization will be improved.

Towards 6G Technology: Insights into Resource Management for Cloud RAN Deployment

Article

Full-text available

Jun 2024

Rapid advancements in the development of smart terminals and infrastructure, coupled with a wide range of applications with complex requirements, are creating traffic demands that current networks may not be able to fully handle. Accordingly, the study of 6G networks deserves attention from both industry and academia. Artificial intelligence (AI) has emerged for application in the optimization and design process of new 6G networks. The developmental trend of 6G is towards effective resource management, along with the architectural improvement of the current network and hardware specifications. Cloud RAN (CRAN) is considered one of the major concepts in sixth- and fifth-generation wireless networks, being able to improve latency, capacity, and connectivity to huge numbers of devices. Besides bettering the current set-up in terms of setting the carriers’ network architecture and hardware specifications, among other potential enablers, the developmental trend of 6G also means that there must be effective resource management. As a result, this study covers a thorough analysis of resource management plans in CRAN, optimization, and AI taxonomy, and how AI integration might enhance existing resource management.

Spectral energy balancing system with massive MIMO based hybrid beam forming for wireless 6G communication using dual deep learning model

Article

Full-text available

Feb 2024

This work aims to provide an effective hybrid beam forming method with Dual-Deep-Network to overcome overhead for mm-wave massive MIMO systems. In this paper, a Dual-Deep-Network technique is described for the extraction of statistical structures from a hybrid beam forming model based on mmWave logics, as well as training logic for the network map functions. The proposed approach of DDN is trained with proper data sequences used for communication and the training phase is conducted with the norms of numerous channel variants. With the nature of diverse channel states, a Dual-Deep-Network is required to manipulate the level of presence and abilities even after training as well. The performance level improvements are practically summarized in both the transmission and reception entities with the help of the proposed hybrid network architecture and the associated Dual Deep Network algorithm. Specifically, the BER versus SNR and spectral efficiency versus SNR are evaluated as well as the resulting accuracy levels are cross validated with numerous classical communication techniques. This paper shows the processing difficulties of the proposed approach and typically cross-validates with other beam forming logics. The computational cost and performance estimations are improved, and the metrics are clearly visualized on this paper based on improved beamforming procedures as well as the proposed approach of DDN based Multi-Resolution Code Book performance metrics are estimated clearly with proper mathematical model investigations. With 7Kbits/s/Hz and 1e-1, respectively, the key metrics of spectral efficiency and BER are enhanced.

Performance Evaluation of Spectral Efficiency Hybrid Precoding and Combining Algorithm for Millimeter Wave -MIMO Systems

Article

Full-text available

Feb 2024
WIRELESS PERS COMMUN

Multiple input multiple output (MIMO) system with Millimeter Wave spectrum is currently used in most wireless applications and all cellular system to provides high data rates. with using large antenna array which is possible by decrease the wavelength to achieve high beamforming gain and improve the spectral efficiency. in this paper, used low complexity with hybrid precoding at the transmitting side and combining at the receiver side with limited feedback system, by using the concept of orthogonal matching pursuit (OMP) in single and multi-user cases, and compared the results with analog only beasmstring. The results of simulation showed that when used Minimum Mean Square Error (MMSE) precoders performed better than other hybrid precoding approaches, in addition the MMSE hybrid precoding /combining technique offers higher spectral efficiency compared with analog only beamstring.

An Enhancement Method with Autoencoder for Deep Learning Based Hybrid Beamforming

Conference Paper

Jan 2024

Federated learning based modulation classification for multipath channels

Article

Mar 2024
PARALLEL COMPUT

Joint Optimization Scheme of CSI Feedback and Hybrid Precoding

Conference Paper

Nov 2023

Hybrid Precoding for mmWave MU-MISO System with Deep Reinforcement Learning and Model-Driven Deep Learning

Conference Paper

Oct 2023

Channel estimation and MIMO combining architecture in millimeter wave cellular system with few ADC bits

Article

Full-text available

Feb 2024

Hybrid combiner and precoder architectures, radio frequency (RF) chain, analog phase shifters, digital-to-analog converter (DAC), and analog-to-digital converter (ADC) are components of a millimeter wave cellular system. Prior works in the area of millimeter wave cellular system design employ receiver with infinite bit and large amount of RF chain that scales linearly with the quantity of transmitting and receiving antennas. This mode of design no doubt increases power demand or requirement of a typical millimeter wave system. In this work, hybrid architecture with few RF chains and small number of ADC bits are proposed and are used as candidate for millimeter wave channel estimation and cellular communication. In that connection, least square (LS), orthogonal matching pursuit (OMP), compressed sampling matching pursuit (CoSAMP), and deep learning (DL) techniques are utilized for analytical investigation. Indeed, computational results reveal that, when ADC consisting of uniform mid- rise quantizer is employed, the performance of 4 and 6 bits at signal-to-noise ratio (SNR) values of − 10 dB and 20 dB is at par with infinite bit (unquantized case). As a validation, DL compares favorably well with adaptive compressed sensing (ACS) technique previously used in the literature for channel estimation, while OMP and CoSAMP show better performance than ACS.

Adaptive Massive MIMO Hybrid Precoding Based on Meta Learning

Conference Paper

Nov 2023

Deep Learning Coordinated Beamforming for Highly-Mobile Millimeter Wave Systems

Article

Full-text available

Jun 2018

Supporting high mobility in millimeter wave (mmWave) systems enables a wide range of important applications such as vehicular communications and wireless virtual/augmented reality. Realizing this in practice, though, requires overcoming several challenges. First, the use of narrow beams and the sensitivity of mmWave signals to blockage greatly impact the coverage and reliability of highly-mobile links. Second, highly-mobile users in dense mmWave deployments need to frequently hand-off between base stations (BSs), which is associated with critical control and latency overhead. Further, identifying the optimal beamforming vectors in large antenna array mmWave systems requires considerable training overhead, which significantly affects the efficiency of these mobile systems. In this paper, a novel integrated machine learning and coordinated beamforming solution is developed to overcome these challenges and enable highly-mobile mmWave applications. In the proposed solution, a number of distributed yet coordinating BSs simultaneously serve a mobile user. This user ideally needs to transmit only one uplink training pilot sequence that will be jointly received at the coordinating BSs using omni or quasi-omni beam patterns. These received signals draw a defining signature not only for the user location, but also for its interaction with the surrounding environment. The developed solution then leverages a deep learning model that learns how to use these signatures to predict the beamforming vectors at the BSs. This renders a comprehensive solution that supports highly-mobile mmWave applications with reliable coverage, low latency, and negligible training overhead. Extensive simulation results, based on accurate ray-tracing, show that the proposed deep-learning coordinated beamforming strategy approaches the achievable rate of the genie-aided solution that knows the optimal beamforming vectors with no training overhead. Compared to traditional beamforming solutions, the results show that the proposed deep learning based strategy attains higher rates, especially in high-mobility large-array regimes.

Data-Driven-Based Analog Beam Selection for Hybrid Beamforming Under mm-Wave Channels

Article

Full-text available

Mar 2018

Hybrid beamforming is a promising low-cost solution for large multiple-input multiple-output (MIMO) systems, where the base station (BS) is equipped with fewer radio frequency chains. In these systems, the selection of codeword for analog beamforming is essential to optimize the sum-rate performance. In this paper, based on machine learning, we propose a data-driven method of analog beam selection to achieve a near-optimal sum-rate with low complexity, which is highly dependent on training data. To be more specific, we take the beam selection problem as a multiclass-classification problem, where a large number of samples of millimeter-wave channels are considered as training data. By using the training data, we exploit support vector machine (SVM) to obtain a statistical classification model in terms of maximizing sum-rate. For real-time transmissions, with the derived classification model, we are able to select the optimal analog beam for each user with low complexity. Besides, we propose a novel method to determine the optimal parameter of Gaussian kernel function by resorting to McLaughlin expansion. Analysis and simulation results reveal that, as long as the training data is sufficient, the proposed data-driven method is able to achieve a near-optimal sum-rate performance, while the complexity reduces by several orders of magnitude, compared with the conventional method.

Cognitive Radar Antenna Selection via Deep Learning

Article

Full-text available

Jun 2019
IET RADAR SONAR NAV

Direction of arrival (DoA) estimation of targets improves with the number of elements employed by a phased array radar antenna. Since larger arrays have high associated cost, area and computational load, there is recent interest in thinning the antenna arrays without loss of far-field DoA accuracy. In this context, a cognitive radar may deploy a full array and then select an optimal subarray to transmit and receive the signals in response to changes in the target environment. Prior works have used optimization and greedy search methods to pick the best subarrays cognitively. In this paper, we leverage deep learning to address the antenna selection problem. Specifically, we construct a convolutional neural network (CNN) as a multi-class classification framework where each class designates a different subarray. The proposed network determines a new array every time data is received by the radar, thereby making antenna selection a cognitive operation. Our numerical experiments show that the proposed CNN structure outperforms existing random thinning and other machine learning approaches.

Power of Deep Learning for Channel Estimation and Signal Detection in OFDM Systems

Article

Full-text available

Aug 2017

This article presents our initial results in deep learning for channel estimation and signal detection in orthogonal frequency-division multiplexing (OFDM). OFDM has been widely adopted in wireless broadband communications to combat frequency-selective fading in wireless channels. In this article, we take advantage of deep learning in handling wireless OFDM channels in an end-to-end approach. Different from existing OFDM receivers that first estimate CSI explicitly and then detect/recover the transmitted symbols with the estimated CSI, our deep learning based approach estimates CSI implicitly and recovers the transmitted symbols directly. To address channel distortion, a deep learning model is first trained offline using the data generated from the simulation based on the channel statistics and then used for recovering the online transmitted data directly. From our simulation results, the deep learning based approach has the ability to address channel distortions and detect the transmitted symbols with performance comparable to minimum mean-square error (MMSE) estimator. Furthermore, the deep learning based approach is more robust than conventional methods when fewer training pilots are used, the cyclic prefix (CP) is omitted, and nonlinear clipping noise is presented. In summary, deep learning is a promising tool for channel estimation and signal detection in wireless communications with complicated channel distortions and interferences.

Deep Learning-Based Communication Over the Air

Article

Full-text available

Jul 2017

End-to-end learning of communications systems is a fascinating novel concept that has so far only been validated by simulations for block-based transmissions. It allows learning of transmitter and receiver implementations as deep neural networks (NNs) that are optimized for an arbitrary differentiable end-to-end performance metric, e.g., block error rate (BLER). In this paper, we demonstrate that over-the-air transmissions are possible: We build, train, and run a complete communications system solely composed of NNs using unsynchronized off-the-shelf software-defined radios (SDRs) and open-source deep learning (DL) software libraries. We extend the existing ideas towards continuous data transmission which eases their current restriction to short block lengths but also entails the issue of receiver synchronization. We overcome this problem by introducing a frame synchronization module based on another NN. A comparison of the BLER performance of the "learned" system with that of a practical baseline shows competitive performance close to 1 dB, even without extensive hyperparameter tuning. We identify several practical challenges of training such a system over actual channels, in particular the missing channel gradient, and propose a two-step learning procedure based on the idea of transfer learning that circumvents this issue.

Deep-Learning-based Millimeter-Wave Massive MIMO for Hybrid Precoding

Article

Jan 2019

Millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) has been regarded to be an emerging solution for the next generation of communications, in which hybrid analog and digital precoding is an important method for reducing the hardware complexity and energy consumption associated with mixed signal components. However, the fundamental limitations of the existing hybrid precoding schemes is that they have high computational complexity and fail to fully exploit the spatial information. To overcome these limitations, this paper proposes a deep-learning-enabled mmWave massive MIMO framework for effective hybrid precoding, in which each selection of the precoders for obtaining the optimized decoder is regarded as a mapping relation in the deep neural network (DNN). Specifically, the hybrid precoder is selected through training based on the DNN for optimizing precoding process of the mmWave massive MIMO. Additionally, we present extensive simulation results to validate the excellent performance of the proposed scheme. The results exhibit that the DNN-based approach is capable of minimizing the bit error ratio (BER) and enhancing spectrum efficiency of the mmWave massive MIMO, which achieves better performance in hybrid precoding compared with conventional schemes while substantially reducing the required computational complexity.

Backpropagating Through the Air: Deep Learning at Physical Layer Without Channel Models

Article

Aug 2018

Recent developments in applying deep learning techniques to train end-to-end communication systems have shown great promise in improving the overall performance of the system. However, most of the current methods for applying deep learning to train physical layer characteristics assume the availability of explicit channel model. Training a neural network requires the availability of functional form all the layers in the network to calculate gradients for optimization. The unavailability of gradients in a physical channel forced previous works to adopt simulation based strategies to train the network and then fine tune only the receiver part with actual channel. In this paper, we present a practical method to train an end-to-end communication system without relying on explicit channel models. By utilizing stochastic perturbation techniques, we show that the proposed method can train a deep learning based communication system in real channel without any assumption on channel models.

Deep Learning for Super-Resolution Channel Estimation and DOA Estimation based Massive MIMO System

Article

Jun 2018

The recent concept of massive multiple input multiple output (MIMO) can significantly improve the capacity of the communication network and it is regarded as a promising technology for the next generation wireless communications. However, the fundamental challenge of existing massive MIMO systems is that high computational complexity and complicated spatial structures bring great difficulties to exploit the characteristics of the channel and sparsity of these multi-antennas systems. To address this problem, in this paper, we focus on channel estimation and direction of arrivals (DOA) estimation, and a novel framework that integrates the massive MIMO into deep learning is proposed. To realize end-to-end performance, a deep neural network (DNN) is employed to conduct offline learning and online learning procedures, which is effective to learn the statistics of the wireless channel and the spatial structures in angle domain. Concretely, the DNN is first trained by simulated data in different channel condition with the aids of the offline learning, and then corresponding output data can be obtained based on current input data during online learning process. In order to realize super-resolution channel estimation and DOA estimation, two algorithms based on the deep learning are developed, in which the DOA can be estimated in angle domain without additional complexity directly. Furthermore, simulation results corroborate that the proposed deep learning based scheme can achieve better performance in terms of the DOA estimation and the channel estimation compared with conventional methods, and the proposed scheme is well investigated by extensive simulation in various cases for testing its robustness. IEEE

Deep Learning for Massive MIMO CSI Feedback

Article

Dec 2017

In frequency division duplex mode, the downlink channel state information (CSI) should be conveyed to the base station through feedback links so that the potential gains of a massive multiple-input multiple-output can be exhibited. However, the excessive feedback overhead remains a bottleneck in this regime. In this letter, we use beep learning technology to develop CsiNet, a novel CSI sensing and recovery network that learns to effectively use channel structure from training samples. In particular, CsiNet learns a transformation from CSI to a near-optimal number of representations (codewords) and an inverse transformation from codewords to CSI. Experiments demonstrate that CsiNet can recover CSI with significantly improved reconstruction quality compared with existing compressive sensing (CS)-based methods. Even at excessively low compression regions where CS-based methods cannot work, CsiNet retains effective beamforming gain.

Channel estimation and hybrid combining for mmWave: Phase shifters or switches?

Conference Paper

Feb 2015

CNN-based Precoder and Combiner Design in mmWave MIMO Systems

Abstract and Figures

Recommended publications

Bit-Interleaved Coded Multiple Beamforming With Perfect Coding in Millimeter-Wave MIMO Systems

Minimum BER beamforming in the RF domain for OFDM transmissions and linear receivers

Hybrid Beamforming for Sum Rate Maximization in Wideband Multi-User MIMO Relay Systems

Enhancing mmWave DOA Estimation by Cumulative Power Gradient At Low SNR