Conference PaperPDF Available

Over-the-Air Computing under Adaptive Channel State Estimation

Authors:

Abstract and Figures

Over-the-air Computation (AirComp) has attracted significant attention as an efficient way of data fusion by integrating uncoded communication transmissions with computation thanks to the superposition offered by the multiple access channels. However, proper pre-processing and post-processing is required to neutralize the wireless channel effect, in order for AirComp to function successfully. Since, internet-of-things (IoT) type of devices with limited capabilities are the target demographic of AirComp, having perfect channel state information (CSI) available is not always a practical assumption. In this work, we examine the effect of imperfect CSI on the AirComp system and we design a general optimization framework that takes into account both magnitude and phase errors in CSI. On top of that, a pilot retransmission policy is designed that offers a trade-off between cost of retransmissions and gain in the accuracy of the computations. Simulation results show the deterioration caused by the imperfect CSI and also the value of the proposed policy under various system conditions.
Content may be subject to copyright.
Over-the-Air Computing under Adaptive Channel
State Estimation
Nikos G. Evgenidis, Vasilis K. Papanikolaou, Panagiotis D. Diamantoulakis, and George K. Karagiannidis
Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, GR-54124 Thessaloniki, Greece
e-mails: nevgenid@ece.auth.gr, vpapanikk@auth.gr, padiaman@auth.gr, geokarag@auth.gr
Abstract—Over-the-air Computation (AirComp) has attracted
significant attention as an efficient way of data fusion by inte-
grating uncoded communication transmissions with computation
thanks to the superposition offered by the multiple access
channels. However, proper pre-processing and post-processing
is required to neutralize the wireless channel effect, in order
for AirComp to function successfully. Since, internet-of-things
(IoT) type of devices with limited capabilities are the target de-
mographic of AirComp, having perfect channel state information
(CSI) available is not always a practical assumption. In this work,
we examine the effect of imperfect CSI on the AirComp system
and we design a general optimization framework that takes into
account both magnitude and phase errors in CSI. On top of that,
a pilot retransmission policy is designed that offers a trade-off
between cost of retransmissions and gain in the accuracy of the
computations. Simulation results show the deterioration caused
by the imperfect CSI and also the value of the proposed policy
under various system conditions.
Index Terms—over-the-air computing, AirComp, imperfect
CSI, optimization framework, MSE minimization
I. INTRODUCTION
In 5G and beyond networks, a paradigm shift has been
noticed from human type communications to machine type
communications. To deal with the massive amount of dis-
tributed data, particularly from internet-of-things type sensors,
data aggregation through over-the-air computing (AirComp)
has recently emerged as a very attractive technology [1], [2].
AirComp can accommodate various objectives in the network
by taking advantage of the superposition property of trans-
mitted waveforms over the multiple access channel (MAC),
and proper pre- and post-processing to facilitate a family of
functions, called nomographic such as mean, geometric mean,
etc [3]. This exploitation can lead to significant increases in
latency and computation load at the central processing node,
especially when the number of sensors becomes excessively
large. As proven in [4], the uncoded AirComp is optimal in
the presence of a Gaussian multiple access channel (MAC)
with independent data sources. This optimality is in terms of
the mean squared error (MSE), which is a basic measure of
performance for the discussed system.
AirComp systems face a particular set of challenges that
need to be resolved to guarantee their operation. The time
synchronization of the devices is one such challenge, where
each device has to account for its message’s propagation delay
to ensure service. Moreover, carrier frequency offset (CFO)
issues that are usually satisfied through the use of high quality
oscillators need to be addressed with different means, since
AirComp devices are usually low-end [5]. On top of that, the
transmission through the wireless channel is prone to channel
fading and noise. To account for that, a power allocation
strategy is required, which is presented in [1], [6], i.e., the
optimal power allocation for the transmitting devices in order
to achieve a minimum MSE.
The integration of AirComp with popular wireless tech-
nologies such as MIMO has gathered a lot of attention from
the research community [7]. Also, recently, AirComp has
been discussed as an appealing technology to be implemented
with federated learning in the network edge [5], [8]–[14].
On top of that, AirComp has shown promise in different
scenarios such as in combination with aerial networks with
unmanned aerial vehicles (UAVs). More specifically, in [15],
the authors optimize the trajectory of the UAV to minimize
the time-average MSE, while in [16] they take into account
CSI imperfections when data are aggregated into a UAV fusion
center. Finally, promising new technologies such as intelligent
reconfigurable surfaces (IRS) has been examined as a way to
facilitate AirComp with higher performance gains [17], [18].
However, most of these works are focused on the perfect
channel state information (CSI) scenario, which cannot easily
be guaranteed in practice, when IoT-type devices are con-
cerned. Imperfect CSI can highly deteriorate the AirComp
system’s performance, as channel information is vital in ob-
taining the correct message at the receiver. In literature, there
are some attempts studying the effect of imperfect CSI and
its performance [16], [19]. Nevertheless, none of these look
into the general case of imperfect CSI both in magnitude and
phase, which is the main motivation of this work. In our
paper, we will investigate the problem of the minimization
of the MSE and the power allocation that must be used in
the transmitting and receiving sides, so that the optimum
can be achieved. First, we provide some theoretical analysis
based on order statistics in order to get more insight on how
the CSI imperfections affect the magnitude and the phase of
every channel. The derived results can help us identify the
necessary communication conditions that are needed for an
AirComp system to work without suffering extensively from
a noise-infused CSI. Then, we propose an algorithm that uses
alternating optimization for the minimization problem, based
on an approximation of the MSE at a worst case scenario.
This approach differs from the one in [19] and also takes
into consideration the phase difference between the real and
the estimated channel, which is obviously the general case to
be considered. Furthermore, based on our theoretical analysis
we propose a second retransmission round for the weaker
estimated channels in an attempt to improve the system’s
performance.
II. SYSTEM MODEL
Let Kbe the number of transmitting devices in the AirComp
system that are such that all of them are independent from
one another. For our work we will assume that the receiver
and all transmitters have a single antenna. Also let bkC
symbolize the transmitting power at the k-th device and a
Csymbolize the receiver gain factor. For practical reasons we
will assume that all devices have a common maximum power
magnitude P,sothat|bk|≤Pfor all k∈{1,···,K}.
We wish to find the computation distortion of the ideal
signal K
k=1 xk, which will be given by
MSE = E|r
K
k=1
xk|2,(1)
where ris the received signal given by r=
aK
k=1 bkhkxk+n,sowehave
MSE = E
aK
k=1
bkhkxk+n
K
k=1
xk
2.(2)
All signals and noise are independent with one another and
we assume that all signals xk[u, u]have normalized
variance, thus E(|xk|2)=1,k. We will also assume that
the noise nfollows a complex Gaussian distribution with
E(n2)=σ2. Taking the expectation with respect to the signals
xkand noise ngives
MSE = E
K
k=1
(abkhk1)xk+an
2(3)
and equivalently from the above assumptions
MSE =
K
k=1
abkhk1
2
+σ2|a|2.(4)
In order to study possible imperfections in the CSI, we
will assume that the error in the estimation of the channel is
modeled as additive random variable nk. Hence, we assume
the following:
nkCN(0
2)=N(0,σ2
2)+jN(0,σ2
2),arethe
noise samples in the kth CSI estimation, uncorrelated and
independent between them.
hkCN(0
2
h)=N(0,σ2
h
2)+jN(0,σ2
h
2)because of
Rayleigh fading conditions in the channel of transmitter
k. Here we assume that σ2
h=2for all |hk|to follow a
Rayleigh distribution of mode σRayleigh =1as used in
the simulations of the paper.
For the estimation of hkwe use symbols of max power
Phence the receiver gets yk=Ph
k+nkand assumes
that yk=Ph
k. Consequently, the channel estimation will
be given by
Ph
k=Ph
k+nkh
k=hk+nk
P=hk+ek,(5)
where ekCN(0,σ2
P)=N(0,σ2
2P)+jN(0,σ2
2P). Conse-
quently, we can assume that the channel estimation h
kis
distributed as h
kCN(0,σ2+
2
h
P)=N(0,σ2+
2
h
2P)+
jN(0,σ2+
2
h
2P).
III. PROPOSED POLICY AND ANALYSIS
A. Theoretical Study on AirComp with Imperfect CSI
Assuming the receiver is unaware of the CSI imperfections,
the algorithm obtained for an AirComp system with perfect
CSI in [1], [6] would lead to a combination of full power and
channel inversion methods. Using this algorithm in (4), would
result in
MSE =
i
k=1 aPh
k
H
|h
k|hk1
2
+σ2|a|2
+
K
k=i+1 hk
h
k1
2
,(6)
where iis the critical number of transmitters that utilize their
full power as given in [1]. Moreover, the channel ordering has
been used |h
1|≤|h
2|···|h
K|. Since the order has been
taken with respect to the estimation of hk, it is clear that the
true order could be different, e.g., |h2|≤|h1|···|hK|,
which means that the proposed optimum solution can lead to
a different optimum number i
est of transmitters that use full
power, since
i=argmax
1iK{gi},(7)
where
gi=Pi
k=1 |hk|
σ2+Pi
k=1 |hk|2(8)
and the magnitudes affect the result. Though ican be
incorrectly estimated the main problem is that the incorrect
values of giwill also affect a, which is given as a=gi.
Lemma 1: The expected value of the magnitude of the
ordered channel gains is given by
E(Ur)=π
4σhK
rr
r1
k=0 r1
k(1)k
(Kr+1+k)3
2
.
(9)
Proof: The proof is presented in Appendix A.
Corollary 1: The expected value of the magnitude of the
error given by
E(|ek|)=
0
|ek|2
σ2
ek
exp |ek|2
2σ2
ekd|ek|
=σekπ
2=σ2
2Pπ
2=1
ρπ
4,(10)
where ρis the transmit SNR.
Proof: Following the proof of Lemma 1, (10) is easily
produced in the same way.
Using the above results, we can find statistically how many
channels are mostly affected by the CSI imperfection, which
is of great value when designing the AirComp system. The
main resulting issue is that the order of the estimated channels
is very likely to be different from the correct order, which
will result in an additional error in the minimization of MSE.
Ultimately, the most affected channels will also be more
susceptible to greater phase difference during CSI.
B. Optimization Framework
Due to AirComp’s operation, imperfections in CSI can
greatly hinder performance, even for small levels of noise.
Solutions that ignore the possible imperfections that result
in optimum controls for the perfect CSI case, by assuming
|h
k|=|hk|,k∈{1,···,K}, diverge remarkably from the
optimal strategy, which is defined by iand a, when imperfect
CSI is available.
In order to improve the overall performance, we propose
a new optimization framework that will take into account all
the extra error terms that occur. In order to do this we assume
that the statistical mean of the noise is known to the receiver.
First of all, since aCwe define a=apto symbolize its
complex phase. By setting Δhk=(hk,h
k), using the fact
that h
k=hk+ek, and (4), the optimization problem can be
expressed as
min
a,bMSE = K
k=1 |a||bk||hk|2
+1
2K
k=1 |a||bk||hk|cos (aphk)+σ2|a|2,
s.t. C1:|bk|≤Pk,k.
(11)
Theorem 1: The optimal power distribution is given by
|bk|=minP, cos Δhk
a|hk|.(12)
Proof: In order to use the phase factor for minimiza-
tion in (11) we will need to approximate its term and find
its extrema. However, from trigonometry, cos (aphk)=
cos apcos Δhksin apsin Δhkand any approach to approx-
imate this quantity without knowledge about the sign of the
phase difference Δhkcannot be made because the sign of
sin Δhkwill be affected. Since the sign of Δhkis affected by
the phase of the noise in the CSI estimation, we cannot make
any assumptions about it and thus we cannot further use ap
in minimization. Hence, from now on we will consider ato
be a real number and we rewrite the mean squared error as
MSE =
K
k=1 a|bk||hk|2
+1
2
K
k=1 a|bk||hk|cos Δhk+σ2a2.
(13)
O
hkB
h
k
|ek|
Fig. 1: Worst case approximation of phase difference Δhk.
In order to find the extrema for every variable |bk|we take
the first order partial derivative to be equal to zero, thus:
MSE
|bk|=0⇔|bk|=cos Δhk
a|hk|and since |bk|is the power
magnitude at device k,itmustalsobe0≤|bk|≤P.
Consequently, the best power distribution will be given by
(12) and, thus, the proof is completed.
We can observe that the real MSE is at this point related to
the estimations h
konly through the terms cos Δhk. Assuming
the imperfections of CSI due to the noise have a constant
magnitude of |ek|, then from Figure 1, the worst case scenario
for these terms arise when the phase of the imperfection
is such that h
kbecomes tangent to the circle of constant
radius |ek|. This approximation will be quite accurate when
the conditions are such that the mean estimated error can be
assumed less than the mean of the estimated channels, i.e.
E(|hk|)>E(|ek|), and using it we obtain
cos Δhmax
k=|h2
k|−|e2
k|
|hk|=|hworst
k|
|hk|.(14)
Hence, if we consider the worst case scenario we can make
the following approximations |h
k|2+|ek|2≈|hk|2and
cos Δhk|h
k|
|hk|. We can also assume that |ek|2E(|ek|2)=
E(n2)
P=σ2
P=cfor all k. While this approach contains its own
errors, mostly for large scale fading or great levels of noise, it
provides a tight approximation of the sinusoidal term for the
greater majority of the involved channels, except for those that
are greatly affected by noise. Also, since it covers the worst
case in an almost optimal way it will partly compensate for
its near optimum behavior in more favorable scenarios.
Using the above mentioned approximation combined with
the attainable values of |bk|given by Theorem 1, we can now
express (13) as
MSEi=
i
k=1 aP|h
k|−1
2
+a2(σ2+iP c)
+c
K
k=i+1
1
|h
k|2+c.
(15)
Lemma 2: The optimal aito minimize (15) is given by
ai=Pi
k=1 |h
k|
(σ2+iP c)+Pi
k=1 |h
k|2.(16)
Proof: Considering (15) as a quadratic polynomial in
terms of aand using a well known property for the global
extremum of this function, we obtain that the global minimum
for every iis given by differentiating (15) in terms of aito
finally obtain (16), and thus, the proof is completed.
At this point, it is observed that (16) will be precise
mostly at conditions like relatively high SNR or good channel
conditions. This is so, because, due to our approximation
the power magnitudes, we will obtain |bk|opt =cos Δhk
a|hk|
|h
k|
a|hk|2|h
k|
a(|h
k|2+c), are points of the function f(x)= x
a(x2+c)
which is an increasing and then decreasing function in terms
of xthat achieves its maximum at x=c. Hence, for these
to be in descending order we need |h
1|≥c.Otherwise,we
observe that the receiver coefficient athat will be calculated
can be such that some of the weaker estimated channels will
use the inverse channel method. Finally, expression (16) is
quite similar to the optimum value of afor the perfect CSI
case, but will always be less than gi. In other words, the
uncertainty caused by the imperfect CSI will force the system
to use more transmitting power in an attempt to counter these
imperfections.
Corollary 2: At least one device must use its full power
during transmission.
Proof: The proof is given in Appendix B.
So, it suffices to solve Ksubproblems for i1, ..., K and,
then, compare the minimum values MSEmin
i=MSE
i(ai).
In order to get a feasible solution for a,itmustbeai>
|h
i+1|
(|h
i+1|2+c)P.Ifai>|h
i+1|
(|h
i+1|2+c)Pwe can always find a
better solution to minimize the overall MSE. Thus, we only
need to check the values MSEi(ai)when aiis feasible. This
way we can calculate iandthenestimateaiand bkfor all k.
It is important to note that this approach differs from the
one followed in [19] in its mathematical derivation, but also
in the fact that our approximation covers the more general
case of phase misalignment as opposed to the phase alignment
considered in [19].
C. Pilot Retransmission Policy
From our theoretical analysis it is clear that statistically
there will be a number of channels whose estimations will
be greatly affected by the noise-induced error during the CSI
procedure. In order to limit the effect of this in the MSE of the
system we need to have better estimations for the channels. As
a result we can consider the possibility of making a second
CSI round at least for some channels. Apparently, the most
obvious choice is for the weaker channels to retransmit pilot
symbols and then use the average of the two estimations as the
new correct estimated channel. For this approach we propose
the following heuristic algorithm in order to nd the number of
channels that will need to re-estimate their channels. In order
to track the trade-off between the extra resources required for
the retransmissions and the resulting MSE, we propose and
define the following cost function, called the Retransmission
Policy Cost (RPC) as
RPC = (Power costk)b·E(MSEk)
Kd
,(17)
where Power costksymbolizes the power resources needed for
kretransmissions, E(MSEk)
Kdenotes the MSE for kretransmis-
sions and are our primary concern over the available resources
is taken to be their power. Moreover parameters band dare
considered to be weights for power and MSE, respectively.
Any combination of parameters can be used to give emphasis
to either the cost in terms of resources for the retransmissions
or its maximum error tolerance. Withous loss of generality, the
time penalty required for the retransmission can be included
in the RPC metric, but given that the decrease of E(MSE) is
vital for the correct interpretation of the superimposed signals
and the fact that the AirComp scheme needs less time than
other traditional schemes like NOMA due to the superposition
property of MAC, we suppose that the small time latency
created by a few retransmission rounds does not affect our
system.
IV. SIMULATION RESULTS
In this section we present the simulation results of our work
for an AirComp system. Fading channels have been simu-
lated with Circular-Symmetric Complex Normal distributed
variables CSCN(0,1) to simulate Rayleigh fading channel
conditions. Unless otherwise stated, the transmit SNR is set
as ρ= 10dB. We apply Monte Carlo analysis averaging over
104channel realizations (snapshots). Finally, we define the
average per user MSE as AMSE = E(MSE)/K .
In Fig. 2 we look at the performance of the perfect
CSI algorithm and the proposed optimization technique for
imperfect CSI under imperfect channel estimation. As we
can see the perfect CSI algorithm not only achieves worse
E(MSE)/K values, but also fails to converge as the number
of devices in the system increases. In contrast to this, the
proposed technique both achieves better performance and has
a diminishing behavior for increasing number of devices.
In Fig. 3 we look at the performance of the retransmission
policy discussed in section III. For comparison reasons, the
retransmissions have also been simulated for random selec-
tions of channels instead of the weaker ones as proposed. As
expected both policies achieve better MSE values than the no-
retransmission policy due to the better channel estimations for
their corresponding re-estimated channels. We can see that the
proposed policy achieves a much better improvement rate over
the random channel selection policy. It is important to point
out that this improvement is greater mainly for the first few
weaker channels, which confirms the idea that a re-estimation
of these channels can be used to decrease the MSE of the
system. We can also observe that though a bigger number of
retransmissions can achieve further performance improvement,
both policies tend to converge on the same MSE, because a
lot of channel re-estimations will now be common.
In order to evaluate in terms of RPC the proposed policy,
we design a simple example of an RPC function as follows.
To obtain the first, necessary round of channel estimates
exactly Kpower resources are needed. Assuming that each
power resource is equal to 1, then, for every retransmission,
1additional power resource is needed from the corresponding
device. Thus, for kretransmissions, Power costk=(K+k)P
and according to (17) the minimum of RPC is studied. RPC
in this case is expressed as
RPC = [(K+k)P]b·E(MSEk)
Kd
.(18)
Using Monte Carlo analysis we see that such a minimum
exists, it is global and it appears that for the statistically weaker
channels all retransmission choices achieve a better trade-off
between power and MSE than the original proposed scheme
without any retransmissions.
Fig. 2: MSE vs the number of devices in the AirComp system
accounting or not for imperfect CSI (Transmit SNR = 10dB).
V. C ONCLUSIONS
In this paper, an AirComp system under imperfect CSI
assumptions was considered. The detrimental effect of im-
perfect CSI is presented for the MSE, especially when the
optimization framework does not account for the imperfections
in the channel estimations. We presented a comprehensive
analysis on how channel estimation errors affect the AirComp
system and a novel optimization framework to minimize MSE
under those conditions. In order to counter the effect of the
imperfections, an adaptive policy based on pilot retransmission
was presented, where the proposed policy shows the potential
to greatly improve the performance. Moreover, a new metric
was presented alongside the retransmission policy to showcase
Fig. 3: MSE vs the number of retransmission with various
policies (Transmit SNR = 10 dB).
Fig. 4: Retransmission Policy Cost vs the Number of Retrans-
missions for the proposed policy, the random policy, and the
original scheme without retransmission for b=d=1and
Pk=1,k.
the efficiency of the approach. Finally, simulation results were
presented to validate the effectiveness of the proposed analysis,
showcasing that it can offer great insights into the design of
AirComp systems.
ACKNOWLEDGMENT
This paper was supported by the European Union’s Horizon
2020 Research and Innovation Program under Agreement
957406. The implementation of the doctoral thesis of Vasilis
K. Papanikolaou was co-financed by Greece and the European
Union (European Social Fund-ESF) through the Operational
Programme “Human Resources Development, Education and
Lifelong Learning” in the context of the Act “Enhancing Hu-
man Resources Research Potential by undertaking a Doctoral
Research” Sub-action 2: IKY Scholarship Programme for PhD
candidates in the Greek Universities.
APPENDIX A
PROOF OF LEMMA 1
We start the proof by presenting an analysis based on order
statistics, in order to relate the mean estimation error with the
mean of the magnitude of the ordered channels. We denote the
correct channel gain ordering in the following way |hcor
1|≤
|hcor
2|···|hcor
K|. From the order statistics we can find that
every ordered sample Ur=|hcor
r|from a total of Ksamples
has the following probability distribution function (PDF)
fUr(ur)= K!
(Kr)!(r1)!
2ur
σ2
h
×exp u2
r
σ2
h
(Kr+1)
1exp u2
r
σ2
hr1
.
(19)
Hence, the expected value E(Ur)can be calculated by
E(Ur)=
K
rr
0
2u2
r
σ2
h
exp u2
r
σ2
h
(Kr+1)
×1exp u2
r
σ2
hr1
dur.(20)
Setting t=ur
σhwe obtain
E(Ur)=2σhK
rr
0
t2exp t2(Kr+1)
×1exp (t2)r1
dt
=2σhK
rr
0
t2eKt2[et21]r1dt. (21)
Using binomial expansion and that
0exp (mt2)=
π
4mand after some algebraic manipulations, after the in-
tegration, we get
0
t2eKt2(et21)ndt =
=1
2π
4
n
k=0 n
k(1)k1
(Kn+k)3
2
.(22)
Then, we can easily get (9) from (21) and (22), and, thus, the
proof is concluded.
APPENDIX B
PROOF OF COROLLARY 2
We begin with a proof by contradiction, assuming that a full
inverse channel method is utilized. Then, by (16) the overall
MSE of the system will be
MSE0(a0)=σ2a2
0+c
K
k=1
1
|h
k|2+c,(23)
where a0>|h
1|
P(|h
1|2+c)for feasibility reasons. Then, taking
a1=|h
1|
P(|h
1|2+c)observe that
MSE1(a1)=a1P|h
1|−1
2
+a2
1(σ2+Pc)
+c
K
k=2
1
|h
k|2+c=a2
1σ2+c
K
k=1
1
|h
k|2+c
=MSE
0(a1)(24)
and since a0>a
1we see that MSE1(a1)<MSE0(a0)which
means that we get that the performance of the system is better
than the full inverse channel method, and, thus the proof is
completed. REFERENCES
[1] W. Liu, X. Zang, Y. Li, and B. Vucetic, “Over-the-Air Computation Sys-
tems: Optimization, Analysis and Scaling Laws,” IEEE Trans. Wireless
Commun., vol. 19, no. 8, pp. 5488–5502, Aug. 2020.
[2] G. Zhu, J. Xu, K. Huang, and S. Cui, “Over-the-Air Computing for
Wireless Data Aggregation in Massive IoT,” IEEE Wireless Commun.,
vol. 28, no. 4, pp. 57–65, Aug. 2021.
[3] M. Goldenbaum, H. Boche, and S. Stanczak, “Nomographic Functions:
Efficient Computation in Clustered Gaussian Sensor Networks, IEEE
Trans. Wireless Commun., vol. 14, pp. 2093–2105, 2015.
[4] M. Gastpar, “Uncoded Transmission Is Exactly Optimal for a Simple
Gaussian Sensor Network,” IEEE Trans. Inform. Theory, vol. 54, pp.
5247–5251, 2008.
[5] Y. Shao, D. Gunduz, and S. C. Liew, “Federated Edge Learning With
Misaligned Over-the-Air Computation,” IEEE Trans. Wireless Commun.,
vol. 21, no. 6, pp. 3951–3964, Jun. 2022.
[6] X. Cao, G. Zhu, J. Xu, and K. Huang, “Optimized Power Control
for Over-the-Air Computation in Fading Channels,” Jul. 2020, number:
arXiv:1906.06858 arXiv:1906.06858 [cs, eess, math].
[7] X. Zhai, X. Chen, J. Xu, and D. W. Kwan Ng, “Hybrid Beamforming
for Massive MIMO Over-the-Air Computation, IEEE Trans. Commun.,
vol. 69, no. 4, pp. 2737–2751, Apr. 2021.
[8] Y. Sun, S. Zhou, Z. Niu, and D. Gunduz, “Dynamic Scheduling for
Over-the-Air Federated Edge Learning With Energy Constraints,” IEEE
J. Select. Areas Commun., vol. 40, no. 1, pp. 227–242, Jan. 2022.
[9] K. Yang, T. Jiang, Y. Shi, and Z. Ding, “Federated Learning via Over-
the-Air Computation,” IEEE Trans. Wireless Commun., vol. 19, no. 3,
pp. 2022–2035, Mar. 2020.
[10] N. Zhang and M. Tao, “Gradient Statistics Aware Power Control for
Over-the-Air Federated Learning,” IEEE Trans. Wireless Commun.,
vol. 20, no. 8, pp. 5115–5128, Aug. 2021.
[11] X. Fan, Y. Wang, Y. Huo, and Z. Tian, “Joint Optimization of Commu-
nications and Federated Learning Over the Air,” IEEE Trans. Wireless
Commun., vol. 21, no. 6, pp. 4434–4449, Jun. 2022.
[12] C. Xu, S. Liu, Z. Yang, Y. Huang, and K.-K. Wong, “Learning Rate
Optimization for Federated Learning Exploiting Over-the-Air Computa-
tion,” IEEE J. Select. Areas Commun., vol. 39, no. 12, pp. 3742–3756,
Dec. 2021.
[13] X. Cao, G. Zhu, J. Xu, Z. Wang, and S. Cui, “Optimized Power Control
Design for Over-the-Air Federated Edge Learning,” IEEE J. Select.
Areas Commun., vol. 40, no. 1, pp. 342–358, Jan. 2022.
[14] T. Sery, N. Shlezinger, K. Cohen, and Y. Eldar, “Over-the-Air Federated
Learning From Heterogeneous Data,” IEEE Trans. Signal Process.,
vol. 69, pp. 3796–3811, 2021.
[15] M. Fu, Y. Zhou, Y. Shi, W. Chen, and R. Zhang, “UAV Aided Over-
the-Air Computation,” IEEE Trans. Wireless Commun., pp. 1–1, 2021.
[16] H. Jung and S.-W. Ko, “Performance Analysis of UAV-Enabled Over-
the-Air Computation Under Imperfect Channel Estimation,” IEEE Wire-
less Commun. Lett., vol. 11, no. 3, pp. 438–442, Mar. 2022.
[17] T. Jiang and Y. Shi, “Over-the-Air Computation via Intelligent Reflecting
Surfaces,” in Proc. 2019 IEEE Global Commun. Conf. (GLOBECOM),
2019, pp. 1–6.
[18] W. Fang, Y. Jiang, Y. Shi, Y. Zhou, W. Chen, and K. B. Letaief, “Over-
the-Air Computation via Reconfigurable Intelligent Surface,” IEEE
Trans. Commun., vol. 69, no. 12, pp. 8612–8626, Dec. 2021.
[19] Y. Chen, G. Zhu, and J. Xu, “Over-the-Air Computation with Imper-
fect Channel State Information,” Feb. 2022, number: arXiv:2202.13666
arXiv:2202.13666 [cs, math].
Article
Over-the-air computing (AirComp) has recently attracted considerable attention as an efficient method of data fusion by integrating uncoded communication transmissions with computation thanks to the signal superposition offered by the multiple access channels. However, appropriate processing is required to neutralize the wireless channel effect. As, internet-of-things (IoT) applications through low-cost devices is the main target of AirComp, perfect availability of channel state information (CSI) is not always practical, there is the need to investigate the effect of imperfect CSI on AirComp. Specifically, we present novel closed-form expressions for tight approximations that can be used to design and evaluate AirComp systems. Furthermore, we design a general optimization framework that takes into account both magnitude and phase errors in the CSI. Finally, a pilot retransmission policy is designed, that offers trade-off between resources cost and the gain in the accuracy of the computations. In order to validate its application, a utility function of the cost of retransmission is introduced, namely, Retransmission Policy Cost (RPC) , which can incorporate the power or throughput cost opposing to the expected gain of the selected policy. Simulations show the deterioration caused by the imperfect CSI and highlight the added value of the proposed policy under various system conditions.
Article
Full-text available
Over-the-air computation (AirComp) is a disruptive technique for fast wireless data aggregation in Internet of Things (IoT) networks via exploiting the waveform superposition property of multiple-access channels. However, the performance of AirComp is bottlenecked by the worst channel condition among all links between the IoT devices and the access point. In this paper, a reconfigurable intelligent surface (RIS) assisted AirComp system is proposed to boost the received signal power and thus mitigate the performance bottleneck by reconfiguring the propagation channels. With an objective to minimize the AirComp distortion, we propose a joint design of AirComp transceivers and RIS phase-shifts, which however turns out to be a highly intractable non-convex programming problem. To this end, we develop a novel alternating minimization framework in conjunction with the successive convex approximation technique, which is proved to converge monotonically. To reduce the computational complexity, we transform the subproblem in each alternation as a smooth convex-concave saddle point problem, which is then tackled by proposing a Mirror-Prox method that only involves a sequence of closed-form updates. Simulations show that the computation time of the proposed algorithm can be two orders of magnitude smaller than that of the state-of-the-art algorithms, while achieving a similar distortion performance.
Article
Full-text available
Federated learning (FL) as a promising edge-learning framework can effectively address the latency and privacy issues by featuring distributed learning at the devices and model aggregation in the central server. In order to enable efficient wireless data aggregation, over-the-air computation (AirComp) has recently been proposed and attracted immediate attention. However, fading of wireless channels can produce aggregate distortions in an AirComp-based FL scheme. To combat this effect, the concept of dynamic learning rate (DLR) is proposed in this work. We begin our discussion by considering multiple-input-single-output (MISO) scenario, since the underlying optimization problem is convex and has closed-form solution. We then extend our studies to more general multiple-input-multiple-output (MIMO) case and an iterative method is derived. Extensive simulation results demonstrate the effectiveness of the proposed scheme in reducing the aggregate distortion and guaranteeing the testing accuracy using the MNIST and CIFAR10 datasets. In addition, we present the asymptotic analysis and give a near-optimal receive beamforming design solution in closed form, which is verified by numerical simulations.
Article
Different from the existing works that focus on transceiver design of over-the-air computation (AirComp) over static networks, we in this paper consider an unmanned aerial vehicle (UAV) aided AirComp system, where the UAV as a flying base station aggregates data from mobile sensors. The trajectory design of the UAV provides an additional degree of freedom to improve the performance of AirComp. We aim to minimize the time-averaged mean-squared error (MSE) of AirComp by jointly optimizing the UAV trajectory, receive normalizing factors, and sensors’ transmit power. To this end, we first propose a novel and equivalent problem transformation by introducing intermediate variables. This reformulation leads to a convex subproblem when fixing any other two blocks of variables, thereby enabling efficient algorithm design based on the principle of block coordinate descent and alternating direction method of multipliers (ADMM) techniques. In particular, we derive the optimal closed-form solutions for normalizing factors and intermediate variables optimization subproblems. We also recast the convex trajectory design subproblem into an ADMM form and obtain the closed-form expressions for each variable updating. Simulation results show that the proposed algorithm achieves a smaller time-averaged MSE while reducing the simulation time by orders of magnitude compared to state-of-the-art algorithms.
Article
Federated learning (FL) is an attractive paradigm for making use of rich distributed data while protecting data privacy. Nonetheless, non-ideal communication links and limited transmission resources may hinder the implementation of fast and accurate FL. In this paper, we study joint optimization of communications and FL based on analog aggregation transmission in realistic wireless networks. We first derive closed-form expressions for the expected convergence rate of FL over the air, which theoretically quantify the impact of analog aggregation on FL. Based on the analytical results, we develop a joint optimization model for accurate FL implementation, which allows a parameter server to select a subset of workers and determine an appropriate power scaling factor. Since the practical setting of FL over the air encounters unobservable parameters, we reformulate the joint optimization of worker selection and power allocation using controlled approximation. Finally, we efficiently solve the resulting mixed-integer programming problem via a simple yet optimal finite-set search method by reducing the search space. Simulation results show that the proposed solutions developed for realistic wireless analog channels outperform a benchmark method, and achieve comparable performance of the ideal case where FL is implemented over error-free wireless channels.
Article
Over-the-air computation (AirComp) is a powerful technique to combine data collected from an enormous number of nodes with limited bandwidth availability. When unmanned aerial vehicles are involved in AirComp as flying fusion centers (FFCs), ultrafast data aggregation can be achieved for ubiquitous sensing and environmental surveillance applications. However, one of the main challenges in AirComp is channel estimation. Whereas most of existing studies assume perfect channel estimation, in this paper, considering the common channel estimation approach using reference signals sent by the FFC, we investigate how imperfect channel state information (CSI) impacts the mean square error (MSE) performance of AirComp. We derive a closed-form expression of MSE, in which the degradation due to the amplitude and phase errors can be identified. Both analytical and simulation results show that the MSE floors are incurred by the imperfect CSI even in the high SNR regime. We observe that the average MSE becomes about 100 times worse compared to that with perfect CSI, which exhibits the significance of accurate channel estimation.
Article
Over-the-air computation (OAC) is a promising technique to realize fast model aggregation in the uplink of federated edge learning (FEEL). OAC, however, hinges on accurate channel-gain precoding and strict synchronization among edge devices, which are challenging in practice. As such, how to design the maximum likelihood (ML) estimator in the presence of residual channel-gain mismatch and asynchronies is an open problem. To fill this gap, this paper formulates the problem of misaligned OAC for FEEL and puts forth a whitened matched filtering and sampling scheme to obtain oversampled, but independent samples from the misaligned and overlapped signals. Given the whitened samples, a sum-product ML (SP-ML) estimator and an aligned-sample estimator are devised to estimate the arithmetic sum of the transmitted symbols. In particular, the computational complexity of our SP-ML estimator is linear in the packet length, and hence is significantly lower than the conventional ML estimator. Extensive simulations on the test accuracy versus the average received energy per symbol to noise power spectral density ratio (EsN0) yield two main results: 1) In the low EsN0 regime, the aligned-sample estimator can achieve superior test accuracy provided that the phase misalignment is not severe. In contrast, the ML estimator does not work well due to the error propagation and noise enhancement in the estimation process. 2) In the high EsN0 regime, the ML estimator attains the optimal learning performance regardless of the severity of phase misalignment. On the other hand, the aligned-sample estimator suffers from a test-accuracy loss caused by phase misalignment.
Article
Machine learning and wireless communication technologies are jointly facilitating an intelligent edge, where federated edge learning (FEEL) is emerging as a promising training framework. As wireless devices involved in FEEL are resource limited in terms of communication bandwidth, computing power and battery capacity, it is important to carefully schedule them to optimize the training performance. In this work, we consider an over-the-air FEEL system with analog gradient aggregation, and propose an energy-aware dynamic device scheduling algorithm to optimize the training performance within the energy constraints of devices, where both communication energy for gradient aggregation and computation energy for local training are considered. The consideration of computation energy makes dynamic scheduling challenging, as devices are scheduled before local training, but the communication energy for over-the-air aggregation depends on the $l_{2}$ -norm of local gradient, which is known only after local training. We thus incorporate estimation methods into scheduling to predict the gradient norm. Taking the estimation error into account, we characterize the performance gap between the proposed algorithm and its offline counterpart. Experimental results show that, under a highly unbalanced local data distribution, the proposed algorithm can increase the accuracy by 4.9% on CIFAR-10 dataset compared with the myopic benchmark, while satisfying the energy constraints.
Article
italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Over-the-air federated edge learning (Air-FEEL) has emerged as a communication-efficient solution to enable distributed machine learning over edge devices by using their data locally to preserve the privacy. By exploiting the waveform superposition property of wireless channels, Air-FEEL allows the “one-shot” over-the-air aggregation of gradient-updates to enhance the communication efficiency, but at the cost of a compromised learning performance due to the aggregation errors caused by channel fading and noise. This paper investigates the transmission power control to combat against such aggregation errors in Air-FEEL. Different from conventional power control designs (e.g., to minimize the individual mean squared error (MSE) of the over-the-air aggregation at each round), we consider a new power control design aiming at directly maximizing the convergence speed. Towards this end, we first analyze the convergence behavior of Air-FEEL (in terms of the optimality gap) subject to aggregation errors at different communication rounds. It is revealed that if the aggregation estimates are unbiased, then the training algorithm would converge exactly to the optimal point with mild conditions; while if they are biased, then the algorithm would converge with an error floor determined by the accumulated estimate bias over communication rounds. Next, building upon the convergence results, we optimize the power control to directly minimize the derived optimality gaps under the cases without and with unbiased aggregation constraints, subject to a set of average and maximum power constraints at individual edge devices. We transform both problems into convex forms, and obtain their structured optimal solutions, both appearing in a form of regularized channel inversion, by using the Lagrangian duality method. Finally, numerical results show that the proposed power control policies achieve significantly faster convergence for Air-FEEL, as compared with benchmark policies with fixed power transmission or conventional MSE minimization.
Article
Wireless data aggregation (WDA), referring to aggregating data distributed at devices (e.g., sensors and smartphones), is a common operation in 5G-and-beyond machine-type communications to support the Internet of Things (IoT), which lays the foundation for diversified applications such as distributed sensing, learning, and control. Conventional WDA techniques, which are designed in a task-agnostic manner separating communication and computation, encounter difficulty in accommodating the massive access under the limited radio resource and stringent latency constraints imposed by emerging applications or tasks (e.g, auto-driving). To address this issue, over-the-air computation (AirComp) is being developed as a new task-oriented solution for WDA by seamlessly integrating communication and computation. By exploiting the waveform superposition property of multiple access channels, AirComp turns the air into a computer for computing and communicating functions of distributed data at many devices, thereby allowing low-latency WDA over massive IoT networks. In view of growing interest in Air-Comp, this article provides a timely overview of the technology by introducing basic principles, discussing advanced techniques and applications, and identifying promising research opportunities.