ArticlePDF Available

Adjusting forward error correction with quality scaling for streaming MPEG

Authors:

Abstract

Packet loss can severely impact streaming video quality. Repair techniques protect streaming video from packet loss but at the price of a reduced effective transmission rate when streaming a flow in a capacity constrained situation. This paper proposes an algorithm that optimizes the choice of Forward Error Correction (FEC) to repair packet loss for streaming MPEG videos under a capacity constraint with quality scaling. An analytic model is developed to estimate the video quality of streaming MPEG given a quality scaling level and a specific FEC strength. Given network conditions in terms of packet loss rate, the model searches the total variable space to find the combination of FEC and scaling that yields the optimal quality under the capacity constraint. Analysis over a range of network conditions indicates that adjusting FEC with quality scaling provides significant performance improvement.
Adjusting Forward Error Correction with Quality Scaling
for Streaming MPEG
Huahui Wu, Mark Claypool, and Robert Kinicki
Computer Science Department
Worcester Polytechnic Institute
100 Institute Rd, Worcester, MA 01609, USA
{flashine|claypool|rek}@cs.wpi.edu
ABSTRACT
Packet loss can severely impact streaming video quality. Re-
pair techniques protect streaming video from packet loss but
at the price of a reduced effective transmission rate when
streaming a flow in a capacity constrained situation. This
paper proposes an algorithm that optimizes the choice of
Forward Error Correction (FEC) to repair packet loss for
streaming MPEG videos under a capacity constraint with
quality scaling. An analytic model is developed to estimate
the video quality of streaming MPEG given a quality scal-
ing level and a specific FEC strength. Given network con-
ditions in terms of packet loss rate, the model searches the
total variable space to find the combination of FEC and
scaling that yields the optimal quality under the capacity
constraint. Analysis over a range of network conditions in-
dicates that adjusting FEC with quality scaling provides
significant performance improvement.
Categories and Subject Descriptors: C.2.m Computer-
Communication Networks: Miscellaneous
General Terms: Performance, Design
Keywords: Streaming MPEG, Quality Scaling, Forward
Error Correction
1. INTRODUCTION
The growth in the power and connectivity of the Internet
has sparked an even larger growth in streaming media. Al-
though many compression techniques have been introduced
to reduce the streaming bitrate [11], video streaming over
the Internet is often still limited by the network capacity.
For example, one capacity constraint comes from the Inter-
net Service Provider’s (ISP) negotiated rate that restricts
capacity on the home user’s last mile. Another capacity
constraint comes from the growing consensus that all Inter-
net traffic must be TCP-Friendly. A flow is TCP-Friendly
if its data rate does not exceed the maximum data rate of a
conformant TCP connection under equivalent network con-
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies are
not made or distributed for profit or commercial advantage and that copies
bear this notice and the full citation on the first page. Tocopy otherwise, to
republish, to post on servers or to redistribute to lists, requires prior specific
permission and/or a fee.
NOSSDAV’05, June 13–14, 2005, Stevenson, Washington, USA.
Copyright 2005 ACM 1-58113-987-X/05/0006 ...$5.00.
ditions. There are proposed approaches to detect and re-
strict the capacity of non-TCP friendly flows [9]. Streaming
media applications that ignore these constraints suffer data
loss in the form of dropped packets.
While multimedia applications can tolerate some data loss,
excessive packet loss yields unacceptably low quality. Since
video encoding relies upon frame dependencies to achieve
high compression rates, the random dropping of packets by
routers can seriously degrade video quality. For example, as
little as 3% MPEG packet loss can cause 30% of the frames
to be undecodable.
To avoid the latency and variance in latency caused by re-
transmission of lost packets, streaming media flows often use
Forward Error Correction (FEC) to reconstruct lost stream-
ing media packets. However, FEC adds redundant repair
data to the original video stream. Current approaches use
either apriori, static FEC choices or FEC that adapts to
perceived packet loss on the network. However none of the
published adaptive FEC schemes account for the additional
FEC overhead against a capacity constraint. With a capac-
ity constraint, adding FEC reduces the effective transmis-
sion rate of the original video content [10].
To preserve real-time playout, multimedia servers use me-
dia scaling to reduce the streaming data rate to match the
capacity constraint. With quality scaling, the multimedia
server adjusts the quantization level before transmission to
reduce the streaming bitrate. When used in conjunction
with FEC, a multimedia application can increase the quan-
tization level even higher to save capacity for the FEC over-
head. Hence, selecting the appropriate amount of FEC and
the corresponding quantization level can be cast as a con-
strained optimization problem that attempts to optimize the
video streaming quality.
Previously, we derived a model of playable frame rate for
streaming MPEG [21] that did not account for media scal-
ing. This paper extends the analytic model to characterize
quality-scaled MPEG video performance with FEC in the
presence of packet loss. For a given loss rate and MPEG
characteristics, the new model utilizes a specified quality
scaling level and number of FEC packets for each MPEG
frame type to estimate video distortion. Based on the model,
the optimization algorithm exhaustively searches all pos-
sible combinations of FEC and scaling level to determine
the configuration that yields the best capacity-constrained
video quality. Experimental results show that adjusting
FEC with quality scaling (QAFEC) provides significant ben-
efits to video quality.
The remainder of the paper is organized as follows: Sec-
tion 2 provides background for the work in this paper; Sec-
tion 3 introduces the analytical quality model; Section 4
presents the optimization algorithm; Section 5 discusses ex-
periments, including a preliminary user study, and analyzes
the results; and Section 6 summarizes the paper and presents
possible future work.
2. BACKGROUND
2.1 MPEG and Quality Scaling
There are four frame types defined in MPEG-1 [11]: I
(intra-coded) frames, P (predictive-coded) frames, B (bi-
directionally predictive-coded) frames and D (DC-coefficients
only) frames. However, the seldom-used D frame is excluded
from our model of MPEG. MPEG video typically repeats a
pattern of I, P, and B frames (known as a Group of Pictures
or GOP) for the duration of a video stream. Figure 1 shows
the typical GOP used in this paper.
I0 B00 B01 P1 B10 P2 I0
Figure 1: A typical MPEG Group Of Pictures
In MPEG, the DCT coefficient of the video signal is quan-
tized by dividing by an integer (the quantization value), and
rounding to the nearest integer. When using higher quan-
tization values, each MPEG frame is encoded with lower
precision, and transmitted with fewer bits. Thus, this qual-
ity scaling technique reduces the bitrate of the streaming
video.
Proposed mechanisms for quality scaling include: adap-
tive quantization values [20], which adapts the encoding
quantization value to network capacity; signal-to-noise ratio
(SNR) scalability [16], which encodes a video clip into multi-
ple layers and streams as many layers as possible; MPEG-4
fine granularity scalability (FGS) [7], which is a special case
of SNR scalability that provides continuous scalability us-
ing partial enhancement; and scalable MPEG (SPEG) [6],
which transcodes MPEG’s DCT coefficients to a base level
plus three enhanced levels and transmits different numbers
of levels.
While this report focuses on using adaptive quantization
values, the model developed (Section 3) is independent of
the scaling technique and only requires the relationships be-
tween scaling level, encoding bitrate and video quality.
2.2 Forward Error Correction(FEC)
Reed-Solomon (R-S) code [15] is a media-independent FEC
technique that can be applied at the packet level. As shown
in Figure 2, an application level video frame is modeled as
being transmitted in Kpackets where Kvaries with frame
type, encoding method, and media content. R-S code adds
(NK) redundant packets to the Koriginal packets and
sends the Npackets over the network. Although some pack-
ets may be lost, e.g. packet 2 in Figure 2, the frame still
can be completely reconstructed if any Kor more packets
are successfully received.
1 2 K Original Video Frame
1 2 K K+1 N After Adding FEC Packets
1 K K+1 N
1 2 K
After Network Transmission
(some packets are lost)
After Reconstruction
(with any K correct packets)
Figure 2: Reed-Solomon code
To analyze the effects of FEC on application layer frames,
the sending of packets is modeled as a series of independent
Bernoulli trials. Thus, the probability q(N, K, p) that a K-
packet video frame is successfully transmitted with NK
redundant FEC packets along a network path with packet
loss probability pis:
q(N, K, p) =
N
X
i=K»„ N
i«(1 p)ipNi(1)
3. MODEL
3.1 Parameters and Variables
System Layer Parameters
Application SI, SP, SB, NP, NB, RF
Presentation SIF , SP F , SB F , l
Network p, tRTT , s, T
Table 1: System Layers and Parameters
The system layers and parameters for our analytic model
are indicated in Table 1, where the parameters are:
RF: the maximum playable frame rate achieved when
there is enough capacity and no loss (While typical frame
rates are 25 fps (PAL) or 29.97 fps (NTSC), this paper uses
the NTSC frame rate and rounds it up to RF= 30f ps).
SI,SP,SB: the size of I, P or B frames respectively, in
fixed size packets.
NP,NB: the number of P or B frames in one GOP, re-
spectively.
SIF ,SP F ,SBF : the number of FEC packets added to
each I, P or B frame, respectively.
lthe quality scaling level, assuming I, P and B frames
have the same level.1
s: the packet size (in bytes).
p: the packet loss probability.
tRT T : the round-trip time (in milliseconds).
T: the capacity constraint, limited by the ISP or by a
TCP-Friendly rate [13].
3.2 Distortion from Quality Scaling
When a video is streamed over an unreliable network un-
der a capacity constraint, its perceptual quality is degraded
1While using different quality levels for different frame types
is possible, capturing the quality dependencies between dif-
ferent frames is difficult. This remains as future work.
by two factors: quality scaling and frame loss. Scaling dis-
tortion, caused by a high quantization value, appears visu-
ally as coarse granularity in every frame. Frame loss, due to
network packet loss, yields jerkiness in the video playout.
This paper uses the Video Quality Model (VQM) [14], an
objective video quality measurement, to approximate the
distortion due to quality scaling. Section 3.3 uses playable
frame rate to estimate the distortion from frame loss. Sec-
tion 3.4 presents a new measurement, namely, distorted play-
able frame rate that combines these two factors.
VQM takes an original video and a distorted video as
input and returns a distortion value Dbetween 0 (no dis-
tortion) and 1 (maximum distortion). Previous research [3]
implies that perceptual video distortion varies exponentially
with the quantization value. Employing VQM to measure
Din videos encoded with varying quantization levels, our
preliminary studies show that Dcan be approximated as an
exponential function of the quantization value l:
D=ˆ
D·lλD(2)
where lis the quantization level, ˆ
Dis the VQM distortion
when l= 1, and λDis the exponential coefficient. Table 3 in
Section 5.2 provides an example that shows how accurately
this function fits real video data.
3.3 Playable Frame Rate
3.3.1 Frame Size
Frame sizes change with the quality scaling level. Pre-
vious researches [3, 19] demonstrate that MPEG streaming
bitrate can be approximated by an exponential function of
the quality scaling level l. Our preliminary experiments sug-
gest frame size can be estimated by exponential functions of
quantization value lgiven as:
8
<
:
SI=ˆ
SI·lλI
SP=ˆ
SP·lλP
SB=ˆ
SB·lλB
(3)
where lis the quantization level, ˆ
Sis the frame size when
l= 1, and λis the exponential coefficient. Note, all the
results Sare rounded up to the nearest integer dSesince
video frames must be sent over the network in an integer
number of packets. Table 3 in Section 5.2 shows how accu-
rately these functions fit real video frame sizes.
3.3.2 Successful Frame Transmission Probability
Given I, P, and B frame sizes, and the distribution of re-
dundant FEC packets added to each frame type, Equation 1
provides the probability of successful transmission for each
frame type, knowing the amount of redundancy added by
media-independent FEC:
8
<
:
qI=q(SI+SIF , SI, p)
qP=q(SP+SP F , SP, p)
qB=q(SB+SBF , SB, p)
(4)
3.3.3 Playable Frame Rate
Our previous work [21] derived a model to estimate total
playable frame rate for streaming MPEG:
R=G·qI·(1 + qPqNP+1
P
1qP+NBP ·qB
·(qPqNP+1
P
1qP+qI·qNP
P))
(5)
where Gis the constant GOP rate, NPis the number of P
frames, NBP is the number of B frames between two refer-
ence frames (I or P frame), and qis the successful trans-
mission probability of I, P or B frames determined from
Equation 4.
Treating I, P and B frame sizes as functions of the quality
scaling level lgiven in Equation 3, the model can be ex-
tended to estimate the playable frame rate and subsequently
used to estimate the quality distortion due to frame loss.
3.4 Distorted Playable Frame Rate
Quality scaling uses a higher quantization value to encode
the video, causing intra-frame quality distortion. Frame loss
lowers the playable frame rate and is referred to as inter-
frame quality distortion.
Since the inter-frame and intra-frame distortion compo-
nents are independent, it is assumed they contribute inde-
pendently to the overall distortion. Hence, quality distortion
can be represented by a function of these two factors. To
stream the highest quality video possible, the media server
needs to use the best quality scaling level and the media
client needs to receive all the frames. Thus, these two fac-
tors are combined into a multiplicative function, referred to
as the distorted frame rate, RD,:
RD= (1 D)·R(6)
where Dis the quality distortion from Equation 2 and Ris
the playable frame rate from Equation 5.
The motivation behind RDis as follows. If a video is
streamed with the best quantization value, its quality scaling
distortion is 0 and video quality is determined only by the
playable frame rate R. With any other quantization value,
every frame carries less visual detail and its contribution to
the video quality (measured by frame rate) is reduced by
the quality distortion D. A preliminary user study (shown
in Section 5.3) shows a correlation between user perceptual
quality and distorted playable frame rate RD. This suggests
that RDmay be a reasonable representation of overall video
quality.
4. OPTIMIZATION ALGORITHM
For given network conditions and MPEG video parame-
ters, the total distorted playable frame rate RDvaries with
quality scaling level and the amount of FEC for each frame
type as a function RD(l, (SI F , SPF , SBF )) where streaming
bitrate is limited by a capacity constraint, T. Thus, this
model can optimize the distorted playable frame rate, RD,
using the following operation research equation:
8
>
>
>
<
>
>
>
:
Maximize :
RD= (1 D(l)) ·R(l, (SI F , SPF , SBF ))
Subject to :
G·((SI(l) + SIF ) + NP·(SP(l) + SP F )
+NB·(SB(l) + SBF )) T
(7)
Unfortunately, finding a closed form solution for the non-
linear function RDis difficult since there are many saddle
points. However, given that the optimization problem is
expressed in terms of integer variables over a restricted do-
main, an exhaustive search of the discrete space is feasible.
With fixed input values for (p, RT T, s), (G, NP, NB) and
functions of (SI(l),SP(l),SB(l)), each set of values of land
(SIF ,SP F ,SBF ) determine the distorted playable frame
rate RDusing the following steps:
1. Approximate the video frame sizes (SI,SPand SB)
using lin Equation 3.
2. Estimate total video streaming bitrate using the video
frame sizes and the FEC frame sizes. If the estimated
bitrate is larger than the capacity constraint T, the set
of values of land (SIF ,SP F ,SBF ) are invalid and RD
is returned as 0.
3. Otherwise, use the video frame sizes and the FEC sizes
to determine the successful transmission probabilities
(qI,qPand qB) from Equation 4.
4. Estimate playable frame rate Rby inputting (qI,qP
and qB) into Equation 5.
5. Use lin Equation 2 to approximate D.
6. Employ Rand Din Equation 6 to estimate the dis-
torted playable frame rate, RD.
With these steps for each set of values, the space of possi-
ble values for land (SIF ,SP F ,SBF ) is exhaustively searched
to determine the quality level and the amount of FEC pack-
ets for each frame type that yields the maximum distorted
playable frame rate under the capacity constraint. Since the
search can be done in real-time 2, the determination of opti-
mal choices for adaptive FEC and quality scaling is feasible
for streaming MPEG.
While the quality adjusted FEC (QAFEC) model assumes
MPEG frame sizes are constant for the entire video, in prac-
tice MPEG frame sizes change from one GOP to the next.
To assess the impact of changing frame sizes on the model,
we used the QAFEC model to determined the adjusted FEC
and scaling assuming an average fixed frame size. Then, a
simulation using trace-based frame sizes [18] was run and ac-
tual playable frame rate at the receiver was measured. The
frame rate predicted by the model was slightly higher than
the actual frame rate achieved. However, the difference was
negligible for loss rates around 1% and was less than 10%
for higher loss rates. Details can be found in [22].
5. EXPERIMENTS
5.1 Methodology
Using the optimization algorithm, the distorted playable
frame rates over a range of network and application settings
are explored. For each set of network and application pa-
rameters, the playable frame rates are compared for MPEG
streaming with quality adjusted FEC, MPEG streaming with
two types of fixed FEC, and MPEG streaming without FEC:
1. Quality Adjusted FEC: Before transmission, the server
employs the optimization algorithm based on the QA-
FEC model to determine the FEC and quality scaling
levels that produce the maximum distorted playable
frame rate RDand uses these for the entire video trans-
mission.
2It takes about 100 milliseconds to find the best FEC and
scaling pattern using our approach on a Pentium-3 800 MHz
PC. Optimizations of the code and a faster machine will
allow searching to be done even faster.
2. Large Fixed FEC: The server protects each frame with
15% FEC packets (moved up to the nearest integer).
This FEC pattern provides strong protection to each
frame and roughly represents the relative importance
of the I, P and B frames [4, 8].
3. Small Fixed FEC: Each I frame receives 1 FEC packet.
This simple FEC pattern protects the most important
frame, the I frame. Protecting the I frame is a scheme
used by other researchers [2, 17].
4. Non-FEC: No FEC is added to the video.
The total bitrate used by the MPEG video and FEC is
scaled to meet a TCP-Friendly capacity constraint [13] us-
ing quality scaling. While other published approaches dy-
namically adapt FEC to current packet loss conditions, the
resultant total bitrates do not satisfy a capacity constraint.
Thus, there is no simple way to compare QAFEC to other
dynamic FEC approaches.
5.2 System Settings
Network Layer Application Layer
tRT T 50 ms NP4 frames per GOP
s1 Kbyte NB10 frames per GOP
p0.01 to 0.04 RF30 frames per second
Table 2: System Parameter Settings
Table 2 presents the system parameter settings for the
network and application layers. A commonly-used MPEG
GOP pattern, ‘IBBPBBPBBPBBPBB’, and a typical full motion
frame rate RFof 30 frames per second (fps) are used. The
packet size s, round-trip time tRT T and packet loss prob-
ability pare chosen based on the characteristics of many
network connections [1, 5]. For all experiments, the param-
eters are fixed, except for packet loss probability p, which
was varied from 0.01 to 0.04 in steps of 0.002.
Two picture sequences are used. The first video, Paris,
from PictureTel, shows two people sitting at a table and
talking while making high-motion gestures. It has 900 raw
images and lasts for 30 seconds, providing a frame rate
of 30 fps. The image size is 352x288 pixels (CIF). The
Berkeley MPEG encoder mpeg encode [12] is used to en-
code the images with different quantization values as one-
minute long videos. From the output videos, the frame
sizes are extracted with the Berkeley MPEG statistics tool
mpeg stat [12] and the quality distortion is extracted with
VQM. Statistical analysis software SPSS 3is used to fit the
relation between quality distortion and lto a function as in
Equation 2, and the relation between frame sizes and lin
Equation 3. The equations then become:
8
>
>
<
>
>
:
D= 0.025 ·l0.87
SI= 81.51 ·l0.70
SP= 52.94 ·l1.21
SB= 15.47 ·l0.79
(8)
Table 3 shows how these analytical functions fit the Paris
data with some representative quantization values. In the
table, lis the quantization value, D
0,S
0
I,S
0
P,S
0
Bare esti-
mated by the analytical functions, and D,SI,SP,SBare
3http://www.spss.com
the real values from the video analysis. Overall, the func-
tions fit the data well.
l D
0
D S
0
ISIS
0
PSPS
0
BSB
5 0.10 0.09 26.4 26.5 7.5 7.8 4.3 4.5
8 0.15 0.15 19.0 19.5 4.2 4.6 2.9 2.8
12 0.21 0.22 14.3 14.5 2.6 2.8 2.1 2.0
18 0.31 0.33 10.7 10.7 1.6 1.6 1.5 1.4
24 0.39 0.38 8.8 8.7 1.1 1.1 1.2 1.2
31 0.49 0.48 7.3 7.2 0.8 0.7 1.0 1.1
Table 3: Estimated Value by Equation 8 versus Real
Video Data
The second video sequence is Tennis, which comes with
the Berkeley tools. Tennis, a short clip of two men play-
ing ping-pong, has 150 raw images and lasts for 5 seconds,
providing a frame rate of 30 fps. The size of each frame
is 352x240 pixels. Again, the Berkeley MPEG tools, VQM
and SPSS are used to approximate the quality distortion
and frame sizes to functions of quality scaling level:
8
>
>
<
>
>
:
D= 0.041 ·l0.69
SI= 74.55 ·l0.86
SP= 96.22 ·l1.31
SB= 33.27 ·l1.01
(9)
Due to space constraints, the functional fit for Tennis is
not shown, but the approximation errors are less than 5%.
Note, analyzing videos and fitting the results to these ex-
ponential functions is a time-intensive operation. It may be
possible to analyze a large variety of video types and find
parameters for the exponential functions, based either on
the frames sizes, frame rates and video content, that are
generally effective. This analysis is left as future work.
5.3 Analysis of Results
The Paris and Tennis videos are used to approximate
their quality scaling distortion Dand frame sizes to func-
tions as Equation 2 and Equation 3 and instantiate them to
Equation 8 and Equation 9. These functions and the opti-
mization algorithm are used to search the FEC and quality
scaling values to produce the maximum distorted playable
frame rate for the four approaches in Section 5.1.
Figure 3 graphs the distorted playable frame rates for the
four FEC choices for the Paris and Tennis videos. The x-
axes are the packet loss probabilities, and the y-axes are the
distorted playable frame rates. From the data in these fig-
ures, quality adjusted FEC provides the best quality under
all network and video conditions. The benefits of quality
adjusted FEC over non-FEC are substantial, with quality
adjusted FEC providing 5-10 more frames per second for
all rates. The small fixed FEC approaches usually improves
playable frame rates over non-FEC video, especially when
loss rates are high. However, the small fixed FEC frame
rates are still much lower than the frame rates with quality
adjusted FEC. Large FEC achieves the playable frame rate
provided by quality adjusted FEC for low loss rates because
the TCP-Friendly rate is relatively high. With limited ca-
pacity (at high loss rates), the quality scaling level is high
(>16), and the large FEC overhead becomes significant.
There is little reduction in the frame sizes as the scaling
level increases (to a maximum of 31) and quality scaling is
0
5
10
15
20
25
30
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 0.045
Distorted Playable Frame Rate
p
Quality Adjusted FEC
Large Fixed FEC
Small Fixed FEC
Non-FEC
a. Paris Video
0
5
10
15
20
25
30
0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04
Distorted Playable Frame Rate
p
Quality Adjusted FEC
Large Fixed FEC
Small Fixed FEC
Non-FEC
b. Tennis Video
Figure 3: Distorted Playable Frame Rates
unable to conserve enough bitrate to overcome the effect of
the large FEC overhead. The result is that none of the orig-
inal video data is sent. These trends hold for both videos
despite differences in the content between the two clips.
A preliminary user study was conducted to compare the
impact of FEC choices on actual users and begin to ascertain
the efficacy of the distorted playable frame rate measure,
RD, as a measure of perceived visual quality. Four versions
of the Paris video clips were generated, simulating the Qual-
ity Adjusted FEC approach, the two fixed FEC strategies
and the Non-FEC approach on a network with 0.02 packet
loss and a TCP-Friendly constraint of 1.17 Mbps. No local
concealment technique is used; if a frame was not playable,
the preceding playable frame was repeated. Ten undergrad-
uate students were asked to rate the quality from 0 (worst)
to 10 (best) of the four videos using the original videos with
no packet loss as a reference. The ratings were provided
after viewing each clip twice in a different random order by
each student.
Repair method D R RDQ
Quality Adjusted FEC 0.17 28.55 23.78 6.89
Small Fixed FEC 0.20 23.58 18.90 4.44
Large Fixed FEC 0.44 30.00 16.93 3.89
Non-FEC 0.28 20.17 14.61 3.50
Table 4: Preliminary User Study
Table 4 displays the average user quality rating Qcom-
pared with the VQM distortion D, the playable frame rate
R, and the distorted playable frame rate RDfor the four
videos. The results indicate that the perceived qualities of
the videos with FEC are significantly higher than the videos
without FEC. Additionally, QAFEC videos appear notice-
ably better under all the conditions than videos with fixed
FEC. Perhaps most importantly, the correlation between Q
and RDsuggests that the distorted playable frame rate RD
appropriately represents perceived quality, accounting for
both the temporal aspects of the video that influence per-
ceived quality (R) and the quality aspects of the video that
influence perceived quality (D).
It may be surprising that non-FEC has a higher VQM dis-
tortion Dsince non-FEC has a higher encoding bitrate with-
out any FEC overhead. The reason is, if non-FEC chooses
a better (lower) quantization level lunder the capacity con-
straint, it will increase the frame sizes, reducing the suc-
cessful transmission probability for each frame and getting
a much lower playable frame rate. So, non-FEC selects a
conservative quantization level to get better overall quality.
6. SUMMARY
This investigation studies adjusting FEC with quality scal-
ing for streaming MPEG. An analytic model (QAFEC) is
proposed that captures the quality distortion of streaming
MPEG in the presence of quality scaling and frame loss.
Using this model, an optimization algorithm determines the
optimal adjustment of FEC and quality scaling under a ca-
pacity bound, accounting for both the network conditions
and video parameters.
The analytic experiments show that QAFEC has signif-
icant advantages. Quality adjusted FEC always achieves
higher quality than MPEG video without FEC and provides
higher video quality than fixed FEC approaches when taken
over a range of MPEG encoding and network conditions. A
preliminary user study confirms the experiments in demon-
strating that the perceived quality of the videos with quality
adjusting FEC are significantly better than the videos with
fixed FEC or without FEC. The user study also suggests
that the proposed distorted playable frame rate, RD, has
the appropriate trend in capturing the distortion to video
quality from quality scaling and frame loss.
Our ongoing work includes: implementing a streaming
MPEG system that includes the model and the optimiza-
tion algorithm; adding other quality scaling techniques, such
as MPEG FGS to the model; and extending the model to
compare and combine quality scaling and temporal scaling.
7. REFERENCES
[1] J. Chung, M. Claypool, and Y. Zhu. Measurement of the
Congestion Responsiveness of RealPlayer Streaming Video
Over UDP. In Proceedings of the Packet Video Workshop
(PV), Nantes, France, Apr. 2003.
[2] N. Feamster and H. Balakrishnan. Packet Loss Recovery for
Streaming Video. In Proceedings of 12th International
Packet Video (PV) Workshop, Pittsburgh, PA, USA, Apr.
2002.
[3] P. Frossard and O. Verscheure. Joint Source/FEC Rate
Selection for Quality-Optimal MPEG-2 Video Delivery.
IEEE Transactions on Image Processing,
10(12):1815–1825, Dec. 2001.
[4] F. Hartanto and H. R. Sirisena. Hybrid Error Control
Mechanism for Video Transmission in the Wireless IP
Networks. In Proceedings of LANMAN, Nov. 1999.
[5] S. Jaiswal, G. Iannaccone, C. Diot, J. Kurose, and
D. Towsley. Inferring TCP Connection Characteristics
Through Passive Measurements. In Proceedings of IEEE
Infocom, Hongkong, China, Apr. 2004.
[6] C. Krasic, J. Walpole, and W.-C. Feng. Quality-adaptive
Media Streaming by Priority Drop. In Proceedings of
NOSSDAV, pages 112–121, Monterey, CA, USA, June
2003. ACM Press.
[7] W. Li. Overview of Fine Granularity Scalability in
MPEG-4 Video Standard. IEEE Transaction on Circuits
and System for Video Technology, 11(3), Mar. 2001.
[8] Y. Liu and M. Claypool. Using Redundancy to Repair
Video Damaged by Network Data Loss. In Proceedings of
ACM MMCN, Jan. 2000.
[9] R. Maha jan, S. Floyd, and D. Wetherall. Controlling
High-Bandwidth Flows at the Congested Router. In
Proceedings of the 9th International Conference on
Network Protocols (ICNP), Nov. 2001.
[10] K. Mayer-Patel, L. Le, and G. Carle. An MPEG
Performance Model and Its Application To Adaptive
Forward Error Correction. In Proceedings of ACM
Multimedia, December 2002.
[11] J. Mitchell and W. Pennebaker. MPEG Video:
Compression Standard. Chapman and Hall, 1996. ISBN
0412087715.
[12] U. of California Berkeley. Berkeley MPEG-2 Encoder and
Player. Internent site
http://bmrc.berkeley.edu/frame/research/mpeg/.
[13] J. Padhye, V. Firoiu, D. Towsley, and J. Kurose. Modeling
TCP Throughput: A Simple Model and Its Empirical
Validation. In Proceedings of ACM SIGCOMM, Vancouver,
Brisish Columbia, Candada, 1998.
[14] M. Pinson and S. Wolf. A New Standardized Method for
Objectively Measuring Video Quality. IEEE Transactions
on Broadcasting, 50(3):312–322, Sept. 2004.
[15] I. S. Reed and G. Solomon. Polynomial Codes Over Certain
Finite Fields. Journal of the Society of Industrial and
Applied Mathematics (SIAM), 8(2):300–304, June 1960.
[16] R. Rejaie, M. Handley, and D. Estrin. Quality Adaptation
for Congestion Controlled Video Playback overthe Internet.
In Proceedings of SIGCOMM, pages 189–200, Cambridge,
MA, Aug. 1999.
[17] I. Rhee. Error Control Techniques for Interactive Low-Bit
Rate Video Transmission over the Internet. In Proceedings
of ACM SIGCOMM Conference, Sept. 1998.
[18] O. Rose. Statistical Properties of MPEG Video Traffic and
their Impact on Traffic Modeling in ATM Systems. In
Proceedings of 20th Conference of Local Computer
Networks, pages 397–406, Oct 1995.
[19] S. Sakazawa, Y. Takishima, M. Wada, and Y. Hatori.
Coding Control Scheme for a Multi-Encoder System. In
Proceedings of 7th International Workshop on Packet
Video (PV), Mar. 1996.
[20] W.-T. Tan and A. Zakhor. Real-Time Internet Video Using
Error Resilient Scalable Compression and TCP-Friendly
Transport Protocol. IEEE Transactions on Multimedia,
1(2):172–186, 1999.
[21] H. Wu, M. Claypool, and R. Kinicki. A Model for MPEG
with Forward Error Correction and TCP-Friendly
Bandwidth. In Proceedings of NOSSDAV, Monterey, CA,
USA, June 2003.
[22] H. Wu, M. Claypool, and R. Kinicki. Adjusting Forward
Error Correction with Temporal Scaling for TCP-Friendly
Streaming MPEG. Technical Report WPI-CS-TR-03-10, CS
Department, Worcester Polytechnic Institute, Apr. 2003.
... Since video coding involves interframe dependencies to achieve high compression rates, the random dropping of packets by routers and/or random bit errors due to a highly additive Whit Gaussian noise over wireless can both seriously degrade video quality. Hence in wired MPEG transmission [8], for example, dropping packets from an independently encoded I frame causes the following dependent P and B frames to be fully undecodable. In practice, interframe dependencies have been shown to cause a 3% packet loss rate to result in a 30 % frame loss rate. ...
... To address the above interaction, we should provide a high quality of service (QoS) for video applications, by meaning high video play-out quality, at high loss rates over wireless link; whilst several studies [3][4][5][6][7][8][9][10][11][12] have pursued both error-control techniques of media adaptation, as well as network-adaptation. The network-adaptation can be efficiently employed by adapting the end-system to the changing network conditions, whereas adaptation in general meaning represents the ability of network protocols and applications to observe and respond to the channel variations. ...
... The loss of one P frame can make some of other P and B frames undecodable, and the loss of one I frame can result in the loss of the whole GoP. This implies that I frames are more important than P frames, and P frames are more important than B frames [8][9]. A GoP structure is expressed as ( N , M ), where M corresponds to the number of P frames in a GoP and M corresponds to the number of B frames between I and P frames. ...
Conference Paper
Full-text available
Wireless video transmission suffers errors from the dynamic wireless environment. Due to errors, the discarded link layer packets impose a serious limitation on the maximum achievable throughput over wireless channel. To face this challenge and to enhance the overall TCP-Friendly video throughput, this paper proposes an MPEG packet loss model which is based on Forward Error Correction (FEC) over wired-to-wireless channel. Within this model, a FEC packet level scheme is used to act as an inter-protection control, which is based on Reed-Solomon (RS) code with the aim of bringing a robust transmission against frequent packet loss. Further, a BPSK scheme is applied for an Additive White Gaussian Noise (AWGN) wireless channel. By using this model, the predicted frame rate for MPEG video streaming can be estimated. Quality of Service (QoS) in terms of frame rate and the quality factor are evaluated for the predicted quantizer scale. The numerical results demonstrate that the proposed scheme improves the QoS of video transmissions in the presence of high wireless channel bit errors. Keywords— Wireless video, Video quality, TCP-Friendly, Quality of Service (QoS), Forward Error Correction (FEC), Reed-Solomon (RS) code.
... Since video coding involves interframe dependencies to achieve high compression rates, the random dropping of packets by routers and/or random bit errors due to a highly additive Whit Gaussian noise over wireless can both seriously degrade video quality. Hence in wired MPEG transmission [8], for example, dropping packets from an independently encoded I frame causes the following dependent P and B frames to be fully undecodable. In practice, interframe dependencies have been shown to cause a 3% packet loss rate to result in a 30 % frame loss rate. ...
... To address the above interaction, we should provide a high quality of service (QoS) for video applications, by meaning high video play-out quality, at high loss rates over wireless link; whilst several studies [3][4][5][6][7][8][9][10][11][12] have pursued both error-control techniques of media adaptation, as well as network-adaptation. The network-adaptation can be efficiently employed by adapting the end-system to the changing network conditions, whereas adaptation in general meaning represents the ability of network protocols and applications to observe and respond to the channel variations. ...
... The loss of one P frame can make some of other P and B frames undecodable, and the loss of one I frame can result in the loss of the whole GoP. This implies that I frames are more important than P frames, and P frames are more important than B frames [8][9]. A GoP structure is expressed as ( N , M ), where M corresponds to the number of P frames in a GoP and M corresponds to the number of B frames between I and P frames. ...
Article
Full-text available
Wireless video transmission suffers errors from the dynamic wireless environment. Due to errors, the discarded link layer packets impose a serious limitation on the maximum achievable throughput over wireless channel. To face this challenge and to enhance the overall TCP-Friendly video throughput, this paper proposes an MPEG packet loss model which is based on Forward Error Correction (FEC) over wired-to-wireless channel. Within this model, a FEC packet level scheme is used to act as an inter-protection control, which is based on Reed-Solomon (RS) code with the aim of bringing a robust transmission against frequent packet loss. Further, a BPSK scheme is applied for an Additive White Gaussian Noise (AWGN) wireless channel. By using this model, the predicted frame rate for MPEG video streaming can be estimated. Quality of Service (QoS) in terms of frame rate and the quality factor are evaluated for the predicted quantizer scale. The numerical results demonstrate that the proposed scheme improves the QoS of video transmissions in the presence of high wireless channel bit errors. Keywords— Wireless video, Video quality, TCP-Friendly, Quality of Service (QoS), Forward Error Correction (FEC), Reed-Solomon (RS) code.
... To improve the video quality over wireless networks at high loss rates, there are many analytical approaches which can be pursued such as adaptive rate control [4], passive error recovery (retransmission) [3], frame-interleaving, [5], error-concealment [6], adaptive modulation [7], forward-error-correction (FEC) at packet-level [8][9][10] and/or channel bit-level [11]. Effectively, FEC adds redundancy codes to the original information via either convolutional codes, like RCPC [1]) or block codes [1][2][3][4][5], like CRC, RS, and BCH codes. ...
... The total effective playable frame rate (PFR) can be derived as in [8], ...
... As a result, a QoS in terms of temporal scalability (frame per sec) at the client has been improved when a maximum channel coding throughput is achieved. The results demonstrate that a proposed scheme introduces a good predicted video quality at high channel bit-errors under various error-correction conditions as compared to other models [8][9][10] ...
Article
Full-text available
In this paper, new analysis and performance of robust error-model are presented for MPEG-4 video stream over wireless point-to-point network. Analytical expressions assume a noisy wireless environment causing frequent and random bit errors associated with packets. By this model, the temporal video scalability can be evaluated under TCP-Friendly Rate Control (TFRC) transmission when the Bose-Chaudhuri-Hochquenghem (BCH) channel coding is employed as a forward-error-correction (FEC) at a radio link layer. A FEC provides an efficient throughput access on wireless network. The numerical results clearly indicate that a quality of service (QoS) can be improved at low channel SNR region when the maximum channel coding throughput is achieved.
... I frames contain the bulk of the audio and video data and are larger than P frames, which in turn are larger than B frames. When a video sequence is compressed, a typical MPEG encoder uses a (Wu et al, 2005). Figure 1 shows the video multicast system considered in this paper for both wired and wireless clients. ...
... Using a TCP-adaptive FEC scenario, the predicted optimal playable frame rate (PFR) based on maximum effective TCP throughput in (12) can be evaluated as (Wu et al, 2005; AL- ...
Chapter
Full-text available
In this article, we present a simple cross-layer model that leads to the optimal throughput of multiple users for multicasting MPEG-4 video over a heterogeneous network. For heterogeneous wired-to-wireless network, at the last wireless hop there are bit errors associated with the link-layer packets that are arising in the wireless channel, in addition of overflow packet dropping over wired links. We employ a heuristic TCP function to optimize the cross-layer model of data link and physical (radio-link) layer. An adaptive Forward-Error-Correction (FEC) scheme is applied at the byte-level as well as at the packet-level. The corresponding optimal video quality can be evaluated at each client end. The results show that a server can significantly adapt to the bandwidth and FEC codes to maximize the video quality of service (QoS) in terms of temporal scaling when a maximum network throughput for each client is reached.
... Since MPEG video coding involves interframe dependencies to achieve high compression rates, the random dropping of packets by routers and/or random bit errors over a noisy wireless networks can both seriously degrade video quality. In wired Internet [Wu 2005], the dropping packets from an independently encoded I frame causes the following dependent P and B frames to be fully undecodable. In practice, interframe dependencies have been shown to cause a 3% packet loss rate to result in a 30 % frame loss rate. ...
... Since MPEG video coding involves interframe dependencies to achieve high compression rates, the random dropping of packets by routers and/or random bit errors over a noisy wireless networks can both seriously degrade video quality. In wired Internet [Wu 2005], the dropping packets from an independently encoded I frame causes the following dependent P and B frames to be fully undecodable. In practice, interframe dependencies have been shown to cause a 3% packet loss rate to result in a 30 % frame loss rate. ...
Article
Full-text available
Wireless video transmission is often vulnerable to errors from the noisy wireless environment. Due to errors, the discarded link layer packets impose a serious limitation on the maximum achievable throughput over wireless channel. To face this challenge and to enhance the overall TCP-Friendly video throughput, this paper proposes an MPEG packet loss model which is based on Forward Error Correction (FEC) over wired-to-wireless channel. A FEC packet level scheme is used to act as an inter-protection control based on Reed-Solomon (RS) code providing a robust transmission against random packet loss. Hence, the predicted frame rate for MPEG video streaming can be estimated. Quality of service (QoS) in terms of frame rate and quality factor (Quantizer Scale) is also evaluated under various FEC code conditions. The new results demonstrate that the proposed scheme improves a video quality performance in high wireless channel bit errors in spite of an increment in the overhead packets due to FEC code.
Article
Full-text available
Wireless Touch Networks (WTN) have become increasingly important with the emergence of the Internet of Things (IoT) and are regarded as a class of self-organizing networks. This article presents an overview of the construction principles, routing protocols, quality of service parameters, traffic models, and characteristics of WTN. The article also explores the application of dynamic routing protocols for constructing a self-organizing network of autonomous IoT systems. Known dynamic routing protocols for mobile radio networks are reviewed and the advantages and disadvantages of proactive and reactive approaches are discussed.
Chapter
Full-text available
All-IP networks are under development with multimedia services in mind. Video multicast is an efficient way to deliver one video simultaneously to many users over such heterogeneous wired-to-wireless networks, such as in wireless IP applications where a mobile terminal communicates with an IP server through a wired IP network in tandem with a wireless network. Unicast video streaming is also an attractive way to deliver time-shifted TV to mobile devices. This Chapter presents a simple cross-layer model that leads to the optimal throughput to multiple users for multicasting video over a heterogeneous network. An adaptive forward-error-correction scheme is applied at the byte-level as well as at the packet-level to reduce channel errors. The results show that a server can significantly adapt to the bandwidth and FEC codes to maximize the video quality of service. For unicast streaming, the Chapter presents a single negative acknowledgment scheme in which a video stream is transmitted over a heterogeneous network from a streaming server to a mobile device in a WiMAX network. The broadband streaming system is compared to several candidate solutions based on originally wired network congestion controllers. Multi-connection streaming is also investigated.
Conference Paper
With the development of computer network technology and multimedia technology, real-time multimedia transmission will become increasingly important. Due to the current network bandwidth, multimedia data compression technology, and other restrictions, such as distance learning, audio and video conferencing systems, remote medical diagnosis is now basically have not been fully realized. In this paper, a scheme using Forward Error Correction(FEC) and Quality of Service(Qos) is proposed to improving the Packet Loss in realtime video transmission. On one hand, the scheme uses improved Vandermonde matrices to avoid the issue that the erasure codes built by classical Vandermonde matrices may not be RS codes. On other hand, this paper design a Qos in receiving end to solve problems of disordered packets and repeated packets. The simulations indicate that the packet loss ratio and transmission efficiency of the proposed scheme are superior to conventional method.
Chapter
Full-text available
All-IP networks are under development with multimedia services in mind. Video multicast is an efficient way to deliver one video simultaneously to many users over such heterogeneous wired-to-wireless networks, such as in wireless IP applications where a mobile terminal communicates with an IP server through a wired IP network in tandem with a wireless network. Unicast video streaming is also an attractive way to deliver time-shifted TV to mobile devices. This Chapter presents a simple cross-layer model that leads to the optimal throughput to multiple users for multicasting video over a heterogeneous network. An adaptive forward-error-correction scheme is applied at the byte-level as well as at the packet-level to reduce channel errors. The results show that a server can significantly adapt to the bandwidth and FEC codes to maximize the video quality of service. For unicast streaming, the Chapter presents a single negative acknowledgment scheme in which a video stream is transmitted over a heterogeneous network from a streaming server to a mobile device in a WiMAX network. The broadband streaming system is compared to several candidate solutions based on originally wired network congestion controllers. Multi-connection streaming is also investigated.
Article
Full-text available
We present a general analytical model for predicting the re-constructed frame rate of an MPEG stream. Our model captures the temporal relationships between I-, P, and B-frames but is independent of the channel and media char-acteristics. We derive an adaptive FEC scheme from the general model and verify it by comparing it to the results of a simulation. The prediction error of the model compared to the simulation for a wide array of parameter values is less than 5%. We then use the derived adaptive FEC scheme to study the optimal rate allocation (i.e., between generat-ing a higher frame rate or increasing the protection for a lower frame rate) when equation-based TCP rate control is used to couple packet rates to channel characteristics such as round trip time and packet loss probabilities. Surprisingly, we find that optimal protection levels for I-and P-frames are relatively static as loss rates increase from 1% to 4% while changes in the frame type pattern are used to ameliorate the effects of the increased loss. The study demonstrates how our model can be used to reveal joint source/channel coding tradeoffs and how they relate to encoding and transmission parameters.
Article
This paper describes a control scheme for the multi-encoder system that transmits multiple video signals under the unified control of each encoder. First, the rate-distortion characteristics of the MC+DCT (MPEG) encoder are analyzed. We show that the rate-distortion characteristics can be represented by a simple power function. Based on this function, a bit-rate distribution rule for multiple encoders and a method of applying this to moving pictures is proposed. A computer simulation using real moving pictures shows that the proposed scheme reduces the total distortion of the system as well as probability of severe distortion which occurs in the case of a 'difficult' picture, as compared with the non bit-rate distribution method.
Article
In this paper we develop a simple analytic characterization of the steady state throughput, as a func-tion of loss rate and round trip time for a bulk transfer TCP flow, i. e., a flow with an unlimited amount of data to send. Unlike the models in [6, 7, 10], our model captures not only the behavior of TCP's fast retransmit mechanism (which is also considered in [6, 7, 10]) but also the effect of TCP's timeout mech-anism on throughput. Our measurements suggest that this latter behavior is important from a modeling perspective, as almost all of our TCP traces contained more timeout events than fast retransmit events. Our measurements demonstrate that our model is able to more accurately predict TCP throughput and is accurate over a wider range of loss rates.
Article
The growth in power and connectivity of today' s PCs promises a continued increase in the growth of streaming media over the Internet. Hand-in-hand with the increase in streaming media comes the impending threat of unrespon- sive UDP traffic, often cited as the major threat to the sta- bility of the Internet. The responsiveness of commercial streaming media applications, such as RealNetworks' Re- alPlayer, will play an important role in the network impact of streaming media. Unfortunately, there are few empirical studies that analyze the responsiveness, or lack of it, of com- mercial streaming media. In this work, we measure the re- sponsiveness of RealVideo over UDP by measuring the per- formance of numerous streaming video clips selected from a variety of RealServers on the Internet. By varying the bot- tleneck bandwidth to the player, we are able to analyze the TCP-Friendliness of RealVideo over UDP and correlate the results with network and application layer statistics. We find that most streaming RealVideo clips are not bandwidth con- strained for typical broadband connections. In times of con- gestion, most RealVideo UDP streams respond to Internet congestion by reducing the application layer encoding rate, and streams with a minimum encoding rate less than the fair bandwidth share usually achieve a TCP-Friendly rate. In addition, our analysis suggests that a reason streaming ap- plications choose not to use TCP is that the TCP API hides network information, such as loss rate and round-trip time, making it difficult to estimate the available bandwidth for effective media scaling.
Conference Paper
In this paper we develop a simple analytic characterization of the steady state throughput, as a function of loss rate and round trip time for a bulk transfer TCP flow, i.e., a flow with an unlimited amount of data to send. Unlike the models in [6, 7, 10], our model captures not only the behavior of TCP's fast retransmit mechanism (which is also considered in [6, 7, 10]) but also the effect of TCP's timeout mechanism on throughput. Our measurements suggest that this latter behavior is important from a modeling perspective, as almost all of our TCP traces contained more time-out events than fast retransmit events. Our measurements demonstrate that our model is able to more accurately predict TCP throughput and is accurate over a wider range of loss rates.