Conference PaperPDF Available

A pragmatic approach for service provisioning based on a small set of per-hop behaviors

Authors:

Abstract

In this paper we describe the implementation of a network providing advanced services such as a Premium service that aims at providing low loss, low delay, and low delay jitter and an Olympic service that allows for a service differentiation in terms of delay within three additional classes. Our implementation of this network is based on the Differentiated Services Architecture, which is the most recent approach of the Internet Engineering Task Force towards quality of service. Access to service classes is controlled by a bandwidth broker, which can perform traffic engineering by means of multiprotocol label switching. The Premium service is implemented as expedited forwarding and the Olympic service as a group of assured forwarding per-hop-behavior. We present a thorough evaluation of the proposed services implemented by the careful assignment of micro-flows to a small set of per-hop behaviors.
1
A Pragmatic Approach for Service Provisioning
Based on a Small Set of Per-Hop Behaviors
Volker Sander
Central Institute for Applied Mathematics,
Forschungszentrum J¨
ulich GmbH, 52425 J¨
ulich, Germany
Email: sander@fz-juelich.de
Markus Fidler
Department of Computer Science, Aachen University
Ahornstr. 55, 52074 Aachen, Germany
Email: fidler@informatik.rwth-aachen.de
Abstract In this paper we describe the implementation of a
network providing advanced services such as a Premium service
that aims at providing low loss, low delay, and low delay jitter
and an Olympic service that allows for a service differentiation
in terms of delay within three additional classes. Our implemen-
tation of this network is based on the Differentiated Services Ar-
chitecture, which is the most recent approach of the Internet En-
gineering Task Force towards Quality of Service. Access to service
classes is controlled by a Bandwidth Broker, which can perform
Traffic Engineering by means of Multiprotocol Label Switching.
The Premium service is implemented as Expedited Forwarding
and the Olympic service as a group of Assured Forwarding Per-
Hop-Behavior. We present a thorough evaluation of the proposed
services implemented by the careful assignment of micro-flows to
a small set of Per-Hop Behaviors.
I. INTRODUCTION
Emerging Grid applications consist of complex mixes of
flows, with a variety of data rates and widely differing latency
requirements. Applications with these characteristics arise in
areas as remote visualization, analysis of scientific databases,
and teleimmersion. Table I lists the various flows of a future
teleimmersion application together with their networking de-
mand [6]. Characteristics such as these place substantial de-
mands on networks which cannot be fulfilled by today’s Best-
Effort (BE) Internet.
TABLE I
FLOWS AND REQUIRE ME NT S OF TELEIMMERSION APPLIC ATIONS
Latency Bandwidth Reliability Dynamic QoS
Control <30 ms 64 Kb/s Yes Low
Text <100 ms 64 Kb/s Yes Low
Audio <30 ms 128 Kb/s No Medium
Video <100 ms 5 Mb/s No Medium
Tracking <10 ms 128 Kb/s No Medium
Database <100 ms >1 Gb/s Yes High
Simulation <30 ms >1 Gb/s Mixed High
Haptic <10 ms >1 Mb/s Mixed High
Rendering <30 ms >1 Gb/s No Medium
The Differentiated Services (DS) [2] framework defines an
architecture for implementing scalable service differentiation
in the existing Internet by an aggregation of flows to a small
number of different traffic classes. DS can be complemented
by Traffic Engineering (TE), which is of special interest, if not
This work was supported in part by the Path Allocation in Backbone networks
(PAB) project funded by the German Research Network (DFN) and the Federal
Ministry of Education and Research (BMBF).
only a relative differentiation between services shall be imple-
mented, but, if absolute service guarantees need to be given.
We apply Multi-Protocol Label Switching (MPLS) [13] for this
purpose. MPLS is based on a functional decomposition of the
network layer into a control component and a forwarding com-
ponent. This distinction gives a number of options for the im-
plementation of the control component. In [9], [15] we pre-
sented the General-purpose Architecture for Reservation and
Allocation (GARA), which implements an advance reservation
framework using heterogeneous resource ensembles. GARA
includes a DS reservation manager, which can be used as a
Bandwidth Broker (BB) for an automated DS network manage-
ment and, which allows to apply DS TE by means of MPLS.
In this paper, we describe an implementation of a Premium
service based on the Expedited Forwarding (EF) Per-Hop Be-
havior (PHB) [5] and an Olympic service based on the Assured
Forwarding (AF) PHB group [10]. We perform a careful eval-
uation of the services by applying both Transport Control Pro-
tocol (TCP) and User Datagram Protocol (UDP) flows, and ad-
dress the question about the impact on the achievable service
when these heterogeneous applications share a single PHB. The
remainder of the paper is organized as follows: Sections II de-
scribes the DS architecure in detail. In section III we show re-
sults obtained from measurements in the implemented network.
Section IV concludes the paper.
II. DIFFERENTIATED SERVICES ARCHITECTURE
The DS architecture [2] addresses the scalability problems
of Integrated Services by defining the behavior of aggregates.
Packets are identified by simple markings that indicate accord-
ing to which aggregate behavior they should be treated. In the
core of the network, routers need not determine to which flow a
packet belongs, only which aggregate behavior should be used.
Edge routers mark packets and indicate whether they are within
profile or if they are out of profile, in which case they might
even be discarded by a dropper at the edge router. A particular
marking on a packet indicates a PHB that has to be applied for
forwarding of the packet. Currently, the EF PHB [5] and the AF
PHB group [10] are specified. Though the architecture allows
for the definition of additional PHBs, by setting the 6-bit Dif-
ferentiated Services Code Point (DSCP), end-to-end guarantees
require the support of a certain PHB in all pertaining domains
thus making an end-to-end deployment with only a few PHBs
more feasible.
2
The EF PHB is intended for building a service that offers low
loss, low delay, and low delay jitter, namely a Premium service.
The specification of the EF PHB was recently redefined to al-
low for a more exact and quantifiable definition. Besides the
Premium service a so called Olympic service [10] is proposed
by the IETF to be based on the AF PHB group by extending
it by means of a class based over-provisioning. Three of the
four currently defined classes of the AF PHB group are used
for such an Olympic service. The service differentiation be-
tween the three classes Gold, Silver, and Bronze is proposed to
be performed by the means of admission control, i.e. assigning
only a light load to the Gold class, a medium load to the Silver
class and a high load to the Bronze class.
The Differentiated Services architecture pushes complexity
to the edges of the network: packets are marked to belong to
an aggregate behavior either by applications or by edge routers.
If edge routers mark packets, which is the more general solu-
tion, they may choose to do so on a per-flow basis or on any
other criteria. In this scenario, of course, the question arises
which packets will get marked. This is especially the case when
the environment is dynamic, i.e. when varying flows should be
able to use the available services. Here, a particular resource
manager called a Bandwidth Broker (BB) comes into place. A
BB is a middleware service which controls and facilitates the
dynamic access to network services of a particular administra-
tive domain. BBs are also viewed as the Policy Decision Point
(PDP) of the controlled domain. The concept of a BB is typ-
ically associated with the Differentiated Services architecture.
In this context, the task of a BB is to control the configuration of
the edge routers of a single DS domain. By performing a care-
ful admission control BBs are a fundamental building block for
the provision of network services on top of DS aggregates. The
GARA research prototype is a BB which addresses these issues.
This paper presents a thorough evaluation of the achievable ser-
vice classes provided by the admission control of a BB in a DS
environment which does not rely on the assumption of a broad
variance of aggregates. Instead, heterogenous aggregates con-
sisting of elastic and non-elastic flows are explicitly addressed.
III. EXPERIMENTAL STUDIES
We report on experiments designed to examine a DS imple-
mentation based on commodity products. In the following sub-
sections we first give the experimental configuration and then
address the implementation and evaluation of the different traf-
fic classes. We show problems observed when using the BE
service and address these with similar measurements using the
Olympic or the Premium service.
A. Experimental Configuration
Our experimental configuration comprises a laboratory
testbed at the Research Centre J¨
ulich, donated by Cisco Sys-
tems. The testbed allows controlled experimentation with basic
DS mechanisms. Four Cisco Systems 7200 series routers were
used for all experiments. These are either connected by OC3
ATM connections, by Fast Ethernet, or by Gigabit Ethernet
connections. End-system computers are connected to routers
by switched Fast Ethernet connections. Hence, the minimum
MTU size of our testbed is that of the end-systems: 1500 B. To
create a point of congestion, we configured an ATM Permanent
Virtual Circuit (PVC) between an ingress and an interior router
to 60 Mb/s.
We performed several experiments demonstrating the perfor-
mance of high-end TCP applications [15] like a Guaranteed
Rate (GR) file transfer with deadline and typical UDP appli-
cations like video streaming [8] or videoconferencing. The fol-
lowing tools have been used for traffic generation:
gen send/gen recv BE UDP traffic generator. This traf-
fic generator was applied to generate BE UDP traffic with
a mean rate of 50 Mb/s and different burst charateristics.
These UDP flows do not aim to model any specific appli-
cation, but we assume that the applied burst characteristics
reflect effects that occur in today’s and in the future Inter-
net. TCP streams are initially bursty, UDP based real-time
applications are emerging, which create bursts, for exam-
ple by intra-coded frames in a video sequence. Further on
burst sizes increase in the network, due to aggregation and
multiplexing [4].
rude/crude Delay-sensitive UDP traffic generator. This
traffic generator allows to measure the one-way delay and
delay jitter. In our experiments we used real-time traf-
fic patterns from script files, which we created from pub-
licly available video traces [8]. We applied IP fragmenta-
tion for the transmission of frames that exceed the MTU,
which we consider as being allowed here, since we config-
ured the DS classes to prevent from dropping fragments.
The sequence, which we applied for the experimental re-
sults shown in this paper, is a television news sequence
produced by the ARD. The sequence is MPEG-4 encoded
with a minimum frame size of 123 B, a maximum frame
size of 17.055 KB, a mean rate of 0.722 Mb/s and a peak
rate of 3.411 Mb/s. The Hurst parameter is about 0.5 and
decays with an increasing aggregation level. Figure 1 il-
lustrates a part of the traffic profile of the sequence.
ttcp TCP stream generator. We used the widely known
TCP benchmark ttcp to generate TCP load. In the experi-
ments reported on in this paper we selected an end-system
which was not capable of generating a rate of more than
1.8 MB/s and if not stated otherwise we applied a socket
buffer corresponding to a maximum window size of about
15 MTU.
B. Implementation and Evaluation of the Best Effort Service
Applying the plain BE service to the video test application
used throughout this paper we generate the baseline for our
evaluation. Our configuration allocates the remaining capac-
ity of the ATM bottleneck link, which is not used by any other
class, to the BE class. In the following experiments no other
class than BE is used, resulting in an assignment of 60 Mb/s
of the bottleneck ATM link to the BE class. The tx-ring-limit
parameter on the ATM interface card that specifies the queue
size, which is assigned to the applied ATM PVC, was set to
16 particles each of 512 B allowing to store upto four MTU on
the ATM interface. This value is by far smaller than the de-
fault value, but it has to be applied to allow for an efficient QoS
3
implementation [7]. The BE layer 3 queue was configured to
hold at most 256 packets. We consider this queue size, which
is a trade off between delay and loss rate, as being feasible for
BE TCP traffic, which is rather sensitiv to packet drops than to
queuing delay in a range of a few tens of milliseconds.
In figure 2 the delay measured when transmitting the news
sequence in the BE class is shown. Congestion is generated by
applying an UDP stream with two bursts, each of ten seconds
duration. As can be seen from figure 2, the delay is bounded
to about 42 ms, showing some minor effects on the measure-
ments due to tail-drop in the router. The delay corresponds to
an effective data rate on the ATM interface of about 48 Mb/s
after subtracting the ATM induced overhead. While this delay
is acceptable for streaming video applications, it can be critical
for real-time video applications like video conferencing.
0
1000
2000
3000
4000
5000
6000
7000
0 10 20 30 40 50 60
Rate (Kb/s)
Receive Time (s)
Fig. 1. Data Rate UDP News Sequence.
0
0.005
0.01
0.015
0.02
0.025
0.03
0.035
0.04
0.045
0 10 20 30 40 50 60
Delay (s)
Receive Time (s)
Fig. 2. Delay BE UDP News Sequence.
Taking the negative effects of the BE class a not guaranteed
transmission rate, possible packet loss and a chance of high de-
lay and delay jitter we argue that certain applications exist,
like a file transfer with deadline or videoconferencing, which
require services that are better than BE.
C. Implementation and Evaluation of an Olympic Service
A Weighted Fair Queuing (WFQ) environment is used for the
implementation of the Olympic service based on three AF PHB
classes. Within these classes GARA is capable of managing
the allocated resources and the relative load in order to allow
for a service differentiation in terms of delay. The Olympic ser-
vice [10] proposed by the IETF is realised by admission con-
trol and a class based over-provisioning. We carried out ex-
periments with the transmission of the news sequence in each
of the Olympic classes, with the classes configured according
to Table II. Within each of the Olympic classes a differentia-
TABLE II
CORE CONFIGU RATION O F TH E OLYMPIC CLASSE S
Class Percent Gross Capacity Net Capacity Over-Provision
Bronze 5 % 3 Mb/s 2.4 Mb/s 1×
Silver 10 % 6 Mb/s 4.8 Mb/s 2×
Gold 15 % 9 Mb/s 7.2 Mb/s 3×
tion of the drop probability for differently marked excess traf-
fic can be performed by applying Multiple Random Early De-
tection (M-RED). Nevertheless, we consider excess traffic in
an over-provisioned class as harmful for the BE class. There-
fore we mark the conforming traffic and drop excess traffic in
the over-provisioned classes. The layer 3 queue size of each
of the three Olympic classes was configured to 128 packets in
the WFQ environment. Consequently, the ingress meter and
marker are based on a token bucket with a confirmed informa-
tion rate of 2.4 Mbit/s for all Olympic classes, which leads to
the over-provisioning factors given in Table II. A confirmed
burst size of 32 MTU is used at the ingress. This value is inten-
tionally smaller than the queue size that is applied in the core, to
avoid packet drops in the Olympic classes within the network,
to avoid a high utilization of the queuing space, and thus to re-
duce queuing delays. Besides it has to be noted that the WFQ
queue size is configured in packets, which can be smaller than
the MTU, whereas the confirmed burst size that is used by the
meter and marker is configured in bytes.
Figure 3 shows the measured delay for the news sequence
in the Bronze Class and the impacts of congestion in the BE
class on the Bronze class. Compared to the transmission of the
sequence within the BE class, which is shown in Figure 2, the
delay is reduced significantly. Furthermore, packet drops did
not occur in the Bronze class. Thereby AF based services can
be applied as GR service without packet loss for conforming
traffic. The delay and delay jitter differentiation, which can
be achieved in addition by the Olympic service, is shown in
Figure 4 and 5 for the Silver and the Gold class respectively,
compared to the Bronze class in Figure 3.
Additionally, we present experiments with TCP in the Bronze
class and demonstrate how TCP can be configured in a GR en-
vironment to achieve the desired throughput. We show that, if
the pertaining class is configured properly, packet drops do not
occur, which prevents from halving the TCP congestion win-
dow. The data rate instead corresponds to the capacity allo-
cated for the flow. To avoid effects on the RTT by upstream
congestion, the acknowledgements are also transmitted in the
Bronze class. The maximum window size is in our experiments
controlled by setting the socket buffer size. The resulting RTT
can be computed according to the bandwidth-delay product:
W=R·RT T , with Wdenoting the maximum window size
and Rdenoting the configured GR capacity. The RTT adjusts
to the available or configured capacity and to the configured
maximum window size.
4
0
0.005
0.01
0.015
0.02
0.025
0.03
0 10 20 30 40 50 60
Delay (s)
Receive Time (s)
Fig. 3. Delay Bronze UDP News Sequence.
0
0.005
0.01
0.015
0.02
0.025
0.03
0 10 20 30 40 50 60
Delay (s)
Receive Time (s)
Fig. 4. Delay Silver UDP News Sequence.
0
0.005
0.01
0.015
0.02
0.025
0.03
0 10 20 30 40 50 60
Delay (s)
Receive Time (s)
Fig. 5. Delay Gold UDP News Sequence.
0
0.005
0.01
0.015
0.02
0.025
0.03
0 10 20 30 40 50 60
Delay (s)
Receive Time (s)
Fig. 6. Delay Premium UDP News Sequence.
For these experiments we configured the Bronze class to
25 % of the bottleneck link capacity, corresponding to a net data
rate of about 1.6 MB/s. Figure 7 shows the RTT for a configured
socket buffer of 15 MTU. Congestion in the BE class starts af-
ter 10 s and leads to an increase in the RTT, which corresponds
to the queuing delay added by queuing the data of a complete
TCP window. Figure 8 shows the corresponding throughput.
At the beginning the application limits the data rate to about
1.8 MB/s and after the BE downstream congestion started, the
limitation is given by the configured capacity for the Bronze
class at about 1.6 MB/s and from the TCP point of view leads
to a limitation of the sending rate by the offered window. The
same effect on the throughput can be observed, if the maximum
window is increased by configuring a socket buffer of 32 MTU.
Figure 9 and figure 10 show the resulting RTT and throughput
for this configuration. Again the RTT by increased queuing de-
lay is adjusted to the available capacity and the window size,
being about twice as high as in the previous experiment.
From these TCP experiments it can be seen that during peri-
ods of BE congestion WFQ acts as an aggregate traffic shaper
with a rate corresponding to the configured WFQ weight. The
achieved TCP throughput is independent of the TCP window
size, as shown in Figure 8 and 10.
D. Implementation and Evaluation of a Premium Service
In a first experiment the Premium service was implemented
based on EF using Priority Queuing (PQ). The ingress router
was configured to apply a meter and marker with a confirmed
information rate of 4.8 Mb/s and a burst size of 32 MTU. Ex-
cess traffic is dropped. The parameters that were applied at the
ingress router were reflected by the core configuration. The PQ
scheduler was bound to 10 % of the bottleneck link capacity,
corresponding to about 4.8 Mb/s. Bursts of up to 48 KB are per-
mitted in the core. Figure 6 shows the results of a transmission
of the news sequence. A reduction of the transmission delay
and delay jitter especially for big video frames, which lead to
packet bursts, becomes obvious for PQ compared to the WFQ
settings in Table II. Here the tx-ring-limit parameter, which is
used to configure the outgoing non-preemptive interface queu-
ing capacity, is of major importance [7].
The following series of experiments were applied to an im-
plementation of the EF PHB with the goal to analyze the be-
havior of a heterogeneous aggregate. It uses WFQ to emulate
strict PQ by provisioning 99 % of the available capacity to the
EF aggregate. This is to ensure that the queue of the EF ag-
gregate caused by possible bursts is minimized to reduce the
queuing delay. Note that the BB prototype GARA performs a
careful admission control and is thus preventing the starvation
of the BE traffic. The particular challenge is caused by apply-
ing elastic and non-elastic traffic in a single EF aggregate. The
setup consisted of following three flows, which passed an ATM
bottleneck link of 100 Mb/s capacity:
The first flow entering the testbed was a delay-sensitive
Premium UDP flow. It ran from the beginning to the end
of the measurement. GARA acted as a BB to associate
the flow to the EF PHB. The related UDP traffic generator
was configured to achieve a rate of 40 Mpbs by constantly
5
0
5
10
15
20
25
30
0 5 10 15 20 25 30 35 40
RTT (ms)
Time (s)
Fig. 7. TCP Round-Trip-Time Bronze 15 MTU Socket Buffer
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
0 5 10 15 20 25 30 35 40
Link Data Rate (Kb/s)
Time (s)
Fig. 8. TCP Throughput Bronze 15 MTU Socket Buffer
0
5
10
15
20
25
30
0 5 10 15 20 25 30 35 40
RTT (ms)
Time (s)
Fig. 9. TCP Round-Trip-Time Bronze 32 MTU Socket Buffer
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
0 5 10 15 20 25 30 35 40
Link Data Rate (Kb/s)
Time (s)
Fig. 10. TCP Throughput Bronze 32 MTU Socket Buffer
submitting 1 KB packets every 0.2 milliseconds. The re-
ceiver continuously reported the delay calculated from the
time-stamps in the packets.
The second flow in the experiment was a GR TCP flow
which was roughly active in the time interval between 20
and 60 seconds. Emulating a distributed supercomputing
application, we created a bursty TCP stream which was
injecting data in chunks of 256 KB into the network, us-
ing the EF PHB. Every 8th message contained two chunks,
i.e. 512 KB. The average rate of the flow was 16 Mb/s. Us-
ing GARA, we claimed a slightly higher guaranteed band-
width reservation, allowing bursts of up to one full chunk.
The third flow started during the experiment was a BE
UDP flow which was roughly active in the time interval
between 40 and 80 seconds. Our main intention was to
create a heavy congestion by submitting 750 byte packets
at a frequency of 10000 Hz, to achieve a rate of 60 Mb/s.
To demonstrate the impact of a single Premium flow un-
der congestion, the competing UDP flow was still active
after the TCP flow ends. The BE flow thus consumed a
significant amount of the available capacity.
Figure 11 shows that the selected single-aggregate imple-
mentation is not appropriate for providing delay-sensitive ser-
vices when bursty TCP flows use the same aggregate in paral-
lel. If a burst introduced by the TCP flow exceeds the available
output link capacity, packets get queued on the IP-layer queue.
Because packets of the Premium UDP flow are also queued,
the delay variation increases significantly. We can easily re-
validate the result illustrated in Figure 11 by some simple cal-
culations. The Premium service is used by a UDP application
which is transmitting data at a rate of 40 Mb/s. This application
shares the aggregate with a TCP flow which is injecting bursts
of 256 KB at the link speed of its Fast Ethernet interface, i.e.
at a rate of 100 Mb/s. Hence, the 256 KB of data enter the EF
aggregate within 20 milliseconds. The total amount of data en-
tering the EF aggregate in this time interval is thus 356 KB. As-
suming an ATM overhead of 20 %, the EF aggregate is served at
a rate of 80 Mb/s. We thus know that 200 KB of EF data leave
the router during this 20 ms interval. Consequently, at the end
of the interval there exists an EF queue of 156 KB that leads to
an upper delay boundary of 15 ms.
In order to inject a traffic profile which is conforming to the
Service Level Agreement with the peered downstream domain,
the egress router of a DS domain might be enforced to shape
out the traffic of a whole aggregate. When this is applied to the
scenario illustrated above, the impact caused by the bursts of
the TCP stream might be amplified by the queuing introduced
by traffic shaping. Figure 12 illustrates the impact of traffic
shaping when it is performed for an aggregate. Traffic shap-
ing can be viewed as an additional constraint which limits the
EF capacity of the output link by shaping the rate to the given
traffic profile. Hence, EF packets get queued whenever TCP
bursts cause the shaper to become active. In the illustrated sce-
nario, the shaper became active whenever the TCP application
produced a burst of two data chunks every 2 s.
As traffic shaping over an aggregate has a negative impact on
the delay variation of a Premium flow, the EF implementation
proposed here uses a flow-based service differentiation on the
6
0
5
10
15
20
25
30
35
40
45
50
0 10 20 30 40 50 60 70 80 90
Type-P One-way Delay (ms)
Receive Time (s)
Fig. 11. No Shaping
0
5
10
15
20
25
30
35
40
45
50
0 10 20 30 40 50 60 70 80 90
Type-P One-way Delay (ms)
Receive Time (s)
Fig. 12. Aggregate Shaping
0
5
10
15
20
25
30
35
40
45
50
0 10 20 30 40 50 60 70 80 90
Type-P One-way Delay (ms)
Receive Time (s)
Fig. 13. Per-Flow Shaping
output interface of the egress router. In detail, GARA applied
an additional router internal packet marking mechanism, called
“qos-group” which facilitates an efficient packet classification.
The router configuration propagated by GARA extended the ba-
sic DSCP marking by also assigning the “qos-group” for the
related flow. This additional classification was then used to
update the configuration of the output interface of the ingress
router to shape out the related GR flow. Figure 13 demonstrates
the EF PHB implementation providing a Premium service and
a GR service using a single aggregate. The remaining impact
is caused by the device queues which are under the control of
the network administrator. Figure 13 also illustrates a side im-
pact of the implementation. The jitter of the delay-sensitive
flow sharing the aggregate with a bursty TCP flow is decreased
when congestion occurs. This result is caused by the fact that
the TCP packets are served by their assigned rate and the active
shaping configuration. As illustrated in Figure 8 and 10, WFQ
operates in periods of congestion as an additional shaper. In that
case, less TCP packets get served when they compete with the
smaller BE UDP packets. Hence, the interference of the larger
packets is reduced, which reduces the maximum experienced
delay.
IV. CONCLUSIONS AND FUTURE WORK
We have presented a quantitative evaluation of a DS imple-
mentation providing a Premium service and an Olympic service
automatically configured by GARA. Our evaluation addressed
the QoS demand of heterogeneous types of flows sharing a
single PHB. The experiments presented used commodity hard-
ware. This demonstrates that real application can actually use
DS, especially if access to services is automated by a BB such
as GARA. The pragmatic approach of limiting the assumptions
made about the underlying PHBs addresses potential deploy-
ment constraints and facilitates the negotiation of traffic trunk
encoding between peered domains.
Our future work will focus on larger scenarios with several
possible bottleneck links. These require complex TE mecha-
nisms and an advanced resource management, which we aim at
addressing with GARA and MPLS.
REFERENCES
[1] Allman, M., Paxson, V., and Stevens, W.: TCP Congestion Control. RFC
2581, (1997)
[2] Blake, S. et al.: An Architecture for Differentiated Services. RFC 2475,
(1998)
[3] Cardwell, N., Savage, S., and Anderson, T.: Modeling TCP Latency. Pro-
ceedings of IEEE Infocom, (2000)
[4] Charny, A., and Le Boudec, J.-Y.: Delay Bounds in a Network With Ag-
gregate Scheduling. Proceedings of QofIS, (2000)
[5] Davie, B. et al.: An Expedited Forwarding PHB. RFC 3246, (2002)
[6] DeFanti, T., and Stevens, R.: Teleimmersion. In Foster, I., and Kesselman,
C.: The Grid: Blueprint for a Future Computing Infrastructure. Morgan-
Kaufmann, (1998)
[7] Ferrari, T., Pau, G., and Raffaelli, C.: Priority Queuing Applied to Ex-
pedited Forwarding: A Measurement-Based Analysis. Proceedings of
QofIS, (2000)
[8] Fitzek, F., and Reisslein, M.: MPEG–4 and H.263 Video Traces for Net-
work Performance Evaluation. IEEE Network, (2001), 15(6):40-54
[9] Foster, I., Roy, A., Sander, V., and Winkler, L.: End-
to-End Quality of Service for High-End Applications.
http://www.mcs.anl.gov/qos/qos papers.htm, (1999)
[10] Heinanen, J. et al.: Assured Forwarding PHB Group”. RFC 2597, (1999)
[11] Mathis, M., Semke, J., Mahdavi, J., and Ott, T.: The Macroscopic Behav-
ior of the TCP Congestion Avoidance Algorithm. Proceedings of ACM
SIGCOMM, (1997)
[12] Padhye, J., Firoiu, V., Towsley, D., and Kurose, J.: Modeling TCP
Troughput: A simple model and its empirical validation. Proceedings of
ACM SIGCOMM, (1998)
[13] Rosen, E., Viswanathan, A., and Callon, R.: Multiprotocol Label Switch-
ing Architecture. RFC 3031, (2001)
[14] Sander, V., Adamson, W., Foster, I, and Roy, A: End-to-End Provision of
Policy Information for Network QoS. IEEE Symposium on High Perfor-
mance Distributed Computing, (2001)
[15] Sander, V., Foster, I., Roy, A., and Winkler, L.: A Differentiated Services
Implementation for High-Performance TCP Flows. Terena Networking
Conference, (2000)
[16] Wu, Q., and Williamson, C.: Improving Ensemble-TCP Performance on
Asymmetric Networks. Proceedings of MASCOTS, (2001)
... In addition, the EF PHB can be used to implement a Guaranteed Rate Service. However, if flows with different delay requirements and especially, if bursty flows share the EF PHB, traffic shaping of these bursts at the edge of the DS domain is likely to be required [25]. To avoid a starvation of other traffic classes, EF traffic has to be strictly policed at the ingress router of a DS domain and excess EF traffic has to be discarded [9]. ...
... An advantage of an AF based Guaranteed Rate Service is the isolation of the pertaining traffic from the EF based Premium Service. Doing so significantly reduces the interdependencies of the two aggregate classes, however this distinction is not necessarily required [25]. Since DS offers better than BE services, an admission control is required, which controls incoming traffic and which further on allows for billing. ...
... A Guaranteed Rate Service can be implemented in a DS network either by applying the EF PHB or the AF PHB [25]. Since we implement both PHBs in our testbed, either of these two can be chosen. ...
Conference Paper
Full-text available
Grid computing requires network services beyond what is currently provided by the Best-Effort Internet. Among the different approaches towards network Quality of Service, aggregate scheduling, which maps micro-flows to a small number of different service classes, offers sufficient scalability up to the size of the Internet. The Differentiated Services architecture of the Internet Engineering Task Force is such an approach. Recently further Best-Effort like services have been proposed that are based on aggregate scheduling, like the less than Best-Effort Scavenger Service and the Alternative Best-Effort Service. In this paper we introduce the existing aggregate based approaches to Quality of Service. We take a heterogeneous mix of Transmission Control Protocol and User Datagram Protocol flows into account and complement services by adding transport protocol specific elements, if appropriate. In particular we show multi-class applications that are designed to apply a Guaranteed Rate Service and a Scavenger Service in parallel. Doing so we can show a performance gain and achieve a more economical use of the available resources without impacting responsive Best-Effort flows.
... By incorporating this with the relaxed configuration of the policing function, TCPapplications can effectively paced to use their reserved bandwidth. Details of work we have done with this can be found in (Sander and Fidler, 2002). ...
Article
Many Grid applications, such as interactive and collaborative environments, can benefit from guarantees for resource performance or quality of service (QoS). Although QoS mechanisms have been developed for different types of resources, they are often difficult to use together because they have different semantics and interfaces. Moreover, many of them do not allow QoS requests to be made in advance of when they are needed. In this paper, we describe GARA, which is a modular and extensible QoS architecture that allows users to make advance reservations for different types of QoS. We also describe our implementation of network QoS in detail.
Article
Full-text available
This document was written during the process of clarification of RFC2598 "An Expedited Forwarding PHB" that led to the publication of revised specification of EF "An Expedited Forwarding PHB". Its primary motivation is providing additional explanation to the revised EF definition and its properties. The document also provides additional implementation examples and gives some guidance for computation of the numerical parameters of the new definition for several well known schedulers and router architectures.
Conference Paper
Full-text available
A large number of products implementing aggregate buffering and scheduling mechanisms have been developed and deployed, and still more are under development. With the rapid increase in the demand for reliable end-to-end QoS solutions, it becomes increasingly important to understand the implications of aggregate scheduling on the resulting QoS capabilities. This paper studies the bounds on the worst case delay in a network implementing aggregate scheduling. We derive an upper bound on the queuing delay as a function of priority traffic utilization and the maximum hop count of any flow, and the shaping parameters at the network ingress. Our bound explodes at a certain utilization level which is a function of the hop count. We show that for a general network configuration and larger utilization utilization an upper bound on delay, if it exists, must be a function of the number of nodes and/or the number of flows in the network.
Article
In this paper we develop a simple analytic characterization of the steady state throughput, as a func-tion of loss rate and round trip time for a bulk transfer TCP flow, i. e., a flow with an unlimited amount of data to send. Unlike the models in [6, 7, 10], our model captures not only the behavior of TCP's fast retransmit mechanism (which is also considered in [6, 7, 10]) but also the effect of TCP's timeout mech-anism on throughput. Our measurements suggest that this latter behavior is important from a modeling perspective, as almost all of our TCP traces contained more timeout events than fast retransmit events. Our measurements demonstrate that our model is able to more accurately predict TCP throughput and is accurate over a wider range of loss rates.
Article
This document defines TCP's four intertwined congestion control algorithms: slow start, congestion avoidance, fast retransmit, and fast recovery. In addition, the document specifies how TCP should begin transmission after a relatively long idle period, as well as discussing various acknowledgment generation methods.
Article
This document defines a general use Differentiated Services (DS) [Blake] Per-Hop-Behavior (PHB) Group called Assured Forwarding (AF). The AF PHB group provides delivery of IP packets in four independently forwarded AF classes. Within each AF class, an IP packet can be assigned one of three different levels of drop precedence. A DS node does not reorder IP packets of the same microflow if they belong to the same AF class.
Article
The IETF’s recent differentiated services (DS) architecture, which specifies a scalable mechanism for treating packets differently, offers new opportunities for building end-to-end quality of service (QoS) systems. However, it also introduces new challenges. In particular, it is not clear whether TCP’s flow and congestion control mechanisms work well with the mechanisms used for end-to-end QoS. For that reason it is essential to analyze whether the existing DS mechanisms can be used with standard TCP implementations or whether it is necessary to wait for upcoming features introduced in future modified versions of TCP. The general-purpose architecture for reservation and allocation (GARA) supports flow-specific QoS specification, immediate and advance reservation, and online monitoring and control of both individual resources and heterogeneous resource ensembles. Using GARA, we evaluated actual DS mechanisms provided by Cisco routers. We present the results of this evaluation and discuss their impact on the performance of popular TCP implementations.