Conference PaperPDF Available

A pragmatic approach for service provisioning based on a small set of per-hop behaviors

November 2002

November 2002

DOI:10.1109/ICCCN.2002.1043061

Source
IEEE Xplore

Conference: Computer Communications and Networks, 2002. Proceedings. Eleventh International Conference on

Authors:

Volker Sander

FH Aachen University of Applied Sciences

In this paper we describe the implementation of a network providing advanced services such as a Premium service that aims at providing low loss, low delay, and low delay jitter and an Olympic service that allows for a service differentiation in terms of delay within three additional classes. Our implementation of this network is based on the Differentiated Services Architecture, which is the most recent approach of the Internet Engineering Task Force towards quality of service. Access to service classes is controlled by a bandwidth broker, which can perform traffic engineering by means of multiprotocol label switching. The Premium service is implemented as expedited forwarding and the Olympic service as a group of assured forwarding per-hop-behavior. We present a thorough evaluation of the proposed services implemented by the careful assignment of micro-flows to a small set of per-hop behaviors.

Content uploaded by Volker Sander

Content may be subject to copyright.

A Pragmatic Approach for Service Provisioning

Based on a Small Set of Per-Hop Behaviors

Volker Sander

Central Institute for Applied Mathematics,

Forschungszentrum J¨

ulich GmbH, 52425 J¨

ulich, Germany

Email: sander@fz-juelich.de

Markus Fidler

Department of Computer Science, Aachen University

Ahornstr. 55, 52074 Aachen, Germany

Email: ﬁdler@informatik.rwth-aachen.de

Abstract— In this paper we describe the implementation of a

network providing advanced services such as a Premium service

that aims at providing low loss, low delay, and low delay jitter

and an Olympic service that allows for a service differentiation

in terms of delay within three additional classes. Our implemen-

tation of this network is based on the Differentiated Services Ar-

chitecture, which is the most recent approach of the Internet En-

gineering Task Force towards Quality of Service. Access to service

classes is controlled by a Bandwidth Broker, which can perform

Trafﬁc Engineering by means of Multiprotocol Label Switching.

The Premium service is implemented as Expedited Forwarding

and the Olympic service as a group of Assured Forwarding Per-

Hop-Behavior. We present a thorough evaluation of the proposed

services implemented by the careful assignment of micro-ﬂows to

a small set of Per-Hop Behaviors.

I. INTRODUCTION

Emerging Grid applications consist of complex mixes of

ﬂows, with a variety of data rates and widely differing latency

requirements. Applications with these characteristics arise in

areas as remote visualization, analysis of scientiﬁc databases,

and teleimmersion. Table I lists the various ﬂows of a future

teleimmersion application together with their networking de-

mand [6]. Characteristics such as these place substantial de-

mands on networks which cannot be fulﬁlled by today’s Best-

Effort (BE) Internet.

TABLE I

FLOWS AND REQUIRE ME NT S OF TELEIMMERSION APPLIC ATIONS

Latency Bandwidth Reliability Dynamic QoS

Control <30 ms 64 Kb/s Yes Low

Text <100 ms 64 Kb/s Yes Low

Audio <30 ms 128 Kb/s No Medium

Video <100 ms 5 Mb/s No Medium

Tracking <10 ms 128 Kb/s No Medium

Database <100 ms >1 Gb/s Yes High

Simulation <30 ms >1 Gb/s Mixed High

Haptic <10 ms >1 Mb/s Mixed High

Rendering <30 ms >1 Gb/s No Medium

The Differentiated Services (DS) [2] framework deﬁnes an

architecture for implementing scalable service differentiation

in the existing Internet by an aggregation of ﬂows to a small

number of different trafﬁc classes. DS can be complemented

by Trafﬁc Engineering (TE), which is of special interest, if not

This work was supported in part by the Path Allocation in Backbone networks

(PAB) project funded by the German Research Network (DFN) and the Federal

Ministry of Education and Research (BMBF).

only a relative differentiation between services shall be imple-

mented, but, if absolute service guarantees need to be given.

We apply Multi-Protocol Label Switching (MPLS) [13] for this

purpose. MPLS is based on a functional decomposition of the

network layer into a control component and a forwarding com-

ponent. This distinction gives a number of options for the im-

plementation of the control component. In [9], [15] we pre-

sented the General-purpose Architecture for Reservation and

Allocation (GARA), which implements an advance reservation

framework using heterogeneous resource ensembles. GARA

includes a DS reservation manager, which can be used as a

Bandwidth Broker (BB) for an automated DS network manage-

ment and, which allows to apply DS TE by means of MPLS.

In this paper, we describe an implementation of a Premium

service based on the Expedited Forwarding (EF) Per-Hop Be-

havior (PHB) [5] and an Olympic service based on the Assured

Forwarding (AF) PHB group [10]. We perform a careful eval-

uation of the services by applying both Transport Control Pro-

tocol (TCP) and User Datagram Protocol (UDP) ﬂows, and ad-

dress the question about the impact on the achievable service

when these heterogeneous applications share a single PHB. The

remainder of the paper is organized as follows: Sections II de-

scribes the DS architecure in detail. In section III we show re-

sults obtained from measurements in the implemented network.

Section IV concludes the paper.

II. DIFFERENTIATED SERVICES ARCHITECTURE

The DS architecture [2] addresses the scalability problems

of Integrated Services by deﬁning the behavior of aggregates.

Packets are identiﬁed by simple markings that indicate accord-

ing to which aggregate behavior they should be treated. In the

core of the network, routers need not determine to which ﬂow a

packet belongs, only which aggregate behavior should be used.

Edge routers mark packets and indicate whether they are within

proﬁle or if they are out of proﬁle, in which case they might

even be discarded by a dropper at the edge router. A particular

marking on a packet indicates a PHB that has to be applied for

forwarding of the packet. Currently, the EF PHB [5] and the AF

PHB group [10] are speciﬁed. Though the architecture allows

for the deﬁnition of additional PHBs, by setting the 6-bit Dif-

ferentiated Services Code Point (DSCP), end-to-end guarantees

require the support of a certain PHB in all pertaining domains

thus making an end-to-end deployment with only a few PHBs

more feasible.

The EF PHB is intended for building a service that offers low

loss, low delay, and low delay jitter, namely a Premium service.

The speciﬁcation of the EF PHB was recently redeﬁned to al-

low for a more exact and quantiﬁable deﬁnition. Besides the

Premium service a so called Olympic service [10] is proposed

by the IETF to be based on the AF PHB group by extending

it by means of a class based over-provisioning. Three of the

four currently deﬁned classes of the AF PHB group are used

for such an Olympic service. The service differentiation be-

tween the three classes Gold, Silver, and Bronze is proposed to

be performed by the means of admission control, i.e. assigning

only a light load to the Gold class, a medium load to the Silver

class and a high load to the Bronze class.

The Differentiated Services architecture pushes complexity

to the edges of the network: packets are marked to belong to

an aggregate behavior either by applications or by edge routers.

If edge routers mark packets, which is the more general solu-

tion, they may choose to do so on a per-ﬂow basis or on any

other criteria. In this scenario, of course, the question arises

which packets will get marked. This is especially the case when

the environment is dynamic, i.e. when varying ﬂows should be

able to use the available services. Here, a particular resource

manager called a Bandwidth Broker (BB) comes into place. A

BB is a middleware service which controls and facilitates the

dynamic access to network services of a particular administra-

tive domain. BBs are also viewed as the Policy Decision Point

(PDP) of the controlled domain. The concept of a BB is typ-

ically associated with the Differentiated Services architecture.

In this context, the task of a BB is to control the conﬁguration of

the edge routers of a single DS domain. By performing a care-

ful admission control BBs are a fundamental building block for

the provision of network services on top of DS aggregates. The

GARA research prototype is a BB which addresses these issues.

This paper presents a thorough evaluation of the achievable ser-

vice classes provided by the admission control of a BB in a DS

environment which does not rely on the assumption of a broad

variance of aggregates. Instead, heterogenous aggregates con-

sisting of elastic and non-elastic ﬂows are explicitly addressed.

III. EXPERIMENTAL STUDIES

We report on experiments designed to examine a DS imple-

mentation based on commodity products. In the following sub-

sections we ﬁrst give the experimental conﬁguration and then

address the implementation and evaluation of the different traf-

ﬁc classes. We show problems observed when using the BE

service and address these with similar measurements using the

Olympic or the Premium service.

A. Experimental Conﬁguration

Our experimental conﬁguration comprises a laboratory

testbed at the Research Centre J¨

ulich, donated by Cisco Sys-

tems. The testbed allows controlled experimentation with basic

DS mechanisms. Four Cisco Systems 7200 series routers were

used for all experiments. These are either connected by OC3

ATM connections, by Fast Ethernet, or by Gigabit Ethernet

connections. End-system computers are connected to routers

by switched Fast Ethernet connections. Hence, the minimum

MTU size of our testbed is that of the end-systems: 1500 B. To

create a point of congestion, we conﬁgured an ATM Permanent

Virtual Circuit (PVC) between an ingress and an interior router

to 60 Mb/s.

We performed several experiments demonstrating the perfor-

mance of high-end TCP applications [15] like a Guaranteed

Rate (GR) ﬁle transfer with deadline and typical UDP appli-

cations like video streaming [8] or videoconferencing. The fol-

lowing tools have been used for trafﬁc generation:

•gen send/gen recv – BE UDP trafﬁc generator. This traf-

ﬁc generator was applied to generate BE UDP trafﬁc with

a mean rate of 50 Mb/s and different burst charateristics.

These UDP ﬂows do not aim to model any speciﬁc appli-

cation, but we assume that the applied burst characteristics

reﬂect effects that occur in today’s and in the future Inter-

net. TCP streams are initially bursty, UDP based real-time

applications are emerging, which create bursts, for exam-

ple by intra-coded frames in a video sequence. Further on

burst sizes increase in the network, due to aggregation and

multiplexing [4].

•rude/crude – Delay-sensitive UDP trafﬁc generator. This

trafﬁc generator allows to measure the one-way delay and

delay jitter. In our experiments we used real-time traf-

ﬁc patterns from script ﬁles, which we created from pub-

licly available video traces [8]. We applied IP fragmenta-

tion for the transmission of frames that exceed the MTU,

which we consider as being allowed here, since we conﬁg-

ured the DS classes to prevent from dropping fragments.

The sequence, which we applied for the experimental re-

sults shown in this paper, is a television news sequence

produced by the ARD. The sequence is MPEG-4 encoded

with a minimum frame size of 123 B, a maximum frame

size of 17.055 KB, a mean rate of 0.722 Mb/s and a peak

rate of 3.411 Mb/s. The Hurst parameter is about 0.5 and

decays with an increasing aggregation level. Figure 1 il-

lustrates a part of the trafﬁc proﬁle of the sequence.

•ttcp – TCP stream generator. We used the widely known

TCP benchmark ttcp to generate TCP load. In the experi-

ments reported on in this paper we selected an end-system

which was not capable of generating a rate of more than

1.8 MB/s and if not stated otherwise we applied a socket

buffer corresponding to a maximum window size of about

15 MTU.

B. Implementation and Evaluation of the Best Effort Service

Applying the plain BE service to the video test application

used throughout this paper we generate the baseline for our

evaluation. Our conﬁguration allocates the remaining capac-

ity of the ATM bottleneck link, which is not used by any other

class, to the BE class. In the following experiments no other

class than BE is used, resulting in an assignment of 60 Mb/s

of the bottleneck ATM link to the BE class. The tx-ring-limit

parameter on the ATM interface card that speciﬁes the queue

size, which is assigned to the applied ATM PVC, was set to

16 particles each of 512 B allowing to store upto four MTU on

the ATM interface. This value is by far smaller than the de-

fault value, but it has to be applied to allow for an efﬁcient QoS

implementation [7]. The BE layer 3 queue was conﬁgured to

hold at most 256 packets. We consider this queue size, which

is a trade off between delay and loss rate, as being feasible for

BE TCP trafﬁc, which is rather sensitiv to packet drops than to

queuing delay in a range of a few tens of milliseconds.

In ﬁgure 2 the delay measured when transmitting the news

sequence in the BE class is shown. Congestion is generated by

applying an UDP stream with two bursts, each of ten seconds

duration. As can be seen from ﬁgure 2, the delay is bounded

to about 42 ms, showing some minor effects on the measure-

ments due to tail-drop in the router. The delay corresponds to

an effective data rate on the ATM interface of about 48 Mb/s

after subtracting the ATM induced overhead. While this delay

is acceptable for streaming video applications, it can be critical

for real-time video applications like video conferencing.

1000

2000

3000

4000

5000

6000

7000

0 10 20 30 40 50 60

Rate (Kb/s)

Receive Time (s)

Fig. 1. Data Rate UDP News Sequence.

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

0 10 20 30 40 50 60

Delay (s)

Receive Time (s)

Fig. 2. Delay BE UDP News Sequence.

Taking the negative effects of the BE class – a not guaranteed

transmission rate, possible packet loss and a chance of high de-

lay and delay jitter – we argue that certain applications exist,

like a ﬁle transfer with deadline or videoconferencing, which

require services that are better than BE.

C. Implementation and Evaluation of an Olympic Service

A Weighted Fair Queuing (WFQ) environment is used for the

implementation of the Olympic service based on three AF PHB

classes. Within these classes GARA is capable of managing

the allocated resources and the relative load in order to allow

for a service differentiation in terms of delay. The Olympic ser-

vice [10] proposed by the IETF is realised by admission con-

trol and a class based over-provisioning. We carried out ex-

periments with the transmission of the news sequence in each

of the Olympic classes, with the classes conﬁgured according

to Table II. Within each of the Olympic classes a differentia-

TABLE II

CORE CONFIGU RATION O F TH E OLYMPIC CLASSE S

Class Percent Gross Capacity Net Capacity Over-Provision

Bronze 5 % 3 Mb/s 2.4 Mb/s ≥1×

Silver 10 % 6 Mb/s 4.8 Mb/s ≥2×

Gold 15 % 9 Mb/s 7.2 Mb/s ≥3×

tion of the drop probability for differently marked excess traf-

ﬁc can be performed by applying Multiple Random Early De-

tection (M-RED). Nevertheless, we consider excess trafﬁc in

an over-provisioned class as harmful for the BE class. There-

fore we mark the conforming trafﬁc and drop excess trafﬁc in

the over-provisioned classes. The layer 3 queue size of each

of the three Olympic classes was conﬁgured to 128 packets in

the WFQ environment. Consequently, the ingress meter and

marker are based on a token bucket with a conﬁrmed informa-

tion rate of 2.4 Mbit/s for all Olympic classes, which leads to

the over-provisioning factors given in Table II. A conﬁrmed

burst size of 32 MTU is used at the ingress. This value is inten-

tionally smaller than the queue size that is applied in the core, to

avoid packet drops in the Olympic classes within the network,

to avoid a high utilization of the queuing space, and thus to re-

duce queuing delays. Besides it has to be noted that the WFQ

queue size is conﬁgured in packets, which can be smaller than

the MTU, whereas the conﬁrmed burst size that is used by the

meter and marker is conﬁgured in bytes.

Figure 3 shows the measured delay for the news sequence

in the Bronze Class and the impacts of congestion in the BE

class on the Bronze class. Compared to the transmission of the

sequence within the BE class, which is shown in Figure 2, the

delay is reduced signiﬁcantly. Furthermore, packet drops did

not occur in the Bronze class. Thereby AF based services can

be applied as GR service without packet loss for conforming

trafﬁc. The delay and delay jitter differentiation, which can

be achieved in addition by the Olympic service, is shown in

Figure 4 and 5 for the Silver and the Gold class respectively,

compared to the Bronze class in Figure 3.

Additionally, we present experiments with TCP in the Bronze

class and demonstrate how TCP can be conﬁgured in a GR en-

vironment to achieve the desired throughput. We show that, if

the pertaining class is conﬁgured properly, packet drops do not

occur, which prevents from halving the TCP congestion win-

dow. The data rate instead corresponds to the capacity allo-

cated for the ﬂow. To avoid effects on the RTT by upstream

congestion, the acknowledgements are also transmitted in the

Bronze class. The maximum window size is in our experiments

controlled by setting the socket buffer size. The resulting RTT

can be computed according to the bandwidth-delay product:

W=R·RT T , with Wdenoting the maximum window size

and Rdenoting the conﬁgured GR capacity. The RTT adjusts

to the available or conﬁgured capacity and to the conﬁgured

maximum window size.

0.005

0.01

0.015

0.02

0.025

0.03

0 10 20 30 40 50 60

Delay (s)

Receive Time (s)

Fig. 3. Delay Bronze UDP News Sequence.

0.005

0.01

0.015

0.02

0.025

0.03

0 10 20 30 40 50 60

Delay (s)

Receive Time (s)

Fig. 4. Delay Silver UDP News Sequence.

0.005

0.01

0.015

0.02

0.025

0.03

0 10 20 30 40 50 60

Delay (s)

Receive Time (s)

Fig. 5. Delay Gold UDP News Sequence.

0.005

0.01

0.015

0.02

0.025

0.03

0 10 20 30 40 50 60

Delay (s)

Receive Time (s)

Fig. 6. Delay Premium UDP News Sequence.

For these experiments we conﬁgured the Bronze class to

25 % of the bottleneck link capacity, corresponding to a net data

rate of about 1.6 MB/s. Figure 7 shows the RTT for a conﬁgured

socket buffer of 15 MTU. Congestion in the BE class starts af-

ter 10 s and leads to an increase in the RTT, which corresponds

to the queuing delay added by queuing the data of a complete

TCP window. Figure 8 shows the corresponding throughput.

At the beginning the application limits the data rate to about

1.8 MB/s and after the BE downstream congestion started, the

limitation is given by the conﬁgured capacity for the Bronze

class at about 1.6 MB/s and from the TCP point of view leads

to a limitation of the sending rate by the offered window. The

same effect on the throughput can be observed, if the maximum

window is increased by conﬁguring a socket buffer of 32 MTU.

Figure 9 and ﬁgure 10 show the resulting RTT and throughput

for this conﬁguration. Again the RTT by increased queuing de-

lay is adjusted to the available capacity and the window size,

being about twice as high as in the previous experiment.

From these TCP experiments it can be seen that during peri-

ods of BE congestion WFQ acts as an aggregate trafﬁc shaper

with a rate corresponding to the conﬁgured WFQ weight. The

achieved TCP throughput is independent of the TCP window

size, as shown in Figure 8 and 10.

D. Implementation and Evaluation of a Premium Service

In a ﬁrst experiment the Premium service was implemented

based on EF using Priority Queuing (PQ). The ingress router

was conﬁgured to apply a meter and marker with a conﬁrmed

information rate of 4.8 Mb/s and a burst size of 32 MTU. Ex-

cess trafﬁc is dropped. The parameters that were applied at the

ingress router were reﬂected by the core conﬁguration. The PQ

scheduler was bound to 10 % of the bottleneck link capacity,

corresponding to about 4.8 Mb/s. Bursts of up to 48 KB are per-

mitted in the core. Figure 6 shows the results of a transmission

of the news sequence. A reduction of the transmission delay

and delay jitter especially for big video frames, which lead to

packet bursts, becomes obvious for PQ compared to the WFQ

settings in Table II. Here the tx-ring-limit parameter, which is

used to conﬁgure the outgoing non-preemptive interface queu-

ing capacity, is of major importance [7].

The following series of experiments were applied to an im-

plementation of the EF PHB with the goal to analyze the be-

havior of a heterogeneous aggregate. It uses WFQ to emulate

strict PQ by provisioning 99 % of the available capacity to the

EF aggregate. This is to ensure that the queue of the EF ag-

gregate caused by possible bursts is minimized to reduce the

queuing delay. Note that the BB prototype GARA performs a

careful admission control and is thus preventing the starvation

of the BE trafﬁc. The particular challenge is caused by apply-

ing elastic and non-elastic trafﬁc in a single EF aggregate. The

setup consisted of following three ﬂows, which passed an ATM

bottleneck link of 100 Mb/s capacity:

•The ﬁrst ﬂow entering the testbed was a delay-sensitive

Premium UDP ﬂow. It ran from the beginning to the end

of the measurement. GARA acted as a BB to associate

the ﬂow to the EF PHB. The related UDP trafﬁc generator

was conﬁgured to achieve a rate of 40 Mpbs by constantly

0 5 10 15 20 25 30 35 40

RTT (ms)

Time (s)

Fig. 7. TCP Round-Trip-Time Bronze 15 MTU Socket Buffer

2000

4000

6000

8000

10000

12000

14000

16000

18000

0 5 10 15 20 25 30 35 40

Link Data Rate (Kb/s)

Time (s)

Fig. 8. TCP Throughput Bronze 15 MTU Socket Buffer

0 5 10 15 20 25 30 35 40

RTT (ms)

Time (s)

Fig. 9. TCP Round-Trip-Time Bronze 32 MTU Socket Buffer

2000

4000

6000

8000

10000

12000

14000

16000

18000

0 5 10 15 20 25 30 35 40

Link Data Rate (Kb/s)

Time (s)

Fig. 10. TCP Throughput Bronze 32 MTU Socket Buffer

submitting 1 KB packets every 0.2 milliseconds. The re-

ceiver continuously reported the delay calculated from the

time-stamps in the packets.

•The second ﬂow in the experiment was a GR TCP ﬂow

which was roughly active in the time interval between 20

and 60 seconds. Emulating a distributed supercomputing

application, we created a bursty TCP stream which was

injecting data in chunks of 256 KB into the network, us-

ing the EF PHB. Every 8th message contained two chunks,

i.e. 512 KB. The average rate of the ﬂow was 16 Mb/s. Us-

ing GARA, we claimed a slightly higher guaranteed band-

width reservation, allowing bursts of up to one full chunk.

•The third ﬂow started during the experiment was a BE

UDP ﬂow which was roughly active in the time interval

between 40 and 80 seconds. Our main intention was to

create a heavy congestion by submitting 750 byte packets

at a frequency of 10000 Hz, to achieve a rate of 60 Mb/s.

To demonstrate the impact of a single Premium ﬂow un-

der congestion, the competing UDP ﬂow was still active

after the TCP ﬂow ends. The BE ﬂow thus consumed a

signiﬁcant amount of the available capacity.

Figure 11 shows that the selected single-aggregate imple-

mentation is not appropriate for providing delay-sensitive ser-

vices when bursty TCP ﬂows use the same aggregate in paral-

lel. If a burst introduced by the TCP ﬂow exceeds the available

output link capacity, packets get queued on the IP-layer queue.

Because packets of the Premium UDP ﬂow are also queued,

the delay variation increases signiﬁcantly. We can easily re-

validate the result illustrated in Figure 11 by some simple cal-

culations. The Premium service is used by a UDP application

which is transmitting data at a rate of 40 Mb/s. This application

shares the aggregate with a TCP ﬂow which is injecting bursts

of 256 KB at the link speed of its Fast Ethernet interface, i.e.

at a rate of 100 Mb/s. Hence, the 256 KB of data enter the EF

aggregate within 20 milliseconds. The total amount of data en-

tering the EF aggregate in this time interval is thus 356 KB. As-

suming an ATM overhead of 20 %, the EF aggregate is served at

a rate of 80 Mb/s. We thus know that 200 KB of EF data leave

the router during this 20 ms interval. Consequently, at the end

of the interval there exists an EF queue of 156 KB that leads to

an upper delay boundary of 15 ms.

In order to inject a trafﬁc proﬁle which is conforming to the

Service Level Agreement with the peered downstream domain,

the egress router of a DS domain might be enforced to shape

out the trafﬁc of a whole aggregate. When this is applied to the

scenario illustrated above, the impact caused by the bursts of

the TCP stream might be ampliﬁed by the queuing introduced

by trafﬁc shaping. Figure 12 illustrates the impact of trafﬁc

shaping when it is performed for an aggregate. Trafﬁc shap-

ing can be viewed as an additional constraint which limits the

EF capacity of the output link by shaping the rate to the given

trafﬁc proﬁle. Hence, EF packets get queued whenever TCP

bursts cause the shaper to become active. In the illustrated sce-

nario, the shaper became active whenever the TCP application

produced a burst of two data chunks every 2 s.

As trafﬁc shaping over an aggregate has a negative impact on

the delay variation of a Premium ﬂow, the EF implementation

proposed here uses a ﬂow-based service differentiation on the

0 10 20 30 40 50 60 70 80 90

Type-P One-way Delay (ms)

Receive Time (s)

Fig. 11. No Shaping

0 10 20 30 40 50 60 70 80 90

Type-P One-way Delay (ms)

Receive Time (s)

Fig. 12. Aggregate Shaping

0 10 20 30 40 50 60 70 80 90

Type-P One-way Delay (ms)

Receive Time (s)

Fig. 13. Per-Flow Shaping

output interface of the egress router. In detail, GARA applied

an additional router internal packet marking mechanism, called

“qos-group” which facilitates an efﬁcient packet classiﬁcation.

The router conﬁguration propagated by GARA extended the ba-

sic DSCP marking by also assigning the “qos-group” for the

related ﬂow. This additional classiﬁcation was then used to

update the conﬁguration of the output interface of the ingress

router to shape out the related GR ﬂow. Figure 13 demonstrates

the EF PHB implementation providing a Premium service and

a GR service using a single aggregate. The remaining impact

is caused by the device queues which are under the control of

the network administrator. Figure 13 also illustrates a side im-

pact of the implementation. The jitter of the delay-sensitive

ﬂow sharing the aggregate with a bursty TCP ﬂow is decreased

when congestion occurs. This result is caused by the fact that

the TCP packets are served by their assigned rate and the active

shaping conﬁguration. As illustrated in Figure 8 and 10, WFQ

operates in periods of congestion as an additional shaper. In that

case, less TCP packets get served when they compete with the

smaller BE UDP packets. Hence, the interference of the larger

packets is reduced, which reduces the maximum experienced

delay.

IV. CONCLUSIONS AND FUTURE WORK

We have presented a quantitative evaluation of a DS imple-

mentation providing a Premium service and an Olympic service

automatically conﬁgured by GARA. Our evaluation addressed

the QoS demand of heterogeneous types of ﬂows sharing a

single PHB. The experiments presented used commodity hard-

ware. This demonstrates that real application can actually use

DS, especially if access to services is automated by a BB such

as GARA. The pragmatic approach of limiting the assumptions

made about the underlying PHBs addresses potential deploy-

ment constraints and facilitates the negotiation of trafﬁc trunk

encoding between peered domains.

Our future work will focus on larger scenarios with several

possible bottleneck links. These require complex TE mecha-

nisms and an advanced resource management, which we aim at

addressing with GARA and MPLS.

REFERENCES

[1] Allman, M., Paxson, V., and Stevens, W.: TCP Congestion Control. RFC

2581, (1997)

[2] Blake, S. et al.: An Architecture for Differentiated Services. RFC 2475,

(1998)

[3] Cardwell, N., Savage, S., and Anderson, T.: Modeling TCP Latency. Pro-

ceedings of IEEE Infocom, (2000)

[4] Charny, A., and Le Boudec, J.-Y.: Delay Bounds in a Network With Ag-

gregate Scheduling. Proceedings of QofIS, (2000)

[5] Davie, B. et al.: An Expedited Forwarding PHB. RFC 3246, (2002)

[6] DeFanti, T., and Stevens, R.: Teleimmersion. In Foster, I., and Kesselman,

C.: The Grid: Blueprint for a Future Computing Infrastructure. Morgan-

Kaufmann, (1998)

[7] Ferrari, T., Pau, G., and Raffaelli, C.: Priority Queuing Applied to Ex-

pedited Forwarding: A Measurement-Based Analysis. Proceedings of

QofIS, (2000)

[8] Fitzek, F., and Reisslein, M.: MPEG–4 and H.263 Video Traces for Net-

work Performance Evaluation. IEEE Network, (2001), 15(6):40-54

[9] Foster, I., Roy, A., Sander, V., and Winkler, L.: End-

to-End Quality of Service for High-End Applications.

http://www.mcs.anl.gov/qos/qos papers.htm, (1999)

[10] Heinanen, J. et al.: Assured Forwarding PHB Group”. RFC 2597, (1999)

[11] Mathis, M., Semke, J., Mahdavi, J., and Ott, T.: The Macroscopic Behav-

ior of the TCP Congestion Avoidance Algorithm. Proceedings of ACM

SIGCOMM, (1997)

[12] Padhye, J., Firoiu, V., Towsley, D., and Kurose, J.: Modeling TCP

Troughput: A simple model and its empirical validation. Proceedings of

ACM SIGCOMM, (1998)

[13] Rosen, E., Viswanathan, A., and Callon, R.: Multiprotocol Label Switch-

ing Architecture. RFC 3031, (2001)

[14] Sander, V., Adamson, W., Foster, I, and Roy, A: End-to-End Provision of

Policy Information for Network QoS. IEEE Symposium on High Perfor-

mance Distributed Computing, (2001)

[15] Sander, V., Foster, I., Roy, A., and Winkler, L.: A Differentiated Services

Implementation for High-Performance TCP Flows. Terena Networking

Conference, (2000)

[16] Wu, Q., and Williamson, C.: Improving Ensemble-TCP Performance on

Asymmetric Networks. Proceedings of MASCOTS, (2001)

Multi-class applications for parallel usage of a guaranteed rate and a scavenger service

Conference Paper

Full-text available

Jun 2003

Grid computing requires network services beyond what is currently provided by the Best-Effort Internet. Among the different approaches towards network Quality of Service, aggregate scheduling, which maps micro-flows to a small number of different service classes, offers sufficient scalability up to the size of the Internet. The Differentiated Services architecture of the Internet Engineering Task Force is such an approach. Recently further Best-Effort like services have been proposed that are based on aggregate scheduling, like the less than Best-Effort Scavenger Service and the Alternative Best-Effort Service. In this paper we introduce the existing aggregate based approaches to Quality of Service. We take a heterogeneous mix of Transmission Control Protocol and User Datagram Protocol flows into account and complement services by adding transport protocol specific elements, if appropriate. In particular we show multi-class applications that are designed to apply a Guaranteed Rate Service and a Scavenger Service in parallel. Doing so we can show a performance gain and achieve a more economical use of the available resources without impacting responsive Best-Effort flows.

Gara: A Uniform Quality of Service Architecture

Article

Jan 2004

Many Grid applications, such as interactive and collaborative environments, can benefit from guarantees for resource performance or quality of service (QoS). Although QoS mechanisms have been developed for different types of resources, they are often difficult to use together because they have different semantics and interfaces. Moreover, many of them do not allow QoS requests to be made in advance of when they are needed. In this paper, we describe GARA, which is a modular and extensible QoS architecture that allows users to make advance reservations for different types of QoS. We also describe our implementation of network QoS in detail.

An expedited forwarding PHB (Per-Hop Behavior)

Article

Full-text available

Mar 2002

This document was written during the process of clarification of RFC2598 "An Expedited Forwarding PHB" that led to the publication of revised specification of EF "An Expedited Forwarding PHB". Its primary motivation is providing additional explanation to the revised EF definition and its properties. The document also provides additional implementation examples and gives some guidance for computation of the numerical parameters of the new definition for several well known schedulers and router architectures.

Delay Bounds in a Network with Aggregate Scheduling

Conference Paper

Full-text available

Jan 2000
Lect Notes Comput Sci

A large number of products implementing aggregate buffering and scheduling mechanisms have been developed and deployed, and still more are under development. With the rapid increase in the demand for reliable end-to-end QoS solutions, it becomes increasingly important to understand the implications of aggregate scheduling on the resulting QoS capabilities. This paper studies the bounds on the worst case delay in a network implementing aggregate scheduling. We derive an upper bound on the queuing delay as a function of priority traffic utilization and the maximum hop count of any flow, and the shaping parameters at the network ingress. Our bound explodes at a certain utilization level which is a function of the hop count. We show that for a general network configuration and larger utilization utilization an upper bound on delay, if it exists, must be a function of the number of nodes and/or the number of flows in the network.

Modeling TCP through-put: A simple model and its empirical validation

Article

Jan 2000
COMPUT COMMUN REV

In this paper we develop a simple analytic characterization of the steady state throughput, as a func-tion of loss rate and round trip time for a bulk transfer TCP flow, i. e., a flow with an unlimited amount of data to send. Unlike the models in [6, 7, 10], our model captures not only the behavior of TCP's fast retransmit mechanism (which is also considered in [6, 7, 10]) but also the effect of TCP's timeout mech-anism on throughput. Our measurements suggest that this latter behavior is important from a modeling perspective, as almost all of our TCP traces contained more timeout events than fast retransmit events. Our measurements demonstrate that our model is able to more accurately predict TCP throughput and is accurate over a wider range of loss rates.

An Architecture for Differentiated Services

Article

Rfc Ietf

The grid: blueprint for a future computing infrastructure

Article

Jan 1999

Multiprotocol Label Switching Architecture" RFC 3031

Article

Jan 2001

An Architecture for Differentiated Services. RFC 2475

Article

Jan 1998

TCP congestion control

Article

Apr 1999

This document defines TCP's four intertwined congestion control algorithms: slow start, congestion avoidance, fast retransmit, and fast recovery. In addition, the document specifies how TCP should begin transmission after a relatively long idle period, as well as discussing various acknowledgment generation methods.

Assured forwarding PHB group

Article

Jun 1999

This document defines a general use Differentiated Services (DS) [Blake] Per-Hop-Behavior (PHB) Group called Assured Forwarding (AF). The AF PHB group provides delivery of IP packets in four independently forwarded AF classes. Within each AF class, an IP packet can be assigned one of three different levels of drop precedence. A DS node does not reorder IP packets of the same microflow if they belong to the same AF class.

Differentiated services implementation for high-performance TCP flows

Article

Mar 2000
COMPUT NETW

The IETF’s recent differentiated services (DS) architecture, which specifies a scalable mechanism for treating packets differently, offers new opportunities for building end-to-end quality of service (QoS) systems. However, it also introduces new challenges. In particular, it is not clear whether TCP’s flow and congestion control mechanisms work well with the mechanisms used for end-to-end QoS. For that reason it is essential to analyze whether the existing DS mechanisms can be used with standard TCP implementations or whether it is necessary to wait for upcoming features introduced in future modified versions of TCP. The general-purpose architecture for reservation and allocation (GARA) supports flow-specific QoS specification, immediate and advance reservation, and online monitoring and control of both individual resources and heterogeneous resource ensembles. Using GARA, we evaluated actual DS mechanisms provided by Cisco routers. We present the results of this evaluation and discuss their impact on the performance of popular TCP implementations.

A pragmatic approach for service provisioning based on a small set of per-hop behaviors

Abstract

Recommended publications

Performance evaluation of per-hop forwarding behavior in the Diffserv Internet

Multi-class applications for parallel usage of a guaranteed rate and a scavenger service

Evaluation of a Differentiated Services Based Implementation of a Premium and an Olympic Service

Statistical End-to-end Performance Bounds for Networks under Long Memory FBM Cross Traffic

Traffic shaping in aggregate-based networks: Implementation and analysis