ArticlePDF Available

Extending the Birkhoff-von Neumann Switching Strategy for Multicast - On the use of Optical Splitting in Switches

September 2007
IEEE Journal on Selected Areas in Communications 25(6):36-50

September 2007
25(6):36-50

DOI:10.1109/JSAC-OCN.2007.026006

Source
IEEE Xplore

Authors:

Supratim Deb

Muriel Médard

Massachusetts Institute of Technology

The Birkhoff-von Neumann (BVN) strategy for single-stage input-queued crossbar switches does not support multicast, as it considers only permutation-based switch configurations. This paper extends the BVN strategy to multicast switching, where an input can simultaneously transmit to multiple outputs. Knowledge of the average rates of flows is used to compute an offline schedule. We begin by considering a system in which the fanout of each flow is split in a predecided manner. We call this static splitting (as opposed to dynamic splitting where no such constraint is imposed), and we study the rate region of the switch under this restriction. We provide a graph-theoretic formulation of the rate region.

Switch connection states : Permutation and Direct Multicast

…

(a) Copying: Cell sent to 3 outputs in 3 time slots (b) No-splitting: Cell sent to all 3 outputs in same time slot (c) Partial Fanout Splitting: Cell sent to 2 outputs in one slot, and to third output in another slot

…

How should a router exploit the optical switching fabric?

…

Viewing the entire network as a crossbar switch

…

The BVN switching algorithm

…

Figures - uploaded by Muriel Médard

Content may be subject to copyright.

Content uploaded by Muriel Médard

Content may be subject to copyright.

Content uploaded by Muriel Médard

Content may be subject to copyright.

A preview of the PDF is not available

Optimizing multicast flows in high-bandwidth reconfigurable datacenter networks

Article

Apr 2022

Modern cloud applications has led to a huge increase in multicast flows, which is becoming one of the primary communication patterns in nowadays datacenter networks. Emerging datacenter technologies enable interesting new opportunities to support such multicast traffic more effectively and flexibly in the physical layer: novel circuit switches offer high-bandwidth and reconfigurable inter-rack multicasting capabilities. However, not much is known today about the algorithmic challenges introduced by this new technology, especially in optimizing the completion times for multicast flows. This paper presents SplitCast, a preemptive multicast scheduling approach that fully exploits emerging high-bandwidth physical-layer multicasting capabilities to reduce flow times. SplitCast dynamically reconfigures the circuit switches to adapt to the multicast traffic, accounting for reconfiguration delays. In particular, SplitCast relies on simple single-hop routing and leverages transfer flexibilities by supporting splittable multicast so that a transfer can already be delivered to just a subset of receivers when the circuit capacity is insufficient. Moreover, SplitCast supports two common forwarding models, the all-stop and the not-all-stop, during circuit reconfiguration. We conduct extensive simulation to evaluate the performance of SplitCast, and the results show that SplitCast can cut down flow times significantly compared to state-of-the-art solutions.

Network Coding in a Multicast Switch

Conference Paper

Full-text available

Jun 2007

We consider the problem of serving multicast flows in a crossbar switch. We show that linear network coding across packets of a flow can sustain traffic patterns that cannot be served if network coding were not allowed. Thus, network coding leads to a larger rate region in a multicast crossbar switch. We demonstrate a traffic pattern which requires a switch speedup if coding is not allowed, whereas, with coding the speedup requirement is eliminated completely. In addition to throughput benefits, coding simplifies the characterization of the rate region. We give a graph-theoretic characterization of the rate region with fanout splitting and intra-flow coding, in terms of the stable set polytope of the "enhanced conflict graph" of the traffic pattern. Such a formulation is not known in the case of fanout splitting without coding. We show that computing the offline schedule (i.e. using prior knowledge of the flow arrival rates) can be reduced to certain graph coloring problems. Finally, we propose online algorithms (i.e. using only the current queue occupancy information) for multicast scheduling based on our graph-theoretic formulation. In particular, we show that a maximum weighted stable set algorithm stabilizes the queues for all rates within the rate region.

Network Coding in a Multicast Switch

Article

Full-text available

Feb 2011

The problem of serving multicast flows in a crossbar switch is considered. Intraflow linear network coding is shown to achieve a larger rate region than the case without coding. A traffic pattern is presented which is achievable with coding but requires a switch speedup when coding is not allowed. The rate region with coding can be characterized in a simple graph-theoretic manner, in terms of the stable set polytope of the "enhanced conflict graph". No such graph-theoretic characterization is known for the case of fanout-splitting without coding. The minimum speedup needed to achieve 100% throughput with coding is shown to be upper bounded by the imperfection ratio of the enhanced conflict graph, where the imperfection ratio measures a certain graph theoretic property of the given graph. When applied to K × N switches with unicasts and broadcasts only, this gives a bound of min(2K-1/K, 2N/N+1) on the speedup. This shows that speedup, which is usually implemented in hardware, can often be substituted by network coding, which can be done in software. Computing an offline schedule (using prior knowledge of the flow rates) is reduced to fractional weighted graph coloring. A graph-theoretic online scheduling algorithm (using only queue occupancy information) is also proposed, that stabilizes the queues for all rates within the rate region.

Queued cross-bar network models for replication and coded storage systems

Article

Full-text available

Jun 2014

Coding techniques may be useful for data center data survivability as well as for reducing traffic congestion. We present a queued cross-bar network (QCN) method that can be used for traffic analysis of both replication/uncoded and coded storage systems. We develop a framework for generating QCN rate regions (RRs) by analyzing their conflict graph stable set polytopes (SSPs). In doing so, we apply recent results from graph theory on the characterization of particular graph SSPs. We characterize the SSP of QCN conflict graphs under a variety of traffic patterns, allowing for their efficient RR computation. For uncoded systems, we show how to compute RRs and find rate optimal scheduling algorithms. For coded storage, we develop a RR upper bound, for which we provide an intuitive interpretation. We show that the coded storage RR upper bound is achievable in certain coded systems in which drives store sufficient coded information, as well in certain dynamic coding systems. Numerical illustrations show that coded storage can result in gains in RR volume of approximately 50%, averaged across traffic patterns.

Scheduling Advantages of Network Coded Storage in Point-to-Multipoint Networks

Article

Full-text available

Feb 2014

We consider scheduling strategies for point-to-multipoint (PMP) storage area networks (SANs) that use network coded storage (NCS). In particular, we present a simple SAN system model, two server scheduling algorithms for PMP networks, and analytical expressions for internal and external blocking probability. We point to select scheduling advantages in NCS systems under normal operating conditions, where content requests can be temporarily denied owing to finite system capacity from drive I/O access or storage redundancy limitations. NCS can lead to improvements in throughput and blocking probability due to increased immediate scheduling options, and complements other well documented NCS advantages such as regeneration, and can be used as a guide for future storage system design.

Distributed Computing of Functions of Structured Sources with Helper Side Information

Conference Paper

Sep 2023

Derya Malak

SplitCast: Optimizing Multicast Flows in Reconfigurable Datacenter Networks

Conference Paper

Apr 2020

Scheduling Dependent Coflows to Minimize the Total Weighted Job Completion Time in Datacenters

Article

May 2019
COMPUT NETW

Datacenter networks are critical to cloud computing. The coflow abstraction is a major leap forward of application-aware network scheduling. In the context of multi-stage jobs, there are dependencies among coflows. As a result, there is a large divergence between coflow-completion-time (CCT) and job-completion-time (JCT). To our best knowledge, this is the first work that systematically studies: how to schedule dependent coflows of multi-stage jobs, so that the total weighted job completion time can be minimized. We present a formal mathematical formulation. Inspired by the optimal solution of the relaxed linear programming, we design an algorithm that runs in polynomial time to solve this problem with an approximation ratio of (2M+1) in general case, and 3 in special case, where M is the number of hosts. Evaluation results demonstrate that, the largest gap between our algorithm and the lower bound is only 9.14%. In testbeds, we reduce the JCT by up to 81.65% comparing with pure DCTCP. In simulations, we reduce the average JCT by up to 33.48% comparing with Aalo, a heuristic multi-stage coflow scheduler; we reduce the total weighted JCT by up to 83.58% comparing with LP-OV-LS, the state-of-the-art approximation algorithm of coflow scheduling.

Rate Quantization and the Speedup Required to Achieve 100% Throughput for Multicast Over Crossbar Switches

Article

Aug 2010

C. Emre Koksal

The problem of providing quality-of-service (QoS) guarantees for multicast traffic over crossbar switches has received limited attention despite the popularity of its counterpart for unicast traffic. Providing a 100% throughput to all admissible multicast traffic has been shown to be a very difficult task, and it requires a very high speedup in the switching fabric. In this paper, we introduce the concept of rate quantization and use rate quantization to show an analogy between packet scheduling in crossbar switches and circuit switching in three-stage Clos networks. We exploit the analogy to adopt circuit-switching algorithms in wide-sense and strict-sense nonblocking Clos networks in order to construct nonblocking packet schedulers for unicast and multicast traffic. We illustrate a simple multicast nonblocking packet scheduler, for which a speedup of 6logn/loglogn is sufficient to support 100% throughput for any admissible multicast traffic in an n×n crossbar switch. Moreover, we revisit some problems in unicast switch scheduling. We illustrate that the analogy provides useful perspectives, and we give a simple proof for a well-known result.

Academic and Research Staff

Article

Full-text available

Broadband Packet Switching Technologies: A Practical Guide to ATM Switches and IP Routers

Book

Feb 2002

Algorithms and Combinatorics

Book

Jan 1993

A certain zero-sum two-person game equivalent to the optimal assignment problem

Article

Jan 1953

J. Von Neumann

Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks

Article

Jan 1992

Design Aspects of optical communication networks

Chapter

Apr 2002

Birkhoff-von Neumann input-buffered crossbar switches for guaranteed-rate services

Conference Paper

Jul 2001

Based on a decomposition result by Birkhoff and von Neumann for a doubly substochastic matrix, in this letter we propose a scheduling algorithm that is capable of providing guaranteed-rate services for input-buffered crossbar switches. Our guarantees are uniformly good for all nonuniform traffic. The computational complexity to identify the scheduling algorithm is O(N-4.5) for an N x N switch. Once the algorithm is identified, its on-line computational complexity is O(log N) and its on-line memory complexity is O(N-3 log N).

Shared‐Memory Switches

Chapter

Feb 2002

Chapter 4 details the operation principles in different design approaches of shared-memory switches, including linked list, content-addressable memory (CAM), space-time-space, multistage. It also covers multicasting methods in shared-memory switches, including multicast logic queue, cell copy circuit, and address copy circuit.

Très observaciones sobre el algebra linear

Article

Jan 1946

G. Birkhoff

Reducibility among combinatorial problems

Article

Jan 1975

A generalized processor sharing approach to flow control in integrated services networks: The single-node case. IEEE/ACM Transactions on Networking (TON), 1(3), 344-357

Article

Jun 1993

The problem of allocating network resources to the users of an integrated services network is investigated in the context of rate-based flow control. The network is assumed to be a virtual circuit, connection-based packet network. It is shown that the use of generalized processor sharing (GPS), when combined with leaky bucket admission control, allows the network to make a wide range of worst-case performance guarantees on throughput and delay. The scheme is flexible in that different users may be given widely different performance guarantees and is efficient in that each of the servers is work conserving. The authors present a practical packet-by-packet service discipline, PGPS that closely approximates GPS. This allows them to relate results for GPS to the packet-by-packet scheme in a precise manner. The performance of a single-server GPS system is analyzed exactly from the standpoint of worst-case packet delay and burstiness when the sources are constrained by leaky buckets. The worst-case session backlogs are also determined.< ></ETX

Extending the Birkhoff-von Neumann Switching Strategy for Multicast - On the use of Optical Splitting in Switches

Abstract and Figures

Recommended publications

4430 9 99 !" %$')(+*,-/.0 99 9999 '12 $3 547698:!";69<= >?a@ B8c/ !";69<=

Scheduling Multirate Periodic Traffic In A Packet Switch

Line switching for alleviating overloads under line outage condition taking bus voltage limits into...

Schedulability analysis of a graph-based task model for mixed-criticality systems