Conference PaperPDF Available

Enhanced Overloaded CDMA Interconnect (OCI) Bus Architecture for on-Chip Communication

August 2015

August 2015

DOI:10.1109/HOTI.2015.12

Conference: 2015 IEEE 23rd Annual Symposium on High-Performance Interconnects (HOTI).
At: Santa Clara, CA, USA

Authors:

Khaled Ahmed

Alexandria University

Mohammed Morsy Farag

Alexandria University

On-chip interconnect is a major building block and a main performance bottleneck in modern complex System-on-Chips (SoCs). The bus topology and its derivatives are the most deployed communication architectures in contemporary SoCs. Space switching exemplified by cross bars and multiplexers, and time sharing are the key enablers of various bus architectures. The cross bar has quadratic complexity while resource sharing significantly degrades the overall system's performance. In this work we motivate using Code Division Multiple Access (CDMA) as a bus sharing strategy which offers many advantages over other topologies. Our work seeks to complement the conventional CDMA bus features by applying overloaded CDMA practices to increase the bus utilization efficiency. We propose the Difference-Overloaded CDMA Interconnect (D-OCI) bus that leverages the balancing property of the Walsh codes to increase the number of interconnected elements by 50%. Two implementations of the D-OCI bus optimized for both speed and resource utilization are presented. The bus operation is validated on a Xilinx Artix-7 AC701 FPGA kit and the bus performance is evaluated and compared to other existing bus topologies. We also present the synthesis results for the UMC-0.13 μm design kit to give an idea of the maximum achievable bus frequency on ASIC platforms. Moreover, we advance a proof-of-concept HLS implementation of the D-OCI bus on a Xilinx Zynq-7000 SoC and compare its performance, latency, and resource utilization to the ARM AXI bus. The performance evaluation demonstrates the superiority of the D-OCI bus.

SoC CDMA XOR encoder and accumulator decoder

…

Pipelined Difference Overloaded CDMA bus system containing the hybrid encoder, and both the orthogonal and the PD overloaded codes decoders.

…

Synthesis and implementation results of the overload CDMA bus for code length N = {8, 16, 32, 64}.

…

Figures - uploaded by Khaled Ahmed

Content may be subject to copyright.

Content uploaded by Khaled Ahmed

Content may be subject to copyright.

Enhanced Overloaded CDMA Interconnect (OCI)

Bus Architecture for on-Chip Communication

Khaled E. Ahmed, Mohammed M. Farag

Electrical Engineering Department, Faculty of Engineering, Alexandria University, Alexandria, Egypt

Email: k.e.elsayed@ieee.org, mmorsy@alexu.edu.eg

Abstract—On-chip interconnect is a major building block and

a main performance bottleneck in modern complex System-on-

Chips (SoCs). The bus topology and its derivatives are the most

deployed communication architectures in contemporary SoCs.

Space switching exempliﬁed by cross bars and multiplexers, and

time sharing are the key enablers of various bus architectures.

The cross bar has quadratic complexity while resource sharing

signiﬁcantly degrades the overall system’s performance. In this

work we motivate using Code Division Multiple Access (CDMA)

as a bus sharing strategy which offers many advantages over

other topologies. Our work seeks to complement the conventional

CDMA bus features by applying overloaded CDMA practices to

increase the bus utilization efﬁciency.

We propose the Difference-Overloaded CDMA Interconnect

(D-OCI) bus that leverages the balancing property of the Walsh

codes to increase the number of interconnected elements by

50%. Two implementations of the D-OCI bus optimized for both

speed and resource utilization are presented. The bus operation

is validated on a Xilinx Artix-7 AC701 FPGA kit and the bus

performance is evaluated and compared to other existing bus

topologies. We also present the synthesis results for the UMC-

0.13 μm design kit to give an idea of the maximum achievable bus

frequency on ASIC platforms. Moreover, we advance a proof-of-

concept HLS implementation of the D-OCI bus on a Xilinx Zynq-

7000 SoC and compare its performance, latency, and resource

utilization to the ARM AXI bus. The performance evaluation

demonstrates the superiority of the D-OCI bus.

Keywords—SoC, CDMA, Bus Architecture, On-Chip Intercon-

nect, CDMA Bus, Multiple Access Interference, Overloaded CDMA.

I. INTRODUCTION

System-on-Chips (SoCs) are getting more and more com-

plex as the feature size of the building transistors scales down.

More IP cores can ﬁt on the same die which causes an

exponential increase in the interconnection complexity [1]. The

performance of individual IP cores used in SoCs is typically

optimized by the vendor leaving the task of implementing the

on-chip interconnection architecture to the system designer.

The task of implementing on-chip interconnects is not trivial

since the wiring density directly impacts the system’s perfor-

mance, resources, and power consumption. In some applica-

tions, on-chip interconnects can be the system’s performance

bottleneck which necessitates optimizing the interconnect log-

ical topology. Buses and Networks-on-Chips (NoCs) are the

most deployed topologies for on-chip interconnect in SoCs [2].

The straightforward approach to realize on-chip commu-

nication is space switching exempliﬁed by crossbar switches

where every IP core is physically connected by wires to every

other element by a dedicated link providing the better achieved

connectivity. The interconnect complexity of the crossbar

scales quadratically with the number of on-chip cores [3]

rendering it a feasible solution only for a small number of

cores. Another common approach to realize on-chip com-

munication is the bus topology which prevails contemporary

SoC designs. In the bus topology, Time Division Multiple

Access (TDMA) is adopted, where all cores are interconnected

to the same bus and bus access is time shared between

interconnected elements according to the bus arbitration rules.

As the number of on-chip components increases, the efﬁciency

of the TDMA bus decreases due to the bus contention and

increased sharing overheads on the bus [4]. Many SoC designs

attempt to overcome this problem by employing hierarchical

bus topologies at the expense of increasing the interconnect

complexity, overhead, and power consumption [5].

The Code Division Multiple Access (CDMA) bus architec-

ture has been proposed as an alternative to the TDMA-based

bus topology to overcome the bus contention problem [6].

Direct sequence CDMA (DS-CDMA) is a well-known ap-

proach for medium sharing in wireless communication systems

where the channel is shared by assigning orthogonal spreading

codes called signatures to all transmit-receive pairs sharing the

communication channel. Code orthogonality enables channel

sharing and is measured in terms of the cross-correlation

between spreading codes which equals zero for orthogonal

spreading codes. In a CDMA bus, data from each transmit

element is spread by XORing data with a unique spreading

code or signature. Data spread by different elements are

summed together and sent over the bus. All receiver elements

simultaneously access the bus and receive the spread data sum.

Despreading is achieved by applying correlation operations

to the received sum, where each receiver can extract its data

by correlating it with the unique signature assigned for each

transmit-receive pair. Other advantages of using CDMA for on-

chip interconnect include reduced power consumption, ﬁxed

communication latency, and reduced system complexity [7].

Table I shows a brief comparison between the basic cross-

bar, time-shared, and CDMA buses in terms of the wiring

complexity, bus throughput, and arbitration overheads [8] [9]

for M×Minterconnected elements. The CDMA bus has less

wiring complexity than the crossbar and less arbitration over-

head than the TDMA bus, thus provides a good compromise

of both. Furthermore, the CDMA bus has the advantage of

the possibility of increasing the bus capacity by increasing the

number of usable spreading codes, as this work suggests, thus

increasing the bus throughput compared to the time-shared bus.

The set of spreading codes used in a CDMA system must

be orthogonal to each other and any extra codes added to

this set induce Multiple Access Interference (MAI) which