Implementation of the swap operation (M = 4). Left: swap box. Right: complete swap unit. N is the bus width.

Source publication

Bus-switch coding for reducing power dissipation in off-hip buses

Article

Full-text available

Jan 2005

We present a novel coding scheme for reducing bus power dissipation. The presented approach is well suited to driving off-chip buses, where the line capacitance is a dominant factor. A distinctive feature of the technique is the dynamic reordering of bus line positions, in order to minimize the toggling activity on physical bus wires. The effective...

Context 1

... swapping patterns can be sequentially generated by a finite state machine (FSM) very similar to a binary counter. The direct binary representation of a swapping pattern is a vector of M binary numbers each ranging from 0 to M .. 1, therefore, requiring M · log2(M ) bits. The swap operation is performed by a set of multiplexers as in Fig. 2. Referring to M = 4, the 8-bit pattern is par- titioned into four 2-bit numbers, namely A, B, C, and D in the left part of Fig. 2. In practice, the extra lines to transmit the pattern are drastically reduced by means of a combinational pattern encoder, exploiting the fact that the allowed individual patterns are at most M!. The pro- posed coding function (Definition 2) is implemented by a twin swap unit, illustrated in Fig. 3; the conversion from a swapping pattern to its inverse is directly implemented by a dedicated two-level combinational logic unit PConv. In order to perform the M! attempts to find the best pat- tern, a partially or fully parallel implementation of BS de- coder can be pursued, employing L units, each perform- ing M !/L attempts. We refer to such solution as an L-way parallel architecture. The architecture of the single unit is shown in Fig. 4. PatGen is the FSM that generates the set of allowed patterns to be tried, H produces the Hamming distance between two words by performing a population count after XORing. The Cmp unit compares the actual Hamming distance with the temporary minimum. When all the patterns have been tried and the minimum distance found, the threshold unit stores the pattern, the encoded word and the distance value on output registers. Fig. 5 shows the top view of the encoder ...

View in full-text

Context 2

View in full-text

A Self-checking CMOS Full adder in Double Pass Transistor Logic

Conference Paper

Full-text available

Jan 2012

This paper presents a self-checking implementation for adder schemes using the dual duplication code. To prove the efficiency of the proposed method, the circuit is simulated in double pass transistor CMOS at 32nm technology and some transient faults are voluntarily injected in the layout of the circuit. This fully differential implementation requi...

Si 1-x Ge x -Channel PFETs: Scalability, Layout Considerations and Compatibility with Other Stress Techniques

Article

Full-text available

Apr 2011

Si1-xGex-channel pFETs can combine enhanced intrinsic performance with a threshold voltage shift, therefore this technology possibly facilitates the use of high-k/metal gate stacks in high-performance applications. This review presents imec's work on a new device concept using Si1-xGex-channels, the implant-free quantum well transistor, that can ad...

Reconfiguration Techniques of Partial Shaded PV Systems for the Maximization of Electrical Energy Production

Conference Paper

Full-text available

Jun 2007

In this paper, the research of the optimal layout of photovoltaic (PV) modules in a PV array giving the maximum output power under different shaded working conditions is carried out. The particular condition of non uniform solar exposition of the modules is analyzed. The study of the different configurations has been carried out starting from a cir...

A Programmable CMOS Delay Line for Wide Delay Range Generation and Duty-Cycle Adjustability

Article

Full-text available

Jun 2017

A programmable CMOS delay line circuit with microsecond delay range and adjustable duty cycle is proposed. Through circuit simulation, approximately 2μs delay range can be achieved using 10-bit counter operating at a clock frequency of 500MHz. Utilising synchronous counters instead of synchronous latches has significantly reduced the large occupied...

Layout-Aware Yield Prediction of Photonic Circuits

Conference Paper

Full-text available

Sep 2018

We demonstrate yield prediction of silicon wavelength filters using layout-aware Monte-Carlo circuit simulations. Maps of wafer and die-level variability of width and thickness are projected onto circuit layout and translated into circuit model parameters. We apply this onto Mach-Zehnder lattice filters with different filter orders.

Fractional-social ski driver optimization-driven routing protocol for routing electric vehicle under server hosted VANET

Article

Full-text available

May 2022
MULTIMED TOOLS APPL

The modernization in Electric Vehicles (EVs) has acquired immense interest amongst several researchers as the EV is termed a supreme mode of transportation. In addition, EV is imperative to preserve classical fuel, but EV poses short driving that are restricted by insufficient batteries that obstruct reliability, and there exist lesser charging applications, which are irregularly dispersed. A new model is devised for optimal routing to charge EV using server-hosted VANET. The goal is to discover optimal routes to charge EV with Vehicular Adhoc Network (VANET). The server-hosted VANET contains roadside and vehicle units such that roadside and vehicle units are operated with a cloud server. Here, optimal routes for attaining charging stations are discovered using the proposed Fractional-Social Ski Driver (Fractional-SSD), which is obtained by integrating Fractional calculus (FC) and Social Ski Driver optimization (SSD). In addition, the fitness function is newly developed using battery power, traffic density and distance. Thus, routing decisions are made to route the EV for charging the battery by adapting multi-objective factors. Hence, the proposed Fractional-SSD is employed to choose the optimal route for charging EV. As a result, the proposed Fractional-SSD acquired improved performance with the maximal battery power of 13,884.19 J, smallest traffic density of 6.5, delay of 10.973 min, and fitness of 24.800, respectively.

Low-Energy and Secure Aggregation of Uncorrelated Data in Clustered Sensor Network

Article

Sep 2017
J Low Power Electron

Giuseppe Visalli

In this work, we propose a novel data-Aggregation system for gathering heterogeneous and nonsparse signals from a cluster-based sensor network. The aggregation algorithm uses an ultra-low energy binary operator that performs the bit line permutation of the source data. The data detection introduces a binary noise whose reduction is by probabilistic process profiling and a further low-pass filtering. The proposed aggregation system compresses sensors data and it enables the secure (from passive attacks) transmission toward the base station. Single user binary data permutation dissipates 2.64 fJoule/cycle dynamic energy in 32 nm CMOS technology; instead, noise profiling dissipates an average 117.16 Pico Joule/cycle total energy in the same technology. Static power in both scenarios represents the most important source when data rate is 1 MHz.

A Low Power L1 Cache Design Based on Data and Tag Re-Mapping

Article

Full-text available

Dec 2013
J Low Power Electron

Giuseppe Visalli

In this work, we propose an architecture-level power optimization technique for L1 caches. The idea is to unify the DATA and TAG fields in a unique embedded static RAM and an intelligent cache controller to minimize the latency penalty. Moreover, an intermediate high-speed pre-fetch buffer optimizes the whole system. We apply this approach to direct-mapped instruction cache and set-associative data cache. Experimental results indicate the power saving by 20% with latency overhead by 12%.

A Bus Switch Coding System with Minimal Hardware Demand

Article

Full-text available

Aug 2012
J Low Power Electron

Giuseppe Visalli

This paper introduces the best architecture for a novel low-power encoding system suitable for high bandwidth off-chip data buses. The technique, known as Bus Switch, reorders dynamically the lines of a bus in agreement to a permutation scheme such to minimize the total bus switching activity, responsible for the consumption of dynamic energy. The idea was to reduce the area, power and latency of the permutation circuits using fixed-scheme scrambling units. Moreover, I replaced the toggle count calculation and evaluation circuits with a hierarchical arrangement of analog comparators, representing the bus toggle binary string as a voltage value. I designed the Bus Switch encoder and decoder in semiconductor technologies at 90, 65 and 45 nanometers. The results confirmed that the proposed Bus Switch minimized the required transistors number and the related area and energy consumptions, extending the Bus Switch's field of application.

Fuzzy Control of Coding Schemes for Reducing Energy Dissipation in Off-Chip Buses

Article

Full-text available

Aug 2008
J Low Power Electron

Giuseppe Visalli

In this paper, we proposed an high-speed and low-power off-chip data bus interface based on the best coding schemes in this hard operative condition. We analyzed the clustered bus invert method and the bus switch coding, a newly proposed approach based on bus lines logically re-ordered. We proposed an high speed and low-power bus interface based on the combined employment of these two approaches controlled by a 9-rules Takagi-Sugeno analog fuzzy controller. The controller analyzes the binary traffic statistical property changing on the fly the used coding scheme. The fuzzy controller has been designed taking care of total energy dissipation such to do not compromise the benefit of coding approaches. The controller is able also to re-configure the bus switch sub-section in an operative condition where original approach introduces strong power losses. We demonstrated the effectiveness of the approach designing at transistor level the analog fuzzy controller and the digital part of the bus interface. Simulation conducted with H-SPICE and NANOSIM confirmed the bus interface is the optimal trade-off for reducing dynamic energy in off-chip buses.

An Ultra-Low Power Data Aggregation System for Wireless Micro Sensor Networks

Article

Aug 2007
J Low Power Electron

The paper introduced a novel methodology, for reducing energetic consumption, during data compression in homogenous sensor nodes organized in a cluster based network. Our approach employed a bit-wise operator previously used in the context of the reduction of dynamic energy in external buses. The document defined the compression and decompression laws based on this operator, in a conceptual way much similar to the code division multiple access (CDMA) systems, used in the telecommunication scenario. Each sensor has internally associated a digital signature, used in the compression stage. The host computer tries to recover the original waveform executing the cited operator and applying the inverse signature. The original data has been corrupted by an interference process, which depends on the presence of the other users in the same cluster. The host computer is able to select the best signatures, mostly reducing the energy of the interfering process. Simulations conducted with Matlab and Simple Power indicated our approach gains an 85% in energy consumption compared to the simpler algorithm up to now known (Least Mean Squares). Moreover, simulations verified the host has the capability to recover the transmitted waveforms in their fundamental harmonic members.

Performance - Timing overhead Trade-off Analysis for a low-power data bus encoding based on input lines reordering

Conference Paper

Full-text available

Dec 2005

This paper analyzes the performance and timing overhead trade-off for a recently proposed data bus encoding scheme for low-power based on data lines reordering. The bus switch (BS) mechanism introduces greater activity savings than previous approaches; the hardware complexity of the encoder suggests to apply BS in off-chip buses, where the parasitic capacitance makes dynamic power dissipation in the bus lines the dominant contribution to power consumption. In the basic BS implementation, the encoding circuits included extra bus lines which degrade the energy saving. This paper illustrates and analyzes a circuit implementation with only one extra line, at the cost of a small time overhead. This solution strongly enhances the advantage in off-chip communications, where the available number of pads represents a key resource in low-cost packages. Our results indicate that the effectiveness of the approach strongly depend on an a-priori traffic analysis.

Encoding circuits for low power optical on-chip communications

Conference Paper

Full-text available

Jun 2005

The increased demands of high data-rate communications could be satisfied by optical semiconductor elements. Actually, these devices represent an important role in the total energy budget available for the chip. This work presents a low-power encoding technique which optimizes the statistical distribution so as to reduce the energy dissipated in optical communications. We evaluated the encoding circuits referring to 180 nm, 130 nm and 90 nm CMOS technologies. Our results show an up to 12% electrical current reduction in the on-chip light emitter.

Bus-Switch Coding, for Dynamic Power Management in off-chip communication channels

Article

Full-text available

Jan 2005

The dynamic power management (DPM) represents an important challenge for extending the battery lifetime in a portable system. The power management, based on static and off-line approaches, does not consider the basic property of a modern battery, which recovers a fraction of its charge during the idle time. The DPM approach profiles a complex system in different power figures depending on a reduced set of macro-states. The DPM problem gives a sequence of macro-states which increases the battery lifetime. The dynamic power management is also required in complex systems where the power dissipation in communication channels represent a dominant factor. The modern communication arrangements operate at rate of some G Bit/sec, which implicated high transition activities, responsible of the dynamic power consumption. Moreover, the signal level involved in the output pads has a quadratic contribution in the dynamic power. The problem of low-power bus encoding has been extensively tackled in the past. The basic approach minimizes the transition density, directly related to the lines switching activities, responsible of the load/un- load of the parasitic capacities. The current literature on low-power bus encoding provides solutions, which do not guarantee a good activity saving increasing the bus lines; this issue represents a huge limitation in the modern communication channels, which require high transmission bandwidth. The paper introduced a novel low-power bus encoding approach, based on tentatively encoding, clustering and re-ordering the lines of a wide system data bus used in multi-processor scenario. The "Bus-Switch" mechanism, as novel bus encoding approach, drastically reduces the transition activity, preserving the required bandwidth for high data-rate communications. Since the optimal bus switch encoder complexity grows significantly decreasing the level of clustering, a sub-optimal approach requires power management policy in order to effectively control the battery life. The paperwork presents an overview of the bus-switch mechanism, including the required architecture for encoding/decoding the input lines. The RTL-model has been translated in a modern technology library at 90nm low-leakage using the Synopsys Tool for placement and CTS

A statistical analysis, for reducing the energy dissipation in a bus-switch encoder

Conference Paper

Full-text available

Jan 2005

The Bus Switch mechanism is a recently proposed bus encoding technique for low-power off-chip data buses. The approach is based on clustering, reordering and encoding the bus input lines according to a reordering pattern and a fixed coding function. This work presents a statistical approach for reducing the hardware overhead of the bus switch technique, by operating with a sub-set of the possible reordering patterns. We demonstrate the effectiveness and robustness of the proposed approach by ANSI C simulations, measuring the average switching activity savings. Our results show a modest switching activity degradation while saving 90% computation time, thus obtaining a sub-optimal encoder configuration satisfactory for a large variety of benchmarks.

Implementation of the swap operation (M = 4). Left: swap box. Right: complete swap unit. N is the bus width.

Contexts in source publication

Similar publications

Citations