Gate level representation of half-adder and full-adder

Source publication

Transition-activity aware design of reduction-stages for parallel multipliers

Conference Paper

Full-text available

Mar 2007

We propose an interconnect reorganization algorithm for re- duction stages in parallel multipliers. It aims at minimiz- ing power consumption for given static probabilities at the primary inputs. In typical signal processing applications the transition probability varies between the most and least significant bits. The same is the case for individu...

Context 1

... and full-adders are basic elements which are frequently used in parallel multipliers, especially in the par- tial products reduction stage (Fig. 2). Functional represen- tation of CARRY and SUM is shown in Table 1 where ⊕ represents a boolean XOR, + represents a boolean OR and · represents a boolean AND function. The probability of one at the output of these blocks is a function of the probabil- ity of one at the inputs [13] [14]. Static probabilities given in Table 1 can be ...

View in full-text

Context 2

... path of the Carry-in input of a full-adder (C in Fig. 2) is shorter than the other two inputs. A transition on this input will therefore result in less activity. The input with the highest transition activity among three inputs of the full-adder should therefore be connected to the Carry-in input. For a full-adder with inputs (A, B, C), max(α A , α B ) < α C ...

View in full-text

Fig. 1: On request, our algorithm determines the number of consumers...

Fig. 2: Representation of data production and consumption domains...

Kafka Consumer Group Autoscaler

Preprint

Full-text available

Jun 2022

Message brokers enable asynchronous communication between data producers and consumers in distributed environments by assigning messages to ordered queues. Message broker systems often provide with mechanisms to parallelize tasks between consumers to increase the rate at which data is consumed. The consumption rate must exceed the production rate o...

An efficient multiplier by pass transistor logic partial product and a modified hybrid full adder for image processing applications

Article

Oct 2021
MICROELECTRON J

Different digital multipliers have resulted from various algorithms and hardware designs. This article presents a high-performance multiplier by a novel AND gate and a modified hybrid full adder (FA) cell. The AND is designed by using the pass transistor logic (PTL) technique and a saspeed-up trnsistor, while the FA is based on the transmission gate (TG). Low-power, high-speed, low power-delay-product (PDP), and high competency of both circuits for using in sophisticated structures like multipliers are confirmed by mathematical relations. The proposed 4-bit array multiplier circuit along with the pad has a 2.87 mm² total area and is investigated under different circumstances including VDD, frequency, load capacitances, and process-voltage-temperature (PVT) variations using Monte Carlo method (MCM) by HSPICE tool and 90 nm technology. The efficiency of the multiplier in image processing applications is proved with average improvements of 12.61% and 32.045% for peak signal-to-noise ratio (PSNR) and PDP compared to state-of-the-art designs, respectively. The overall results of the multiplier approve its capability for digital signal processors (DSPs).

Low Power Multiplier by Effective Capacitance Reduction

Article

Full-text available

Jan 2017

In this study we present an energy efficient multiplier design based on effective capacitance minimization. Only the partial product reduction stage in the multiplier is considered in this research. The effective capacitance at a node is defined as the product of capacitance and switching activity at that node. Hence to minimize the effective capacitance, we decided to ensure that the switching activity of nodes with higher capacitance is kept to a minimum. This is achieved by wiring the higher switching activity signals to nodes with lower capacitance and vice versa, for the 4:2 compressor and adder cells. This reduced the overall switching capacitance, thereby reducing the total power consumption of the multiplier. Power analysis was done by synthesizing our design on Spartan-3E FPGA. The dynamic power for our 1616 multiplier was measured as 360.74 mW and the total power 443.31 mW. This is 17.4% less compared to the most recent design. Also, we noticed that our design has the lowest power-delay product compared to the multipliers presented in literature.

Exploiting asymmetry in Booth-encoded multipliers for reduced energy multiplication

Conference Paper

Nov 2015

Booth Encoding is a common technique utilized in the design of high-speed multipliers. These multipliers typically encode just one operand of the multiplier, and this asymmetry results in different power characteristics as each input transitions to the next value in a pipelined design. Relative to the non-encoded input, changes on the Booth-encoded input induce more signal transitions requiring ∼73% more multiplier array energy. This paper proposes low-overhead approaches to take advantage of this asymmetric behavior to reduce the energy of multiplication operations in pipelined SIMD architectures like GPUs. Compiler-based approaches that apply constant or uniform inputs to the Booth-encoded input of the multiplier can save 4.8% of multiplier energy on average. An additional 1.5% savings can be achieved with dynamic detection and steering of uniform inputs.

Analysis of switching activity in DSP signals in the presence of noise

Conference Paper

Full-text available

Jun 2009

Input switching activity is one of the deciding factors for power consumption in digital signal processing components. For accurate power estimation, it is essential to have knowledge about the switching activity in the input signal, including how this activity changes in different environments, e.g., in the presence of noise. The dual bit type (DBT) method aims at characterizing the bit-level switching activity in a signal, using signal statistics. However, the DBT method requires that the correlation coefficient and switching activity for the most significant bit of the signal are available. In this paper we give an expression for direct calculation of the correlation coefficient for the most significant bit in a signal, using the word-level correlation coefficient. Using simulation results we examine the accuracy of the given method to calculate the switching activity and correlation coefficient for the most significant bit. Furthermore, we derive expressions for accurately calculating the variance and word-level correlation coefficient for a correlated signal, when an additional noise of a given variance is added to the signal. This can be used to estimate the bit-level switching activity in a signal in the presence of noise. Finally, based on this we study the impact the additional noise has on the switching activity of the resulting signal.

Increasing the Spurious-Free Dynamic Range of an Integrated Spectrum Analyzer

Thesis

Full-text available

Nov 2008

M.S. Oude Alink

Spectrum Analyzers (SAs) are measurement instruments able to decompose a time signal into its frequency components. Due to non-idealities, SAs add noise and distort the signal to be measured. The ratio between the the largest signal and the noise floor level in a measured spectrum, without any distortion components rising above the noise floor, is called the Spurious-Free Dynamic Range (SFDR). In a CMOS-integrated SA the SFDR is limited to around 60 dB by technology, while it needs to be 70 dB (at a frequency resolution of 1 MHz) to be competitive with commercial SAs. A method called crosscorrelation is introduced to lower the noise floor at the cost of measurement time. It relies on two equivalent measurement paths in which the noise produced in one path is uncorrelated with the noise produced in the other path, such that the noise in the final spectrum tends to cancel out. Although the noise level is only lowered by 1:5 dB if measurement time is doubled, it allows the SA to be designed for high linearity. This design involves the use of digital hardware to compute the crosscorrelation. Consequently Analog-to-Digital Converters (ADCs) are required, but they also limit the SFDR due to the non-linear eect of quantization. New approximations to the relation between the number of quantization levels and the SFDR are found. These approximations show that very additional bit improves the SFDR by 8 dB. A simulator of a concept architecture from Recore Systems is used to implement the digital correlation. It achieves an SFDR of 87 dB. An RF-frontend with a frequency range of 0 GHz to 6 GHz is designed for maximum linearity by moving amplification to IF. It provides impedance matching, variable attenuation and mixing. Its performance gures are a Noise Figure (NF) of 14 dB and a Third Order Input-referred Intermodulation Intercept Point (IP3) of +23 dBm, which gives a theoretical SFDR of 82 dB. In order to obtain estimates on the feasability of an integrated SA, other parts, such as the IF-circuitry and local oscillators, are briefly reviewed. The estimated power consumption of the entire correlation SA is 0:5 W at a sample rate of 200 MS/s, and the estimated chip area is 6:5 mm2. The largest power consumers are the VCO (0:2 W), followed by the IF-circuitry (0:1 W) and the ADCs and digital correlator (each 0:08 W). Chip area is dominated by SRAM-memory (36%), ADCs (25%) and the VCO (20%)

Power optimization of weighted bit-product summation tree for elementary function generator

Conference Paper

Full-text available

May 2008

In this paper we propose a method for lowering the power consumption in our previously proposed method for approximating elementary functions. By rearranging the interconnect ordering in the summation tree we show that it is possible to lower the power consumption in the range of 5.4 % to 25.6% compared to a random ordering. The reduction tree is progressively designed and the interconnect ordering is decided based on the transition activities of the partial products. The reduction in power consumption comes with no overhead in performance or area compared to the random ordering.

Power optimized partial product reduction interconnect ordering in parallel multipliers

Conference Paper

Full-text available

Dec 2007

When designing the reduction tree of a parallel multiplier, we can exploit a large intrinsic freedom for the interconnection order of partial products. The transition activities vary significantly for different internal partial products. In this work we propose a method for generation of power-efficient parallel multipliers in such a way that its partial products are connected to minimize activity. The reduction tree is designed progressively. A simulated annealing optimizer uses power cost numbers from a specially implemented probabilistic gate-level power estimator and selects a power-efficient solution for each stage of the reduction tree. VHDL simulation using ModelSim shows a significant reduction in the overall number of transitions. This reduction ranges from 15% up to 32% compared to randomly generated reduction trees and is achieved without any noticeable area or performance overhead.

Switching Activity Reduction of MAC-Based FIR Filters with Correlated Input Data

Conference Paper

Full-text available

Aug 2007

In this work we consider coefficient reordering for low power realization of FIR filters on fixed-point multiply-accumulate (MAC) based architectures, such as DSP processors. Compared to previous work we consider the input data correlation in the ordering optimization. For this we model the input data using the dual bit type approach. Results show that compared with just optimizing the number of switches between coefficients, the proposed method works better when the input data is correlated, which can be assumed for most applications.

Design of Low-Power Reduction-Trees in Parallel Multipliers

Article

Saeeid Tahmasbi Oskuii

Multipliers in FIR Using VHDL Sunil Nitin

Research

Jan 2022

-A multiplier is one of the key equipment obstructs in most digital and high frameworks, for example, FIR filter, digital signal processors and microprocessors and so forth. This venture introduces a proficient execution of rapid multiplier utilizing the shift and add technique, Radix_2, Radix_4 modified Booth multiplier algorithm. In this task we look at the working of the three multipliers by actualizing each of them independently in FIR filter. The parallel multipliers like radix 2 and radix 4 modified booth multiplier does the calculations utilizing lesser adders and lesser iterative advances. Because of which they involve lesser space when contrasted with the serial multiplier. This is an imperative basis in light of the fact that in the manufacture of chips and elite framework requires segments which are as little as could reasonably be expected. In our undertaking when we look at the power utilization of the considerable number of multipliers we locate that serial multipliers devour more power. So where control is a critical paradigm there we ought to incline toward parallel multipliers like booth multipliers to serial multipliers. The low power utilization nature of corner multiplier settles on it a favored decision in planning distinctive circuits In this venture we initially composed three distinctive sort of multipliers utilizing shift and add technique, radix 2 and radix 4 modified booth multiplier algorithm. We utilized diverse sort of adders like sixteen bit full adder in outlining that multiplier. At that point we planned a 4 tap delay FIR filter and set up of the augmentation and increases we executed the segments of various multipliers and adders. At that point we looked at the working of various multipliers by contrasting the power utilization by each of them. The consequence of our undertaking causes us to pick a superior choice amongst serial and parallel multiplier in manufacturing diverse frameworks. Multipliers shape a standout amongst the most essential parts of numerous frameworks. So by examining the working of various multipliers outlines a superior framework with less power utilization and lesser zone. The consequence of our undertaking encourages us to settle on a legitimate decision of various multipliers in creating in various number juggling units and settling on a decision among various adders in various advanced applications as per prerequisites Index Terms-Finite Impulse Response, radix_2 and radix_4 Booth multiplier, Shift and add multiplier.

Gate level representation of half-adder and full-adder

Contexts in source publication

Similar publications

Citations