ArticlePDF Available

Design Of Low Power Approximate Mirror Adder

May 2018

May 2018

Authors:

PRASANNA VENKATESAN G K D

Karpagam Academy of Higher Education

N-bit ripple carry adder.

…

Figures - uploaded by PRASANNA VENKATESAN G K D

Content may be subject to copyright.

Content uploaded by PRASANNA VENKATESAN G K D

Content may be subject to copyright.

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 2, Apr-May, 2014

ISSN: 2320 – 8791(Impact Factor: 1.479)

www.ijreat.org

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org)

Design Of Low Power Approximate Mirror Adder

Sasikala.M1, Dr.G.K.D.Prasanna Venkatesan2

ME VLSI student

, Vice Principal, Professor and Head/ECE

PGP college of Engineering and Technology Nammakkal, Tamilnadu,India.

ABSTRACT: Addition is a fundamental

operation for any digital system, digital

signal processing and control system. A

quick and accurate operation of a digital

system is greatly influenced by the

performance of the resident adders. Low

power is an imperative requirement for

portable multimedia devices employing

various signal processing algorithms and

architectures. In most of the multimedia

applications human beings can gather

useful information from slightly erroneous

outputs. Therefore, we cannot need to

produce exactly correct numerical outputs.

Preceding research in this context exploits

error resiliency primarily through voltage

over scaling, make use of algorithmic and

architectural techniques to mitigate the

resulting errors. In this paper, we propose

logic difficulty reduction at the transistor

level as an alternative approach to take

advantage of the relaxation of numerical

accuracy. We demonstrate this idea by

proposing various imprecise or

approximate full adder cells with reduced

complexity at the transistor level, and make

use of them to design approximate multi-

bit adders. In addition to the inherent

decrease in switched capacitance, our

techniques result in significantly smaller

difficult paths, enabling voltage scaling.

We design architectures for video and

image compression algorithms using the

proposed approximate arithmetic units and

evaluate them to demonstrate the efficacy

of our approach. We also derive clear

mathematical models for error and power

consumption of these approximate adders.

Index Terms: Approximate Mirror Adder,

low power

1. INTRODUCTION

The adder is one of the most critical

components of a processor, as it is

profitable in the Arithmetic Logic Unit

(ALU), in the floating-point unit and for

address generation in case of cache or

memory access. More demand for mobile

electronic devices such as cellular phones

and laptop computers requires the use of

power efficient VLSI circuits. With

exponential growth of portable electronic

devices like laptops, multimedia and

cellular device, research efforts in the field

of low power VLSI (very large-scale

integration) systems have increased many

folds. Wile rise in chip density, power

consumption of VLSI systems is also

increasing and this further, adds to

reliability and packaging impact.

Packaging and cooling cost of VLSI

systems also goes up with high power

dissipation. Now a day’s low power

consumption along with minimum delay

and area requirements is one of important

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 2, Apr-May, 2014

ISSN: 2320 – 8791(Impact Factor: 1.479)

www.ijreat.org

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org)

design consideration for IC designers.

Recent trends in micro-electronics

technology have gradually changed the

Strategies used in VLSI circuits.

Establishing an efficient methodology is

one of the key to design VLSI chip

successfully. The design of

microelectronics system is strongly

influenced by the fact that transistor and

featured size have continuously influenced,

while density and frequency have increased

.VLSI will undoubtly play a key role in a

technical revolution which yields a great

benefit through application in

communications, leisure and education.

The major advantages of VLSI technology

might be as follows: development of new

functions and application, low cost, light

weight, and low power dissipation,

improvement in reliability and safe,

possibility of being used to highly

sophisticated control system and more

advanced service function through

systemization.

Addition is a fundamental

operation for

any digital system, digital signal processing

and control system. A quick and accurate

operation of a digital system is greatly

influenced by the performance of the

resident adders. Adders are also act as very

important component in digital systems

because of their extensive use in other basic

digital operations such as subtraction,

multiplication and division. Hence, to make

Performance of the digital adder would

greatly advance the execution of binary

operations inside a circuit compromised of

such blocks. The action of a digital circuit

block is gauged by analyzing its power

dissipation, layout region and its operating

speed.

Fig.1 N-bit ripple carry adder.

Fig .2 full adder (FA)

Ripple-Carry Adder (RCA):

The n-bit

adder built from n one-bit full adders is

know as a ripple carry adder, because of

the way the carry is computed. Each full

adder inputs a Cin, which is the Cout of the

preceding adder. This kind of adder is

called a ripple carry adder, since each carry

bit “ripples” to the next full adder. Block

diagram of Ripple Carry Adder is as in Fig.

1.The layout of ripple carry adder is not so

difficult, which allows for fast design time;

however, the ripple carry adder is relatively

not fast, since each full adder must wait for

the carry bit to be calculated from the

previous full adder. The gate delay cannot

difficult be calculated by inspection of the

full adder circuit. Each full adder requires

A,B,Cin levels of logic. In a 32-bit (ripple

carry) adder, there are 32 full adders, so the

difficult path (worst case) delay is 31 *

2(for carry propagation) +3(for sum) = 65

gate delays.

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 2, Apr-May, 2014

ISSN: 2320 – 8791(Impact Factor: 1.479)

www.ijreat.org

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org)

11.APPROXIMATE ADDERS

In this section, we discuss different

methodologies for designing approximate

adders. We use RCAs and CSAs

throughout our subsequent discussions in

all sections of this paper. Since the Mirror

Adder is one of the widely used

economical implementations of an FA, we

use it as our basis for proposing different

approximations of an FA cell.

A. Approximation Strategies for the MA

In this section, we explain step-by-step

procedures for coming up with various

approximate Mirror adder cells with fewer

transistors. Removal of some series

connected transistors will facilitate faster

charging/discharging of node capacitances.

Moreover, complexity decrease by removal

of transistors also aids in reducing the

αC

term (switched capacitance) in the dynamic

power expression

dynamic =

αCV

2DD

where

is the switching activity or average

number of switching

Transitions per unit time and

is the load

capacitance being charged/discharged. This

directly results in lower power dissipation.

Area reduction is also achieved by this

process. Now, let us discuss the

conventional adder implementation

followed by the proposed approximations.

1)Conventional MA:

The Gate-level of a

conventional adder, which is a popular way

of implementing an FA.Since this

implementation is not based on

complementary cmos,it provides a good

opportunity to design an approximate

version with removal of selected

transistors.

2)Approximation 1:

In order to get an

approximate Adder with fewer input gates,

we start to remove c

from the

conventional adder one by one. However,

we cannot do this in an arbitrary fashion.

The input combination of

A, B

and

does not result in short circuits or open

circuits in the simplified schematic.

Another important criterion is that the

resulting simplification should introduce

minimal errors in the FA truth table.

3)Approximation 2:

In order to get an

approximate Adder with fewer input gates,

we start to remove input “B” form the

conventional adder one by one. However,

we cannot do this in an arbitrary fashion.

The input A, B

and

in does not result in

short circuits or open circuits in the

simplified schematic. Another important

criterion is that the resulting simplification

should introduce minimal errors in the FA

truth table.

4)Approximation 3:

In order to get an

approximate Adder with fewer input gates,

we start to remove input “A” from the

conventional adder one by one. However,

we cannot do this in an arbitrary fashion.

The input

A, B

and

in does not result in

short circuits or open circuits in the

simplified schematic. Another important

criterion is that the resulting simplification

should introduce minimal errors in the FA

truth table.

In above all these cases, there is one error

out and four errors in Sum.

RESULT IN DELAY:

A. CONVENTIONAL ADDER(16 BIT)

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 2, Apr-May, 2014

ISSN: 2320 – 8791(Impact Factor: 1.479)

www.ijreat.org

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org)

B. APPROXIMATION ADDER(16 BIT)

C. CONVENTIONAL ADDER(32 BIT)

D. APPROXIMATION ADDER(32 BIT)

E.CONVENTIONAL ADDER(64 BIT)

F. APPROXIMATION ADDER(64 BIT)

COMPARISON:

DELAY

No.

bits

Existing

RCA

(ns)

Proposed Output

A1 A2 A3

8 19.776 14.715

15.8 14.27

16 31.7 23.233

21.97 19.02

32 55.68 37.435

35.776

29.9

64 102 68.8 64.887

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 2, Apr-May, 2014

ISSN: 2320 – 8791(Impact Factor: 1.479)

www.ijreat.org

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org)

KEYWORDS:

A1- Approximation I

A2- Approximation II

A3- Approximation III

SIMULATION RESULT

PERFORMANCE ANALYSIS

111.CONCLUSION:

The ‘APPROXIMATE ADDER’ was thus

designed with an idea to minimize the

delay and power consumption The

‘APPROXIMATE ADDER’ was tested

using the Xilinx ISE and was compared

with the other conventional adders such as

the Ripple carry adder. The power

consumption of the ‘Approximate adder’

was calculated using the Micro

wind/DSCH tool. Extensive comparisons

with conventional digital adders showed

that the proposed ‘APPROXIMATE

ADDER’ outperformed the conventional

adders in both power consumption and

speed performance. The potential

applications of the ‘APPROXIMATE

ADDER’ fall mainly in areas where there

is no strict requirement on accuracy or

where super low power consumption and

high-speed performance are more

important than accuracy. In future we can

implement the approximation technique in

CARRY LOOK AHEAD ADDER(CLA)

and it can be used in the DSP application

for portable devices such as cell phones,

laptops and medical imaging.

1V.References:

[1]Vaibhav Gupta, DebabrataMohapatra,

Anand Ragunathan”Low power Digital

signal processing using approximate

adder”

IEEE transactions on computer-

aided design of integrated circuits and

systems, VOL. 32, NO. 1, JANUARY 2013.

[2] P. Kulkarni, P. Gupta, and M.

Ercegovac, “Trading accuracy for power

with an under designed multiplier

architecture,” in

Proc. 24th IEEE Int.Conf.

VLSI Design

, Jan. 2011, pp. 346–351.

[4] R. Hegde and N. R. Shanbhag, “Soft

digital signal processing,”

IEEE Trans.

Very Large Scale Integr.Syst.

, vol. 9, no. 6,

pp. 813–823, Jun.2001.[5] B. Shim, S.

Sridhara, and N. Shanbhag, “Reliable low-

100

120

rca 16

rca 32

rca 64

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 2, Apr-May, 2014

ISSN: 2320 – 8791(Impact Factor: 1.479)

www.ijreat.org

Published by: PIONEER RESEARCH & DEVELOPMENT GROUP (www.prdg.org)

power digital signal processing via reduced

precision redundancy,”

IEEE Trans. Very

Large Scale Integr. Syst.

, vol. 12, no. 5, pp.

497–510, May 2004.

[6] G. Varatkar and N. Shanbhag, “Energy-

efficient motion estimation using error-

tolerance,” in

Proc. IEEE/ACM Int.

Symp.Low Power Electron.

Design

, Oct. 2006, pp. 113–118.

[7] D. Mohapatra, G. Karakonstantis, and

K. Roy, “Significance driven computation:

A voltage-scalable, variation-aware,

quality-tuning motion

estimator,” in

Proc. IEEE/ACM Int. Symp.

Low Power Electron. Design

Aug. 2009, pp. 195–200.

[8] N. Banerjee, G. Karakonstantis, and K.

Roy, “Process variation tolerant low power

DCT architecture,” in

Proc. Design,

Automat. Test Eur.

, 2007,pp. 1–6.

[9] G. Karakonstantis, D. Mohapatra, and

K. Roy, “System level DSP synthesis using

voltage overscaling, unequal error

protection and adaptive

quality tuning,” in

Proc. IEEE Workshop

Signal Processing Systems

,Oct. 2009, pp.

133–138.

[10] L. N. Chakrapani, K. K.

Muntimadugu, L. Avinash, J. George, and

K. V.Palem, “Highly energy and

performance efficient embedded

computing through approximately correct

arithmetic: A mathematical foundation and

preliminary experimental validation,” in

Proc. CASES

, 2008, pp.187–196.

A Novel Highly-Efficient Inexact Full Adder Cell for Motion and Edge Detection Systems of Image Processing in CNFET Technology

Article

Mar 2022

In this paper, a novel and highly efficient inexact Full Adder cell by exploiting two logic styles including conventional CMOS (C-COMS) and pass transistor logic (PTL) are presented. The so-called carbon nanotube field-effect transistor (CNFET) technology is used to implement circuits at the transistor level. To justify the efficiency of our design, extensive simulations are performed at the transistor level as well as application level. Transistor-level simulations which are carried out by the HSPICE 2008 tool, demonstrate at least 12% higher performance in terms of power-delay-area product (PDAP) of the proposed circuit compared to the latest designs. At the application level, by using the MATLAB tool, inexact Full Adders are employed in the structure of the ripple carry adder (RCA) that is applied in motion and edge detection algorithms. Computer simulation results confirm the appropriate quality of the output images in terms of the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) criteria. At last, to make a compromise between hardware and application level parameters, the power-delay-area-1/PSNR product (PDAPP) and power-delay-area-1/SSIM product (PDASP) are considered as figures of merit. The proposed circuit shows remarkable improvement from the PDAPP and PDASP points of view compared to its counterparts.

A 1.17ns FO4(1V) Low Energy Cost 4-Bit Absolute-Value Detector

Conference Paper

Sep 2021

System level DSP synthesis using voltage overscaling, unequal error protection & adaptive quality tuning

Conference Paper

Full-text available

Nov 2009

In this paper, we propose a system level design approach considering voltage over-scaling (VOS) that achieves error resiliency using unequal error protection of different computation elements, while incurring minor quality degradation. Depending on user specifications and severity of process variations/channel noise, the degree of VOS in each block of the system is adaptively tuned to ensure minimum system power while providing ldquojust-the-rightrdquo amount of quality and robustness. This is achieved, by taking into consideration system level interactions and ensuring that under any change of operating conditions only the ldquoless-crucialrdquo computations, that contribute less to block/system output quality, are affected. The design methodology applied to a DCT/IDCT system shows large power benefits (up to 69%) at reasonable image quality while tolerating errors induced by varying operating conditions (VOS, process variations, channel noise). Interestingly, the proposed IDCT scheme conceals channel noise at scaled voltages.

Significance driven computation: a voltage-scalable, variation-aware, quality-tuning motion estimator

Conference Paper

Full-text available

Aug 2009

In this paper we present a design methodology for algorithm/architecture co-design of a voltage-scalable, process variation aware motion estimator based on significance driven computation. The fundamental premise of our approach lies in the fact that all computations are not equally significant in shaping the output response of video systems. We use a statistical technique to intelligently identify these significant/not-so-significant computations at the algorithmic level and subsequently change the underlying architecture such that the significant computations are computed in an error free manner under voltage over-scaling. Furthermore, our design includes an adaptive quality compensation (AQC) block which "tunes" the algorithm and architecture depending on the magnitude of voltage over-scaling and severity of process variations. Simulation results show average power savings of ~ 33% for the proposed architecture when compared to conventional implementation in the 90 nm CMOS technology. The maximum output quality loss in terms of Peak Signal to Noise Ratio (PSNR) was ~ 1 dB without incurring any throughput penalty.

Highly energy and performance efficient embedded computing through approximately correct arithmetic

Conference Paper

Oct 2008

We develop a theoretical foundation to characterize a novel methodology for low energy and high performance dsp for embedded computing. Computing elements are operated at a frequency higher than that permitted by a conventionally correct circuit design, enabling a trade-off between error that is deliberately introduced, and the energy consumed. Simi- lar techniques considered previously were relevant to deeply scaled future technology generations. Our work extends this idea to be applicable to current-day designs through: (i) a mathematically rigorous foundation characterizing a trade- off between energy consumed and the quality of solution, and (ii) a means of achieving this trade off through very aggres- sive voltage scaling beyond that of a conventionally designed circuit. Through our "cmos inspired" mathematical model, we show that our approach is better (by an exponential fac- tor) than the conventional uniform voltage scaling approach for comparable computational speed or performance .W e fur- ther establish through experimental study that a similar im- provement by a factor of 3.4x to the snr over conventional voltage-scaled approaches can be achieved in the context of the ubiquitous discrete Fourier transform.

Energy-efficient Motion Estimation using Error-Tolerance

Conference Paper

Nov 2006

Presented is an energy-efficient motion estimation architecture using error-tolerance. The technique employs overscaling of the supply voltage (voltage overscaling (VOS)) to reduce power at the expense of timing errors, which are then corrected using algorithmic noise-tolerance (ANT) techniques. Referred to as input subsampled replica ANT (ISR-ANT), the proposed technique incorporates an input subsampled replica of the main sum of absolute difference (MSAD) block for obtaining the motion vectors in the presence of errors induced by VOS. Simulations show that the proposed technique can save up to 60% power over an optimal error-free present day system in a 130nm CMOS technology. Power savings increase to 79% in a 45nm predictive process technology.

digital signal processing via reduced precision redundancy

May 2004
497-510

digital signal processing via reduced precision redundancy," IEEE Trans. Very Large Scale Integr. Syst., vol. 12, no. 5, pp. 497-510, May 2004.

Process variation tolerant low power DCT architecture

Jan 2007
1-6

Roy, "Process variation tolerant low power DCT architecture," in Proc. Design, Automat. Test Eur., 2007,pp. 1-6.

Design Of Low Power Approximate Mirror Adder

Figures

Recommended publications

Design of A High Speed And Low Power 4 Bit Carry Skip Adder

Design of low power 12-bit magnitude comparator

An IC Analog Modulator

Design of a Bandgap Current Reference with Wide Range of Output Voltage in a 0.18.