Bypassing Technique Based Braun Multiplier for Low Power Consumption with Improved Speed

Kusum Sahu1*, Rahul Sinha2*

1Electronics and Telecommunication, CSVTU
CSIT Durg (C.G.), India
kusum.sahu13@gmail.com

2Electronics and Telecommunication, CSVTU
CSIT Durg (C.G.), India
rsinha.vlsieng@gmail.com

Abstract— A multiplier is one of the key hardware blocks in most digital and high performance systems such as FIR filters, digital signal processors and microprocessors etc. If the power consumption of the multiplier block reduced, then reduction of the power consumption of various digital signal processing chips and communication systems could be possible. In this paper 4/8/16 bit Braun multiplier has been designed using bypassing technique and a comparative study of all bypassing techniques on the basis of their power consumption is done. At the last stage of the multiplier KSA (Kogge Stone Adder) and RCA (Ripple Carry Adder) is used for high speed and the low power consumption. The implementation of Braun multipliers and its bypassing techniques is done using Verilog HDL Xilinx 14.4 ISE and it is verified using the Spartan6, device xc6slx45. After comparing, it is concluded that mixed bypass multiplier is best suited for situations where both delay and power are low. The number of slices & LUT used is also less when compared to other techniques. For 16 bit Row and Column Bypassing Technique using KSA the delay and power found to be 28.118 ns and 0.184 Watt From other two methods (Row Bypass and Column Bypass) it is found to be (28.679ns & 0.190 Watt) and (28.518 ns and 0.184 Watt) respectively.

Keywords— Column Bypass, KSA, Mixed Bypass, RCA, Row Bypass, Verilog HDL.

I. Introduction

Multiplication is one of the essential operations in Digital Signal Processing (DSP) applications like Fast Fourier Transform (FFT), Digital filters etc. [1]. In DSP applications, most of the power is consumed by the multipliers, hence low power multipliers must be designed in order to reduce the power dissipation in DSP applications. With advancement in the technology, many investigators have examined and are trying to design multipliers which will offer the following design characteristic – Accuracy, low power consumption, high speed, regularity of layout, less complexity and less area or the combination of all these characteristic in one multiplier thus making them appropriate for low power consumption, high speed, less area in compact VLSI implementation. The multiplication is an arithmetical operation for common DSP application, such as digital filter, digital communication system, microprocessor, Fast Fourier Transform (FFT), portable and battery operated devices. In order to achieve the high speed and low power demand in DSP applications, parallel array multipliers are generally used. One such commonly used parallel array multiplier is the Braun’s multiplier. The Braun’s multiplier is generally called as the CSA (Carry Save Array Multiplier). The architecture of a Braun’s multiplier consists of AND gates and full adders. Power consumption is mainly due to the Static and Dynamic power consumption. Static power consumption is very small so our major concern is the Dynamic power Consumption which have to be reduced using Braun Multiplier so that our portable devices will have a long life. When Braun multiplier is used without Bypassing technique then power consumption can be increased this disadvantage can be removed by using Braun multiplier using bypassing technique and at the last stage of the bypassing based multiplier KSA and RCA can be used to improve the switching speed and for the low power consumption.

II. The Adders

Ripple Carry Adder Ripple carry adder can be designed by cascading full adder in series i.e. carry from previous full adder is connected as input carry for the next stage [1]. Multiple full adder circuits can be cascaded in parallel to add an N-bit number. For a 4-bit parallel adder, there must be 4 number of full adder circuits. A Ripple Carry Adder is a logic circuit in which the carry-out of each full adder is the carry-in of the succeeding next most significant full adder. It is called a Ripple Carry Adder because each carry bit gets rippled into the next stage. In a Ripple Carry Adder the sum and carry out bits of any Half Adder stage is not valid until the carry in of that stage occurs.

Fig1. 4-bit Ripple Carry Adder

Kogge Stone Adder The Kogge-Stone Adder concept was given by Peter M. Kogge and Harold S. Stone and was published in 1973. In Ripple Carry Adder the carry input of Full Adder is dependent on the carry output of the previous Full Adder and present Full Adder wait until the previous Full Adder has generated output. Hence delay is more for the Ripple Carry Adder if no. of bit increases, then delays also

© 2016 IJAIR. All Rights Reserved

| 239 |
increases, so for reducing this delay Kogge Stone Adder (KSA) is used. Three computational stages are used in Kogge Stone Adder these are Preprocessing, Carry generation network, Post processing.

Fig2. 4 bit Kogge Stone Adder

III. Bypassing Technique

Bypassing means to turn off some row or column when their outputs are known or in other words, we can say that Row or Column may be turn off when either multiplicand or multiplier or both bits have zero value. To reduce the switching activities three types of bypassing techniques are used in this project these are:

- Row bypassing multiplier
- Column bypassing multiplier
- Row and Column bypassing multiplier

The Basic building blocks used for the construction of all these Bypassing based multiplier is FA cell (Full adder cell). This Full adder cell is constructed by using one Full Adder, Multiplexers and Buffers.

IV. Row Bypassing Multiplier

In Row bypassing multiplier if multiplier bit b₂ is zero in below figure then partial products output is directly given to the third row from the first row and the second row is disabled. But when the rightmost full adder in the second row is disabled then addition is not executed and it would not give a correct answer so some extra circuitry is required which is shown in the triangle area. This is the drawback of Row Bypassing Multiplier.

Fig3. A 4*4 Braun Multiplier with Row Bypassing

One of the drawbacks of this Row bypassing Braun multiplier is that if the rightmost FA cell in the second row is disabled then it would not give a correct result, so to obtain an accurate result in some extra circuitry is required which is shown in the above figure enclosed by the blue triangle area. This drawback can be reduced by using a Column bypassing multiplier.

V. Column Bypassing Multiplier

The drawback of the Row Bypassing technique can be minimized by using column bypassing multiplier and it is used when multiplicand bit is zero. To design 4*4, 8*8 and 16*16 bit Braun multiplier with Column bypassing, the basic building block is FACELL (Full adder cell). This FACELL can be designed by using full adder, multiplexer, buffer. A 4*4 column bypassing multiplier can be implemented by using (n-1)*(n-1) full adders, (n-1)*(n-1) multiplexers and 2*(n-1)*(n-1) buffers.
The circuitry of this Row and Column bypassing multiplier is complex from other two bypassing techniques. In this, in each stage of CSA array AND gate and OR gate is used which increases the hardware complexity.

VII. Synthesis and Simulation Result
The synthesis of Braun multiplier has been performed by using Xilinx ISE 14.4 tool & simulation has been done using Isim 14.4 simulation tool on Spartan 6 family, device Xc6slx45, package as csg324 with speed grade -3. The RTL schematic of 16 bit Braun multiplier with Row Bypassing using RCA and KSA is shown in below fig. 8, 9 and the simulation view of 16 bit Braun multiplier with Row Bypassing is shown in below fig 10.

VI. Row and Column Bypassing
In mixed bypassing multiplier the addition operations in the i-th column or the j-th row can be bypassed if the ai bit in the multiplicand is 0 or the bj bit in the multiplier is 0. The basic building block for the designing a 4x4 Row and Column bypass multiplier is FA cell and it can be implemented by using (n-1)* (n-1) full adders, 2*(n-1)* (n-1) multiplexers and 2*(n-1)* (n-1) buffers.
The RTL schematic of 16 bit Braun multiplier with Row and Column (mixed) bypassing using RCA and KSA is shown in below fig. 14, 15 and the simulation view of 16 bit Braun multiplier with Row and Column (mixed) bypassing using RCA and KSA is shown in below fig. 16.
Column bypassing and mixed bypassing Braun multiplier less power is consumed using KSA when compared from RCA and hence speed can be improved. For 16 bit Row and Column Bypassing Technique using KSA the consumed power is found to be 0.184 Watt. From other two methods (Row Bypass and Column Bypass) it is found to be 0.190 Watt and 0.184 Watt respectively.

X. Future Work
In this paper 4*8, 8*8 and 16*16 bit multipliers are designed for unsignalled bit multipliers. The bypassing techniques can also be applied to signed array multiplier architectures. It can also be implemented for the higher bit length like 32*32 and 64*64 bit in future. The design of multipliers considered here involved the use of RCA and KSA. The value of delay and power can also be reduced by using some other PPAs like BKA, SKA, LDA, and HCA etc.

IX. Conclusion
4/8/16 bit Row Bypassing, Column Bypassing and mixed (Row and Column) bypassing is implemented using RCA and KSA in Xilinx ISE 14.4 tool and it is simulated by using ISIM 14.4 simulator and both are compared. From the above comparison table it is concluded that for 4/8/16 bit Row, Column bypassing and mixed bypassing Braun multiplier less power is consumed using KSA when compared from RCA and hence speed can be improved. For 16 bit Row and Column Bypassing Technique using KSA the consumed power is found to 0.184 Watt. From other two methods (Row Bypass and Column Bypass) it is found to be 0.190 Watt and 0.184 Watt respectively.

VIII. Comparison Table for All Bypassing Technique using RCA & KSA

<table>
<thead>
<tr>
<th>Bit Length</th>
<th>Temp. Adder</th>
<th>Power Consumption (in Watt)</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td>10&lt;sup&gt;-6&lt;/sup&gt;</td>
</tr>
<tr>
<td>4 bit row Bypass</td>
<td>RCA</td>
<td>0.039</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.029</td>
</tr>
<tr>
<td>8 bit row Bypass</td>
<td>RCA</td>
<td>0.075</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.029</td>
</tr>
<tr>
<td>16 bit row Bypass</td>
<td>RCA</td>
<td>0.087</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.080</td>
</tr>
<tr>
<td>4 bit column Bypass</td>
<td>RCA</td>
<td>0.035</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.028</td>
</tr>
<tr>
<td>8 bit column Bypass</td>
<td>RCA</td>
<td>0.069</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.056</td>
</tr>
<tr>
<td>16 bit column Bypass</td>
<td>RCA</td>
<td>0.085</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.075</td>
</tr>
<tr>
<td>4 bit mixed Bypass</td>
<td>RCA</td>
<td>0.028</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.026</td>
</tr>
<tr>
<td>8 bit mixed Bypass</td>
<td>RCA</td>
<td>0.045</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.029</td>
</tr>
<tr>
<td>16 bit mixed Bypass</td>
<td>RCA</td>
<td>0.080</td>
</tr>
<tr>
<td></td>
<td>KSA</td>
<td>0.073</td>
</tr>
</tbody>
</table>

IX. Conclusion
4/8/16 bit Row Bypassing, Column Bypassing and mixed (Row and Column) bypassing is implemented using RCA and KSA in Xilinx ISE 14.4 tool and it is simulated by using ISIM 14.4 simulator and both are compared. From the above comparison table it is concluded that for 4/8/16 bit Row, Column bypassing and mixed bypassing Braun multiplier less power is consumed using KSA when compared from RCA and hence speed can be improved. For 16 bit Row and Column Bypassing Technique using KSA the consumed power is found to be 0.184 Watt. From other two methods (Row Bypass and Column Bypass) it is found to be 0.190 Watt and 0.184 Watt respectively.

X. Future Work
In this paper 4*8, 8*8 and 16*16 bit multipliers are designed for unsignalled bit multipliers. The bypassing techniques can also be applied to signed array multiplier architectures. It can also be implemented for the higher bit length like 32*32 and 64*64 bit in future. The design of multipliers considered here involved the use of RCA and KSA. The value of delay and power can also be reduced by using some other PPAs like BKA, SKA, LDA, and HCA etc.

References
[12] Soniya, Suresh Kumar (July – August 2013), “A Review of Different Type of Multipliers and Multiplier-Accumulator Unit”,...

