4.1 Unsigned Binary Multiplication: Digital Computer Arithmetic Datapath Design
4.1 Unsigned Binary Multiplication: Digital Computer Arithmetic Datapath Design
P = A·B
n−1
m−1
= ( ai · 2i ) · ( bj · 2j )
i=0 j=0
m−1
n−1
= ai · bj · 2i+j
i=0 j=0
a3 a2 a1 a0
x b3 b2 b1 b0
a3 b0 a2 b0 a1 b0 a0 b0
a3 b1 a2 b1 a1 b1 a0 b1
a3 b2 a2 b2 a1 b2 a0 b2
a3 b3 a2 b3 a2 b3 a0 b3
p7 p6 p5 p4 p3 p2 p1 p0
multiplication is concerned with not just two operands, but many of them it is
imperative to organize the hardware to mitigate the carry path or chain. There-
fore, many implementations consider adders according to two principles:
Carry-Save Addition (CSA) - idea of utilizing addition without carries con-
nected in series but just to count.
Carry-Propagate Addition (CPA) - idea of utilizing addition with the carries
connected in series to produce a result in either conventional or redundant
notation.
Each adder is the same as the full adder discussed in Chapter 3, however, the
view in which each connection is made from adder to adder is where the main
difference lies. Because each adder is really trying to compute both carry and
save information, sometimes VLSI designers refer to it as a carry-save adder
or CSA. As mentioned previously, because each adder attempts to count the
number of inputs that are 1, it is sometimes also called a counter. A (c, d) is an
adder where c refers to the column height and d is the number of bits to display
at its output. For example, a (3, 2) counter counts the 3 inputs all with the same
weight and displays two outputs. A (3, 2) counter is shown in Figure 4.2.
Therefore, an n-bit CSA can take three n-bit operands and generate an n-bit
partial sum and n-bit carry. Large operand sizes would require more CSAs to
produce a result. However, a CPA would be required to produce the correct
result. For example, in Table 4.1 an example is shown that adds together A +
B + D + E with the values 10 + 6 + 11 + 12. The implementation utilizing
the carry-save concept for this example is shown in Figure 4.4. As seen in
58 DIGITAL COMPUTER ARITHMETIC DATAPATH DESIGN
A B D
CSA
C S
Table 4.1, the partial sum is 31 and the carry is 8 that produces the correct result
of 39. This process of performing addition on a given array that produces an
output array with a smaller number of bits is called reduction. The Verilog
code for this implementation is shown in Figure 4.3.
The CSA utilizes many topologies of adder so that the carry-out from one
adder is not connected to the carry-in of the next adder. Eventually, a CPA
could be utilized to form the true result. This organization of utilizing m-word
by n-bit multi-operand adders together to add m-operands or words each of
which is n-bits long is called a multi-operand adder (MOA). A m-word by n-
bit multi-operand adder can be implemented using (m − 2) n-bit CSA’s and 1
CPA. Unfortunately, because the number of bits added together increases the
result, the partial sum and carry must grow as well. Therefore, the result will
contain n + log2 (m) bits. In our example above, this means a 4 + log2 (4) =
6-bit result is produced.
Higher order counters can be created by putting together various sized coun-
ters. A higher order counter (p, q) takes p input bits and produces q output bits.
Multiplication 59
input [3:0] A, B, D, E;
input Cin;
output [4:0] S, C;
endmodule // moa4x4
A3 B3 D3 A2 B2 D2 A1 B1 D1 A0 B0 D0 C in
E3 E2 E1 E0
S4 C3 S3 C2 S2 C1 S1 C0 S0
Since q bits can represent a number between 0 and 2q − 1, the following re-
striction is required p ≤ 2q − 1. In general a (2q − 1, q) higher order counter
requires (2q − 1 − q) (3, 2) counters. The increase in complexity that occurs
in higher-order counters and multi-operand adders can make an implementa-
tion complex as seen by the Verilog code in Figure 4.3. Consequently, some
designers use programs that generate RTL code automatically. Another use-
ful technique is to utilize careful naming methodologies for each temporary
60 DIGITAL COMPUTER ARITHMETIC DATAPATH DESIGN
variable and declaration. For example, in Figure 4.3, the temporary variables
utilize s 0 2 to represent the sum from the first carry-save adder in the second
column.