Freescale Signal Processing Extension 2 ISA Reference Manual
Freescale Signal Processing Extension 2 ISA Reference Manual
Freescale Signal Processing Extension 2 ISA Reference Manual
f
Assignment in which the data may be reformatted in the target location
NOT logical operator (ones complement)
+ Twos complement addition
Twos complement subtraction, unary minus
Multiplication
Division (yielding quotient)
Table 3-4. Instruction Field Descriptions (continued)
Field Description
Description of Instruction Operation
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-11
+
dp
Floating-point addition, result rounded to double-precision
dp
Floating-point subtraction, result rounded to double-precision
dp
Floating-point multiplication, product rounded to double-precision
dp
Floating-point division quotient, rounded to double-precision
+
sp
Floating-point addition, result rounded to single-precision
sp
Floating-point subtraction, result rounded to single-precision
sf
Signed fractional multiplication
si
Signed integer multiplication
sp
Floating-point multiplication, result rounded to single-precision
sp
Floating-point division, result rounded to single-precision
fp
Floating-point multiplication to infinite precision (no rounding)
ui
Unsigned integer multiplication
FPSquareRoot-Double(x) Floating-point , result rounded to double-precision
FPSquareRoot-Single(x) Floating-point , result rounded to single-precision
FPReciprocal-Estimate(x) Floating-point estimate of
FPReciprocal-SquareRoot-
Estimate(x)
Floating-point estimate of
Allocate-DataCache-
Block(x)
If the block containing the byte addressed by x does not exist in the data cache, allocate a block
in the data cache and set the contents of the block to 0.
Flush-DataCache-Block(x) If the block containing the byte addressed by x exists in the data cache and is dirty, the block is
written to main memory and is removed from the data cache.
Invalidate-DataCache-
Block(x)
If the block containing the byte addressed by x exists in the data cache, the block is removed from
the data cache.
Store-DataCache-Block(x) If the block containing the byte addressed by x exists the data cache and is dirty, the block is
written to main memory but may remain in the data cache.
Prefetch-DataCache-
Block(x,y)
If the block containing the byte addressed by x does not exist in the portion of the data cache
specified by y, the block in memory is copied into the data cache.
Prefetch-ForStore-
DataCache-Block(x,y)
If the block containing the byte addressed by x does not exist in the portion of the data cache
specified by y, the block in memory is copied into the data cache and made exclusive to the
processor that is executing the instruction.
ZeroDataCache-Block(x) The contents of the block containing the byte addressed by x in the data cache is cleared.
Invalidate-Instruction-
CacheBlock(x)
If the block containing the byte addressed by x is in the instruction cache, the block is removed
from the instruction cache.
Prefetch-Instruction-
CacheBlock(x,y)
If the block containing the byte addressed by x does not exist in the portion of the instruction cache
specified by y, the block in memory is copied into the instruction cache.
=, Equal to, Not Equal to relations
Table 3-5. RTL Notation (continued)
Notation Meaning
x
x
1
x
---
1
x
-------
SPE2 Programming Interface Manual, Rev. 1.0-2
3-12 Freescale Semiconductor
SPE2 Operations
<, , >, Signed comparison relations
<
u
, >
u
Unsigned comparison relations
? Unordered comparison relation
&, | AND, OR logical operators
, Exclusive OR, Equivalence logical operators ((ab) = (ab))
ABS(x) Absolute value of x
APID(x) Returns an implementation-dependent information on the presence and status of the auxiliary
processing extensions specified by x
CEIL(x) Least integer x
CnvtFP32ToI32Sat(fp,
signed,upper_lower,round,
fractional)
Converts a 32 bit floating point number to a 32 bit integer if possible, otherwise it saturates.
CnvtI32ToFP32Sat
(v,signed,upper_lower,
fractional)
Converts a 32 bit integer to a 32 bit floating point number if possible, otherwise it saturates.
EXTS(x) Result of extending x on the left with signed bits
EXTZ(x) Result of extending x on the left with zeros
GPR(x) General purpose register x
MASK(x, y) Mask that has ones in bit positions x through y (wrapping if x>y) and zeros elsewhere
MEM(x,1) Contents of the byte of memory located at address x
MEM(x,y)(for y={2,4,8}) Contents of y bytes of memory starting at address x.
If big-endian memory, the byte at address x is the MSB and the byte at address x+y1 is the
LSB of the value being accessed.
If little-endian memory, the byte at address x is the LSB and the byte at address x+y 1 is the
MSB of the value being accessed.
MOD(x,y) Modulo y of x (remainder of x divided by y)
ROTL32(x, y) Result of rotating the value x||x left y positions, where x is 32 bits long
SINGLE(x) Result of converting x from floating-point double format to floating-point single format
SPREG(x) Special-purpose register x
TRAP Invoke a trap-type program interrupt
characterization Reference to setting status bits in a standard way that is explained in the text
undefined Undefined value that may vary between implementations and between different executions on the
same implementation
CIA Current instruction address, which is the address of the instruction that is described in RTL. Used
by relative branches to set the next instruction address (NIA) and by branch instructions with LK=1
to set the LR. CIA does not correspond to any architected register.
Table 3-5. RTL Notation (continued)
Notation Meaning
Overview of Intrinsic Definition
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-13
Table 3-6 summarizes precedence rules for RTL operators. Operators that are higher in the table
are applied before those that are lower in the table. Operators at the same level in the table associate
from left to right, from right to left, or not at all, as shown. (For example, the operator associates
from left to right, so abc = (ab)c.) Using parentheses can increase clarity or override the
evaluation order that the table implies; parenthesized expressions are evaluated before serving as
parameters.
3.6 Overview of Intrinsic Definition
The rest of this chapter describes individual instructions, which are listed in alphabetical order by
mnemonic. Figure 3-3 shows the format for instruction description pages.
NIA Next instruction address, and the address of the next instruction to be executed. For a successful
branch, the next instruction address is the branch target address: in RTL, indicated by assigning
a value to NIA. For other instructions that cause non-sequential instruction fetching, the RTL is
similar. For instructions that do not branch, and do not otherwise cause instruction fetching to be
non-sequential, the next instruction address is CIA+4. NIA does not correspond to any architected
register.
if then else Conditional execution indenting shows range; else is optional.
do Do loop, indenting shows range. To and/or by clauses specify incrementing an iteration variable,
and a while clause gives termination conditions.
leave Leave innermost do loop, or do loop described in leave statement
calc_update Load and Store Indexed with-Update parameter a Calculation
Table 3-6. Operator Precedence
Operators Associativity
Subscript, function evaluation Left to right
Pre-superscript (replication), post-superscript
(exponentiation)
Right to left
unary , Right to left
, Left to right
+, Left to right
|| Left to right
=, , <, , >, , <
u
, >
u
, ? Left to right
&, , Left to right
| Left to right
: (range) None
None
Table 3-5. RTL Notation (continued)
Notation Meaning
SPE2 Programming Interface Manual, Rev. 1.0-2
3-14 Freescale Semiconductor
SPE2 Operations
Figure 3-3. Instruction Description
3.6.1 Saturation, Shift, and Bit Reverse Models
For saturation, left shifts, and bit reversal, the pseudo RTL is provided here to more accurately
describe those functions that are referenced in the intrinsic pseudo RTL.
3.6.1.1 Saturation
SATURATE(overflow, carry, saturated_underflow, saturated_overflow, value)
if overflow then
if carry then
return saturated_underflow
else
return saturated_overflow
else
return value
3.6.1.2 Shift
SL(value, cnt)
if cnt > 31 then
return 0
else
Instruction mnemonic
Instruction name
Instruction syntax
Instruction encoding
RTL description of
Text description of
Registers altered by instruction
instruction operation
instruction operation
User/Supervisor access
Architecture
Key:
ev_addsmiaaw __ev_addsmiaaw
Vector Add Signed, Modulo, Integer to Accumulator Word
d = __ev_addsmiaaw (a)
d0:31 ACC0:31 + A0:31
d32:63 ACC32:63 + A32:63
// update accumulator
ACC0:63 d0:63
Each word element in A is added to the corresponding element in the
accumulator. The sum is placed into the corresponding parameter d word and
into the accumulator.
Other registers altered: ACC
d a Maps to
__ev64_opaque __ev64_opaque evaddsmiaaw d,a
0 31 32 63
A
Accumulator
+ +
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-15
return (value << cnt)
3.6.1.3 Bit Reverse
BITREVERSE(value)
result 0
mask 1
shift 31
cnt 32
while cnt > 0 then do
t data & mask
if shift >= 0 then
result (t << shift) | result
else
result (t >> -shift) | result
cnt cnt - 1
shift shift - 2
mask mask << 1
return result
3.6.1.4 Load and Store Indexed with-Update Calculations
NOTE: THIS IS STILL IN DEFINITION and is NOT FINALIZED
Calc_update(a,b)
mode b
0:2
if(mode=000) then
if 64-bit mode then a
0:63
a
0:63
+ EXTS(b
3:63
)
else a
32:63
a
32:63
+ b
32:63
elseif (mode=001) then
Offset b
8:15
;// signed byte offset
Length b
16:31
;// unsigned buffer length-1 in bytes. Length is byte index of last
// byte in buffer. buffer must be aligned on a power of 2 byte boundary.
// Length must be >7.
Mod CEIL(log
2
(Length+1)); // power of 2 byte boundary of 1st buffer element.
// Mod must be >3.
a
(64-Mod):63
(a
48:63
+ exts(Offset))
(64-Mod):63
;
elseif (mode=010) then
Mask b
16:31
// mask value is log2(#points)1, zero extended, then left shifted log2(element size
// in bytes). e.g., a 16 point FFT on halfwords has a mask of 16b0000000000011110
tmpa a
48:63
// up to 64Kbytes in a single FFT
tmpd bitreverse(1 + bitreverse(tmpa | ~Mask)))
a
32:63
(a
32:63
& ~Mask) || (tmpd & Mask)
else program illegal exception
3.7 Intrinsic Definitions
The intrinsic definitions are split into the integer operations defined by the Enhanced Signal
Processing Engine (SPE2) and the Embedded Floating-Point Version 2 (EFP2) APU.
SPE2 Programming Interface Manual, Rev. 1.0-2
3-16 Freescale Semiconductor
SPE2 Operations
3.7.1 SPE2 Intrinsic Definitions
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-17
__brinc __brinc
Bit Reversed Increment
d = __brinc (a,b)
n MASKBITS // Imp dependent # of mask bits
mask b
64-n:63
// Least sig. n bits of register
temp0 a
64-n:63
temp1 bitreverse(1 + bitreverse(a | ( mask)))
d a
0:63-n
|| (temp1 & mask)
brinc provides a way for software to access FFT data in a bit-reversed manner. Parameter a
contains the index into a buffer that contains data on which FFT is to be performed. Parameter b
contains a mask that allows the index to be updated with bit-reversed addressing. Typically this
instruction precedes a load with index instruction; for example,
brinc r2, r3, r4
lhax r8, r5, r2
Parameter b contains a bit-mask that is based on the number of points in an FFT. To access a buffer
containing n byte sized data that is to be accessed with bit-reversed addressing, the mask has log
2
n
1s in the least significant bit positions and 0s in the remaining most significant bit positions. If,
however, the data size is a multiple of a half word or a word, the mask is constructed so that the 1s
are shifted left by log
2
(size of the data) and 0s are placed in the least significant bit positions.
Table 3-7 shows example values of masks for different data sizes and number of data.
Architecture Note: An implementation can restrict the number of bits specified in a mask. The
number of bits in a mask may not exceed 32.
Architecture Note: This instruction only modifies the lower 32 bits of the destination register in
32-bit implementations. For 64-bit implementations in 32-bit mode, the contents of the upper
32 bits of the destination register are undefined.
Architecture Note: Execution of brinc does not cause SPE2 Unavailable exceptions, regardless of
the state of MSRSPE.
Table 3-7. Data Samples and Sizes
Number of Data
Samples
Byte Half Word Word Double Word
8 000...00000111 000...00001110 000...000011100 000...0000111000
16 000...00001111 000...00011110 000...000111100 000...0001111000
32 000...00011111 000...00111110 000...001111100 000...0011111000
64 000...00111111 000...01111110 000...011111100 000...0111111000
d a b Maps to
uint32_t uint32_t uint32_t brinc d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-18 Freescale Semiconductor
SPE2 Operations
__ev_circinc __ev_circinc
Circular Increment
d = __ev_circinc (a,b)
Offset
0:15
b
48:63
; // signed byte offset in b
Length
0:15
a
16:31
; // unsigned buffer length-1 in bytes. Length is index of
// last byte in buffer.
Index
0:31
a
32:63
; // index into buffer, must be positive.
if ((Offset
0
= 0) & ((EXTS
32
(Offset
0:15
) + Index
0:31
) >
ui
EXTZ
32
(Length
0:15
))) then
d
32:63
Index
0:31
+ EXTS
32
(Offset
0:15
) - EXTZ
32
(Length
0:15
) - 1; // wrap at end
elseif (Offset
0
= 1) & ((EXTS
32
(Offset
0:15
) + Index
0:31
) <
si
0)) then
d
32:63
Index
0:31
+ EXTS
32
(Offset
0:15
) + EXTZ
32
(Length
0:15
) + 1; // wrap at start
else d
32:63
Index
0:31
+ EXTS
32
(Offset
0:15
);
Note: Length+1 must be a multiple of 8 bytes, Index must be within buffer, and abs(Offset)
must be less or equal to Length+1 for the calculation to be used properly in a future buffer
access in all implementations.
circinc provides a way for software to modify a circular buffer index value. Parameter a contains
the index value into a circular buffer that contains data accessed by a load or store instruction
using circular addressing. Parameter b contains an offset value that allows the index to be
updated with circular addressing. Typically this instruction is used to update a new buffer pointer
value following a sequence of buffer accesses, to allow the pointer into the buffer to be
moved to the next starting element for a future calculation. For forward compatibility, the
Offset value in parameter b should be a 32-bit value with a magnitude <= 16-bits.
Figure 3-4.
Architecture Note: This instruction only modifies the lower 32 bits of the destination register in
32-bit implementations, or in 64-bit implementations in both 32-bit and 64-bit mode, and the
contents of the upper 32-bits of the destination register are unaffected.
0 15 16 .... 31 32 ... 63
- Length
(unsigned)
Index
(always positive)
a
0 31 32 47 48 63
- EXTS(Offset) Offset
(signed)
b
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ circinc d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-19
Architecture Note: An implementation can restrict the allowable values of the Length, Offset, and
Index. Unallowed values may generate a boundedly undefined result. Implementations must
support a set of values that allow for calculations supporting circular buffers that are a multiple of
8 bytes, where the index points to an element of the buffer, and the magnitude of the offset is <=
the size of the buffer.
SPE2 Programming Interface Manual, Rev. 1.0-2
3-20 Freescale Semiconductor
SPE2 Operations
__ev_abs __ev_abs
Vector Absolute Value
d = __ev_abs (a)
d
0:31
ABS(a
0:31
)
d
32:63
ABS(a
32:63
)
The absolute value of each element of parameter a is placed in the corresponding elements of
parameter d. An absolute value of 0x8000_0000 (most negative number) returns 0x8000_0000.
No overflow is detected.
Figure 3-5. Vector Absolute Value (__ev_abs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabs d,a
0 31 32 63
a
d
ABS ABS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-21
__ev_absb __ev_absb
Vector Absolute Value Byte
d = __ev_absb (a)
d
0:7
ABS(a
0:7
)
d
8:15
ABS(a
8:15
)
d
16:23
ABS(a
16:23
)
d
24:31
ABS(a
24:31
)
d
32:39
ABS(a
32:39
)
d
40:47
ABS(a
40:47
)
d
48:55
ABS(a
48:55
)
d
56:63
ABS(a
56:63
)
The absolute value of each byte element of parameter a is placed in the corresponding elements of
parameter d. An absolute value of 0x80 (most negative number) returns 0x80. No overflow is
detected.
Figure 3-6. Vector Absolute Value Byte (__ev_absb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabsb d,a
0 40 32 63
a
d
ABS ABS ABS ABS ABS ABS ABS ABS
8 16 24 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-22 Freescale Semiconductor
SPE2 Operations
__ev_absbs __ev_absbs
Vector Absolute Value Byte and Saturate
d = __ev_absbs (a)
The absolute value of each signed byte element of parameter a is placed into parameter d. The
absolute value of 0x80 (most negative number) returns 0x7F. Any overflow is reported in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-7. Vector Absolute Value Byte and Saturate (__ev_absbs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabsbs d,a
if (a
0:7
= 0x80) then
d
0:7
0x7F
ovh 1
else
d
0:7
ABS(a
0:7
)
ovh 0
if (a
8:15
= 0x80) then
d
8:15
0x7F
ovh 1
else d
8:15
ABS(a
8:15
)
if (a
16:23
= 0x80) then
d
16:23
0x7F
ovh 1
else d
16:23
ABS(a
16:23
)
if (a
24:31
= 0x80) then
d
24:31
0x7F
ovh 1
else d
24:31
ABS(a
24:31
)
if (a
32:39
= 0x80) then
d
32:39
0x7F
ovl 1
else
d
32:39
ABS(a
32:39
)
ovl 0
if (a
40:47
= 0x80) then
d
40:47
0x7F
ovl 1
else d
40:47
ABS(a
40:47
)
if (a
48:55
= 0x80) then
d
48:55
0x7F
ovl 1
else d
48:55
ABS(a
48:55
)
if (a
56:63
= 0x80) then
d
56:63
0x7F
ovl 1
else d
56:63
ABS(a
56:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
0 40 32 63
a
d
ABS
8 16 24 48 56
&SAT
ABS
&SAT
ABS
&SAT
ABS
&SAT
ABS
&SAT
ABS
&SAT
ABS
&SAT
ABS
&SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-23
__ev_absd __ev_absd
Vector Absolute Value Doubleword
d = __ev_absd (a)
d
0:63
ABS(a
0:63
)
The absolute value of the doubleword in parameter a is placed into parameter d. The absolute value
of 0x8000_0000_0000_0000 (most negative number) returns 0x8000_0000_0000_0000. No
overflow is detected.
Figure 3-8. Vector Absolute Value Doubleword (__ev_absd)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabsd d,a
0 63
a
ABS
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-24 Freescale Semiconductor
SPE2 Operations
__ev_absdifsb __ev_absdifsb
Vector Absolute Difference of Signed Bytes
d = __ev_absdifsb (a,b)
d
0:7
ABS(b
0:7
- a
0:7
)
d
8:15
ABS(b
8:15 -
a
8:15
)
d
16:23
ABS(b
16:23
- a
16:23
)
d
24:31
ABS(b
24:31
- a
24:31
)
d
32:39
ABS(b
32:39
- a
32:39
)
d
40:47
ABS(b
40:47
- a
40:47
)
d
48:55
ABS(b
48:55
- a
48:55
)
d
56:63
ABS(b
56:63
- a
56:63
)
The eight signed byte elements of parameter a are subtracted from the corresponding elements of
parameter b and the absolute values of the differences are placed into parameter d.
Figure 3-9. Vector Absolute Difference of Signed Bytes (__ev_absdifsb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evabsdifsb d,a,b
0 40 32 63
a
d
ABS ABS ABS ABS ABS ABS ABS ABS
8 16 24 48 56
b
- - - - - - - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-25
__ev_absdifsh __ev_absdifsh
Vector Absolute Difference of Signed Half Words
d = __ev_absdifsh (a,b)
d
0:15
ABS(b
0:15
- a
0:15
)
d
16:31
ABS(b
16:31
- a
16:31
)
d
32:47
ABS(b
32:47
- a
32:47
)
d
48:63
ABS(b
48:63
- a
48:63
)
The four signed half word elements of parameter a are subtracted from the corresponding elements
of parameter b and the absolute values of the differences are placed into parameter d.
Figure 3-10. Vector Absolute Difference of Signed Half Words (__ev_absdifsh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evabsdifsh d,a,b
b
0 32 63
a
16 48
-
- - -
d
ABS ABS ABS ABS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-26 Freescale Semiconductor
SPE2 Operations
__ev_absdifsw __ev_absdifsw
Vector Absolute Difference of Signed Words
d = __ev_absdifsw (a,b)
d
0:31
ABS(b
0:31
- a
0:31
)
d
32:63
ABS(b
32:63
- a
32:63
)
The signed word elements of parameter a are subtracted from the corresponding elements of
parameter b and the absolute values of the differences are placed into parameter d.
Figure 3-11. Vector Absolute Difference of Signed Words (__ev_absdifsw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evabsdifsw d,a,b
d
b
0 32 63
a
-
-
ABS ABS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-27
__ev_absdifub __ev_absdifub
Vector Absolute Difference of Unsigned Bytes
d = __ev_absdifub (a,b)
d
0:7
ABS(b
0:7
- a
0:7
)
d
8:15
ABS(b
8:15 -
a
8:15
)
d
16:23
ABS(b
16:23
- a
16:23
)
d
24:31
ABS(b
24:31
- a
24:31
)
d
32:39
ABS(b
32:39
- a
32:39
)
d
40:47
ABS(b
40:47
- a
40:47
)
d
48:55
ABS(b
48:55
- a
48:55
)
d
56:63
ABS(b
56:63
- a
56:63
)
The eight unsigned byte elements of parameter a are subtracted from the corresponding elements
of parameter b and the absolute values of the differences are placed into parameter d.
Figure 3-12. Vector Absolute Difference of Unsigned Bytes (__ev_absdifub)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evabsdifub d,a,b
0 40 32 63
a
d
ABS ABS ABS ABS ABS ABS ABS ABS
8 16 24 48 56
b
- - - - - - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-28 Freescale Semiconductor
SPE2 Operations
__ev_absdifuh __ev_absdifuh
Vector Absolute Difference of Unsigned Half Words
d = __ev_absdifuh (a,b)
d
0:15
ABS(b
0:15
- a
0:15
)
d
16:31
ABS(b
16:31
- a
16:31
)
d
32:47
ABS(b
32:47
- a
32:47
)
d
48:63
ABS(b
48:63
- a
48:63
)
The four unsigned half word elements of parameter a are subtracted from the corresponding
elements of parameter b and the absolute values of the differences are placed into parameter d.
Figure 3-13. Vector Absolute Difference of Unsigned Half Words (__ev_absdifuh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evabsdifuh d,a,b
b
0 32 63
a
16 48
-
- - -
d
ABS ABS ABS ABS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-29
__ev_absdifuw __ev_absdifuw
Vector Absolute Difference of Unsigned Words
d = __ev_absdifuw (a,b)
d
0:31
ABS(b
0:31
- a
0:31
)
d
32:63
ABS(b
32:63
- a
32:63
)
The unsigned word elements of parameter a are subtracted from the corresponding elements of
parameter b and the absolute values of the differences are placed into parameter d.
Figure 3-14. Vector Absolute Difference of Unsigned Words (__ev_absdifuw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evabsdifuw d,a,b
d
b
0 32 63
a
-
-
ABS ABS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-30 Freescale Semiconductor
SPE2 Operations
__ev_absds __ev_absds
Vector Absolute Value Doubleword and Saturate
d = __ev_absds (a)
if (a
0:63
= 0x8000_0000_0000_0000) then
d
0:63
0x7FFF_FFFF_FFFF_FFFF
ov 1
else
d
0:63
ABS(a
0:63
)
ov 0
endif
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The absolute value of the doubleword in parameter a is placed into parameter d. The absolute value
of 0x8000_0000_0000_0000 (most negative number) returns 0x7FFF_FFFF_FFFF_FFFF. Any
overflow is reported in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-15. Vector Absolute Value Doubleword and Saturate (__ev_absds)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabsds d,a
0 63
a
ABS&SAT
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-31
__ev_absh __ev_absh
Vector Absolute Value Half Word
d = __ev_absh (a)
d
0:15
ABS(a
0:15
)
d
16:31
ABS(a
16:31
)
d
32:47
ABS(a
32:47
)
d
48:63
ABS(a
48:63
)
The absolute value of each half word element of parameter a is placed in the corresponding
elements of parameter d. An absolute value of 0x8000 (most negative number) returns 0x8000. No
overflow is detected.
Figure 3-16. Vector Absolute Value Half Word (__ev_absh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabsh d,a
0 31 32 63 15 16 48 47
a
d
ABS ABS ABS ABS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-32 Freescale Semiconductor
SPE2 Operations
__ev_abshs __ev_abshs
Vector Absolute Value Half Word and Saturate
d = __ev_abshs (a)
The absolute value of each half word element of parameter a is placed into parameter d. The
absolute value of 0x8000 (most negative number) returns 0x7FFF. Any overflow is reported in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-17. Vector Absolute Value Half Word and Saturate (__ev_abshs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabshs d,a
if (a
0:15
= 0x8000) then
d
0:15
0x7FFF
ovh 1
else
d
0:15
ABS(a
0:15
)
ovh 0
endif
if (a
16:31
= 0x8000) then
d
16:31
0x7FFF
ovh 1
else d
16:31
ABS(a
16:31
)
if (a
32:47
= 0x8000) then
d
32:47
0x7FFF
ovl 1
else
d
32:47
ABS(a
32:47
)
ovl 0
endif
if (a
48:63
= 0x8000) then
d
48:63
0x7FFF
ovl 1
else d
48:63
ABS(a
48:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
d
0 31 32 63
ABS&SAT ABS&SAT ABS&SAT ABS&SAT
15 16 48 47
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-33
__ev_abss __ev_abss
Vector Absolute Value (Word) and Saturate
d = __ev_abss (a)
The absolute value of each word element of parameter a is placed into parameter d. The absolute
value of 0x8000_0000 (most negative number) returns 0x7FFF_FFFF. Any overflow is reported
in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-18. Vector Absolute Value (Word) and Saturate (__ev_abss)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evabss d,a
if (a
0:31
= 0x8000_0000) then
d
0:31
0x7FFF_FFFF
ovh 1
else
d
0:31
ABS(a
0:31
)
ovh 0
endif
if (a
32:63
= 0x8000_0000) then
d
32:63
0x7FFF_FFFF
ovl 1
else
d
32:63
ABS(a
32:63
)
ovl 0
endif
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
0 31 32 63
a
d
ABS&SAT ABS&SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-34 Freescale Semiconductor
SPE2 Operations
__ev_add2subf2h __ev_add2subf2h
Vector Add Upper 2 and Subtract from Lower 2 Half Words
d = __ev_add2subf2h (a,b)
d
0:15
b
0:15
+ a
0:15
// Modulo sum
d
16:31
b
16:31
+ a
16:31
// Modulo sum
d
32:47
b
32:47
- a
32:47
// Modulo difference, b - a
d
48:63
b
48:63
- a
48:63
// Modulo difference, b - a
The upper two half word elements of parameter a are added to the upper 2 half word elements of
parameter b, the lower two half word elements of parameter a are subtracted from the lower two
half word elements of parameter b, and the results are placed in parmaeter d. The sum and
difference are modulo.
Figure 3-19. Vector Add upper 2 and Subtract from lower 2 Half Words (__ev_add2subf2h)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evadd2subf2h d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- + +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-35
__ev_add2subf2hss __ev_add2subf2hss
Vector Add Upper 2 and Subtract from Lower 2 Half Words Signed and Saturate
d = __ev_add2subf2hss (a,b)
The upper two signed half word elements of parameter a are added to the upper two signed half
word elements of parameter b, the lower two signed half word elements of parameter a are
subtracted from the lower two signed half word elements of parameter b, saturating if overflow or
underflow occurs, and the results are placed in parameter d. Any overflow or underflow is recorded
in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-20. Vector Add upper 2 and Subtract from lower 2 Half Words Signed and
Saturate (__ev_add2subf2hss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evadd2subf2hss d,a,b
// h0
temp
0:31
EXTS(b
0:15
) + EXTS(a
0:15
)
ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000,
0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) + EXTS(a
16:31
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000,
0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) - EXTS(a
32:47
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000,
0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) - EXTS(a
48:63
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000,
0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
a
0 31 32 63
b
15 16 47 48
-
d
- + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-36 Freescale Semiconductor
SPE2 Operations
__ev_addsmiaa __ev_addsmiaa
Vector Add to Accumulator
d = __ev_addsmiaa (a)
d
0:63
ACC
0:63
+ a
0:63
ACC
0:63
d
0:63
The 64-bit value in parameter a is added to the accumulator and the results are placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-21. Vector Add to Accumulator (__ev_addsmiaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddsmiaa d,a
0 63
a
Accumulator
+
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-37
__ev_addb __ev_addb
Vector Add Bytes
d = __ev_addb (a,b)
d
0:7
a
0:7
+ b
0:7
// Modulo sum
d
8:15
a
8:15 +
b
8:15
// Modulo sum
d
16:23
a
16:23
+ b
16:23
// Modulo sum
d
24:31
a
24:31
+ b
24:31
// Modulo sum
d
32:39
a
32:39
+ b
32:39
// Modulo sum
d
40:47
a
40:47
+ b
40:47
// Modulo sum
d
48:55
a
48:55
+ b
48:55
// Modulo sum
d
56:63
a
56:63
+ b
56:63
// Modulo sum
The eight byte elements of parameter a are added to the corresponding elements of parameter b
and the results are placed in parameter d. The sum is a modulo sum.
Figure 3-22. Vector Add Bytes (__ev_addb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddb d,a,b
0 40 32 63
a
d
8 16 24 48 56
b
+ + + + + + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-38 Freescale Semiconductor
SPE2 Operations
__ev_addbss __ev_addbss
Vector Add Bytes Signed and Saturate
d = __ev_addbss (a,b)
The eight signed byte elements of parameter a are added to the corresponding signed elements of
parameter b, saturating if overflow or underflow occurs, and the results are placed in parameter d.
Any overflow or underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-23. Vector Add Bytes Signed and Saturate (__ev_addbss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddbss d,a,b
temp
0:8
EXTS(a
0:7
) + EXTS(b
0:7
)
ovb0 temp
0
temp
1
d
0:7
SATURATE(ovb0,temp
0
,0x80,0x7f,temp
1:8
)
temp
0:8
EXTS(a
8:15
) + EXTS(b
8:15
)
ovb1 temp
0
temp
1
d
8:15
SATURATE(ovb1,temp
0
,0x80,0x7f,temp
1:8
)
temp
0:8
EXTS(a
16:23
) + EXTS(b
16:23
)
ovb2 temp
0
temp
1
d
16:23
SATURATE(ovb2,temp
0
,0x80,0x7f,temp
1:8
)
temp
0:8
EXTS(a
24:31
) + EXTS(b
24:31
)
ovb3 temp
0
temp
1
d
24:31
SATURATE(ovb3,temp
0
,0x80,0x7f,temp
1:8
)
temp
0:8
EXTS(a
32:39
) + EXTS(b
32:39
)
ovb4 temp
0
temp
1
d
32:39
SATURATE(ovb4,temp
0
,0x80,0x7f,temp
1:8
)
temp
0:8
EXTS(a
40:47
) + EXTS(b
40:47
)
ovb5 temp
0
temp
1
d
40:47
SATURATE(ovb5,temp
0
,0x80,0x7f,temp
1:8
temp
0:8
EXTS(a
48:55
) + EXTS(b
48:55
)
ovb6 temp
0
temp
1
d48:55 SATURATE(ovb6,temp
0
,0x80,0x7f,temp
1:
temp
0:8
EXTS(a
56:63
) + EXTS(b
56:63
)
ovb7 temp
0
temp
1
d
56:63
SATURATE(ovb7,temp
0
,0x80,0x7f,temp
1:8
ovh ovb0 | ovb1 | ovb2 | ovb3
ovl ovh4 | ovh5 | ovb6 | ovb7
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
0 40 32 63
a
d
8 16 24 48 56
b
+ + + + + + + +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-39
__ev_addbus __ev_addbus
Vector Add Bytes Unsigned and Saturate
d = __ev_addbus (a,b)
The eight unsigned byte elements of parameter a are added to the corresponding unsigned
elements of parameter b, saturating if overflow occurs, and the results are placed in parameter d.
Any overflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-24. Vector Add Bytes Unsigned and Saturate (__ev_addbus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddbus d,a,b
temp
0:8
EXTS(a
0:7
) + EXTS(b
0:7
)
ovb0 temp
0
temp
1
d
0:7
SATURATE(ovb0,temp
0
,0x80,0x7f,temp
1:8
)
temp
0:8
EXTZ(a
0:7
) + EXTZ(b
0:7
)
ovb0 temp
0
d
0:7
SATURATE(ovb0,temp
0
,0xff,0xff,temp
1:8
)
temp
0:8
EXTZ(a
8:15
) + EXTZ(b
8:15
)
ovb1 temp
0
d
8:15
SATURATE(ovb1,temp
0
,0xff,0xff,temp
1:8
)
temp
0:8
EXTZ(a
16:23
) + EXTZ(b
16:23
)
ovb2 temp
0
d
16:23
SATURATE(ovb2,temp
0
,0xff,0xff,temp
1:8
)
temp
0:8
EXTZ(a
24:31
) + EXTZ(b
24:31
)
ovb3 temp
0
d
24:31
SATURATE(ovb3,temp
0
,0xff,0xff,temp
1:8
)
temp
0:8
EXTZ(a
32:39
) + EXTZ(b
32:39
)
ovb4 temp
0
d
32:39
SATURATE(ovb4,temp
0
,0xff,0xff,temp
1:8
temp
0:8
EXTZ(a
40:47
) + EXTZ(b
40:47
)
ovb5 temp
0
d
40:47
SATURATE(ovb5,temp
0
,0xff,0xff,temp
1:8
temp
0:8
EXTZ(a
48:55
) + EXTZ(b
48:55
)
ovb6 temp
0
d
48:55
SATURATE(ovb6,temp
0
,0xff,0xff,temp
1:8
temp
0:8
EXTZ(a
56:63
) + EXTZ(b
56:63
)
ovb7 temp
0
d
56:63
SATURATE(ovb7,temp
0
,0xff,0xff,temp
1:8
ovh ovb0 | ovb1 | ovb2 | ovb3
ovl ovh4 | ovh5 | ovb6 | ovb7
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
0 40 32 63
a
d
8 16 24 48 56
b
+ + + + + + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-40 Freescale Semiconductor
SPE2 Operations
__ev_addd __ev_addd
Vector Add Doubleword
d = __ev_addd (a,b)
d
0:63
a
0:63
+ b
0:63
The 64-bit value in parameter a is added to the 64-bit value in parameter b and the results are
placed into parameter d.
Figure 3-25. Vector Add Doubleword (__ev_addd)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddd d,a,b
0 63
a
b
+
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-41
__ev_adddss __ev_adddss
Vector Add Doubleword Signed and Saturate
d = __ev_adddss (a,b)
temp
0:64
EXTS
65
(b
0:63
) + EXTS
65
(a
0:63
)
ov temp
0
temp
1
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7fff_ffff_ffff_ffff, temp
1:64
)
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The signed doubleword in parameter a is added to the signed doubleword in parameter b,
saturating if overflow or underflow occurs, and the result is placed into parameter d. Any overflow
or underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-26. Vector Add Doubleword Signed and Saturate (__ev_adddss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evadddss d,a,b
0 63
a
b
+
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-42 Freescale Semiconductor
SPE2 Operations
__ev_adddus __ev_adddus
Vector Add Doubleword Unsigned and Saturate
d = __ev_adddus (a,b)
temp
0:64
EXTZ
65
(b
0:63
) + EXTZ
65
(a
0:63
)
ov temp
0
d
0:63
SATURATE(ov, temp
0
, 0xffff_ffff_ffff_ffff, 0xffff_ffff_ffff_ffff, temp
1:64
)
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The unsigned doubleword in parameter a is added to the unsigned doubleword in parameter b,
saturating if overflow or underflow occurs, and the result is placed into parameter d. Any overflow
or underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-27. Vector Add Doubleword Unsigned and Saturate (__ev_adddus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evadddus d,a,b
0 63
a
b
+
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-43
__ev_addh __ev_addh
Vector Add Half Words
d = __ev_addh (a,b)
d
0:15
a
0:15
+ b
0:15
// Modulo sum
d
16:31
a
16:31
+ b
16:31
// Modulo sum
d
32:47
a
32:47
+ b
32:47
// Modulo sum
d
48:63
a
48:63
+ b
48:63
// Modulo sum
The four half word elements of parameter a are added to the four half word elements of parameter
b and the results are placed in parameter d. The sum is a modulo sum.
Figure 3-28. Vector Add Half Words (__ev_addh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddh d,a,b
b
0 32 63
a
16 48
+
+ + +
d
15 31 47
SPE2 Programming Interface Manual, Rev. 1.0-2
3-44 Freescale Semiconductor
SPE2 Operations
__ev_addhhisw __ev_addhhisw
Vector Add Half Words High Signed to Words
d = __ev_addhhisw (a,b)
d
0:31
EXTS
32
(a
0:15
) + EXTS
32
(b
0:15
)// Modulo
d
32:63
EXTS
32
(a
16:31
) + EXTS
32
(b
16:31
) // Modulo
The high halfword elements of parameter a are sign-extended to 32 bits and added to the respective
sign-extended high halfword elements of parameter b and the 32-bit results are placed into
parameter d. The sum is a modulo sum.
Figure 3-29. Vector Add Halfwords High Signed to Words (__ev_addhhisw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhhisw d,a,b
b
0 31 32 63
a
15 16 47 48
d
+ +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-45
__ev_addhhiuw __ev_addhhiuw
Vector Add Half Words High Unsigned to Words
d = __ev_addhhiuw (a,b)
d
0:31
EXTZ
32
(a
0:15
) + EXTZ
32
(b
0:15
)// Modulo
d
32:63
EXTZ
32
(a
16:31
) + EXTZ
32
(b
16:31
) // Modulo
The high halfword elements of parameter a are zero-extended to 32 bits and added to the respective
zero-extended high halfword elements of parameter b and the 32-bit results are placed into
parameter d. The sum is a modulo sum.
Figure 3-30. Vector Add Halfwords High Unsigned to Words (__ev_addhhiuw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhhiuw d,a,b
b
0 31 32 63
a
15 16 47 48
d
+ +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-46 Freescale Semiconductor
SPE2 Operations
__ev_addhlosw __ev_addhlosw
Vector Add Half Words Low Signed to Words
d = __ev_addhlosw (a,b)
d
0:31
EXTS
32
(a
32:47
) + EXTS
32
(b
32:47
)// Modulo
d
32:63
EXTS
32
(a
48:63
) + EXTS
32
(b
48:63
) // Modulo
The low halfword elements of parameter a are sign-extended to 32 bits and added to the respective
sign-extended low halfword elements of parameter b and the 32-bit results are placed into
parameter d. The sum is a modulo sum.
Figure 3-31. Vector Add Halfwords Low Signed to Words (__ev_addhlosw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhlosw d,a,b
b
0 31 32 63
a
15 16 47 48
d
+ +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-47
__ev_addhlouw __ev_addhlouw
Vector Add Half Words Low Unsigned to Words
d = __ev_addhlouw (a,b)
d
0:31
EXTZ
32
(a
32:47
) + EXTZ
32
(b
32:47
)// Modulo
d
32:63
EXTZ
32
(a
48:63
) + EXTZ
32
(b
48:63
) // Modulo
The low halfword elements of parameter a are zero-extended to 32 bits and added to the respective
zero-extended low halfword elements of parameter b and the 32-bit results are placed into
parameter d. The sum is a modulo sum.
Figure 3-32. Vector Add Halfwords Low Unsigned to Words (__ev_addhlouw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhlouw d,a,b
b
0 31 32 63
a
15 16 47 48
d
+ +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-48 Freescale Semiconductor
SPE2 Operations
__ev_addhss __ev_addhss
Vector Add Half Words Signed and Saturate
d = __ev_addhss (a,b)
// h0
temp
0:31
EXTS(a
0:15
) + EXTS(b
0:15
); ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0,temp
15
,0x8000,0x7fff,temp
16:31
)
// h1
temp
0:31
EXTS(a
16:31
) + EXTS(b
16:31
); ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(a
32:47
) + EXTS(b
32:47
); ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(a
48:63
) + EXTS(b
48:63
); ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The four signed half word elements of parameter a are added to the corresponding four signed half
word elements of parameter b, saturating if overflow or underflow occurs, and the results are
placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-33. Vector Add Half Words Signed and Saturate (__ev_addhss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhss d,a,b
b
0 32 63
a
16 48
+
+ + +
d
15 31 47
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-49
__ev_addhus __ev_addhus
Vector Add Half Words Unsigned and Saturate
d = __ev_addhus (a,b)
// h0
temp
0:31
EXTZ(a
0:15
) + EXTZ(b
0:15
); ovh0 temp
15
d
0:15
SATURATE(ovh0, temp
15
, 0xffff, 0xffff, temp
16:31
)
// h1
temp
0:31
EXTZ(a
16:31
) + EXTZ(b
16:31
); ovh1 temp
15
d
16:31
SATURATE(ovh1, temp
15
, 0xffff, 0xffff, temp
16:31
)
// h2
temp
0:31
EXTZ(a
32:47
) + EXTZ(b
32:47
); ovh2 temp
15
d
32:47
SATURATE(ovh2, temp
15
, 0xffff, 0xffff, temp
16:31
)
// h3
temp
0:31
EXTZ(a
48:63
) + EXTZ(b
48:63
); ovh3 temp
15
d
48:63
SATURATE(ovh3, temp
15
, 0xffff, 0xffff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The four unsigned half word elements of parameter a are added to four unsigned half word
elements of parameter b, saturating if overflow occurs, and the results are placed in parameter d.
Any overflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-34. Vector Add Half Words Unsigned and Saturate (__ev_addhus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhus d,a,b
b
0 32 63
a
16 48
+
+ + +
d
15 31 47
SPE2 Programming Interface Manual, Rev. 1.0-2
3-50 Freescale Semiconductor
SPE2 Operations
__ev_addhx __ev_addhx
Vector Add Half Words Exchanged
d = __ev_addhx (a,b)
// h0
d
0:15
a
0:15
+ b
16:31
// modulo sum
// h1
d
16:31
a
16:31
+ b
0:15
// modulo sum
// h2
d
32:47
a
32:47
+ b
48:63
// modulo sum
// h3
d
48:63
a
48:63
+ b
32:47
// modulo sum
The four half word elements of parameter a are added to four exchanged half word elements of
parameter b and the results are placed in parameter d. The sum is a modulo sum.
Figure 3-35. Vector Add Half Words Exchanged (__ev_addhx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhx d,a,b
b
0 32 63
a
16 48
+
+ + +
d
15 31 47
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-51
__ev_addhxss __ev_addhxss
Vector Add Half Words Exchanged, Signed and Saturate
d = __ev_addhxss (a,b)
// h1
temp
0:31
EXTS(a
16:31
) + EXTS(b
0:15
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(a
32:47
) + EXTS(b
48:63
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(a
48:63
) + EXTS(b
32:47
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The four signed half word elements of parameter a are added to four exchanged signed half word
elements of parameter b, saturating if overflow or underflow occurs, and the results are placed in
parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-36. Vector Add Half Words Exchanged, Signed and Saturate (__ev_addhxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhxss d,a,b
b
0 32 63
a
16 48
+
+ + +
d
15 31 47
SPE2 Programming Interface Manual, Rev. 1.0-2
3-52 Freescale Semiconductor
SPE2 Operations
__ev_addhxus __ev_addhxus
Vector Add Half Words Exchanged, Unsigned and Saturate
d = __ev_addhxus (a,b)
// h0
temp
0:31
EXTZ(a
0:15
) + EXTZ(b
16:31
); ovh0 temp
15
d
0:15
SATURATE(ovh0, temp
15
, 0xffff, 0xffff, temp
16:31
)
// h1
temp
0:31
EXTZ(a
16:31
) + EXTZ(b
0:15
); ovh1 temp
15
d
16:31
SATURATE(ovh1, temp
15
, 0xffff, 0xffff, temp
16:31
)
// h2
temp
0:31
EXTZ(a
32:47
) + EXTZ(b
48:63
); ovh2 temp
15
d
32:47
SATURATE(ovh2, temp
15
, 0xffff, 0xffff, temp
16:31
)
// h3
temp
0:31
EXTZ(a
48:63
) + EXTZ(b
32:47
); ovh3 temp
15
d
48:63
SATURATE(ovh3, temp
15
, 0xffff, 0xffff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The four unsigned half word elements of parameter a are added to four exchanged unsigned half
word elements of parameter b, saturating if overflow occurs, and the results are placed in
parameter d. Any overflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-37. Vector Add Half Words Exchanged, Unsigned and Saturate (__ev_addhxus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddhxus d,a,b
b
0 32 63
a
16 48
+
+ + +
d
15 31 47
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-53
__ev_addib __ev_addib
Vector Add Immediate Byte
d = __ev_addib (a,b)
d
0:7
a
0:7
+ EXTZ(b) // Modulo sum
d
8:15
a
8:15
+ EXTZ(b) // Modulo sum
d
16:23
a
16:23
+ EXTZ(b) // Modulo sum
d
24:31
a
24:31
+ EXTZ(b) // Modulo sum
d
32:39
a
32:39
+ EXTZ(b) // Modulo sum
d
40:47
a
40:47
+ EXTZ(b) // Modulo sum
d
48:55
a
48:55
+ EXTZ(b) // Modulo sum
d
56:63
a
56:63
+ EXTZ(b) // Modulo sum
Parameter b is zero-extended and added to the eight byte elements of parameter a and the results
are placed in parameter d.
Figure 3-38. Vector Add Immediate Byte (__ev_addib)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evaddib d,a,b
a
d
EXTZ(b)
0 32 63 24 16 8 40 48 56
0 32 63 24 16 8 40 48 56
EXTZ(b) EXTZ(b) EXTZ(b) EXTZ(b) EXTZ(b) EXTZ(b) EXTZ(b)
+ + + + + + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-54 Freescale Semiconductor
SPE2 Operations
__ev_addih __ev_addih
Vector Add Immediate Half Word
d = __ev_addih (a,b)
d
0:15
a
0:15
+ EXTZ(b)// Modulo sum
d
16:31
a
16:31
+ EXTZ(b)// Modulo sum
d
32:47
a
32:47
+ EXTZ(b)// Modulo sum
d
48:63
a
48:63
+ EXTZ(b)// Modulo sum
Parameter b is zero-extended and added to the four half word elements of parameter a and the
results are placed in parameter d.
NOTE
The same value is added to all elements of parameter a.
Figure 3-39. Vector Add Immediate Half Word (__ev_addih)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evaddih d,a,b
0 32 63
a
16 48
+
+ + +
d
15 31 47
EXTZ(b) EXTZ(b) EXTZ(b) EXTZ(b)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-55
__ev_addiw __ev_addiw
Vector Add Immediate Word
d = __ev_addiw (a,b)
d
0:31
a
0:31
+ EXTZ(b)// Modulo sum
d
32:63
a
32:63
+ EXTZ(b)// Modulo sum
Parameter b is zero-extended and added to both the high and low elements of parameter a and the
results are placed in the parameter d.
NOTE
The same value is added to both elements of parameter a.
Figure 3-40. Vector Add Immediate Word (__ev_addiw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evaddiw d,a,b
0 31 32 63
a
EXTZ(b)
+ +
d
EXTZ(b)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-56 Freescale Semiconductor
SPE2 Operations
__ev_addsmiaaw __ev_addsmiaaw
Vector Add Signed, Modulo, Integer to Accumulator Word
d = __ev_addsmiaaw (a)
d
0:31
ACC
0:31
+ a
0:31//
low
d
32:63
ACC
32:63
+ a
32:63
// high
// update accumulator
ACC
0:63
d
0:63
Each word element in parameter a is added to the corresponding element in the accumulator and
the results are placed in parameter d and into the accumulator.
Other registers altered: ACC
Figure 3-41. Vector Add Signed, Modulo, Integer to Accumulator Word (__ev_addsmiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddsmiaaw d,a
0 31 32 63
a
Accumulator
+ +
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-57
__ev_addssiaa __ev_addssiaa
Vector Add Signed, Saturate, Integer to Accumulator
d = __ev_addssiaa (a)
//
temp
0:64
EXTS(ACC
0:63
) + EXTS(a
0:63
)
ov temp
0
temp
1
d
0:31
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7fff_ffff_ffff_ffff, temp
1:64
)
ACC
0:63
d
0:6
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The signed 64-bit value in parameter a is added to the accumulator saturating if overflow or
underflow occurs, and the results are placed into parameter d and the accumulator. Any overflow
or underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-42. Vector Add Signed, Saturate, Integer to Accumulator (__ev_addssiaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddssiaa d,a
0 63
a
Accumulator
+
SSAT
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-58 Freescale Semiconductor
SPE2 Operations
__ev_addssiaaw __ev_addssiaaw
Vector Add Signed, Saturate, Integer to Accumulator Word
d = __ev_addssiaaw (a)
// high
temp
0:63
EXTS(ACC
0:31
) + EXTS(a
0:31
)
ovh temp
31
temp
32
d
0:31
SATURATE(ovh, temp
31
, 0x80000000, 0x7fffffff, temp
32:63
)
// low
temp
0:63
EXTS(ACC
32:63
) + EXTS(a
32:63
)
ovl temp
31
temp
32
d
32:63
SATURATE(ovl, temp
31
, 0x80000000, 0x7fffffff, temp
32:63
)
ACC
0:63
d
0:63
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
Each signed integer word element in parameter a is sign-extended and added to the corresponding
sign-extended element in the accumulator, saturating if overflow or underflow occurs, and the
results are placed in parameter d and the accumulator. Any overflow or underflow is recorded in
the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-43. Vector Add Signed, Saturate, Integer to Accumulator Word (__ev_addssiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddssiaaw d,a
0 31 32 63
a
Accumulator
+ +
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-59
__ev_addsubfh __ev_addsubfh
Vector Add / Subtract from Half Word
d = __ev_addsubfh (a,b)
d
0:15
b
0:15
+ a
0:15
// Modulo sum
d
16:31
b
16:31
- a
16:31
// Modulo difference, b - a
d
32:47
b
32:47
+ a
32:47
// Modulo sum
d
48:63
b
48:63
- a
48:63
// Modulo difference, b - a
The even half word elements of parameter a are added to the even half word elements of parameter
b, the odd half word elements of parameter a are subtracted from the odd half word elements of
parameter b, and the results are placed in parameter d. The sum and difference are modulo.
Figure 3-44. Vector Add / Subtract from Half Word (__ev_addsubfh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfh d,a,b
a
0 32 63
b
16 48
+
- + -
d
15 31 47
SPE2 Programming Interface Manual, Rev. 1.0-2
3-60 Freescale Semiconductor
SPE2 Operations
__ev_addsubfhss __ev_addsubfhss
Vector Add / Subtract from Half Word Signed and Saturate
d = __ev_addsubfhss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) + EXTS(a
0:15
); ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) - EXTS(a
16:31
); ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) + EXTS(a
32:47
); ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) - EXTS(a
48:63
); ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The even signed half word elements of parameter a are added to the even signed half word
elements of parameter b, the odd signed half word elements of parameter a are subtracted from the
odd signed half word elements of parameter b, saturating if overflow or underflow occurs, and the
results are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-45. Vector Add / Subtract from Half Word Signed and Saturate (__ev_addsubfhss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfhss d,a,b
a
0 32 63
b
16 48
+
- + -
d
15 31 47
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-61
__ev_addsubfhx __ev_addsubfhx
Vector Add / Subtract from Half Word Exchanged
d = __ev_addsubfhx (a,b)
// h0
d
0:15
b
0:15
+ a
16:31
// modulo
// h1
d
16:31
b
16:31
- a
0:15
// modulo
// h2
d
32:47
b
32:47
+ a
48:63
// modulo
// h3
d
48:63
b
48:63
- a
32:47
// modulo
The odd exchanged half word elements of parameter a are added to the even half word elements
of parameter b, the even exchanged half words of parameter a are subtracted from the odd half
words of parameter b, and the results are placed in parameter d. The sum and differences are
modulo.
Figure 3-46. Vector Add / Subtract from Half Word Exchanged (__ev_addsubfhx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfhx d,a,b
a
0 32 63
b
16 48
+
- + -
d
15 31 47
SPE2 Programming Interface Manual, Rev. 1.0-2
3-62 Freescale Semiconductor
SPE2 Operations
__ev_addsubfhxss __ev_addsubfhxss
Vector Add / Subtract from Half Word Exchanged, Signed and Saturate
d = __ev_addsubfhxss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) + EXTS(a
16:31
); ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) - EXTS(a
0:15
); ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) + EXTS(a
48:63
); ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) - EXTS(a
32:47
); ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The odd exchanged signed half word elements of parameter a are added to the even signed half
word elements of parameter b, the even exchanged signed half words of parameter a are subtracted
from the odd signed half words of parameter b, saturating if overflow or underflow occurs, and the
results are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-47. Vector Add / Subtract from Half Word Exchanged, Signed and Saturate
(__ev_addsubfhxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfhxss d,a,b
a
0 32 63
b
16 48
+
- + -
d
15 31 47
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-63
__ev_addsubfw __ev_addsubfw
Vector Add / Subtract from Word
d = __ev_addsubfw (a,b)
d
0:31
a
0:31
+ b
0:31
// Modulo
d
32:63
a
32:63
- b
32:63
// Modulo
The high word element of parameter a is added to the high word element of parameter b, the low
word element of parameter a is subtracted from the low word element of parameter b, and the
results are placed in parameter d.
Figure 3-48. Vector Add / Subtract from Word (__ev_addsubfw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfw d,a,b
0 31 32 63
b
a
+ -
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-64 Freescale Semiconductor
SPE2 Operations
__ev_addsubfwss __ev_addsubfwss
Vector Add / Subtract from Word Signed and Saturate
d = __ev_addsubfwss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) + EXTS(a
0:31
)
ovh temp
0
temp
1
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) - EXTS(a
32:63
)
ovl temp
0
temp
1
d
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The high word element of parameter a is added to the high word element of parameter b, saturating
if overflow or underflow occurs, the low word element of parameter a is subtracted from the low
word element of parameter b, saturating if overflow or underflow occurs, and the results are placed
in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-49. Vector Add / Subtract from Word Signed and Saturate (__ev_addsubfwss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfwss d,a,b
0 31 32 63
b
a
+ -
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-65
__ev_addsubfwx __ev_addsubfwx
Vector Add / Subtract from Word Exchanged
d = __ev_addsubfwx (a,b)
d
0:31
b
0:31
+ a
32:63
// Modulo
d
32:63
b
32:63
- a
0:31
// Modulo
The low word element of parameter a is added to the high word element of parameter b, the high
word element of parameter a is subtracted from the low word element of parameter b, and the
results are placed in parameter d.
Figure 3-50. Vector Add / Subtract from Word Exchanged (__ev_addsubfwx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfwx d,a,b
0 31 32 63
b
a
+ -
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-66 Freescale Semiconductor
SPE2 Operations
__ev_addsubfwxss __ev_addsubfwxss
Vector Add / Subtract from Word Exchanged Signed and Saturate
d = __ev_addsubfwxss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) + EXTS(a
32:63
)
ovh temp
0
temp
11
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) - EXTS(a
0:31
)
ovl temp
0
temp
1
rD
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The low word element of parameter a is added to the high word element of parameter b, saturating
if overflow or underflow occurs, the high word element of parameter a is subtracted from the low
word element of parameter b, saturating if overflow or underflow occurs, and the results are placed
in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-51. Vector Add / Subtract from Word Exchanged Signed and Saturate
(__ev_addsubfwxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfwxss d,a,b
0 31 32 63
b
a
+ -
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-67
__ev_addumiaaw __ev_addumiaaw
Vector Add Unsigned, Modulo, Integer to Accumulator Word
d = __ev_addumiaaw (a)
d
0:31
ACC
0:31
+ a
0:31
d
32:63
ACC
32:63
+ a
32:63
ACC
0:63
d
0:63
Each unsigned integer word element in the parameter a is added to the corresponding element in
the accumulator and the results are placed in the parameter d and the accumulator.
Other registers altered: ACC
Figure 3-52. Vector Add Unsigned, Modulo, Integer to Accumulator Word
(__ev_addumiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddumiaaw d,a
0 31 32 63
a
Accumulator
+ +
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-68 Freescale Semiconductor
SPE2 Operations
__ev_addusiaa __ev_addusiaa
Vector Add Unsigned, Saturate, Integer to Accumulator
d = __ev_addusiaa (a)
temp
0:64
EXTZ(ACC
0:63
) + EXTZ(a
0:63
)
ov temp
0
d
0:63
SATURATE(ov, temp
0
, 0xffff_ffff_ffff_ffff, 0xffff_ffff_ffff_ffff, temp
1:64
)
ACC
0:63
d
0:63
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The unsigned 64-bit value in parameter a is added to the unsigned value in the accumulator
saturating if overflow occurs, and the results are placed into parameter d and the accumulator. Any
overflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-53. Vector Add Unsigned, Saturate, Integer to Accumulator (__ev_addusiaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddusiaa d,a
0 63
a
Accumulator
+
USAT
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-69
__ev_addusiaaw __ev_addusiaaw
Vector Add Unsigned, Saturate, Integer to Accumulator Word
d = __ev_addusiaaw (a)
// high
temp
0:63
EXTZ(ACC
0:31
) + EXTZ(a
0:31
)
ovh temp
31
d
0:31
SATURATE(ovh, temp
31
, 0xffffffff, 0xffffffff, temp
32:63
)
// low
temp
0:63
EXTZ(ACC
32:63
) + EXTZ(a
32:63
)
ovl temp
31
d
32:63
SATURATE(ovl, temp
31
, 0xffffffff, 0xffffffff, temp
32:63
)
ACC
0:63
d
0:63
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
Each unsigned integer word element in parameter a is zero-extended and added to the
corresponding zero-extended element in the accumulator, saturating if overflow occurs, and the
results are placed in parameter d and the accumulator. Any overflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-54. Vector Add Unsigned, Saturate, Integer to Accumulator Word
(__ev_addusiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evaddusiaaw d,a
0 31 32 63
a
Accumulator
+ +
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-70 Freescale Semiconductor
SPE2 Operations
__ev_addw __ev_addw
Vector Add Word
d = __ev_addw (a,b)
d
0:31
a
0:31
+ b
0:31
// Modulo sum
d
32:63
a
32:63
+ b
32:63
// Modulo sum
The corresponding elements of parameters a and b are added, and the results are placed in
parameter d. The sum is a modulo sum.
Figure 3-55. Vector Add Word (__ev_addw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddw d,a,b
0 31 32 63
a
b
+ +
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-71
__ev_addwegsf __ev_addwegsf
Vector Add Word Even Guarded Signed Fraction
d = __ev_addwegsf (a,b)
d
0:63
(EXTS
48
(b
0:31
) ||
16
0) + (EXTS
48
(a
0:31
) ||
16
0)
The even word elements of parameter a and parameter b are sign-extended with 16 guard bits and
padded with 16 0s, and then added together to produce a 64-bit sum, and the result is placed into
parameter d.
Note: __ev_addwegsf is used to add 1.31 fractions to produce a 17.47 fractional sum.
Figure 3-56. Vector Add Word Even Guarded Signed Fraction (__ev_addwegsf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwegsf d,a,b
0 31 32 63
b
a
0 63
d
15 16 17
.
+
SPE2 Programming Interface Manual, Rev. 1.0-2
3-72 Freescale Semiconductor
SPE2 Operations
__ev_addwegsi __ev_addwegsi
Vector Add Word Even Guarded Signed Integer
d = __ev_addwegsi (a,b)
d
0:63
EXTS
64
(b
0:31
) + EXTS
64
(a
0:31
)
The even word elements of parameter a and parameter b are sign-extended to 64 bits and added
together to produce a 64-bit sum, and the result is placed into parameter d.
Note: __ev_addwegsi can also be used to add 1.31 fractions to produce a 33.31 fractional sum.
Figure 3-57. Vector Add Word (__ev_addwegsi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwegsi d,a,b
0 31 32 63
b
a
+
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-73
__ev_addwogsf __ev_addwogsf
Vector Add Word Odd Guarded Signed Fraction
d = __ev_addwogsf (a,b)
d
0:63
(EXTS
48
(b
32:63
) ||
16
0) + (EXTS
48
(a
32:63
) ||
16
0)
The odd word elements of parameter a and parameter b are sign-extended with 16 guard bits and
padded with 16 0s, and then added together to produce a 64-bit sum, and the result is placed into
parameter d.
Note: __ev_addwogsf is used to add 1.31 fractions to produce a 17.47 fractional sum.
Figure 3-58. Vector Add Word Odd Guarded Signed Fraction (__ev_addwogsf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwogsf d,a,b
0 31 32 63
b
a
0 63
d
15 16 17
.
+
SPE2 Programming Interface Manual, Rev. 1.0-2
3-74 Freescale Semiconductor
SPE2 Operations
__ev_addwogsi __ev_addwogsi
Vector Add Word Odd Guarded Signed Integer
d = __ev_addwogsi (a,b)
d
0:63
EXTS
64
(b
32:63
) + EXTS
64
(a
32:63
)
The odd word elements of parameter a and parameter b are sign-extended to 64 bits and added
together to produce a 64-bit sum, and the result is placed into parameter d.
Note: __ev_addwogsi can also be used to add 1.31 fractions to produce a 33.31 fractional sum.
Figure 3-59. Vector Add Word Odd Guarded Signed Integer (__ev_addwogsi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwogsi d,a,b
0 31 32 63
b
a
+
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-75
__ev_addwss __ev_addwss
Vector Add Word Signed and Saturate
d = __ev_addwss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) + EXTS(a
0:31
)
ovh temp
0
temp
1
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) + EXTS(a
32:63
)
ovl temp
0
temp
1
d
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The high word element of parameter a is added to the high word element of parameter b, saturating
if overflow or underflow occurs, the low word element of parameter a is added to the low word
element of parameter b, saturating if overflow or underflow occurs, and the results are placed in
parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-60. Vector Add Word Signed and Saturate (__ev_addwss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwss d,a,b
0 31 32 63
b
a
+ +
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-76 Freescale Semiconductor
SPE2 Operations
__ev_addwus __ev_addwus
Vector Add Word Unsigned and Saturate
d = __ev_addwus (a,b)
// h0
temp
0:32
EXTZ(b
0:31
) + EXTZ(a
0:31
)
ovh temp
0
d
0:15
SATURATE(ovh, temp
0
, 0xffff_ffff, 0xffff_ffff, temp
1:32
)
// h1
temp
0:31
EXTZ(b
32:63
) + EXTZ(a
32:63
)
ovl temp
0
d
16:31
SATURATE(ovl, temp
0
, 0xffff_ffff, 0xffff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The high word element of parameter a is added to the high word element of parameter b, saturating
if overflow occurs, the low word element of parameter a is added to the low word element of
parameter b, saturating if overflow occurs, and the results are placed in parameter d. Any overflow
is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-61. Vector Add Word Unsigned and Saturate (__ev_addwus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwus d,a,b
0 31 32 63
b
a
+ +
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-77
__ev_addwx __ev_addwx
Vector Add Word Exchanged
d = __ev_addwx (a,b)
d
0:31
a
32:63
+ b
0:31
// Modulo sum
d
32:63
a
0:31
+ b
32:63
// Modulo sum
The exchanged word elements of parameter a are added to the elements of parameter b and the
results are placed in parameter d. The sum is a modulo sum.
Figure 3-62. Vector Add Word Exchanged (__ev_addwx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwx d,a,b
0 31 32 63
b
a
+ +
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-78 Freescale Semiconductor
SPE2 Operations
__ev_addwxss __ev_addwxss
Vector Add Word Exchanged Signed and Saturate
d = __ev_addwxss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) + EXTS(a
32:63
)
ovh temp
0
temp
1
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) + EXTS(a
0:31
)
ovl temp
0
temp
1
d
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The low word element of parameter a is added to the high word element of parameter b, saturating
if overflow or underflow occurs, the high word element of parameter a is added to the low word
element of parameter b, saturating if overflow or underflow occurs, and the results are placed in
parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-63. Vector Add Word Exchanged Signed and Saturate (__ev_addwxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwxss d,a,b
0 31 32 63
b
a
+ +
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-79
__ev_addwxus __ev_addwxus
Vector Add Word Exchanged Unsigned and Saturate
d = __ev_addwxus (a,b)
// h0
temp
0:32
EXTZ(b
0:31
) + EXTZ(a
32:63
)
ovh temp
0
d
0:15
SATURATE(ovh, temp
0
, 0xffff_ffff, 0xffff_ffff, temp
1:32
)
// h1
temp
0:31
EXTZ(b
32:63
) + EXTZ(a
0:31
)
ovl temp
0
d
16:31
SATURATE(ovl, temp
0
, 0xffff_ffff, 0xffff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The low word element of parameter a is added to the high word element of parameter b, saturating
if overflow occurs, the high word element of parameter a is added to the low word element of
parameter b, saturating if overflow occurs, and the results are placed in parameter d. Any overflow
is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-64. Vector Add Word Exchanged Unsigned and Saturate (__ev_addwxus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddwxus d,a,b
0 31 32 63
b
a
+ +
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-80 Freescale Semiconductor
SPE2 Operations
__ev_all_eq __ev_all_eq
Vector All Equal
d = __ev_all_eq (a,b)
if ( a
0:31
= b
0:31
) & (a
32:63
= b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are equal to the upper 32 bits of
parameter b and the lower 32 bits of parameter a are equal to the lower 32 bits of parameter b.
Figure 3-65. Vector All Equal (__ev_all_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpeq x,a,b
0 31 32 63
a
b
=
=
OR
AND
x
&
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-81
__ev_all_gts __ev_all_gts
Vector All Greater Than Signed
d = __ev_all_gts (a,b)
if ( (a
0:31
>
signed
b
0:31
) & (a
32:63
>
signed
b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are greater than the upper 32 bits
of parameter b and the lower 32 bits of parameter a are greater than the lower 32 bits of parameter
b.
Figure 3-66. Vector All Greater Than Signed (__ev_all_gts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgts x,a,b
0 31 32 63
a
b
>
>
x
&
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-82 Freescale Semiconductor
SPE2 Operations
__ev_all_gtu __ev_all_gtu
Vector All Elements Greater Than Unsigned
d = __ev_all_gtu (a,b)
if ( (a
0:31
>
unsigned
b
0:31
) & (a
32:63
>
unsigned
b
32:63
)) then d true
else a false
This intrinsic returns true if both the upper 32 bits of parameter a are greater than the upper 32 bits
of parameter b and the lower 32 bits of parameter a are greater than the lower 32 bits of parameter
b.
Figure 3-67. Vector All Greater Than Unsigned (__ev_all_gtu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgtu x,a,b
0 31 32 63
a
b
> >
x
&
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-83
__ev_all_lts __ev_all_lts
Vector All Elements Less Than Signed
d = __ev_all_lts (a,b)
if ( (a
0:31
<
signed
b
0:31
) & (a
32:63
<
signed
b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b and the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
Figure 3-68. Vector All Less Than Signed (__ev_all_lts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmplts x,a,b
0 31 32 63
a
b
< <
x
&
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-84 Freescale Semiconductor
SPE2 Operations
__ev_all_Itu __ev_all_Itu
Vector All Elements Less Than Unsigned
d = __ev_all_ltu (a,b)
if ( (a
0:31
<
unsigned
b
0:31
) & (a
32:63
<
unsigned
b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b and the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
Figure 3-69. Vector All Less Than Unsigned (__ev_all_ltu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpltu x,a,b
0 31 32 63
a
b
< <
x
&
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-85
__ev_and __ev_and
Vector AND
d = __ev_and (a,b)
d
0:31
a
0:31
& b
0:31
// Bitwise AND
d
32:63
a
32:63
& b
32:63
// Bitwise AND
The corresponding elements of parameters a and b are ANDed bitwise, and the results are placed
in the corresponding element of parameter d.
Figure 3-70. Vector AND (__ev_and)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evand d,a,b
0 31 32 63
a
b
& &
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-86 Freescale Semiconductor
SPE2 Operations
__ev_andc __ev_andc
Vector AND with Complement
d = __ev_andc (a,b)
d
0:31
a
0:31
& (b
0:31
) // Bitwise ANDC
d
32:63
a
32:63
& (b
32:63
) // Bitwise ANDC
The word elements of parameter a and are ANDed bitwise with the complement of the
corresponding elements of parameter b. The results are placed in the corresponding element of
parameter d.
Figure 3-71. Vector AND with Complement (__ev_andc)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evandc d,a,b
0 31 32 63
a
b
AND AND
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-87
__ev_avgbs __ev_avgbs
Vector Average Byte Signed
d = __ev_avgbs (a,b)
The signed byte elements of parameter a are added to the corresponding signed elements of
parameter b, producing 9-bit signed integer sums. The high-order 8 bits of the results are placed
into the corresponding byte elements of parameter d.
Figure 3-72. Vector Average Byte Signed (__ev_avgbs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgbs d,a,b
temp
0:8
EXTS(a
0:7
) + EXTS(b
0:7
)
d
0:7
temp
0:7
temp
0:8
EXTS(a
8:15
) + EXTS(b
8:15
)
d
8:15
temp
0:7
temp
0:8
EXTS(a
16:23
) + EXTS(b
16:23
)
d
16:23
temp
0:7
temp
0:8
EXTS(a
24:31
) + EXTS(b
24:31
)
d
24:31
temp
0:7
temp
0:8
EXTS(a
32:39
) + EXTS(b
32:39
)
d
32:39
temp
0:7
temp
0:8
EXTS(a
40:47
) + EXTS(b
40:47
)
d
40:47
temp
0:7
temp
0:8
EXTS(a
48:55
) + EXTS(b
48:55
)
d
48:55
temp
0:7
temp
0:8
EXTS(a
56:63
) + EXTS(b
56:63
)
d
56:63
temp
0:7
0 32 63
b
a
24 16 8 40 48 56
+
0 32 63
d
24 16 8 40 48 56
+ + + + + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-88 Freescale Semiconductor
SPE2 Operations
__ev_avgbsr __ev_avgbsr
Vector Average Byte Signed with Round
d = __ev_avgbsr (a,b)
The signed byte elements of parameter a are added to the corresponding signed elements of
parameter b, producing 9-bit signed integer sums. The sums are incremented by 1, and the
high-order 8 bits of the results are placed into the corresponding byte elements of parameter d.
Figure 3-73. Vector Average Byte Signed with Round (__ev_avgbsr)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgbsr d,a,b
temp
0:8
EXTS(a
0:7
) + EXTS(b
0:7
) + 1
d
0:7
temp
0:7
temp
0:8
EXTS(a
8:15
) + EXTS(b
8:15
) + 1
d
8:15
temp
0:7
temp
0:8
EXTS(a
16:23
) + EXTS(b
16:23
) + 1
d
16:23
temp
0:7
temp
0:8
EXTS(a
24:31
) + EXTS(b
24:31
) + 1
d
24:31
temp
0:7
temp
0:8
EXTS(a
32:39
) + EXTS(b
32:39
) + 1
d
32:39
temp
0:7
temp
0:8
EXTS(a
40:47
) + EXTS(b
40:47
) + 1
d
40:47
temp
0:7
temp
0:8
EXTS(a
48:55
) + EXTS(b
48:55
) + 1
d
48:55
temp
0:7
temp
0:8
EXTS(a
56:63
) + EXTS(b
56:63
) + 1
d
56:63
temp
0:7
0 32 63
b
a
24 16 8 40 48 56
+
0 32 63
d
24 16 8 40 48 56
+ + + + + + +
+1 +1 +1 +1 +1 +1 +1 +1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-89
__ev_avgbu __ev_avgbu
Vector Average Byte Unsigned
d = __ev_avgbu (a,b)
The unsigned byte elements of parameter a are added to the corresponding unsigned elements of
parameter b, producing 9-bit unsigned integer sums. The high-order 8 bits of the sums are placed
into the corresponding byte elements of parameter d.
Figure 3-74. Vector Average Byte Unsigned (__ev_avgbu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgbu d,a,b
temp
0:8
EXTZ(a
0:7
) + EXTZ(b
0:7
)
d
0:7
temp
0:7
temp
0:8
EXTZ(a
8:15
) + EXTZ(b
8:15
)
d
8:15
temp
0:7
temp
0:8
EXTZ(a
16:23
) + EXTZ(b
16:23
)
d
16:23
temp
0:7
temp
0:8
EXTZ(a
24:31
) + EXTZ(b
24:31
)
d
24:31
temp
0:7
temp
0:8
EXTZ(a
32:39
) + EXTZ(b
32:39
)
d
32:39
temp
0:7
temp
0:8
EXTZ(a
40:47
) + EXTZ(b
40:47
)
d
40:47
temp
0:7
temp
0:8
EXTZ(a
48:55
) + EXTZ(b
48:55
)
d
48:55
temp
0:7
temp
0:8
EXTZ(a
56:63
) + EXTZ(b
56:63
)
d
56:63
temp
0:7
0 32 63
b
a
24 16 8 40 48 56
+
0 32 63
d
24 16 8 40 48 56
+ + + + + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-90 Freescale Semiconductor
SPE2 Operations
__ev_avgbur __ev_avgbur
Vector Average Byte Unsigned with Round
d = __ev_avgbur (a,b)
The unsigned byte elements of parameter a are added to the corresponding unsigned elements of
parameter b, producing 9-bit unsigned integer sums. The sums are incremented by 1 and the
high-order 8 bits of the results are placed into the corresponding byte elements of parameter d.
Figure 3-75. Vector Average Byte Unsigned with Round (__ev_avgbur)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgbur d,a,b
temp
0:8
EXTZ(a
0:7
) + EXTZ(b
0:7
) + 1
d
0:7
temp
0:7
temp
0:8
EXTZ(a
8:15
) + EXTZ(b
8:15
) + 1
d
8:15
temp
0:7
temp
0:8
EXTZ(a
16:23
) + EXTZ(b
16:23
) + 1
d
16:23
temp
0:7
temp
0:8
EXTZ(a
24:31
) + EXTZ(b
24:31
) + 1
d
24:31
temp
0:7
temp
0:8
EXTZ(a
32:39
) + EXTZ(b
32:39
) + 1
d
32:39
temp
0:7
temp
0:8
EXTZ(a
40:47
) + EXTZ(b
40:47
) + 1
d
40:47
temp
0:7
temp
0:8
EXTZ(a
48:55
) + EXTZ(b
48:55
) + 1
d
48:55
temp
0:7
temp
0:8
EXTZ(a
56:63
) + EXTZ(b
56:63
) + 1
d
56:63
temp
0:7
0 32 63
b
a
24 16 8 40 48 56
+
0 32 63
d
24 16 8 40 48 56
+ + + + + + +
+1 +1 +1 +1 +1 +1 +1 +1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-91
__ev_avgds __ev_avgds
Vector Average Doubleword Signed
d = __ev_avgds (a,b)
temp
0:64
EXTS
65
(a
0:63
) + EXTS
65
(b
0:63
)
d
0:63
temp
0:63
The signed doubleword in parameter a is added to the signed doubleword in parameter b,
producing 65-bit signed integer sum. The high-order 64 bits of the result is placed into parameter
d.
Figure 3-76. Vector Average Doubleword Signed (__ev_avgds)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgds d,a,b
a
0 63
b
+
d
temp
SPE2 Programming Interface Manual, Rev. 1.0-2
3-92 Freescale Semiconductor
SPE2 Operations
__ev_avgdsr __ev_avgdsr
Vector Average Doubleword Signed with Round
d = __ev_avgdsr (a,b)
temp
0:64
EXTS
65
(a
0:63
) + EXTS
65
(b
0:63
) + 1
d
0:63
temp
0:63
The signed doubleword in parameter a is added to the signed doubleword in parameter b,
producing 65-bit signed integer sum. The sum is incremented by 1 and the high-order 64 bits of
the result is placed into parameter d.
Figure 3-77. Vector Average Doubleword Signed with Round (__ev_avgdsr)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgdsr d,a,b
a
0 63
b
+
d
temp
+1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-93
__ev_avgdu __ev_avgdu
Vector Average Doubleword Unsigned
d = __ev_avgdu (a,b)
temp
0:64
EXTZ
65
(a
0:63
) + EXTZ
65
(b
0:63
)
d
0:63
temp
0:63
The unsigned doubleword in parameter a is added to the unsigned doubleword in parameter b,
producing 65-bit unsigned integer sum. The high-order 64 bits of the result is placed into
parameter d.
Figure 3-78. Vector Average Doubleword Unsigned (__ev_avgdu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgdu d,a,b
a
0 63
b
+
d
temp
SPE2 Programming Interface Manual, Rev. 1.0-2
3-94 Freescale Semiconductor
SPE2 Operations
__ev_avgdur __ev_avgdur
Vector Average Doubleword Unsigned with Round
d = __ev_avgdur (a,b)
temp
0:64
EXTZ
65
(a
0:63
) + EXTZ
65
(b
0:63
) + 1
d
0:63
temp
0:63
The unsigned doubleword in parameter a is added to the unsigned doubleword in parameter b,
producing a 65-bit unsigned integer sum. The sum is incremented by 1 and the high-order 64 bits
of the result is placed into parameter d.
Figure 3-79. Vector Average Doubleword Unsigned with Round (__ev_avgdur)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgdur d,a,b
a
0 63
b
+
d
temp
+1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-95
__ev_avghs __ev_avghs
Vector Average Half Word Signed
d = __ev_avghs (a,b)
temp
0:16
EXTS(a
0:15
) + EXTS(b
0:15
)
d
0:15
temp
0:15
temp
0:16
EXTS(a
16:31
) + EXTS(b
16:31
)
d
16:31
temp
0:15
temp
0:16
EXTS(a
32:47
) + EXTS(b
32:47
)
d
32:47
temp
0:15
temp
0:16
EXTS(a
48:63
) + EXTS(b
48:63
)
d
48:63
temp
0:15
The signed half word elements of parameter a are added to the signed half word elements of
parameter b, producing 17-bit signed integer sums. The high-order 16 bits of the results are placed
into the corresponding half word elements of parameter d.
Figure 3-80. Vector Average Half Word Signed (__ev_avghs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavghs d,a,b
a
0 31 32 63
b
15 16 47 48
+ + + +
d
temp
SPE2 Programming Interface Manual, Rev. 1.0-2
3-96 Freescale Semiconductor
SPE2 Operations
__ev_avghsr __ev_avghsr
Vector Average Half Word Signed with Round
d = __ev_avghsr (a,b)
temp
0:16
EXTS(a
0:15
) + EXTS(b
0:15
) + 1
d
0:15
temp
0:15
temp
0:16
EXTS(a
16:31
) + EXTS(b
16:31
) + 1
d
16:31
temp
0:15
temp
0:16
EXTS(a
32:47
) + EXTS(b
32:47
) + 1
d
32:47
temp
0:15
temp
0:16
EXTS(a
48:63
) + EXTS(b
48:63
) + 1
d
48:63
temp
0:15
The signed half word elements of parameter a are added to the signed half word elements of
parameter b, producing 17-bit signed integer sums. The sums are incremented by 1 and the
high-order 16 bits of the results are placed into the corresponding half word elements of parameter
d.
Figure 3-81. Vector Average Half Word Signed with Round (__ev_avghsr)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavghsr d,a,b
a
0 31 32 63
b
15 16 47 48
+ + + +
+1
d
+1 +1 +1
temp
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-97
__ev_avghu __ev_avghu
Vector Average Half Word Unsigned
d = __ev_avghu (a,b)
temp
0:16
EXTZ(a
0:15
) + EXTZ(b
0:15
)
d
0:15
temp
0:15
temp
0:16
EXTZ(a
16:31
) + EXTZ(b
16:31
)
d
16:31
temp
0:15
temp
0:16
EXTZ(a
32:47
) + EXTZ(b
32:47
)
d
32:47
temp
0:15
temp
0:16
EXTZ(a
48:63
) + EXTZ(b
48:63
)
d
48:63
temp
0:15
The unsigned half word elements of parameter a are added to the unsigned half word elements of
parameter b, producing 17-bit unsigned integer sums. The high-order 16 bits of the results are
placed into the corresponding half word elements of parameter d.
Figure 3-82. Vector Average Half Word Unsigned (__ev_avghu)
__ev64_opaque__
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavghu d,a,b
a
0 31 32 63
b
15 16 47 48
+ + + +
d
temp
SPE2 Programming Interface Manual, Rev. 1.0-2
3-98 Freescale Semiconductor
SPE2 Operations
__ev_avghur __ev_avghur
Vector Average Half Word Unsigned with Round
d = __ev_avghur (a,b)
temp
0:16
EXTZ(a
0:15
) + EXTZ(b
0:15
) + 1
d
0:15
temp
0:15
temp
0:16
EXTZ(a
16:31
) + EXTZ(b
16:31
) + 1
d
16:31
temp
0:15
temp
0:16
EXTZ(a
32:47
) + EXTZ(b
32:47
) + 1
d
32:47
temp
0:15
temp
0:16
EXTZ(a
48:63
) + EXTZ(b
48:63
) + 1
d
48:63
temp
0:15
The unsigned half word elements of parameter a are added to the unsigned half word elements of
parameter b, producing 17-bit unsigned integer sums. The sums are incremented by 1 and the
high-order 16 bits of the results are placed into the corresponding half word elements of parameter
d.
Figure 3-83. Vector Average Half Word Unsigned with Round (__ev_avghur)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavghur d,a,b
a
0 31 32 63
b
15 16 47 48
+ + + +
+1
d
+1 +1 +1
temp
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-99
__ev_avgws __ev_avgws
Vector Average Word Signed
d = __ev_avgws (a,b)
temp
0:32
EXTS(a
0:31
) + EXTS(b
0:31
)
d
0:31
temp
0:31
temp
0:32
EXTS(a
32:63
) + EXTS(b
32:63
)
d
32:63
temp
0:31
The signed word elements of parameter a are added to the signed word elements of parameter b,
producing 33-bit signed integer sums. The high-order 32 bits of the results are placed into the
corresponding word elements of parameter d.
Figure 3-84. Vector Average Word Signed (__ev_avgws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgws d,a,b
a
0 31 32 63
b
15 16 47 48
+ +
d
temp
SPE2 Programming Interface Manual, Rev. 1.0-2
3-100 Freescale Semiconductor
SPE2 Operations
__ev_avgwsr __ev_avgwsr
Vector Average Word Signed with Round
d = __ev_avgwsr (a,b)
temp
0:32
EXTS(a
0:31
) + EXTS(b
0:31
) + 1
d
0:31
temp
0:31
temp
0:32
EXTS(a
32:63
) + EXTS(b
32:63
) + 1
d
32:63
temp
0:31
The signed word elements of parameter a are added to the signed word elements of parameter b,
producing 33-bit signed integer sums. The sums are incremented by 1 and the high-order 32 bits
of the results are placed into the corresponding word elements of parameter d.
Figure 3-85. Vector Average Word Signed with Round (__ev_avgwsr)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgwsr d,a,b
a
0 31 32 63
b
15 16 47 48
+ +
+1
d
+1
temp
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-101
__ev_avgwu __ev_avgwu
Vector Average Word Unsigned
d = __ev_avgwu (a,b)
temp
0:32
EXTZ(a
0:31
) + EXTS(b
0:31
)
d
0:31
temp
0:31
temp
0:32
EXTZ(a
32:63
) + EXTS(b
32:63
)
d
32:63
temp
0:31
The unsigned word elements of parameter a are added to the unsigned word elements of parameter
b, producing 33-bit unsigned integer sums. The high-order 32 bits of the results are placed into the
corresponding word elements of parameter d.
Figure 3-86. Vector Average Word Unsigned (__ev_avgwu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgwu d,a,b
a
0 31 32 63
b
15 16 47 48
+ +
d
temp
SPE2 Programming Interface Manual, Rev. 1.0-2
3-102 Freescale Semiconductor
SPE2 Operations
__ev_avgwur __ev_avgwur
Vector Average Word Unsigned with Round
d = __ev_avgwur (a,b)
temp
0:32
EXTZ(a
0:31
) + EXTS(b
0:31
) + 1
d
0:31
temp
0:31
temp
0:32
EXTZ(a
32:63
) + EXTS(b
32:63
) + 1
d
32:63
temp
0:31
The unsigned word elements of parameter a are added to the unsigned word elements of parameter
b, producing 33-bit unsigned integer sums. The sums are incremented by 1 and the high-order 32
bits of the results are placed into the corresponding word elements of parameter d.
Figure 3-87. Vector Average Word Unsigned with Round (__ev_avgwur)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evavgwur d,a,b
a
0 31 32 63
b
15 16 47 48
+ +
+1
d
+1
temp
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-103
__ev_clrbe __ev_clrbe
Vector Clear Bytes Even
d = __ev_clrbe (a,b)
d
0:15
a
0:15
& (
8
(mask
0
) ||
8
1)
d
16:31
a
16:31
& (
8
(mask
1
) ||
8
1)
d
32:47
a
32:47
& (
8
(mask
2
) ||
8
1)
d
48:63
a
48:63
& (
8
(mask
3
) ||
8
1)
Each even byte element in parameter a is logically and-ed with the associated mask bit specified
by parameter b and is placed into the corresponding byte element of parameter d, as shown in
Figure 3-88. Odd byte elements are placed into the corresponding byte element of parameter d. A
mask value of 0 causes the corresponding even byte element of parameter d to be cleared.
NOTE
Mask bit mask
3
is the least significant bit of parameter b. A value of
0x8 for parameter b sets mask
0
to a one and the remaining mask bits
to zero.
Figure 3-88. Vector Clear Bytes Even (__ev_clrbe)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 4-bit unsigned literal evclrbe d,a,b
0 31 32 63
a
15 16 47 48 39 40 55 56 23 24 7 8
0 31 32 63
d
15 16 47 48 39 40 55 56 23 24 7 8
mask
3
AND AND
mask
1
AND
mask
0
AND
mask
2
SPE2 Programming Interface Manual, Rev. 1.0-2
3-104 Freescale Semiconductor
SPE2 Operations
__ev_clrbo __ev_clrbo
Vector Clear Bytes Odd
d = __ev_clrbo (a,b)
d
0:15
a
0:15
& (
8
1 ||
8
(mask
0
))
d
16:31
a
16:31
& (
8
1 ||
8
(mask
1
))
d
32:47
a
32:47
&(
8
1 ||
8
(mask
2
))
d
48:63
a
48:63
& (
8
1 ||
8
(mask
3
))
Each odd byte element in parameter a is logically and-ed with the associated mask bit specified by
parameter b and is placed into the corresponding byte element of parameter d, as shown in
Figure 3-89. Even byte elements are placed into the corresponding byte element of parameter d.
A mask value of 0 causes the corresponding odd byte element of parameter d to be cleared.
NOTE
Mask bit mask
3
is the least significant bit of parameter b. A value of
0x8 for parameter b sets mask
0
to a one and the remaining mask bits
to zero.
Figure 3-89. Vector Clear Bytes Odd (__ev_clrbo)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 4-bit unsigned literal evclrbo d,a,b
0 31 32 63
a
15 16 47 48 39 40 55 56 23 24 7 8
0 31 32 63
d
15 16 47 48 39 40 55 56 23 24 7 8
mask
3
AND AND
mask
1
AND
mask
0
AND
mask
2
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-105
__ev_clrh __ev_clrh
Vector Clear Half Words
d = __ev_clrh (a,b)
d
0:15
a
0:15
& (
16
(mask
0
))
d
16:31
a
16:31
& (
16
(mask
1
))
d
32:47
a
32:47
& (
16
(mask
2
))
d
48:63
a
48:63
& (
16
(mask
3
))
Each half word element in parameter a is logically and-ed with the associated mask bit specified
by parameter b and is placed into the corresponding half word element of parameter d, as shown
in Figure 3-90. A mask value of 0 causes the corresponding half word element of parameter d to
be cleared.
NOTE
Mask bit mask
3
is the least significant bit of parameter b. A value of
0x8 for parameter b sets mask
0
to a one and the remaining mask bits
to zero.
Figure 3-90. Vector Clear Half Words (__ev_clrh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 4-bit unsigned literal evclrh d,a,b
15 16 47 48 0 31 32 63
d
15 16 47 48 0 31 32 63
a
mask
3
AND
mask
2
AND
mask
1
AND
mask
0
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-106 Freescale Semiconductor
SPE2 Operations
__ev_any_eq __ev_any_eq
Vector Any Equal
d = __ev_any_eq (a,b)
if ( (a
0:31
= b
0:31
) | (a
32:63
= b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are equal to the upper 32 bits
of parameter b or the lower 32 bits of parameter a are equal to the lower 32 bits of parameter b.
Figure 3-91. Vector Any Equal (__ev_any_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpeq x,a,b
0 31 32 63
a
b
x
|
d
=
=
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-107
__ev_any_gts __ev_any_gts
Vector AND with Complement
d = __ev_any_gts (a,b)
if ((a
0:31
>
signed
b
0:31
)|(a
32:63
>
signed
b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are greater than the upper 32
bits of parameter b or the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-92. Vector Any Greater Than Signed (__ev_any_gts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgts x,a,b
0 31 32 63
a
b
> >
x
|
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-108 Freescale Semiconductor
SPE2 Operations
__ev_any_gtu __ev_any_gtu
Vector Any Element Greater Than Unsigned
d = __ev_any_gtu (a,b)
if ( (a
0:31
>
unsigned
b
0:31
) | (a
32:63
>
unsigned
b
32:63
)) then d true
else dfalse
This intrinsic returns true if either the upper 32 bits of parameter a are greater than the upper 32
bits of parameter b or the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-93. Vector Any Greater Than Unsigned (__ev_any_gtu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgtu x,a,b
0 31 32 63
a
b
> >
x
|
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-109
__ev_any_lts __ev_any_lts
Vector Any Element Less Than Signed
d = __ev_any_lts (a,b)
if ( (a
0:31
<
signed
b
0:31
) | (a
32:63
<
signed
b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are less than the upper 32 bits
of parameter b or the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
Figure 3-94. Vector Any Less Than Signed(__ev_any_lts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmplts x,a,b
0 31 32 63
a
b
< <
x
|
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-110 Freescale Semiconductor
SPE2 Operations
__ev_any_ltu __ev_any_ltu
Vector Any Element Less Than Unsigned
d = __ev_any_ltu (a,b)
if ( (a
0:31
<
unsigned
b
0:31
) | (a
32:63
<
unsigned
b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are less than the upper 32 bits
of parameter b or the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
Figure 3-95. Vector Any Less Than Unsigned (__ev_any_ltu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpltu x,a,b
0 31 32 63
a
b
< <
x
|
D
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-111
__ev_cntlsh __ev_cntlsh
Vector Count Leading Signed Bits Half Word
d = __ev_cntlsh (a)
The leading signed bits in each half word element of parameter a are counted, and the respective
count is placed into each half word element of parameter d.
evcntlzh is used for unsigned parameters; evcntlsh is used for signed parameters.
Figure 3-96. Vector Count Leading Signed Bits Half Word (__ev_cntlsh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evcntlsh d,a
0 31 32 63
a
15 16 47 48
d
sssss
count of leading
signed bits
ssssssss
count of leading
signed bits
sss
count of leading
signed bits
ssssss
count of leading
signed bits
SPE2 Programming Interface Manual, Rev. 1.0-2
3-112 Freescale Semiconductor
SPE2 Operations
__ev_cntlsw __ev_cntlsw
Vector Count Leading Signed Bits Word
d = __ev_cntlsw (a)
The leading signed bits in each element of parameter a are counted, and the count is placed into
each element of parameter d.
evcntlzw is used for unsigned parameters; evcntlsw is used for signed parameters.
Figure 3-97. Vector Count Leading Signed Bits Word (__ev_cntlsw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evcntlsw d,a
0 31 32 63
a
d
ssss_sss... ssss_sss...
Count of leading signed bits Count of leading signed bits
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-113
__ev_cntlzh __ev_cntlzh
Vector Count Leading Zeros Half Word
d = __ev_cntlzh (a)
The leading zero bits in each half word element of parameter a are counted, and the respective
count is placed into each half word element of parameter d.
Figure 3-98. Vector Count Leading Zeros Half Word (__ev_cntlzh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evcntlzh d,a
0 31 32 63
a
15 16 47 48
d
0000
count of leading
zeros
00000000
count of leading
zeros
00
count of leading
zeros
000000
count of leading
zeros
SPE2 Programming Interface Manual, Rev. 1.0-2
3-114 Freescale Semiconductor
SPE2 Operations
__ev_cntlzw __ev_cntlzw
Vector Count Leading Zeros Word
d = __ev_cntlzw (a)
The leading zero bits in each element of parameter a are counted, and the respective count is placed
into each element of parameter d.
Figure 3-99. Vector Count Leading Zeros Word (__ev_cntlzw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evcntlzw d,a
0 31 32 63
a
d
0000_000... 0000_000...
count of leading zeros count of leading zeros
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-115
__ev_diff2his[a] __ev_diff2his[a]
Vector Difference of 2 Halfwords Interleaved Signed (to Accumulator)
d = __ev_diff2his (a) (A = 0)
d = __ev_diff2hisa (a) (A = 1)
d
0:31
EXTS
32
(a
0:15
) - EXTS
32
(a
32:47
)
d
32:63
EXTS
32
(a
16:31
) - EXTS
32
(a
48:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of two interleaved signed halfword elements of parameter a are subtracted, and the results
are placed into the word elements of parameter d. If A = 1, the result in parameter d is also placed
into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-100. Vector Difference of 2 Halfwords Interleaved Signed (to Accumulator)
(__ev_diff2his[a])
d a Maps to
__ev64_opaque__ __ev64_opaque__ evdiff2his d,a
__ev64_opaque__ __ev64_opaque__ evdiff2hisa d,a
0 32 63
a
16 48
0 63
d (and Accumulator if A=1)
_
_
_
SPE2 Programming Interface Manual, Rev. 1.0-2
3-116 Freescale Semiconductor
SPE2 Operations
__ev_diff2hisaaw __ev_diff2hisaaw
Vector Difference of 2 Halfwords Interleaved Signed and Accumulate into Words
d = __ev_diff2hisaaw (a)
d
0:31
ACC
0:31
+ (EXTS
32
(a
0:15
) - EXTS
32
(a
32:47
))
d
32:63
ACC
32:63
+ (EXTS
32
(a
16:31
) - EXTS
32
(a
48:63
))
// update accumulator
ACC
0:63
d
0:63
Groups of two interleaved signed halfword elements of parameter a are subtracted, the differences
are added to the corresponding word elements in the accumulator, and the results are placed into
the word elements of parameter d and the accumulator.
Other registers altered: ACC
Figure 3-101. Vector Difference of 2 Halfwords Interleaved Signed and Accumulate
(__ev_diff2hisaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evdiff2hisaaw d,a
+ +
0 32 63
a
16 48
0 63
d and Accumulator
0 63
Accumulator
_ _
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-117
__ev_divs __ev_divs
Vector Divide Signed
d = __ev_divs (a,b)
dividend a
0:63
divisor b
0:63
d
0:63
dividend divisor
ov0
if ((dividend < 0) & (divisor = 0)) then
d
0:63
0x8000_0000_0000_0000
ov 1
else if ((dividend >= 0) & (divisor = 0)) then
d
0:63
0x7FFF_FFFF_FFFF_FFFF
ov 1
else if ((dividend = 0x8000_0000_0000_0000) & (divisor = 0xFFFF_FFFF_FFFF_FFFF))
then
d
0:63
0x7FFF_FFFF_FFFF_FFFF
ov 1
endif
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The dividend in parameter a is divided by the divisor in parameter b. The resulting 64-bit quotient
is placed into parameter d. The remainder is not supplied. The operands and quotient are
interpreted as signed integers. If overflow, underflow, or divide by zero occurs, the overflow and
summary overflow SPEFSCR bits are set. Note that any overflow indication is always set as a side
effect of this instruction. No form is defined that disables the setting of the overflow bits. In case
of overflow, a saturated value is delivered into parameter d.
Figure 3-102. Vector Divide Signed (__ev_divs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdivs d,a,b
0 63
a (dividend)
b (divisor)
a / b
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-118 Freescale Semiconductor
SPE2 Operations
__ev_divu __ev_divu
Vector Divide Unsigned
d = __ev_divu (a,b)
dividend a
0:63
divisor b
0:63
d
0:63
dividend divisor
ov0
if (divisor = 0) then
d
0:63
0xFFFF_FFFF_FFFF_FFFF
ov 1
endif
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The dividend in parameter a is divided by the divisor in parameter b. The resulting 64-bit quotient
is placed into parameter d. The remainder is not supplied. The operands and quotient are
interpreted as unsigned integers. If divide by zero occurs, the overflow and summary overflow
SPEFSCR bits are set. Note that any overflow indication is always set as a side effect of this
instruction. No form is defined that disables the setting of the overflow bits. In case of overflow, a
saturated value is delivered into the parameter d.
Figure 3-103. Vector Divide Signed (__ev_divs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdivu d,a,b
0 63
a (dividend)
b (divisor)
a / b
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-119
__ev_divws __ev_divws
Vector Divide Word Signed
d = __ev_divws (a,b)
dividendh a
0:31
dividendl a
32:63
divisorh b
0:31
divisorl b
32:63
d
0:31
dividendh divisorh
d
32:63
dividendl divisorl
ovh 0
ovl 0
if ((dividendh < 0) & (divisorh = 0)) then
d
0:31
0x80000000
ovh 1
else if ((dividendh >= 0) & (divisorh = 0)) then
d
0:31
0x7FFFFFFF
ovh 1
else if ((dividendh = 0x80000000) & (divisorh = 0xFFFF_FFFF)) then
d
0:31
0x7FFFFFFF
ovh 1
if ((dividendl < 0) & (divisorl = 0)) then
d
32:63
0x80000000
ovl 1
else if ((dividendl >= 0) & (divisorl = 0)) then
d
32:63
0x7FFFFFFF
ovl 1
else if ((dividendl = 0x80000000) & (divisorl = 0xFFFF_FFFF)) then
d
32:63
0x7FFFFFFF
ovl 1
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The two dividends are the two elements of the contents of parameter a. The two divisors are the
two elements of the contents of parameter b. The resulting two 32-bit quotients on each element
are placed into parameter d. The remainders are not supplied. Parameters and quotients are
interpreted as signed integers. If overflow, underflow, or divide by zero occurs, the overflow and
summary overflow SPEFSCR bits are set. Note that any overflow indication is always set as a side
effect of this instruction. No form is defined that disables the setting of the overflow bits. In case
of overflow, a saturated value is delivered into the destination register.
Figure 3-104. Vector Divide Word Signed (__ev_divws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdivws d,a,b
0 31 32 63
a (dividends)
b (divisors)
a/b a/b
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-120 Freescale Semiconductor
SPE2 Operations
__ev_divwsf __ev_divwsf
Vector Divide Word Signed Fractional
d = __ev_divwsf (a,b)
dividendh a
0:31
dividendl a
32:63
divisorh b
0:31
divisorl b
32:63
d
0:31
dividendh
sf
divisorh
d
32:63
dividendl
sf
divisorl
ovh 0
ovl 0
if (((dividendh
0
^ divisorh
0
) = 1) & ((|dividendh| > |divisorh|) | (divisorh = 0))) then
d
0:31
0x80000000
ovh 1
else if ((dividendh
0
^ divisorh
0
) = 0) & ((|dividendh| >= |divisorh|) | (divisorh = 0))) then
d
0:31
0x7FFFFFFF
ovh 1
if (((dividendl
0
^ divisorl
0
) = 1) & ((|dividendl| > |divisorl|) | (divisorl = 0))) then
d
32:63
0x80000000
ovl 1
else if (((dividendl
0
^ divisorl
0
) 0) & & ((|dividendl| >= |divisorl|) | (divisorl = 0))) then
d
32:63
0x7FFFFFFF
ovl 1
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The two dividends are the two elements contained in parameter a. The two divisors are the two
elements contained in parameter b. The resulting two 32-bit quotients of each element are placed
into parameter d. The remainders are not supplied. The operands and quotients are interpreted as
signed fractions. Each quotient satisfies the equation dividend = (quotient x divisor) + remainder,
where the sign of the remainder (if non-zero) is the same as the sign of the dividend. The
magnitude of the remainder is less than the magnitude of the divisor. If overflow, underflow, or
divide by zero occurs, the overflow and summary overflow SPEFSCR bits are set. Note that any
overflow indication is always set as a side effect of this instruction. No form is defined that disables
the setting of the overflow bits. In case of overflow, a saturated value is delivered into the
parameter d.
Figure 3-105. Vector Divide Word Signed Fraction (__ev_divwsf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdivwsf d,a,b
0 31 32 63
a (dividends)
b (divisors)
a/b a/b
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-121
__ev_divwu __ev_divwu
Vector Divide Word Unsigned
d = __ev_divwu (a,b)
dividendh a
0:31
dividendl a
32:63
divisorh b
0:31
divisorl b
32:63
d
0:31
dividendh divisorh
d
32:63
dividendl divisorl
ovh 0
ovl 0
if (divisorh = 0) then
d
0:31
= 0xFFFFFFFF
ovh 1
if (divisorl = 0) then
d
32:63
0xFFFFFFFF
ovl 1
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The two dividends are the two elements of the contents of parameter a. The two divisors are the
two elements of the contents of parameter b. Two 32-bit quotients are formed as a result of the
division on each of the high and low elements and the quotients are placed into parameter d.
Remainders are not supplied. Parameters and quotients are interpreted as unsigned integers. If a
divide by zero occurs, the overflow and summary overflow SPEFSCR bits are set. Note that any
overflow indication is always set as a side effect of this instruction. No form is defined that disables
the setting of the overflow bits. In case of overflow, a saturated value is delivered into the
destination register.
Figure 3-106. Vector Divide Word Unsigned (__ev_divwu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdivwu d,a,b
0 31 32 63
a (dividends)
b (divisors)
A/ B A/ B
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-122 Freescale Semiconductor
SPE2 Operations
__ev_divwuf __ev_divwuf
Vector Divide Word Unsigned Fractional
d = __ev_divwuf (a,b)
dividendh a
0:31
dividendl a
32:63
divisorh b
0:31
divisorl b
32:63
d
0:31
dividendh
uf
divisorh
uf
d
32:63
dividendl
uf
divisorl
ovh 0
ovl 0
if (divisorh = 0) |(dividendh >= divisorh) then
d
0:31
= 0xFFFFFFFF
ovh 1
if (divisorl = 0) |(dividendl >= divisorl) then
d
32:63
0xFFFFFFFF
ovl 1
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The two dividends are the two elements contained in parameter a. The two divisors are the two
elements contained in parameter b. Two 32-bit quotients are formed as a result of the division on
each of the high and low elements and the quotients are placed into parameter d. Remainders are
not supplied. Operands and quotients are interpreted as unsigned fractions. Each quotient satisfies
the equation dividend = (quotient x divisor) + remainder, where the remainder is non-negative and
the magnitude of the remainder is less than the magnitude of the divisor. If an overflow (dividend
>= divisor) or a divide by zero occurs, the overflow and summary overflow SPEFSCR bits are set.
Note that any overflow indication is always set as a side effect of this instruction. No form is
defined that disables the setting of the overflow bits. In case of overflow, a saturated value is
delivered into parameter d.
Figure 3-107. Vector Divide Word Unsigned Fractional (__ev_divwuf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdivwuf d,a,b
0 31 32 63
a (dividends)
b (divisors)
a/ b a/ b
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-123
__ev_dlveb __ev_dlveb
Vector De-interleave Even Bytes
d = __ev_dlveb (a,b)
d
0:63
a
0:7
|| a
16:23
|| a
32:39
|| a
48:55
|| b
0:7
|| b
16:23
|| b
32:39
|| b
48:55
The even byte elements in parameters a and b are de-interleaved and placed into parameter d.
Figure 3-108. Vector De-interleave Even Bytes (__ev_dlveb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlveb d,a,b
0 32 63
a
24 16 8 40 48 56
0 32 63
d
24 16 8 40 48 56
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-124 Freescale Semiconductor
SPE2 Operations
__ev_dlveh __ev_dlveh
Vector De-interleave Even Half Words
d = __ev_dlveh (a,b)
d
0:63
a
0:15
|| a
32:47
|| b
0:15
|| b
32:47
The even half word elements in parameters a and b are de-interleaved and placed into parameter d.
Figure 3-109. Vector De-interleave Even Half Words (__ev_dlveh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlveh d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-125
__ev_dlveob __ev_dlveob
Vector De-interleave Even/Odd Bytes
d = __ev_dlveob (a,b)
d
0:63
a
0:7
|| a
16:23
|| a
32:39
|| a
48:55
|| b
8:15
|| b
24:31
|| b
40:47
|| b
56:63
The even byte elements in parameter a and the odd byte elements in parameter b are de-interleaved
and placed into parameter d.
Figure 3-110. Vector De-interleave Even/Odd Bytes (__ev_dlveob)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlveob d,a,b
0 32 63
a
24 16 8 40 48 56
0 32 63
d
24 16 8 40 48 56
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-126 Freescale Semiconductor
SPE2 Operations
__ev_dlveoh __ev_dlveoh
Vector De-interleave Even/Odd Half Words
d = __ev_dlveoh (a,b)
d
0:63
a
0:15
|| a
32:47
|| b
16:31
|| b
48:63
The even half word elements in parameter a and the odd half word elements of parameter b are
de-interleaved and placed into parameter d.
Figure 3-111. Vector De-interleave Even Half Words (__ev_dlveoh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlveoh d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-127
__ev_dlvob __ev_dlvob
Vector De-interleave Odd Bytes
d = __ev_dlvob (a,b)
d
0:63
a
8:15
|| a
24:31
|| a
40:47
|| a
56:63
|| b
8:15
|| b
24:31
|| b
40:47
|| b
56:63
The odd byte elements in parameter a and the odd byte elements in parameter b are de-interleaved
and placed into parameter d.
Figure 3-112. Vector De-interleave Odd Bytes (__ev_dlvob)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlvob d,a,b
0 32 63
a
24 16 8 40 48 56
0 32 63
d
24 16 8 40 48 56
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-128 Freescale Semiconductor
SPE2 Operations
__ev_dlvoeb __ev_dlvoeb
Vector De-interleave Odd/Even Bytes
d = __ev_dlvoeb (a,b)
d
0:63
a
8:15
|| a
24:31
|| a
40:47
|| a
56:63
|| b
0:7
|| b
16:23
|| b
32:39
|| b
48:55
The odd byte elements in parameter a and the even byte elements in parameter b are de-interleaved
and placed into parameter d.
Figure 3-113. Vector De-interleave Odd/Even Bytes (__ev_dlvoeb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlvoeb d,a,b
0 32 63
a
24 16 8 40 48 56
0 32 63
d
24 16 8 40 48 56
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-129
__ev_dlvoeh __ev_dlvoeh
Vector De-interleave Odd/Even Half Words
d = __ev_dlvoeh (a,b)
d
0:63
a
16:31
|| a
48:63
|| b
0:15
|| b
32:47
The odd half word elements in parameter a and the even half word elements in parameter b are
de-interleaved and placed into parameter d.
Figure 3-114. Vector De-interleave Odd/Even Half Words (__ev_dlvoeh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlvoeh d,a,b
15 16 47 48 0 31 32 63
a
d
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-130 Freescale Semiconductor
SPE2 Operations
__ev_dlvoh __ev_dlvoh
Vector De-interleave Odd Half Words
d = __ev_dlvoh (a,b)
d
0:63
a
16:31
|| a
48:63
|| b
16:31
|| b
48:63
The odd half word elements in parameters a and b are de-interleaved and placed into parameter d.
Figure 3-115. Vector De-interleave Odd Half Words (__ev_dlvoh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdlvoh d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-131
__ev_dotp4hgasmf[a] __ev_dotp4hgasmf[a]
Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo, Fractional (to
Accumulator)
d = __ev_dotp4hgasmf (a,b) (A = 0)
d = __ev_dotp4hgasmfa (a,b) (A = 1)
temp0
0:32
a
0:15
sf
b
0:15
temp1
0:32
a
16:31
sf
b
16:31
temp2
0:32
a
32:47
sf
b
32:47
temp3
0:32
a
48:63
sf
b
48:63
temp
0:63
EXTS(temp1
0:31
) + EXTS(temp1
0:31
) + EXTS(temp2
0:31
) + EXTS(temp3
0:31
) //modulo
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding half word pairs of signed fractional elements in parameters a and b are multiplied
producing four 33-bit intermediate products. The intermediate 33-bit products are sign-extended
and added together to produce a 64-bit result and the sum is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Note: If the two input operands to a multiply are both -1.0, the intermediate product is represented
as +1.0, thus the need for a 33-bit intermediate product prior to sign extension.
Other registers altered: ACC (if A=1)
Figure 3-116. Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo,
Fractional (to Accumulator) (__ev_dotp4hgasmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasmfa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 33-bit)
x x
d (and Accumulator if A=1)
+
EXTS
EXTS
EXTS
EXTS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-132 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgasmfaa __ev_dotp4hgasmfaa
Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo, Fractional and
Accumulate
d = __ev_dotp4hgasmfaa (a,b)
temp0
0:32
a
0:15
sf
b
0:15
temp1
0:32
a
16:31
sf
b
16:31
temp2
0:32
a
32:47
sf
b
32:47
temp3
0:32
a
48:63
sf
b
48:63
temp
0:63
EXTS(temp1
0:31
) + EXTS(temp1
0:31
) + EXTS(temp2
0:31
) + EXTS(temp3
0:31
) + ACC
0:63
d
0:63
temp
0:63
ACC
0:63
d
0:63
Corresponding half word pairs of signed fractional elements in parameters a and b are multiplied
producing four 33-bit intermediate products. The 33-bit intermediate products are sign-extended
and added together with the contents of the accumulator to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator.
Note: If the two input operands to a multiply are both -1.0, the intermediate product is represented
as +1.0, thus the need for a 33-bit intermediate product prior to sign extension.
Other registers altered: ACC
Figure 3-117. Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo,
Fractional and Accumulate (__ev_dotp4hgasmfaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasmfaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 33-bit)
x x
Accumulator
+
d and Accumulator
+
EXTS
EXTS
EXTS
EXTS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-133
__ev_dotp4hgasmfaa3 __ev_dotp4hgasmfaa3
Vector Dot Product of Four Halfwords, Guarded, Add, Signed, Modulo, Fractional and
Accumulate, 3 operand
d = __ev_dotp4hgasmfaa3 (a,b,c)
temp0
0:32
b
0:15
sf
c
0:15
temp1
0:32
b
16:31
sf
c
16:31
temp2
0:32
b
32:47
sf
c
32:47
temp3
0:32
b
48:63
sf
c
48:63
temp
0:63
EXTS
64
(temp1
0:32
) + EXTS
64
(temp1
0:32
) + EXTS
64
(temp2
0:32
) + EXTS
64
(temp3
0:32
) + a
0:63
d
0:63
temp
0:63
ACC
0:63
d
0:63
Corresponding halfword pairs of signed fractional elements in parameters b and c are multiplied
producing four 33-bit intermediate products. The 33-bit intermediate products are sign-extended
and added together with the contents of parameter a to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator.
Note: If the two input operands to a multiply are both -1.0, the intermediate product is represented
as +1.0, thus the need for a 33-bit intermediate product prior to sign extension.
Other registers altered: ACC
Figure 3-118. Vector Dot Product of Four Halfwords, Guarded, Add, Signed, Modulo,
Fractional and Accumulate 3 op (__ev_dotp4hgasmfaa3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 33-bit)
x x
a
+
d and Accumulator
+
EXTS
EXTS
EXTS
EXTS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-134 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hgasmfaa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-135
__ev_dotp4hgasmi[a] __ev_dotp4hgasmi[a]
Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotp4hgasmi (a,b) (A = 0)
d = __ev_dotp4hgasmia (a,b) (A = 1)
temp0
0:31
a
0:15
si
b
0:15
temp1
0:31
a
16:31
si
b
16:31
temp2
0:31
a
32:47
si
b
32:47
temp3
0:31
a
48:63
si
b
48:63
temp
0:63
EXTS(temp1
0:31
) + EXTS(temp1
0:31
) + EXTS(temp2
0:31
) + EXTS(temp3
0:31
) //modulo
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding half word pairs of signed integer elements in parameters a and b are multiplied
producing four 32-bit intermediate products. The intermediate 32-bit products are sign-extended
and added together to produce a 64-bit result and the sum is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-119. Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo,
Integer (to Accumulator) (__ev_dotp4hgasmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasmia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
d (and Accumulator if A=1)
+
EXTS
EXTS
EXTS
EXTS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-136 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgasmiaa __ev_dotp4hgasmiaa
Vector Dot Product of Four Half Words, Guarded, Add, Signed, Modulo, Integer and Accumulate
d = __ev_dotp4hgasmiaa (a,b)
temp0
0:31
a
0:15
si
b
0:15
temp1
0:31
a
16:31
si
b
16:31
temp2
0:31
a
32:47
si
b
32:47
temp3
0:31
a
48:63
si
b
48:63
temp
0:63
EXTS(temp1
0:31
) + EXTS(temp1
0:31
) + EXTS(temp2
0:31
) + EXTS(temp3
0:31
) + ACC
0:63
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding half word pairs of signed integer elements in paraameter a and parameter b are
multiplied producing four 32-bit intermediate products. The intermediate 32-bit products are
sign-extended and added together with the contents of the accumulator to produce a 64-bit result,
and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-120. Vector Dot Product of Half Words, Guarded, Add, Signed, Modulo, Integer
and Accumulate (__ev_dotp4hgasmiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasmiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
Accumulator
+
d and Accumulator
+
EXTS
EXTS
EXTS
EXTS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-137
__ev_dotp4hgasmiaa3 __ev_dotp4hgasmiaa3
Vector Dot Product of Four Halfwords, Guarded, Add, Signed, Modulo, Integer and Accumulate,
3 operand
d = __ev_dotp4hgasmiaa3 (a,b,c)
temp0
0:31
b
0:15
si
c
0:15
temp1
0:31
b
16:31
si
c
16:31
temp2
0:31
b
32:47
si
c
32:47
temp3
0:31
b
48:63
si
c
48:63
temp
0:63
EXTS
64
(temp1
0:31
) + EXTS
64
(temp1
0:31
) + EXTS
64
(temp2
0:31
) + EXTS
64
(temp3
0:31
) + a
0:63
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding halfword pairs of signed integer elements in parameters b and c are multiplied
producing four 32-bit intermediate products. The intermediate 32-bit products are sign-extended
and added together with the contents of parameter a to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-121. Vector Dot Product of Halfwords, Guarded, Add, Signed, Modulo, Integer
and Accumulate 3 op (__ev_dotp4hgasmiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hgasmiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
a
+
d and Accumulator
+
EXTS
EXTS
EXTS
EXTS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-138 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgasumi[a] __ev_dotp4hgasumi[a]
Vector Dot Product of Four Half Words, Guarded, Add, Signed by Unsigned, Modulo, Integer (to
Accumulator)
d = __ev_dotp4hgasumi (a,b) (A = 0)
d = __ev_dotp4hgasumia (a,b) (A = 1)
temp0
0:31
a
0:15
sui
b
0:15
temp1
0:31
a
16:31
sui
b
16:31
temp2
0:31
a
32:47
sui
b
32:47
temp3
0:31
a
48:63
sui
b
48:63
temp
0:63
EXTS(temp1
0:31
) + EXTS(temp1
0:31
) + EXTS(temp2
0:31
) + EXTS(temp3
0:31
) //modulo
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding half word pairs of signed integer elements in parameter a and unsigned integer
elements in parameter b are multiplied producing four 32-bit intermediate products. The
intermediate 32-bit products are sign-extended and added together to produce a 64-bit result and
the sum is placed into parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-122. Vector Dot Product of Four Half Words, Guarded, Add, Signed by Unsigned,
Modulo, Integer (to Accumulator) (__ev_dotp4hgasumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasumia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(4 32-bit)
x x
d (and Accumulator if A=1)
+
EXTS
EXTS
EXTS
EXTS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-139
__ev_dotp4hgasumiaa __ev_dotp4hgasumiaa
Vector Dot Product of Four Half Words, Guarded, Add, Signed by Unsigned, Modulo, Integer and
Accumulate
d = __ev_dotp4hgasumiaa (a,b)
temp0
0:31
a
0:15
sui
b
0:15
temp1
0:31
a
16:31
sui
b
16:31
temp2
0:31
a
32:47
sui
b
32:47
temp3
0:31
a
48:63
sui
b
48:63
temp
0:63
EXTS(temp1
0:31
) + EXTS(temp1
0:31
) + EXTS(temp2
0:31
) + EXTS(temp3
0:31
) + ACC
0:63
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding half word pairs of signed integer elements in parameter a and unsigned integer
elements in parameter b are multiplied producing four 32-bit intermediate products. The
intermediate 32-bit products are sign-extended and added together with the contents of the
accumulator to produce a 64-bit result, and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-123. Vector Dot Product of Half Words, Guarded, Add, Signed by Unsigned,
Modulo, Integer and Accumulate (__ev_dotp4hgasumiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgasumiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(4 32-bit)
x x
Accumulator
+
d and Accumulator
+
EXTS
EXTS
EXTS
EXTS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-140 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgasumiaa3 __ev_dotp4hgasumiaa3
Vector Dot Product of Four Halfwords, Guarded, Add, Signed by Unsigned, Modulo, Integer and
Accumulate, 3 operand
d = __ev_dotp4hgasumiaa3 (a,b,c)
temp0
0:31
b
0:15
sui
c
0:15
temp1
0:31
b
16:31
sui
c
16:31
temp2
0:31
b
32:47
sui
c
32:47
temp3
0:31
b
48:63
sui
c
48:63
temp
0:63
EXTS
64
(temp1
0:31
) + EXTS
64
(temp1
0:31
) + EXTS
64
(temp2
0:31
) + EXTS
64
(temp3
0:31
) + a
0:63
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding halfword pairs of signed integer elements in parameter b and unsigned integer
elements in parameter c are multiplied producing four 32-bit intermediate products. The
intermediate 32-bit products are sign-extended and added together with the contents of parameter
a to produce a 64-bit result, and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-124. Vector Dot Product of Halfwords, Guarded, Add, Signed by Unsigned,
Modulo, Integer and Accumulate 3 op (__ev_dotp4hgasumiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hgasumiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c (unsigned)
x x
b (signed)
products
(4 32-bit)
x x
a
+
d and Accumulator
+
EXTS
EXTS
EXTS
EXTS
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-141
__ev_dotp4hgaumi[a] __ev_dotp4hgaumi[a]
Vector Dot Product of Four Half Words, Guarded, Add, Unsigned, Modulo, Integer (to
Accumulator)
d = __ev_dotp4hgaumi (a,b) (A = 0)
d = __ev_dotp4hgaumia (a,b) (A = 1)
temp0
0:31
a
0:15
ui
b
0:15
temp1
0:31
a
16:31
ui
b
16:31
temp2
0:31
a
32:47
ui
b
32:47
temp3
0:31
a
48:63
ui
b
48:63
temp
0:63
EXTZ(temp1
0:31
) + EXTZ(temp1
0:31
) + EXTZ(temp2
0:31
) + EXTZ(temp3
0:31
) //modulo
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding half word pairs of unsigned integer elements in parameters a and b are multiplied
producing four 32-bit intermediate products. The intermediate 32-bit products are zero-extended
and added together to produce a 64-bit result and the sum is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-125. Vector Dot Product of Four Half Words, Guarded, Add, Unsigned, Modulo,
Integer (to Accumulator) (__ev_dotp4hgaumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgaumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgaumia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
d (and Accumulator if A=1)
+
EXTZ
EXTZ
EXTZ
EXTZ
SPE2 Programming Interface Manual, Rev. 1.0-2
3-142 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgaumiaa __ev_dotp4hgaumiaa
Vector Dot Product of Four Half Words, Guarded, Add, Unsigned, Modulo, Integer and
Accumulate
d = __ev_dotp4hgaumiaa (a,b)
temp0
0:31
a
0:15
ui
b
0:15
temp1
0:31
a
16:31
ui
b
16:31
temp2
0:31
a
32:47
ui
b
32:47
temp3
0:31
a
48:63
ui
b
48:63
temp
0:63
EXTZ(temp1
0:31
) + EXTZ(temp1
0:31
) + EXTZ(temp2
0:31
) + EXTZ(temp3
0:31
) + ACC
0:63
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding half word pairs of unsigned integer elements in parameters a and b are multiplied
producing four 32-bit intermediate products. The intermediate 32-bit products are zero-extended
and added together with the contents of the accumulator to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-126. Vector Dot Product of Half Words, Guarded, Add, Unsigned, Modulo, Integer
and Accumulate (__ev_dotp4hgaumiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgaumiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
Accumulator
+
d and Accumulator
+
EXTZ
EXTZ
EXTZ
EXTZ
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-143
__ev_dotp4hgaumiaa3 __ev_dotp4hgaumiaa3
Vector Dot Product of Four Halfwords, Guarded, Add, Unsigned, Modulo, Integer and
Accumulate, 3 operand
d = __ev_dotp4hgaumiaa3 (a,b,c)
temp0
0:31
b
0:15
ui
c
0:15
temp1
0:31
b
16:31
ui
c
16:31
temp2
0:31
b
32:47
ui
c
32:47
temp3
0:31
b
48:63
ui
c
48:63
temp
0:63
EXTZ
64
(temp1
0:31
) + EXTZ
64
(temp1
0:31
) + EXTZ
64
(temp2
0:31
) + EXTZ
64
(temp3
0:31
)
+ a
0:63
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding halfword pairs of unsigned integer elements in parameters b and c are multiplied
producing four 32-bit intermediate products. The intermediate 32-bit products are zero-extended
and added together with the contents of parameter a to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-127. Vector Dot Product of Halfwords, Guarded, Add, Unsigned, Modulo, Integer
and Accumulate 3 op (__ev_dotp4hgaumiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hgaumiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
a
+
d and Accumulator
+
EXTZ
EXTZ
EXTZ
EXTZ
SPE2 Programming Interface Manual, Rev. 1.0-2
3-144 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgssmf[a] __ev_dotp4hgssmf[a]
Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo, Fractional (to
Accumulator)
d = __ev_dotp4hgssmf (a,b) (A = 0)
d = __ev_dotp4hgssmfa (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:32
a
0:15
sf
b
0:15
temph2
0:32
a
16:31
sf
b
16:31
temph
0:63
EXTS
64
(temph1
0:32
) - EXTS
64
(temph2
0:32
)
//low dot - calculate real part of complex product
templ1
0:32
a
32:47
si
b
32:47
templ2
0:32
a
48:63
si
b
48:63
templ
0:63
EXTS
64
(templ1
0:32
) - EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Halfword pairs of signed fractional elements in the high halfwords of parameter a are multiplied
with the corresponding high halfwords of parameter b, producing a pair of 33-bit products which
are sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended 33-bit products of the halfword pairs of signed
fractional elements from the low halfwords of parameter a and the corresponding low halfwords
of parameter b, and the sum is placed into parameter d. If A = 1, the result in parameter d is also
placed into the accumulator. This instruction is used to produce the real portion of a guarded
complex dot product.
Other registers altered: ACC (if A=1)
Figure 3-128. Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo,
Fractional (to Accumulator) (__ev_dotp4hgssmf[a])
15 16 47 48 0 31 32 63
Intermediate
b
x
a
products
(4 33-bit)
x
-
d (and Accumulator if A=1)
-
Re Img Re Img
Re Img Re Img
Re
x
x
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-145
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgssmf d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgssmfa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-146 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgssmfaa __ev_dotp4hgssmfaa
Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo, Fractional and
Accumulate
d = __ev_dotp4hgssmfaa (a,b)
// high dot - calculate real part of complex product
temph1
0:32
a
0:15
sf
b
0:15
temph2
0:32
a
16:31
sf
b
16:31
temph
0:63
EXTS
64
(temph1
0:32
) - EXTS
64
(temph2
0:32
)
//low dot - calculate real part of complex product
templ1
0:32
a
32:47
si
b
32:47
templ2
0:32
a
48:63
si
b
48:63
templ
0:63
EXTS
64
(templ1
0:32
) - EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
ACC
0:63
d
0:63
Halfword pairs of signed fractional elements in the high halfwords of parameter a are multiplied
with the corresponding high halfwords of parameter b, producing a pair of 33-bit products which
are sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended 33-bit products of the halfword pairs of signed
fractional elements from the low halfwords of parameter a and the corresponding low halfwords
of parameter b, and then added together with the contents of the accumulator, and the sum is placed
into parameter d and the accumulator. This instruction is used to produce the real portion of a
guarded complex dot product.
Other registers altered: ACC
Figure 3-129. Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo,
Fractional and Accumulate (__ev_dotp4hgssmfaa)
15 16 47 48 0 31 32 63
Intermediate
b
x
a
products
(4 33-bit)
x
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Re
x
x
+
Accumulator
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-147
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev_dotp4hgssmfaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-148 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgssmfaa3 __ev_dotp4hgssmfaa3
Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo, Fractional and
Accumulate 3 operand
d = __ev_dotp4hgssmfaa3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:32
b
0:15
sf
c
0:15
temph2
0:32
b
16:31
sf
c
16:31
temph
0:63
EXTS
64
(temph1
0:32
) - EXTS
64
(temph2
0:32
)
//low dot - calculate real part of complex product
templ1
0:32
b
32:47
si
c
32:47
templ2
0:32
b
48:63
si
c
48:63
templ
0:63
EXTS
64
(templ1
0:32
) - EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
+ a
0:63
ACC
0:63
d
0:63
Halfword pairs of signed fractional elements in the high halfwords of parameter b are multiplied
with the corresponding high halfwords of parameter c, producing a pair of 33-bit products which
are sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended 33-bit products of the halfword pairs of signed
fractional elements from the low halfwords of parameter b and the corresponding low halfwords
of parameter c, then added together with the contents of parameter a to produce a 64-bit result, and
the sum is placed into parameter d and the accumulator. This instruction is used to produce the real
portion of a guarded complex dot product.
Other registers altered: ACC
Figure 3-130. Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo,
Fractional and Accumulate 3op (__ev_dotp4hgssmfaa3)
15 16 47 48 0 31 32 63
Intermediate
c
x
b
products
(4 33-bit)
x
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Re
x
x
+
a
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-149
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hgssmfaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-150 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgssmi[a] __ev_dotp4hgssmi[a]
Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo, Integer (to
Accumulator)
d = __ev_dotp4hgssmi (a,b) (A = 0)
d = __ev_dotp4hgssmia (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
//low dot - calculate real part of complex product
templ1
0:31
a
32:47
si
b
32:47
templ2
0:31
a
48:63
si
b
48:63
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter a are multiplied with
the corresponding high halfwords of parameter b, producing a pair of 32-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended products of the halfword pairs of signed integer
elements from the low halfwords of parameter a and the corresponding low halfwords of
parameter b, and the sum is placed into parameter d. If A = 1, the result in parameter d is also
placed into the accumulator. This instruction is used to produce the real portion of a guarded
complex dot product.
Other registers altered: ACC (if A=1)
Figure 3-131. Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo,
Integer (to Accumulator) (__ev_dotp4hgssmi[a])
15 16 47 48 0 31 32 63
Intermediate
b
x
a
products
(4 32-bit)
x
-
d (and Accumulator if A=1)
-
Re Img Re Img
Re Img Re Img
Re
x
x
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-151
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgssmi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgssmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-152 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgssmiaa __ev_dotp4hgssmiaa
Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo, Integer and
Accumulate
d = __ev_dotp4hgssmiaa (a,b)
// high dot - calculate real part of complex product
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
//low dot - calculate real part of complex product
templ1
0:31
a
32:47
si
b
32:47
templ2
0:31
a
48:63
si
b
48:63
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter a are multiplied with
the corresponding high halfwords of parameter b, producing a pair of 32-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended products of the halfword pairs of signed integer
elements from the low halfwords of parameter a and the corresponding low halfwords of
parameter b, and then added together with the contents of the accumulator, and the sum is placed
into parameter d and the accumulator. This instruction is used to produce the real portion of a
guarded complex dot product.
Other registers altered: ACC
Figure 3-132. Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo,
Integer and Accumulate (__ev_dotp4hgssmiaa)
15 16 47 48 0 31 32 63
Intermediate
b
x
a
products
(4 32-bit)
x
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Re
x
x
+
Accumulator
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-153
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hgssmiaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-154 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hgssmiaa3 __ev_dotp4hgssmiaa3
Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo, Integer and
Accumulate 3 operand
d = __ev_dotp4hgssmiaa3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:31
b
0:15
si
c
0:15
temph2
0:31
b
16:31
si
c
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
//low dot - calculate real part of complex product
templ1
0:31
b
32:47
si
c
32:47
templ2
0:31
b
48:63
si
c
48:63
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
+ a
0:63
ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter b are multiplied with
the corresponding high halfwords of parameter c, producing a pair of 32-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended products of the halfword pairs of signed integer
elements from the low halfwords of parameter b and the corresponding low halfwords of
parameter c, then added together with the contents of parameter a to produce a 64-bit result, and
the sum is placed into parameter d and the accumulator. This instruction is used to produce the real
portion of a guarded complex dot product.
Other registers altered: ACC
Figure 3-133. Vector Dot Product of Four Halfwords, Guarded, Subtract, Signed, Modulo,
Integer and Accumulate 3op (__ev_dotp4hgssmiaa3)
15 16 47 48 0 31 32 63
Intermediate
c
x
b
products
(4 32-bit)
x
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Re
x
x
+
a
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-155
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hgssmiaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-156 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgasmf[a] __ev_dotp4hxgasmf[a]
Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed, Modulo, Fractional (to
Accumulator)
d = __ev_dotp4hxgasmf (a,b) (A = 0)
d = __ev_dotp4hxgasmfa (a,b) (A = 1)
// high dot - calculate imag part of complex product
temph1
0:32
b
0:15
sf
a
16:31
temph2
0:32
b
16:31
sf
a
0:15
temph
0:63
EXTS
64
(temph1
0:32
) + EXTS
64
(temph2
0:32
)
//low dot - calculate imag part of complex product
templ1
0:32
b
32:47
sf
a
48:63
templ2
0:32
b
48:63
sf
a
32:47
templ
0:63
EXTS
64
(templ1
0:32
) + EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Pairs of signed fractional elements in the high halfwords of parameter b are multiplied with the
exchanged high halfwords of parameter a, producing a pair of 33-bit products which are
sign-extended to 64 bits. The sum of this pair of intermediate products is added to the sum of the
corresponding sign-extended 33-bit products of the halfword pairs of signed fractional elements
from the low halfwords of parameter b and the exchanged low halfwords of parameter a to produce
a 64-bit result, and the sum is placed into parameter d. If A = 1, the result in parameter d is also
placed into the accumulator. This instruction is used to produce the imaginary portion of a guarded
complex dot product. Note: If the two input operands to a multiply are both -1.0, the intermediate
product is represented as +1.0 (
1
0 || 0x8000_0000).
Other registers altered: ACC (if A=1)
Figure 3-134. Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed,
Modulo, Fractional (to Accumulator) (__ev_dotp4hxgasmf[a])
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 33-bit)
+
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Imag
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-157
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgasmf d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgasmfa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-158 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgasmfaa __ev_dotp4hxgasmfaa
Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed, Modulo, Fractional
and Accumulate
d = __ev_dotp4hxgasmfaa (a,b)
// high dot - calculate imag part of complex product
temph1
0:32
b
0:15
sf
a
16:31
temph2
0:32
b
16:31
sf
a
0:15
temph
0:63
EXTS
64
(temph1
0:32
) + EXTS
64
(temph2
0:32
)
//low dot - calculate imag part of complex product
templ1
0:32
b
32:47
sf
a
48:63
templ2
0:32
b
48:63
sf
a
32:47
templ
0:63
EXTS
64
(templ1
0:32
) + EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
ACC
0:63
d
0:63
Pairs of signed fractional elements in the high halfwords of parameter b are multiplied with the
exchanged high halfwords of parameter a, producing a pair of 33-bit products which are
sign-extended to 64 bits. The sum of this pair of intermediate products is added to the sum of the
corresponding sign-extended 33-bit products of the halfword pairs of signed fractional elements
from the low halfwords of parameter b and the exchanged low halfwords of parameter a, then
added together with the contents of the accumulator to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator. This instruction is used to produce the imaginary
portion of a guarded complex dot product. Note: If the two input operands to a multiply are both
-1.0, the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000).
Other registers altered: ACC
Figure 3-135. Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed,
Modulo, Fractional and Accumulate (__ev_dotp4hxgasmfaa)
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 33-bit)
+
d and Accumulator
+
Re Img Re Img
Re Img Re Img
Imag
+
Accumulator
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-159
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgasmfaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-160 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgasmfaa3 __ev_dotp4hxgasmfaa3
Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed, Modulo, Fractional
and Accumulate 3 operand
d = __ev_dotp4hxgasmfaa3 (a,b,c)
// high dot - calculate imag part of complex product
temph1
0:32
c
0:15
sf
b
16:31
temph2
0:32
c
16:31
sf
b
0:15
temph
0:63
EXTS
64
(temph1
0:32
) + EXTS
64
(temph2
0:32
)
//low dot - calculate imag part of complex product
templ1
0:32
c
32:47
sf
b
48:63
templ2
0:32
c
48:63
sf
b
32:47
templ
0:63
EXTS
64
(templ1
0:32
) + EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
+ a
0:63
ACC
0:63
d
0:63
Pairs of signed fractional elements in the high halfwords of parameter c are multiplied with the
exchanged high halfwords of parameter b, producing a pair of 33-bit products which are
sign-extended to 64 bits. The sum of this pair of intermediate products is added to the sum of the
corresponding sign-extended 33-bit products of the halfword pairs of signed fractional elements
from the low halfwords of parameter c and the exchanged low halfwords of parameter b, then
added together with the contents of parameter a to produce a 64-bit result, and the sum is placed
into parameter d and the accumulator. This instruction is used to produce the imaginary portion of
a guarded complex dot product. Note: If the two input operands to a multiply are both -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000).
Other registers altered: ACC
Figure 3-136. Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed,
Modulo, Fractional and Accumulate 3 op (__ev_dotp4hxgasmfaa3)
15 16 47 48 0 31 32 63
Intermediate
c
b
products
(4 33-bit)
+
d and Accumulator
+
Re Img Re Img
Re Img Re Img
Imag
+
a
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-161
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hxgasmfaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-162 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgasmi[a] __ev_dotp4hxgasmi[a]
Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed, Modulo, Integer (to
Accumulator)
d = __ev_dotp4hxgasmi (a,b) (A = 0)
d = __ev_dotp4hxgasmia (a,b) (A = 1)
// high dot - calculate imag part of complex product
temph1
0:31
b
0:15
si
a
16:31
temph2
0:31
b
16:31
si
a
0:15
temph
0:63
EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
)
//low dot - calculate imag part of complex product
templ1
0:31
b
32:47
si
a
48:63
templ2
0:31
b
48:63
si
a
32:47
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter b are multiplied with
the exchanged high halfwords of parameter a, producing a pair of 32-bit products which are
sign-extended to 64 bits. The sum of this pair of intermediate products is added to the sum of the
corresponding sign-extended products of the halfword pairs of signed integer elements from the
low halfwords of parameter b and the exchanged low halfwords of parameter a to produce a 64-bit
result, and the sum is placed into parameter d. If A = 1, the result in parameter d is also placed into
the accumulator. This instruction is used to produce the imaginary portion of a guarded complex
dot product.
Other registers altered: ACC (if A=1)
Figure 3-137. Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed,
Modulo, Integer (to Accumulator) (__ev_dotp4hxgasmi[a])
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 32-bit)
+
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Imag
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-163
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgasmi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgasmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-164 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgasmiaa __ev_dotp4hxgasmiaa
Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed, Modulo, Integer and
Accumulate
d = __ev_dotp4hxgasmiaa (a,b)
// high dot - calculate imag part of complex product
temph1
0:31
b
0:15
si
a
16:31
temph2
0:31
b
16:31
si
a
0:15
temph
0:63
EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
)
//low dot - calculate imag part of complex product
templ1
0:31
b
32:47
si
a
48:63
templ2
0:31
b
48:63
si
a
32:47
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter b are multiplied with
the exchanged high halfwords of parameter a, producing a pair of 32-bit products which are
sign-extended to 64 bits. The sum of this pair of intermediate products is added to the sum of the
corresponding sign-extended products of the halfword pairs of signed integer elements from the
low halfwords of parameter b and the exchanged low halfwords of parameter a, then added
together with the contents of the accumulator to produce a 64-bit result, and the sum is placed into
parameter d and the accumulator. This instruction is used to produce the real portion of a guarded
complex dot product.
Other registers altered: ACC
Figure 3-138. Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed,
Modulo, Integer and Accumulate (__ev_dotp4hxgasmiaa)
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 32-bit)
+
d and Accumulator
+
Re Img Re Img
Re Img Re Img
Imag
+
Accumulator
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-165
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgasmiaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-166 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgasmiaa3 __ev_dotp4hxgasmiaa3
Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed, Modulo, Integer and
Accumulate 3 operand
d = __ev_dotp4hxgasmiaa3 (a,b,c)
// high dot - calculate imag part of complex product
temph1
0:31
c
0:15
si
b
16:31
temph2
0:31
c
16:31
si
b
0:15
temph
0:63
EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
)
//low dot - calculate imag part of complex product
templ1
0:31
c
32:47
si
b
48:63
templ2
0:31
c
48:63
si
b
32:47
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
+ a
0:63
ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter c are multiplied with
the exchanged high halfwords of parameter b, producing a pair of 32-bit products which are
sign-extended to 64 bits. The sum of this pair of intermediate products is added to the sum of the
corresponding sign-extended products of the halfword pairs of signed integer elements from the
low halfwords of parameter c and the exchanged low halfwords of parameter b, then added
together with the contents of parameter a to produce a 64-bit result, and the sum is placed into
parameter d and the accumulator. This instruction is used to produce the real portion of a guarded
complex dot product.
Other registers altered: ACC
Figure 3-139. Vector Dot Product of Four Halfwords Exchanged, Guarded, Add, Signed,
Modulo, Integer and Accumulate 3 op (__ev_dotp4hxgasmiaa3)
15 16 47 48 0 31 32 63
Intermediate
c
b
products
(4 32-bit)
+
d and Accumulator
+
Re Img Re Img
Re Img Re Img
Imag
+
a
+
x
x
exts exts exts exts
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-167
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hxgasmiaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-168 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgssmf[a] __ev_dotp4hxgssmf[a]
Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract, Signed, Modulo,
Fractional (to Accumulator)
d = __ev_dotp4hxgssmf (a,b) (A = 0)
d = __ev_dotp4hxgssmfa (a,b) (A = 1)
// high dot - calculate imag part of complex x complex conjugate product
temph1
0:32
b
0:15
sf
a
16:31
temph2
0:32
b
16:31
sf
a
0:15
temph
0:63
EXTS
64
(temph1
0:32
) - EXTS
64
(temph2
0:32
)
//low dot - calculate imag part of complex x complex conjugate product
templ1
0:32
b
32:47
sf
a
48:63
templ2
0:32
b
48:63
sf
a
32:47
templ
0:63
EXTS
64
(templ1
0:32
) - EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Pairs of signed fractional elements in the high halfwords of parameter b are multiplied with the
exchanged high halfwords of parameter a, producing a pair of 33-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended 33-bit products of the halfword pairs of signed
fractional elements from the low halfwords of parameter b and the exchanged low halfwords of
parameter a to produce a 64-bit result, and the sum is placed into parameter d. If A = 1, the result
in parameter d is also placed into the accumulator. This instruction is used to produce the
imaginary portion of a guarded complex x complex conjugate dot product. If the two input
operands to a multiply are both -1.0, the intermediate product is represented as +1.0 (
1
0 ||
0x8000_0000).
Other registers altered: ACC (if A=1)
Figure 3-140. Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract,
Signed, Modulo, Fractional (to Accumulator) (__ev_dotp4hxgssmf[a])
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 33-bit)
-
d (and Accumulator if A=1)
-
Re Img Re Img
Re Img Re Img
Imag
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-169
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgssmf d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgssmfa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-170 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgssmfaa __ev_dotp4hxgssmfaa
Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract, Signed, Modulo,
Fractional and Accumulate
d = __ev_dotp4hxgssmfaa (a,b)
// high dot - calculate imag part of complex x complex conjugate product
temph1
0:32
b
0:15
sf
a
16:31
temph2
0:32
b
16:31
sf
a
0:15
temph
0:63
EXTS
64
(temph1
0:32
) - EXTS
64
(temph2
0:32
)
//low dot - calculate imag part of complex x complex conjugate product
templ1
0:32
b
32:47
sf
a
48:63
templ2
0:32
b
48:63
sf
a
32:47
templ
0:63
EXTS
64
(templ1
0:32
) - EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
ACC
0:63
d
0:63
Pairs of signed fractional elements in the high halfwords of parameter b are multiplied with the
exchanged high halfwords of parameter a, producing a pair of 33-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended 33-bit products of the halfword pairs of signed
fractional elements from the low halfwords of parameter b and the exchanged low halfwords of
parameter a, then added together with the contents of the accumulator to produce a 64-bit result,
and the sum is placed into parameter d and the accumulator. This instruction is used to produce
the imaginary portion of a guarded complex x complex conjugate dot product. If the two input
operands to a multiply are both -1.0, the intermediate product is represented as +1.0 (
1
0 ||
0x8000_0000).
Other registers altered: ACC
Figure 3-141. Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract,
Signed, Modulo, Fractional and Accumulate (__ev_dotp4hxgssmfaa)
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 33-bit)
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Imag
+
Accumulator
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-171
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgssmfaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-172 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgssmfaa3 __ev_dotp4hxgssmfaa3
Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract, Signed, Modulo,
Fractional and Accumulate 3 operand
d = __ev_dotp4hxgssmfaa3 (a,b,c)
// high dot - calculate imag part of complex x complex conjugate product
temph1
0:32
c
0:15
sf
b
16:31
temph2
0:32
c
16:31
sf
b
0:15
temph
0:63
EXTS
64
(temph1
0:32
) - EXTS
64
(temph2
0:32
)
//low dot - calculate imag part of complex x complex conjugate product
templ1
0:32
c
32:47
sf
b
48:63
templ2
0:32
c
48:63
sf
b
32:47
templ
0:63
EXTS
64
(templ1
0:32
) - EXTS
64
(templ2
0:32
)
d
0:63
temph
0:63
+ templ
0:63
+ a
0:63
ACC
0:63
d
0:63
Pairs of signed fractional elements in the high halfwords of parameter c are multiplied with the
exchanged high halfwords of parameter b, producing a pair of 33-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended 33-bit products of the halfword pairs of signed
fractional elements from the low halfwords of parameter c and the exchanged low halfwords of
parameter b, then added together with the contents of parameter a to produce a 64-bit result, and
the sum is placed into parameter d and the accumulator. This instruction is used to produce the
imaginary portion of a guarded complex x complex conjugate dot product. Note: If the two input
operands to a multiply are both -1.0, the intermediate product is represented as +1.0 (
1
0 ||
0x8000_0000).
Other registers altered: ACC
Figure 3-142. Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract,
Signed, Modulo, Fractional and Accumulate 3 op (__ev_dotp4hxgssmfaa3)
15 16 47 48 0 31 32 63
Intermediate
c
b
products
(4 33-bit)
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Imag
+
a
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-173
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hxgssmfaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-174 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgssmi[a] __ev_dotp4hxgssmi[a]
Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract, Signed, Modulo, Integer
(to Accumulator)
d = __ev_dotp4hxgssmi (a,b) (A = 0)
d = __ev_dotp4hxgssmia (a,b) (A = 1)
// high dot - calculate imag part of complex x complex conjugate product
temph1
0:31
b
0:15
si
a
16:31
temph2
0:31
b
16:31
si
a
0:15
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
//low dot - calculate imag part of complex x complex conjugate product
templ1
0:31
b
32:47
si
a
48:63
templ2
0:31
b
48:63
si
a
32:47
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter b are multiplied with
the exchanged high halfwords of parameter a, producing a pair of 32-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended products of the halfword pairs of signed integer
elements from the low halfwords of parameter b and the exchanged low halfwords of parameter a
to produce a 64-bit result, and the sum is placed into parameter d. If A = 1, the result in parameter
d is also placed into the accumulator. This instruction is used to produce the imaginary portion of
a guarded complex x complex conjugate dot product.
Other registers altered: ACC (if A=1)
Figure 3-143. Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract,
Signed, Modulo, Integer (to Accumulator) (__ev_dotp4hxgssmi[a])
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 32-bit)
-
d (and Accumulator if A=1)
-
Re Img Re Img
Re Img Re Img
Imag
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-175
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgssmi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgssmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-176 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgssmiaa __ev_dotp4hxgssmiaa
Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract, Signed, Modulo, Integer
and Accumulate
d = __ev_dotp4hxgssmiaa (a,b)
// high dot - calculate imag part of complex x complex conjugate product
temph1
0:31
b
0:15
si
a
16:31
temph2
0:31
b
16:31
si
a
0:15
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
//low dot - calculate imag part of complex x complex conjugate product
templ1
0:31
b
32:47
si
a
48:63
templ2
0:31
b
48:63
si
a
32:47
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter b are multiplied with
the exchanged high halfwords of parameter a, producing a pair of 32-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended products of the halfword pairs of signed integer
elements from the low halfwords of parameter b and the exchanged low halfwords of parameter a,
then added together with the contents of the accumulator to produce a 64-bit result, and the sum
is placed into parameter d and the accumulator. This instruction is used to produce the real portion
of a guarded complex x complex conjugate dot product.
Other registers altered: ACC
Figure 3-144. Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract,
Signed, Modulo, Integer and Accumulate (__ev_dotp4hxgssmiaa)
15 16 47 48 0 31 32 63
Intermediate
b
a
products
(4 32-bit)
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Imag
+
Accumulator
+
exts exts exts exts
x
x
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-177
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotp4hxgssmiaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-178 Freescale Semiconductor
SPE2 Operations
__ev_dotp4hxgssmiaa3 __ev_dotp4hxgssmiaa3
Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract, Signed, Modulo, Integer
and Accumulate 3 operand
d = __ev_dotp4hxgssmiaa3 (a,b,c)
// high dot - calculate imag part of complex x complex conjugate product
temph1
0:31
c
0:15
si
b
16:31
temph2
0:31
c
16:31
si
b
0:15
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
//low dot - calculate imag part of complex x complex conjugate product
templ1
0:31
c
32:47
si
b
48:63
templ2
0:31
c
48:63
si
b
32:47
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
d
0:63
temph
0:63
+ templ
0:63
+ a
0:63
ACC
0:63
d
0:63
Halfword pairs of signed integer elements in the high halfwords of parameter c are multiplied with
the exchanged high halfwords of parameter b, producing a pair of 32-bit products which are
sign-extended to 64 bits. The difference of this pair of intermediate products is added to the
difference of the corresponding sign-extended products of the halfword pairs of signed integer
elements from the low halfwords of parameter c and the exchanged low halfwords of parameter b,
then added together with the contents of parameter a to produce a 64-bit result, and the sum is
placed into parameter d and the accumulator. This instruction is used to produce the real portion
of a guarded complex x complex conjugate dot product.
Other registers altered: ACC
Figure 3-145. Vector Dot Product of Four Halfwords Exchanged, Guarded, Subtract,
Signed, Modulo, Integer and Accumulate 3 op (__ev_dotp4hxgssmiaa3)
15 16 47 48 0 31 32 63
Intermediate
c
b
products
(4 32-bit)
-
d and Accumulator
-
Re Img Re Img
Re Img Re Img
Imag
+
a
+
x
x
exts exts exts exts
x
x
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-179
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotp4hxgssmiaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-180 Freescale Semiconductor
SPE2 Operations
__ev_dotpbasmi[a] __ev_dotpbasmi[a]
Vector Dot Product of Bytes, Add, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotpbasmi (a,b) (A = 0)
d = __ev_dotpbasmia (a,b) (A = 1)
// high dot
temph
0:31
EXTS(a
0:7
si
b
0:7
) + EXTS(a
8:15
si
b
8:15
) +
EXTS(a
16:23
si
b
16:23
) +
EXTS(a
24:31
si
b
24:31
)
d
0:31
temph
0:31
//low
templ
0:31
EXTS(a
32:39
si
b
32:39
) + EXTS(a
40:47
si
b
40:47
) +
EXTS(a
48:55
si
b
48:55
)
+ EXTS(a
56:63
si
b
56:63
)
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of signed integer elements in
parameters a and b are multiplied producing four 16-bit products. This quad of intermediate
products are sign-extended to 32 bits and added together and the sum is placed into the
corresponding parameter d word. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-146. Vector Dot Product of Bytes, Add, Signed, Modulo, Integer (to Accumulator)
(__ev_dotpbasmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbasmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbasmia d,a,b
0 32 63
Intermediate
b
a
products
d (and Accumulator if A=1)
+
24 16 8 40 48 56
x x x x x x x x
exts exts exts exts
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-181
__ev_dotpbasmiaaw __ev_dotpbasmiaaw
Vector Dot Product of Bytes, Add, Signed, Modulo, Integer and Accumulate into Words
d = __ev_dotpbasmiaaw (a,b)
// high dot
temph
0:31
EXTS(a
0:7
si
b
0:7
) + EXTS(a
8:15
si
b
8:15
) +
EXTS(a
16:23
si
b
16:23
) +
EXTS(a
24:31
si
b
24:31
) + ACC
0:31
d
0:31
temph
0:31
//low
templ
0:31
EXTS(a
32:39
si
b
32:39
) + EXTS(a
40:47
si
b
40:47
) +
EXTS(a
48:55
si
b
48:55
)
+ EXTS(a
56:63
si
b
56:63
) + ACC
32:63
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of signed integer elements in
parameters a and b are multiplied producing four 16-bit products. This quad of intermediate
products are sign-extended to 32 bits and added together with the contents of the corresponding
accumulator word and the sum is placed into the corresponding parameter d and accumulator
word.
Other registers altered: ACC
Figure 3-147. Vector Dot Product of Bytes, Add, Signed, Modulo, Integer and Accumulate
Words (__ev_dotpbasmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbasmiaaw d,a,b
0 32 63
Intermediate
b
a
products
Accumulator
+
24 16 8 40 48 56
x x x x x x x x
exts exts exts exts
+
exts exts exts exts
+ +
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-182 Freescale Semiconductor
SPE2 Operations
__ev_dotpbasmiaaw3 __ev_dotpbasmiaaw3
Vector Dot Product of Bytes, Add, Signed, Modulo, Integer and Accumulate into Words, 3
operand
d = __ev_dotpbasmiaaw3 (a,b,c)
// high dot
temph
0:31
EXTS
32
(b
0:7
si
c
0:7
) + EXTS
32
(b
8:15
si
c
8:15
) +
EXTS
32
(b
16:23
si
c
16:23
)
+ EXTS
32
(b
24:31
si
c
24:31
) + a
0:31
d
0:31
temph
0:31
//low
templ
0:31
EXTS
32
(b
32:39
si
c
32:39
) + EXTS
32
(b
40:47
si
c
40:47
)
+
EXTS
32
(b
48:55
si
c
48:55
) + EXTS
32
(b
56:63
si
c
56:63
) + a
32:63
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of signed integer elements in
parameters b and c are multiplied producing four 16-bit products. This quad of intermediate
products are sign-extended to 32 bits and added together with the contents of the corresponding
parameter a word and the sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-148. Vector Dot Product of Bytes, Add, Signed, Modulo, Integer and Accumulate
Words 3 op (__ev_dotpbasmiaaw3)
0 32 63
Intermediate
c
b
products
a
+
24 16 8 40 48 56
x x x x x x x x
exts exts exts exts
+
exts exts exts exts
+ +
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-183
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpbasmiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-184 Freescale Semiconductor
SPE2 Operations
__ev_dotpbasumi[a] __ev_dotpbasumi[a]
Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer (to Accumulator)
d = __ev_dotpbasumi (a,b) (A = 0)
d = __ev_dotpbasumia (a,b) (A = 1)
// high dot
temph
0:31
EXTS(a
0:7
sui
b
0:7
) + EXTS(a
8:15
sui
b
8:15
) +
EXTS(a
16:23
sui
b
16:23
) +
EXTS(a
24:31
sui
b
24:31
)
d
0:31
temph
0:31
//low
templ
0:31
EXTS(a
32:39
sui
b
32:39
) + EXTS(a
40:47
sui
b
40:47
) +
EXTS(a
48:55
sui
b
48:55
) + EXTS(a
56:63
sui
b
56:63
)
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of signed integer elements in
parameter a and unsigned integer elements in parameter b are multiplied producing four 16-bit
products. This quad of intermediate products are sign-extended to 32 bits and added together and
the sum is placed into the corresponding parameter d word. If A = 1, the result in parameter d is
also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-149. Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer (to
Accumulator) (__ev_dotpbasumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbasumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbasumia d,a,b
0 32 63
Intermediate
b
a
products
d (and Accumulator if A=1)
+
24 16 8 40 48 56
x x x x x x x x
exts exts exts exts
+
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-185
__ev_dotpbasumiaaw __ev_dotpbasumiaaw
Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer and Accumulate into
Words
d = __ev_dotpbasumiaaw (a,b)
// high dot
temph
0:31
EXTS(a
0:7
sui
b
0:7
) + EXTS(a
8:15
sui
b
8:15
) +
EXTS(a
16:23
sui
b
16:23
) +
EXTS(a
24:31
sui
b
24:31
) + ACC
0:31
d
0:31
temph
0:31
//low
templ
0:31
EXTS(a
32:39
sui
b
32:39
) + EXTS(a
40:47
sui
b
40:47
) +
EXTS(a
48:55
sui
b
48:55
) + EXTS(a
56:63
sui
b
56:63
) + ACC
32:63
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of signed integer elements in
parameter a and unsigned integer elements in parameter b are multiplied producing four 16-bit
products. This quad of intermediate products are sign-extended to 32 bits and added together with
the contents of the corresponding accumulator word and the sum is placed into the corresponding
parameter d and accumulator word.
Other registers altered: ACC
Figure 3-150. Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer and
Accumulate Words (__ev_dotpbasumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbasumiaaw d,a,b
0 32 63
Intermediate
b
a
products
Accumulator
+
24 16 8 40 48 56
x x x x x x x x
exts exts exts exts
+
exts exts exts exts
+ +
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-186 Freescale Semiconductor
SPE2 Operations
__ev_dotpbasumiaaw3 __ev_dotpbasumiaaw3
Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer and Accumulate into
Words, 3 operand
d = __ev_dotpbasumiaaw3 (a,b,c)
// high dot
temph
0:31
EXTS
32
(b
0:7
sui
c
0:7
) + EXTS
32
(b
8:15
sui
c
8:15
)
+
EXTS
32
(b
16:23
sui
c
16:23
) + EXTS
32
(b
24:31
sui
c
24:31
) + a
0:31
d
0:31
temph
0:31
//low
templ
0:31
EXTS
32
(b
32:39
sui
a
32:39
) + EXTS
32
(b
40:47
sui
c
40:47
)
+
EXTS
32
(b
48:55
sui
c
48:55
) + EXTS
32
(b
56:63
sui
c
56:63
) + a
32:63
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of signed integer elements in
b and unsigned integer elements in parameter c are multiplied producing four 16-bit products. This
quad of intermediate products are sign-extended to 32 bits and added together with the contents of
the corresponding parameter a word and the sum is placed into the corresponding parameter d and
accumulator word.
Other registers altered: ACC
Figure 3-151. Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer and
Accumulate Words 3 op (__ev_dotpbasumiaaw3)
0 32 63
Intermediate
c
b
products
a
+
24 16 8 40 48 56
x x x x x x x x
exts exts exts exts
+
exts exts exts exts
+ +
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-187
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpbasumiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-188 Freescale Semiconductor
SPE2 Operations
__ev_dotpbaumi[a] __ev_dotpbaumi[a]
Vector Dot Product of Bytes, Add, Unsigned, Modulo, Integer (to Accumulator)
d = __ev_dotpbaumi (a,b) (A = 0)
d = __ev_dotpbaumia (a,b) (A = 1)
// high dot
temph
0:31
EXTZ(a
0:7
ui
b
0:7
) + EXTZ(a
8:15
ui
b
8:15
) +
EXTZ(a
16:23
ui
b
16:23
) +
EXTZ(a
24:31
ui
b
24:31
)
d
0:31
temph
0:31
//low
templ
0:31
EXTZ(a
32:39
ui
b
32:39
) + EXTZ(a
40:47
ui
b
40:47
) +
EXTZ(a
48:55
ui
b
48:55
) + EXTZ(a
56:63
ui
b
56:63
)
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of unsigned integer elements
in parameters a and b are multiplied producing four 16-bit products. This quad of intermediate
products are zero-extended to 32 bits and added together and the sum is placed into the
corresponding parameter d word. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-152. Vector Dot Product of Bytes, Add, Unsigned, Modulo, Integer (to
Accumulator) (__ev_dotpbaumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbaumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbaumia d,a,b
0 32 63
Intermediate
b
a
products
d (and Accumulator if A=1)
+
24 16 8 40 48 56
x x x x x x x x
extz extz extz extz
+
extz extz extz extz
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-189
__ev_dotpbaumiaaw __ev_dotpbaumiaaw
Vector Dot Product of Bytes, Add, Unsigned, Modulo, Integer and Accumulate into Words
d = __ev_dotpbaumiaaw (a,b)
// high dot
temph
0:31
EXTZ(a
0:7
ui
a
0:7
) + EXTZ(a
8:15
ui
b
8:15
) +
EXTZ(a
16:23
ui
b
16:23
) +
EXTZ(a
24:31
ui
b
24:31
) + ACC
0:31
d
0:31
temph
0:31
//low
templ
0:31
EXTZ(a
32:39
ui
b
32:39
) + EXTZ(a
40:47
ui
b
40:47
) +
EXTZ(a
48:55
ui
b
48:55
) + EXTZ(a
56:63
ui
b
56:63
) + ACC
32:63
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of unsigned integer elements
in parameter a and parameter b are multiplied producing four 16-bit products. This quad of
intermediate products are zero-extended to 32 bits and added together with the contents of the
corresponding accumulator word and the sum is placed into the corresponding parameter d and
accumulator word.
Other registers altered: ACC
Figure 3-153. Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer and
Accumulate Words (__ev_dotpbaumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpbaumiaaw d,a,b
0 32 63
Intermediate
b
a
products
Accumulator
+
24 16 8 40 48 56
x x x x x x x x
extz extz extz extz
+
extz extz extz extz
+ +
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-190 Freescale Semiconductor
SPE2 Operations
__ev_dotpbaumiaaw3 __ev_dotpbaumiaaw3
Vector Dot Product of Bytes, Add, Unsigned, Modulo, Integer and Accumulate into Words, 3
operand
d = __ev_dotpbaumiaaw3 (a,b,c)
// high dot
temph
0:31
EXTZ
32
(b
0:7
ui
c
0:7
) + EXTZ
32
(b
8:15
ui
c
8:15
) +
EXTZ
32
(b
16:23
ui
c
16:23
)
+ EXTZ
32
(b
24:31
ui
c
24:31
) + a
0:31
d
0:31
temph
0:31
//low
templ
0:31
EXTZ
32
(b
32:39
ui
c
32:39
) + EXTZ
32
(b
40:47
ui
c
40:47
)
+
EXTZ
32
(b
48:55
ui
c
48:55
) + EXTZ
32
(b
56:63
ui
c
56:63
) + a
32:63
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding byte pairs of unsigned integer elements
in parameters b and c are multiplied producing four 16-bit products. This quad of intermediate
products are zero-extended to 32 bits and added together with the contents of the corresponding
parameter a word and the sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-154. Vector Dot Product of Bytes, Add, Signed by Unsigned, Modulo, Integer and
Accumulate Words 3 op (__ev_dotpbaumiaaw3)
0 32 63
Intermediate
c
b
products
a
+
24 16 8 40 48 56
x x x x x x x x
extz extz extz extz
+
extz extz extz extz
+ +
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-191
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpbaumiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-192 Freescale Semiconductor
SPE2 Operations
__ev_dotphasmi[a] __ev_dotphasmi[a]
Vector Dot Product of Half Words, Add, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotphasmi (a,b) (A = 0)
d = __ev_dotphasmia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
si
b
32:47
templ2
0:31
a
48:63
si
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
// modulo sum
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. This pair of intermediate
32-bit products is added together and the sum is placed into the corresponding parameter d word.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-155. Vector Dot Product of Half Words, Add, Signed, Modulo, Integer (to
Accumulator) (__ev_dotphasmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasmia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+
d (and Accumulator if A=1)
+
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-193
__ev_dotphasmiaaw __ev_dotphasmiaaw
Vector Dot Product of Half Words, Add, Signed, Modulo, Integer and Accumulate into Words
d = __ev_dotphasmiaaw (a,b)
// high dot
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
+ ACC
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
si
b
32:47
templ2
0:31
a
48:63
si
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ ACC
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. This pair of intermediate
32-bit products is added together with the contents of the corresponding accumulator word and the
sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-156. Vector Dot Product of Half Words, Add, Signed, Modulo, Integer and
Accumulate Words (__ev_dotphasmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasmiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-194 Freescale Semiconductor
SPE2 Operations
__ev_dotphasmiaaw3 __ev_dotphasmiaaw3
Vector Dot Product of Halfwords, Add, Signed, Modulo, Integer and Accumulate into Words, 3
operand
d = __ev_dotphasmiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
si
c
0:15
temph2
0:31
b
16:31
si
c
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
+ a
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
b
32:47
si
c
32:47
templ2
0:31
b
48:63
si
c
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ a
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameters b and c are multiplied producing a pair of 32-bit products. This pair of intermediate
32-bit products is added together with the contents of the corresponding parameter a word and the
sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-157. Vector Dot Product of Halfwords, Add, Signed, Modulo, Integer and
Accumulate Words 3 op (__ev_dotphasmiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-195
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphasmiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-196 Freescale Semiconductor
SPE2 Operations
__ev_dotphassf[a] __ev_dotphassf[a]
Vector Dot Product of Half Words, Add, Signed, Saturate, Fractional (to Accumulator)
d = __ev_dotphassf (a,b) (A = 0)
d = __ev_dotphassfa (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS(temph1
0:31
) + EXTS(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS(templ1
0:31
) + EXTS(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding half word pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. This pair of intermediate 32-bit
products is added together, saturating if overflow or underflow occurs, and the sum is placed into
the corresponding parameter d word. If A = 1, the result in parameter d is also placed into the
accumulator
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-197
Figure 3-158. Vector Dot Product of Half Words, Add, Signed, Saturate, Fractional (to
Accumulator) (__ev_dotphassf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassfa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-198 Freescale Semiconductor
SPE2 Operations
__ev_dotphassfaaw __ev_dotphassfaaw
Vector Dot Product of Half Words, Add, Signed, Saturate, Fractional and Accumulate into Words
d = __ev_dotphassfaaw (a,b)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS(temph1
0:31
) + EXTS(temph2
0:31
) + EXTS(ACC
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS(templ1
0:31
) + EXTS(templ2
0:31
) + EXTS(ACC
32:63
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding half word pairs of signed fractional
elements in parameter a and parameter b are multiplied producing a pair of 32-bit products. If both
inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. This pair of intermediate 32-bit
products is added together with the contents of the corresponding accumulator word, saturating if
overflow or underflow occurs, and the sum is placed into the corresponding parameter d and
accumulator word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-199
Figure 3-159. Vector Dot Product of Half Words, Add, Signed, Saturate, Fractional and
Accumulate Words (__ev_dotphassfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassfaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-200 Freescale Semiconductor
SPE2 Operations
__ev_dotphassfaaw3 __ev_dotphassfaaw3
Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional and Accumulate into Words,
3 operand
d = __ev_dotphassfaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sf
c
0:15
if (b
0:15
= 0x8000) & (c
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
16:31
sf
c
16:31
if (b
16:31
= 0x8000) & (c
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
32:47
sf
c
32:47
if (b
32:47
= 0x8000) & (c
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
48:63
sf
c
48:63
if (b
48:63
= 0x8000) & (c
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters b and c are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. This pair of intermediate 32-bit
products is added together with the contents of the corresponding parameter a word, saturating if
overflow or underflow occurs, and the sum is placed into the corresponding parameter d and
accumulator word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-201
Figure 3-160. Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional and
Accumulate Words 3 op (__ev_dotphassfaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphassfaaw3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-202 Freescale Semiconductor
SPE2 Operations
__ev_dotphassfr[a] __ev_dotphassfr[a]
Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional, Round (to Accumulator)
d = __ev_dotphassfr (a,b) (A = 0)
d = __ev_dotphassfra (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. This pair of intermediate 32-bit
products is added together and rounded to 16 bits using the current fractional rounding mode in
SPEFCSR, saturating if overflow or underflow occurs, and placed into the corresponding
parameter d word. If A = 1, the result in parameter d is also placed into the accumulator
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-203
Figure 3-161. Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional, Round
(to Accumulator) (__ev_dotphassfr[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassfra d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ w/round + w/round
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-204 Freescale Semiconductor
SPE2 Operations
__ev_dotphassfraaw __ev_dotphassfraaw
Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional, Round and Accumulate into
Words
d = __ev_dotphassfraaw (a,b)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in a and b are multiplied producing a pair of 32-bit products. If both inputs of a multiply
are -1.0, the result saturates to 0x7FFF_FFFF. This pair of intermediate 32-bit products is added
together with the contents of the corresponding accumulator word and rounded to 16 bits using the
current fractional rounding mode in SPEFCSR, saturating if overflow or underflow occurs, and
placed into the corresponding d and accumulator word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation and round. It is implementation dependent
whether the saturation is detected before or after rounding for the accumulate. The pseudocode
above assumes detection after rounding.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-205
Figure 3-162. Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional, Round
and Accumulate Words (__ev_dotphassfraaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassfraaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
+w/round +w/round
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-206 Freescale Semiconductor
SPE2 Operations
__ev_dotphassfraaw3 __ev_dotphassfraaw3
Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional, Round and Accumulate into
Words, 3 operand
d = __ev_dotphassfraaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sf
c
0:15
if (b
0:15
= 0x8000) & (c
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
16:31
sf
c
16:31
if (b
16:31
= 0x8000) & (c
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
b
32:47
sf
c
32:47
if (b
32:47
= 0x8000) & (c
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
48:63
sf
c
48:63
if (b
48:63
= 0x8000) & (c
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters b and c are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. This pair of intermediate 32-bit
products is added together with the contents of the corresponding parameter a word and rounded
to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow or
underflow occurs, and placed into the corresponding parameter d and accumulator word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation and round. It is implementation dependent
whether the saturation is detected before or after rounding for the accumulate. The pseudocode
above assumes detection after rounding.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-207
Figure 3-163. Vector Dot Product of Halfwords, Add, Signed, Saturate, Fractional, Round
and Accumulate Words 3 op (__ev_dotphassfraaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphassfraaw3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
+ +
+w/round +w/round
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-208 Freescale Semiconductor
SPE2 Operations
__ev_dotphassi[a] __ev_dotphassi[a]
Vector Dot Product of Half Words, Add, Signed, Saturate, Integer (to Accumulator)
d = __ev_dotphassi (a,b) (A = 0)
d = __ev_dotphassia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
si
b
0:15
; temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS(temph1
0:31
) + EXTS(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
si
b
32:47
; templ2
0:31
a
48:63
si
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ
0:63
EXTS(templ1
0:31
) + EXTS(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. This pair of intermediate
32-bit products is added together, saturating if overflow or underflow occurs, and the sum is placed
into the corresponding parameter d word. If A = 1, the result in parameter d is also placed into the
accumulator
Other registers altered: SPEFSCR, ACC (if A=1)
Figure 3-164. Vector Dot Product of Half Words, Add, Signed, Saturate, Integer (to
Accumulator) (__ev_dotphassi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-209
__ev_dotphassiaaw __ev_dotphassiaaw
Vector Dot Product of Half Words, Add, Signed, Saturate, Integer and Accumulate into Words
d = __ev_dotphassiaaw (a,b)
// high dot
temph1
0:31
a
0:15
si
b
0:15
; temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS(temph1
0:31
) + EXTS(temph2
0:31
) + EXTS(ACC
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
si
b
32:47
; templ2
0:31
a
48:63
si
b
48:63
templ
0:63
EXTS(templ1
0:31
) + EXTS(templ2
0:31
) + EXTS(ACC
32:63
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. This pair of intermediate
32-bit products is added together with the contents of the corresponding accumulator word,
saturating if overflow or underflow occurs, and the sum is placed into the corresponding parameter
d and accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-165. Vector Dot Product of Half Words, Add, Signed, Saturate, Integer and
Accumulate (__ev_dotphassiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphassiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-210 Freescale Semiconductor
SPE2 Operations
__ev_dotphassiaaw3 __ev_dotphassiaaw3
Vector Dot Product of Halfwords, Add, Signed, Saturate, Integer and Accumulate into Words, 3
operand
d = __ev_dotphassiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
si
c
0:15
; temph2
0:31
b
16:31
si
c
16:31
temph
0:63
EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
32:47
si
c
32:47
; templ2
0:31
b
48:63
si
c
48:63
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameters b and c are multiplied producing a pair of 32-bit products. This pair of intermediate
32-bit products is added together with the contents of the corresponding parameter a word,
saturating if overflow or underflow occurs, and the sum is placed into the corresponding parameter
d and accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-166. Vector Dot Product of Halfwords, Add, Signed, Saturate, Integer and
Accumulate 3 op (__ev_dotphassiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-211
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphassiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-212 Freescale Semiconductor
SPE2 Operations
__ev_dotphasumi[a] __ev_dotphasumi[a]
Vector Dot Product of Half Words, Add, Signed by Unsigned, Modulo, Integer (to Accumulator)
d = __ev_dotphasumi (a,b) (A = 0)
d = __ev_dotphasumia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sui
b
0:15
temph2
0:31
a
16:31
sui
b
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
sui
b
32:47
templ2
0:31
a
48:63
siu
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
// modulo sum
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameter a and unsigned integer elements in parameter b are multiplied producing a pair of
32-bit products. This pair of intermediate 32-bit products is added together and the sum is placed
into the corresponding parameter d word. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-167. Vector Dot Product of Half Words, Add, Signed by Unsigned, Modulo, Integer
(to Accumulator) (__ev_dotphasumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasumia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(4 32-bit)
x x
+
d (and Accumulator if A=1)
+
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-213
__ev_dotphasumiaaw __ev_dotphasumiaaw
Vector Dot Product of Half Words, Add, Signed by Unsigned, Modulo, Integer and Accumulate
into Words
d = __ev_dotphasumiaaw (a,b)
// high dot
temph1
0:31
a
0:15
sui
b
0:15
temph2
0:31
a
16:31
sui
b
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
+ ACC
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
sui
b
32:47
templ2
0:31
a
48:63
sui
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ ACC
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameter a and unsigned integer elements in parameter b are multiplied producing a pair of
32-bit products. This pair of intermediate 32-bit products is added together with the contents of
the corresponding accumulator word and the sum is placed into the corresponding parameter d and
accumulator word.
Other registers altered: ACC
Figure 3-168. Vector Dot Product of Half Words, Add, Signed by Unsigned, Modulo, Integer
and Accumulate Words (__ev_dotphasumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasumiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-214 Freescale Semiconductor
SPE2 Operations
__ev_dotphasumiaaw3 __ev_dotphasumiaaw3
Vector Dot Product of Halfwords, Add, Signed by Unsigned, Modulo, Integer and Accumulate
into Words, 3 operand
d = __ev_dotphasumiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sui
c
0:15
temph2
0:31
b
16:31
sui
c
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
+ a
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
b
32:47
sui
c
32:47
templ2
0:31
b
48:63
sui
c
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ a
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameter b and unsigned integer elements in parameter c are multiplied producing a pair of
32-bit products. This pair of intermediate 32-bit products is added together with the contents of
the corresponding parameter a word and the sum is placed into the corresponding parameter d and
accumulator word.
Other registers altered: ACC
Figure 3-169. Vector Dot Product of Halfwords, Add, Signed by Unsigned, Modulo, Integer
and Accumulate Words 3 op (__ev_dotphasumiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c (unsigned)
x x
b (signed)
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-215
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphasumiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-216 Freescale Semiconductor
SPE2 Operations
__ev_dotphasusi[a] __ev_dotphasusi[a]
Vector Dot Product of Half Words, Add, Signed by Unsigned, Saturate, Integer (to Accumulator)
d = __ev_dotphasusi (a,b) (A = 0)
d = __ev_dotphasusia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sui
b
0:15
; temph2
0:31
a
16:31
sui
b
16:31
temph
0:63
EXTS(temph1
0:31
) + EXTS(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
sui
b
32:47
; templ2
0:31
a
48:63
sui
b
48:63
templ
0:63
EXTS(templ1
0:31
) + EXTS(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameter a and unsigned integer elements in parameter b are multiplied producing a pair of
32-bit products. This pair of intermediate 32-bit products is added together, saturating if overflow
or underflow occurs, and the sum is placed into the corresponding parameter d word. If A = 1, the
result in parameter d is also placed into the accumulator
Other registers altered: SPEFSCR, ACC (if A=1)
Figure 3-170. Vector Dot Product of Half Words, Add, Signed by Unsigned, Saturate,
Integer (to Accumulator) (__ev_dotphasusi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasusi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasusia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(4 32-bit)
x x
+ +
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-217
__ev_dotphasusiaaw __ev_dotphasusiaaw
Vector Dot Product of Half Words, Add, Singed by Unsigned, Saturate, Integer and Accumulate
into Words
d = __ev_dotphasusiaaw (a,b)
// high dot
temph1
0:31
a
0:15
sui
b
0:15
; temph2
0:31
a
16:31
sui
b
16:31
temph
0:63
EXTS(temph1
0:31
) + EXTS(temph2
0:31
) + EXTS(ACC
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
sui
b
32:47
; templ2
0:31
a
48:63
sui
b
48:63
templ
0:63
EXTS(templ1
0:31
) + EXTS(templ2
0:31
) + EXTS(ACC
32:63
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameter a and unsigned integer elements in parameter b are multiplied producing a pair of
32-bit products. This pair of intermediate 32-bit products is added together with the contents of
the corresponding accumulator word, saturating if overflow or underflow occurs, and the sum is
placed into the corresponding parameter d and accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-171. Vector Dot Product of Half Words, Add, Signed by Unsigned, Saturate,
Integer and Accumulate (__ev_dotphasusiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphasusiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-218 Freescale Semiconductor
SPE2 Operations
__ev_dotphasusiaaw3 __ev_dotphasusiaaw3
Vector Dot Product of Halfwords, Add, Signed by Unsigned, Saturate, Integer and Accumulate
into Words, 3 operand
d = __ev_dotphasusiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sui
c
0:15
; temph2
0:31
b
16:31
sui
c
16:31
temph
0:63
EXTS
64
(temph1
0:31
) + EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
32:47
sui
c
32:47
; templ2
0:31
b
48:63
sui
c
48:63
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameter b and unsigned integer elements in parameter c are multiplied producing a pair of
32-bit products. This pair of intermediate 32-bit products is added together with the contents of
the corresponding parameter a word, saturating if overflow or underflow occurs, and the sum is
placed into the corresponding parameter d and accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-172. Vector Dot Product of Halfwords, Add, Signed by Unsigned, Saturate,
Integer and Accumulate 3 op (__ev_dotphasusiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c (unsigned)
x x
b (signed)
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-219
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphasusiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-220 Freescale Semiconductor
SPE2 Operations
__ev_dotphaumi[a] __ev_dotphaumi[a]
Vector Dot Product of Half Words, Add, Unsigned, Modulo, Integer (to Accumulator)
d = __ev_dotphaumi (a,b) (A = 0)
d = __ev_dotphaumia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
ui
b
0:15
temph2
0:31
a
16:31
ui
b
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
ui
b
32:47
templ2
0:31
a
48:63
ui
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
// modulo sum
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding half word pairs of unsigned integer
elements in parameters a and b are multiplied producing a pair of 32-bit products. This pair of
intermediate 32-bit products is added together with a modulo sum, and the result is placed in
parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-173. Vector Dot Product of Half Words, Add, Unsigned, Modulo, Integer (to
Accumulator) (__ev_dotphaumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphaumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphaumia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+
d (and Accumulator if A=1)
+
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-221
__ev_dotphaumiaaw __ev_dotphaumiaaw
Vector Dot Product of Half Words, Add, Unsigned, Modulo, Integer and Accumulate into Words
d = __ev_dotphaumiaaw (a,b)
// high dot
temph1
0:31
a
0:15
ui
b
0:15
temph2
0:31
a
16:31
ui
b
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
+ ACC
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
ui
b
32:47
templ2
0:31
a
48:63
ui
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ ACC
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding half word pairs of unsigned integer
elements in parameters a and b are multiplied producing a pair of 32-bit products. This pair of
intermediate 32-bit products is added together with the contents of the corresponding accumulator
word and the sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-174. Vector Dot Product of Half Words, Add, Unsigned, Modulo, Integer and
Accumulate Words (__ev_dotphaumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphaumiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-222 Freescale Semiconductor
SPE2 Operations
__ev_dotphaumiaaw3 __ev_dotphaumiaaw3
Vector Dot Product of Halfwords, Add, Unsigned, Modulo, Integer and Accumulate into Words,
3 operand
d = __ev_dotphaumiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
ui
c
0:15
temph2
0:31
b
16:31
ui
c
16:31
temph
0:31
temph1
0:31
+ temph2
0:31
+ a
0:31
// modulo sum
d
0:31
temph
0:31
//low
templ1
0:31
b
32:47
ui
c
32:47
templ2
0:31
b
48:63
ui
c
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ a
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding halfword pairs of unsigned integer
elements in parameters b and c are multiplied producing a pair of 32-bit products. This pair of
intermediate 32-bit products is added together with the contents of the corresponding parameter a
word and the sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-175. Vector Dot Product of Halfwords, Add, Unsigned, Modulo, Integer and
Accumulate Words 3 op (__ev_dotphaumiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-223
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphaumiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-224 Freescale Semiconductor
SPE2 Operations
__ev_dotphausi[a] __ev_dotphausi[a]
Vector Dot Product of Half Words, Add, Unsigned, Saturate, Integer (to Accumulator)
d = __ev_dotphausi (a,b) (A = 0)
d = __ev_dotphausia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
ui
b
0:15
; temph2
0:31
a
16:31
ui
b
16:31
temph
0:63
EXTZ(temph1
0:31
) + EXTZ(temph2
0:31
)
ovh temph
30
| temph
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
ui
b
32:47
; templ2
0:31
a
48:63
ui
b
48:63
templ
0:63
EXTZ(templ1
0:31
) + EXTZ(templ2
0:31
)
ovl templ
30
| templ
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding half word pairs of unsigned integer
elements in parameter a and parameter b are multiplied producing a pair of 32-bit products. This
pair of intermediate 32-bit products is added together, saturating if overflow occurs, and the sum
is placed into the corresponding parameter d word. If A = 1, the result in parameter d is also placed
into the accumulator.
Other registers altered: SPEFSCR, ACC (if A = 1)
Figure 3-176. Vector Dot Product of Half Words, Add, Unsigned, Saturate, Integer [to
Accumulator] (__ev_dotphausi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphausi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphausia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
d and Accumulator (if A = 1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-225
__ev_dotphausiaaw __ev_dotphausiaaw
Vector Dot Product of Half Words, Add, Unsigned, Saturate, Integer and Accumulate into Words
d = __ev_dotphausiaaw (a,b)
// high dot
temph1
0:31
a
0:15
ui
b
0:15
; temph2
0:31
a
16:31
ui
b
16:31
temph
0:63
EXTZ(temph1
0:31
) + EXTZ(temph2
0:31
) + EXTZ(ACC
0:31
)
ovh temph
30
| temph
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
ui
b
32:47
; templ2
0:31
a
48:63
ui
b
48:63
templ
0:63
EXTZ(templ1
0:31
) + EXTZ(templ2
0:31
) + EXTZ(ACC
32:63
)
ovl templ
30
| templ
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding half word pairs of unsigned integer
elements in parameters a and b are multiplied producing a pair of 32-bit products. This pair of
intermediate 32-bit products is added together with the contents of the corresponding accumulator
word, saturating if overflow occurs, and the sum is placed into the corresponding parameter d and
accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-177. Vector Dot Product of Half Words, Add, Unsigned, Saturate, Integer and
Accumulate Words (__ev_dotphausiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphausiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ +
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-226 Freescale Semiconductor
SPE2 Operations
__ev_dotphausiaaw3 __ev_dotphausiaaw3
Vector Dot Product of Halfwords, Add, Unsigned, Saturate, Integer and Accumulate into Words,
3 operand
d = __ev_dotphausiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
ui
c
0:15
; temph2
0:31
b
16:31
ui
c
16:31
temph
0:63
EXTZ
64
(temph1
0:31
) + EXTZ
64
(temph2
0:31
) + EXTZ
64
(a
0:31
)
ovh temph
30
| temph
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
32:47
ui
c
32:47
; templ2
0:31
b
48:63
ui
c
48:63
templ
0:63
EXTZ
64
(templ1
0:31
) + EXTZ
64
(templ2
0:31
) + EXTZ
64
(a
32:63
)
ovl templ
30
| templ
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding halfword pairs of unsigned integer
elements in parameters b and c are multiplied producing a pair of 32-bit products. This pair of
intermediate 32-bit products is added together with the contents of the corresponding parameter a
word, saturating if overflow occurs, and the sum is placed into the corresponding parameter d and
accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-178. Vector Dot Product of Halfwords, Add, Unsigned, Saturate, Integer and
Accumulate Words 3 op (__ev_dotphausiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
+ +
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-227
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphausiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-228 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcsmi[a] __ev_dotphihcsmi[a]
Vector Dot Product of High Halfwords, Complex, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotphihcsmi (a,b) (A = 0)
d = __ev_dotphihcsmia (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:31
temph1
0:31
- temph2
0:31
// modulo difference
d
0:31
temph
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
b
0:15
si
a
16:31
templ2
0:31
a
0:15
si
b
16:31
templ
0:31
templ1
0:31
+ templ2
0:31
// modulo sum
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is placed into the high word
of parameter d. For the low word element in the destination, halfword pairs of signed integer
elements from the high halfwords of parameters a and b are multiplied after exchanging the high
halfwords of parameter a, producing a pair of 32-bit products. The sum of this pair of intermediate
32-bit products is placed into the low word of parameter d. If A = 1, the result in parameter d is
also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-179. Vector Dot Product of High Halfwords, Complex, Signed, Modulo, Integer (to
Accumulator) (__ev_dotphihcsmi[a])
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-229
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcsmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcsmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-230 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcsmiaaw __ev_dotphihcsmiaaw
Vector Dot Product of High Halfwords, Complex, Signed, Modulo, Integer and Accumulate into
Words
d = __ev_dotphihcsmiaaw (a,b)
// high dot - calculate real part of complex product
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:31
temph1
0:31
- temph2
0:31
+ ACC
0:31
// modulo sum
d
0:31
temph
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
b
0:15
si
a
16:31
templ2
0:31
a
0:15
si
b
16:31
templ
0:31
templ1
0:31
+ templ2
0:31
+ ACC
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added together with the
contents of the upper accumulator word and the sum is placed into the high word of parameter d.
For the low word element in the destination, halfword pairs of signed integer elements from the
high halfwords of parameters a and b are multiplied after exchanging the high halfwords of
parameter a, producing a pair of 32-bit products. The sum of this pair of intermediate 32-bit
products is added together with the contents of the low accumulator word and the sum is placed
into the low word of parameter d and the accumulator.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-231
Figure 3-180. Vector Dot Product of High Halfwords, Complex, Signed, Modulo, Integer
and Accumulate Words (__ev_dotphihcsmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcsmiaaw d,a,b
+ +
Accumulator
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-232 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcsmiaaw3 __ev_dotphihcsmiaaw3
Vector Dot Product of High Halfwords, Complex, Signed, Modulo, Integer and Accumulate into
Words, 3 operand
d = __ev_dotphihcsmiaaw3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:31
b
0:15
si
c
0:15
temph2
0:31
b
16:31
si
c
16:31
temph
0:31
temph1
0:31
- temph2
0:31
+ a
0:31
// modulo sum
d
0:31
temph
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
c
0:15
si
b
16:31
templ2
0:31
b
0:15
si
c
16:31
templ
0:31
templ1
0:31
+ templ2
0:31
+ a
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the high halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added together with the
contents of the upper parameter a word and the sum is placed into the high word of parameter a.
For the low word element in the destination, halfword pairs of signed integer elements from the
high halfwords of parameters b and c are multiplied after exchanging the high halfwords of
parameter b, producing a pair of 32-bit products. The sum of this pair of intermediate 32-bit
products is added together with the contents of the lower parameter a word and the sum is placed
into the low word of parameter d and the accumulator.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-233
Figure 3-181. Vector Dot Product of High Halfwords, Complex, Signed, Modulo, Integer
and Accumulate Words 3 op (__ev_dotphihcsmiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphihcsmiaaw3 d,b,c
+ +
a
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-234 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssf[a] __ev_dotphihcssf[a]
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Fractional (to Accumulator)
d = __ev_dotphihcssf (a,b) (A = 0)
d = __ev_dotphihcssfa (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
16:31
sf
b
0:15
if (a
16:31
= 0x8000) & (b
0:15
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
0:15
sf
b
16:31
if (a
0:15
= 0x8000) & (b
16:31
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is placed into the high word of parameter d, saturating
if overflow or underflow occurs. For the low word element in the destination, halfword pairs of
signed fractional elements from the high halfwords of parameters a and b are multiplied after
exchanging the high halfwords of parameter a, producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate
32-bit products is placed into the low word of parameter d, saturating if overflow or underflow
occurs. If A = 1, the result in parameter d is also placed into the accumulator
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-235
Figure 3-182. Vector Dot Product of High Halfwords, Complex, Signed, Saturate,
Fractional (to Accumulator) (__ev_dotphihcssf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssfa d,a,b
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-236 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssfaaw __ev_dotphihcssfaaw
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Fractional and Accumulate
into Words
d = __ev_dotphihcssfaaw (a,b)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
16:31
sf
b
0:15
if (a
16:31
= 0x8000) & (b
0:15
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
0:15
sf
b
16:31
if (a
0:15
= 0x8000) & (b
16:31
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of the accumulator, and the
result is placed into the high word of parameter d and the accumulator, saturating if overflow or
underflow occurs. For the low word element in the destination, halfword pairs of signed fractional
elements from the high halfwords of parameters a and b are multiplied after exchanging the high
halfwords of parameter a, producing a pair of 32-bit products. If both inputs of a multiply are -1.0,
the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the
low word of the accumulator is placed into the low word of parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-237
Figure 3-183. Vector Dot Product of High Halfwords, Complex, Signed, Saturate,
Fractional and Accumulate Words (__ev_dotphihcssfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssfaaw d,a,b
+ +
Accumulator
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-238 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssfaaw3 __ev_dotphihcssfaaw3
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Fractional and Accumulate
into Words, 3 operand
d = __ev_dotphihcssfaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sf
c
0:15
if (b
0:15
= 0x8000) & (c
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
16:31
sf
c
16:31
if (b
16:31
= 0x8000) & (c
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
16:31
sf
c
0:15
if (b
16:31
= 0x8000) & (c
0:15
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
0:15
sf
c
16:31
if (b
0:15
= 0x8000) & (c
16:31
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of parameter a, and the result
is placed into the high word of parameter d and the accumulator, saturating if overflow or
underflow occurs. For the low word element in the destination, halfword pairs of signed fractional
elements from the high halfwords of parameters b and c are multiplied after exchanging the high
halfwords of parameter b, producing a pair of 32-bit products. If both inputs of a multiply are -1.0,
the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the
low word of parameter a is placed into the low word of parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-239
Figure 3-184. Vector Dot Product of High Halfwords, Complex, Signed, Saturate,
Fractional and Accumulate Words 3 op (__ev_dotphihcssfaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphihcssfaaw3 d,b,c
+ +
a
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-240 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssfr[a] __ev_dotphihcssfr[a]
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Fractional, Round (to
Accumulator)
d = __ev_dotphihcssfr (a,b) (A = 0)
d = __ev_dotphihcssfra (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
16:31
sf
b
0:15
if (a
16:31
= 0x8000) & (b
0:15
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
0:15
sf
b
16:31
if (a
0:15
= 0x8000) & (b
16:31
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is rounded to 16 bits using the current fractional
rounding mode in SPEFCSR, saturating if overflow or underflow occurs, and placed into the high
word of parameter d. For the low word element in the destination, halfword pairs of signed
fractional elements from the high halfwords of parameters a and b are multiplied after exchanging
the high halfwords of parameter a, producing a pair of 32-bit products. If both inputs of a multiply
are -1.0, the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products
is rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if
overflow or underflow occurs, and placed into the low word of parameter d. If A = 1, the result in
parameter d is also placed into the accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-241
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Figure 3-185. Vector Dot Product of High Halfwords, Complex, Signed, Saturate,
Fractional, Round (to Accumulator) (__ev_dotphihcssfr[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssfra d,a,b
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- w/round
d (and Accumulator if A=1)
+ w/round
Re Img Re Img
Re Img Re Img
Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-242 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssfraaw __ev_dotphihcssfraaw
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Fractional, Round and
Accumulate into Words
d = __ev_dotphihcssfraaw (a,b)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
16:31
sf
b
0:15
if (a
16:31
= 0x8000) & (b
0:15
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
0:15
sf
b
16:31
if (a
0:15
= 0x8000) & (b
16:31
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of the accumulator and
rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow
or underflow occurs, and placed into the high word of parameter d and the accumulator. For the
low word element in the destination, halfword pairs of signed fractional elements from the high
halfwords of parameters a and b are multiplied after exchanging the high halfwords of parameter
a, producing a pair of 32-bit products. If both inputs of a multiply are -1.0, the result saturates to
0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the low word of the
accumulator is rounded to 16 bits using the current fractional rounding mode in SPEFCSR,
saturating if overflow or underflow occurs, and placed into the low word of parameter d and the
accumulator.
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-243
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Figure 3-186. Vector Dot Product of High Halfwords, Complex, Signed, Saturate,
Fractional, Round and Accumulate Words (__ev_dotphihcssfraaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssfraaw d,a,b
+ w/round + w/round
Accumulator
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-244 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssfraaw3 __ev_dotphihcssfraaw3
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Fractional, Round and
Accumulate into Words, 3 operand
d = __ev_dotphihcssfraaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sf
c
0:15
if (b
0:15
= 0x8000) & (c
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
16:31
sf
c
16:31
if (b
16:31
= 0x8000) & (c
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
b
16:31
sf
c
0:15
if (b
16:31
= 0x8000) & (c
0:15
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
0:15
sf
c
16:31
if (b
0:15
= 0x8000) & (c
16:31
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)),16)
ovl chk_ovf(temph
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of parameter a and rounded
to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow or
underflow occurs, and placed into the high word of parameter d and the accumulator. For the low
word element in the destination, halfword pairs of signed fractional elements from the high
halfwords of parameters b and c are multiplied after exchanging the high halfwords of parameter
b, producing a pair of 32-bit products. If both inputs of a multiply are -1.0, the result saturates to
0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the low word of parameter
a is rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if
overflow or underflow occurs, and placed into the low word of parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-245
Other registers altered: SPEFSCR, ACC
Figure 3-187. Vector Dot Product of High Halfwords, Complex, Signed, Saturate,
Fractional, Round and Accumulate Words 3 op (__ev_dotphihcssfraaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphihcssfraaw3 d,b,c
+ w/round + w/round
a
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-246 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssi[a] __ev_dotphihcssi[a]
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Integer (to Accumulator)
d = __ev_dotphihcssi (a,b) (A = 0)
d = __ev_dotphihcssia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
si
b
0:15
; temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
16:31
si
b
0:15
; templ2
0:31
a
0:15
si
b
16:31
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is placed into the high word
of parameter d, saturating if overflow or underflow occurs. For the low word element in the
destination, halfword pairs of signed integer elements from the high halfwords of parameters a and
b are multiplied after exchanging the high halfwords of parameter a, producing a pair of 32-bit
products. The sum of this pair of intermediate 32-bit products is placed into the low word of
parameter d, saturating if overflow or underflow occurs. If A = 1, the result in d is also placed into
the accumulator.
Other registers altered: SPEFSCR, ACC (if A=1)
Figure 3-188. Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Integer
(to Accumulator) (__ev_dotphihcssi[a])
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-247
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-248 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssiaaw __ev_dotphihcssiaaw
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Integer and Accumulate into
Words
d = __ev_dotphihcssiaaw (a,b)
// high dot
temph1
0:31
a
0:15
si
b
0:15
; temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
16:31
si
b
0:15
; templ2
0:31
a
0:15
si
b
16:31
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added to the high word of
the accumulator, and the result is placed into the high word of parameter d and the accumulator,
saturating if overflow or underflow occurs. For the low word element in the destination, halfword
pairs of signed fractional elements from the high halfwords of parameters a and b are multiplied
after exchanging the high halfwords of parameter a, producing a pair of 32-bit products. The sum
of this pair of intermediate 32-bit products and the low word of the accumulator is placed into the
low word of parameter d and the accumulator, saturating if overflow or underflow occurs.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-249
Figure 3-189. Vector Dot Product of Half Words, Subtract, Signed, Saturate, Integer and
Accumulate Words (__ev_dotphihcssiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphihcssiaaw d,a,b
+ +
Accumulator
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-250 Freescale Semiconductor
SPE2 Operations
__ev_dotphihcssiaaw3 __ev_dotphihcssiaaw3
Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Integer and Accumulate into
Words, 3 operand
d = __ev_dotphihcssiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
si
c
0:15
; temph2
0:31
b
16:31
si
c
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
16:31
si
c
0:15
; templ2
0:31
b
0:15
si
c
16:31
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the high halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added to the high word of
parameter a, and the result is placed into the high word of parameter d and the accumulator,
saturating if overflow or underflow occurs. For the low word element in the destination, halfword
pairs of signed fractional elements from the high halfwords of parameters b and c are multiplied
after exchanging the high halfwords of parameter b, producing a pair of 32-bit products. The sum
of this pair of intermediate 32-bit products and the low word of parameter a is placed into the low
word of parameter d and the accumulator, saturating if overflow or underflow occurs.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-251
Figure 3-190. Vector Dot Product of High Halfwords, Complex, Signed, Saturate, Integer
and Accumulate Words (__ev_dotphihcssiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphihcssiaaw3 d,b,c
+ +
a
d and Accumulator
Re Img
15 16 15 16 0 31 0 31
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-252 Freescale Semiconductor
SPE2 Operations
__ev_dotphssmi[a] __ev_dotphssmi[a]
Vector Dot Product of Halfwords, Subtract, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotphssmi (a,b) (A = 0)
d = __ev_dotphssmia (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:31
temph1
0:31
- temph2
0:31
// modulo difference
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
si
b
32:47
templ2
0:31
a
48:63
si
b
48:63
templ
0:31
templ1
0:31
- templ2
0:31
// modulo difference
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. The intermediate 32-bit
product of the odd halfword is subtracted from the intermediate product of the even halfword with
a modulo difference, and the result is placed into parameter d. If A = 1, the result in parameter d
is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-191. Vector Dot Product of Halfwords, Subtract, Signed, Modulo, Integer (to
Accumulator) (__ev_dotphssmi[a])
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
-
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-253
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphssmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphssmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-254 Freescale Semiconductor
SPE2 Operations
__ev_dotphssmiaaw __ev_dotphssmiaaw
Vector Dot Product of Halfwords, Subtract, Signed, Modulo, Integer and Accumulate into Words
d = __ev_dotphssmiaaw (a,b)
// high dot
temph1
0:31
a
0:15
si
b
0:15
temph2
0:31
a
16:31
si
b
16:31
temph
0:31
temph1
0:31
- temph2
0:31
+ ACC
0:31
// modulo difference
d
0:31
temph
0:31
//low
templ1
0:31
a
32:47
si
b
32:47
templ2
0:31
a
48:63
si
b
48:63
templ
0:31
templ1
0:31
- templ2
0:31
+ ACC
32:63
// modulo difference
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. The intermediate 32-bit
product of the odd halfword is subtracted from the intermediate product of the even halfword, and
the difference is added together with the contents of the corresponding accumulator word and the
sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-192. Vector Dot Product of Halfwords, Subtract, Signed, Modulo, Integer and
Accumulate Words (__ev_dotphssmiaaw)
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- -
+ +
Accumulator
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-255
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphssmiaaw d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-256 Freescale Semiconductor
SPE2 Operations
__ev_dotphssmiaaw3 __ev_dotphssmiaaw3
Vector Dot Product of Halfwords, Subtract, Signed, Modulo, Integer and Accumulate into Words,
3 operand
d = __ev_dotphssmiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
si
c
0:15
temph2
0:31
b
16:31
si
c
16:31
temph
0:31
temph1
0:31
- temph2
0:31
+ a
0:31
// modulo difference
d
0:31
temph
0:31
//low
templ1
0:31
b
32:47
si
c
32:47
templ2
0:31
b
48:63
si
c
48:63
templ
0:31
templ1
0:31
- templ2
0:31
+ a
32:63
// modulo difference
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameters b and c are multiplied producing a pair of 32-bit products. The intermediate 32-bit
product of the odd halfword is subtracted from the intermediate product of the even halfword, and
the difference is added together with the contents of the corresponding parameter a word and the
sum is placed into the corresponding parameter d and accumulator word.
Other registers altered: ACC
Figure 3-193. Vector Dot Product of Halfwords, Subtract, Signed, Modulo, Integer and
Accumulate Words 3 op (__ev_dotphssmiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- -
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-257
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphssmiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-258 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssf[a] __ev_dotphsssf[a]
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional (to Accumulator)
d = __ev_dotphsssf (a,b) (A = 0)
d = __ev_dotphsssfa (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The intermediate 32-bit product of the
odd halfword is subtracted from the intermediate product of the even halfword, saturating if
overflow or underflow occurs, and the difference is placed into the corresponding parameter d
word. If A = 1, the result in parameter d is also placed into the accumulator
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-259
Figure 3-194. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional (to
Accumulator) (__ev_dotphsssf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssfa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- -
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-260 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssfaaw __ev_dotphsssfaaw
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional and Accumulate into
Words
d = __ev_dotphsssfaaw (a,b)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The intermediate 32-bit product of the
odd halfword is subtracted from the intermediate product of the even halfword, and the difference
is added together with the contents of the corresponding accumulator word, saturating if overflow
or underflow occurs, and the sum is placed into the corresponding parameter d and accumulator
word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-261
Figure 3-195. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional and
Accumulate Words (__ev_dotphsssfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssfaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- -
+ +
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-262 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssfaaw3 __ev_dotphsssfaaw3
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional and Accumulate into
Words, 3 operand
d = __ev_dotphsssfaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sf
c
0:15
if (b
0:15
= 0x8000) & (c
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
16:31
sf
c
16:31
if (b
16:31
= 0x8000) & (c
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
32:47
sf
c
32:47
if (b
32:47
= 0x8000) & (c
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
48:63
sf
c
48:63
if (b
48:63
= 0x8000) & (c
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters b and c are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The intermediate 32-bit product of the
odd halfword is subtracted from the intermediate product of the even halfword, and the difference
is added together with the contents of the corresponding parameter a word, saturating if overflow
or underflow occurs, and the sum is placed into the corresponding parameter d and accumulator
word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-263
Figure 3-196. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional and
Accumulate Words 3 op (__ev_dotphsssfaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphsssfaaw3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- -
+ +
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-264 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssfr[a] __ev_dotphsssfr[a]
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional, Round (to Accumulator)
d = __ev_dotphsssfr (a,b) (A = 0)
d = __ev_dotphsssfra (a,b) (A = 1)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The intermediate 32-bit product of the
odd halfword is subtracted from the intermediate product of the even halfword, and the difference
is and rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if
overflow or underflow occurs, and placed into the corresponding parameter d word. If A = 1, the
result in parameter d is also placed into the accumulator
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-265
Figure 3-197. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional,
Round (to Accumulator) (__ev_dotphsssfr[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssfra d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- w/round - w/round
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-266 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssfraaw __ev_dotphsssfraaw
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional, Round and Accumulate
into Words
d = __ev_dotphsssfraaw (a,b)
// high dot
temph1
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The intermediate 32-bit product of the
odd halfword is subtracted from the intermediate product of the even halfword, and the difference
is added together with the contents of the corresponding accumulator word and rounded to 16 bits
using the current fractional rounding mode in SPEFCSR, saturating if overflow or underflow
occurs, and placed into the corresponding parameter d and accumulator word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-267
Figure 3-198. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional,
Round and Accumulate Words (__ev_dotphsssfraaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssfraaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- -
+ w/round + w/round
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-268 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssfraaw3 __ev_dotphsssfraaw3
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional, Round and Accumulate
into Words, 3 operand
d = __ev_dotphsssfraaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
sf
c
0:15
if (b
0:15
= 0x8000) & (c
0:15
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
16:31
sf
c
16:31
if (b
16:31
= 0x8000) & (c
16:31
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
b
32:47
sf
c
32:47
if (b
32:47
= 0x8000) & (c
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
48:63
sf
c
48:63
if (b
48:63
= 0x8000) & (c
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For each word element in the destination, corresponding halfword pairs of signed fractional
elements in parameters b and c are multiplied producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The intermediate 32-bit product of the
odd halfword is subtracted from the intermediate product of the even halfword, and the difference
is added together with the contents of the corresponding parameter a word and rounded to 16 bits
using the current fractional rounding mode in SPEFCSR, saturating if overflow or underflow
occurs, and placed into the corresponding parameter d and accumulator word.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-269
Figure 3-199. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Fractional,
Round and Accumulate Words 3 op (__ev_dotphsssfraaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphsssfraaw3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- -
+ w/round + w/round
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-270 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssi[a] __ev_dotphsssi[a]
Vector Dot Product of Half Words, Subtract, Signed, Saturate, Integer (to Accumulator)
Maps to evotphssmi[a] since no overflow can occur
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphssmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphssmia d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-271
__ev_dotphsssiaaw __ev_dotphsssiaaw
Vector Dot Product of Half Words, Subtract, Signed, Saturate, Integer and Accumulate into
Words
d = __ev_dotphsssiaaw (a,b)
// high dot
temph1
0:31
a
0:15
si
b
0:15
; temph2
0:31
a
16:31
si
b
16:31
temph
0:63
EXTS(temph1
0:31
) - EXTS(temph2
0:31
) + EXTS(ACC
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0xFFFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
32:47
si
b
32:47
; templ2
0:31
a
48:63
si
b
48:63
templ
0:63
EXTS(templ1
0:31
) - EXTS(templ2
0:31
) + EXTS(ACC
32:63
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0xFFFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl;
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl;
For each word element in the destination, corresponding half word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 32-bit products. The intermediate 32-bit
product of the odd half word is subtracted from the intermediate product of the even half word, and
the difference is added together with the contents of the corresponding accumulator word,
saturating if overflow or underflow occurs, and the sum is placed into the corresponding parameter
d and accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-200. Vector Dot Product of Half Words, Subtract, Signed, Saturate, Integer and
Accumulate Words (__ev_dotphsssiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotphsssiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- -
+ +
d and Accumulator
Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-272 Freescale Semiconductor
SPE2 Operations
__ev_dotphsssiaaw3 __ev_dotphsssiaaw3
Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Integer and Accumulate into
Words, 3 operand
d = __ev_dotphsssiaaw3 (a,b,c)
// high dot
temph1
0:31
b
0:15
si
c
0:15
; temph2
0:31
b
16:31
si
c
16:31
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
32:47
si
c
32:47
; templ2
0:31
b
48:63
si
c
48:63
templ
0:63
EXTS
64
(templ1
0:31
) - EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the destination, corresponding halfword pairs of signed integer elements
in parameters b and c are multiplied producing a pair of 32-bit products. The intermediate 32-bit
product of the odd halfword is subtracted from the intermediate product of the even halfword, and
the difference is added together with the contents of the corresponding parameter a word,
saturating if overflow or underflow occurs, and the sum is placed into the corresponding parameter
d and accumulator word.
Other registers altered: SPEFSCR, ACC
Figure 3-201. Vector Dot Product of Halfwords, Subtract, Signed, Saturate, Integer and
Accumulate Words 3 op (__ev_dotphsssiaaw3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- -
+ +
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-273
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotphsssiaaw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-274 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcsmi[a] __ev_dotplohcsmi[a]
Vector Dot Product of Low Halfwords, Complex, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotplohcsmi (a,b) (A = 0)
d = __ev_dotplohcsmia (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:31
a
32:47
si
b
32:47
temph2
0:31
a
48:63
si
b
48:63
temph
0:31
temph1
0:31
- temph2
0:31
// modulo difference
d
0:31
temph
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
b
32:47
si
a
48:63
templ2
0:31
a
32:47
si
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
// modulo sum
d
32:63
templ
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is placed into the high word
of parameter d. For the low word element in the destination, halfword pairs of signed integer
elements from the low halfwords of parameters a and b are multiplied after exchanging the low
halfwords of parameter a, producing a pair of 32-bit products. The sum of this pair of intermediate
32-bit products is placed into the low word of parameter d. If A = 1, the result in parameter d is
also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-202. Vector Dot Product of Low Halfwords, Complex, Signed, Modulo, Integer (to
Accumulator) (__ev_dotplohcsmi[a])
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-275
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcsmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcsmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-276 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcsmiaaw __ev_dotplohcsmiaaw
Vector Dot Product of Low Halfwords, Complex, Signed, Modulo, Integer and Accumulate into
Words
d = __ev_dotplohcsmiaaw (a,b)
// high dot - calculate real part of complex product
temph1
0:31
a
32:47
si
b
32:47
temph2
0:31
a
48:63
si
b
48:63
temph
0:31
temph1
0:31
- temph2
0:31
+ ACC
0:31
// modulo sum
d
0:31
temph
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
b
32:47
si
a
48:63
templ2
0:31
a
32:47
si
b
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ ACC
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added together with the
contents of the upper accumulator word and the sum is placed into the high word of parameter d.
For the low word element in the destination, halfword pairs of signed integer elements from the
high halfwords of parameters a and b are multiplied after exchanging the high halfwords of
parameter a, producing a pair of 32-bit products. The sum of this pair of intermediate 32-bit
products is added together with the contents of the low accumulator word and the sum is placed
into the low word of parameter d and the accumulator.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-277
Figure 3-203. Vector Dot Product of Low Halfwords, Complex, Signed, Modulo, Integer and
Accumulate Words (__ev_dotplohcsmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcsmiaaw d,a,b
+ +
Accumulator
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-278 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcsmiaaw3 __ev_dotplohcsmiaaw3
Vector Dot Product of Low Halfwords, Complex, Signed, Modulo, Integer and Accumulate into
Words, 3 operand
d = __ev_dotplohcsmiaaw3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:31
b
32:47
si
c
32:47
temph2
0:31
b
48:63
si
c
48:63
temph
0:31
temph1
0:31
- temph2
0:31
+ a
0:31
// modulo sum
d
0:31
temph
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
c
32:47
si
b
48:63
templ2
0:31
b
32:47
si
c
48:63
templ
0:31
templ1
0:31
+ templ2
0:31
+ a
32:63
// modulo sum
d
32:63
templ
0:31
// update accumulator
ACC
0:63
d
0:63
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the low halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added together with the
contents of the upper parameter a word and the sum is placed into the high word of parameter d.
For the low word element in the destination, halfword pairs of signed integer elements from the
low halfwords of parameters b and c are multiplied after exchanging the low halfwords of
parameter b, producing a pair of 32-bit products. The sum of this pair of intermediate 32-bit
products is added together with the contents of the lower parameter a word and the sum is placed
into the low word of parameter d and the accumulator.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-279
Figure 3-204. Vector Dot Product of Low Halfwords, Complex, Signed, Modulo, Integer and
Accumulate Words 3 op (__ev_dotplohcsmiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotplohcsmiaaw3 d,b,c
+ +
a
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-280 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssf[a] __ev_dotplohcssf[a]
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional (to Accumulator)
d = __ev_dotplohcssf (a,b) (A = 0)
d = __ev_dotplohcssfa (a,b) (A = 1)
// high dot
temph1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
48:63
sf
b
32:47
if (a
48:63
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
32:47
sf
b
48:63
if (a
32:47
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is placed into the high word of parameter d, saturating
if overflow or underflow occurs. For the low word element in the destination, halfword pairs of
signed fractional elements from the low halfwords of parameters a and b are multiplied after
exchanging the low halfwords of parameter a, producing a pair of 32-bit products. If both inputs
of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate
32-bit products is placed into the low word of parameter d, saturating if overflow or underflow
occurs. If A = 1, the result in parameter d is also placed into the accumulator
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-281
Figure 3-205. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional
(to Accumulator) (__ev_dotplohcssf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssfa d,a,b
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-282 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssfaaw __ev_dotplohcssfaaw
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional and Accumulate
into Words
d = __ev_dotplohcssfaaw (a,b)
// high dot
temph1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
48:63
sf
b
32:47
if (a
48:63
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
32:47
sf
b
48:63
if (a
32:47
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of the accumulator, and the
result is placed into the high word of parameter d and the accumulator, saturating if overflow or
underflow occurs. For the low word element in the destination, halfword pairs of signed fractional
elements from the low halfwords of parameters a and b are multiplied after exchanging the low
halfwords of parameter a, producing a pair of 32-bit products. If both inputs of a multiply are -1.0,
the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the
low word of the accumulator is placed into the low word of parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-283
Figure 3-206. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional
and Accumulate Words (__ev_dotplohcssfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssfaaw d,a,b
+ +
Accumulator
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-284 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssfaaw3 __ev_dotplohcssfaaw3
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional and Accumulate
into Words, 3 operand
d = __ev_dotplohcssfaaw3 (a,b,c)
// high dot
temph1
0:31
b
32:47
sf
c
32:47
if (b
32:47
= 0x8000) & (c
32:47
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
48:63
sf
c
48:63
if (b
48:63
= 0x8000) & (c
48:63
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
48:63
sf
c
32:47
if (b
48:63
= 0x8000) & (c
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
32:47
sf
c
48:63
if (b
32:47
= 0x8000) & (c
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of parameter a, and the result
is placed into the high word of parameter d and the accumulator, saturating if overflow or
underflow occurs. For the low word element in the destination, halfword pairs of signed fractional
elements from the low halfwords of parameters b and c are multiplied after exchanging the low
halfwords of parameter b, producing a pair of 32-bit products. If both inputs of a multiply are -1.0,
the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the
low word of parameter a is placed into the low word of parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-285
Figure 3-207. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional
and Accumulate Words 3 op (__ev_dotplohcssfaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotplohcssfaaw3 d,b,c
+ +
a
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-286 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssfr[a] __ev_dotplohcssfr[a]
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional, Round (to
Accumulator)
d = __ev_dotplohcssfr (a,b) (A = 0)
d = __ev_dotplohcssfra (a,b) (A = 1)
// high dot
temph1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
48:63
sf
b
32:47
if (a
48:63
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
32:47
sf
b
48:63
if (a
32:47
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is rounded to 16 bits using the current fractional
rounding mode in SPEFCSR, saturating if overflow or underflow occurs, and placed into the high
word of parameter d. For the low word element in the destination, halfword pairs of signed
fractional elements from the low halfwords of parameters a and b are multiplied after exchanging
the low halfwords of parameter a, producing a pair of 32-bit products. If both inputs of a multiply
are -1.0, the result saturates to 0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products
is rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if
overflow or underflow occurs, and placed into the low word of parameter d. If A = 1, the result in
parameter d is also placed into the accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-287
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC (if A=1)
Figure 3-208. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate,
Fractional, Round (to Accumulator) (__ev_dotplohcssfr[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssfra d,a,b
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- w/round
d (and Accumulator if A=1)
+ w/round
Re Img Re Img
Re Img Re Img
Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-288 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssfraaw __ev_dotplohcssfraaw
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional, Round and
Accumulate into Words
d = __ev_dotplohcssfraaw (a,b)
// high dot
temph1
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
a
48:63
sf
b
32:47
if (a
48:63
= 0x8000) & (b
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
a
32:47
sf
b
48:63
if (a
32:47
= 0x8000) & (b
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)),16)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of the accumulator and
rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow
or underflow occurs, and placed into the high word of parameter d and the accumulator. For the
low word element in the destination, halfword pairs of signed fractional elements from the low
halfwords of parameters a and b are multiplied after exchanging the low halfwords of parameter
a, producing a pair of 32-bit products. If both inputs of a multiply are -1.0, the result saturates to
0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the low word of the
accumulator is rounded to 16 bits using the current fractional rounding mode in SPEFCSR,
saturating if overflow or underflow occurs, and placed into the low word of parameter d and the
accumulator.
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-289
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Other registers altered: SPEFSCR, ACC
Figure 3-209. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate,
Fractional, Round and Accumulate Words (__ev_dotplohcssfraaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssfraaw d,a,b
+ w/round + w/round
Accumulator
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-290 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssfraaw3 __ev_dotplohcssfraaw3
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Fractional, Round and
Accumulate into Words, 3 operand
d = __ev_dotplohcssfraaw3 (a,b,c)
// high dot
temph1
0:31
b
32:47
sf
c
32:47
if (b
32:47
= 0x8000) & (c
32:47
= 0x8000) then
temph1
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:31
b
48:63
sf
c
48:63
if (b
48:63
= 0x8000) & (c
48:63
= 0x8000) then
temph2
0:31
0x7FFF_FFFF //saturate
movh 1
endif
temph
0:63
ROUND((EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)),16)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_0000, temph
32:63
)
//low
templ1
0:31
b
48:63
sf
c
32:47
if (b
48:63
= 0x8000) & (c
32:47
= 0x8000) then
templ1
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:31
b
32:47
sf
c
48:63
if (b
32:47
= 0x8000) & (c
48:63
= 0x8000) then
templ2
0:31
0x7FFF_FFFF //saturate
movl 1
endif
templ
0:63
ROUND((EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)),16)
ovl chk_ovf(temph
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_0000, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF_FFFF. The difference
of this pair of intermediate 32-bit products is added to the high word of parameter a and rounded
to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow or
underflow occurs, and placed into the high word of parameter d and the accumulator. For the low
word element in the destination, halfword pairs of signed fractional elements from the low
halfwords of parameters b and c are multiplied after exchanging the low halfwords of parameter
a, producing a pair of 32-bit products. If both inputs of a multiply are -1.0, the result saturates to
0x7FFF_FFFF. The sum of this pair of intermediate 32-bit products and the low word of parameter
a is rounded to 16 bits using the current fractional rounding mode in SPEFCSR, saturating if
overflow or underflow occurs, and placed into the low word of parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from either the multiply or the accumulation.
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-291
Other registers altered: SPEFSCR, ACC
Figure 3-210. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate,
Fractional, Round and Accumulate Words 3 op (__ev_dotplohcssfraaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotplohcssfraaw3 d,b,c
+ w/round + w/round
a
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-292 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssi[a] __ev_dotplohcssi[a]
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Integer (to Accumulator)
d = __ev_dotplohcssi (a,b) (A = 0)
d = __ev_dotplohcssia (a,b) (A = 1)
// high dot
temph1
0:31
a
32:47
si
b
32:47
; temph2
0:31
a
48:63
si
b
48:63
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
)
ovh temph
31
temph
32
d
0:31
SATURATE(ovh, temph
31
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
48:63
si
b
32:47
; templ2
0:31
a
32:47
si
b
48:63
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
)
ovl templ
31
templ
32
d
32:63
SATURATE(ovl, templ
31
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding halfword pairs of signed integer
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is placed into the high word
of parameter d, saturating if overflow or underflow occurs. For the low word element in the
destination, halfword pairs of signed integer elements from the low halfwords of parameters a and
b are multiplied after exchanging the low halfwords of parameter a, producing a pair of 32-bit
products. The sum of this pair of intermediate 32-bit products is placed into the low word of
parameter d, saturating if overflow or underflow occurs. If A = 1, the result in parameter d is also
placed into the accumulator.
Other registers altered: SPEFSCR, ACC (if A=1)
Figure 3-211. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Integer (to
Accumulator) (__ev_dotplohcssi[a])
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-293
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-294 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssiaaw __ev_dotplohcssiaaw
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Integer and Accumulate into
Words
d = __ev_dotplohcssiaaw (a,b)
// high dot
temph1
0:31
a
32:47
si
b
32:47
; temph2
0:31
a
48:63
si
b
48:63
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(ACC
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
a
48:63
si
b
32:47
; templ2
0:31
a
32:47
si
b
48:63
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(ACC
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters a and b are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added to the high word of
the accumulator, and the result is placed into the high word of parameter d and the accumulator,
saturating if overflow or underflow occurs. For the low word element in the destination, halfword
pairs of signed fractional elements from the low halfwords of parameters a and b are multiplied
after exchanging the low halfwords of parameter a, producing a pair of 32-bit products. The sum
of this pair of intermediate 32-bit products and the low word of the accumulator is placed into the
low word of parameter d and the accumulator, saturating if overflow or underflow occurs.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-295
Figure 3-212. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Integer
and Accumulate Words (__ev_dotplohcssiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotplohcssiaaw d,a,b
+ +
Accumulator
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-296 Freescale Semiconductor
SPE2 Operations
__ev_dotplohcssiaaw3 __ev_dotplohcssiaaw3
Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Integer and Accumulate into
Words, 3 operand
d = __ev_dotplohcssiaaw3 (a,b,c)
// high dot
temph1
0:31
b
32:47
si
c
32:47
; temph2
0:31
b
48:63
si
c
48:63
temph
0:63
EXTS
64
(temph1
0:31
) - EXTS
64
(temph2
0:31
) + EXTS
64
(a
0:31
)
ovh chk_ovf(temph
30:32
)
d
0:31
SATURATE(ovh, temph
30
, 0x8000_0000, 0x7FFF_FFFF, temph
32:63
)
//low
templ1
0:31
b
48:63
si
c
32:47
; templ2
0:31
b
32:47
si
c
48:63
templ
0:63
EXTS
64
(templ1
0:31
) + EXTS
64
(templ2
0:31
) + EXTS
64
(a
32:63
)
ovl chk_ovf(templ
30:32
)
d
32:63
SATURATE(ovl, templ
30
, 0x8000_0000, 0x7FFF_FFFF, templ
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding halfword pairs of signed fractional
elements in the low halfwords of parameters b and c are multiplied producing a pair of 32-bit
products. The difference of this pair of intermediate 32-bit products is added to the high word of
parameter a, and the result is placed into the high word of parameter d and the accumulator,
saturating if overflow or underflow occurs. For the low word element in the destination, halfword
pairs of signed fractional elements from the low halfwords of parameters b and c are multiplied
after exchanging the low halfwords of parameter b, producing a pair of 32-bit products. The sum
of this pair of intermediate 32-bit products and the low word of parameter a is placed into the low
word of parameter d and the accumulator, saturating if overflow or underflow occurs.
Other registers altered: SPEFSCR, ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-297
Figure 3-213. Vector Dot Product of Low Halfwords, Complex, Signed, Saturate, Integer
and Accumulate Words (__ev_dotplohcssiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotplohcssiaaw3 d,b,c
+ +
a
d and Accumulator
Re Img
47 48 47 48 32 63 32 63
Intermediate
c
x x
b
products
(4 32-bit)
x x
- +
Re Img Re Img
Re Img Re Img
SPE2 Programming Interface Manual, Rev. 1.0-2
3-298 Freescale Semiconductor
SPE2 Operations
__ev_dotpwasmi[a] __ev_dotpwasmi[a]
Vector Dot Product of Words, Add, Signed, Modulo, integer (to Accumulator)
d = __ev_dotpwasmi (a,b) (A = 0)
d = __ev_dotpwasmia (a,b) (A = 1)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:31
temph
0:63
+ templ
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate products is added together and the
sum is placed into parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-214. Vector Dot Product of Words, Add, Signed, Modulo, Integer (to Accumulator)
(__ev_dotpwasmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasmia d,a,b
0 31 32 63
Intermediate
b
x
a
products
(2 64-bit)
x
+
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-299
__ev_dotpwasmiaa __ev_dotpwasmiaa
Vector Dot Product of Words, Add, Signed, Modulo, Integer and Accumulate
d = __ev_dotpwasmiaa (a,b)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate products is added together with the
contents of the accumulator and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-215. Vector Dot Product of Words, Add, Signed, Modulo, Integer and Accumulate
(__ev_dotpwasmiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasmiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
+
+
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-300 Freescale Semiconductor
SPE2 Operations
__ev_dotpwasmiaa3 __ev_dotpwasmiaa3
Vector Dot Product of Words, Add, Signed, Modulo, Integer and Accumulate, 3 operand
d = __ev_dotpwasmiaa3 (a,b,c)
temph
0:63
b
0:31
si
c
0:31
templ
0:63
b
32:63
si
c
32:63
temp
0:63
temph
0:63
+ templ
0:63
+ a
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameters b and c are multiplied
producing a pair of 64-bit products. This pair of intermediate products is added together with the
contents of parameter a and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-216. Vector Dot Product of Words, Add, Signed, Modulo, Integer and Accumulate
3 op (__ev_dotpwasmiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwasmiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(2 64-bit)
+
+
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-301
__ev_dotpwassi[a] __ev_dotpwassi[a]
Vector Dot Product of Words, Add, Signed, Saturate, integer (to Accumulator)
d = __ev_dotpwassi (a,b) (A = 0)
d = __ev_dotpwassia (a,b) (A = 1)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:64
EXTS(temph
0:63
) + EXTS(templ
0:63
)
ov (temp
0
temp
1
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate 64-bit products is added together,
saturating if overflow or underflow occurs, and the sum is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC (if A=1)
Figure 3-217. Vector Dot Product of Words, Add, Signed, Saturate, Integer [to
Accumulator] (__ev_dotpwassi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwassi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwassia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
+
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-302 Freescale Semiconductor
SPE2 Operations
__ev_dotpwassiaa __ev_dotpwassiaa
Vector Dot Product of Words, Add, Signed, Saturate, Integer and Accumulate
d = __ev_dotpwassiaa (a,b)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:64
EXTS(ACC
0:63
) + EXTS(temph
0:63
) + EXTS(templ
0:63
)
ov (temp
0
temp
1
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
mov | ov
SPEFSCR
SOV
SPEFSCR
SOV
| mov | ov
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate 64-bit products is added together
with the contents of the accumulator, saturating if overflow or underflow occurs in the final sum,
and the sum is placed into parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-218. Vector Dot Product of Words, Add, Signed, Saturate, Integer and
Accumulate (__ev_dotpwassiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwassiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
+
+
Accumulator
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-303
__ev_dotpwassiaa3 __ev_dotpwassiaa3
Vector Dot Product of Words, Add, Signed, Saturate, Integer and Accumulate, 3 operand
d = __ev_dotpwassiaa3 (a,b,c)
temph
0:63
b
0:31
si
c
0:31
templ
0:63
b
32:63
si
c
32:63
temp
0:66
EXTS
67
(a
0:63
) + EXTS
67
(temph
0:63
) + EXTS
67
(templ
0:63
)
ov chk_ovf(temp
0:3
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
3:66
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
mov | ov
SPEFSCR
SOV
SPEFSCR
SOV
| mov | ov
Corresponding pairs of signed integer word elements in parameters b and c are multiplied
producing a pair of 64-bit products. This pair of intermediate 64-bit products is added together
with the contents of parameter a, saturating if overflow or underflow occurs in the final sum, and
the sum is placed into parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-219. Vector Dot Product of Words, Add, Signed, Saturate, Integer and
Accumulate 3 op (__ev_dotpwassiaa3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(2 64-bit)
+
+
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-304 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwassiaa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-305
__ev_dotpwasumi[a] __ev_dotpwasumi[a]
Vector Dot Product of Words, Add, Signed by Unsigned, Modulo, integer (to Accumulator)
d = __ev_dotpwasumi (a,b) (A = 0)
d = __ev_dotpwasumia (a,b) (A = 1)
temph
0:63
a
0:31
sui
b
0:31
templ
0:63
a
32:63
sui
b
32:63
temp
0:63
temph
0:63
+ templ
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameter a and unsigned integer word
elements in parameter b are multiplied producing a pair of 64-bit products. This pair of
intermediate products is added together and the sum is placed into parameter d. If A = 1, the result
in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-220. Vector Dot Product of Words, Add, Signed by Unsigned, Modulo, Integer (to
Accumulator) (__ev_dotpwasumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasumia d,a,b
0 31 32 63
Intermediate
b (unsigned)
x
a (signed)
products
(2 64-bit)
x
+
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-306 Freescale Semiconductor
SPE2 Operations
__ev_dotpwasumiaa __ev_dotpwasumiaa
Vector Dot Product of Words, Add, Signed by Unsigned, Modulo, Integer and Accumulate
d = __ev_dotpwasumiaa (a,b)
temph
0:63
a
0:31
sui
b
0:31
templ
0:63
a
32:63
sui
b
32:63
temp
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameter a and unsigned integer word
elements in parameter b are multiplied producing a pair of 64-bit products. This pair of
intermediate products is added together with the contents of the accumulator and the sum is placed
into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-221. Vector Dot Product of Words, Add, Signed by Unsigned, Modulo, Integer and
Accumulate (__ev_dotpwasumiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasumiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(2 64-bit)
+
+
Accumulator
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-307
__ev_dotpwasumiaa3 __ev_dotpwasumiaa3
Vector Dot Product of Words, Add, Signed by Unsigned, Modulo, Integer and Accumulate, 3
operand
d = __ev_dotpwasumiaa3 (a,b,c)
temph
0:63
b
0:31
sui
c
0:31
templ
0:63
b
32:63
sui
c
32:63
temp
0:63
temph
0:63
+ templ
0:63
+ a
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameter b and unsigned integer word
elements in parameter c are multiplied producing a pair of 64-bit products. This pair of
intermediate products is added together with the contents of parameter a and the sum is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-222. Vector Dot Product of Words, Add, Signed by Unsigned, Modulo, Integer and
Accumulate 3 op (__ev_dotpwasumiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwasumiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c (unsigned)
x x
b (signed)
products
(2 64-bit)
+
+
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-308 Freescale Semiconductor
SPE2 Operations
__ev_dotpwasusi[a] __ev_dotpwasusi[a]
Vector Dot Product of Words, Add, Signed by Unsigned, Saturate, integer (to Accumulator)
d = __ev_dotpwasusi (a,b) (A = 0)
d = __ev_dotpwasusia (a,b) (A = 1)
temph
0:63
a
0:31
sui
b
0:31
templ
0:63
a
32:63
sui
b
32:63
temp
0:64
EXTS(temph
0:63
) + EXTS(templ
0:63
)
ov (temp
0
temp
1
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of signed integer word elements in parameter a and unsigned integer word
elements in parameter b are multiplied producing a pair of 64-bit products. This pair of
intermediate 64-bit products is added together, saturating if overflow or underflow occurs, and the
sum is placed into parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC (if A=1)
Figure 3-223. Vector Dot Product of Words, Add, Signed by Unsigned, Saturate, Integer [to
Accumulator] (__ev_dotpwasusi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasusi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasusia d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(2 64-bit)
+
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-309
__ev_dotpwasusiaa __ev_dotpwasusiaa
Vector Dot Product of Words, Add, Signed by Unsigned, Saturate, Integer and Accumulate
d = __ev_dotpwasusiaa (a,b)
temph
0:63
a
0:31
sui
b
0:31
templ
0:63
a
32:63
sui
b
32:63
temp
0:64
EXTS(ACC
0:63
) + EXTS(temph
0:63
) + EXTS(templ
0:63
)
ov (temp
0
temp
1
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of signed integer word elements in parameter a and unsigned integer word
elements in parameter b are multiplied producing a pair of 64-bit products. This pair of
intermediate 64-bit products is added together with the contents of the accumulator, saturating if
overflow or underflow occurs in the final sum, and the sum is placed into parameter d and the
accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-224. Vector Dot Product of Words, Add, Signed by Unsigned, Saturate, Integer
and Accumulate (__ev_dotpwasusiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwasusiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x x
a (signed)
products
(2 64-bit)
+
+
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-310 Freescale Semiconductor
SPE2 Operations
__ev_dotpwasusiaa3 __ev_dotpwasusiaa3
Vector Dot Product of Words, Add, Signed by Unsigned, Saturate, Integer and Accumulate, 3
operand
d = __ev_dotpwasusiaa3 (a,b,c)
temph
0:63
b
0:31
sui
c
0:31
templ
0:63
b
32:63
sui
c
32:63
temp
0:66
EXTS
67
(a
0:63
) + EXTS
67
(temph
0:63
) + EXTS
67
(templ
0:63
)
ov chk_ovf(temp
0:3
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
3:66
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of signed integer word elements in parameter b and unsigned integer word
elements in parameter c are multiplied producing a pair of 64-bit products. This pair of
intermediate 64-bit products is added together with the contents of parameter a, saturating if
overflow or underflow occurs in the final sum, and the sum is placed into parameter d and the
accumulator. The overflow and summary overflow bits are recorded in the SPEFSCR based on an
overflow or underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-225. Vector Dot Product of Words, Add, Signed by Unsigned, Saturate, Integer
and Accumulate 3 op (__ev_dotpwasusiaa3)
15 16 47 48 0 31 32 63
Intermediate
c (unsigned)
x x
b (signed)
products
(2 64-bit)
+
+
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-311
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwasusiaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-312 Freescale Semiconductor
SPE2 Operations
__ev_dotpwaumi[a] __ev_dotpwaumi[a]
Vector Dot Product of Words, Add, Unsigned, Modulo, integer (to Accumulator)
d = __ev_dotpwaumi (a,b) (A = 0)
d = __ev_dotpwaumia (a,b) (A = 1)
temph
0:63
a
0:31
ui
b
0:31
templ
0:63
a
32:63
ui
b
32:63
temp
0:31
temph
0:63
+ templ
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding pairs of unsigned integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate products is added together and the
sum is placed into parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-226. Vector Dot Product of Words, Add, Unsigned, Modulo, Integer (to
Accumulator) (__ev_dotpwaumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwaumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwaumia d,a,b
0 31 32 63
Intermediate
b
x
a
products
(2 64-bit)
x
+
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-313
__ev_dotpwaumiaa __ev_dotpwaumiaa
Vector Dot Product of Words, Add, Unsigned, Modulo, Integer and Accumulate
d = __ev_dotpwaumiaa (a,b)
temph
0:63
a
0:31
ui
b
0:31
templ
0:63
a
32:63
ui
b
32:63
temp
0:63
temph
0:63
+ templ
0:63
+ ACC
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of unsigned integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate products is added together with the
contents of the accumulator and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-227. Vector Dot Product of Words, Add, Unsigned, Modulo, Integer and
Accumulate (__ev_dotpwaumiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwaumiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
+
+
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-314 Freescale Semiconductor
SPE2 Operations
__ev_dotpwaumiaa3 __ev_dotpwaumiaa3
Vector Dot Product of Words, Add, Unsigned, Modulo, Integer and Accumulate, 3 operand
d = __ev_dotpwaumiaa3 (a,b,c)
temph
0:63
b
0:31
ui
c
0:31
templ
0:63
b
32:63
ui
c
32:63
temp
0:63
temph
0:63
+ templ
0:63
+ a
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of unsigned integer word elements in parameters b and c are multiplied
producing a pair of 64-bit products. This pair of intermediate products is added together with the
contents of parameter a and the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-228. Vector Dot Product of Words, Add, Unsigned, Modulo, Integer and
Accumulate 3 op (__ev_dotpwaumiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwaumiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(2 64-bit)
+
+
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-315
__ev_dotpwausi[a] __ev_dotpwausi[a]
Vector Dot Product of Words, Add, Unsigned, Saturate, integer (to Accumulator)
d = __ev_dotpwausi (a,b) (A = 0)
d = __ev_dotpwausia (a,b) (A = 1)
temph
0:63
a
0:31
ui
b
0:31
templ
0:63
a
32:63
ui
b
32:63
temp
0:64
EXTZ(temph
0:63
) + EXTZ(templ
0:63
)
ov temp
0
d
0:63
SATURATE(ov, 0, 0xFFFF_FFFF_FFFF_FFFF, 0xFFFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of unsigned integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate 64-bit products is added together
with the contents of the accumulator, saturating if overflow occurs, and the sum is placed into
parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow from
the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-229. Vector Dot Product of Words, Add, Unsigned, Saturate, Integer (to
Accumulator) (__ev_dotpwausi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwausi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwausia d,a,b
0 31 32 63
Intermediate
b
x
a
products
(2 64-bit)
x
+
d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-316 Freescale Semiconductor
SPE2 Operations
__ev_dotpwausiaa __ev_dotpwausiaa
Vector Dot Product of Words, Add, Unsigned, Saturate, Integer and Accumulate
d = __ev_dotpwausiaa (a,b)
temph
0:63
a
0:31
ui
b
0:31
templ
0:63
a
32:63
ui
b
2:63
temp
0:64
EXTZ(ACC
0:63
) + EXTZ(temph
0:63
) + EXTZ(templ
0:63
)
ov temp
0
d
0:63
SATURATE(ov, 0, 0xFFFF_FFFF_FFFF_FFFF, 0xFFFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of unsigned integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. This pair of intermediate 64-bit products is added together
with the contents of the accumulator, saturating if overflow occurs in the final sum, and the sum is
placed into parameter d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow from
the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-230. Vector Dot Product of Words, Add, Unsigned, Saturate, Integer and
Accumulate (__ev_dotpwausiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwausiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
+
+
Accumulator
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-317
__ev_dotpwausiaa3 __ev_dotpwausiaa3
Vector Dot Product of Words, Add, Unsigned, Saturate, Integer and Accumulate, 3 operand
d = __ev_dotpwausiaa3 (a,b,c)
temph
0:63
b
0:31
ui
c
0:31
templ
0:63
b
32:63
ui
c
32:63
temp
0:66
EXTZ
67
(a
0:63
) + EXTZ
67
(temph
0:63
) + EXTZ
67
(templ
0:63
)
ov temp
0:2
!=
3
0
d
0:63
SATURATE(ov, 0, 0xFFFF_FFFF_FFFF_FFFF, 0xFFFF_FFFF_FFFF_FFFF, temp
3:66
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of unsigned integer word elements in parameters b and c are multiplied
producing a pair of 64-bit products. This pair of intermediate 64-bit products is added together
with the contents of parameter a, saturating if overflow occurs in the final sum, and the sum is
placed into parameter d and the accumulator. The overflow and summary overflow bits are
recorded in the SPEFSCR based on an overflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-231. Vector Dot Product of Words, Add, Unsigned, Saturate, Integer and
Accumulate 3 op (__ev_dotpwausiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwausiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(2 64-bit)
+
+
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-318 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcsmi[a] __ev_dotpwcsmi[a]
Vector Dot Product of Words, Complex, Signed, Modulo, Integer (to Accumulator)
d = __ev_dotpwcsmi (a,b) (A = 0)
d = __ev_dotpwcsmia (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
si
b
0:31
temph2
0:63
a
32:63
si
b
32:63
temph
0:63
temph1
0:63
- temph2
0:63
// modulo difference
d
0:31
temph
32:63
//low dot - calculate imaginary part of complex product
templ1
0:31
a
32:63
si
b
0:31
templ2
0:31
a
0:31
si
b
32:63
templ
0:63
templ1
0:63
+ templ2
0:63
// modulo sum
d
32:63
templ
32:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
For the high word element in the destination, corresponding word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 64-bit products. The low-order 32-bits of
the difference of this pair of intermediate 64-bit products is placed into the high word of parameter
d. For the low word element in the destination, word pairs of signed integer elements from the
words of parameters a and b are multiplied after exchanging the words of parameter a, producing
a pair of 64-bit products. The low-order 32-bits of the sum of this pair of intermediate 64-bit
products is placed into the low word of parameter d. If A = 1, the result in parameter d is also
placed into the accumulator. This instruction can be used to perform a complex multiply or dot
product by placing {real,img} pairs into the word pairs of parameters a and b.
Other registers altered: ACC (if A=1)
Figure 3-232. Vector Dot Product of Words, Complex, Signed, Modulo, Integer (to
Accumulator) (__ev_dotpwcsmi[a])
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-319
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcsmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcsmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-320 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcsmiaaw __ev_dotpwcsmiaaw
Vector Dot Product of Words, Complex, Signed, Modulo, Integer and Accumulate into Words
d = __ev_dotpwcsmiaaw (a,b)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
si
b
0:31
temph2
0:63
a
32:63
si
b
32:63
temph
0:63
temph1
0:63
- temph2
0:63
// modulo difference
d
0:31
temph
32:63
+ ACC
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
a
32:63
si
b
0:31
templ2
0:31
a
0:31
si
b
32:63
templ
0:63
templ1
0:63
+ templ2
0:63
// modulo sum
d
32:63
templ
32:63
+ ACC
32:63
// update accumulator
ACC
0:63
d
0:63
For the high word element in the destination, corresponding word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 64-bit products. The low-order 32-bits of
the difference of this pair of intermediate 64-bit products is added together with the contents of the
upper accumulator word and the sum is placed into the high words of parameter d and the
accumulator. For the low word element in the destination, word pairs of signed integer elements
from the words of parameters a and b are multiplied after exchanging the words of parameter a,
producing a pair of 64-bit products. The low-order 32-bits of the sum of this pair of intermediate
64-bit products is added together with the contents of the low accumulator word and the sum is
placed into the low words of parameter d and the accumulator. This instruction can be used to
perform a complex multiply or dot product by placing {real,img} pairs into the word pairs of
parameters a and b.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-321
Figure 3-233. Vector Dot Product of Words, Complex, Signed, Modulo, Integer and
Accumulate Words (__ev_dotpwcsmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcsmiaaw d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
- +
+ +
Accumulator
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
31 32 0 63 31 32 0 63
(not to scale)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-322 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcsmiaaw3__ev_dotpwcsmiaaw3
Vector Dot Product of Words, Complex, Signed, Modulo, Integer and Accumulate into Words 3
operand
d = __ev_dotpwcsmiaaw3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:63
b
0:31
si
c
0:31
temph2
0:63
b
32:63
si
c
32:63
temph
0:63
temph1
0:63
- temph2
0:63
// modulo difference
d
0:31
temph
32:63
+ a
0:31
//low dot - calculate imaginary part of complex product
templ1
0:31
b
32:63
si
c
0:31
templ2
0:31
b
0:31
si
c
32:63
templ
0:63
templ1
0:63
+ templ2
0:63
// modulo sum
d
32:63
templ
32:63
+ a
32:63
// update accumulator
ACC
0:63
d
0:63
For the high word element in the destination, corresponding word pairs of signed integer elements
in parameters b and c are multiplied producing a pair of 64-bit products. The low-order 32-bits of
the difference of this pair of intermediate 64-bit products is added together with the contents of the
upper parameter a word and the sum is placed into the high words of parameter d and the
accumulator. For the low word element in the destination, word pairs of signed integer elements
from the words of parameters b and c are multiplied after exchanging the words of parameter b,
producing a pair of 64-bit products. The low-order 32-bits of the sum of this pair of intermediate
64-bit products is added together with the contents of the low parameter a word and the sum is
placed into the low words of parameter d and the accumulator. This instruction can be used to
perform a complex multiply or dot product by placing {real,img} pairs into the word pairs of
parameters b and c.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-323
Figure 3-234. Vector Dot Product of Words, Complex, Signed, Modulo, Integer and
Accumulate Words 3 op (__ev_dotpwcsmiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwcsmiaaw3 d,b,c
31 32 31 32 0 63 0 63
Intermediate
c
x x
b
products
(4 64-bit)
x x
- +
+ +
a
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
31 32 0 63 31 32 0 63
(not to scale)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-324 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssf[a] __ev_dotpwcssf[a]
Vector Dot Product of Words, Complex, Signed, Saturate, Fractional (to Accumulator)
d = __ev_dotpwcssf (a,b) (A = 0)
d = __ev_dotpwcssfa (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
temph1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
temph2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
temph
0:64
EXTS
65
(temph1
0:63
) - EXTS
65
(temph2
0:63
)
ovh temph
0
temph
1
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
1:32
)
//low dot - calculate imaginary part of complex product
templ1
0:63
a
32:63
sf
b
0:31
if (a
32:63
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
templ1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:63
a
0:31
sf
b
32:63
if (a
0:31
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
templ2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
templ
0:64
EXTS
65
(templ1
0:63
) + EXTS
65
(templ2
0:63
)
ovl templ
0
templ
1
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
1:32
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding word pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 64-bit products. The high-order
32-bit difference of this pair of intermediate 64-bit products is placed into the high word of
parameter d, saturating if overflow or underflow occurs. For the low word element in the
destination, word pairs of signed fractional elements from the words of parameters a and b are
multiplied after exchanging the words of parameter a, producing a pair of 64-bit products. The
high-order 32-bit sum of this pair of intermediate 64-bit products is placed into the low word of
parameter d, saturating if overflow or underflow occurs. If both elements of a multiply are -1.0,
the product is saturated. If A = 1, the result in parameter d is also placed into the accumulator. This
instruction can be used to perform a complex multiply or dot product by placing {real,img} pairs
into the word pairs of parameters a and b.
Other registers altered: ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-325
Figure 3-235. Vector Dot Product of Words, Complex, Signed, Saturate, Fractional (to
Accumulator) (__ev_dotpwcssf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssfa d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
SAT SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-326 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssfaaw __ev_dotpwcssfaaw
Vector Dot Product of Words, Complex, Signed, Saturate, Fractional and Accumulate into Words
d = __ev_dotpwcssfaaw (a,b)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
temph1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
temph2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
temph
0:66
EXTS
67
(ACC
0:31
||
32
0) + EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)
ovh chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
//low dot - calculate imaginary part of complex product
templ1
0:63
a
32:63
sf
b
0:31
if (a
32:63
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
templ1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:63
a
0:31
sf
b
32:63
if (a
0:31
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
templ2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
templ
0:66
EXTS
67
(ACC
32:63
||
32
0) + EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding word pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 64-bit products. The high-order
32-bit difference of this pair of intermediate 64-bit products added to the upper word of the
accumulator, saturating if overflow or underflow occurs. The 32-bit sum is placed into the high
word of parameter d and the accumulator. For the low word element in the destination, word pairs
of signed fractional elements from the words of parameters a and b are multiplied after exchanging
the words of parameter a, producing a pair of 64-bit products. The high-order 32-bit sum of this
pair of intermediate 64-bit products is added to the lower word of the accumulator, saturating if
overflow or underflow occurs. The 32-bit sum is placed into the low word of parameter d and the
accumulator. If both elements of a multiply are -1.0, the product is saturated. This instruction can
be used to perform a complex multiply or dot product by placing {real,img} pairs into the word
pairs of parameters a and b.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-327
Figure 3-236. Vector Dot Product of Words, Complex, Signed, Saturate, Fractional and
Accumulate Words (__ev_dotpwcssfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssfaaw d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
- +
+ +
Accumulator
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
SAT
SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-328 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssfaaw3 __ev_dotpwcssfaaw3
Vector Dot Product of Words, Complex, Signed, Saturate, Fractional and Accumulate into Words
3 operand
d = __ev_dotpwcssfaaw3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:63
b
0:31
sf
c
0:31
if (b
0:31
= 0x8000_0000) & (c
0:31
= 0x8000_0000) then
temph1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:63
b
32:63
sf
c
32:63
if (b
32:63
= 0x8000_0000) & (c
32:63
= 0x8000_0000) then
temph2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
temph
0:66
EXTS
67
(a
0:31
||
32
0) + EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)
ovh chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
//low dot - calculate imaginary part of complex product
templ1
0:63
b
32:63
sf
c
0:31
if (b
32:63
= 0x8000_0000) & (c
0:31
= 0x8000_0000) then
templ1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:63
b
0:31
sf
c
32:63
if (b
0:31
= 0x8000_0000) & (c
32:63
= 0x8000_0000) then
templ2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
templ
0:66
EXTS
67
(a
32:63
||
32
0) + EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding word pairs of signed fractional
elements in parameters b and c are multiplied producing a pair of 64-bit products. The high-order
32-bit difference of this pair of intermediate 64-bit products added to the upper word of parameter
a, saturating if overflow or underflow occurs. The 32-bit sum is placed into the high word of
parameter d and the accumulator. For the low word element in the destination, word pairs of signed
fractional elements from the words of parameters b and c are multiplied after exchanging the
words of parameter b, producing a pair of 64-bit products. The high-order 32-bit sum of this pair
of intermediate 64-bit products is added to the lower word of parameter a, saturating if overflow
or underflow occurs. The 32-bit sum is placed into the low word of parameter d and the
accumulator. If both elements of a multiply are -1.0, the product is saturated. This instruction can
be used to perform a complex multiply or dot product by placing {real,img} pairs into the word
pairs of parameters b and c.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-329
Figure 3-237. Vector Dot Product of Words, Complex, Signed, Saturate, Fractional and
Accumulate Words 3 op (__ev_dotpwcssfaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwcssfaaw3 d,b,c
31 32 31 32 0 63 0 63
Intermediate
c
x x
b
products
(4 64-bit)
x x
- +
+ +
a
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
SAT
SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-330 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssfr[a] __ev_dotpwcssfr[a]
Vector Dot Product of Words, Complex, Signed, Saturate, Fractional, Round (to Accumulator)
d = __ev_dotpwcssfr (a,b) (A = 0)
d = __ev_dotpwcssfra (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
temph1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
temph2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
temph
0:66
ROUND((EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)), 32)
ovh chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
//low dot - calculate imaginary part of complex product
templ1
0:63
a
32:63
sf
b
0:31
if (a
32:63
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
templ1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:63
a
0:31
sf
b
32:63
if (a
0:31
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
templ2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
templ
0:66
ROUND((EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)),32)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding word pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 64-bit products. The difference
of this pair of intermediate products is rounded to 32 bits using the current fractional rounding
mode in SPEFCSR, saturating if overflow or underflow occurs, and placed into the high word of
parameter d. For the low word element in the destination, word pairs of signed fractional elements
from the words of parameters a and b are multiplied after exchanging the words of parameter a,
producing a pair of 64-bit products. The sum of this pair of intermediate products is rounded to 32
bits using the current fractional rounding mode in SPEFCSR, saturating if overflow or underflow
occurs, and placed into the high word of parameter d. If both elements of a multiply are -1.0, the
product is saturated. If A = 1, the result in parameter d is also placed into the accumulator. This
instruction can be used to perform a complex multiply or dot product by placing {real,img} pairs
into the word pairs of parameters a and b.
Other registers altered: ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-331
Figure 3-238. Vector Dot Product of Words, Complex, Signed, Saturate, Fractional, Round
(to Accumulator) (__ev_dotpwcssfr[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssfra d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
ROUND & SAT
ROUND & SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-332 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssfraaw __ev_dotpwcssfraaw
Vector Dot Product of Words, Complex, Signed, Saturate, Fractional, Round and Accumulate
into Words
d = __ev_dotpwcssfraaw (a,b)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
temph1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
temph2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
temph
0:66
ROUND((EXTS
67
(ACC
0:31
||
32
0) + EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)),32)
ovl chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
//low dot - calculate imaginary part of complex product
templ1
0:63
a
32:63
sf
b
0:31
if (a
32:63
= 0x8000_0000) & (b
0:31
= 0x8000_0000) then
templ1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:63
a
0:31
sf
b
32:63
if (a
0:31
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
templ2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
templ
0:66
ROUND((EXTS
67
(ACC
32:63
||
32
0) + EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)),32)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding word pairs of signed fractional
elements in parameters a and b are multiplied producing a pair of 64-bit products. The difference
of this pair of intermediate 64-bit products is added to the zero-padded upper word of the
accumulator, rounded to 32 bits using the current fractional rounding mode in SPEFCSR,
saturating if overflow or underflow occurs, and placed into the high word of parameter d and the
accumulator. For the low word element in the destination, word pairs of signed fractional elements
from the words of parameters a and b are multiplied after exchanging the words of parameter a,
producing a pair of 64-bit products. The sum of this pair of intermediate 64-bit products is added
to the zero-padded lower word of the accumulator, rounded to 32 bits using the current fractional
rounding mode in SPEFCSR, saturating if overflow or underflow occurs, and placed into the low
word of parameter d and the accumulator. If both elements of a multiply are -1.0, the product is
saturated. This instruction can be used to perform a complex multiply or dot product by placing
{real,img} pairs into the word pairs of parameters a and b.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-333
Figure 3-239. Vector Dot Product of Words, Complex, Signed, Saturate, Fractional, Round
and Accumulate Words (__ev_dotpwcssfraaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssfraaw d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
- +
+ +
Accumulator
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
ROUND & SAT
ROUND & SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-334 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssfraaw3 __ev_dotpwcssfraaw3
Vector Dot Product of Words, Complex, Signed, Saturate, Fractional, Round and Accumulate
into Words 3 operand
d = __ev_dotpwcssfraaw3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:63
b
0:31
sf
c
0:31
if (b
0:31
= 0x8000_0000) & (c
0:31
= 0x8000_0000) then
temph1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
movh 0
temph2
0:63
b
32:63
sf
c
32:63
if (b
32:63
= 0x8000_0000) & (c
32:63
= 0x8000_0000) then
temph2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
temph
0:66
ROUND((EXTS
67
(a
0:31
||
32
0) + EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)),32)
ovl chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
//low dot - calculate imaginary part of complex product
templ1
0:63
b
32:63
sf
a
0:31
if (b
32:63
= 0x8000_0000) & (c
0:31
= 0x8000_0000) then
templ1
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
movl 0
templ2
0:63
b
0:31
sf
c
32:63
if (b
0:31
= 0x8000_0000) & (c
32:63
= 0x8000_0000) then
templ2
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
templ
0:66
ROUND((EXTS
67
(a
32:63
||
32
0) + EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)),32)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh | ovh; SPEFSCR
OV
movl | ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh | ovh; SPEFSCR
SOV
SPEFSCR
SOV
| movl | ovl
For the high word element in the destination, corresponding word pairs of signed fractional
elements in parameters b and c are multiplied producing a pair of 64-bit products. The difference
of this pair of intermediate 64-bit products is added to the zero-padded upper word of parameter
a, rounded to 32 bits using the current fractional rounding mode in SPEFCSR, saturating if
overflow or underflow occurs, and placed into the high word of parameter d and the accumulator.
For the low word element in the destination, word pairs of signed fractional elements from the
words of parameters b and c are multiplied after exchanging the words of parameter b, producing
a pair of 64-bit products. The sum of this pair of intermediate 64-bit products is added to the
zero-padded lower word of the parameter a, rounded to 32 bits using the current fractional
rounding mode in SPEFCSR, saturating if overflow or underflow occurs, and placed into the low
word of parameter d and the accumulator. If both elements of a multiply are -1.0, the product is
saturated. This instruction can be used to perform a complex multiply or dot product by placing
{real,img} pairs into the word pairs of parameters b and c.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-335
Figure 3-240. Vector Dot Product of Words, Complex, Signed, Saturate, Fractional, Round
and Accumulate Words 3 op (__ev_dotpwcssfraaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwcssfraaw3 d,b,c
31 32 31 32 0 63 0 63
Intermediate
c
x x
b
products
(4 64-bit)
x x
- +
+ +
a
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
ROUND & SAT
ROUND & SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-336 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssi[a] __ev_dotpwcssi[a]
Vector Dot Product of Words, Complex, Signed, Saturate, Integer (to Accumulator)
d = __ev_dotpwcssi (a,b) (A = 0)
d = __ev_dotpwcssia (a,b) (A = 1)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
si
b
0:31
temph2
0:63
a
32:63
si
b
32:63
temph
0:64
EXTS
65
(temph1
0:63
) - EXTS
65
(temph2
0:63
)
if (temph
0:64
>
si
EXTS
65
(0x7FFF_FFFF)) | (temph
0:64
<
si
EXTS
65
(0x8000_0000)) then
ovh 1
else
ovh 0
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
33:64
)
//low dot - calculate imaginary part of complex product
templ1
0:31
a
32:63
si
b
0:31
templ2
0:31
a
0:31
si
b
32:63
templ
0:64
EXTS
65
(templ1
0:63
) + EXTS
65
(templ2
0:63
)
if (templ
0:64
>
si
EXTS
65
(0x7FFF_FFFF)) | (templ
0:64
<
si
EXTS
65
(0x8000_0000)) then
ovl 1
else
ovl 0
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
33:64
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 64-bit products. The difference of this pair
of intermediate 64-bit products is placed into the high word of parameter d, saturating if the
difference cannot be represented in 32 bits. For the low word element in the destination, word pairs
of signed integer elements from the words of parameters a and b are multiplied after exchanging
the words of parameter a, producing a pair of 64-bit products. The low-order 32-bits of the sum of
this pair of intermediate 64-bit products is placed into the low word of parameter d, saturating if
the sum cannot be represented in 32 bits. If A = 1, the result in parameter d is also placed into the
accumulator. This instruction can be used to perform a complex multiply or dot product by placing
{real,img} pairs into the word pairs of parameters a and b.
Other registers altered: ACC (if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-337
Figure 3-241. Vector Dot Product of Words, Complex, Signed, Saturate, Integer (to
Accumulator) (__ev_dotpwcssi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssia d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
-
d (and Accumulator if A=1)
+
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
SAT SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-338 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssiaaw __ev_dotpwcssiaaw
Vector Dot Product of Words, Complex, Signed, Saturate, Integer and Accumulate into Words
d = __ev_dotpwcssiaaw (a,b)
// high dot - calculate real part of complex product
temph1
0:63
a
0:31
si
b
0:31
temph2
0:63
a
32:63
si
b
32:63
temph
0:66
EXTS
67
(ACC
0:31
) + EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)
if (temph
0:66
>
si
EXTS
67
(0x7FFF_FFFF)) | (temph
0:66
<
si
EXTS
67
(0x8000_0000)) then
ovh 1
else
ovh 0
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
35:66
)
//low dot - calculate imaginary part of complex product
templ1
0:31
a
32:63
si
b
0:31
templ2
0:31
a
0:31
si
b
32:63
templ
0:66
EXTS
67
(ACC
32:63
) + EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)
if (templ
0:66
>
si
EXTS
67
(0x7FFF_FFFF)) | (templ
0:66
<
si
EXTS
67
(0x8000_0000)) then
ovl 1
else
ovl 0
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
35:66
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding word pairs of signed integer elements
in parameters a and b are multiplied producing a pair of 64-bit products. The difference of this pair
of intermediate 64-bit products is added together with the contents of the sign-extended upper
accumulator word, and the sum is placed into the high word of parameter d and the accumulator,
saturating if the sum cannot be represented in 32 bits. For the low word element in the destination,
word pairs of signed integer elements from the words of parameters a and b are multiplied after
exchanging the words of parameter a, producing a pair of 64-bit products. The sum of this pair of
intermediate 64-bit products is added together with the contents of the sign-extended lower
accumulator word and the sum is placed into the low word of parameter d and the accumulator,
saturating if the sum cannot be represented in 32 bits. This instruction can be used to perform a
complex multiply or dot product by placing {real,img} pairs into the word pairs of parameters a
and b.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-339
Figure 3-242. Vector Dot Product of Words, Complex, Signed, Saturate, Integer and
Accumulate Words (__ev_dotpwcssiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwcssiaaw d,a,b
31 32 31 32 0 63 0 63
Intermediate
b
x x
a
products
(4 64-bit)
x x
- +
+ +
Accumulator
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
SAT
SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-340 Freescale Semiconductor
SPE2 Operations
__ev_dotpwcssiaaw3 __ev_dotpwcssiaaw3
Vector Dot Product of Words, Complex, Signed, Saturate, Integer and Accumulate into Words 3
operand
d = __ev_dotpwcssiaaw3 (a,b,c)
// high dot - calculate real part of complex product
temph1
0:63
b
0:31
si
c
0:31
temph2
0:63
b
32:63
si
c
32:63
temph
0:66
EXTS
67
(a
0:31
) + EXTS
67
(temph1
0:63
) - EXTS
67
(temph2
0:63
)
if (temph
0:66
>
si
EXTS
67
(0x7FFF_FFFF)) | (temph
0:66
<
si
EXTS
67
(0x8000_0000)) then
ovh 1
else
ovh 0
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
35:66
)
//low dot - calculate imaginary part of complex product
templ1
0:31
b
32:63
si
c
0:31
templ2
0:31
b
0:31
si
c
32:63
templ
0:66
EXTS
67
(a
32:63
) + EXTS
67
(templ1
0:63
) + EXTS
67
(templ2
0:63
)
if (templ
0:66
>
si
EXTS
67
(0x7FFF_FFFF)) | (templ
0:66
<
si
EXTS
67
(0x8000_0000)) then
ovl 1
else
ovl 0
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
35:66
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For the high word element in the destination, corresponding word pairs of signed integer elements
in parameters b and c are multiplied producing a pair of 64-bit products. The difference of this pair
of intermediate 64-bit products is added together with the contents of the sign-extended upper
parameter a word, and the sum is placed into the high word of parameter d and the accumulator,
saturating if the sum cannot be represented in 32 bits. For the low word element in the destination,
word pairs of signed integer elements from the words of parameters b and c are multiplied after
exchanging the words of parameter b, producing a pair of 64-bit products. The sum of this pair of
intermediate 64-bit products is added together with the contents of the sign-extended low
parameter a word and the sum is placed into the low word of parameter d and the accumulator,
saturating if the sum cannot be represented in 32 bits. This instruction can be used to perform a
complex multiply or dot product by placing {real,img} pairs into the word pairs of parameters b
and c.
Other registers altered: ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-341
Figure 3-243. Vector Dot Product of Words, Complex, Signed, Saturate, Integer and
Accumulate Words 3 op (__ev_dotpwcssiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwcssiaaw3 d,b,c
31 32 31 32 0 63 0 63
Intermediate
c
x x
b
products
(4 64-bit)
x x
- +
+ +
a
d and Accumulator
Re Img Re Img
Re Img Re Img
Re Img
(not to scale)
SAT
SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-342 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgasmf[a] __ev_dotpwgasmf[a]
Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional (to Accumulator)
d = __ev_dotpwgasmf (a,b) (A = 0)
d = __ev_dotpwgasmfa (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
0:31
templ
0:64
b
32:63
sf
a
32:63
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
EXTS
64
(temp
0:49
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the corresponding words
in parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, added
together, and the high-order 50 bits of the 66-bit sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format, and the result is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-244. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional (to
Accumulator) (__ev_dotpwgasmf[a])
0 31 32 63
b
X
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
65
+
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-343
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgasmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgasmfa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-344 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgasmfaa __ev_dotpwgasmfaa
Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional and Accumulate
d = __ev_dotpwgasmfaa (a,b)
temph
0:64
b
0:31
sf
a
0:31
templ
0:64
b
32:63
sf
a
32:63
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
ACC
0:63
+ EXTS
64
(temp
0:49
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the corresponding words
in parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, added
together, and the high-order 50 bits of the 66-bit sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format which is added to the contents of the accumulator, and
the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-245. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional and
Accumulate (__ev_dotpwgasmfaa)
0 31 32 63
b
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
65
X X
+
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-345
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgasmfaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-346 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgasmfaa3 __ev_dotpwgasmfaa3
Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional and Accumulate 3
operand
d = __ev_dotpwgasmfaa3 (a,b,c)
temph
0:64
c
0:31
sf
b
0:31
templ
0:64
c
32:63
sf
b
32:63
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
a
0:63
+ EXTS
64
(temp
0:49
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the corresponding words
in parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, added
together, and the high-order 50 bits of the 66-bit sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format which is added to the contents of parameter a, and the
result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-246. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional and
Accumulate 3 op (__ev_dotpwgasmfaa3)
0 31 32 63
c
b
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
65
X X
+
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-347
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwgasmfaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-348 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgasmfr[a] __ev_dotpwgasmfr[a]
Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional, Round (to Accumulator)
d = __ev_dotpwgasmfr (a,b) (A = 0)
d = __ev_dotpwgasmfra (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
0:31
templ
0:64
b
32:63
sf
a
32:63
temp
0:66
EXTS
67
(temph
0:64
) + EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
EXTS
64
(tempr
0:50
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the corresponding words
in parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, added
together, and rounded using the current rounding mode in SPEFSCR to discard 16 low order bits.
The high order 51 bits of the 67-bit rounded sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format, and the result is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-247. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional,
Round (to Accumulator) (__ev_dotpwgasmfr[a])
0 31 32 63
b
X
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
66
+
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-349
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgasmfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgasmfra d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-350 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgasmfraa __ev_dotpwgasmfraa
Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional, Round and Accumulate
d = __ev_dotpwgasmfraa (a,b)
temph
0:64
b
0:31
sf
a
0:31
; templ
0:64
b
32:63
sf
a
32:63
temp
0:66
EXTS
67
(temph
0:64
) + EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
ACC
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the corresponding words
in parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, added
together, and rounded using the current rounding mode in SPEFSCR to discard 16 low order bits.
The high order 51 bits of the 67-bit rounded sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format. The intermediate sum is added to the contents of the
accumulator, and the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-248. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional,
Round and Accumulate (__ev_dotpwgasmfraa)
0 31 32 63
b
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
66
X X
+
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-351
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgasmfraa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-352 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgasmfraa3 __ev_dotpwgasmfraa3
Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional, Round and Accumulate
3 operand
d = __ev_dotpwgasmfraa3 (a,b,c)
temph
0:64
c
0:31
sf
b
0:31
; templ
0:64
c
32:63
sf
b
32:63
temp
0:66
EXTS
67
(temph
0:64
) + EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
a
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the corresponding words
in parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, added
together, and rounded using the current rounding mode in SPEFSCR to discard 16 low order bits.
The high order 51 bits of the 67-bit rounded sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format. The intermediate sum is added to the contents of
parameter a, and the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-249. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional,
Round and Accumulate 3 op (__ev_dotpwgasmfraa3)
0 31 32 63
c
b
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
66
X X
+
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-353
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwgasmfraa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-354 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgssmf[a] __ev_dotpwgssmf[a]
Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional (to Accumulator)
d = __ev_dotpwgssmf (a,b) (A = 0)
d = __ev_dotpwgssmfa (a,b) (A = 1)
temph
0:64
a
0:31
sf
b
0:31
templ
0:64
a
32:63
sf
b
32:63
temp
0:65
EXTS
66
(temph
0:64
) - EXTS
66
(templ
0:64
)
d
0:63
EXTS
64
(temp
0:49
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameters a and b are multiplied to produce a pair of
65-bit intermediate fractional products. If both inputs are -1.0, the intermediate product is
represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two high-order product bits are
the same. The intermediate products are sign-extended to 66 bits, the low-order product is
subtracted from the high-order product, the high-order 50 bits of the 66-bit result are sign-extended
to 64-bits to produce an intermediate difference in 17.47 fractional format, and the result is placed
into parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-250. Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional
(to Accumulator) (__ev_dotpwgssmf[a])
0 31 32 63
b
X
a
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
65
-
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-355
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgssmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgssmfa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-356 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgssmfaa __ev_dotpwgssmfaa
Vector Multiply Word Complex, Real, Guarded, Signed, Modulo, Fractional and Accumulate
d = __ev_dotpwgssmfaa (a,b)
temph
0:64
a
0:31
sf
b
0:31
templ
0:64
a
32:63
sf
b
32:63
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
d
0:63
ACC
0:63
+ EXTS
64
(temp
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameters a and b are multiplied to produce a pair of
65-bit intermediate fractional products. If both inputs are -1.0, the intermediate product is
represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two high-order product bits are
the same. The intermediate products are sign-extended to 67 bits, the low-order product is
subtracted from the high-order product, the high-order 51 bits of the 67-bit result are sign-extended
to 64-bits to produce an intermediate difference in 17.47 fractional format which is added to the
contents of the accumulator, and the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-251. Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional
and Accumulate (__ev_dotpwgssmfaa)
0 31 32 63
b
X
a
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
X
66
-
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-357
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgssmfaa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-358 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgssmfaa3 __ev_dotpwgssmfaa3
Vector Multiply Word Complex, Real, Guarded, Signed, Modulo, Fractional and Accumulate 3
operand
d = __ev_dotpwgssmfaa3 (a,b,c)
temph
0:64
b
0:31
sf
c
0:31
templ
0:64
b
32:63
sf
c
32:63
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
d
0:63
a
0:63
+ EXTS
64
(temp
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameters b and c are multiplied to produce a pair of
65-bit intermediate fractional products. If both inputs are -1.0, the intermediate product is
represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two high-order product bits are
the same. The intermediate products are sign-extended to 67 bits, the low-order product is
subtracted from the high-order product, the high-order 51 bits of the 67-bit result are sign-extended
to 64-bits to produce an intermediate difference in 17.47 fractional format which is added to the
contents of parameter a, and the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-252. Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional
and Accumulate 3 op (__ev_dotpwgssmfaa3)
0 31 32 63
c
X
b
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
X
66
-
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-359
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwgssmfaa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-360 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgssmfr[a] __ev_dotpwgssmfr[a]
Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional, Round (to
Accumulator)
d = __ev_dotpwgssmfr (a,b) (A = 0)
d = __ev_dotpwgssmfra (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
0:31
templ
0:64
b
32:63
sf
a
32:63
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
EXTS
64
(tempr
0:50
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the corresponding words
in parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, the
low-order product is subtracted from the high-order product and rounded using the current
rounding mode in SPEFSCR to discard 16 low order bits. The high order 51 bits of the 67-bit
rounded result are sign-extended to 64-bits to produce an intermediate difference in 17.47
fractional format, and the result is placed into parameter d. If A = 1, the result in parameter d is
also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-253. Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional,
Round (to Accumulator) (__ev_dotpwgssmfr[a])
0 31 32 63
b
X
a
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
66
-
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-361
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgssmfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgssmfra d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-362 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgssmfraa __ev_dotpwgssmfraa
Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional, Round and
Accumulate
d = __ev_dotpwgssmfraa (a,b)
temph
0:64
b
0:31
sf
a
0:31
; templ
0:64
b
32:63
sf
a
32:63
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
ACC
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the corresponding words
in parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, the
low-order product is subtracted from the high-order product and rounded using the current
rounding mode in SPEFSCR to discard 16 low order bits. The high order 51 bits of the 67-bit
rounded result are sign-extended to 64-bits to produce an intermediate difference in 17.47
fractional format which is added to the contents of the accumulator, and the result is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-254. Vector Dot Product of Words Guarded, Add, Signed, Modulo, Fractional,
Round and Accumulate (__ev_dotpwgssmfraa)
0 31 32 63
b
a
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
66
X X
-
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-363
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwgssmfraa d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-364 Freescale Semiconductor
SPE2 Operations
__ev_dotpwgssmfraa3 __ev_dotpwgssmfraa3
Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional, Round and
Accumulate 3 operand
d = __ev_dotpwgssmfraa3 (a,b,c)
temph
0:64
c
0:31
sf
b
0:31
; templ
0:64
c
32:63
sf
b
32:63
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
a
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the corresponding words
in parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0,
the intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, the
low-order product is subtracted from the high-order product and rounded using the current
rounding mode in SPEFSCR to discard 16 low order bits. The high order 51 bits of the 67-bit
rounded result are sign-extended to 64-bits to produce an intermediate difference in 17.47
fractional format which is added to the contents of parameter a, and the result is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-255. Vector Dot Product of Words Guarded, Subtract, Signed, Modulo, Fractional,
Round and Accumulate 3 op (__ev_dotpwgssmfraa3)
0 31 32 63
c
b
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
66
X X
-
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-365
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwgssmfraa3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-366 Freescale Semiconductor
SPE2 Operations
__ev_dotpwssmi[a] __ev_dotpwssmi[a]
Vector Dot Product of Words, Add, Unsigned, Saturate, integer (to Accumulator)
d = __ev_dotpwssmi (a,b) (A = 0)
d = __ev_dotpwssmia (a,b) (A = 1)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:31
temph
0:63
- templ
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. The product of the low word elements is subtracted from the
product of the high word elements, and the difference is placed into parameter d. If A = 1, the result
in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-256. Vector Dot Product of Words, Subtract, Signed, Modulo, Integer (to
Accumulator) (__ev_dotpwssmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwssmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwssmia d,a,b
0 31 32 63
Intermediate
b
x
a
products
(2 64-bit)
x
-
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-367
__ev_dotpwssmiaa __ev_dotpwssmiaa
Vector Dot Product of Words, Subtract, Signed, Modulo, Integer and Accumulate
d = __ev_dotpwssmiaa (a,b)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:63
temph
0:63
- templ
0:63
+ ACC
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. The product of the low word elements is subtracted from the
product of the high word elements, and the result is added to the contents of the accumulator and
the sum is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-257. Vector Dot Product of Words, Subtract, Signed, Modulo, Integer and
Accumulate (__ev_dotpwssmiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwssmiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
-
+
Accumulator
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-368 Freescale Semiconductor
SPE2 Operations
__ev_dotpwssmiaa3 __ev_dotpwssmiaa3
Vector Dot Product of Words, Subtract, Signed, Modulo, Integer and Accumulate, 3 operand
d = __ev_dotpwssmiaa3 (a,b,c)
temph
0:63
b
0:31
si
c
0:31
templ
0:63
b
32:63
si
c
32:63
temp
0:63
temph
0:63
- templ
0:63
+ a
0:63
// modulo sum
d
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
Corresponding pairs of signed integer word elements in parameters b and c are multiplied
producing a pair of 64-bit products. The product of the low word elements is subtracted from the
product of the high word elements, the result is added to the contents of parameter a, and the sum
is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-258. Vector Dot Product of Words, Subtract, Signed, Modulo, Integer and
Accumulate 3op (__ev_dotpwssmiaa3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwssmiaa3 d,b,c
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(2 64-bit)
-
+
a
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-369
__ev_dotpwsssi[a] __ev_dotpwsssi[a]
Vector Dot Product of Words, Subtract, Signed, Saturate, integer (to Accumulator)
d = __ev_dotpwsssi (a,b) (A = 0)
d = __ev_dotpwsssia (a,b) (A = 1)
Maps to evdotpwssmi since no overflow can occur
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwssmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwssmia d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-370 Freescale Semiconductor
SPE2 Operations
__ev_dotpwsssiaa __ev_dotpwsssiaa
Vector Dot Product of Words, Subtract, Signed, Saturate, Integer and Accumulate
d = __ev_dotpwsssiaa (a,b)
temph
0:63
a
0:31
si
b
0:31
templ
0:63
a
32:63
si
b
32:63
temp
0:64
EXTS(ACC
0:63
) + EXTS(temph
0:63
) - EXTS(templ
0:63
)
ov (temp
0
temp
1
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
1:64
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
mov | ov
SPEFSCR
SOV
SPEFSCR
SOV
| mov | ov
Corresponding pairs of signed integer word elements in parameters a and b are multiplied
producing a pair of 64-bit products. The product of the low word elements is subtracted from the
product of the high word elements, then added together with the contents of the accumulator,
saturating if overflow or underflow occurs in the final sum, and the sum is placed into parameter
d and the accumulator.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow or
underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-259. Vector Dot Product of Words, Add, Signed, Saturate, Integer and
Accumulate (__ev_dotpwsssiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwsssiaa d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(2 64-bit)
-
+
Accumulator
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-371
__ev_dotpwsssiaa3 __ev_dotpwsssiaa3
Vector Dot Product of Words, Subtract, Signed, Saturate, Integer and Accumulate, 3 operand
d = __ev_dotpwsssiaa3 (a,b,c)
temph
0:63
b
0:31
si
c
0:31
templ
0:63
b
32:63
si
c
32:63
temp
0:66
EXTS
67
(a
0:63
) + EXTS
67
(temph
0:63
) - EXTS
67
(templ
0:63
)
ov chk_ovf(temp
0:3
)
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7FFF_FFFF_FFFF_FFFF, temp
3:66
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
Corresponding pairs of signed integer word elements in parameters b and c are multiplied
producing a pair of 64-bit products. The product of the low word elements is subtracted from the
product of the high word elements, then added together with the contents of parameter a, saturating
if overflow or underflow occurs in the final sum, and the sum is placed into parameter d and the
accumulator. The overflow and summary overflow bits are recorded in the SPEFSCR based on an
overflow or underflow from the accumulation operations.
Other registers altered: SPEFSCR ACC
Figure 3-260. Vector Dot Product of Words, Subtract, Signed, Saturate, Integer and
Accumulate 3 op (__ev_dotpwsssiaa3)
15 16 47 48 0 31 32 63
Intermediate
c
x x
b
products
(2 64-bit)
-
+
a
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-372 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwsssiaa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-373
__ev_dotpwxgasmf[a] __ev_dotpwxgasmf[a]
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional (to
Accumulator)
d = __ev_dotpwxgasmf (a,b) (A = 0)
d = __ev_dotpwxgasmfa (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
32:63
templ
0:64
b
32:63
sf
a
0:31
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
EXTS
64
(temp
0:49
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, added
together, and the high-order 50 bits of the 66-bit sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format, and the result is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-261. Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo,
Fractional (to Accumulator) (__ev_dotpwxgasmf[a])
0 31 32 63
b
X
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
65
+
SPE2 Programming Interface Manual, Rev. 1.0-2
3-374 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgasmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgasmfa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-375
__ev_dotpwxgasmfaa __ev_dotpwxgasmfaa
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional and
Accumulate
d = __ev_dotpwxgasmfaa (a,b)
temph
0:64
b
0:31
sf
a
32:63
templ
0:64
b
32:63
sf
a
0:31
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
ACC
0:63
+ EXTS
64
(temp
0:49
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, added
together, and the high-order 50 bits of the 66-bit sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format which is added to the contents of the accumulator, and
the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-262. Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo,
Fractional and Accumulate (__ev_dotpwxgasmfaa)
0 31 32 63
b
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
65
X X
+
SPE2 Programming Interface Manual, Rev. 1.0-2
3-376 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgasmfaa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-377
__ev_dotpwxgasmfaa3 __ev_dotpwxgasmfaa3
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional and
Accumulate 3 operand
d = __ev_dotpwxgasmfaa3 (a,b,c)
temph
0:64
c
0:31
sf
b
32:63
templ
0:64
c
32:63
sf
b
0:31
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
a
0:63
+ EXTS
64
(temp
0:49
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the exchanged words in
parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, added
together, and the high-order 50 bits of the 66-bit sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format which is added to the contents of parameter a, and the
result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-263. Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo,
Fractional and Accumulate 3op (__ev_dotpwxgasmfaa3)
0 31 32 63
c
b
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
65
X X
+
SPE2 Programming Interface Manual, Rev. 1.0-2
3-378 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwxgasmfaa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-379
__ev_dotpwxgasmfr[a] __ev_dotpwxgasmfr[a]
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional, Round (to
Accumulator)
d = __ev_dotpwxgasmfr (a,b) (A = 0)
d = __ev_dotpwxgasmfra (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
32:63
templ
0:64
b
32:63
sf
a
0:31
temp
0:66
EXTS
67
(temph
0:64
) + EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
EXTS
64
(tempr
0:50
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, added
together, and rounded using the current rounding mode in SPEFSCR to discard 16 low order bits.
The high order 51 bits of the 67-bit rounded sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format, and the result is placed into parameter d. If A = 1, the
result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-264. Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo,
Fractional, Round (to Accumulator) (__ev_dotpwxgasmfr[a])
0 31 32 63
b
X
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
66
+
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-380 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgasmfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgasmfra d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-381
__ev_dotpwxgasmfraa __ev_dotpwxgasmfraa
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional, Round and
Accumulate
d = __ev_dotpwxgasmfraa (a,b)
temph
0:64
b
0:31
sf
a
32:63
; templ
0:64
b
32:63
sf
a
0:31
temp
0:66
EXTS
67
(temph
0:64
) + EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
ACC
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, added
together, and rounded using the current rounding mode in SPEFSCR to discard 16 low order bits.
The high order 51 bits of the 67-bit rounded sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format. The intermediate sum is added to the contents of the
accumulator, and the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-265. Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo,
Fractional, Round and Accumulate (__ev_dotpwxgasmfraa)
0 31 32 63
b
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
66
X X
+
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-382 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgasmfraa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-383
__ev_dotpwxgasmfraa3 __ev_dotpwxgasmfraa3
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional, Round and
Accumulate 3 operand
d = __ev_dotpwxgasmfraa3 (a,b,c)
temph
0:64
c
0:31
sf
b
32:63
; templ
0:64
c
32:63
sf
b
0:31
temp
0:66
EXTS
67
(temph
0:64
) + EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
a
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the exchanged words in
parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, added
together, and rounded using the current rounding mode in SPEFSCR to discard 16 low order bits.
The high order 51 bits of the 67-bit rounded sum are sign-extended to 64-bits to produce an
intermediate sum in 17.47 fractional format. The intermediate sum is added to the contents of
parameter a, and the result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-266. Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo,
Fractional, Round and Accumulate 3op (__ev_dotpwxgasmfraa3)
0 31 32 63
c
b
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
66
X X
+
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-384 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwxgasmfraa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-385
__ev_dotpwxgssmf[a] __ev_dotpwxgssmf[a]
Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed, Modulo, Fractional (to
Accumulator)
d = __ev_dotpwxgssmf (a,b) (A = 0)
d = __ev_dotpwxgssmfa (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
32:63
templ
0:64
b
32:63
sf
a
0:31
temp
0:65
EXTS
66
(temph
0:64
) - EXTS
66
(templ
0:64
)
d
0:63
EXTS
64
(temp
0:49
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, the
low product is subtracted from the high product, and the high-order 50 bits of the 66-bit difference
are sign-extended to 64-bits to produce an intermediate difference in 17.47 fractional format, and
the result is placed into parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-267. Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed,
Modulo, Fractional (to Accumulator) (__ev_dotpwxgssmf[a])
0 31 32 63
b
X
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
65
-
SPE2 Programming Interface Manual, Rev. 1.0-2
3-386 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgssmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgssmfa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-387
__ev_dotpwxgssmfaa __ev_dotpwxgssmfaa
Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed, Modulo, Fractional and
Accumulate
d = __ev_dotpwxgssmfaa (a,b)
temph
0:64
b
0:31
sf
a
32:63
templ
0:64
b
32:63
sf
a
0:31
temp
0:65
EXTS
66
(temph
0:64
) - EXTS
66
(templ
0:64
)
d
0:63
ACC
0:63
+ EXTS
64
(temp
0:49
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, the
low product is subtracted from the high product, and the high-order 50 bits of the 66-bit difference
are sign-extended to 64-bits to produce an intermediate difference in 17.47 fractional format which
is added to the contents of the accumulator, and the result is placed into parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-268. Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed,
Modulo, Fractional and Accumulate (__ev_dotpwxgssmfaa)
0 31 32 63
b
a
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
65
X X
-
SPE2 Programming Interface Manual, Rev. 1.0-2
3-388 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgssmfaa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-389
__ev_dotpwxgssmfaa3 __ev_dotpwxgssmfaa3
Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed, Modulo, Fractional and
Accumulate 3 operand
d = __ev_dotpwxgssmfaa3 (a,b,c)
temph
0:64
c
0:31
sf
b
32:63
templ
0:64
c
32:63
sf
b
0:31
temp
0:65
EXTS
66
(temph
0:64
) + EXTS
66
(templ
0:64
)
d
0:63
a
0:63
+ EXTS
64
(temp
0:49
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the exchanged words in
parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 66 bits, the
low product is subtracted from the high product, and the high-order 50 bits of the 66-bit difference
are sign-extended to 64-bits to produce an intermediate difference in 17.47 fractional format which
is added to the contents of parameter a, and the result is placed into parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-269. Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed,
Modulo, Fractional and Accumulate 3op (__ev_dotpwxgssmfaa3)
0 31 32 63
c
b
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
65
X X
-
SPE2 Programming Interface Manual, Rev. 1.0-2
3-390 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwxgssmfaa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-391
__ev_dotpwxgssmfr[a] __ev_dotpwxgssmfr[a]
Vector Dot Product of Words Exchanged, Guarded, Add, Signed, Modulo, Fractional, Round (to
Accumulator)
d = __ev_dotpwxgssmfr (a,b) (A = 0)
d = __ev_dotpwxgssmfra (a,b) (A = 1)
temph
0:64
b
0:31
sf
a
32:63
templ
0:64
b
32:63
sf
a
0:31
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
EXTS
64
(tempr
0:50
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, the
low product is subtracted from the high product, and the difference is rounded using the current
rounding mode in SPEFSCR to discard 16 low order bits. The high order 51 bits of the 67-bit
rounded difference are sign-extended to 64-bits to produce an intermediate difference in 17.47
fractional format, and the result is placed into parameter d. If A = 1, the result in parameter d is
also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-270. Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed,
Modulo, Fractional, Round (to Accumulator) (__ev_dotpwxgssmfr[a])
0 31 32 63
b
X
a
0 63
intermediate sum
15 16 17
.
S.....................S X
S
48 0
exts
d (and Accumulator if A=1)
X
66
-
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-392 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgssmfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgssmfra d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-393
__ev_dotpwxgssmfraa __ev_dotpwxgssmfraa
Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed, Modulo, Fractional, Round
and Accumulate
d = __ev_dotpwxgssmfraa (a,b)
temph
0:64
b
0:31
sf
a
32:63
; templ
0:64
b
32:63
sf
a
0:31
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
ACC
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter b are multiplied with the exchanged words in
parameter a to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, the
low product is subtracted from the high product, and the difference is rounded using the current
rounding mode in SPEFSCR to discard 16 low order bits. The high order 51 bits of the 67-bit
rounded difference are sign-extended to 64-bits to produce an intermediate difference in 17.47
fractional format. The intermediate difference is added to the contents of the accumulator, and the
result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-271. Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed,
Modulo, Fractional, Round and Accumulate (__ev_dotpwxgssmfraa)
0 31 32 63
b
a
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
Accumulator
+
d and Accumulator
66
X X
-
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-394 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evdotpwxgssmfraa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-395
__ev_dotpwxgssmfraa3 __ev_dotpwxgssmfraa3
Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed, Modulo, Fractional, Round
and Accumulate 3 operand
d = __ev_dotpwxgssmfraa3 (a,b,c)
temph
0:64
c
0:31
sf
b
32:63
; templ
0:64
c
32:63
sf
b
0:31
temp
0:66
EXTS
67
(temph
0:64
) - EXTS
67
(templ
0:64
)
tempr
0:66
ROUND(temp
0:66
,16)
d
0:63
a
0:63
+ EXTS
64
(tempr
0:50
)
ACC
0:63
d
0:63
The signed fractional word elements in parameter c are multiplied with the exchanged words in
parameter b to produce a pair of 65-bit intermediate fractional products. If both inputs are -1.0, the
intermediate product is represented as +1.0 (
1
0 || 0x8000_0000_0000_0000), otherwise the two
high-order product bits are the same. The intermediate products are sign-extended to 67 bits, the
low product is subtracted from the high product, and the difference is rounded using the current
rounding mode in SPEFSCR to discard 16 low order bits. The high order 51 bits of the 67-bit
rounded difference are sign-extended to 64-bits to produce an intermediate difference in 17.47
fractional format. The intermediate difference is added to the contents of parameter a, and the
result is placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-272. Vector Dot Product of Words Exchanged, Guarded, Subtract, Signed,
Modulo, Fractional, Round and Accumulate 3op (__ev_dotpwxgssmfraa3)
0 31 32 63
c
b
0 63
intermediate difference
15 16 17
.
S.....................S X
S
48 0
exts
a
+
d and Accumulator
66
X X
-
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-396 Freescale Semiconductor
SPE2 Operations
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evdotpwxgssmfraa3 d,b,c
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-397
__ev_eqv __ev_eqv
Vector Equivalent
d = __ev_eqv (a,b)
d
0:31
a
0:31
b
0:31
// Bitwise XNOR
d
32:63
a
32:63
b
32:63
// Bitwise XNOR
The corresponding elements of parameters a and b are XNORed bitwise, and the results are placed
in the parameter d.
Figure 3-273. Vector Equivalent (__ev_eqv)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ eveqv d,a,b
0 31 32 63
a
b
XNOR XNOR
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-398 Freescale Semiconductor
SPE2 Operations
__ev_extsb __ev_extsb
Vector Extend Sign Byte
d = __ev_extsb (a)
d
0:31
EXTS(a
24:31
)
d
32:63
EXTS(a
56:63
)
The signs of the byte in each of the elements in parameter a are extended, and the results are placed
in the parameter d.
Figure 3-274. Vector Extend Sign Byte (__ev_extsb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evextsb d,a
0 31 32 63
a
d
s
23 24 55 56
s
s s ssss_ssss_ssss_ssss_ssss_ssss ssss_ssss_ssss_ssss_ssss_ssss
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-399
__ev_extsbh __ev_extsbh
Vector Extend Sign of Bytes (Odd) to Half Words
d = __ev_extsbh (a)
d
0:15
EXTS(a
8:15
)
d
16:31
EXTS(a
24:31
)
d
32:47
EXTS(a
40:47
)
d
48:63
EXTS(a
56:63
)
Each of the odd byte elements of parameter a are sign-extended to half words and placed into
corresponding half word elements of parameter d.
Figure 3-275. Vector Extend Sign Bytes (Odd) to Half Words (__ev_extsbh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evextsbh d,a
d
0 31 32 63
a
15 16 47 48
0 31 32 63 15 16 47 48
sssssssss sssssssss sssssssss sssssssss
s s s s
39 40 55 56 23 24 7 8
SPE2 Programming Interface Manual, Rev. 1.0-2
3-400 Freescale Semiconductor
SPE2 Operations
__ev_extsh __ev_extsh
Vector Extend Sign Half Word
d = __ev_extsh (a)
d
0:31
EXTS(a
16:31
)
d
32:63
EXTS(a
48:63
)
The odd half word elements in parameter a are sign-extended, and the results are placed in
parameter d.
Figure 3-276. Vector Extend Sign Half Word (__ev_extsh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evextsh d,a
0 31 32 63
a
d
s s
s s ssss_ssss_ssss_ssss ssss_ssss_ssss_ssss
15 16 17 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-401
__ev_extsw __ev_extsw
Vector Extend Sign Word
d = __ev_extsw (a)
d
0:63
EXTS(a
32:63
)
The low word element in parameter a is sign-extended, and the result is placed in parameter d.
Figure 3-277. Vector Extend Sign Word (__ev_extsw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evextsw d,a
0 31 32 63
a
d
s
ssss_ssss_ssss_ssss_ssss_ssss_ssss_ssss
47 48
s
SPE2 Programming Interface Manual, Rev. 1.0-2
3-402 Freescale Semiconductor
SPE2 Operations
__ev_extzb __ev_extzb
Vector Extend Zero Byte
d = __ev_extzb (a)
d
0:31
EXTZ(a
24:31
)
d
32:63
EXTZ(a
56:63
)
The low-order byte in each of the elements in parameter a is zero-extended, and the results are
placed in parameter d.
Figure 3-278. Vector Extend Zero Byte (__ev_extzb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evextzb d,a
0 31 32 63
a
d
23 24 55 56
0000_0000_0000_0000_0000_0000 0000_0000_0000_0000_0000_0000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-403
__ev_insb __ev_insb
Vector Insert Byte
d = __ev_insb (a,b,c,s)
temp
0:7
b
0+(c*8):7+(c*8)
if (s=0) then d
0:7
temp
0:7
else d
0:7
a
0:7
if (s=1) then d
8:15
temp
0:7
else d
8:15
a
8:15
if (s=2) then d
16:23
temp
0:7
else d
16:23
a
16:23
if (s=3) then d
24:31
temp
0:7
else d
24:31
a
24:31
if (s=4) then d
32:39
temp
0:7
else d
32:39
a
32:39
if (s=5) then d
40:47
temp
0:7
else d
40:47
a
40:47
if (s=6) then d
48:55
temp
0:7
else d
48:55
a
48:55
if (s=7) then d
56:63
temp
0:7
else d
56:63
a
56:63
The byte element of parameter b specified by parameter c is placed into the byte element of
parameter d specified by parameter s. The remaining bytes of parameter d are copied from
parameter a. Byte 0 is the most-significant byte.
Figure 3-279. Vector Insert Byte (__ev_insb)
d a b c s Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal 3-bit unsigned literal d a
evinsb d,b,s,c
0 32 63
a
24 16 8 40 48 56
c
0 1 2 3
0 32 63
b
24 16 8 40 48 56
4 5 6 7
s
=
7
0 32 63
d
24 16 8 40 48 56
s
=
6
s
=
5
s
=
4
s
=
3
s
=
2
s
=
1
s
=
0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-404 Freescale Semiconductor
SPE2 Operations
__ev_insh __ev_insh
Vector Insert Half Word
d = __ev_insh (a,b,c,s)
temp
0:15
b
0+(c*16):15+(c*16)
if (s=0) then d
0:15
temp
0:15
else d
0:15
a
0:15
if (s=1) then d
16:31
temp
0:15
else d
16:31
a
16:31
if (s=2) then d
32:47
temp
0:15
else d
32:47
a
32:47
if (s=3) then d
48:63
temp
0:15
else d
48:63
a
48:63
The half word element of parameter b specified by parameter c is placed into the half word element
of parameter d specified by parameter s. The remaining half words of parameter d are copied from
parameter a. Half word 0 is the most-significant half word.
Figure 3-280. Vector Insert Half Word (__ev_insh)
d a b c s Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ 2-bit unsigned literal 2-bit unsigned literal d a
evinsh d,b,s,c
15 16 47 48 0 31 32 63
a
15 16 47 48 0 31 32 63
d
15 16 47 48 0 31 32 63
b
c
0 1 2 3
0
s=3 s=2 s=1 s=0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-405
__ev_ilveh __ev_ilveh
Vector Interleave Even Half Words
d = __ev_ilveh (a,b)
d
0:63
a
0:15
|| b
0:15
|| a
32:47
|| b
32:47
The even half word elements in parameter a are interleaved with the even half word elements in
parameter b and placed into parameter d.
Figure 3-281. Vector Interleave Even Half Words (__ev_ilveh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilveh d,a,b
15 16 47 48 0 31 32 63
a
d
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-406 Freescale Semiconductor
SPE2 Operations
__ev_ilveoh __ev_ilveoh
Vector Interleave Even/Odd Half Words
d = __ev_ilveoh (a,b)
d
0:63
a
0:15
|| b
0:15
|| a
32:47
|| b
32:47
The even half word elements in parameter a are interleaved with the odd half word elements in
parameter b and placed into parameter d.
Figure 3-282. Vector Interleave Even/Odd Half Words (__ev_ilveoh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilveoh d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-407
__ev_ilvhih __ev_ilvhih
Vector Interleave High Half Words
d = __ev_ilvhih (a,b)
d
0:63
a
0:15
|| b
0:15
|| a
16:31
|| b
16:31
The most significant two half word elements in parameter a are interleaved with the most
significant two half word elements in parameter b and placed into parameter d.
Figure 3-283. Vector Interleave High Half Words (__ev_ilvhih)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilvhih d,a,b
15 16 47 48 0 31 32 63
a
d
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-408 Freescale Semiconductor
SPE2 Operations
__ev_ilvhiloh __ev_ilvhiloh
Vector Interleave High/Low Half Words
d = __ev_ilvhiloh (a,b)
d
0:63
a
0:15
|| b
32:47
|| a
16:31
|| b
48:63
The most significant two half word elements in parameter a are interleaved with the least
significant two half word elements in parameter b and placed into parameter d.
Figure 3-284. Vector Interleave High/Low Half Words (__ev_ilvhiloh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilvhiloh d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-409
__ev_ilvloh __ev_ilvloh
Vector Interleave Low Half Words
d = __ev_ilvloh (a,b)
d
0:63
a
32:47
|| b
32:47
|| a
48:63
|| b
48:63
The least significant two half word elements in parameter a are interleaved with the least
significant two half word elements in parameter b and placed into parameter d.
Figure 3-285. Vector Interleave Low Half Words (__ev_ilvloh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilvloh d,a,b
15 16 47 48 0 31 32 63
a
d
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-410 Freescale Semiconductor
SPE2 Operations
__ev_ilvlohih __ev_ilvlohih
Vector Interleave Low/High Half Words
d = __ev_ilvlohih (a,b)
d
0:63
a
32:47
|| b
0:15
|| a
48:63
|| b
16:31
The least significant two half word elements in parameter a are interleaved with the most
significant two half word elements in parameter b and placed into parameter d.
Figure 3-286. Vector Interleave Low/High Half Words (__ev_ilvlohih)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilvlohih d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-411
__ev_ilvoeh __ev_ilvoeh
Vector Interleave Odd/Even Half Words
d = __ev_ilvoeh (a,b)
d
0:63
a
16:31
|| b
0:15
|| a
48:63
|| b
32:47
The odd half word elements in parameter b are interleaved with the even half word elements in
parameter b and placed into parameter d.
Figure 3-287. Vector Interleave Odd/Even Half Words (__ev_ilvoeh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilvoeh d,a,b
15 16 47 48 0 31 32 63
a
d
b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-412 Freescale Semiconductor
SPE2 Operations
__ev_ilvoh __ev_ilvoh
Vector Interleave Odd Half Words
d = __ev_ilvoh (a,b)
d
0:63
a
16:31
|| b
16:31
|| a
48:63
|| b
48:63
The odd half word elements in parameter a are interleaved with the odd half word elements in
parameter b and placed into parameter d.
Figure 3-288. Vector Interleave Odd Half Words (__ev_ilvoh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evilvoh d,a,b
15 16 47 48 0 31 32 63
a
d
b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-413
__ev_lbbsplatb[u] __ev_lbbsplatb[u]
Vector Load Byte into Byte and Splat Bytes [with Update]
d = __ev_lbbsplatb (a,b) (U = 0)
d = __ev_lbbsplatbu (a,b) (U = 1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b)
d
0:7
MEM(EA,1)
d
8:15
MEM(EA,1)
d
16:23
MEM(EA,1)
d
23:31
MEM(EA,1)
d
32:39
MEM(EA,1)
d
40:47
MEM(EA,1)
d
48:55
MEM(EA,1)
d
56:63
MEM(EA,1)
if (U=1) then a EA
The byte addressed by the EA is loaded from memory and placed into each byte element of
parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-289 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-289. __ev_lbbsplatb[u] Results in Big- and Little-Endian Modes
NOTE
If machine register r0 is used to pass parameter a, then a zero value is
used as the base value when calculating EA. For the with update
form (U=1), parameter a is both an input and output operand. Also, for
the with update form (U=1), parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint8_t * 5-bit unsigned literal evlbbsplatb d,b(a)
U = 1 __ev64_opaque__ uint8_t *& 5-bit unsigned literal evlbbsplatbu d,b(a)
a a
0
a
a a a a a a
Memory
GPR in big or little
Byte address
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-414 Freescale Semiconductor
SPE2 Operations
__ev_lbbsplatb[m]x __ev_lbbsplatb[m]x
Vector Load Byte into Byte and Splat Bytes [with Modify] Indexed
d = __ev_lbbsplatbx (a,b) (M = 0)
d = __ev_lbbsplatbmx (a,b) (M = 1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:7
MEM(EA,1)
d
8:15
MEM(EA,1)
d
16:23
MEM(EA,1)
d
23:31
MEM(EA,1)
d
32:39
MEM(EA,1)
d
40:47
MEM(EA,1)
d
48:55
MEM(EA,1)
d
56:63
MEM(EA,1)
if (M=1) then a
32:63
calc_a_update(a,b)
The byte addressed by EA is loaded from memory and placed into each byte element of parameter
d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-290 shows how bytes are loaded into d as determined by the endian mode.
Figure 3-290. __ev_lbbsplatb[m]x Results in Big- and Little-Endian Modes
NOTE
If machine register r0 is used to pass parameter a, then a zero value is
used as the base value when calculating EA. Also, for the with
modify form (M=1), parameter a is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ uint8_t * int32_t evlbbsplatbx d,a,b
M = 1 __ev64_opaque__ uint8_t *& int32_t evlbbsplatbmx d,a,b
a a
0
a
a a a a a a
Memory
GPR in big or little
Byte address
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-415
__ev_ldb[u] __ev_ldb[u]
Vector Load Double into Eight Bytes [with Update]
d = __ev_ldb (a,b) (U = 0)
d = __ev_ldbu (a,b) (U = 1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*8)
d
0:7
MEM(EA, 1)
d
8:15
MEM(EA+1,1)
d
16:23
MEM(EA+2,1)
d
24:31
MEM(EA+3,1)
d
32:39
MEM(EA+4, 1)
d
40:47
MEM(EA+5,1)
d
48:55
MEM(EA+6,1)
d
56:63
MEM(EA+7,1)
if (U=1) then a EA
The double word addressed by EA is loaded from memory and placed in parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-291 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-291. __ev_ldb[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evldb d,b(a)
U = 1 __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evldbu d,b(a)
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
c d e f h a b g
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-416 Freescale Semiconductor
SPE2 Operations
__ev_ldb[m]x __ev_ldb[m]x
Vector Load Double into Eight Bytes [with Modify] Indexed
d = __ev_ldbx (a,b) (M = 0)
d = __ev_ldbmx (a,b) (M = 1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:7
MEM(EA,1)
d
8:15
MEM(EA+1,1)
*
d
16:23
MEM(EA+2,1)
*
d
24:31
MEM(EA+3,1)
*
d
32:39
MEM(EA+4,1)
*
d
40:47
MEM(EA+5,1)
*
d
48:55
MEM(EA+6,1)
*
d
56:63
MEM(EA+7,1)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The doubleword addressed by EA is loaded from memory and placed into parameter d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-292 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-292. __ev_ldb[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ __ev64_opaque__ * int32_t evldbx d,a,b
M = 1 __ev64_opaque__ __ev64_opaque__ *& int32_t evldbmx d,a,b
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
c d e f h a b g
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-417
__ev_ldd[u] __ev_ldd[u]
Vector Load Double Word into Double Word [with Update]
d = __ev_ldd (a,b) (U = 0)
d = __ev_lddu (a,b) (U = 1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*8)
d MEM(EA, 8)
if (U=1) then a EA
The double word addressed by the EA is loaded from memory and placed in parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-293 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-293. __ev_ldd[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evldd d,b(a)
U = 1 __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evlddu d,b(a)
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
f e d c a h g b
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-418 Freescale Semiconductor
SPE2 Operations
__ev_ldd[m]x __ev_ldd[m]x
Vector Load Double Word into Double Word [with Modify] Indexed
d = __ev_lddx (a,b) (M = 0)
d = __ev_lddmx (a,b) (M = 1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d MEM(EA,8)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The doubleword addressed by EA is loaded from memory and placed into parameter d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-294 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-294. __ev_ldd[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ __ev64_opaque__* int32_t evlddx d,a,b
M = 1 __ev64_opaque__ __ev64_opaque__*& int32_t evlddmx d,a,b
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
f e d c a h g b
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-419
__ev_ldh[u] __ev_ldh[u]
Vector Load Double into Four Half Words [with Update]
d = __ev_ldh (a,b) (U=0)
d = __ev_ldhu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*8)
d
0:15
MEM(EA, 2)
d
16:31
MEM(EA+2,2)
d
32:47
MEM(EA+4,2)
d
48:63
MEM(EA+6,4)
if (U=1) then a EA
The double word addressed by EA is loaded from memory and placed in parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-295 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-295. __ev_ldh[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evldh d,b(a)
U = 1 __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evldhu d,b(a)
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
d c f e g b a h
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-420 Freescale Semiconductor
SPE2 Operations
__ev_ldh[m]x __ev_ldh[m]x
Vector Load Double into Four Halfwords [with Modify] Indexed
d = __ev_ldhx (a,b) (M=0)
d = __ev_ldhmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (a)
d
0:15
MEM(EA,2)
*
d
16:31
MEM(EA+2,2)
*
d
32:47
MEM(EA+4,2)
*
d
48:63
MEM(EA+6,2)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The doubleword addressed by EA is loaded from memory and placed into parameter d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-296 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-296. __ev_ldhx Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ __ev64_opaque__ * int32_t evldhx d,a,b
M = 1 __ev64_opaque__ __ev64_opaque__ *& int32_t evldhmx d,a,b
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
d c f e g b a h
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-421
__ev_ldw[u] __ev_ldw[u]
Vector Load Double into Two Words [with Update]
d = __ev_ldw (a,b) (U=0)
d = __ev_ldwu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*8)
d
0:31
MEM(EA, 4)
d
32:63
MEM(EA+4, 4)
if (U=1) then a EA
The double word addressed by EA is loaded from memory and placed in parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-297 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-297. __ev_ldw[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evldw d,b(a)
U = 1 __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evldwu d,b(a)
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
b a h g e d c f
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-422 Freescale Semiconductor
SPE2 Operations
__ev_ldw[m]x __ev_ldw[m]x
Vector Load Double into Two Words [with Modify] Indexed
d = __ev_ldwx (a,b) (M=0)
d = __ev_ldwmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:31
MEM(EA,4)
*
d
32:63
MEM(EA+4,4)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The doubleword addressed by EA is loaded from memory and placed into parameter d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-298 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-298. __ev_ldwx Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ __ev64_opaque__ * int32_t evldwx d,a,b
M = 1 __ev64_opaque__ __ev64_opaque__ *& int32_t evldwmx d,a,b
c d e f h
0 1 2 3 4 5 6 7
a b g
c d e f h a b g
b a h g e d c f
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-423
__ev_lhhesplat[u] __ev_lhhesplat[u]
Vector Load Half Word into Half Words Even and Splat [with Update]
d = __ev_lhhesplat (a,b) (U=0)
d = __ev_lhhesplatu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*2)
d
0:15
MEM(EA,2)
d
16:31
0x0000
d
32:47
MEM(EA,2)
d
48:63
0x0000
if (U=1) then a EA
The half word addressed by EA is loaded from memory and placed in the even half words of each
element of parameter d. The odd half words are zeroed.
If U=1 (with update), EA is placed into parameter a.
Figure 3-299 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-299. __ev_lhhesplat[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint16_t * 5-bit unsigned literal evlhhesplat d,b(a)
U = 1 __ev64_opaque__ uint16_t *& 5-bit unsigned literal evlhhesplatu d,b(a)
0
a b
Z Z a b Z a b Z
Z Z b a Z b a Z
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
SPE2 Programming Interface Manual, Rev. 1.0-2
3-424 Freescale Semiconductor
SPE2 Operations
__ev_lhhesplat[m]x __ev_lhhesplat[m]x
Vector Load Halfword into Halfwords Even and Splat [with Modify] Indexed
d = __ev_lhhesplatx (a,b) (M=0)
d = __ev_lhhesplatmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:15
MEM(EA,2)
*
d
16:31
0x0000
d
32:47
MEM(EA,2)
*
d
48:63
0x0000
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The halfword addressed by EA is loaded from memory and placed in the even halfwords of each
word element of parameter d. The odd halfwords are zeroed.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-300 shows how bytes are loaded into d as determined by the endian mode.
Figure 3-300. __ev_lhhesplat[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ uint16_t * int32_t evlhhesplatx d,a,b
M = 1 __ev64_opaque__ uint16_t *& int32_t evlhhesplatmx d,a,b
0 1
a b
Z Z a b Z a b Z
Z Z b a Z b a Z
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-425
__ev_lhhossplat[u] __ev_lhhossplat[u]
Vector Load Half Word into Half Word Odd Signed and Splat [with Update]
d = __ev_lhhossplat (a,b) (U=0)
d = __ev_lhhossplatu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*2)
d
0:31
EXTS(MEM(EA,2))
d
32:63
EXTS(MEM(EA,2))
if (U=1) then a EA
The half word addressed by EA is loaded from memory and placed in the odd half words sign
extended in each element of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-301 shows how bytes are loaded into parameter d as determined by the endian mode.
In big-endian mode, the msb of parameter a is sign-extended.
In little-endian mode, the msb of parameter b is sign-extended.
Figure 3-301. __ev_lhhossplat[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint16_t * 5-bit unsigned literal evlhhossplat d,b(a)
U = 1 __ev64_opaque__ uint16_t *& 5-bit unsigned literal evlhhossplatu d,b(a)
0 1
a b
a b S S b S S a
b a S S a S S b
Memory
GPR in big endian
GPR in little endian
Byte address
S = sign
S = sign
SPE2 Programming Interface Manual, Rev. 1.0-2
3-426 Freescale Semiconductor
SPE2 Operations
__ev_lhhossplat[m]x __ev_lhhossplat[m]x
Vector Load Halfword into Halfword Odd Signed and Splat [with Modify] Indexed
d = __ev_lhhossplatx (a,b) (M=0)
d = __ev_lhhossplatmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:31
EXTS
32
(MEM(EA,2)
*
)
d
32:63
EXTS
32
(MEM(EA,2)
*
)
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The halfword addressed by EA is loaded from memory and placed in the odd halfwords,
sign-extended into each word element of parameter d.
If M=1 (with modify), parameter a
32:63
is updated with an address value determined by the mode
specifier in parameter a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-302 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-302. __ev_lhhossplat[m]x Results in Big- and Little-Endian Modes
In big-endian memory, the most significant bit of a is sign extended. In little-endian memory, the
most significant bit of parameter b is sign extended.
Implementation dependent note: If the EA is not halfword aligned, an alignment exception may
occur.
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ uint16_t * int32_t evlhhossplatx d,a,b
M = 1 __ev64_opaque__ uint16_t *& int32_t evlhhossplatmx d,a,b
0 1
a b
a b S S b S S a
b a S S a S S b
Memory
GPR in big endian
GPR in little endian
Byte address
S = sign
S = sign
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-427
__ev_lhhousplat[u] __ev_lhhousplat[u]
Vector Load Half Word into Half Word Odd Unsigned and Splat [with Update]
d = __ev_lhhousplat (a,b) (U=0)
d = __ev_lhhousplatu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*2)
d
0:15
0x0000
d
16:31
MEM(EA,2)
d
32:47
0x0000
d
48:63
MEM(EA,2)
if (U=1) then a EA
The half word addressed by EA is loaded from memory and placed in the odd half words zero
extended in each element of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-303 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-303. __ev_lhhousplat[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint16_t * 5-bit unsigned literal evlhhousplat d,b(a)
U = 1 __ev64_opaque__ uint16_t *& 5-bit unsigned literal evlhhousplatu d,b(a)
0 1
a b
a b Z Z b Z Z a
b a Z Z a Z Z b
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
SPE2 Programming Interface Manual, Rev. 1.0-2
3-428 Freescale Semiconductor
SPE2 Operations
__ev_lhhousplat[m]x __ev_lhhousplat[m]x
Vector Load Halfword into Halfword Odd Unsigned and Splat [with Modify] Indexed
d = __ev_lhhousplatx (a,b) (M=0)
d = __ev_lhhousplatmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:15
0x0000
d
16:31
MEM(EA,2)
*
d
32:47
0x0000
d
48:63
MEM(EA,2)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The half-word addressed by EA is loaded from memory and placed in the odd half words zero
extended into each word element of parameter d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-304 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-304. __ev_lhhousplatx Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ uint16_t * int32_t evlhhousplatx d,a,b
M = 1 __ev64_opaque__ uint16_t *& int32_t evlhhousplatmx d,a,b
0 1
a b
a b Z Z b Z Z a
b a Z Z a Z Z b
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-429
__ev_lhhsplath[u] __ev_lhhsplath[u]
Vector Load Half Word into Half Word and Splat Half Words [with Update]
d = __ev_lhhsplath (a,b) (U=0)
d = __ev_lhhsplathu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA b + EXTZ(b*2)
d
0:15
MEM(EA,2)
d
16:31
MEM(EA,2)
d
32:47
MEM(EA,2)
d
48:63
MEM(EA,2)
if (U=1) then a EA
The half word addressed by EA is loaded from memory and placed into each half word element of
parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-305 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-305. __ev_lhhsplath[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter a is both
an input and output operand. Also, for the with update form (U=1),
parameter b cannot be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint16_t * 5-bit unsigned literal evlhhsplath d,b(a)
U = 1 __ev64_opaque__ uint16_t *& 5-bit unsigned literal evlhhsplathu d,b(a)
a a
0 1
a b
a b b b b a
b a b a a b a b
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-430 Freescale Semiconductor
SPE2 Operations
__ev_lhhsplath[m]x __ev_lhhsplath[m]x
Vector Load Halfword into Halfword and Splat Halfwords [with Modify] Indexed
d = __ev_lhhsplathx (a,b) (M=0)
d = __ev_lhhsplathmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0 0
else temp0 (a)
EA temp0 + (b)
d
0:15
MEM(EA,2)
*
d
16:31
MEM(EA,2)
*
d
32:47
MEM(EA,2)
*
d
48:63
MEM(EA,2)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The halfword addressed by EA is loaded from memory and placed into each halfword element of
parameter d.
If M=1 (with modify), a
32:63
is updated with an address value determined by the mode specifier
in a
0:3
. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-306 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-306. __ev_lhhsplathx Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half-word aligned. Also, if machine register r0 is used to pass
parameter a, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter a
is both an input and output operand.
M d a b Maps to
M = 0 __ev64_opaque__ uint16_t * int32_t evlhhsplathx d,a,b
M = 1 __ev64_opaque__ uint16_t *& int32_t evlhhsplathmx d,a,b
a a
0 1
a b
a b b b b a
b a b a a b a b
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-431
__ev_lower_eq __ev_lower_eq
Vector Lower Bits Equal
d = __ev_lower_eq (a,b)
if (a
32:63
= b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are equal to the lower 32 bits of
parameter b.
Figure 3-307. Vector Lower Equal (__ev_lower_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpeq x,a,b
0 31 32 63
a
b
=
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-432 Freescale Semiconductor
SPE2 Operations
__ev_lower_gts __ev_lower_gts
Vector Lower Bits Greater Than Signed
d = __ev_lower_gts (a,b)
if (a
32:63
>
signed
b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-308. Vector Lower Greater Than Signed (__ev_lower_gts)
d a b Maps to
Bool __ev64_opaque__ __ev64_opaque__ evcmpgts x,a,b
0 31 32 63
a
b
>
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-433
__ev_lower_gtu __ev_lower_gtu
Vector Lower Bits Greater Than Unsigned
d = __ev_lower_gtu (a,b)
if (a
32:63
>
unsigned
b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-309. Vector Lower Greater Than Unsigned (__ev_lower_gtu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgtu x,a,b
0 31 32 63
a
b
>
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-434 Freescale Semiconductor
SPE2 Operations
__ev_lower_lts __ev_lower_lts
Vector Lower Bits Less Than Signed
d = __ev_lower_lts (a,b)
if (a
32:63
<
signed
b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are less than the lower 32 bits of
parameter b.
Figure 3-310. Vector Lower Less Than Signed (__ev_lower_lts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmplts x,a,b
0 31 32 63
a
b
<
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-435
__ev_lower_ltu __ev_lower_ltu
Vector Lower Bits Less Than Unsigned
d = __ev_lower_ltu (a,b)
if (a
32:63
<
unsigned
b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are less than the lower 32 bits of
parameter b.
Figure 3-311. Vector Lower Less Than Unsigned (__ev_lower_ltu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpltu x,a,b
0 31 32 63
a
b
<
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-436 Freescale Semiconductor
SPE2 Operations
__ev_lvsl __ev_lvsl
Load Vector for Shift Left
d = __ev_lvsl (a,b)
if (a = 0) then temp
32
0
else temp (a
32:63
)
EA
0:31
temp + (b
32:63
)
sh EA
29:31
if (sh=0) then d
0:63
0x0001020304050607
if (sh=1) then d
0:63
0x0102030405060708
if (sh=2) then d
0:63
0x0203040506070809
if (sh=3) then d
0:63
0x030405060708090A
if (sh=4) then d
0:63
0x0405060708090A0B
if (sh=5) then d
0:63
0x05060708090A0B0C
if (sh=6) then d
0:63
0x060708090A0B0C0D
if (sh=7) then d
0:63
0x0708090A0B0C0D0E
The contents of parameters a and b are used to calculate an effective address. A control vector is
calculated based on the byte offset of the EA and placed into parameter d. The control vector may
be used by an __ev_perm2 instruction to do a simulated alignment of a misaligned big-endian
double word, or may be used for other computations such as a double length shift or rotate in
conjunction with __ev_lvsr.
Figure 3-312. Load Vector for Shift Left (__ev_lvsl)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evlvsl d,a,b
63
d 56
0 8 16 24 32 40 48 56
63
a
32
63
b
32
+
=
00010008
00000005
EA
0001000D Table Lookup
0B 0C 0A 09 08 07 06 05
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-437
__ev_lvsr __ev_lvsr
Load Vector for Shift Right
d = __ev_lvsr (a,b)
if (a = 0) then temp
32
0
else temp (a
32:63
)
EA
0:31
temp + (b
32:63
)
sh EA
29:31
if (sh=0) then d
0:63
0x08090A0B0C0D0E0F
if (sh=1) then d
0:63
0x0708090A0B0C0D0E
if (sh=2) then d
0:63
0x060708090A0B0C0D
if (sh=3) then d
0:63
0x05060708090A0B0C
if (sh=4) then d
0:63
0x0405060708090A0B
if (sh=5) then d
0:63
0x030405060708090A
if (sh=6) then d
0:63
0x0203040506070809
if (sh=7) then d
0:63
0x0102030405060708
The contents of parameters a and b are used to calculate an effective address. A control vector is
calculated based on the byte offset of the EA and placed into parameter d. The control vector may
be used by an __ev_perm2 instruction to do a simulated alignment of a misaligned little-endian
double word, or may be used for other computations such as a double length shift or rotate in
conjunction with __ev_lvsl.
Figure 3-313. Load Vector for Shift Right (__ev_lvsr)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evlvsr d,a,b
63
d 56
0 8 16 24 32 40 48 56
63
a
32
63
b
32
+
=
00010008
00000005
EA
0001000D Table Lookup
09 0A 08 07 06 05 04 03
SPE2 Programming Interface Manual, Rev. 1.0-2
3-438 Freescale Semiconductor
SPE2 Operations
__ev_lwbe[u] __ev_lwbe[u]
Vector Load Word into Four Bytes Even [with Update]
d = __ev_lwbe (a,b) (U=0)
d = __ev_lwbeu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:7
MEM(EA, 1)
d
8:15
0x0000
d
16:23
MEM(EA+1,1)
d
24:31
0x0000
d
32:39
MEM(EA+2, 1)
d
40:47
0x0000
d
48:55
MEM(EA+3,1)
d
56:63
0x0000
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed into the even bytes of parameter d.
The odd bytes are zeroed.
If U=1 (with update), EA is placed into parameter a.
Figure 3-314 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-314. __ev_lwbe[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwbe d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwbeu d,b(a)
c d
0 1 2 3
a b
b Z c Z Z a Z d
Memory
GPR in big or little
Byte address
Z = zero
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-439
__ev_lwbe[m]x __ev_lwbe[m]x
Vector Load Word into Four Bytes Even [with Modify] Indexed
d = __ev_lwbex (a,b) (M=0)
d = __ev_lwbemx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp + (b)
d
0:7
MEM(EA,1)
d
8:15
0x0000
d
16:23
MEM(EA+1,1)
*
d
24:31
0x0000
d
32:39
MEM(EA+2,1)
*
d
40:47
0x0000
d
48:55
MEM(EA+3,1)
*
d
56:63
0x0000
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in the even bytes in parameter d.
The odd bytes are zeroed.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-315 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-315. __ev_lwbe[u]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
c d
0 1 2 3
a b
b Z c Z Z a Z d
Memory
GPR in big or little
Byte address
Z = zero
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-440 Freescale Semiconductor
SPE2 Operations
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwbex d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwbemx d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-441
__ev_lwbos[u] __ev_lwbos[u]
Vector Load Word into Four Bytes Odd Signed [with Update]
d = __ev_lwbos (a,b) (U=0)
d = __ev_lwbosu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:15
EXTS(MEM(EA,1))
d
16:31
EXTS(MEM(EA+1,1))
d
32:47
EXTS(MEM(EA+2,1))
d
48:63
EXTS(MEM(EA+3,1))
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed in the odd bytes, sign-extended into
each half word element of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-316 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-316. __ev_lwbos[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwbos d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwbosu d,b(a)
c d
0 1 2 3
a b
S b S c d S a S
Memory
GPR in big or little
Byte address
Z = zero
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-442 Freescale Semiconductor
SPE2 Operations
__ev_lwbos[m]x __ev_lwbos[m]x
Vector Load Word into Four Bytes Odd Signed [with Modify] Indexed
d = __ev_lwbosx (a,b) (M=0)
d = __ev_lwbosmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp + (b)
d
0:15
EXTS
16
(MEM(EA,1))
d
16:31
EXTS
16
(MEM(EA+1,1)
*
)
d
32:47
EXTS
16
(MEM(EA+2,1)
*
)
d
48:63
EXTS
16
(MEM(EA+3,1)
*
)
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in the odd bytes, sign-extended into
each half word element of parameter d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-317 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-317. __ev_lwbos[u]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with update form (U=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwbosx d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwbosmx d,a,b
c d
0 1 2 3
a b
S b S c d S a S
Memory
GPR in big or little
Byte address
S = sign
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-443
__ev_lwbou[u] __ev_lwbou[u]
Vector Load Word into Four Bytes Odd Unsigned [with Update]
d = __ev_lwbou (a,b) (U=0)
d = __ev_lwbouu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:15
EXTZ(MEM(EA,1))
d
16:31
EXTZ(MEM(EA+1,1))
d
32:47
EXTZ(MEM(EA+2,1))
d
48:63
EXTZ(MEM(EA+3,1))
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed in the odd bytes, zero-extended into
each half word element of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-318 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-318. __ev_lwbou[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwbou d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwbouu d,b(a)
c d
0 1 2 3
a b
Z b Z c d Z a Z
Memory
GPR in big or little
Byte address
Z = zero
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-444 Freescale Semiconductor
SPE2 Operations
__ev_lwbou[m]x __ev_lwbou[m]x
Vector Load Word into Four Bytes Odd Unsigned [with Modify] Indexed
d = __ev_lwboux (a,b) (M=0)
d = __ev_lwboumx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp + (b)
d
0:15
EXTZ
16
(MEM(EA,1))
d
16:31
EXTZ
16
(MEM(EA+1,1)
*
)
d
32:47
EXTZ
16
(MEM(EA+2,1)
*
)
d
48:63
EXTZ
16
(MEM(EA+3,1)
*
)
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in the odd bytes, zero-extended into
each half word element of parameter d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-319 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-319. __ev_lwbou[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwboux d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwboumx d,a,b
c d
0 1 2 3
a b
Z b Z c d Z a Z
Memory
GPR in big or little
Byte address
Z = zero
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-445
__ev_lwbsplatw[u] __ev_lwbsplatw[u]
Vector Load Word as Bytes and Splat Words [with Update]
d = __ev_lwbsplatw (a,b) (U=0)
d = __ev_lwbsplatwu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:7
MEM(EA,1)
d
8:15
MEM(EA+1,1)
d
16:23
MEM(EA+2,1)
d
24:31
MEM(EA+3,1)
d
32:39
MEM(EA,1)
d
40:47
MEM(EA+1,1)
d
48:55
MEM(EA+2,1)
d
56:63
MEM(EA+3,1)
if U=1 then a EA
The word addressed by EA is loaded from memory as a quad of bytes and placed in both word
elements of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-320 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-320. __ev_lwbsplatw[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwbsplatw d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwbsplatwu d,b(a)
c d
0 1 2 3
a b
c d a b d a b c
Memory
GPR in big and
Byte address
little endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-446 Freescale Semiconductor
SPE2 Operations
__ev_lwbsplatw[m]x __ev_lwbsplatw[m]x
Vector Load Word as Bytes and Splat Words [with Modify] Indexed
d = __ev_lwbsplatwx (a,b) (M=0)
d = __ev_lwbsplatwmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp+ (b)
d
0:7
MEM(EA,1)
d
8:15
MEM(EA+1,1)
*
d
16:23
MEM(EA+2,1)
*
d
24:31
MEM(EA+3,1)
*
d
32:63
d
0:31
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory as a quad of bytes and placed in both word
elements of parameter d.
If U=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-321 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-321. __ev_lwbsplatw[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwbsplatwx d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwbsplatwmx d,a,b
c d
0 1 2 3
a b
c d a b d a b c
Memory
GPR in big and
Byte address
little endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-447
__ev_lwhe[u] __ev_lwhe[u]
Vector Load Word into Two Half Words Even [with Update]
d = __ev_lwhe (a,b) (U=0)
d = __ev_lwheu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:15
MEM(EA,2)
d
16:31
0x0000
d
32:47
MEM(EA+2,2)
d
48:63
0x0000
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed in the even half words in each word
element of parameter d. The odd half words are zeroed.
If U=1 (with update), EA is placed into parameter a.
Figure 3-322 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-322. __ev_lwhe[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwhe d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwheu d,b(a)
c d
0 1 2 3
a b
Z Z c d Z a b Z
Z Z d c Z b a Z
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
SPE2 Programming Interface Manual, Rev. 1.0-2
3-448 Freescale Semiconductor
SPE2 Operations
__ev_lwhe[m]x __ev_lwhe[m]x
Vector Load Word into Two Half Words Even [with Modify] Indexed
d = __ev_lwhex (a,b) (M=0)
d = __ev_lwhemx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0
else temp(a)
EA temp + (b)
d
0:15
MEM(EA,2)
*
d
16:31
0x0000
d
32:47
MEM(EA+2,2)
*
d
48:63
0x0000
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in the even half words in each word
element of parameter d. The odd half words are zeroed.
If U=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-323 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-323. __ev_lwhe[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwhex d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwhemx d,a,b
c d
0 1 2 3
a b
Z Z c d Z a b Z
Z Z d c Z b a Z
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-449
__ev_lwhos[u] __ev_lwhos[u]
Vector Load Word into Two Half Words Odd Signed (with sign extension) [with Update]
d = __ev_lwhos (a,b) (U=0)
d = __ev_lwhosu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:31
EXTS(MEM(EA,2))
d
32:63
EXTS(MEM(EA+2,2))
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed in the odd half words sign extended
in each word element of parameter d.
Figure 3-324 shows how bytes are loaded into parameter d as determined by the endian mode.
In big-endian memory, the msbs of parameters a and c are sign-extended.
In little-endian memory, the msbs of parameters b and d are sign-extended.
Figure 3-324. __ev_lwhos[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwhos d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwhosu d,b(a)
c d
0 1 2 3
a b
a b S S d S S c
b a S S c S S d
Memory
GPR in big endian
GPR in little endian
Byte address
S = sign
S = sign
SPE2 Programming Interface Manual, Rev. 1.0-2
3-450 Freescale Semiconductor
SPE2 Operations
__ev_lwhos[m]x __ev_lwhos[m]x
Vector Load Word into Two Half Words Odd Signed [with Modify] Indexed (with sign extension)
d = __ev_lwhosx (a,b) (M=0)
d = __ev_lwhosmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp0
else temp (a)
EA temp + (b)
d
0:31
EXTS
32
(MEM(EA,2)
*
)
d
32:63
EXTS
32
(MEM(EA+2,2)
*
)
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in the odd half words sign-extended
in each word element of parameter d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-325 shows how bytes are loaded into parameter d as determined by the endian mode.
In big-endian memory, the msbs of parameters a and c are sign-extended.
In little-endian memory, the msbs of parameters b and d are sign-extended.
Figure 3-325. __ev_lwhos[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwhosx d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwhosmx d,a,b
c d
0 1 2 3
a b
a b S S d S S c
b a S S c S S d
Memory
GPR in big endian
GPR in little endian
Byte address
S = sign
S = sign
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-451
__ev_lwhou[u] __ev_lwhou[u]
Vector Load Word into Two Half Words Odd Unsigned (zero-extended) [with Update]
d = __ev_lwhou (a,b) (U=0)
d = __ev_lwhouu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:15
0x0000
d
16:31
MEM(EA,2)
d
32:47
0x0000
d
48:63
MEM(EA+2,2)
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed in the odd half words zero extended
in each word element of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-326 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-326. __ev_lwhou[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwhou d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwhouu d,b(a)
c d
0 1 2 3
a b
a b Z Z d Z Z c
b a Z Z c Z Z d
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
SPE2 Programming Interface Manual, Rev. 1.0-2
3-452 Freescale Semiconductor
SPE2 Operations
__ev_lwhou[m]x __ev_lwhou[m]x
Vector Load Word into Two Half Words Odd Unsigned [with Modify] Indexed (zero-extended)
d = __ev_lwhoux (a,b) (M=0)
d = __ev_lwhoumx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp + (b)
d
0:15
0x0000
d
16:31
MEM(EA,2)
*
d
32:47
0x0000
d
48:63
MEM(EA+2,2)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in the odd half words zero extended
in each word element of parameter d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-327 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-327. __ev_lwhou[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwhoux d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwhoumx d,a,b
c d
0 1 2 3
a b
a b Z Z d Z Z c
b a Z Z c Z Z d
Memory
GPR in big endian
GPR in little endian
Byte address
Z = zero
Z = zero
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-453
__ev_lwhsplat[u] __ev_lwhsplat[u]
Vector Load Word into Two Half Words and Splat [with Update]
d = __ev_lwhsplat (a,b) (U=0)
d = __ev_lwhsplatu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA temp + EXTZ(b*4)
d
0:15
MEM(EA,2)
d
16:31
MEM(EA,2)
d
32:47
MEM(EA+2,2)
d
48:63
MEM(EA+2,2)
if U=1 then a EA
The word addressed by EA is loaded from memory and placed in both the even and odd half words
in each word element of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-328 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-328. __ev_lwhsplat[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwhsplat d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwhsplatu d,b(a)
c d
0 1 2 3
a b
a b c d d a b c
b a d c c b a d
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-454 Freescale Semiconductor
SPE2 Operations
__ev_lwhsplatw[u] __ev_lwhsplatw[u]
Vector Load Word as Half Words and Splat Words [with Update]
d = __ev_lwhsplatw (a,b) (U=0)
d = __ev_lwhsplatwu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
EA b + EXTZ(b*4)
d
0:15
MEM(EA,2)
d
16:31
MEM(EA+2,2)
d
32:47
MEM(EA,2)
d
48:63
MEM(EA+2,2)
if (U=1) then a EA
The word addressed by EA is loaded from memory as a pair of half words and placed in both word
elements of parameter d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-329 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-329. __ev_lwhsplatw[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwhsplatw d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwhsplatwu d,b(a)
c d
0 1 2 3
a b
c d a b d a b c
d c b a c b a d
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-455
__ev_lwhsplatw[m]x __ev_lwhsplatw[m]x
Vector Load Word as Half Words and Splat Words [with Modify] Indexed
d = __ev_lwhsplatwx (a,b) (M=0)
d = __ev_lwhsplatwmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp(a)
EA temp+ (b)
d
0:15
MEM(EA,2)
*
d
16:31
MEM(EA+2,2)
*
d
32:47
MEM(EA,2)
*
d
48:63
MEM(EA+2,2)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory as a pair of half words and placed in both word
elements of parameter d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-330 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-330. __ev_lwhsplatw[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwhsplatwx d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwhsplatwmx d,a,b
c d
0 1 2 3
a b
c d a b d a b c
d c b a c b a d
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-456 Freescale Semiconductor
SPE2 Operations
__ev_lwhsplat[m]x __ev_lwhsplat[m]x
Vector Load Word into Two Half Words and Splat [with Modify] Indexed
d = __ev_lwhsplatx (a,b) (M=0)
d = __ev_lwhsplatmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp + (b)
d
0:15
MEM(EA,2)
*
d
16:31
MEM(EA,2)
*
d
32:47
MEM(EA+2,2)
*
d
48:63
MEM(EA+2,2)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in both the even and odd half words
in each word element of parameter d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-331 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-331. __ev_lwhsplat[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwhsplatx d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwhsplatmx d,a,b
c d
0 1 2 3
a b
a b c d d a b c
b a d c c b a d
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-457
__ev_lwwsplat[u] __ev_lwwsplat[u]
Vector Load Word into Word and Splat [with Update]
d = __ev_lwwsplat (a,b) (U=0)
d = __ev_lwwsplatu (a,b) (U=1)
if (a = r0) then temp 0
else temp (a)
UIMM b
EA temp + EXTZ(UIMM*4)
d
0:31
MEM(EA,4)
d
32:63
MEM(EA,4)
if (U=1) then a EA
The word addressed by EA is loaded from memory and placed in both word elements of parameter
d.
If U=1 (with update), EA is placed into parameter a.
Figure 3-332 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-332. __ev_lwwsplat[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter a is both an input and output
operand. Also, for the with update form (U=1), parameter b cannot
be 0.
U d a b Maps to
U = 0 __ev64_opaque__ uint32_t * 5-bit unsigned literal evlwwsplat d,b(a)
U = 1 __ev64_opaque__ uint32_t *& 5-bit unsigned literal evlwwsplatu d,b(a)
c d
0 1 2 3
a b
c d a b d a b c
b a d c a d c b
Memory
GPR in big endian
GPR in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-458 Freescale Semiconductor
SPE2 Operations
__ev_lwwsplat[m]x __ev_lwwsplat[m]x
Vector Load Word into Word and Splat [with Modify] Indexed
d = __ev_lwwsplatx (a,b) (M=0)
d = __ev_lwwsplatmx (a,b) (M=1)
if a=0 & M=1 then take_illegal_exception
if a=0 & M=0 then temp 0
else temp (a)
EA temp + (b)
d
0:31
MEM(EA,4)
*
d
32:63
MEM(EA,4)
*
if (M=1) then a
32:63
calc_a_update(a,b)
* - may wrap at length boundary for M=1 and mode 1000.
The word addressed by EA is loaded from memory and placed in both word elements of parameter
d.
If M=1 (with modify), parameter a is updated with an address value determined by the mode
specifier in parameter b. See 3.2.3, Addressing Modes - Modify forms.
Figure 3-333 shows how bytes are loaded into parameter d as determined by the endian mode.
Figure 3-333. __ev_lwwsplat[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter a,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter a is both an input and
output operand.
U d a b Maps to
M = 0 __ev64_opaque__ uint32_t * int32_t evlwwsplatx d,a,b
M = 1 __ev64_opaque__ uint32_t *& int32_t evlwwsplatmx d,a,b
c d
0 1 2 3
a b
c d a b d a b c
b a d c a d c b
Memory
GPR in big endian
GPR in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-459
__ev_mar __ev_mar
Store Accumulator
d = __ev_mar ()
d
0:63
ACC
0:63
The contents of the accumulator are copied into parameter d. This is the method for saving the
accumulator.
Figure 3-334. Store Accumulator (__ev_mar)
d Maps to
__ev64_opaque__ evmar d
0 31 32 63
Accumulator
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-460 Freescale Semiconductor
SPE2 Operations
__ev_maxbpsh __ev_maxbpsh
Vector Maximum of Byte Pairs Signed to Half Word
d = __ev_maxbpsh (a)
d
0:15
EXTS(MAX
si
(a
0:7
,a
8:15
))
d
16:31
EXTS(MAX
si
(a
16:23
,a
24:31
))
d
32:47
EXTS(MAX
si
(a
32:39
,a
40:47
))
d
48:63
EXTS(MAX
si
(a
48:55
,a
56:63
))
Even/odd pairs of signed-integer byte elements of parameter a are compared and the larger of the
two signed-integer byte elements is sign-extended and placed into the corresponding half word
elements of parameter d.
Figure 3-335. Vector Maximum of Byte Pairs Signed to Half Word (__ev_maxbpsh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmaxbpsh d,a
0 32 63
a
24 16 8 40 48 56
0 32 63
d
16 48
MAX
si
MAX
si
MAX
si
MAX
si
exts exts exts exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-461
__ev_maxbpuh __ev_maxbpuh
Vector Maximum of Byte Pairs Unsigned to Half Word
d = __ev_maxbpuh (a)
d
0:15
EXTZ(MAX
si
(a
0:7
,a
8:15
))
d
16:31
EXTZ(MAX
si
(a
16:23
,a
24:31
))
d
32:47
EXTZ(MAX
si
(a
32:39
,a
40:47
))
d
48:63
EXTZ(MAX
si
(a
48:55
,a
56:63
))
Even/odd pairs of unsigned-integer byte elements of parameter a are compared and the larger of
the two unsigned-integer byte elements is zero-extended and placed into the corresponding half
word element of parameter d.
Figure 3-336. Vector Maximum of Byte Pairs Unsigned to Half Word (__ev_maxbpuh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmaxbpuh d,a
0 32 63
a
24 16 8 40 48 56
0 32 63
d
16 48
MAX
ui
MAX
ui
MAX
ui
MAX
ui
extz extz extz extz
SPE2 Programming Interface Manual, Rev. 1.0-2
3-462 Freescale Semiconductor
SPE2 Operations
__ev_maxbs __ev_maxbs
Vector Maximum Byte Signed
d = __ev_maxbs (a,b)
d
0:7
MAX
si
(a
0:7
,b
0:7
)
d
8:15
MAX
si
(a
8:15
,b
8:15
)
d
16:23
MAX
si
(a
16:23
,b
16:23
)
d
24:31
MAX
si
(a
24:31
,b
24:31
)
d
32:39
MAX
si
(a
32:39
,b
32:39
)
d
40:47
MAX
si
(a
40:47
,b
40:47
)
d
48:55
MAX
si
(a
48:55
,b
48:55
)
d
56:63
MAX
si
(a
56:63
,b
56:63
)
Each signed-integer byte element of parameter a is compared to the corresponding signed-integer
byte element of parameter b and the larger of the two signed-integer elements is placed into the
corresponding byte element of parameter d.
Figure 3-337. Vector Maximum Byte Signed (__ev_maxbs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxbs d,a,b
0 32 63
b
a
24 16 8 40 48 56
MAX
si
0 32 63
d
24 16 8 40 48 56
MAX
si
MAX
si
MAX
si
MAX
si
MAX
si
MAX
si
MAX
si
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-463
__ev_maxbu __ev_maxbu
Vector Maximum Byte Unsigned
d = __ev_maxbu (a,b)
d
0:7
MAX
ui
(a
0:7
,b
0:7
)
d
8:15
MAX
ui
(a
8:15
,b
8:15
)
d
16:23
MAX
ui
(a
16:23
,b
16:23
)
d
24:31
MAX
ui
(a
24:31
,b
24:31
)
d
32:39
MAX
ui
(a
32:39
,b
32:39
)
d
40:47
MAX
ui
(a
40:47
,b
40:47
)
d
48:55
MAX
ui
(a
48:55
,b
48:55
)
d
56:63
MAX
ui
(a
56:63
,b
56:63
)
Each signed-integer byte element of parameter a is compared to the corresponding signed-integer
byte element of parameter b and the larger of the two signed-integer elements is placed into the
corresponding byte element of parameter d.
Figure 3-338. Vector Maximum Byte Unsigned (__ev_maxbu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxbu d,a,b
0 32 63
b
a
24 16 8 40 48 56
MAX
ui
0 32 63
d
24 16 8 40 48 56
MAX
ui
MAX
ui
MAX
ui
MAX
ui
MAX
ui
MAX
ui
MAX
ui
SPE2 Programming Interface Manual, Rev. 1.0-2
3-464 Freescale Semiconductor
SPE2 Operations
__ev_maxds __ev_maxds
Vector Maximum Doubleword Signed
d = __ev_maxds (a,b)
d
0:63
MAX
si
(a
0:63
, b
0:63
)
The signed-integer doubleword in parameter a is compared to the signed-integer doubleword in
parameter b and the larger of the two doublewords is placed into parameter d.
Figure 3-339. Vector Maximum Doubleword Signed (__ev_maxds)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxds d,a,b
b
0 63
a
d
MAX
si
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-465
__ev_maxdu __ev_maxdu
Vector Maximum Doubleword Unsigned
d = __ev_maxdu (a,b)
d
0:63
MAX
ui
(a
0:63
, b
0:63
)
The unsigned-integer doubleword in parameter a is compared to the unsigned-integer doubleword
in parameter b and the larger of the two doublewords is placed into parameter d.
Figure 3-340. Vector Maximum Doubleword Unsigned (__ev_maxdu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxdu d,a,b
b
0 63
a
d
MAX
ui
SPE2 Programming Interface Manual, Rev. 1.0-2
3-466 Freescale Semiconductor
SPE2 Operations
__ev_maxhpsw __ev_maxhpsw
Vector Maximum of Half Word Pairs Signed to Word
d = __ev_maxhpsw (a)
d
0:31
EXTS(MAX
si
(a
0:15
,a
16:31
))
d
32:63
EXTS(MAX
si
(a
32:47
,a
48:63
))
Even/odd pairs of signed-integer half word elements of parameter a are compared and the larger
of the two signed-integer half word elements is sign-extended and placed into the corresponding
word element of parameter d.
Figure 3-341. Vector Maximum of Half Word Pairs Signed to Word (__ev_maxhpsw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmaxhpsw d,a
0 32 63
a
16 48
0 32 63
d
MAX
si
exts
MAX
si
exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-467
__ev_maxhpuw __ev_maxhpuw
Vector Maximum of Half Word Pairs Unsigned to Word
d = __ev_maxhpuw (a)
d
0:31
EXTZ(MAX
si
(a
0:15
,a
16:31
))
d
32:63
EXTZ(MAX
si
(a
32:47
,a
48:63
))
Even/odd pairs of unsigned-integer half word elements of parameter a are compared and the larger
of the two unsigned-integer half word elements is zero-extended and placed into the corresponding
word element of parameter d.
Figure 3-342. Vector Maximum of Half Word Pairs Unsigned to Word (__ev_maxhpuw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmaxhpuw d,a
0 32 63
a
16 48
0 32 63
d
MAX
ui
extz
MAX
ui
extz
SPE2 Programming Interface Manual, Rev. 1.0-2
3-468 Freescale Semiconductor
SPE2 Operations
__ev_maxhs __ev_maxhs
Vector Maximum Half Word Signed
d = __ev_maxhs (a,b)
d
0:15
MAX
si
(a
0:15
, b
0:15
)
d
16:31
MAX
si
(a
16:31
, b
16:31
)
d
32:47
MAX
si
(a
32:47
, b
32:47
)
d
48:63
MAX
si
(a
48:63
, b
48:63
)
Each signed-integer half word element of parameter a is compared to the corresponding
signed-integer half word element of parameter b and the larger of the two signed-integer elements
is placed into the corresponding half word element of parameter d.
Figure 3-343. Vector Maximum Half Word Signed (__ev_maxhs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxhs d,a,b
b
0 31 32 63
a
15 16 47 48
MAX
si
d
MAX
si
MAX
si
MAX
si
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-469
__ev_maxhu __ev_maxhu
Vector Maximum Half Word Unsigned
d = __ev_maxhu (a,b)
d
0:15
MAX
ui
(a
0:15
, b
0:15
)
d
16:31
MAX
ui
(a
16:31
, b
16:31
)
d
32:47
MAX
ui
(a
32:47
, b
32:47
)
d
48:63
MAX
ui
(a
48:63
, b
48:63
)
Each unsigned-integer half word element of parameter a is compared to the corresponding
unsigned-integer half word element of parameter b and the larger of the two unsigned-integer
elements is placed into the corresponding half word element of parameter d.
Figure 3-344. Vector Maximum Half Word Unsigned (__ev_maxhu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxhu d,a,b
b
0 31 32 63
a
15 16 47 48
MAX
ui
d
MAX
ui
MAX
ui
MAX
ui
SPE2 Programming Interface Manual, Rev. 1.0-2
3-470 Freescale Semiconductor
SPE2 Operations
__ev_maxmagws __ev_maxmagws
Vector Maximum Magnitude Word Signed
d = __ev_maxmagws (a,b)
if ((ABS(a
0:31
) > ABS(b
0:31
)) | ((a
0:31
>0) & (ABS(a
0:31
) = ABS(b
0:31
)))) then
temph1
0:31
a
0:31
else
temph1
0:31
b
0:31
endif
if ((ABS(a
32:63
) > ABS(b
32:63
)) | ((a
32:63
>0) & (ABS(a
32:63
) = ABS(b
32:63
)))) then
templ1
0:31
a
32:63
else
templ1
0:31
b
32:63
endif
d
0:31
temph1
0:31
; d
32:63
templ1
0:31
The magnitude of each signed-integer word element of parameter a is compared to the magnitude
of the corresponding signed-integer word element of parameter b. The word element from
parameter a or b with the larger magnitude is placed into the corresponding word element of
parameter d. If the magnitudes of parameters a and b are equal, a positive value is selected if either
word element is positive.
Figure 3-345. Vector Maximum Magnitude Word Signed (__ev_maxmagws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxmagws d,a,b
b
0 31 32 63
a
d
( (|a
0:31
| > |b
0:31
|)
1 0 1 0
| ((a
0:31
>0) & (|a
0:31
| = |b
0:31
|)) )
( (|a
32:63
| > |b
32:63
|)
| ((a
32:63
>0) & (|a
32:63
| = |b
32:63
|)) )
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-471
__ev_maxwpsd __ev_maxwpsd
Vector Maximum of Word Pair Signed to Double Word
d = __ev_maxwpsd (a)
d
0:63
EXTS(MAX
si
(a
0:31
,a
32:63
))
The signed-integer word elements of parameter a are compared and the larger of the two
signed-integer word elements is sign-extended and placed into parameter d.
Figure 3-346. Vector Maximum of Word Pairs Signed to Doubleword (__ev_maxwpsd)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmaxwpsd d,a
0 32 63
a
0 63
d
MAX
si
exts
SPE2 Programming Interface Manual, Rev. 1.0-2
3-472 Freescale Semiconductor
SPE2 Operations
__ev_maxwpud __ev_maxwpud
Vector Maximum of Word Pair Unsigned to Double Word
d = __ev_maxwpud (a)
d
0:63
EXTZ(MAX
ui
(a
0:31
,a
32:63
))
The unsigned-integer word elements of parameter a are compared and the larger of the two
unsigned-integer word elements is zero-extended and placed into parameter d.
Figure 3-347. Vector Maximum of Word Pairs Unsigned to Double Word (__ev_maxwpud)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmaxwpud d,a
0 32 63
a
0 63
d
MAX
ui
extz
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-473
__ev_maxws __ev_maxws
Vector Maximum Word Signed
d = __ev_maxws (a,b)
d
0:31
MAX
si
(a
0:31
, b
0:31
)
d
32:63
MAX
si
(a
32:63
, b
32:63
)
Each signed-integer word element of parameter a is compared to the corresponding signed-integer
word element of parameter b and the larger of the two signed-integer elements is placed into the
corresponding word element of parameter d.
Figure 3-348. Vector Maximum Word Signed (__ev_maxws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxws d,a,b
b
0 31 32 63
a
15 16 47 48
d
MAX
si
MAX
si
SPE2 Programming Interface Manual, Rev. 1.0-2
3-474 Freescale Semiconductor
SPE2 Operations
__ev_maxwu __ev_maxwu
Vector Maximum Word Unsigned
d = __ev_maxwu (a,b)
d
0:31
MAX
ui
(a
0:31
, b
0:31
)
d
32:63
MAX
ui
(a
32:63
, b
32:63
)
Each unsigned-integer word element of parameter a is compared to the corresponding
unsigned-integer word element of parameter b and the larger of the two unsigned-integer elements
is placed into the corresponding word element of parameter d.
Figure 3-349. Vector Maximum Word Unsigned (__ev_maxwu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmaxwu d,a,b
b
0 31 32 63
a
15 16 47 48
d
MAX
ui
MAX
ui
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-475
__ev_mbesmi[a] __ev_mbesmi[a]
Vector Multiply Bytes Even, Signed, Modulo, Integer (to Accumulator)
d = __ev_mbesmi (a,b) (A=0)
d = __ev_mbesmia (a,b) (A=1)
d
0:15
a
0:7
si
b
0:7
d
16:31
a
16:23
si
b
16:23
d
32:47
a
32:39
si
b
32:39
d
48:63
a
48:55
si
b
48:55
// update accumulator
if A = 1, then ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte signed
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding half words in parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-350. Vector Multiply Bytes Even, Signed, Modulo, Integer (to Accumulator)
(__ev_mbesmi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesmi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesmia d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
63
d (and Accumulator if A=1)
0 16 32 48
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
SPE2 Programming Interface Manual, Rev. 1.0-2
3-476 Freescale Semiconductor
SPE2 Operations
__ev_mbesmiaah __ev_mbesmiaah
Vector Multiply Byte Even, Signed, Modulo, Integer and Accumulate Half Words
d = __ev_mbesmiaah (a,b)
temp0
0:15
a
0:7
si
b
0:7
d
0:15
temp0
0:15
+ACC
0:15
temp1
0:15
a
16:23
si
b
16:23
d
16:31
temp1
0:15
+ACC
16:31
temp2
0:15
a
32:39
si
b
32:39
d
32:47
temp2
0:15
+ACC
32:47
temp3
0:15
a
48:55
si
b
48:55
d
48:63
temp3
0:15
+ACC
48:63
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte signed
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-351. Vector Multiply Bytes Even, Signed, Modulo, Integer and Accumulate Half
Words (__ev_mbesmiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesmiaah d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-477
__ev_mbesmianh __ev_mbesmianh
Vector Multiply Byte Even, Signed, Modulo, Integer and Accumulate Negative Half Words
d = __ev_mbesmianh (a,b)
temp0
0:15
a
0:7
si
b
0:7
d
0:15
ACC
0:15
-
temp0
0:15
temp1
0:15
a
16:23
si
b
16:23
d
16:31
ACC
16:31
- temp1
0:15
temp2
0:15
a
32:39
si
b
32:39
d
32:47
ACC
32:47
- temp2
0:15
temp3
0:15
a
48:55
si
b
48:55
d
48:63
ACC
48:63
- temp3
0:15
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte signed
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is
subtracted from the contents of the accumulator half words to form intermediate differences,
which are placed into the corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-352. Vector Multiply Bytes Even, Signed, Modulo, Integer and Accumulate
Negative Half Words (__ev_mbesmianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesmianh d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-478 Freescale Semiconductor
SPE2 Operations
__ev_mbessiaah __ev_mbessiaah
Vector Multiply Byte Even, Signed, Saturate, Integer and Accumulate Half Words
d = __ev_mbessiaah (a,b)
temp
0:15
a
0:7
si
b
0:7
temp
0:31
EXTS(ACC
0:15
) +
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
16:23
si
b
16:23
temp
0:31
EXTS(ACC
16:31
) +
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
32:39
si
b
32:39
temp
0:31
EXTS(ACC
32:47
) +
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
48:55
si
b
48:55
temp
0:31
EXTS(ACC
48:63
) +
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding even-numbered byte signed integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then added to the
corresponding half word in the accumulator, saturating if overflow or underflow occurs, and the
result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-479
Figure 3-353. Vector Multiply Bytes Even, Signed, Saturate, Integer and Accumulate Half
Words (__ev_mbessiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbessiaah d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-480 Freescale Semiconductor
SPE2 Operations
__ev_mbessianh __ev_mbessianh
Vector Multiply Byte Even, Signed, Saturate, Integer and Accumulate Half Words
d = __ev_mbessianh (a,b)
temp
0:15
a
0:7
si
b
0:7
temp
0:31
EXTS(ACC
0:15
) -
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
16:23
si
b
16:23
temp
0:31
EXTS(ACC
16:31
) -
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
32:39
si
b
32:39
temp
0:31
EXTS(ACC
32:47
) -
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
48:55
si
b
48:55
temp
0:31
EXTS(ACC
48:63
) -
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding even-numbered byte signed integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then subtracted from
the corresponding half word in the accumulator, saturating if overflow or underflow occurs, and
the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the subtraction, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-481
Figure 3-354. Vector Multiply Bytes Even, Signed, Saturate, Integer and Accumulate
Negative Half Words (__ev_mbessianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbessianh d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-482 Freescale Semiconductor
SPE2 Operations
__ev_mbesumi[a] __ev_mbesumi[a]
Vector Multiply Bytes Even, Signed by Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mbesumi (a,b) (A=0)
d = __ev_mbesumia (a,b) (A=1)
d
0:15
a
0:7
su
b
0:7
d
16:31
a
16:23
su
b
16:23
d
32:47
a
32:39
su
b
32:39
d
48:63
a
48:55
su
b
48:55
// update accumulator
if A = 1, then ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. Each
intermediate 16-bit product is added to the contents of the accumulator half words to form
intermediate sums, which are placed into the corresponding half words in parameter d. If A = 1,
the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-355. Vector Multiply Bytes Even, Signed by Unsigned, Modulo, Integer (to
Accumulator) (__ev_mbesumi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesumi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesumia d,a,b
x x x
63
a (signed) 56
0 8 16 24 32 40 48 56
63
d (and Accumulator if A=1)
0 16 32 48
63
b (unsigned) 56
0 8 16 24 32 40 48 56
x
intermediate products
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-483
__ev_mbesumiaah __ev_mbesumiaah
Vector Multiply Byte Even, Signed by Unsigned, Modulo, Integer and Accumulate Half Words
d = __ev_mbesumiaah (a,b)
temp0
0:15
a
0:7
su
b
0:7
d
0:15
temp0
0:15
+ACC
0:15
temp1
0:15
a
16:23
su
b
16:23
d
16:31
temp1
0:15
+ACC
16:31
temp2
0:15
a
32:39
su
b
32:39
d
32:47
temp2
0:15
+ACC
32:47
temp3
0:15
a
48:55
su
b
48:55
d
48:63
temp3
0:15
+ACC
48:63
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. Each
intermediate 16-bit product is added to the contents of the accumulator half words to form
intermediate sums, which are placed into the corresponding parameter d half words and into the
accumulator.
Other registers altered: ACC
Figure 3-356. Vector Multiply Bytes Even, Signed by Unsigned, Modulo, Integer and
Accumulate Half Words (__ev_mbesumiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesumiaah d,a,b
x x x
63
a (signed) 56
0 8 16 24 32 40 48 56
Accumulator
63
b (unsigned) 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-484 Freescale Semiconductor
SPE2 Operations
__ev_mbesumianh __ev_mbesumianh
Vector Multiply Byte Even, Signed by Unsigned, Modulo, Integer and Accumulate Negative Half
Words
d = __ev_mbesumianh (a,b)
temp0
0:15
a
0:7
su
b
0:7
d
0:15
ACC
0:15
-
temp0
0:15
temp1
0:15
a
16:23
su
b
16:23
d
16:31
ACC
16:31
- temp1
0:15
temp2
0:15
a
32:39
su
b
32:39
d
32:47
ACC
32:47
- temp2
0:15
temp3
0:15
a
48:55
su
b
48:55
d
48:63
ACC
48:63
- temp3
0:15
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. Each
intermediate 16-bit product is subtracted from the contents of the accumulator half words to form
intermediate differences, which are placed into the corresponding parameter d half words and into
the accumulator.
Other registers altered: ACC
Figure 3-357. Vector Multiply Bytes Even, Signed by Unsigned, Modulo, Integer and
Accumulate Negative Half Words (evmbesumianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesumianh d,a,b
x x x
63
a (signed) 56
0 8 16 24 32 40 48 56
Accumulator
63
b (unsigned) 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-485
__ev_mbesusiaah __ev_mbesusiaah
Vector Multiply Byte Even, Signed by Unsigned, Saturate, Integer and Accumulate Half Words
d = __ev_mbesusiaah (a,b)
temp
0:15
a
0:7
su
b
0:7
temp
0:31
EXTS(ACC
0:15
) +
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
16:23
su
b
16:23
temp
0:31
EXTS(ACC
16:31
) +
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
32:39
su
b
32:39
temp
0:31
EXTS(ACC
32:47
) +
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
48:55
su
b
48:55
temp
0:31
EXTS(ACC
48:63
) +
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding even-numbered byte signed integer elements in parameter a and unsigned
integer element in parameter b are multiplied, producing a 16-bit intermediate product. Each
16-bit product is then added to the corresponding half word in the accumulator, saturating if
overflow or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-486 Freescale Semiconductor
SPE2 Operations
Figure 3-358. Vector Multiply Bytes Even, Signed by Unsigned, Saturate, Integer and
Accumulate Half Words (__ev_mbesusiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesusiaah d,a,b
x x x
63
a (signed) 56
0 8 16 24 32 40 48 56
Accumulator
63
b (unsigned) 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-487
__ev_mbesusianh __ev_mbesusianh
Vector Multiply Byte Even, Signed by Unsigned, Saturate, Integer and Accumulate Negative Half
Words
d = __ev_mbesusianh (a,b)
temp
0:15
a
0:7
su
b
0:7
temp
0:31
EXTS(ACC
0:15
) -
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
16:23
su
b
16:23
temp
0:31
EXTS(ACC
16:31
) -
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
32:39
su
b
32:39
temp
0:31
EXTS(ACC
32:47
) -
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
48:55
su
b
48:55
temp
0:31
EXTS(ACC
48:63
) -
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding even-numbered byte signed integer element in parameter a and unsigned
integer element in parameter b are multiplied, producing a 16-bit intermediate product. Each
16-bit product is then subtracted from the corresponding half word in the accumulator, saturating
if overflow or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the subtraction, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-488 Freescale Semiconductor
SPE2 Operations
Figure 3-359. Vector Multiply Bytes Even, Signed by Unsigned, Saturate, Integer and
Accumulate Negative Half Words (__ev_mbesusianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbesusianh d,a,b
x x x
63
a (signed) 56
0 8 16 24 32 40 48 56
Accumulator
63
b (unsigned) 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-489
__ev_mbeumi[a] __ev_mbeumi[a]
Vector Multiply Bytes Even, Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mbeumi (a,b) (A=0)
d = __ev_mbeumia (a,b) (A=1)
d
0:15
a
0:7
ui
b
0:7
d
16:31
a
16:23
ui
b
16:23
d
32:47
a
32:39
ui
b
32:39
d
48:63
a
48:55
ui
b
48:55
// update accumulator
if A = 1, then ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte unsigned
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding half words in parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-360. Vector Multiply Bytes Even, Unsigned, Modulo, Integer (to Accumulator)
(__ev_mbeumi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbeumi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbeumia d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
63
d (and Accumulator if A=1)
0 16 32 48
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
SPE2 Programming Interface Manual, Rev. 1.0-2
3-490 Freescale Semiconductor
SPE2 Operations
__ev_mbeumiaah __ev_mbeumiaah
Vector Multiply Byte Even, Unsigned, Modulo, Integer and Accumulate Half Words
d = __ev_mbeumiaah (a,b)
temp0
0:15
a
0:7
ui
b
0:7
d
0:15
temp0
0:15
+ACC
0:15
temp1
0:15
a
16:23
ui
b
16:23
d
16:31
temp1
0:15
+ACC
16:31
temp2
0:15
a
32:39
ui
b
32:39
d
32:47
temp2
0:15
+ACC
32:47
temp3
0:15
a
48:55
ui
b
48:55
d
48:63
temp3
0:15
+ACC
48:63
// update accumulator
ACC
0:63
rD
0:63
For each half word element in the accumulator, the corresponding even-numbered byte unsigned
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-361. Vector Multiply Bytes Even, Unsigned, Modulo, Integer and Accumulate Half
Words (__ev_mbeumiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbeumiaah d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-491
__ev_mbeumianh __ev_mbeumianh
Vector Multiply Byte Even, Unsigned, Modulo, Integer and Accumulate Negative Half Words
d = __ev_mbeumianh (a,b)
temp0
0:15
a
0:7
ui
b
0:7
d
0:15
ACC
0:15
-
temp0
0:15
temp1
0:15
a
16:23
ui
b
16:23
d
16:31
ACC
16:31
- temp1
0:15
temp2
0:15
a
32:39
ui
b
32:39
d
32:47
ACC
32:47
- temp2
0:15
temp3
0:15
a
48:55
ui
b
48:55
d
48:63
ACC
48:63
- temp3
0:15
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding even-numbered byte unsigned
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is
subtracted from the contents of the accumulator half words to form intermediate differences,
which are placed into the corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-362. Vector Multiply Bytes Even, Unsigned, Modulo, Integer and Accumulate
Negative Half Words (__ev_mbeumianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbeumianh d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-492 Freescale Semiconductor
SPE2 Operations
__ev_mbeusiaah __ev_mbeusiaah
Vector Multiply Byte Even, Unsigned, Saturate, Integer and Accumulate Half Words
d = __ev_mbeusiaah (a,b)
temp
0:15
a
0:7
ui
b
0:7
temp
0:31
EXTZ(ACC
0:15
) +
EXTZ(temp
0:15
)
ovh0 temp
15
d
0:15
SATURATE(ovh0, 0, 0xFFFF, 0xFFFF, temp
16:31
)
temp
0:15
a
16:23
ui
b
16:23
temp
0:31
EXTZ(ACC
16:31
) +
EXTZ(temp
0:15
)
ovh1 temp
15
d
16:31
SATURATE(ovh1, 0, 0xFFFF, 0xFFFF, temp
16:31
)
temp
0:15
a
32:39
ui
b
32:39
temp
0:31
EXTZ(ACC
32:47
) +
EXTZ(temp
0:15
)
ovl0 temp
15
d
32:47
SATURATE(ovl0, 0, 0xFFFF, 0xFFFF, temp
16:31
)
temp
0:15
a
48:55
ui
b
48:55
temp
0:31
EXTZ(ACC
48:63
) +
EXTZ(temp
0:15
)
ovl1 temp
15
d
48:63
SATURATE(ovl1, 0, 0xFFFF, 0xFFFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding even-numbered byte unsigned integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then added to the
corresponding half word in the accumulator, saturating if overflow occurs, and the result is placed
in parameter d and the accumulator.
If there is an overflow from the addition, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-493
Figure 3-363. Vector Multiply Bytes Even, Unsigned, Saturate, Integer and Accumulate
Half Words (__ev_mbeusiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbeusiaah d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
SPE2 Programming Interface Manual, Rev. 1.0-2
3-494 Freescale Semiconductor
SPE2 Operations
__ev_mbeusianh __ev_mbeusianh
Vector Multiply Byte Even, Unsigned, Saturate, Integer and Accumulate Negative Half Words
d = __ev_mbeusianh (a,b)
temp
0:15
a
0:7
ui
b
0:7
temp
0:31
EXTZ(ACC
0:15
) -
EXTZ(temp
0:15
)
ovh0 temp
15
d
0:15
SATURATE(ovh0, 0, 0x0000, 0x0000, temp
16:31
)
temp
0:15
a
16:23
ui
b
16:23
temp
0:31
EXTZ(ACC
16:31
) -
EXTZ(temp
0:15
)
ovh1 temp
15
d
16:31
SATURATE(ovh1, 0, 0x0000, 0x0000, temp
16:31
)
temp
0:15
a
32:39
ui
b
32:39
temp
0:31
EXTZ(ACC
32:47
) -
EXTZ(temp
0:15
)
ovl0 temp
15
d
32:47
SATURATE(ovl0, 0, 0x0000, 0x0000, temp
16:31
)
temp
0:15
a
48:55
ui
b
48:55
temp
0:31
EXTZ(ACC
48:63
) -
EXTZ(temp
0:15
)
ovl1 temp
15
d
48:63
SATURATE(ovl1, 0, 0x0000, 0x0000, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding even-numbered byte unsigned integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then subtracted from
the corresponding half word in the accumulator, saturating if underflow occurs, and the result is
placed in parameter d and the accumulator.
If there is an underflow from the subtraction, the overflow and summary overflow bits are recorded
in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-495
Figure 3-364. Vector Multiply Bytes Even, Unsigned, Saturate, Integer and Accumulate
Negative Half Words (__ev_mbeusianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbeusianh d,a,b
x x x
63
a 56
0 8 16 24 32 40 48 56
Accumulator
63
b 56
0 8 16 24 32 40 48 56
x
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-496 Freescale Semiconductor
SPE2 Operations
__ev_mbosmi[a] __ev_mbosmi[a]
Vector Multiply Bytes Odd, Signed, Modulo, Integer (to Accumulator)
d = __ev_mbosmi (a,b) (A=0)
d = __ev_mbosmia (a,b) (A=1)
d
0:15
a
8:15
si
b
8:15
d
16:31
a
24:31
si
b
24:31
d
32:47
a
40:47
si
b
40:47
d
48:63
a
56:63
si
b
56:63
// update accumulator
if A = 1, then ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte signed
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding half words in parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-365. Vector Multiply Bytes Odd, Signed, Modulo, Integer (to Accumulator)
(__ev_mbosmi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosmi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosmia d,a,b
x x x x
63
a
0 8 16 24 32 40 48 56
63
d (and Accumulator if A=1)
0 16 32 48
63
b
0 8 16 24 32 40 48 56
intermediate products
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-497
__ev_mbosmiaah __ev_mbosmiaah
Vector Multiply Byte Odd, Signed, Modulo, Integer and Accumulate Half Words
d = __ev_mbosmiaah (a,b)
temp0
0:15
a
8:15
si
b
8:15
d
0:15
temp0
0:15
+ACC
0:15
temp1
0:15
a
24:31
si
b
24:31
d
16:31
temp1
0:15
+ACC
16:31
temp2
0:15
a
40:47
si
b
40:47
d
32:47
temp2
0:15
+ACC
32:47
temp3
0:15
a
56:63
si
b
56:63
d
48:63
temp3
0:15
+ACC
48:63
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte signed
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-366. Vector Multiply Bytes Odd, Signed, Modulo, Integer and Accumulate Half
Words (__ev_mbosmiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosmiaah d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-498 Freescale Semiconductor
SPE2 Operations
__ev_mbosmianh __ev_mbosmianh
Vector Multiply Byte Odd, Signed, Modulo, Integer and Accumulate Negative Half Words
d = __ev_mbosmianh (a,b)
temp0
0:15
a
8:15
si
b
8:15
d
0:15
ACC
0:15
-
temp0
0:15
temp1
0:15
a
24:31
si
b
24:31
d
16:31
ACC
16:31
- temp1
0:15
temp2
0:15
a
40:47
si
b
40:47
d
32:47
ACC
32:47
- temp2
0:15
temp3
0:15
a
56:63
si
b
56:63
d
48:63
ACC
48:63
- temp3
0:15
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte signed
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is
subtracted from the contents of the accumulator half words to form intermediate differences,
which are placed into the corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-367. Vector Multiply Bytes Odd, Signed, Modulo, Integer and Accumulate
Negative Half Words (__ev_mbosmianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosmianh d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-499
__ev_mbossiaah __ev_mbossiaah
Vector Multiply Byte Odd, Signed, Saturate, Integer and Accumulate Half Words
d = __ev_mbossiaah (a,b)
temp
0:15
a
8:15
si
b
8:15
temp
0:31
EXTS(ACC
0:15
) +
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
24:31
si
b
24:31
temp
0:31
EXTS(ACC
16:31
) +
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
40:47
si
b
40:47
temp
0:31
EXTS(ACC
32:47
) +
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
56:63
si
b
56:63
temp
0:31
EXTS(ACC
48:63
) +
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding odd-numbered byte signed integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then added to the
corresponding half word in the accumulator, saturating if overflow or underflow occurs, and the
result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-500 Freescale Semiconductor
SPE2 Operations
Figure 3-368. Vector Multiply Bytes Odd, Signed, Saturate, Integer and Accumulate Half
Words (__ev_mbossiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbossiaah d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-501
__ev_mbossianh __ev_mbossianh
Vector Multiply Byte Odd, Signed, Saturate, Integer and Accumulate Negative Half Words
d = __ev_mbossianh (a,b)
temp
0:15
a
8:15
si
b
8:15
temp
0:31
EXTS(ACC
0:15
) -
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
24:31
si
b
24:31
temp
0:31
EXTS(ACC
16:31
) -
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
40:47
si
b
40:47
temp
0:31
EXTS(ACC
32:47
) -
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
56:63
si
b
56:63
temp
0:31
EXTS(ACC
48:63
) -
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding odd-numbered byte signed integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then subtracted from
the corresponding half word in the accumulator, saturating if overflow or underflow occurs, and
the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the subtraction, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-502 Freescale Semiconductor
SPE2 Operations
Figure 3-369. Vector Multiply Bytes Odd, Signed, Saturate, Integer and Accumulate
Negative Half Words (__ev_mbossianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbossianh d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-503
__ev_mbosumi[a] __ev_mbosumi[a]
Vector Multiply Bytes Odd, Signed by Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mbosumi (a,b) (A=0)
d = __ev_mbosumia (a,b) (A=1)
d
0:15
a
8:15
su
b
8:15
d
16:31
a
24:31
su
b
24:31
d
32:47
a
40:47
su
b
40:47
d
48:63
a
56:63
su
b
56:63
// update accumulator
if A = 1, then ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. Each
intermediate 16-bit product is added to the contents of the accumulator half words to form
intermediate sums, which are placed into the corresponding half words in parameter d. If A = 1,
the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-370. Vector Multiply Bytes Odd, Signed by Unsigned, Modulo, Integer (to
Accumulator) (__ev_mbosumi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosumi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosumia d,a,b
63
d (and Accumulator if A=1)
0 16 32 48
intermediate products
x x x x
63
a (signed)
0 8 16 24 32 40 48 56
63
b (unsigned)
0 8 16 24 32 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-504 Freescale Semiconductor
SPE2 Operations
__ev_mbosumiaah __ev_mbosumiaah
Vector Multiply Byte Odd, Signed by Unsigned, Modulo, Integer and Accumulate Half Words
d = __ev_mbosumiaah (a,b)
temp0
0:15
a
8:15
su
b
8:15
d
0:15
temp0
0:15
+ACC
0:15
temp1
0:15
a
24:31
su
b
24:31
d
16:31
temp1
0:15
+ACC
16:31
temp2
0:15
a
40:47
su
b
40:47
d
32:47
temp2
0:15
+ACC
32:47
temp3
0:15
a
56:63
su
b
56:63
d
48:63
temp3
0:15
+ACC
48:63
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. Each
intermediate 16-bit product is added to the contents of the accumulator half words to form
intermediate sums, which are placed into the corresponding parameter d half words and into the
accumulator.
Other registers altered: ACC
Figure 3-371. Vector Multiply Bytes Odd, Signed by Unsigned, Modulo, Integer and
Accumulate Half Words (__ev_mbosumiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosumiaah d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
x x x x
63
a (signed)
0 8 16 24 32 40 48 56
63
b (unsigned)
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-505
__ev_mbosumianh __ev_mbosumianh
Vector Multiply Byte Odd, Signed by Unsigned, Modulo, Integer and Accumulate Negative Half
Words
d = __ev_mbosumianh (a,b)
temp0
0:15
a
8:15
su
b
8:15
d
0:15
ACC
0:15
-
temp0
0:15
temp1
0:15
a
24:31
su
b
24:31
d
16:31
ACC
16:31
- temp1
0:15
temp2
0:15
a
40:47
su
b
40:47
d
32:47
ACC
32:47
- temp2
0:15
temp3
0:15
a
56:63
su
b
56:63
d
48:63
ACC
48:63
- temp3
0:15
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. Each
intermediate 16-bit product is subtracted from the contents of the accumulator half words to form
intermediate differences, which are placed into the corresponding parameter d half words and into
the accumulator.
Other registers altered: ACC
Figure 3-372. Vector Multiply Bytes Odd, Signed by Unsigned, Modulo, Integer and
Accumulate Negative Half Words (__ev_mbosumianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosumianh d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
x x x x
63
a (signed)
0 8 16 24 32 40 48 56
63
b (unsigned)
0 8 16 24 32 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-506 Freescale Semiconductor
SPE2 Operations
__ev_mbosusiaah __ev_mbosusiaah
Vector Multiply Byte Odd, Signed by Unsigned, Saturate, Integer and Accumulate Half Words
d = __ev_mbosusiaah (a,b)
temp
0:15
a
8:15
su
b
8:15
temp
0:31
EXTS(ACC
0:15
) +
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
24:31
su
b
24:31
temp
0:31
EXTS(ACC
16:31
) +
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
40:47
su
b
40:47
temp
0:31
EXTS(ACC
32:47
) +
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
56:63
su
b
56:63
temp
0:31
EXTS(ACC
48:63
) +
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding odd-numbered byte signed integer elements in parameter a and unsigned
integer element in parameter b are multiplied, producing a 16-bit intermediate product. Each
16-bit product is then added to the corresponding half word in the accumulator, saturating if
overflow or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-507
Figure 3-373. Vector Multiply Bytes Odd, Signed by Unsigned, Saturate, Integer and
Accumulate Half Words (__ev_mbosusiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosusiaah d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
x x x x
63
a (signed)
0 8 16 24 32 40 48 56
63
b (unsigned)
0 8 16 24 32 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-508 Freescale Semiconductor
SPE2 Operations
__ev_mbosusianh __ev_mbosusianh
Vector Multiply Byte Odd, Signed by Unsigned, Saturate, Integer and Accumulate Negative Half
Words
d = __ev_mbosusianh (a,b)
temp
0:15
a
8:15
su
b
8:15
temp
0:31
EXTS(ACC
0:15
) -
EXTS(temp
0:15
)
ovh0 (temp
15
temp
16
)
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
24:31
su
b
24:31
temp
0:31
EXTS(ACC
16:31
) -
EXTS(temp
0:15
)
ovh1 (temp
15
temp
16
)
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
40:47
su
b
40:47
temp
0:31
EXTS(ACC
32:47
) -
EXTS(temp
0:15
)
ovl0 (temp
15
temp
16
)
d
32:47
SATURATE(ovl0, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
temp
0:15
a
56:63
su
b
56:63
temp
0:31
EXTS(ACC
48:63
) -
EXTS(temp
0:15
)
ovl1 (temp
15
temp
16
)
d
48:63
SATURATE(ovl1, temp
15
, 0x8000, 0x7FFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding odd-numbered byte signed integer element in parameter a and unsigned integer
element in parameter b are multiplied, producing a 16-bit intermediate product. Each 16-bit
product is then subtracted from the corresponding half word in the accumulator, saturating if
overflow or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the subtraction, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-509
Figure 3-374. Vector Multiply Bytes Odd, Signed by Unsigned, Saturate, Integer and
Accumulate Negative Half Words (__ev_mbosusianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbosusianh d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
x x x x
63
a (signed)
0 8 16 24 32 40 48 56
63
b (unsigned)
0 8 16 24 32 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-510 Freescale Semiconductor
SPE2 Operations
__ev_mboumi[a] __ev_mboumi[a]
Vector Multiply Bytes Odd, Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mboumi (a,b) (A=0)
d = __ev_mboumia (a,b) (A=1)
d
0:15
a
8:15
ui
b
8:15
d
16:31
a
24:31
ui
b
24:31
d
32:47
a
40:47
ui
b
40:47
d
48:63
a
56:63
ui
b
56:63
// update accumulator
if A = 1, then ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte unsigned
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding half words in parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-375. Vector Multiply Bytes Odd, Unsigned, Modulo, Integer (to Accumulator)
(__ev_mboumi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmboumi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmboumia d,a,b
63
d (and Accumulator if A=1)
0 16 32 48
intermediate products
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-511
__ev_mboumiaah __ev_mboumiaah
Vector Multiply Byte Odd, Unsigned, Modulo, Integer and Accumulate Half Words
d = __ev_mboumiaah (a,b)
temp0
0:15
a
8:15
ui
b
8:15
d
0:15
temp0
0:15
+ACC
0:15
temp1
0:15
a
24:31
ui
b
24:31
rD
16:31
temp1
0:15
+ACC
16:31
temp2
0:15
a
40:47
ui
b
40:47
d
32:47
temp2
0:15
+ACC
32:47
temp3
0:15
a
56:63
ui
b
56:63
d
48:63
temp3
0:15
+ACC
48:63
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte unsigned
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is added
to the contents of the accumulator half words to form intermediate sums, which are placed into the
corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-376. Vector Multiply Bytes Odd, Unsigned, Modulo, Integer and Accumulate Half
Words (__ev_mboumiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmboumiaah d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-512 Freescale Semiconductor
SPE2 Operations
__ev_mboumianh __ev_mboumianh
Vector Multiply Byte Odd, Unsigned, Modulo, Integer and Accumulate Negative Half Words
d = __ev_mboumianh (a,b)
temp0
0:15
a
8:15
ui
b
8:15
d
0:15
ACC
0:15
-
temp0
0:15
temp1
0:15
a
24:31
ui
b
24:31
d
16:31
ACC
16:31
- temp1
0:15
temp2
0:15
a
40:47
ui
b
40:47
d
32:47
ACC
32:47
- temp2
0:15
temp3
0:15
a
56:63
ui
b
56:63
d
48:63
ACC
48:63
- temp3
0:15
// update accumulator
ACC
0:63
d
0:63
For each half word element in the accumulator, the corresponding odd-numbered byte unsigned
integer elements in parameters a and b are multiplied. Each intermediate 16-bit product is
subtracted from the contents of the accumulator half words to form intermediate differences,
which are placed into the corresponding parameter d half words and into the accumulator.
Other registers altered: ACC
Figure 3-377. Vector Multiply Bytes Odd, Unsigned, Modulo, Integer and Accumulate
Negative Half Words (__ev_mboumianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmboumianh d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-513
__ev_mbousiaah __ev_mbousiaah
Vector Multiply Byte Odd, Unsigned, Saturate, Integer and Accumulate Half Words
d = __ev_mbousiaah (a,b)
temp
0:15
a
8:15
ui
b
8:15
temp
0:31
EXTZ(ACC
0:15
) +
EXTZ(temp
0:15
)
ovh0 temp
15
d
0:15
SATURATE(ovh0, 0, 0xFFFF, 0xFFFF, temp
16:31
)
temp
0:15
a
24:31
ui
b
24:31
temp
0:31
EXTZ(ACC
16:31
) +
EXTZ(temp
0:15
)
ovh1 temp
15
d
16:31
SATURATE(ovh1, 0, 0xFFFF, 0xFFFF, temp
16:31
)
temp
0:15
a
40:47
ui
b
40:47
temp
0:31
EXTZ(ACC
32:47
) +
EXTZ(temp
0:15
)
ovl0 temp
15
d
32:47
SATURATE(ovl0, 0, 0xFFFF, 0xFFFF, temp
16:31
)
temp
0:15
a
56:63
ui
b
56:63
temp
0:31
EXTZ(ACC
48:63
) +
EXTZ(temp
0:15
)
ovl1 temp
15
d
48:63
SATURATE(ovl1, 0, 0xFFFF, 0xFFFF, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding odd-numbered byte unsigned integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then added to the
corresponding half word in the accumulator, saturating if overflow occurs, and the result is placed
in parameter d and the accumulator.
If there is an overflow from the addition, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-514 Freescale Semiconductor
SPE2 Operations
Figure 3-378. Vector Multiply Bytes Odd, Unsigned, Saturate, Integer and Accumulate Half
Words (__ev_mbousiaah)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbousiaah d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
+ + + +
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-515
__ev_mbousianh __ev_mbousianh
Vector Multiply Byte Odd, Unsigned, Saturate, Integer and Accumulate Negative Half Words
d = __ev_mbousianh (a,b)
temp
0:15
a
8:15
ui
b
8:15
temp
0:31
EXTZ(ACC
0:15
) -
EXTZ(temp
0:15
)
ovh0 temp
15
d
0:15
SATURATE(ovh0, 0, 0x0000, 0x0000, temp
16:31
)
temp
0:15
a
24:31
ui
b
24:31
temp
0:31
EXTZ(ACC
16:31
) -
EXTZ(temp
0:15
)
ovh1 temp
15
d
16:31
SATURATE(ovh1, 0, 0x0000, 0x0000, temp
16:31
)
temp
0:15
a
40:47
ui
b
40:47
temp
0:31
EXTZ(ACC
32:47
) -
EXTZ(temp
0:15
)
ovl0 temp
15
d
32:47
SATURATE(ovl0, 0, 0x0000, 0x0000, temp
16:31
)
temp
0:15
a
56:63
ui
b
56:63
temp
0:31
EXTZ(ACC
48:63
) -
EXTZ(temp
0:15
)
ovl1 temp
15
d
48:63
SATURATE(ovl1, 0, 0x0000, 0x0000, temp
16:31
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh0 | ovh1
SPEFSCR
OV
ovl0 | ovl1
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh0 | ovh1
SPEFSCR
SOV
SPEFSCR
SOV
| ovl0 | ovl1
The corresponding odd-numbered byte unsigned integer elements in parameters a and b are
multiplied, producing a 16-bit intermediate product. Each 16-bit product is then subtracted from
the corresponding half word in the accumulator, saturating if underflow occurs, and the result is
placed in parameter d and the accumulator.
If there is an underflow from the subtraction, the overflow and summary overflow bits are recorded
in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-516 Freescale Semiconductor
SPE2 Operations
Figure 3-379. Vector Multiply Bytes Odd, Unsigned, Saturate, Integer and Accumulate
Negative Half Words (__ev_mbousianh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmbousianh d,a,b
Accumulator
intermediate products
63
d and Accumulator
0 16 32 48
- - - -
x x x x
63
a
0 8 16 24 32 40 48 56
63
b
0 8 16 24 32 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-517
__ev_mergehi __ev_mergehi
Vector Merge High
d = __ev_mergehi (a,b)
d
0:31
a
0:31
d
32:63
b
0:31
The high-order elements of parameters a and b are merged and placed into parameter d, as shown
in Figure 3-380.
Figure 3-380. High-Order Element Merging (__ev_mergehi)
NOTE
To perform a vector splat high, specify the same register in parameters
a and b.
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmergehi d,a,b
0 31 32 63
A
B
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-518 Freescale Semiconductor
SPE2 Operations
__ev_mergehilo __ev_mergehilo
Vector Merge High/Low
d = __ev_mergehilo (a,b)
d
0:31
a
0:31
d
32:63
b
32:63
The high-order element of parameter a and the low-order element of parameter b are merged and
placed into parameter d, as shown in Figure 3-381.
Figure 3-381. High-Order Element Merging (__ev_mergehilo)
Application note: With appropriate specification of parameters a and b, evmergehi, evmergelo,
evmergehilo, and evmergelohi provide a full 32-bit permute of two source parameters.
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmergehilo d,a,b
0 31 32 63
a
b
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-519
__ev_mergelo __ev_mergelo
Vector Merge Low
d = __ev_mergelo (a,b)
d
0:31
a
32:63
d
32:63
b
32:63
The low-order elements of parameters a and b are merged and placed in parameter d, as shown in
Figure 3-382.
Figure 3-382. Low-Order Element Merging (__ev_mergelo)
NOTE
To perform a vector splat low, specify the same register in parameters
a and b.
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmergelo d,a,b
0 31 32 63
a
b
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-520 Freescale Semiconductor
SPE2 Operations
__ev_mergelohi __ev_mergelohi
Vector Merge Low/High
d = __ev_mergelohi (a,b)
d
0:31
a
32:63
d
32:63
b
0:31
The low-order element of parameter a and the high-order element of parameter b are merged and
placed into parameter d, as shown in Figure 3-383.
Figure 3-383. Low-Order Element Merging (__ev_mergelohi)
NOTE
To perform a vector swap, specify the same register in parameters a
and b.
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmergelohi d,a,b
0 31 32 63
a
b
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-521
__ev_mhegsmfaa __ev_mhegsmfaa
Vector Multiply Half Words, Even, Guarded, Signed, Modulo, Fractional and Accumulate
d = __ev_mhegsmfaa (a,b)
temp
0:31
a
32:47
sf
b
32:47
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low even-numbered, half-word signed fractional elements in parameters a and
b are multiplied. The product is added to the contents of the 64-bit accumulator, and the result is
placed into parameter d and the accumulator.
NOTE
This sum is a modulo sum. Neither overflow check nor saturation is
performed. Any overflow of the 64-bit sum is not recorded into the
SPEFSCR. If the two input operands are both -1.0, the intermediate
product is represented as +1.0.
Figure 3-384. __ev_mhegsmfaa (Even Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhegsmfaa d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-522 Freescale Semiconductor
SPE2 Operations
__ev_mhegsmfan __ev_mhegsmfan
Vector Multiply Half Words, Even, Guarded, Signed, Modulo, Fractional and Accumulate
Negative
d = __ev_mhegsmfan (a,b)
temp
0:31
a
32:47
sf
b
32:47
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low even-numbered, half-word signed fractional elements in parameters a and
b are multiplied. The product is subtracted from the contents of the 64-bit accumulator, and the
result is placed into parameter d and the accumulator.
NOTE
This difference is a modulo difference. Neither overflow check nor
saturation is performed. Any overflow of the 64-bit difference is not
recorded into the SPEFSCR. If the two input operands are both -1.0,
the intermediate product is represented as +1.0.
Figure 3-385. __ev_mhegsmfan (Even Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhegsmfan d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
ssss_ssss_ssss_ssss...sss
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-523
__ev_mhegsmiaa __ev_mhegsmiaa
Vector Multiply Half Words, Even, Guarded, Signed, Modulo, Integer and Accumulate
d = __ev_mhegsmiaa (a,b)
temp
0:31
a
32:47
si
b
32:47
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low even-numbered half-word signed integer elements in parameters a and b
are multiplied. The intermediate product is sign-extended and added to the contents of the 64-bit
accumulator, and the resulting sum is placed into parameter d and the accumulator.
NOTE
This sum is a modulo sum. Neither overflow check nor saturation is
performed. Any overflow of the 64-bit sum is not recorded into the
SPEFSCR.
Figure 3-386. __ev_mhegsmiaa (Even Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhegsmiaa d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-524 Freescale Semiconductor
SPE2 Operations
__ev_mhegsmian __ev_mhegsmian
Vector Multiply Half Words, Even, Guarded, Signed, Modulo, Integer and Accumulate Negative
d = __ev_mhegsmian (a,b)
temp
0:31
a
32:47
si
b
32:47
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low even-numbered half-word signed integer elements in parameters a and b
are multiplied. The intermediate product is sign-extended and subtracted from the contents of the
64-bit accumulator, and the result is placed into parameter d and into the accumulator.
NOTE
This difference is a modulo difference. Neither overflow check nor
saturation is performed. Any overflow of the 64-bit difference is not
recorded into the SPEFSCR.
Figure 3-387. __ev_mhegsmian (Even Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhegsmian d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-525
__ev_mhegumiaa __ev_mhegumiaa
Vector Multiply Half Words, Even, Guarded, Unsigned, Modulo, Integer and Accumulate
d = __ev_mhegumiaa (a,b)
temp
0:31
a
32:47
ui
b
32:47
temp
0:63
EXTZ(temp
0:31
)
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low even-numbered half-word unsigned integer elements in parameters a and
b are multiplied. The intermediate product is zero-extended and added to the contents of the 64-bit
accumulator. The resulting sum is placed into parameter d and into the accumulator.
NOTE
This sum is a modulo sum. Neither overflow check nor saturation is
performed. Any overflow of the 64-bit sum is not recorded into the
SPEFSCR.
Figure 3-388. __ev_mhegumiaa (Even Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhegumiaa d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
uuuu_uuuu_uuuu_uuuu...uuu
SPE2 Programming Interface Manual, Rev. 1.0-2
3-526 Freescale Semiconductor
SPE2 Operations
__ev_mhegumian __ev_mhegumian
Vector Multiply Half Words, Even, Guarded, Unsigned, Modulo, Integer and Accumulate Negative
d = __ev_mhegumian (a,b)
temp
0:31
a
32:47
ui
b
32:47
temp
0:63
EXTZ(temp
0:31
)
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low even-numbered unsigned integer elements in parameters a and b are
multiplied. The intermediate product is zero-extended and subtracted from the contents of the
64-bit accumulator. The result is placed into parameter d and into the accumulator.
NOTE
This difference is a modulo difference. Neither overflow check nor
saturation is performed. Any overflow of the 64-bit difference is not
recorded into the SPEFSCR.
Figure 3-389. __ev_mhegumian (Even Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhegumian d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
uuuu_uuuu_uuuu_uuuu...uuu
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-527
__ev_mhesmf[a] __ev_mhesmf[a]
Vector Multiply Half Words, Even, Signed, Modulo, Fractional (to Accumulator)
d = __ev_mhesmf (a,b) (A = 0)
d = __ev_mhesmfa (a,b) (A = 1)
// high
d
0:31
(a
0:15
sf
b
0:15
)
// low
d
32:63
(a
32:47
sf
b
32:47
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding even-numbered half-word signed fractional elements in parameters a and b are
multiplied, and the 32 bits of each product are placed into the corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-390. Even Multiply of Two Signed Modulo Fractional Elements (to Accumulator)
(__ev_mhesmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmfa d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mhesmfa)
X X
SPE2 Programming Interface Manual, Rev. 1.0-2
3-528 Freescale Semiconductor
SPE2 Operations
__ev_mhesmfaaw __ev_mhesmfaaw
Vector Multiply Half Words, Even, Signed, Modulo, Fractional and Accumulate into Words
d = __ev_mhesmfaaw (a,b)
// high
temp
0:31
(a
0:15
sf
b
0:15
)
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
(a
32:47
sf
b
32:47
)
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half-word signed
fractional elements in parameters a and b are multiplied. The 32 bits of each intermediate product
are added to the contents of the accumulator words to form intermediate sums, which are placed
into the corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-391. Even Form of Vector Half-Word Multiply (__ev_mhesmfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmfaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-529
__ev_mhesmfanw __ev_mhesmfanw
Vector Multiply Half Words, Even, Signed, Modulo, Fractional and Accumulate Negative into Words
d = __ev_mhesmfanw (a,b)
// high
temp
0:31
a
0:15
sf
b
0:15
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
32:47
sf
b
32:47
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half-word signed
fractional elements in parameters a and b are multiplied. The 32-bit intermediate products are
subtracted from the contents of the accumulator words to form intermediate differences, which are
placed into the corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-392. Even Form of Vector Half-Word Multiply (__ev_mhesmfanw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmfanw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-530 Freescale Semiconductor
SPE2 Operations
__ev_mhesmi[a] __ev_mhesmi[a]
Vector Multiply Half Words, Even, Signed, Modulo, Integer (to Accumulator)
d = __ev_mhesmi (a,b) (A = 0)
d = __ev_mhesmia (a,b) (A = 1)
// high
d
0:31
a
0:15
si
b
0:15
// low
d
32:63
a
32:47
si
b
32:47
// update accumulator
if A = 1, then ACC
0:63
d
0:63
The corresponding even-numbered half-word signed integer elements in parameters a and b are
multiplied. The two 32-bit products are placed into the corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-393. Even Form for Vector Multiply (to Accumulator) (__ev_mhesmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmia d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator for __ev_mhesmia)
X X
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-531
__ev_mhesmiaaw __ev_mhesmiaaw
Vector Multiply Half Words, Even, Signed, Modulo, Integer and Accumulate into Words
d = __ev_mhesmiaaw (a,b)
// high
temp
0:31
a
0:15
si
b
0:15
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
a
32:47
si
b
32:47
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half-word signed
integer elements in parameters a and b are multiplied. Each intermediate 32-bit product is added
to the contents of the accumulator words to form intermediate sums, which are placed into the
corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-394. Even Form of Vector Half-Word Multiply (__ev_mhesmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-532 Freescale Semiconductor
SPE2 Operations
__ev_mhesmianw __ev_mhesmianw
Vector Multiply Half Words, Even, Signed, Modulo, Integer and Accumulate Negative into Words
d = __ev_mhesmianw (a,b)
// high
temp0
0:31
a
0:15
si
b
0:15
d
0:31
ACC
0:31
temp0
0:31
// low
temp1
0:31
a
32:47
si
b
32:47
d
32:63
ACC
32:63
temp1
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half-word signed
integer elements in parameters a and b are multiplied. Each intermediate 32-bit product is
subtracted from the contents of the accumulator words to form intermediate differences, which are
placed into the corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-395. Even Form of Vector Half-Word Multiply (__ev_mhesmianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesmianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-533
__ev_mhessf[a] __ev_mhessf[a]
Vector Multiply Half Words, Even, Signed, Saturate, Fractional (to Accumulator)
d = __ev_mhessf (a,b) (A = 0)
d = __ev_mhessfa (a,b) (A = 1)
// high
temp
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000)
&
(b
0:15
= 0x8000) then
d
0:31
0x7FFF_FFFF //saturate
movh 1
else
d
0:31
temp
0:31
movh 0
// low
temp
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000)
&
(b
32:47
= 0x8000) then
d
32:63
0x7FFF_FFFF //saturate
movl 1
else
d
32:63
temp
0:31
movl 0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh
SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh
SPEFSCR
SOV
SPEFSCR
SOV
| movl
The corresponding even-numbered half-word signed fractional elements in parameters a and b are
multiplied. The 32 bits of each product are placed into the corresponding words of parameter d. If
both inputs are -1.0, the result saturates to the largest positive signed fraction and the overflow and
summary overflow bits are recorded in the SPEFSCR.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: SPEFSCR
ACC (if A = 1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-534 Freescale Semiconductor
SPE2 Operations
Figure 3-396. Even Multiply of Two Signed Saturate Fractional Elements (to Accumulator)
(__ev_mhessf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhessf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhessfa d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mhessa)
X X
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-535
__ev_mhessfaaw __ev_mhessfaaw
Vector Multiply Half Words, Even, Signed, Saturate, Fractional and Accumulate into Words
d = __ev_mhessfaaw (a,b)
// high
temp
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temp
0:63
EXTS(ACC
0:31
) + EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
temp
0:63
EXTS(ACC
32:63
) + EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding even-numbered half-word signed fractional elements in parameters a and b are
multiplied, producing a 32-bit product. If both inputs are 1.0, the result saturates to
0x7FFF_FFFF. Each 32-bit product is then added to the corresponding word in the accumulator,
saturating if overflow or underflow occurs, and the result is placed in parameter d and the
accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-536 Freescale Semiconductor
SPE2 Operations
Figure 3-397. Even Form of Vector Half-Word Multiply (__ev_mhessfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhessfaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-537
__ev_mhessfanw __ev_mhessfanw
Vector Multiply Half Words, Even, Signed, Saturate, Fractional and Accumulate Negative into Words
d = __ev_mhessfanw (a,b)
// high
temp
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temp
0:63
EXTS(ACC
0:31
) - EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
temp
0:63
EXTS(ACC
32:63
) - EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding even-numbered half-word signed fractional elements in parameters a and b are
multiplied, producing a 32-bit product. If both inputs are 1.0, the result saturates to
0x7FFF_FFFF. Each 32-bit product is then subtracted from the corresponding word in the
accumulator, saturating if overflow or underflow occurs, and the result is placed in parameter d
and the accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-538 Freescale Semiconductor
SPE2 Operations
Figure 3-398. Even Form of Vector Half-Word Multiply (__ev_mhessfanw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhessfanw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-539
__ev_mhessiaaw __ev_mhessiaaw
Vector Multiply Half Words, Even, Signed, Saturate, Integer and Accumulate into Words
d = __ev_mhessiaaw (a,b)
// high
temp
0:31
a
0:15
si
b
0:15
temp
0:63
EXTS(ACC
0:31
) + EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
32:47
si
b
32:47
temp
0:63
EXTS(ACC
32:63
) + EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding even-numbered half-word signed integer elements in parameters a and b are
multiplied, producing a 32-bit product. Each 32-bit product is then added to the corresponding
word in the accumulator, saturating if overflow or underflow occurs, and the result is placed in
parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-399. Even Form of Vector Half-Word Multiply (__ev_mhessiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhessiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-540 Freescale Semiconductor
SPE2 Operations
__ev_mhessianw __ev_mhessianw
Vector Multiply Half Words, Even, Signed, Saturate, Integer and Accumulate Negative into Words
d = __ev_mhessianw (a,b)
// high
temp
0:31
a
0:15
si
b
0:15
temp
0:63
EXTS(ACC
0:31
) - EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
32:47
si
b
32:47
temp
0:63
EXTS(ACC
32:63
) - EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, the corresponding even-numbered half-word signed
integer elements in parameters a and b are multiplied, producing a 32-bit product. Each 32-bit
product is then subtracted from the corresponding word in the accumulator, saturating if overflow
or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-400. Even Form of Vector Half-Word Multiply (__ev_mhessianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhessianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-541
__ev_mhesumi[a] __ev_mhesumi[a]
Vector Multiply Half Words, Even, Signed by Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mhesumi (a,b) (A = 0)
d = __ev_mhesumia (a,b) (A = 1)
// high
d
0:31
(a
0:15
su
b
0:15
)
// low
d
32:63
(a
32:47
su
b
32:47
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding even-numbered half word signed integer elements in parameter a and unsigned
integer elements in parameter b are multiplied then placed into the corresponding words of
parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-401. Even Multiply of Signed and Unsigned Modulo Integer Elements (to
Accumulator) (__ev_mhesumi)
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesumia d,a,b
15 16 47 48 0 31 32 63
a (signed)
b (unsigned)
d (and accumulator if __ev_mhesumia)
X X
SPE2 Programming Interface Manual, Rev. 1.0-2
3-542 Freescale Semiconductor
SPE2 Operations
__ev_mhesumiaaw __ev_mhesumiaaw
Vector Multiply Half Words, Even, Signed by Unsigned, Modulo, Integer and Accumulate into
Words
d = __ev_mhesumiaaw (a,b)
// high
temp
0:31
(a
0:15
su
b
0:15
)
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
(a
32:47
su
b
32:47
)
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half word signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. The 32
bits of each intermediate product are added to the contents of the accumulator words to form
intermediate sums, which are placed into the corresponding parameter d words and into the
accumulator.
Other registers altered: ACC
Figure 3-402. Even Form of Vector Half Word Multiply Signed by Unsigned Integer and
Accumulate (__ev_mhesumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesumiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
+ +
d and Accumulator
a (signed)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-543
__ev_mhesumianw __ev_mhesumianw
Vector Multiply Half Words, Even, Signed by Unsigned, Modulo, Integer and Accumulate
Negative into Words
d = __ev_mhesumianw (a,b)
// high
temp
0:31
a
0:15
su
b
0:15
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
32:47
su
b
32:47
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half word signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. The
32-bit intermediate products are subtracted from the contents of the accumulator words to form
intermediate differences, which are placed into the corresponding parameter d words and into the
accumulator.
Other registers altered: ACC
Figure 3-403. Even Form of Vector Half Word Multiply Signed by Unsigned Integer and
Accumulate Negated (__ev_mhesumianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesumianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
d and Accumulator
a (signed)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-544 Freescale Semiconductor
SPE2 Operations
__ev_mhesusiaaw __ev_mhesusiaaw
Vector Multiply Half Words, Even, Signed by Unsigned, Saturate, Integer and Accumulate into
Words
d = __ev_mhesusiaaw (a,b)
// high
temph
0:31
a
0:15
su
b
0:15
temp
0:63
EXTS(ACC
0:31
) + EXTS(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
templ
0:31
a
32:47
su
b
32:47
temp
0:63
EXTS(ACC
32:63
) + EXTS(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding even-numbered half word signed integer element in parameter a and unsigned
integer element in parameter b are multiplied producing a 32-bit product. Each 32-bit product is
then added to the corresponding word in the accumulator saturating if overflow or underflow
occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-404. Even Form of Vector Half Word Multiply Signed by Unsigned Saturate
Integer and Accumulate (__ev_mhesusiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesusiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
+ +
d and Accumulator
a (signed)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-545
__ev_mhesusianw __ev_mhesusianw
Vector Multiply Half Words, Even, Signed by Unsigned, Saturate, Integer and Accumulate
Negative into Words
d = __ev_mhesusianw (a,b)
// high
temph
0:31
a
0:15
su
b
0:15
temp
0:63
EXTS(ACC
0:31
) - EXTS(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
templ
0:31
a
32:47
su
b
32:47
temp
0:63
EXTS(ACC
32:63
) - EXTS(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding even-numbered half word signed integer element in parameter a and unsigned
integer element in parameter b are multiplied producing a 32-bit product. Each 32-bit product is
then subtracted from the corresponding word in the accumulator saturating if overflow or
underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-405. Even Form of Vector Half Word Multiply Signed by Unsigned Integer and
Accumulate Negated (__ev_mhesusianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhesusianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
+ +
d and Accumulator
a (signed)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-546 Freescale Semiconductor
SPE2 Operations
__ev_mheumi[a] __ev_mheumi[a]
Vector Multiply Half Words, Even, Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mheumi (a,b) (A = 0)
d = __ev_mheumia (a,b) (A = 1)
// high
d
0:31
a
0:15
ui
b
0:15
// low
d
32:63
a
32:47
ui
b
32:47
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding even-numbered half-word unsigned integer elements in parameters a and b are
multiplied. The two 32-bit products are placed into the corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Figure 3-406. Vector Multiply Half Words, Even, Unsigned, Modulo, Integer (to
Accumulator) (__ev_mheumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmheumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmheumia d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mheumia)
X X
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-547
__ev_mheumiaaw __ev_mheumiaaw
Vector Multiply Half Words, Even, Unsigned, Modulo, Integer and Accumulate into Words
d = __ev_mheumiaaw (a,b)
// high
temp
0:31
a
0:15
ui
b
0:15
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
a
32:47
ui
b
32:47
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half-word unsigned
integer elements in parameters a and b are multiplied. Each intermediate product is added to the
contents of the corresponding accumulator words, and the sums are placed into the corresponding
parameter d and accumulator words.
Other registers altered: ACC
Figure 3-407. Even Form of Vector Half-Word Multiply (__ev_mheumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmheumiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-548 Freescale Semiconductor
SPE2 Operations
__ev_mheumianw __ev_mheumianw
Vector Multiply Half Words, Even, Unsigned, Modulo, Integer and Accumulate Negative into Words
d = __ev_mheumianw (a,b)
// high
temp
0:31
a
0:15
ui
b
0:15
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
32:47
ui
b
32:47
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding even-numbered half-word unsigned
integer elements in parameters a and b are multiplied. Each intermediate product is subtracted
from the contents of the corresponding accumulator words. The differences are placed into the
corresponding parameter d and accumulator words.
Other registers altered: ACC
Figure 3-408. Even Form of Vector Half-Word Multiply (__ev_mheumianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmheumianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
- -
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-549
__ev_mheusiaaw __ev_mheusiaaw
Vector Multiply Half Words, Even, Unsigned, Saturate, Integer and Accumulate into Words
d = __ev_mheusiaaw (a,b)
// high
temp
0:31
a
0:15
ui
b
0:15
temp
0:63
EXTZ(ACC
0:31
) + EXTZ(temp
0:31
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
//low
temp
0:31
a
32:47
ui
b
32:47
temp
0:63
EXTZ(ACC
32:63
) + EXTZ(temp
0:31
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding even-numbered half-word unsigned
integer elements in parameters a and b are multiplied, producing a 32-bit product. Each 32-bit
product is then added to the corresponding word in the accumulator, saturating if overflow occurs,
and the result is placed in parameter d and the accumulator.
If there is an overflow from the addition, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-409. Even Form of Vector Half-Word Multiply (__ev_mheusiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmheusiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-550 Freescale Semiconductor
SPE2 Operations
__ev_mheusianw __ev_mheusianw
Vector Multiply Half Words, Even, Unsigned, Saturate, Integer and Accumulate Negative into Words
d = __ev_mheusianw (a,b)
// high
temp
0:31
a
0:15
ui
b
0:15
temp
0:63
EXTZ(ACC
0:31
) - EXTZ(temp
0:31
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0x0000_0000, 0x0000_0000, temp
32:63
)
//low
temp
0:31
a
32:47
ui
b
32:47
temp
0:63
EXTZ(ACC
32:63
) - EXTZ(temp
0:31
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0x0000_0000, 0x0000_0000, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding even-numbered half-word unsigned
integer elements in parameters a and b are multiplied, producing a 32-bit product. Each 32-bit
product is then subtracted from the corresponding word in the accumulator, saturating if underflow
occurs, and the result is placed in parameter d and the accumulator.
If there is an underflow from the subtraction, the overflow and summary overflow bits are recorded
in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-410. Even Form of Vector Half-Word Multiply (__ev_mheusianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmheusianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-551
__ev_mhogsmfaa __ev_mhogsmfaa
Vector Multiply Half Words, Odd, Guarded, Signed, Modulo, Fractional and Accumulate
d = __ev_mhogsmfaa (a,b)
temp
0:31
a
48:63
sf
b
48:63
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low odd-numbered half-word signed fractional elements in parameters a and b
are multiplied. The intermediate product is sign-extended to 64 bits and added to the contents of
the 64-bit accumulator. This result is placed into parameter d and into the accumulator.
NOTE
This sum is a modulo sum. Neither overflow check nor saturation is
performed. If an overflow from the 64-bit sum occurs, it is not
recorded into the SPEFSCR.
Figure 3-411. __ev_mhogsmfaa (Odd Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhogsmfaa d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-552 Freescale Semiconductor
SPE2 Operations
__ev_mhogsmfan __ev_mhogsmfan
Vector Multiply Half Words, Odd, Guarded, Signed, Modulo, Fractional and Accumulate Negative
d = __ev_mhogsmfan (a,b)
temp
0:31
a
48:63
sf
b
48:63
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low odd-numbered half-word signed fractional elements in parameters a and b
are multiplied. The intermediate product is sign-extended to 64 bits and subtracted from the
contents of the 64-bit accumulator. This result is placed into parameter d and into the accumulator.
NOTE
This difference is a modulo difference. Neither overflow check nor
saturation is performed. Any overflow of the 64-bit difference is not
recorded into the SPEFSCR. If the two input operands are both -1.0,
the intermediate product is represented as +1.0.
Figure 3-412. __ev_mhogsmfan (Odd Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhogsmfan d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-553
__ev_mhogsmiaa __ev_mhogsmiaa
Vector Multiply Half Words, Odd, Guarded, Signed, Modulo, Intege and Accumulate
d = __ev_mhogsmiaa (a,b)
temp
0:31
a
48:63
si
b
48:63
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low odd-numbered half-word signed integer elements in parameters a and b
are multiplied. The intermediate product is sign-extended to 64 bits and added to the contents of
the 64-bit accumulator. This sum is placed into parameter d and into the accumulator.
NOTE
This sum is a modulo sum. Neither overflow check nor saturation is
performed. An overflow from the 64-bit sum, if one occurs, is not
recorded into the SPEFSCR.
Figure 3-413. __ev_mhogsmiaa (Odd Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhogsmiaa d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-554 Freescale Semiconductor
SPE2 Operations
__ev_mhogsmian __ev_mhogsmian
Vector Multiply Half Words, Odd, Guarded, Signed, Modulo, Integer and Accumulate Negative
d = __ev_mhogsmian (a,b)
temp
0:31
a
48:63
si
b
48:63
temp
0:63
EXTS(temp
0:31
)
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low odd-numbered half-word signed integer elements in parameters a and b
are multiplied. The intermediate product is sign-extended to 64 bits and subtracted from the
contents of the 64-bit accumulator. This result is placed into parameter d and into the accumulator.
NOTE
This difference is a modulo difference. Neither overflow check nor
saturation is performed. Any overflow of the 64-bit difference is not
recorded into the SPEFSCR.
Figure 3-414. __ev_mhogsmian (Odd Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhogsmian d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
ssss_ssss_ssss_ssss...ssss
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-555
__ev_mhogumiaa __ev_mhogumiaa
Vector Multiply Half Words, Odd, Guarded, Unsigned, Modulo, Integer and Accumulate
d = __ev_mhogumiaa (a,b)
temp
0:31
a
48:63
ui
b
48:63
temp
0:63
EXTZ(temp
0:31
)
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low odd-numbered half-word unsigned integer elements in parameters a and b
are multiplied. The intermediate product is zero-extended to 64 bits and added to the contents of
the 64-bit accumulator. This sum is placed into parameter d and into the accumulator.
NOTE
This sum is a modulo sum. Neither overflow check nor saturation is
performed. An overflow from the 64-bit sum, if one occurs, is not
recorded into the SPEFSCR.
Figure 3-415. __ev_mhogumiaa (Odd Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhogumiaa d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
uuuu_uuuu_uuuu_uuuu...uuu
SPE2 Programming Interface Manual, Rev. 1.0-2
3-556 Freescale Semiconductor
SPE2 Operations
__ev_mhogumian __ev_mhogumian
Vector Multiply Half Words, Odd, Guarded, Unsigned, Modulo, Integer and Accumulate Negative
d = __ev_mhogumian (a,b)
temp
0:31
a
48:63
ui
b
48:63
temp
0:63
EXTZ(temp
0:31
)
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low odd-numbered half-word unsigned integer elements in parameters a and b
are multiplied. The intermediate product is zero-extended to 64 bits and subtracted from the
contents of the 64-bit accumulator. This result is placed into parameter d and into the accumulator.
NOTE
This difference is a modulo difference. Neither overflow check nor
saturation is performed. Any overflow of the 64-bit difference is not
recorded into the SPEFSCR.
Figure 3-416. __ev_mhogumian (Odd Form)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhogumian d,a,b
47 48 0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
uuuu_uuuu_uuuu_uuuu...uuu
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-557
__ev_mhosmf[a] __ev_mhosmf[a]
Vector Multiply Half Words, Odd, Signed, Modulo, Fractional (to Accumulator)
d = __ev_mhosmf (a,b) (A = 0)
d = __ev_mhosmfa (a,b) (A = 1)
// high
d
0:31
a
16:31
sf
b
16:31
// low
d
32:63
a
48:63
sf
b
48:63
// update accumulator
if A = 1, then ACC
0:63
d
0:63
The corresponding odd-numbered, half-word signed fractional elements in parameters a and b are
multiplied. Each product is placed into the corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-417. Vector Multiply Half Words, Odd, Signed, Modulo, Fractional (to
Accumulator) (__ev_mhosmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmfa d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mhosmf)
X X
SPE2 Programming Interface Manual, Rev. 1.0-2
3-558 Freescale Semiconductor
SPE2 Operations
__ev_mhosmfaaw __ev_mhosmfaaw
Vector Multiply Half Words, Odd, Signed, Modulo, Fractional and Accumulate into Words
d = __ev_mhosmfaaw (a,b)
// high
temp
0:31
a
16:31
sf
b
16:31
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
a
48:63
sf
b
48:63
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half-word signed
fractional elements in parameters a and b are multiplied. The 32 bits of each intermediate product
is added to the contents of the corresponding accumulator word, and the results are placed into the
corresponding parameter d words and into the accumulator
Other registers altered: ACC
Figure 3-418. Odd Form of Vector Half-Word Multiply (__ev_mhosmfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmfaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-559
__ev_mhosmfanw __ev_mhosmfanw
Vector Multiply Half Words, Odd, Signed, Modulo, Fractional and Accumulate Negative into Words
d = __ev_mhosmfanw (a,b)
// high
temp
0:31
a
16:31
sf
b
16:31
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
48:63
sf
b
48:63
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half-word signed
fractional elements in parameters a and b are multiplied. The 32 bits of each intermediate product
is subtracted from the contents of the corresponding accumulator word. The word and the results
are placed into the corresponding parameter d word and into the accumulator.
Other registers altered: ACC
Figure 3-419. Odd Form of Vector Half-Word Multiply (__ev_mhosmfanw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmfanw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-560 Freescale Semiconductor
SPE2 Operations
__ev_mhosmi[a] __ev_mhosmi[a]
Vector Multiply Half Words, Odd, Signed, Modulo, Integer (to Accumulator)
d = __ev_mhosmi (a,b) (A = 0)
d = __ev_mhosmia (a,b) (A = 1)
// high
d
0:31
a
16:31
si
b
16:31
// low
d
32:63
a
48:63
si
b
48:63
// update accumulator
if A = 1, then ACC
0:63
d
0:63
The corresponding odd-numbered half-word signed integer elements in parameters a and b are
multiplied. The two 32-bit products are placed into the corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-420. Vector Multiply Half Words, Odd, Signed, Modulo, Integer (to Accumulator)
(__ev_mhosmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmia d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mhosmia)
X X
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-561
__ev_mhosmiaaw __ev_mhosmiaaw
Vector Multiply Half Words, Odd, Signed, Modulo, Integer and Accumulate into Words
d = __ev_mhosmiaaw (a,b)
// high
temp
0:31
a
16:31
si
b
16:31
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
a
48:63
si
b
48:63
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half-word signed
integer elements in parameters a and b are multiplied. Each intermediate 32-bit product is added
to the contents of the corresponding accumulator word and the results are placed into the
corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-421. Odd Form of Vector Half-Word Multiply (__ev_mhosmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-562 Freescale Semiconductor
SPE2 Operations
__ev_mhosmianw __ev_mhosmianw
Vector Multiply Half Words, Odd, Signed, Modulo, Integer and Accumulate Negative into Words
d = __ev_mhosmianw (a,b)
// high
temp
0:31
a
16:31
si
b
16:31
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
48:63
si
b
48:63
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half-word signed
integer elements in parameters a and b are multiplied. Each intermediate 32-bit product is
subtracted from the contents of the corresponding accumulator word and the results are placed into
the corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-422. Odd Form of Vector Half-Word Multiply (__ev_mhosmianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosmianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
- -
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-563
__ev_mhossf[a] __ev_mhossf[a]
Vector Multiply Half Words, Odd, Signed, Saturate, Fractional (to Accumulator)
d = __ev_mhossf (a,b) (A = 0)
d = __ev_mhossfa (a,b) (A = 1)
// high
temp
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000)
&
(b
16:31
= 0x8000) then
d
0:31
0x7FFF_FFFF //saturate
movh 1
else
d
0:31
temp
0:31
movh 0
// low
temp
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000)
&
(b
48:63
= 0x8000) then
d
32:63
0x7FFF_FFFF //saturate
movl 1
else
d
32:63
temp
0:31
movl 0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh
SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh
SPEFSCR
SOV
SPEFSCR
SOV
| movl
The corresponding odd-numbered half-word signed fractional elements in parameters a and b are
multiplied. The 32 bits of each product are placed into the corresponding words of parameter d. If
both inputs are -1.0, the result saturates to the largest positive signed fraction and the overflow and
summary overflow bits are recorded in the SPEFSCR.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: SPEFSCR
ACC (if A = 1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-564 Freescale Semiconductor
SPE2 Operations
Figure 3-423. Vector Multiply Half Words, Odd, Signed, Saturate, Fractional (to
Accumulator) (__ev_mhossf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhossf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhossfa d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mhossfa)
X X
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-565
__ev_mhossfaaw __ev_mhossfaaw
Vector Multiply Half Words, Odd, Signed, Saturate, Fractional and Accumulate into Words
d = __ev_mhossfaaw (a,b)
// high
temp
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temp
0:63
EXTS(ACC
0:31
) + EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
temp
0:63
EXTS(ACC
32:63
) + EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh ; SPEFSCR
OV
ovl | movl ;
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh ; SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding odd-numbered half-word signed fractional elements in parameters a and b are
multiplied, producing a 32-bit product. If both inputs are -1.0, the result saturates to
0x7FFF_FFFF. Each 32-bit product is then added to the corresponding word in the accumulator,
saturating if overflow or underflow occurs, and the result is placed in parameter d and the
accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-424. Odd Form of Vector Half-Word Multiply (__ev_mhossfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhossfaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-566 Freescale Semiconductor
SPE2 Operations
__ev_mhossfanw __ev_mhossfanw
Vector Multiply Half Words, Odd, Signed, Saturate, Fractional and Accumulate Negative into Words
d = __ev_mhossfanw (a,b)
// high
temp
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temp
0:63
EXTS(ACC
0:31
) - EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temp
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
temp
0:63
EXTS(ACC
32:63
) - EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh ; SPEFSCR
OV
ovl | movl ;
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh ; SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl ;
The corresponding odd-numbered half-word signed fractional elements in parameters a and b are
multiplied, producing a 32-bit product. If both inputs are -1.0, the result saturates to
0x7FFF_FFFF. Each 32-bit product is then subtracted from the corresponding word in the
accumulator, saturating if overflow or underflow occurs, and the result is placed in parameter d
and the accumulator.
If there is an overflow or underflow from either the multiply or the subtraction, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-425. Odd Form of Vector Half-Word Multiply (__ev_mhossfanw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhossfanw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
--
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-567
__ev_mhossiaaw __ev_mhossiaaw
Vector Multiply Half Words, Odd, Signed, Saturate, Integer and Accumulate into Words
d = __ev_mhossiaaw (a,b)
// high
temp
0:31
a
16:31
si
b
16:31
temp
0:63
EXTS(ACC
0:31
) + EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
48:63
si
b
48:63
temp
0:63
EXTS(ACC
32:63
) + EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding odd-numbered half-word signed integer elements in parameters a and b are
multiplied, producing a 32-bit product. Each 32-bit product is then added to the corresponding
word in the accumulator, saturating if overflow or underflow occurs, and the result is placed in
parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-426. Odd Form of Vector Half-Word Multiply (__ev_mhossiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhossiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-568 Freescale Semiconductor
SPE2 Operations
__ev_mhossianw __ev_mhossianw
Vector Multiply Half Words, Odd, Signed, Saturate, Integer and Accumulate Negative into Words
d = __ev_mhossianw (a,b)
// high
temp
0:31
a
16:31
si
b
16:31
temp
0:63
EXTS(ACC
0:31
) - EXTS(temp
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:31
a
48:63
si
b
48:63
temp
0:63
EXTS(ACC
32:63
) - EXTS(temp
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding odd-numbered half-word signed integer elements in parameter a and b are
multiplied, producing a 32-bit product. Each 32-bit product is then subtracted from the
corresponding word in the accumulator, saturating if overflow or underflow occurs, and the result
is placed in parameter d and the accumulator.
If there is an overflow or underflow from the subtraction, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-427. Odd Form of Vector Half-Word Multiply (__ev_mhossianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhossianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and Accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-569
__ev_mhosumi[a] __ev_mhosumi[a]
Vector Multiply Half Words, Odd, Signed by Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mhosumi (a,b) (A = 0)
d = __ev_mhosumia (a,b) (A = 1)
// high
d
0:31
(a
16:31
su
b
16:31
)
// low
d
32:63
(a
48:63
su
b
48:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding odd-numbered, half word signed integer element in parameter a and unsigned
integer element in parameter b are multiplied. Each product is placed into the corresponding words
of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-428. Vector Multiply Half Words, Odd, Signed by Unsigned, Modulo, Integer (to
Accumulator) (__ev_mhosumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosumia d,a,b
15 16 47 48 0 31 32 63
a (signed)
b (unsigned)
d (and accumulator if __ev_mhosumia)
X X
SPE2 Programming Interface Manual, Rev. 1.0-2
3-570 Freescale Semiconductor
SPE2 Operations
__ev_mhosumiaaw __ev_mhosumiaaw
Vector Multiply Half Words, Odd, Signed by Unsigned, Modulo, Integer and Accumulate into
Words
d = __ev_mhosumiaaw (a,b)
// high
temp
0:31
a
16:31
su
b
16:31
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
a
48:63
su
b
48:63
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half word signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. The 32
bits of each intermediate product is added to the contents of the corresponding accumulator word
and the results are placed into the corresponding parameter d words and into the accumulator
Other registers altered: ACC
Figure 3-429. Odd Form of Vector Half Word Multiply (__ev_mhosumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosumiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
+ +
d and Accumulator
a (signed)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-571
__ev_mhosumianw __ev_mhosumianw
Vector Multiply Half Words, Odd, Signed by Unsigned, Modulo, Integer and Accumulate
Negative into Words
d = __ev_mhosumianw (a,b)
// high
temp
0:31
a
16:31
su
b
16:31
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
48:63
su
b
48:63
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half word signed
integer element in parameter a and unsigned integer element in parameter b are multiplied. The 32
bits of each intermediate product is subtracted from the contents of the corresponding accumulator
word and the results are placed into the corresponding parameter d words and into the accumulator.
Other registers altered: ACC
Figure 3-430. Odd Form of Vector half word Multiply (__ev_mhosumianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosumianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
d and Accumulator
a (signed)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-572 Freescale Semiconductor
SPE2 Operations
__ev_mhosusiaaw __ev_mhosusiaaw
Vector Multiply Half Words, Odd, Signed by Unsigned, Saturate, Integer and Accumulate into
Words
d = __ev_mhosusiaaw (a,b)
// high
temph
0:31
a
16:31
su
b
16:31
temp
0:63
EXTS(ACC
0:31
) + EXTS(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
templ
0:31
a
48:63
su
b
48:63
temp
0:63
EXTS(ACC
32:63
) + EXTS(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding odd-numbered half word signed integer element in parameter a and unsigned
integer element in parameter b are multiplied producing a 32-bit product. Each 32-bit product is
then added to the corresponding word in the accumulator saturating if overflow or underflow
occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-431. Odd Form of Vector half word Multiply (__ev_mhosusiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosusiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
+ +
d and Accumulator
a (signed)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-573
__ev_mhosusianw __ev_mhosusianw
Vector Multiply Half Words, Odd, Signed by Unsigned, Saturate, Integer and Accumulate
Negative into Words
d = __ev_mhosusianw (a,b)
// high
temph
0:31
a
16:31
su
b
16:31
temp
0:63
EXTS(ACC
0:31
) - EXTS(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
templ
0:31
a
48:63
su
b
48:63
temp
0:63
EXTS(ACC
32:63
) - EXTS(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding odd-numbered half word signed integer element in parameter a and unsigned
integer element in parameter b are multiplied producing a 32-bit product. Each 32-bit product is
then subtracted from the corresponding word in the accumulator saturating if overflow or
underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from either the multiply or the subtraction, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-432. Odd Form of Vector half word Multiply (__ev_mhosusianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhosusianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b (unsigned)
X X
--
d and Accumulator
a (signed)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-574 Freescale Semiconductor
SPE2 Operations
__ev_mhoumi[a] __ev_mhoumi[a]
Vector Multiply Half Words, Odd, Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mhoumi (a,b) (A = 0)
d = __ev_mhoumia (a,b) (A = 1)
// high
d
0:31
a
16:31
ui
b
16:31
// low
d
32:63
a
48:63
ui
b
48:63
// update accumulator
if A = 1, then ACC
0:63
d
0:63
The corresponding odd-numbered half-word unsigned integer elements in parameters a and b are
multiplied. The two 32-bit products are placed into the corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-433. Vector Multiply Half Words, Odd, Unsigned, Modulo, Integer (to
Accumulator) (__ev_mhoumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhoumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhoumia d,a,b
15 16 47 48 0 31 32 63
a
b
d (and accumulator if __ev_mhoumi
X X
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-575
__ev_mhoumiaaw __ev_mhoumiaaw
Vector Multiply Half Words, Odd, Unsigned, Modulo, Integer and Accumulate into Words
d = __ev_mhoumiaaw (a,b)
// high
temp
0:31
a
16:31
ui
b
16:31
d
0:31
ACC
0:31
+ temp
0:31
// low
temp
0:31
a
48:63
ui
b
48:63
d
32:63
ACC
32:63
+ temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half-word unsigned
integer elements in parameters a and b are multiplied. Each intermediate product is added to the
contents of the corresponding accumulator word. The sums are placed into the corresponding
parameter d and accumulator words.
Other registers altered: ACC
Figure 3-434. Odd Form of Vector Half-Word Multiply (__ev_mhoumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhoumiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-576 Freescale Semiconductor
SPE2 Operations
__ev_mhoumianw __ev_mhoumianw
Vector Multiply Half Words, Odd, Unsigned, Modulo, Integer and Accumulate Negative into Words
d = __ev_mhoumianw (a,b)
// high
temp
0:31
a
0:15
ui
b
0:15
d
0:31
ACC
0:31
- temp
0:31
// low
temp
0:31
a
32:47
ui
b
32:47
d
32:63
ACC
32:63
- temp
0:31
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding odd-numbered half-word unsigned
integer elements in parameters a and b are multiplied. Each intermediate product is subtracted
from the contents of the corresponding accumulator word. The results are placed into the
corresponding parameter d and accumulator words.
Other registers altered: ACC
Figure 3-435. Odd Form of Vector Half-Word Multiply (__ev_mhoumianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhoumianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-577
__ev_mhousiaaw __ev_mhousiaaw
Vector Multiply Half Words, Odd, Unsigned, Saturate, Integer and Accumulate into Words
d = __ev_mhousiaaw (a,b)
// high
temp
0:31
a
16:31
ui
b
16:31
temp
0:63
EXTZ(ACC
0:31
) + EXTZ(temp
0:31
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
//low
temp
0:31
a
48:63
ui
b
48:63
temp
0:63
EXTZ(ACC
32:63
) + EXTZ(temp
0:31
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding odd-numbered half-word unsigned
integer elements in parameters a and b are multiplied, producing a 32-bit product. Each 32-bit
product is then added to the corresponding word in the accumulator, saturating if overflow occurs,
and the result is placed in parameter d and the accumulator.
If there is an overflow from the addition, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-436. Odd Form of Vector Half Word Multiply (__ev_mhousiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhousiaaw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
+ +
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-578 Freescale Semiconductor
SPE2 Operations
__ev_mhousianw __ev_mhousianw
Vector Multiply Half Words, Odd, Unsigned, Saturate, Integer and Accumulate Negative into Words
d = __ev_mhousianw (a,b)
// high
temp
0:31
a
16:31
ui
b
16:31
temp
0:63
EXTZ(ACC
0:31
) - EXTZ(temp
0:31
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0, 0, temp
32:63
)
//low
temp
0:31
a
48:63
ui
b
48:63
temp
0:63
EXTZ(ACC
32:63
) - EXTZ(temp
0:31
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0, 0, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding odd-numbered half-word unsigned
integer elements in parameters a and b are multiplied, producing a 32-bit product. Each 32-bit
product is then subtracted from the corresponding word in the accumulator, saturating if overflow
occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow from the subtraction, the overflow and summary overflow bits are recorded
in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-437. Odd Form of Vector Half Word Multiply (__ev_mhousianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhousianw d,a,b
15 16 47 48 0 31 32 63
Intermediate product
Accumulator
b
X X
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-579
__ev_mhssf __ev_mhssf
Vector Multiply Half Word Signed, Saturate, Fractional
d = __ev_mhssf (a,b)
temp0
0:31
a
0:15
sf
b
0:15
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temp0
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temp1
0:31
a
16:31
sf
b
16:31
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temp1
0:31
0x7FFF_FFFF //saturate
movh 1
temp2
0:31
a
32:47
sf
b
32:47
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temp2
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
temp3
0:31
a
48:63
sf
b
48:63
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temp3
0:31
0x7FFF_FFFF //saturate
movl 1
d
0:15
temp0
0:15 ;
d
16:31
temp1
0:15
; d
32:47
temp2
0:15
; d
48:63
temp3
0:15
// update SPEFSCR
SPEFSCR
OVH
movh; SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh; SPEFSCR
SOV
SPEFSCR
SOV
| movl
For each half word element in the destination, corresponding half word pairs of signed fractional
elements in parameters a and b are multiplied, and the result is placed into the corresponding
parameter d half word. If both inputs of a multiply are -1.0, the result saturates to 0x7FFF. The
overflow and summary overflow bits are recorded in the SPEFSCR based on an underflow from
the multiply.
Other registers altered: SPEFSCR
Figure 3-438. Vector Multiply Half Word, Signed, Saturate, Fractional (__ev_mhssf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhssf d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-580 Freescale Semiconductor
SPE2 Operations
__ev_mhssfr __ev_mhssfr
Vector Multiply Half Word Signed, Saturate, Fractional and Round
d = __ev_mhssfr (a,b)
For each half word element in the destination, corresponding half word pairs of signed fractional
elements in parameters a and b are multiplied, then rounded by adding 1/2 lsb, and the result is
placed into the corresponding parameter d half word. If both inputs of a multiply are -1.0, the result
saturates to 0x7FFF. The overflow and summary overflow bits are recorded in the SPEFSCR based
on an underflow from the multiply.
Other registers altered: SPEFSCR
Figure 3-439. Vector Multiply Half Word, Signed, Saturate, Fractional and Round
(__ev_mhssfr)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhssfr d,a,b
temp0
0:31
(a
0:15
sf
b
0:15
) + 0x0000_8000
if (a
0:15
= 0x8000) & (b
0:15
= 0x8000) then
temp0
0:31
0x7FFF_FFFF //saturate
movh 1
else
movh 0
temp1
0:31
(a
16:31
sf
b
16:31
) + 0x0000_8000
if (a
16:31
= 0x8000) & (b
16:31
= 0x8000) then
temp1
0:31
0x7FFF_FFFF //saturate
movh 1
temp2
0:31
(a
32:47
sf
b
32:47
) + 0x0000_8000
if (a
32:47
= 0x8000) & (b
32:47
= 0x8000) then
temp2
0:31
0x7FFF_FFFF //saturate
movl 1
else
movl 0
temp3
0:31
(a
48:63
sf
b
48:63
) + 0x0000_8000
if (a
48:63
= 0x8000) & (b
48:63
= 0x8000) then
temp3
0:31
0x7FFF_FFFF //saturate
movl 1
d
0:15
temp0
0:15
d
16:31
temp1
0:15
d
32:47
temp2
0:15
d
48:63
temp3
0:15
// update SPEFSCR
SPEFSCR
OVH
movh
SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh
SPEFSCR
SOV
SPEFSCR
SOV
| movl
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
+ 0x0000_8000
d
+ 0x0000_8000 + 0x0000_8000 + 0x0000_8000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-581
__ev_mhssi __ev_mhssi
Vector Multiply Half Word Signed, Saturate, Integer
d = __ev_mhssi (a,b)
temp0
0:31
a
0:15
si
b
0:15
if (temp0
0:31
> 0x0000_7FFF) | (temp0
0:31
< 0xFFFF_8000) then
movh 1
temp0
16:31
SATURATE(movh, temp0
0
, 0x8000, 0x7FFF, temp0
16:31
)
else
movh 0
temp1
0:31
a
16:31
si
b
16:31
if (temp1
0:31
> 0x0000_7FFF) | (temp1
0:31
< 0xFFFF_8000) then
movh 1
temp1
16:31
SATURATE(movh, temp1
0
, 0x8000, 0x7FFF, temp1
16:31
)
temp2
0:31
a
32:47
si
b
32:47
if (temp2
0:31
> 0x0000_7FFF) | (temp2
0:31
< 0xFFFF_8000) then
movl 1
temp2
16:31
SATURATE(movl, temp2
0
, 0x8000, 0x7FFF, temp2
16:31
)
else
movl 0
temp3
0:31
a
48:63
si
b
48:63
if (temp3
0:31
> 0x0000_7FFF) | (temp3
0:31
< 0xFFFF_8000) then
movl 1
temp3
16:31
SATURATE(movl, temp3
0
, 0x8000, 0x7FFF, temp3
16:31
)
d
0:15
temp0
16:31 ;
d
16:31
temp1
16:31
; d
32:47
temp2
16:31
; d
48:63
temp3
16:31
// update SPEFSCR
SPEFSCR
OVH
movh; SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh; SPEFSCR
SOV
SPEFSCR
SOV
| movl
For each half word element in the destination, corresponding half word pairs of signed integer
elements in parameters a and b are multiplied, and the result is placed into the corresponding
parameter d half word. If the result exceeds the magnitude of a half word, the result saturates. The
overflow and summary overflow bits are recorded in the SPEFSCR based on an underflow or
overflow from the multiply.
Other registers altered: SPEFSCR
Figure 3-440. Vector Multiply Half Word, Signed, Saturate, Integer (__ev_mhssi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhssi d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-582 Freescale Semiconductor
SPE2 Operations
__ev_mhsusi __ev_mhsusi
Vector Multiply Half Word Signed by Unsigned, Saturate, Integer
d = __ev_mhsusi (a,b)
temp0
0:31
a
0:15
sui
b
0:15
if (temp0
0:31
> 0x0000_7FFF) | (temp0
0:31
< 0xFFFF_8000) then
movh 1
temp0
16:31
SATURATE(movh, temp0
0
, 0x8000, 0x7FFF, temp0
16:31
)
else
movh 0
temp1
0:31
a
16:31
sui
b
16:31
if (temp1
0:31
> 0x0000_7FFF) | (temp1
0:31
< 0xFFFF_8000) then
movh 1
temp1
16:31
SATURATE(movh, temp1
0
, 0x8000, 0x7FFF, temp1
16:31
)
temp2
0:31
a
32:47
sui
b
32:47
if (temp2
0:31
> 0x0000_7FFF) | (temp2
0:31
< 0xFFFF_8000) then
movl 1
temp2
16:31
SATURATE(movl, temp2
0
, 0x8000, 0x7FFF, temp2
16:31
)
else
movl 0
temp3
0:31
a
48:63
sui
b
48:63
if (temp3
0:31
> 0x0000_7FFF) | (temp3
0:31
< 0xFFFF_8000) then
movl 1
temp3
16:31
SATURATE(movl, temp3
0
, 0x8000, 0x7FFF, temp3
16:31
)
d
0:15
temp0
16:31 ;
d
16:31
temp1
16:31
; d
32:47
temp2
16:31
; d
48:63
temp3
16:31
// update SPEFSCR
SPEFSCR
OVH
movh; SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh; SPEFSCR
SOV
SPEFSCR
SOV
| movl
For each half word element in the destination, corresponding half word pairs of signed integer
elements in parameter a and unsigned integer elements in parameter b are multiplied, and the
result is placed into the corresponding parameter d half word. If the result exceeds the magnitude
of a signed half word, the result saturates. The overflow and summary overflow bits are recorded
in the SPEFSCR based on an underflow or overflow from the multiply.
Other registers altered: SPEFSCR
Figure 3-441. Vector Multiply Half Word, Signed by Unsigned, Saturate, Integer
(__ev_mhsusi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhsusi d,a,b
15 16 47 48 0 31 32 63
Intermediate
b (unsigned)
x
sui
x
sui
a (signed)
products
(4 32-bit)
x
sui
x
sui
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-583
__ev_mhumi __ev_mhumi
Vector Multiply Half Word Unsigned, Modulo, Integer
d = __ev_mhumi (a,b)
temph0
0:31
(a
0:15
ui
b
0:15
)
temph1
0:31
(a
16:31
ui
b
16:31
)
temph2
0:31
(a
32:47
ui
b
32:47
)
temph3
0:31
(a
48:63
ui
b
48:63
)
d
0:15
temph0
16:31
d
16:31
temph1
16:31
d
32:47
temph2
16:31
d
48:63
temph3
16:31
For each half word element in the destination, corresponding half word pairs of unsigned integer
elements in parameters a and b are multiplied producing a 32-bit product. The low order 16 bits
of the product is placed into the corresponding parameter d half word.
Note: The least significant 16 bits of the products are independent of whether the half word
elements in parameters a and b are treated as signed or unsigned 16-bit integers.
Figure 3-442. Vector Multiply Half Word, Unsigned, Modulo, Integer (__ev_mhumi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhumi d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-584 Freescale Semiconductor
SPE2 Operations
__ev_mhusi __ev_mhusi
Vector Multiply Half Word Unsigned, Saturate, Integer
d = __ev_mhusi (a,b)
temp0
0:31
a
0:15
ui
b
0:15
if (temp0
0:31
> 0x0000_FFFF) then
movh 1
temp0
16:31
SATURATE(movh, 0, 0xFFFF, 0xFFFF, temp0
16:31
)
else
movh 0
temp1
0:31
a
16:31
ui
b
16:31
if (temp1
0:31
> 0x0000_FFFF) then
movh 1
temp1
16:31
SATURATE(movh, 0, 0xFFFF, 0xFFFF, temp1
16:31
)
temp2
0:31
a
32:47
ui
b
32:47
if (temp2
0:31
> 0x0000_FFFF) then
movl 1
temp2
16:31
SATURATE(movl, 0, 0xFFFF, 0xFFFF, temp2
16:31
)
else
movl 0
temp3
0:31
a
48:63
ui
b
48:63
if (temp3
0:31
> 0x0000_FFFF) then
movl 1
temp3
16:31
SATURATE(movl, 0, 0xFFFF, 0xFFFF, temp3
16:31
)
d
0:15
temp0
16:31 ;
d
16:31
temp1
16:31
; d
32:47
temp2
16:31
; d
48:63
temp3
16:31
// update SPEFSCR
SPEFSCR
OVH
movh; SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh; SPEFSCR
SOV
SPEFSCR
SOV
| movl
For each half word element in the destination, corresponding half word pairs of unsigned integer
elements in parameters a and b are multiplied, and the result is placed into the corresponding
parameter d half word. If the result exceeds the magnitude of a half word, the result saturates. The
overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow from the
multiply.
Other registers altered: SPEFSCR
Figure 3-443. Vector Multiply Half Word, Unsigned, Saturate, Integer (__ev_mhusi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmhusi d,a,b
15 16 47 48 0 31 32 63
Intermediate
b
x x
a
products
(4 32-bit)
x x
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-585
__ev_minbpsh __ev_minbpsh
Vector Minimum of Byte Pairs Signed to Half Word
d = __ev_minbpsh (a)
d
0:15
EXTS(MIN
si
(a
0:7
,a
8:15
))
d
16:31
EXTS(MIN
si
(a
16:23
,a
24:31
))
d
32:47
EXTS(MIN
si
(a
32:39
,a
40:47
))
d
48:63
EXTS(MIN
si
(a
48:55
,a
56:63
))
Even/odd pairs of signed-integer byte elements of parameter a are compared and the smaller of the
two signed-integer byte elements is sign-extended and placed into the corresponding half word
element of parameter d.
Figure 3-444. Vector Minimum of Byte Pairs Signed to Half Word (__ev_minbpsh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evminbpsh d,a
0 32 63
a
24 16 8 40 48 56
0 32 63
d
16 48
MIN
si
MIN
si
MIN
si
MIN
si
exts exts exts exts
SPE2 Programming Interface Manual, Rev. 1.0-2
3-586 Freescale Semiconductor
SPE2 Operations
__ev_minbpuh __ev_minbpuh
Vector Minimum of Byte Pairs Unsigned to Half Word
d = __ev_minbpuh (a)
d
0:15
EXTZ(MIN
ui
(a
0:7
,a
8:15
))
d
16:31
EXTZ(MIN
ui
(a
16:23
,a
24:31
))
d
32:47
EXTZ(MIN
ui
(a
32:39
,a
40:47
))
d
48:63
EXTZ(MIN
ui
(a
48:55
,a
56:63
))
Even/odd pairs of unsigned-integer byte elements of parameter a are compared and the smaller of
the two unsigned-integer byte elements is zero-extended and placed into the corresponding half
word element of parameter d.
Figure 3-445. Vector Minimum of Byte Pairs Unsigned to Half Word (__ev_minbpuh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evminbpuh d,a
0 32 63
a
24 16 8 40 48 56
0 32 63
d
16 48
MIN
ui
MIN
ui
MIN
ui
MIN
ui
extz extz extz extz
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-587
__ev_minbs __ev_minbs
Vector Minimum Byte Signed
d = __ev_minbs (a,b)
d
0:7
MIN
si
(a
0:7
,b
0:7
)
d
8:15
MIN
si
(a
8:15
,b
8:15
)
d
16:23
MIN
si
(a
16:23
,b
16:23
)
d
24:31
MIN
si
(a
24:31
,b
24:31
)
d
32:39
MIN
si
(a
32:39
,b
32:39
)
d
40:47
MIN
si
(a
40:47
,b
40:47
)
d
48:55
MIN
si
(a
48:55
,b
48:55
)
d
56:63
MIN
si
(a
56:63
,b
56:63
)
Each signed-integer byte element of parameter a is compared to the corresponding signed-integer
byte element of parameter b and the smaller of the two signed-integer elements is placed into the
corresponding byte element of parameter d.
Figure 3-446. Vector Minimum Byte Signed (__ev_minbs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminbs d,a,b
0 32 63
b
a
24 16 8 40 48 56
MIN
si
0 32 63
d
24 16 8 40 48 56
MIN
si
MIN
si
MIN
si
MIN
si
MIN
si
MIN
si
MIN
si
SPE2 Programming Interface Manual, Rev. 1.0-2
3-588 Freescale Semiconductor
SPE2 Operations
__ev_minbu __ev_minbu
Vector Minimum Byte Unsigned
d = __ev_minbu (a,b)
d
0:7
MIN
ui
(a
0:7
,b
0:7
)
d
8:15
MIN
ui
(a
8:15
,b
8:15
)
d
16:23
MIN
ui
(a
16:23
,b
16:23
)
d
24:31
MIN
ui
(a
24:31
,b
24:31
)
d
32:39
MIN
ui
(a
32:39
,b
32:39
)
d
40:47
MIN
ui
(a
40:47
,b
40:47
)
d
48:55
MIN
ui
(a
48:55
,b
48:55
)
d
56:63
MIN
ui
(a
56:63
,b
56:63
)
Each signed-integer byte element of parameter a is compared to the corresponding signed-integer
byte element of parameter b and the smaller of the two signed-integer elements is placed into the
corresponding byte element of parameter d.
Figure 3-447. Vector Minimum Byte Unsigned (__ev_minbu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminbu d,a,b
0 32 63
b
a
24 16 8 40 48 56
MIN
ui
0 32 63
d
24 16 8 40 48 56
MIN
ui
MIN
ui
MIN
ui
MIN
ui
MIN
ui
MIN
ui
MIN
ui
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-589
__ev_minds __ev_minds
Vector Minimum Download Signed
d = __ev_minds (a,b)
d
0:63
MIN
si
(a
0:63
, b
0:63
)
The signed-integer doubleword in parameter a is compared to the signed-integer doubleword in
parameter b and the smaller of the two doublewords is placed into parameter d.
Figure 3-448. Vector Minimum Doubleword Signed (__ev_minds)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminds d,a,b
b
0 63
a
d
MIN
si
SPE2 Programming Interface Manual, Rev. 1.0-2
3-590 Freescale Semiconductor
SPE2 Operations
__ev_mindu __ev_mindu
Vector Minimum Download Unsigned
d = __ev_mindu (a,b)
d
0:63
MIN
ui
(a
0:63
, b
0:63
)
The unsigned-integer doubleword in parameter a is compared to the unsigned-integer doubleword
in parameter b and the smaller of the two doublewords is placed into parameter d.
Figure 3-449. Vector Minimum Doubleword Unsigned (__ev_mindu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmindu d,a,b
b
0 63
a
d
MIN
ui
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-591
__ev_minhpsw __ev_minhpsw
Vector Minimum of Half Word Pairs Signed to Word
d = __ev_minhpsw (a)
d
0:31
EXTS(MIN
si
(a
0:15
,a
16:31
))
d
32:63
EXTS(MIN
si
(a
32:47
,a
48:63
))
Even/odd pairs of signed-integer half word elements of parameter a are compared and the smaller
of the two signed-integer half word elements is sign-extended and placed into the corresponding
word element of parameter d.
Figure 3-450. Vector Minimum of Half Word Pairs Signed to Word (evminhpsw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evminhpsw d,a
0 32 63
a
16 48
0 32 63
d
MIN
si
exts
MIN
si
exts
SPE2 Programming Interface Manual, Rev. 1.0-2
3-592 Freescale Semiconductor
SPE2 Operations
__ev_minhpuw __ev_minhpuw
Vector Minimum of Half Word Pairs Unsigned to Word
d = __ev_minhpuw (a)
d
0:31
EXTZ(MIN
ui
(a
0:15
,a
16:31
))
d
32:63
EXTZ(MIN
ui
(a
32:47
,a
48:63
))
Even/odd pairs of unsigned-integer half word elements of parameter a are compared and the
smaller of the two unsigned-integer half word elements is zero-extended and placed into the
corresponding word element of parameter d.
Figure 3-451. Vector Minimum of Half Word Pairs Unsigned to Word (__ev_minhpuw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evminhpuw d,a
0 32 63
a
16 48
0 32 63
d
MIN
ui
extz
MIN
ui
extz
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-593
__ev_minhs __ev_minhs
Vector Minimum Half Word Signed
d = __ev_minhs (a,b)
d
0:15
MIN
si
(a
0:15
, b
0:15
)
d
16:31
MIN
si
(a
16:31
, b
16:31
)
d
32:47
MIN
si
(a
32:47
, b
32:47
)
d
48:63
MIN
si
(a
48:63
, b
48:63
)
Each signed-integer half word element of parameter a is compared to the corresponding
signed-integer half word element of parameter b and the smaller of the two signed-integer
elements is placed into the corresponding half word element of parameter d.
Figure 3-452. Vector Minimum Half Word Signed (__ev_minhs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminhs d,a,b
b
0 31 32 63
a
15 16 47 48
MIN
si
d
MIN
si
MIN
si
MIN
si
SPE2 Programming Interface Manual, Rev. 1.0-2
3-594 Freescale Semiconductor
SPE2 Operations
__ev_minhu __ev_minhu
Vector Minimum Half Word Unsigned
d = __ev_minhu (a,b)
d
0:15
MIN
ui
(a
0:15
, b
0:15
)
d
16:31
MIN
ui
(a
16:31
, b
16:31
)
d
32:47
MIN
ui
(a
32:47
, b
32:47
)
d
48:63
MIN
ui
(a
48:63
, b
48:63
)
Each unsigned-integer half word element of parameter a is compared to the corresponding
unsigned-integer half word element of parameter b and the smaller of the two unsigned-integer
elements is placed into the corresponding half word element of parameter d.
Figure 3-453. Vector Minimum Half Word Unsigned (__ev_minhu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminhu d,a,b
b
0 31 32 63
a
15 16 47 48
MIN
ui
d
MIN
ui
MIN
ui
MIN
ui
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-595
__ev_minwpsd __ev_minwpsd
Vector Minimum of Word Pair Signed to Double Word
d = __ev_minwpsd (a)
d
0:63
EXTS(MIN
si
(a
0:31
,a
32:63
))
The signed-integer word elements of parameter a are compared and the smaller of the two
signed-integer word elements is sign-extended and placed into parameter d.
Figure 3-454. Vector Minimum of Word Pairs Signed to Double Word (__ev_minwpsd)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evminwpsd d,a
0 32 63
a
0 63
d
MIN
si
exts
SPE2 Programming Interface Manual, Rev. 1.0-2
3-596 Freescale Semiconductor
SPE2 Operations
__ev_minwpud __ev_minwpud
Vector Minimum of Word Pair Unsigned to Double Word
d = __ev_minwpud (a)
d
0:63
EXTZ(MIN
ui
(a
0:31
,a
32:63
))
The unsigned-integer word elements of parameter a are compared and the smaller of the two
unsigned-integer word elements is zero-extended and placed into parameter d.
Figure 3-455. Vector Minimum of Word Pairs Unsigned to Double Word (__ev_minwpud)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evminwpud d,a
0 32 63
a
0 63
d
MIN
ui
extz
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-597
__ev_minws __ev_minws
Vector Minimum Word Signed
d = __ev_minws (a,b)
d
0:31
MIN
si
(a
0:31
, b
0:31
)
d
32:63
MIN
si
(a
32:63
, b
32:63
)
Each signed-integer word element of parameter a is compared to the corresponding signed-integer
word element of parameter b and the smaller of the two signed-integer elements is placed into the
corresponding word element of parameter d.
Figure 3-456. Vector Minimum Word Signed (__ev_minws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminws d,a,b
b
0 31 32 63
a
15 16 47 48
d
MIN
si
MIN
si
SPE2 Programming Interface Manual, Rev. 1.0-2
3-598 Freescale Semiconductor
SPE2 Operations
__ev_minwu __ev_minwu
Vector Minimum Word Unsigned
d = __ev_minwu (a,b)
d
0:31
MIN
ui
(a
0:31
, b
0:31
)
d
32:63
MIN
ui
(a
32:63
, b
32:63
)
Each unsigned-integer word element of parameter a is compared to the corresponding
unsigned-integer word element of parameter b and the smaller of the two unsigned-integer
elements is placed into the corresponding word element of parameter d.
Figure 3-457. Vector Minimum Word Unsigned (evminwu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evminwu d,a,b
b
0 31 32 63
a
15 16 47 48
d
MIN
ui
MIN
ui
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-599
__ev_mra __ev_mra
Initialize Accumulator
d = __ev_mra (a)
ACC
0:63
a
0:63
d
0:63
a
0:63
The contents of parameter a are written into the accumulator and copied into parameter d. This is
the method for initializing the accumulator.
Other registers altered: ACC
Figure 3-458. Initialize Accumulator (__ev_mra)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evmra d,a
0 31 32 63
a
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-600 Freescale Semiconductor
SPE2 Operations
__ev_mwehgsmf[a] __ev_mwehgsmf[a]
Vector Multiply Word Even High Guarded Signed, Modulo, Fractional (to Accumulator)
d = __ev_mwehgsmf (a,b) (A = 0)
d = __ev_mwehgsmfa (a,b) (A = 1)
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000
else
d
0:63
EXTS
64
(temp
0:47
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The even word signed fractional elements in parameters a and b are multiplied. The high order 48
bits of the 64-bit product are sign-extended to 64-bits to produce an intermediate product in 17.47
fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0. The
intermediate product is then placed into parameter d. If A = 1, the result in parameter d is also
placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-459. Vector Multiply Word Even, High, Guarded, Signed, Modulo, Fractional (to
Accumulator) (__ev_mwehgsmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfa d,a,b
0 31 32 63
b
X
a
0 63
d (and accumulator if A=1)
15 16 17
.
S........................S
S
47 0
exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-601
__ev_mwehgsmfaa __ev_mwehgsmfaa
Vector Multiply Word Even High Guarded Signed, Modulo, Fractional and Accumulate
d = __ev_mwehgsmfaa (a,b)
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000 + ACC
0:63
else
d
0:63
EXTS
64
(temp
0:47
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The even word signed fractional elements in parameters a and b are multiplied. The high order 48
bits of the 64-bit product are sign-extended to 64-bits to produce an intermediate product in 17.47
fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0. The
intermediate product is then added to the contents of the accumulator and the result is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-460. Vector Multiply Word Even, High, Guarded, Signed, Modulo, Fractional and
Accumulate (__ev_mwehgsmfaa)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
+
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-602 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfaa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-603
__ev_mwehgsmfan __ev_mwehgsmfan
Vector Multiply Word Even High, Guarded, Signed, Modulo, Fractional and Accumulate Negative
d = __ev_mwehgsmfan (a,b)
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
d
0:63
ACC
0:63
0x0000_8000_0000_0000
else
d
0:63
ACC
0:63
- EXTS
64
(temp
0:47
)
ACC
0:63
d
0:63
The even word signed fractional elements in parameters a and b are multiplied. The high order 48
bits of the 64-bit product are sign-extended to 64-bits to produce an intermediate product in 17.47
fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0. The
intermediate product is then subtracted from the contents of the accumulator and the result is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-461. Vector Multiply Word Even, High, Guarded, Signed, Modulo, Fractional and
Accumulate Negative (__ev_mwehgsmfan)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
-
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-604 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfan d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-605
__ev_mwehgsmfr[a] __ev_mwehgsmfr[a]
Vector Multiply Word Even High Guarded Signed, Modulo, Fractional, Round (to Accumulator)
d = __ev_mwehgsmfr (a,b) (A = 0)
d = __ev_mwehgsmfra (a,b) (A = 1)
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000
else
temp
0:64
EXTS
65
(temp
0:63
)
tempr
0:64
ROUND(temp
0:64
,16)
d
0:63
EXTS
64
(tempr
0:48
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The even word signed fractional elements in parameters a and b are multiplied. The 64-bit
fractional product is sign-extended to 65 bits, rounded to 49-bits using the current rounding mode
in SPEFSCR, and the 49-bit value is sign-extended to 64-bits to produce a rounded intermediate
product in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented
as +1.0. The intermediate product is then placed into parameter d. If A = 1, the result in parameter
d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-462. Vector Multiply Word Even, High, Guarded, Signed, Modulo, Fractional,
Round (to Accumulator) (__ev_mwehgsmfr[a])
0 31 32 63
b
X
a
0 63
d (and accumulator if A=1)
15 16 17
.
S........................S
S
47 0
exts
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-606 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfra d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-607
__ev_mwehgsmfraa __ev_mwehgsmfraa
Vector Multiply Word Even High Guarded Signed, Modulo, Fractional, Round and Accumulate
d = __ev_mwehgsmfraa (a,b)
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000 + ACC
0:63
else
temp
0:64
EXTS
65
(temp
0:63
)
tempr
0:64
ROUND(temp
0:64
,16)
d
0:63
EXTS
64
(tempr
0:48
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The even word signed fractional elements in parameters a and b are multiplied. The 64-bit
fractional product is sign-extended to 65 bits, rounded to 49-bits using the current rounding mode
in SPEFSCR, and the 49-bit value is sign-extended to 64-bits to produce an intermediate product
in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0.
The intermediate product is then added to the contents of the accumulator and the result is placed
into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-463. Vector Multiply Word Even, High, Guarded, Signed, Modulo, Fractional,
Round and Accumulate (__ev_mwehgsmfraa)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
+
d and Accumulator
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-608 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfraa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-609
__ev_mwehgsmfran __ev_mwehgsmfran
Vector Multiply Word Even High, Guarded, Signed, Modulo, Fractional, Round and Accumulate
Negative
d = __ev_mwehgsmfran (a,b)
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
d
0:63
ACC
0:63
- 0x0000_8000_0000_0000
else
temp
0:64
EXTS
65
(temp
0:63
)
tempr
0:64
ROUND(temp
0:64
,16)
d
0:63
ACC
0:63
- EXTS
64
(tempr
0:48
)
ACC
0:63
d
0:63
The even word signed fractional elements in parameters a and b are multiplied. The 64-bit
fractional product is sign-extended to 65 bits, rounded to 49-bits using the current rounding mode
in SPEFSCR, and the 49-bit value is sign-extended to 64-bits to produce an intermediate product
in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0.
The intermediate product is then subtracted from the contents of the accumulator and the result is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-464. Vector Multiply Word Even, High, Guarded, Signed, Modulo, Fractional,
Round, and Accumulate Negative (__ev_mwehgsmfran)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
-
d and Accumulator
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-610 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwehgsmfran d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-611
__ev_mwhsmf[a] __ev_mwhsmf[a]
Vector Multiply Word High Signed, Modulo, Fractional (to Accumulator)
d = __ev_mwhsmf (a,b) (A = 0)
d = __ev_mwhsmfa (a,b) (A = 1)
// high
temp
0:63
a
0:31
sf
b
0:31
d
0:31
temp
0:31
// low
temp
0:63
a
32:63
sf
b
32:63
d
32:63
temp
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding word signed fractional elements in parameters a and b are multiplied, and bits
031 of the two products are placed into the two corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A =1)
Figure 3-465. Vector Multiply Word High Signed, Modulo, Fractional (to Accumulator)
(__ev_mwhsmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhsmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhsmfa d,a,b
0 31 32 63
Intermediate product
b
X
d (and accumulator
a
X
if __ev_mshdmfa)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-612 Freescale Semiconductor
SPE2 Operations
__ev_mwhsmi[a] __ev_mwhsmi[a]
Vector Multiply Word High Signed, Modulo, Integer (to Accumulator)
d = __ev_mwhsmi (a,b) (A = 0)
d = __ev_mwhsmia (a,b) (A = 1)
// high
temp
0:63
a
0:31
si
b
0:31
d
0:31
temp
0:31
// low
temp
0:63
a
32:63
si
b
32:63
d
32:63
temp
0:31
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding word signed integer elements in parameters a and b are multiplied. Bits 031
of the two 64-bit products are placed into the two corresponding words of parameter d.
If A = 1,The result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-466. Vector Multiply Word High Signed, Modulo, Integer (to Accumulator)
(__ev_mwhsmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhsmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhsmia d,a,b
0 31 32 63
Intermediate product
b
X
d (and accumulator
a
X
if __ev_mwhsmia)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-613
__ev_mwhssf[a] __ev_mwhssf[a]
Vector Multiply Word High Signed, Saturate, Fractional (to Accumulator)
d = __ev_mwhssf (a,b) (A = 0)
d = __ev_mwhssfa (a,b) (A = 1)
The corresponding word signed fractional elements in parameters a and b are multiplied. Bits 031
of each product are placed into the corresponding words of parameter d. If both inputs are -1.0, the
result saturates to the largest positive signed fraction and the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC (if A = 1)
Figure 3-467. Vector Multiply Word High Signed, Saturate, Fractional (to Accumulator)
(__ev_mwhssf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfa d,a,b
// high
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &
(b
0:31
= 0x8000_0000) then
d
0:31
0x7FFF_FFFF //saturate
movh 1
else
d
0:31
temp
0:31
movh 0
// low
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &
(b
32:63
= 0x8000_0000) then
d
32:63
0x7FFF_FFFF //saturate
movl 1
else
d
32:63
temp
0:31
movl 0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
;
// update SPEFSCR
SPEFSCR
OVH
movh
SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh
SPEFSCR
SOV
SPEFSCR
SOV
| movl
0 31 32 63
Intermediate product
b
X
d (and accumulator
a
X
if __ev_mwhssfa)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-614 Freescale Semiconductor
SPE2 Operations
__ev_mwhssfaaw __ev_mwhssfaaw
Vector Multiply Word High Signed, Saturate, Fractional and Accumulate into Words
d = __ev_mwhssfaaw (a,b)
// high
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
temph
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
temph
0:63
a
0:31
sf
b
0:31
movh 0
temp
0:63
EXTS(ACC
0:31
) + EXTS(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
32:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
templ
0:63
a
32:63
sf
b
32:63
movl 0
temp
0:63
EXTS(ACC
32:63
) + EXTS(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding signed fractional word elements in parameters a and b are multiplied producing
a 32-bit product. If both inputs are -1.0, the result saturates to the largest positive signed fraction.
Bits 0-31 of each product are then added to the corresponding word in the accumulator, saturating
if overflow or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-615
Figure 3-468. Vector Multiply Word High Signed, Saturate, Fractional and Accumulate into
Words (__ev_mwhssfaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfaaw d,a,b
0 31 32 63
Intermediate product
b
X
d and accumulator
a
X
+ +
accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-616 Freescale Semiconductor
SPE2 Operations
__ev_mwhssfaaw3 __ev_mwhssfaaw3
Vector Multiply Word High Signed, Saturate, Fractional and Accumulate into Words 3 operand
d = __ev_mwhssfaaw3 (a,b,c)
// high
if (b
0:31
= 0x8000_0000) &(c
0:31
= 0x8000_0000) then
temph
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
temph
0:63
b
0:31
sf
c
0:31
movh 0
temp
0:63
EXTS
64
(a
0:31
) + EXTS
64
(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
if (b
32:63
= 0x8000_0000) &(c
32:63
= 0x8000_0000) then
a
32:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
templ
0:63
b
32:63
sf
c
32:63
movl 0
temp
0:63
EXTS
64
(a
32:63
) + EXTS
64
(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding signed fractional word elements in parameters b and c are multiplied producing
a 32-bit product. If both inputs are -1.0, the result saturates to the largest positive signed fraction.
Bits 0:31 of each product are then added to the corresponding word in parameter a, saturating if
overflow or underflow occurs, and the result is placed into parameter d and the accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-617
Figure 3-469. Vector Multiply Word High Signed, Saturate, Fractional and Accumulate into
Words 3 op (__ev_mwhssfaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwhssfaaw3 d,b,c
0 31 32 63
Intermediate product
c
X
d and accumulator
b
X
+ +
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-618 Freescale Semiconductor
SPE2 Operations
__ev_mwhssfanw __ev_mwhssfanw
Vector Multiply Word High Signed, Saturate, Fractional and Accumulate Negative into Words
d = __ev_mwhssfanw (a,b)
// high
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then
temph
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
temph
0:63
a
0:31
sf
b
0:31
movh 0
temp
0:63
EXTS(ACC
0:31
) - EXTS(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
32:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
templ
0:63
a
32:63
sf
b
32:63
movl 0
temp
0:63
EXTS(ACC
32:63
) - EXTS(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding signed fractional word elements in parameters a and b are multiplied producing
a 32-bit product. If both inputs are -1.0, the result saturates to the largest positive signed fraction.
Bits 0-31 of each product are then subtracted from the corresponding word in the accumulator,
saturating if overflow or underflow occurs, and the result is placed in parameter d and the
accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-619
Figure 3-470. Vector Multiply Word High Signed, Saturate, Fractional and Accumulate
Negative into Words (__ev_mwhssfanw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfanw d,a,b
0 31 32 63
Intermediate product
b
X
d and accumulator
a
X
- -
accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-620 Freescale Semiconductor
SPE2 Operations
__ev_mwhssfanw3 __ev_mwhssfanw3
Vector Multiply Word High Signed, Saturate, Fractional and Accumulate Negative into Words 3
operand
d = __ev_mwhssfanw3 (a,b,c)
// high
if (b
0:31
= 0x8000_0000) &(c
0:31
= 0x8000_0000) then
temph
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1
else
temph
0:63
b
0:31
sf
c
0:31
movh 0
temp
0:63
EXTS
64
(d
0:31
) - EXTS
64
(temph
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
if (b
32:63
= 0x8000_0000) &(c
32:63
= 0x8000_0000) then
a
32:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1
else
templ
0:63
b
32:63
sf
c
32:63
movl 0
temp
0:63
EXTS
64
(a
32:63
) - EXTS
64
(templ
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl| movl
The corresponding signed fractional word elements in parameters b and c are multiplied producing
a 32-bit product. If both inputs are -1.0, the result saturates to the largest positive signed fraction.
Bits 0:31 of each product are then subtracted from the corresponding word in parameter a,
saturating if overflow or underflow occurs, and the result is placed into parameter d and the
accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-621
Figure 3-471. Vector Multiply Word High Signed, Saturate, Fractional and Accumulate
Negative into Words 3 op (__ev_mwhssfanw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwhssfanw3 d,b,c
0 31 32 63
Intermediate product
c
X
d and accumulator
b
X
- -
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-622 Freescale Semiconductor
SPE2 Operations
__ev_mwhssfr[a] __ev_mwhssfr[a]
Vector Multiply Word High Signed, Saturate, Fractional and Round (to Accumulator)
d = __ev_mwhssfr (a,b) (A = 0)
d = __ev_mwhssfra (a,b) (A = 1)
The corresponding word signed fractional elements in parameters a and b are multiplied. Bits 0-32
of each product are incremented, and bits 0-31 of the sum are placed into the corresponding words
of parameter d, and the accumulator if A = 1. If both inputs are -1.0, the result saturates to
0x7FFF_FFFF, and the overflow and summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC (If A = 1)
Figure 3-472. Vector Multiply Word High Signed, Saturate, Fractional and Round (to
Accumulator) (__ev_mwhssfr[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfra d,a,b
// high
temp
0:63
a
0:31
sf
b
0:31
if (a
0:31
= 0x8000_0000) &(b
0:31
=
0x8000_0000) then
d
0:31
0x7FFF_FFFF //saturate
movh 1
else
tempr
0:32
temp
0:32
+ 1
d
0:31
tempr
0:31
movh 0
// low
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
=
0x8000_0000) then
d
32:63
0x7FFF_FFFF //saturate
movl 1
else
tempr
0:32
temp
0:32
+ 1
d
32:63
tempr
0:31
movl 0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
movh
SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh
SPEFSCR
SOV
SPEFSCR
SOV
| movl
0 31 32 63
Intermediate product
b
X
d (and accumulator
a
X
if __ev_mwhssfra)
+ 1 + 1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-623
__ev_mwhssfraaw __ev_mwhssfraaw
Vector Multiply Word High Signed, Saturate, Fractional, Round and Accumulate into Words
d = __ev_mwhssfraaw (a,b)
// high
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then {
tempr
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1}
else {
temph
0:63
a
0:31
sf
b
0:31
tempr
0:32
temph
0:32
+ 1
movh 0}
temp
0:63
EXTS(ACC
0:31
) + EXTS(tempr
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then {
tempr
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1}
else {
templ
0:63
a
32:63
sf
b
32:63
tempr
0:32
templ
0:32
+ 1
movl 0}
temp
0:63
EXTS(ACC
32:63
) + EXTS(tempr
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl | movl
The corresponding signed fractional word elements in parameters a and b are multiplied. Bits 0-32
of each product are incremented to obtain an intermediate result. If both inputs are -1.0, the
intermediate result saturates to the largest positive signed fraction. Bits 0-31 of each intermediate
result are then added to the corresponding word in the accumulator, saturating if overflow or
underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from either the multiply or the addition, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-624 Freescale Semiconductor
SPE2 Operations
Figure 3-473. Vector Multiply Word High Signed, Saturate, Fractional, Round, and
Accumulate into Words (__ev_mwhssfraaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfraaw d,a,b
d and accumulator
+ +
accumulator
0 31 32 63
Intermediate product
b
X
a
X
+ 1 + 1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-625
__ev_mwhssfraaw3 __ev_mwhssfraaw3
Vector Multiply Word High Signed, Saturate, Fractional, Round and Accumulate into Words 3
operand
d = __ev_mwhssfraaw3 (a,b,c)
// high
temph
0:63
b
0:31
sf
c
0:31
if (b
0:31
= 0x8000_0000) &(c
0:31
= 0x8000_0000) then {
temph
0:63
0x7FFF_FFFF_0000_0000 //saturate
movh 1}
else
movh 0
temph
0:66
ROUND((EXTS
67
(a
0:31
||
32
0) + EXTS
67
(temph
0:63
)),32)
ovh chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
// low
templ
0:63
b
32:63
sf
c
32:63
if (b
32:63
= 0x8000_0000) &(c
32:63
= 0x8000_0000) then {
templ
0:63
0x7FFF_FFFF_0000_0000 //saturate
movl 1}
else
movl 0
templ
0:66
ROUND((EXTS
67
(a
32:63
||
32
0) + EXTS
67
(templ
0:63
)),32)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl | movl
The corresponding signed fractional word elements in parameters b and c are multiplied. If both
inputs are -1.0, the intermediate result saturates to the largest positive signed fraction. The 64-bit
products are added to the corresponding zero-padded word of parameter a, the sums are rounded
to 32 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow or
underflow occurs, and the results are placed into the corresponding word of parameter d and the
accumulator. If there is an overflow or underflow from either the multiply or the addition with
round, the overflow and summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-626 Freescale Semiconductor
SPE2 Operations
Figure 3-474. Vector Multiply Word High Signed, Saturate, Fractional, Round, and
Accumulate into Words 3 op (__ev_mwhssfraaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwhssfraaw3 d,b,c
d and accumulator
+ +
a
0 31 32 63
Intermediate product
c
X
b
X
ROUND ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-627
__ev_mwhssfranw __ev_mwhssfranw
Vector Multiply Word High Signed, Saturate, Fractional, Round and Accumulate Negative into
Words
d = __ev_mwhssfranw (a,b)
// high
if (a
0:31
= 0x8000_0000) &(b
0:31
= 0x8000_0000) then {
tempr
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movh 1}
else {
temph
0:63
a
0:31
sf
b
0:31
tempr
0:32
temph
0:32
+ 1
movh 0}
temp
0:63
EXTS(ACC
0:31
) - EXTS(tempr
0:31
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then {
tempr
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
movl 1}
else {
templ
0:63
a
32:63
sf
b
32:63
tempr
0:32
templ
0:32
+ 1
movl 0}
temp
0:63
EXTS(ACC
32:63
) - EXTS(tempr
0:31
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl | movl
The corresponding signed fractional word elements in parameters a and b are multiplied. Bits 0-32
of each product are incremented to obtain an intermediate result. If both inputs are -1.0, the
intermediate result saturates to the largest positive signed fraction. Bits 0-31 of each intermediate
result are then subtracted from the corresponding word in the accumulator, saturating if overflow
or underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow or underflow from either the multiply or the subtract, the overflow and
summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-628 Freescale Semiconductor
SPE2 Operations
Figure 3-475. Vector Multiply Word High Signed, Saturate, Fractional, Round, and
Accumulate Negative into Words (__ev_mwhssfranw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhssfranw d,a,b
d and accumulator
- -
accumulator
0 31 32 63
Intermediate product
b
X
a
X
+ 1 + 1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-629
__ev_mwhssfranw3 __ev_mwhssfranw3
Vector Multiply Word High Signed, Saturate, Fractional, Round and Accumulate Negative into
Words 3 operand
d = __ev_mwhssfranw3 (a,b,c)
// high
temph
0:63
b
0:31
sf
c
0:31
if (b
0:31
= 0x8000_0000) &(c
0:31
= 0x8000_0000) then {
temph
0:63
0x7FFF_FFFF_0000_0000 //saturate
movh 1}
else
movh 0
temph
0:66
ROUND((EXTS
67
(a
0:31
||
32
0) - EXTS
67
(temph
0:63
)),32)
ovh chk_ovf(temph
0:3
)
d
0:31
SATURATE(ovh, temph
0
, 0x8000_0000, 0x7FFF_FFFF, temph
3:34
)
// low
templ
0:63
b
32:63
sf
c
32:63
if (b
32:63
= 0x8000_0000) &(c
32:63
= 0x8000_0000) then {
templ
0:63
0x7FFF_FFFF_0000_0000 //saturate
movl 1}
else
movl 0
templ
0:66
ROUND((EXTS
67
(a
32:63
||
32
0) - EXTS
67
(templ
0:63
)),32)
ovl chk_ovf(templ
0:3
)
d
32:63
SATURATE(ovl, templ
0
, 0x8000_0000, 0x7FFF_FFFF, templ
3:34
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh | movh
SPEFSCR
OV
ovl | movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh | movh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl | movl
The corresponding signed fractional word elements in parameters b and c are multiplied. If both
inputs are -1.0, the intermediate result saturates to the largest positive signed fraction. The 64-bit
products are subtracted from the corresponding zero-padded word of a, the differences are rounded
to 32 bits using the current fractional rounding mode in SPEFCSR, saturating if overflow or
underflow occurs, and the results are placed into the corresponding word of d and the accumulator.
If there is an overflow or underflow from either the multiply or the subtraction with round, the
overflow and summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
SPE2 Programming Interface Manual, Rev. 1.0-2
3-630 Freescale Semiconductor
SPE2 Operations
Figure 3-476. Vector Multiply Word High Signed, Saturate, Fractional, Round, and
Accumulate Negative into Words 3 op (__ev_mwhssfranw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwhssfranw3 d,b,c
d and accumulator
- -
a
0 31 32 63
Intermediate product
c
X
b
X
ROUND ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-631
__ev_mwhumi[a] __ev_mwhumi[a]
Vector Multiply Word High Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mwhumi (a,b) (A = 0)
d = __ev_mwhumia (a,b) (A = 1)
// high
temp
0:63
a
0:31
ui
b
0:31
d
0:31
temp
0:31
// low
temp
0:63
a
32:63
ui
b
32:63
d
32:63
temp
0:31
// update accumulator
if A = 1, ACC
0:63
d
0:63
The corresponding word unsigned integer elements in parameters a and b are multiplied. Bits 031
of the two products are placed into the two corresponding words of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-477. Vector Multiply Word High Unsigned, Modulo, Integer (to Accumulator)
(__ev_mwhumi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwhumi d,a,b
0 31 32 63
Intermediate product
b
X
d (and accumulator
a
X
if __ev_mwhumia)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-632 Freescale Semiconductor
SPE2 Operations
__ev_mwlsmiaaw __ev_mwlsmiaaw
Vector Multiply Word Low Signed, Modulo, Integer and Accumulate in Words
d = __ev_mwlsmiaaw (a,b)
// high
temp
0:63
a
0:31
si
b
0:31
d
0:31
ACC
0:31
+ temp
32:63
// low
temp
0:63
a
32:63
si
b
32:63
d
32:63
ACC
32:63
+ temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word signed integer elements in
parameters a and b are multiplied. The least significant 32 bits of each intermediate product is
added to the contents of the corresponding accumulator words, and the result is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-478. Vector Multiply Word Low Signed, Modulo, Integer and Accumulate
in Words (__ev_mwlsmiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlsmiaaw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
+ +
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-633
__ev_mwlsmiaaw3 __ev_mwlsmiaaw3
Vector Multiply Word Low Signed, Modulo, Integer and Accumulate in Words 3 operand
d = __ev_mwlsmiaaw3 (a,b,c)
// high
temp
0:63
b
0:31
si
c
0:31
d
0:31
a
0:31
+ temp
32:63
// low
temp
0:63
b
32:63
si
c
32:63
d
32:63
a
32:63
+ temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word signed integer elements in
parameters b and c are multiplied. The least significant 32 bits of each intermediate product is
added to the contents of the corresponding parameter a words, and the result is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-479. Vector Multiply Word Low Signed, Modulo, Integer and Accumulate
in Words 3 op (__ev_mwlsmiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlsmiaaw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
+ +
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-634 Freescale Semiconductor
SPE2 Operations
__ev_mwlsmianw __ev_mwlsmianw
Vector Multiply Word Low Signed, Modulo, Integer and Accumulate Negative in Words
d = __ev_mwlsmianw (a,b)
// high
temp
0:63
a
0:31
si
b
0:31
d
0:31
ACC
0:31
- temp
32:63
// low
temp
0:63
a
32:63
si
b
32:63
d
32:63
ACC
32:63
- temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word elements in parameters a and
b are multiplied. The least significant 32 bits of each intermediate product is subtracted from the
contents of the corresponding accumulator words, and the result is placed in parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-480. Vector Multiply Word Low Signed, Modulo, Integer and Accumulate
Negative in Words (__ev_mwlsmianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlsmfanw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-635
__ev_mwlsmianw3 __ev_mwlsmianw3
Vector Multiply Word Low Signed, Modulo, Integer and Accumulate Negative in Words 3 operand
d = __ev_mwlsmianw3 (a,b)
// high
temp
0:63
b
0:31
si
c
0:31
d
0:31
a
0:31
- temp
32:63
// low
temp
0:63
b
32:63
si
c
32:63
d
32:63
a
32:63
- temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word elements in parameters b and
c are multiplied. The least significant 32 bits of each intermediate product is subtracted from the
contents of the corresponding accumulator words and the result is placed into parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-481. Vector Multiply Word Low Signed, Modulo, Integer and Accumulate Negative
in Words 3 op (__ev_mwlsmianw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlsmianw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-636 Freescale Semiconductor
SPE2 Operations
__ev_mwlssiaaw __ev_mwlssiaaw
Vector Multiply Word Low Signed, Saturate, Integer and Accumulate in Words
d = __ev_mwlssiaaw (a,b)
// high
temp
0:63
a
0:31
si
b
0:31
temp
0:63
EXTS
64
(ACC
0:31
) + EXTS
64
(temp
32:63
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:63
a
32:63
si
b
32:63
temp
0:63
EXTS
64
(ACC
32:63
) + EXTS
64
(temp
32:63
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding word signed integer elements in parameters a and b are multiplied producing
a 64-bit product. The least significant 32 bits of each product is then added to the corresponding
word in the accumulator saturating if overflow or underflow occurs, and the result is placed into
parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-482. Vector Multiply Word Low Signed, Saturate, Integer and Accumulate in
Words (__ev_mwlssiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlssiaaw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
+ +
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-637
__ev_mwlssiaaw3 __ev_mwlssiaaw3
Vector Multiply Word Low Signed, Saturate, Integer and Accumulate in Words
d = __ev_mwlssiaaw3 (a,b,c)
// high
temp
0:63
b
0:31
si
c
0:31
temp
0:63
EXTS
64
(a
0:31
) + EXTS
64
(temp
32:63
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:63
b
32:63
si
c
32:63
temp
0:63
EXTS
64
(a
32:63
) + EXTS
64
(temp
32:63
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding word signed integer elements in parameters b and c are multiplied producing
a 64-bit product. The least significant 32 bits of each product is then added to the corresponding
word in parameter a saturating if overflow or underflow occurs, and the result is placed into
parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-483. Vector Multiply Word Low Signed, Saturate, Integer and Accumulate in
Words 3 op (__ev_mwlssiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
__ev_mwlssiaaw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
+ +
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-638 Freescale Semiconductor
SPE2 Operations
__ev_mwlssianw __ev_mwlssianw
Vector Multiply Word Low Signed, Saturate, Integer and Accumulate Negative in Words
d = __ev_mwlssianw (a,b)
// high
temp
0:63
a
0:31
si
b
0:31
temp
0:63
EXTS
64
(ACC
0:31
) - EXTS
64
(temp
32:63
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:63
a
32:63
si
b
32:63
temp
0:63
EXTS
64
(ACC
32:63
) - EXTS
64
(temp
32:63
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding word signed integer elements in parameters a and b are multiplied producing
a 64-bit product. The least significant 32 bits of each product is then subtracted from the
corresponding word in the accumulator saturating if overflow or underflow occurs, and the result
is placed into parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-484. Vector Multiply Word Low Signed, Saturate, Integer and Accumulate
Negative in Words (__ev_mwlssianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlssianw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-639
__ev_mwlssianw3 __ev_mwlssianw3
Vector Multiply Word Low Signed, Saturate, Integer and Accumulate Negative in Words 3
operand
d = __ev_mwlssianw3 (a,b,c)
// high
temp
0:63
b
0:31
si
c
0:31
temp
0:63
EXTS
64
(a
0:31
) - EXTS
64
(temp
32:63
)
ovh (temp
31
temp
32
)
d
0:31
SATURATE(ovh, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// low
temp
0:63
b
32:63
si
c
32:63
temp
0:63
EXTS
64
(a
32:63
) - EXTS
64
(temp
32:63
)
ovl (temp
31
temp
32
)
d
32:63
SATURATE(ovl, temp
31
, 0x8000_0000, 0x7FFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The corresponding word signed integer elements in parameters b and c are multiplied producing
a 64-bit product. The least significant 32 bits of each product is then subtracted from the
corresponding word in parameter a saturating if overflow or underflow occurs, and the result is
placed into parameter d and the accumulator.
If there is an overflow or underflow from the addition, the overflow and summary overflow bits
are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-485. Vector Multiply Word Low Signed, Saturate, Integer and Accumulate
Negative in Words 3 op (__ev_mwlssianw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlssianw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-640 Freescale Semiconductor
SPE2 Operations
__ev_mwlumi[a] __ev_mwlumi[a]
Vector Multiply Word Low Unsigned, Modulo, Integer
d = __ev_mwlumi (a,b) (A = 0)
d = __ev_mwlumia (a,b) (A = 1)
// high
temp
0:63
a
0:31
ui
b
0:31
d
0:31
temp
32:63
// low
temp
0:63
a
32:63
ui
b
32:63
d
32:63
temp
32:63
// update accumulator
If A = 1 then ACC
0:63
d
0:63
The corresponding word unsigned integer elements in parameters a and b are multiplied. The least
significant 32 bits of each product are placed into the two corresponding words of parameter d.
NOTE
The least significant 32 bits of the product are independent of whether
the word elements in parameters a and b are treated as signed or
unsigned 32-bit integers.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Note that evmwlumi and evmwlumia can be used for signed or unsigned integers.
Figure 3-486. Vector Multiply Word Low Unsigned, Modulo, Integer (__ev_mwlumi[a])
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlumi d,a,b
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlumia d,a,b
0 31 32 63
Intermediate product
b
X
a
X
d (and accumulator
if __ev_mwlumia)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-641
__ev_mwlumiaaw __ev_mwlumiaaw
Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate in Words
d = __ev_mwlumiaaw (a,b)
// high
temp
0:63
a
0:31
ui
b
0:31
d
0:31
ACC
0:31
+ temp
32:63
// low
temp
0:63
a
32:63
ui
b
32:63
d
32:63
ACC
32:63
+ temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word unsigned integer elements in
parameters a and b are multiplied. The least significant 32 bits of each product are added to the
contents of the corresponding accumulator word, and the result is placed into the corresponding
parameter d and accumulator word.
Other registers altered: ACC
Figure 3-487. Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate in
Words (__ev_mwlumiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlumiaaw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
+ +
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-642 Freescale Semiconductor
SPE2 Operations
__ev_mwlumiaaw3 __ev_mwlumiaaw3
Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate in Words 3 operand
d = __ev_mwlumiaaw3 (a,b,c)
// high
temp
0:63
b
0:31
ui
c
0:31
d
0:31
a
0:31
+ temp
32:63
// low
temp
0:63
b
32:63
ui
c
32:63
d
32:63
a
32:63
+ temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word unsigned integer elements in
parameters b and c are multiplied. The least significant 32 bits of each product is added to the
contents of the corresponding parameter a word and the result is placed into parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-488. Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate in
Words 3 op (__ev_mwlumiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlumiaaw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
+ +
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-643
__ev_mwlumianw __ev_mwlumianw
Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate Negative in Words
d = __ev_mwlumianw (a,b)
// high
temp
0:63
a
0:31
ui
b
0:31
d
0:31
ACC
0:31
- temp
32:63
// low
temp
0:63
a
32:63
ui
b
32:63
d
32:63
ACC
32:63
- temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word unsigned integer elements in
parameters a and b are multiplied. The least significant 32 bits of each product are subtracted from
the contents of the corresponding accumulator word, and the result is placed into parameter d and
the accumulator.
Other registers altered: ACC
Figure 3-489. Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate
Negative in Words (__ev_mwlumianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlumianw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-644 Freescale Semiconductor
SPE2 Operations
__ev_mwlumianw3 __ev_mwlumianw3
Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate Negative in Words 3
operand
d = __ev_mwlumianw3 (a,b,c)
// high
temp
0:63
b
0:31
ui
c
0:31
d
0:31
a
0:31
- temp
32:63
// low
temp
0:63
b
32:63
ui
c
32:63
d
32:63
a
32:63
- temp
32:63
// update accumulator
ACC
0:63
d
0:63
For each word element in the accumulator, the corresponding word unsigned integer elements in
parameters b and c are multiplied. The least significant 32 bits of each product is subtracted from
the contents of the corresponding parameter a word and the result is placed into parameter d and
the accumulator.
Other registers altered: ACC
Figure 3-490. Vector Multiply Word Low Unsigned, Modulo, Integer and Accumulate
Negative in Words 3 op (__ev_mwlumianw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlumianw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-645
__ev_mwlusiaaw __ev_mwlusiaaw
Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate in Words
d = __ev_mwlusiaaw (a,b)
// high
temp
0:63
a
0:31
ui
b
0:31
temp
0:63
EXTZ(ACC
0:31
) + EXTZ(temp
32:63
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
//low
temp
0:63
a
32:63
ui
b
32:63
temp
0:63
EXTZ(ACC
32:63
) + EXTZ(temp
32:63
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding word unsigned integer elements in
parameters a and b are multiplied, producing a 64-bit product. The least significant 32 bits of each
product are then added to the corresponding word in the accumulator, saturating if overflow
occurs, and the result is placed in parameter d and the accumulator.
If there is an overflow from the addition, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-491. Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate
in Words (__ev_mwlusiaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlusiaaw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
+ +
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-646 Freescale Semiconductor
SPE2 Operations
__ev_mwlusiaaw3 __ev_mwlusiaaw3
Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate in Words 3 operand
d = __ev_mwlusiaaw3 (a,b,c)
// high
temp
0:63
b
0:31
ui
c
0:31
temp
0:63
EXTZ
64
(a
0:31
) + EXTZ
64
(temp
32:63
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
//low
temp
0:63
b
32:63
ui
c
32:63
temp
0:63
EXTZ
64
(a
32:63
) + EXTZ
64
(temp
32:63
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding word unsigned integer elements in
parameters b and c are multiplied producing a 64-bit product. The least significant 32 bits of each
product is then added to the corresponding word in parameter a saturating if overflow occurs, and
the result is placed into parameter d and the accumulator.
If there is an overflow from the addition, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-492. Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate in
Words 3 op (__ev_mwlusiaaw3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlusiaaw3 d,b,c
0 31 32 63
Intermediate product
c
X
a
b
X
+ +
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-647
__ev_mwlusianw __ev_mwlusianw
Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate Negative in Words
d = __ev_mwlusianw (a,b)
// high
temp
0:63
a
0:31
ui
b
0:31
temp
0:63
EXTZ(ACC
0:31
) - EXTZ(temp
32:63
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0x0000_0000, 0x0000_0000, temp
32:63
)
//low
temp
0:63
a
32:63
ui
b
32:63
temp
0:63
EXTZ(ACC
32:63
) - EXTZ(temp
32:63
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0x0000_0000, 0x0000_0000, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding word unsigned integer elements in
parameters a and b are multiplied, producing a 64-bit product. The least significant 32 bits of each
product are then subtracted from the corresponding word in the accumulator, saturating if
underflow occurs, and the result is placed in parameter d and the accumulator.
If there is an underflow from the subtraction, the overflow and summary overflow bits are recorded
in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-493. Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate
Negative in Words (__ev_mwlusianw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwlusianw d,a,b
0 31 32 63
Intermediate product
b
X
Accumulator
a
X
- -
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-648 Freescale Semiconductor
SPE2 Operations
__ev_mwlusianw3 __ev_mwlusianw3
Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate Negative in Words 3
operand
d = __ev_mwlusianw3 (a,b,c)
// high
temp
0:63
b
0:31
ui
c
0:31
temp
0:63
EXTZ
64
(a
0:31
) - EXTZ
64
(temp
32:63
)
ovh temp
31
d
0:31
SATURATE(ovh, 0, 0x0000_0000, 0x0000_0000, temp
32:63
)
//low
temp
0:63
b
32:63
ui
c
32:63
temp
0:63
EXTZ
64
(a
32:63
) - EXTZ
64
(temp
32:63
)
ovl temp
31
d
32:63
SATURATE(ovl, 0, 0x0000_0000, 0x0000_0000, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
For each word element in the accumulator, corresponding word unsigned integer elements in
parameters b and c are multiplied producing a 64-bit product. The least significant 32 bits of each
product is then subtracted from the corresponding word in parameter a saturating if underflow
occurs, and the result is placed into parameter d and the accumulator.
If there is an underflow from the subtraction, the overflow and summary overflow bits are recorded
in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-494. Vector Multiply Word Low Unsigned, Saturate, Integer and Accumulate
Negative in Words 3 op (__ev_mwlusianw3)
0 31 32 63
Intermediate product
c
X
a
b
X
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-649
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evmwlusianw3 d,b,c
SPE2 Programming Interface Manual, Rev. 1.0-2
3-650 Freescale Semiconductor
SPE2 Operations
__ev_mwohgsmf[a] __ev_mwohgsmf[a]
Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional (to Accumulator)
d = __ev_mwohgsmf (a,b) (A = 0)
d = __ev_mwohgsmfa (a,b) (A = 1)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000
else
d
0:63
EXTS
64
(temp
0:47
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The odd word signed fractional elements in parameters a and b are multiplied. The high order 48
bits of the 64-bit fractional product are sign-extended to 64-bits to produce an intermediate product
in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0.
The intermediate product is then placed into parameter d. If A = 1, the result in parameter d is also
placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-495. Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional (to
Accumulator) (__ev_mwohgsmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfa d,a,b
0 31 32 63
b
X
a
0 63
d (and accumulator if A=1)
15 16 17
.
S........................S
S
47 0
exts
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-651
__ev_mwohgsmfaa __ev_mwohgsmfaa
Vector Multiply Word Odd High Guarded Signed, Modulo, Fractional and Accumulate
d = __ev_mwohgsmfaa (a,b)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000 + ACC
0:63
else
d
0:63
EXTS
64
(temp
0:47
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The odd word signed fractional elements in parameters a and b are multiplied. The high order 48
bits of the 64-bit product are sign-extended to 64-bits to produce an intermediate product in 17.47
fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0. The
intermediate product is then added to the contents of the accumulator and the result is placed into
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-496. Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional and
Accumulate (__ev_mwohgsmfaa)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
+
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-652 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfaa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-653
__ev_mwohgsmfan __ev_mwohgsmfan
Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional and Accumulate Negative
d = __ev_mwohgsmfan (a,b)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000 + ACC
0:63
else
d
0:63
ACC
0:63 -
EXTS
64
(temp
0:47
)
// update accumulator
ACC
0:63
d
0:63
The odd word signed fractional elements in parameters a and b are multiplied. The high order 48
bits of the 64-bit product are sign-extended to 64-bits to produce an intermediate product in 17.47
fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0. The
intermediate product is then subtracted from the contents of the accumulator and the result is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-497. Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional and
Accumulate Negative (__ev_mwohgsmfan)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
-
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-654 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfan d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-655
__ev_mwohgsmfr[a] __ev_mwohgsmfr[a]
Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional, Round (to Accumulator)
d = __ev_mwohgsmfr (a,b) (A = 0)
d = __ev_mwohgsmfra (a,b) (A = 1)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000
else
temp
0:64
EXTS
65
(temp
0:63
)
tempr
0:64
ROUND(temp
0:64
,16)
d
0:63
EXTS
64
(tempr
0:48
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The odd word signed fractional elements in parameters a and b are multiplied. The 64-bit
fractional product is sign-extended to 65 bits, rounded to 49-bits using the current rounding mode
in SPEFSCR, and the 49-bit value is sign-extended to 64-bits to produce a rounded intermediate
product in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented
as +1.0. The intermediate product is then placed into parameter d. If A = 1, the result in parameter
d is also placed into the accumulator.
Other registers altered: ACC (If A = 1)
Figure 3-498. Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional,
Round (to Accumulator) (__ev_mwohgsmfr[a])
0 31 32 63
b
X
a
0 63
d (and accumulator if A=1)
15 16 17
.
S........................S
S
47 0
exts
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-656 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfr d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfra d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-657
__ev_mwohgsmfraa __ev_mwohgsmfraa
Vector Multiply Word Odd High Guarded Signed, Modulo, Fractional, Round and Accumulate
d = __ev_mwohgsmfraa (a,b)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
0x0000_8000_0000_0000 + ACC
0:63
else
temp
0:64
EXTS
65
(temp
0:63
)
tempr
0:64
ROUND(temp
0:64
,16)
d
0:63
EXTS
64
(tempr
0:48
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The odd word signed fractional elements in parameters a and b are multiplied. The 64-bit
fractional product is sign-extended to 65 bits, rounded to 49-bits using the current rounding mode
in SPEFSCR, and the 49-bit value is sign-extended to 64-bits to produce an intermediate product
in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0.
The intermediate product is then added to the contents of the accumulator and the result is placed
into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-499. Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional,
Round and Accumulate (__ev_mwohgsmfraa)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
+
d and Accumulator
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-658 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfraa d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-659
__ev_mwohgsmfran __ev_mwohgsmfran
Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional, Round and Accumulate
Negative
d = __ev_mwohgsmfran (a,b)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
ACC
0:63
- 0x0000_8000_0000_0000
else
temp
0:64
EXTS
65
(temp
0:63
)
tempr
0:64
ROUND(temp
0:64
,16)
d
0:63
ACC
0:63
- EXTS
64
(tempr
0:48
)
ACC
0:63
d
0:63
The odd word signed fractional elements in parameters a and b are multiplied. The 64-bit
fractional product is sign-extended to 65 bits, rounded to 49-bits using the current rounding mode
in SPEFSCR, and the 49-bit value is sign-extended to 64-bits to produce an intermediate product
in 17.47 fractional format. If both inputs are -1.0, the intermediate product is represented as +1.0.
The intermediate product is then subtracted from the contents of the accumulator and the result is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-500. Vector Multiply Word Odd, High, Guarded, Signed, Modulo, Fractional,
Round and Accumulate Negative (__ev_mwohgsmfran)
0 31 32 63
b
X
a
0 63
intermediate product
15 16 17
.
S........................S
S
47 0
exts
Accumulator
-
d and Accumulator
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-660 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwohgsmfran d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-661
__ev_mwsmf[a] __ev_mwsmf[a]
Vector Multiply Word Signed, Modulo, Fractional (to Accumulator)
d = __ev_mwsmf (a,b) (A = 0)
d = __ev_mwsmfa (a,b) (A = 1)
d
0:63
a
32:63
sf
b
32:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The corresponding low word signed fractional elements in parameters a and b are multiplied. The
product is placed into parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-501. Vector Multiply Word Signed, Modulo, Fractional (to Accumulator)
(__ev_mwsmf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmfa d,a,b
0 31 32 63
d (and accumulator if
b
X
a
__ev_mwsmfa)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-662 Freescale Semiconductor
SPE2 Operations
__ev_mwsmfaa __ev_mwsmfaa
Vector Multiply Word Signed, Modulo, Fractional and Accumulate
d = __ev_mwsmfaa (a,b)
temp
0:63
a
32:63
sf
b
32:63
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low word signed fractional elements in parameters a and b are multiplied. The
intermediate product is added to the contents of the 64-bit accumulator and the result is placed in
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-502. Vector Multiply Word Signed, Modulo, Fractional and Accumulate
(__ev_mwsmfaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmfaa d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-663
__ev_mwsmfan __ev_mwsmfan
Vector Multiply Word Signed, Modulo, Fractional and Accumulate Negative
d = __ev_mwsmfan (a,b)
temp
0:63
a
32:63
sf
b
32:63
d
0:63
ACC
0:63
- temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low word signed fractional elements in parameters a and b are multiplied. The
intermediate product is subtracted from the contents of the accumulator, and the result is placed in
parameter d and the accumulator.
Other registers altered: ACC
Figure 3-503. Vector Multiply Word Signed, Modulo, Fractional, and Accumulate Negative
(__ev_mwsmfan)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmfan d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-664 Freescale Semiconductor
SPE2 Operations
__ev_mwsmi[a] __ev_mwsmi[a]
Vector Multiply Word Signed, Modulo, Integer (to Accumulator)
d = __ev_mwsmi (a,b) (A = 0)
d = __ev_mwsmia (a,b) (A = 1)
d
0:63
a
32:63
si
b
32:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The low word signed integer elements in parameters a and b are multiplied. The product is placed
into the parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-504. Vector Multiply Word Signed, Modulo, Integer (to Accumulator)
(__ev_mwsmi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmia d,a,b
0 31 32 63
d (and accumulator if __ev_mwsmia
b
X
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-665
__ev_mwsmiaa __ev_mwsmiaa
Vector Multiply Word Signed, Modulo, Integer and Accumulate
d = __ev_mwsmiaa (a,b)
temp
0:63
a
32:63
si
b
32:63
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The low word signed integer elements in parameters a and b are multiplied. The intermediate
product is added to the contents of the 64-bit accumulator, and the result is placed into parameter
d and the accumulator.
Other registers altered: ACC
Figure 3-505. Vector Multiply Word Signed, Modulo, Integer and Accumulate
(__ev_mwsmiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmiaa d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-666 Freescale Semiconductor
SPE2 Operations
__ev_mwsmian __ev_mwsmian
Vector Multiply Word Signed, Modulo, Integer and Accumulate Negative
d = __ev_mwsmian (a,b)
temp
0:63
a
32:63
si
b
32:63
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The corresponding low word signed integer elements in parameters a and b are multiplied. The
intermediate product is subtracted from the contents of the 64-bit accumulator and the result is
placed into parameter d and the accumulator.
Other registers altered: ACC
Figure 3-506. Vector Multiply Word Signed, Modulo, Integer and Accumulate Negative
(__ev_mwsmian)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwsmian d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-667
__ev_mwssf[a] __ev_mwssf[a]
Vector Multiply Word Signed, Saturate, Fractional (to Accumulator)
d = __ev_mwssf (a,b) (A = 0)
d = __ev_mwssfa (a,b) (A = 1)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) &(b
32:63
= 0x8000_0000) then
d
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
mov 1
else
d
0:63
temp
0:63
mov 0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0
SPEFSCR
OV
mov
SPEFSCR
SOV
SPEFSCR
SOV
| mov
The low word signed fractional elements in parameters a and b are multiplied. The 64-bit product
is placed into parameter d. If both inputs are -1.0, the result saturates to the largest positive signed
fraction, and the overflow and summary overflow bits are recorded in the SPEFSCR.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: SPEFSCR ACC (if A = 1)
Figure 3-507. Vector Multiply Word Signed, Saturate, Fractional (to Accumulator)
(__ev_mwssf[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssf d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssfa d,a,b
0 31 32 63
d (and accumulator if __ev_mwssf
b
X
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-668 Freescale Semiconductor
SPE2 Operations
__ev_mwssfaa __ev_mwssfaa
Vector Multiply Word Signed, Saturate, Fractional and Accumulate
d = __ev_mwssfaa (a,b)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
temp
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
mov 1
else
mov 0
temp
0:64
EXTS(ACC
0:63
) + EXTS(temp
0:63
)
ov (temp
0
temp
1
)
d
0:63
= SATURATE(ov, temp
0
, 0x8000000000000000, 0x7FFFFFFFFFFFFFFF, temp
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0 ; SPEFSCR
OV
mov | ov ; SPEFSCR
SOV
SPEFSCR
SOV
| mov | ov ;
The low word signed fractional elements in parameters a and b are multiplied, producing a 64-bit
product. If both inputs are -1.0, the product saturates to the largest positive signed fraction
(0x7FFFFFFFFFFFFFFF). The 64-bit product is then added to the accumulator to form an
intermediate sum. If the intermediate sum has overflowed, the appropriate saturation value
(0x7FFFFFFFFFFFFFFF if positive overflow or 0x8000000000000000 if negative
overflow) is placed into the accumulator word and the corresponding parameter d word.
Otherwise, the low 64 bits of the intermediate sum are placed into the accumulator word and the
corresponding parameter d word. The overflow and summary overflow bits are recorded to
indicate occurrence of saturation either the multiply or the addition.
If there is an overflow from the multiply, the overflow and summary overflow bits are recorded in
the SPEFSCR.
Note: There is no saturation on the addition with the accumulator.
Other registers altered: SPEFSCR ACC
Figure 3-508. Vector Multiply Word Signed, Saturate, Fractional and Accumulate
(__ev_mwssfaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssfaa d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-669
__ev_mwssfan __ev_mwssfan
Vector Multiply Word Signed, Saturate, Fractional and Accumulate Negative
d = __ev_mwssfan (a,b)
temp
0:63
a
32:63
sf
b
32:63
if (a
32:63
= 0x8000_0000) & (b
32:63
= 0x8000_0000) then
temp
0:63
0x7FFF_FFFF_FFFF_FFFF //saturate
mov 1
else
mov 0
temp
0:64
EXTS(ACC
0:63
) - EXTS(temp
0:63
)
ov (temp
0
temp
1
)
d
0:63
= SATURATE(ov, temp
0
, 0x8000000000000000, 0x7FFFFFFFFFFFFFFF, temp
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0 ; SPEFSCR
OV
mov | ov ; SPEFSCR
SOV
SPEFSCR
SOV
| mov | ov
The low word signed fractional elements in parameters a and b are multiplied producing a 64-bit
product. If both inputs are -1.0, the product saturates to the largest positive signed fraction
(0x7FFFFFFFFFFFFFFF). The 64-bit product is then subtracted from the accumulator to form an
intermediate difference. If the intermediate difference has overflowed, the appropriate saturation
value (0x7FFFFFFFFFFFFFFF if positive overflow or 0x8000000000000000 if negative
overflow) is placed into the accumulator word and the corresponding parameter d word.
Otherwise, the low 64 bits of the intermediate difference are placed into the accumulator word and
the corresponding parameter d word. The overflow and summary overflow bits are recorded to
indicate occurrence of saturation either the multiply or the subtraction.
Other registers altered: SPEFSCR ACC
Figure 3-509. Vector Multiply Word Signed, Saturate, Fractional and Accumulate Negative
(__ev_mwssfan)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssfan d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-670 Freescale Semiconductor
SPE2 Operations
__ev_mwssiaa __ev_mwssiaa
Vector Multiply Word Signed, Saturate, Integer and Accumulate
d = __ev_mwssiaa (a,b)
temp1
0:63
a
32:63
si
b
32:63
temp2
0:64
EXTS(ACC
0:63
) + EXTS(temp1
0:63
)
ov (temp2
0
temp2
1
)
d
0:63
= SATURATE(ov, temp2
0
, 0x8000000000000000, 0x7FFFFFFFFFFFFFFF, temp2
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The low word signed integer elements in parameters a and b are multiplied producing a 64-bit
product. The 64-bit product is then added to the accumulator to form an intermediate sum. If the
intermediate sum has overflowed, the appropriate saturation value (0x7FFFFFFFFFFFFFFF if
positive overflow or 0x8000000000000000 if negative overflow) is placed into the
accumulator word and the corresponding parameter d word. Otherwise, the low 64 bits of the
intermediate sum are placed into the accumulator word and the corresponding parameter d word.
The overflow and summary overflow bits are recorded to indicate occurrence of saturation in the
addition.
Other registers altered: SPEFSCR ACC
Figure 3-510. Vector Multiply Word Signed, Saturate, Integer and Accumulate
(__ev_mwssiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssiaa d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
+
d and Accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-671
__ev_mwssian __ev_mwssian
Vector Multiply Word Signed, Saturate, Integer and Accumulate Negative
d = __ev_mwssian (a,b)
temp1
0:63
a
32:63
si
b
32:63
temp2
0:64
EXTS(ACC
0:63
) - EXTS(temp1
0:63
)
ov (temp2
0
temp2
1
)
d
0:63
= SATURATE(ov, temp2
0
, 0x8000000000000000, 0x7FFFFFFFFFFFFFFF, temp2
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The low word signed integer elements in parameters a and b are multiplied producing a 64-bit
product. The 64-bit product is then subtracted from the accumulator to form an intermediate
difference. If the intermediate difference has overflowed, the appropriate saturation value
(0x7FFFFFFFFFFFFFFF if positive overflow or 0x8000000000000000 if negative
overflow) is placed into the accumulator word and the corresponding parameter d word.
Otherwise, the low 64 bits of the intermediate difference are placed into the accumulator word and
the corresponding parameter d word. The overflow and summary overflow bits are recorded to
indicate occurrence of saturation in the addition.
Other registers altered: SPEFSCR ACC
Figure 3-511. Vector Multiply Word Signed, Saturate, Integer and Accumulate Negative
(__ev_mwssian)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssian d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
d and Accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-672 Freescale Semiconductor
SPE2 Operations
__ev_mwssiw __ev_mwssiw
Vector Multiply Word Signed, Saturate, Integer Word
d = __ev_mwssiw (a,b)
// high
temp0
0:63
a
0:31
si
b
0:31
if (temp0
0:63
> 0x0000_0000_7FFF_FFFF) | (temp0
0:63
< 0xFFFF_FFFF_FFFF_8000) then
movh 1
temp0
32:63
SATURATE(movh, temp0
0
, 0x8000_0000, 0x7FFF_FFFF, temp0
32:63
)
else
movh 0
// low
temp1
0:63
a
0:31
si
b
0:31
if (temp1
0:63
> 0x0000_0000_7FFF_FFFF) | (temp1
0:63
< 0xFFFF_FFFF_FFFF_8000) then
movl 1
temp1
32:63
SATURATE(movl, temp1
0
, 0x8000_0000, 0x7FFF_FFFF, temp1
32:63
)
else
movl 0
d
0:31
temp0
32:63 ;
d
32:63
temp1
32:63
// update SPEFSCR
SPEFSCR
OVH
movh; SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh; SPEFSCR
SOV
SPEFSCR
SOV
| movl
For each word element in the destination, corresponding signed word integer elements in
parameters a and b are multiplied producing a 64-bit intermediate product. The least significant
32 bits of each product is then placed in the corresponding word in parameter d, saturating if
overflow occurs.
If there is an overflow, the overflow and summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-512. Vector Multiply Word Signed, Saturate, Integer Word (__ev_mwssiw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwssiw d,a,b
0 31 32 63
Intermediate product
b
X
sat
a
X
d
sat
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-673
__ev_mwumi[a] __ev_mwumi[a]
Vector Multiply Word Unsigned, Modulo, Integer (to Accumulator)
d = __ev_mwumi (a,b) (A = 0)
d = __ev_mwumia (a,b) (A = 1)
d
0:63
a
32:63
ui
b
32:63
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The low word unsigned integer elements in parameters a and b are multiplied to form a 64-bit
product that is placed into parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A = 1)
Figure 3-513. Vector Multiply Word Unsigned, Modulo, Integer (to Accumulator)
(__ev_mwumi[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwumi d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwumia d,a,b
0 31 32 63
d (an accumulator if __ev_mwuni
b
X
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-674 Freescale Semiconductor
SPE2 Operations
__ev_mwumiaa __ev_mwumiaa
Vector Multiply Word Unsigned, Modulo, Integer and Accumulate
d = __ev_mwumiaa (a,b)
temp
0:63
a
32:63
ui
b
32:63
d
0:63
ACC
0:63
+ temp
0:63
// update accumulator
ACC
0:63
d
0:63
The low word unsigned integer elements in parameters a and b are multiplied. The intermediate
product is added to the contents of the 64-bit accumulator, and the resulting value is placed into
the accumulator and into parameter d.
Other registers altered: ACC
Figure 3-514. Vector Multiply Word Unsigned, Modulo, Integer and Accumulate
(__ev_mwumiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwumiaa d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
+
d and accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-675
__ev_mwumian __ev_mwumian
Vector Multiply Word Unsigned, Modulo, Integer and Accumulate Negative
d = __ev_mwumian (a,b)
temp
0:63
a
32:63
ui
b
32:63
d
0:63
ACC
0:63
temp
0:63
// update accumulator
ACC
0:63
d
0:63
The low word unsigned integer elements in parameters a and b are multiplied. The intermediate
product is subtracted from the contents of the 64-bit accumulator, and the resulting value is placed
into the accumulator and into parameter d.
Other registers altered: ACC
Figure 3-515. Vector Multiply Word Unsigned, Modulo, Integer and Accumulate Negative
(__ev_mwumian)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwumian d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
d and accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-676 Freescale Semiconductor
SPE2 Operations
__ev_mwusiaa __ev_mwusiaa
Vector Multiply Word Unsigned, Saturate, Integer and Accumulate
d = __ev_mwusiaa (a,b)
temp1
0:63
a
32:63
ui
b
32:63
temp2
0:64
EXTZ(ACC
0:63
) + EXTZ(temp1
0:63
)
ov temp2
0
d
0:63
= SATURATE(ov, temp2
0
, 0xFFFFFFFFFFFFFFFF, 0xFFFFFFFFFFFFFFFF, temp2
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The low word unsigned integer elements in parameters a and b are multiplied producing a 64-bit
product. The 64-bit product is then added to the accumulator to form an intermediate sum. If the
intermediate sum has overflowed, the saturation value 0xFFFFFFFFFFFFFFFF is placed into
the accumulator word and the corresponding parameter d word. Otherwise, the low 64 bits of the
intermediate sum are placed into the accumulator word and the corresponding parameter d word.
The overflow and summary overflow bits are recorded to indicate occurrence of saturation in the
addition.
Other registers altered: SPEFSCR ACC
Figure 3-516. Vector Multiply Word Unsigned, Saturate, Integer and Accumulate
(__ev_mwusiaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwusiaa d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
+
d and Accumulator
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-677
__ev_mwusian __ev_mwusian
Vector Multiply Word Unsigned, Saturate, Integer and Accumulate Negative
d = __ev_mwusian (a,b)
temp1
0:63
a
32:63
ui
b
32:63
temp2
0:64
EXTZ(ACC
0:63
) - EXTZ(temp1
0:63
)
ov temp2
0
d
0:63
= SATURATE(ov, 0, 0x0000000000000000, 0x0000000000000000, temp2
1:64
)
// update accumulator
ACC
0:63
d
0:63
// update SPEFSCR
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The low word signed integer elements in parameters a and b are multiplied producing a 64-bit
product. The 64-bit product is then subtracted from the accumulator to form an intermediate
difference. If the intermediate difference has underflowed, the saturation value
0x0000000000000000 is placed into the accumulator word and the corresponding parameter
d word. Otherwise, the low 64 bits of the intermediate difference are placed into the accumulator
word and the corresponding parameter d word. The overflow and summary overflow bits are
recorded to indicate occurrence of saturation in the subtraction.
Other registers altered: SPEFSCR ACC
Figure 3-517. Vector Multiply Word Unsigned, Saturate, Integer and Accumulate Negative
(__ev_mwusian)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwusian d,a,b
0 31 32 63
Intermediate product
Accumulator
b
X
d and Accumulator
a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-678 Freescale Semiconductor
SPE2 Operations
__ev_mwusiw __ev_mwusiw
Vector Multiply Word Unsigned, Saturate, Integer Word
d = __ev_mwusiw (a,b)
// high
temp0
0:63
a
0:31
ui
b
0:31
if (temp0
0:63
> 0x0000_0000_FFFF_FFFF) then
movh 1
temp0
32:63
SATURATE(movh, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp0
32:63
)
else
movh 0
// low
temp1
0:63
a
0:31
ui
b
0:31
if (temp1
0:63
> 0x0000_0000_FFFF_FFFF) then
movl 1
temp1
32:63
SATURATE(movl, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, temp1
32:63
)
else
movl 0
d
0:31
temp0
32:63 ;
d
32:63
temp1
32:63
// update SPEFSCR
SPEFSCR
OVH
movh; SPEFSCR
OV
movl
SPEFSCR
SOVH
SPEFSCR
SOVH
| movh; SPEFSCR
SOV
SPEFSCR
SOV
| movl
For each word element in the destination, corresponding word unsigned word integer elements in
parameters a and b are multiplied producing a 64-bit intermediate product. The least significant
32 bits of each product is then placed in the corresponding word in parameter d, saturating if
overflow occurs.
If there is an overflow, the overflow and summary overflow bits are recorded in the SPEFSCR.
Other registers altered: SPEFSCR ACC
Figure 3-518. Vector Multiply Word Unsigned, Saturate, Integer Word (__ev_mwusiw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evmwusiw d,a,b
0 31 32 63
Intermediate product
b
X
sat
a
X
d
sat
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-679
__ev_nand __ev_nand
Vector NAND
d = __ev_nand (a,b)
d
0:31
(a
0:31
& b
0:31
)// Bitwise NAND
d
32:63
(a
32:63
& b
32:63
) // Bitwise NAND
Each element of parameters a and b are bitwise NANDed. The result is placed in the
corresponding element of parameter d.
Figure 3-519. Vector NAND (__ev_nand)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evnand d,a,b
0 31 32 63
a
b
NAND NAND
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-680 Freescale Semiconductor
SPE2 Operations
__ev_neg __ev_neg
Vector Negate
d = __ev_neg (a)
d
0:31
NEG(a
0:31
)
d
32:63
NEG(a
32:63
)
The negative of each element of parameter a is placed in parameter d. The negative of
0x8000_0000 (most negative number) returns 0x8000_0000. No overflow is detected.
Figure 3-520. Vector Negate (__ev_neg)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evneg d,a
0 31 32 63
A
NEG NEG
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-681
__ev_negb __ev_negb
Vector Negate Byte
d = __ev_negb (a)
d
0:7
NEG(a
0:7
)
d
8:15
NEG(a
8:15
)
d
16:23
NEG(a
16:23
)
d
24:31
NEG(a
24:31
)
d
32:39
NEG(a
32:39
)
d
40:47
NEG(a
40:47
)
d
48:55
NEG(a
48:55
)
d
56:63
NEG(a
56:63
)
The negated value of each byte element of parameter a is placed in the corresponding element of
parameter d. An initial value of 0x80 (most negative number) returns 0x80. No overflow is
detected.
Figure 3-521. Vector Negate Byte (evnegb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegb d,a
0 32 63
a
24 16 8 40 48 56
NEG
0 32 63
d
24 16 8 40 48 56
NEG NEG NEG NEG NEG NEG NEG
SPE2 Programming Interface Manual, Rev. 1.0-2
3-682 Freescale Semiconductor
SPE2 Operations
__ev_negbo __ev_negbo
Vector Negate Byte Odd
d = __ev_negbo (a)
d
0:7
a
0:7
d
8:15
NEG(a
8:15
)
d
16:23
a
16:23
d
24:31
NEG(a
24:31
)
d
32:39
a
32:39
d
40:47
NEG(a
40:47
)
d
48:55
a
48:55
d
56:63
NEG(a
56:63
)
The even byte elements of parameter a are placed into the corresponding bytes of parameter d
unchanged. The negated values of the odd byte elements of parameter a are placed into the
corresponding bytes of parameter d. An initial value of 0x80 (most negative number) returns 0x80.
No overflow is detected.
Figure 3-522. Vector Negate Byte Odd (__ev_negbo)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegbo d,a
0 32 63
a
24 16 8 40 48 56
0 32 63
d
24 16 8 40 48 56
NEG NEG NEG NEG
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-683
__ev_negbos __ev_negbos
Vector Negate Byte Odd and Saturate
d = __ev_negbos (a)
ovh 0
d
0:7
a
0:7
if (a
8:15
= 0x80) then
d
8:15
0x7F
ovh 1
else d
8:15
NEG(a
8:15
)
endif
d
16:23
a
16:23
if (a
24:31
= 0x80) then
d
24:31
0x7F
ovh 1
else d
24:31
NEG(a
24:31
)
endif
ovl 0
d
32:39
a
32:39
if (a
40:47
= 0x80) then
d
40:47
0x7F
ovl 1
else d
40:47
NEG(a
40:47
)
endif
d
48:55
a
48:55
if (a
56:63
= 0x80) then
d
56:63
0x7F
ovl 1
else d
56:63
NEG(a
56:63
)
endif
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The even byte elements of parameter a are placed into the corresponding bytes of parameter d
unchanged. The negated values of the odd byte elements of a are placed into the corresponding
bytes of parameter d. The negated value of 0x80 (most negative number) returns 0x7F. Any
overflow is reported in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-523. Vector Negate Byte Odd and Saturate (__ev_negbos)
0 32 63
a
24 16 8 40 48 56
0 32 63
d
24 16 8 40 48 56
NEG NEG NEG NEG
&SAT &SAT &SAT &SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-684 Freescale Semiconductor
SPE2 Operations
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegbos d,a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-685
__ev_negbs __ev_negbs
Vector Negate Byte and Saturate
d = __ev_negbs (a)
The negated value of each signed byte element of parameter a is placed in parameter d. The
negated value of 0x80 (most negative number) returns 0x7F. Any overflow is reported in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-524. Vector Negate Byte and Saturate (__ev_negbs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegbs d,a
if (a
0:7
= 0x80) then
d
0:7
0x7F
ovh 1
else
d
0:7
NEG(a
0:7
)
ovh 0
if (a
8:15
= 0x80) then
d
8:15
0x7F
ovh 1
else d
8:15
NEG(a
8:15
)
if (a
16:23
= 0x80) then
d
16:23
0x7F
ovh 1
else d
16:23
NEG(a
16:23
)
if (a
24:31
= 0x80) then
d
24:31
0x7F
ovh 1
else d
24:31
NEG(a
24:31
)
if (a
32:39
= 0x80) then
d
32:39
0x7F
ovl1
else
d
32:39
NEG(a
32:39
)
ovl 0
if (a
40:47
= 0x80) then
d
40:47
0x7F
ovl 1
else d
40:47
NEG(a
40:47
)
if (a
48:55
= 0x80) then
d
48:55
0x7F
ovl 1
else d
48:55
NEG(a
48:55
)
if (a
56:63
= 0x80) then
d
56:63
0x7F
ovl 1
else d
56:63
NEG(a
56:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
0 32 63
a
24 16 8 40 48 56
NEG
0 32 63
d
24 16 8 40 48 56
NEG NEG NEG NEG NEG NEG NEG
&SAT &SAT &SAT &SAT &SAT &SAT &SAT &SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-686 Freescale Semiconductor
SPE2 Operations
__ev_negd __ev_negd
Vector Negate Doubleword
d = __ev_negd (a)
d
0:63
NEG(a
0:63
)
The negated value of the doubleword in parameter a is placed into parameter d. The negative of
0x8000_0000_0000_0000 (most negative number) returns 0x8000_0000_0000_0000. No
overflow is detected.
Figure 3-525. Vector Negate Doubleword (__ev_negd)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegd d,a
0 63
a
NEG
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-687
__ev_negds __ev_negds
Vector Negate Doubleword and Saturate
d = __ev_negds (a)
if (a
0:63
= 0x8000_0000_0000_0000) then
d
0:63
0x7FFF_FFFF_FFFF_FFFF
ov 1
else
d
0:63
NEG(a
0:63
)
ov 0
endif
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The negated value of the doubleword in parameter a is placed into parameter d. The negative of
0x8000_0000_0000_0000 (most negative number) returns 0x7FFF_FFFF_FFFF_FFFF. Any
overflow is reported in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-526. Vector Negate Doubleword and Saturate (__ev_negds)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegds d,a
0 63
a
NEG&SAT
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-688 Freescale Semiconductor
SPE2 Operations
__ev_negh __ev_negh
Vector Negate Half Word
d = __ev_negh (a)
d
0:15
NEG(a
0:15
)
d
16:31
NEG(a
16:31
)
d
32:47
NEG(a
32:47
)
d
48:63
NEG(a
48:63
)
The negative of each element of parameter a is placed in parameter d. The negative of 0x8000
(most negative number) returns 0x8000. No overflow is detected.
Figure 3-527. Vector Negate Half Word (__ev_negh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegh d,a
d
0 31 32 63
a
NEG NEG NEG NEG
15 16 47 48 0 31 32 63 15 16 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-689
__ev_negho __ev_negho
Vector Negate Halfwords Odd
d = __ev_negho (a)
d
0:15
a
0:15
d
16:31
NEG(a
16:31
)
d
32:47
a
32:47
d
48:63
NEG(a
48:63
)
The even halfword elements of parameter a are placed into the corresponding halfwords of
parameter d unchanged. The negated values of the odd halfword elements of parameter a are
placed into the corresponding halfwords of parameter d. The negative of 0x8000 (most negative
number) returns 0x8000. No overflow is detected.
Figure 3-528. Vector Negate Halfwords Odd (__ev_negho)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegho d,a
d
0 31 32 63
a
NEG NEG
15 16 47 48 0 31 32 63 15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-690 Freescale Semiconductor
SPE2 Operations
__ev_neghos __ev_neghos
Vector Negate Halfwords Odd and Saturate
d = __ev_neghos (a)
d
0:15
a
0:15
if (a
16:31
= 0x8000) then
d
16:31
0x7FFF
ovh 1
else
d
16:31
NEG(a
16:31
)
ovh 0
endif
d
32:47
a
32:47
if (a
48:63
= 0x8000) then
d
48:63
0x7FFF
ovl 1
else
d
48:63
NEG(a
48:63
)
ovl 0
endif
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The even halfword elements of parameter a are placed into the corresponding halfwords of
parameter d unchanged. The negated values of the odd halfword elements of parameter a are
placed into the corresponding halfwords of parameter d. The negative of 0x8000 (most negative
number) returns 0x7FFF. Any overflow is reported in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-529. Vector Negate Halfwords Odd and Saturate (__ev_neghos)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evneghos d,a
d
0 31 32 63
NEG&SAT NEG&SAT
15 16 47 48 0 31 32 63
a
15 16 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-691
__ev_neghs __ev_neghs
Vector Negate Half Word and Saturate
d = __ev_neghs (a)
if (a
0:15
= 0x8000) then
d
0:15
0x7FFF
ovh 1
else
d
0:15
NEG(a
0:15
)
ovh 0
endif
if (a
16:31
= 0x8000) then
d
16:31
0x7FFF
ovh 1
else d
16:31
NEG(a
16:31
)
if (a
32:47
= 0x8000) then else
d
32:47
0x7FFF
ovl1
else
d
32:47
NEG(a
32:47
)
ovl 0
endif
if (a
48:63
= 0x8000) then
d
48:63
0x7FFF
ovl 1
else d
48:63
NEG(a
48:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The negative of each half word element of parameter a is placed in parameter d. The negative of
0x8000 (most negative number) returns 0x7FFF. Any overflow is reported in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-530. Vector Negate Half Word and Saturate (__ev_neghs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evneghs d,a
d
0 31 32 63
a
NEG&SAT NEG&SAT NEG&SAT NEG&SAT
15 16 47 48 0 31 32 63 15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-692 Freescale Semiconductor
SPE2 Operations
__ev_negs __ev_negs
Vector Negate (Word) and Saturate
d = __ev_negs (a)
if (a
0:31
= 0x8000_0000) then
d
0:31
0x7FFF_FFFF
ovh 1
else
d
0:31
NEG(a
0:31
)
ovh 0
endif
if (a
32:63
= 0x8000_0000) then
d
32:63
0x7FFF_FFFF
ovl 1
else
d
32:63
NEG(a
32:63
)
ovl 0
endif
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The negative of each word element of parameter a is placed in parameter d. The negative of
0x8000_0000 (most negative number) returns 0x7FFF_FFFF. Any overflow is reported in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-531. Vector Negate (Word) and Saturate (__ev_negs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegs d,a
0 31 32 63
a
NEG&SAT NEG&SAT
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-693
__ev_negwo __ev_negwo
Vector Negate Word Odd
d = __ev_negwo (a)
d
0:31
a
0:31
d
32:63
NEG(a
32:63
)
The value of the even word element of parameter a is placed into the even word of parameter d
unchanged, and the negated value of the odd word element of parameter a is placed into the odd
word of parameter d. The negative of 0x8000_0000 (most negative number) returns 0x8000_0000.
No overflow is detected.
Figure 3-532. Vector Negate Word Odd (__ev_negwo)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegwo d,a
0 31 32 63
a
NEG
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-694 Freescale Semiconductor
SPE2 Operations
__ev_negwos __ev_negwos
Vector Negate Word Odd and Saturate
d = __ev_negwos (a)
d
0:31
a
0:31
if (a
32:63
= 0x8000_0000) then
d
32:63
0x7FFF_FFFF
ovl 1
else
d
32:63
NEG(a
32:63
)
ovl 0
endif
SPEFSCR
OVH
0
SPEFSCR
OV
ovl
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The value of the even word element of parameter a is placed into the even word of parameter d
unchanged, and the negated value of the odd word element of parameter a is placed into the odd
word of parameter d. The negative of 0x8000_0000 (most negative number) returns
0x7FFF_FFFF. Any overflow is reported in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-533. Vector Negate Word Odd and Saturate (__ev_negwos)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evnegwos d,a
0 31 32 63
a
NEG&SAT
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-695
__ev_nor __ev_nor
Vector NOR
d = __ev_nor (a,b)
d
0:31
(a
0:31
| b
0:31
) // Bitwise NOR
d
32:63
(a
32:63
| b
32:63
) // Bitwise NOR
Each element of parameters a and b is bitwise NORed. The result is placed in the corresponding
element of parameter d.
NOTE
Use evnand or evnor for evnot.
Figure 3-534. Vector NOR (__ev_nor)
Simplified mnemonic: evnot d,a performs a complement register.
evnot d,a equivalent to evnor d,a,a
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evnor d,a,b
0 31 32 63
a
b
NOR NOR
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-696 Freescale Semiconductor
SPE2 Operations
__ev_or __ev_or
Vector OR
d = __ev_or (a,b)
d
0:31
a
0:31
| b
0:31
//Bitwise OR
d
32:63
a
32:63
| b
32:63
// Bitwise OR
Each element of parameters a and b is bitwise ORed. The result is placed in the corresponding
element of parameter d.
Figure 3-535. Vector OR (__ev_or)
Simplified mnemonic: evmr d,a handles moving of the full 64-bit SPE register.
evmr d,a equivalent to evor d,a,a
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evor d,a,b
0 31 32 63
a
b
OR OR
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-697
__ev_orc __ev_orc
Vector OR with Complement
d = __ev_orc (a,b)
d
0:31
a
0:31
| (b
0:31
) // Bitwise ORC
d
32:63
a
32:63
| (b
32:63
) // Bitwise ORC
Each element of parameter a is bitwise ORed with the complement of parameter b. The result is
placed in the corresponding element of parameter d.
Figure 3-536. Vector OR with Complement (__ev_orc)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evorc d,a,b
0 31 32 63
a
b
OR OR
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-698 Freescale Semiconductor
SPE2 Operations
__ev_perm __ev_perm
Vector Permute
d = __ev_perm (a,b)
sa b
0:7
sb b
8:15
sc b
16:23
sd b
24:31
se b
32:39
sf b
40:47
sg b
48:55
sh b
56:63
if (sa <
ui
8) then d
0:7
a
a*8:(a*8)+7
elseif (sa
0
=1) then d
0:7
0xFF else d
0:7
0x00
if (sb <
ui
8) then d
8:15
a
b*8:(b*8)+7
elseif (sb
0
=1) then d
8:15
0xFF else d
8:15
0x00
if (sc <
ui
8) then d
16:23
a
c*8:(c*8)+7
elseif (sc
0
=1) then d
16:23
0xFF else d
16:23
0x00
if (sd <
ui
8) then d
24:31
a
d*8:(d*8)+7
elseif (sd
0
=1) then d
24:31
0xFF else d
24:31
0x00
if (se <
ui
8) then d
32:39
a
e*8:(e*8)+7
elseif (se
0
=1) then d
32:39
0xFF else d
32:39
0x00
if (sf <
ui
8) then d
40:47
a
f*8:(f*8)+7
elseif (sf
0
=1) then d
40:47
0xFF else d
40:47
0x00
if (sg <
ui
8) then d
48:55
a
g*8:(g*8)+7
elseif (sg
0
=1) then d
48:23
0xFF else d
48:55
0x00
if (sh <
ui
8) then d
56:63
a
h*8:(h*8)+7
elseif (sh
0
=1) then d
16:23
0xFF else d
56:63
0x00
The contents of parameter b are used as a select vector. For each byte in the destination vector, a
byte is selected from either parameter a, a constant of 0xFF, or a constant of 0x00 by the
corresponding byte of the select vector. The selected byte values are placed into parameter d.
Figure 3-537. Vector Permute (__ev_perm)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evperm d,a,b
63
d 56
0 8 16 24 32 40 48 56
63
b 56
0 8 16 24 32 40 48 56
10 03 05 03 FE 06 14 01
63
a
0 8 16 24 32 40 48 56
7 6 5 4 3 2 1 0
0x00 0x00 0xFF
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-699
__ev_perm2 __ev_perm2
Vector Permute (form 2)
d = __ev_perm2 (a,b,c)
temp
0:127
b
0:63
| | c
0:63
tempctl
0:63
a
0:63
sa tempctl
0:7
sb tempctl
8:15
sc tempctl
16:23
sd tempctl
24:31
se tempctl
32:39
sf tempctl
40:47
sg tempctl
48:55
sh tempctl
56:63
if (sa <
ui
16) then d
0:7
temp
sa*8:(sa*8)+7
else d
0:7
0x00
if (sb <
ui
16) then d
8:15
temp
sb*8:(sb*8)+7
else d
8:15
0x00
if (sc <
ui
16) then d
16:23
temp
sc*8:(sc*8)+7
else d
16:23
0x00
if (sd <
ui
16) then d
24:31
temp
sd*8:(sd*8)+7
else d
24:31
0x00
if (se <
ui
16) then d
32:39
temp
se*8:(se*8)+7
else d
32:39
0x00
if (sf <
ui
16) then d
40:47
temp
sf*8:(sf*8)+7
else d
40:47
0x00
if (sg <
ui
16) then d
48:55
temp
sg*8:(sg*8)+7
else d
48:55
0x00
if (sh <
ui
16) then d
56:63
temp
sh*8:(sh*8)+7
else d
56:63
0x00
The contents of parameters b and c are concatenated into a 128-bit source vector consisting of
bytes 0-15. The contents of parameter a are copied to a temporary select vector. For each byte in
the destination vector, a byte is selected from either the source vector or a constant of 0x00 by the
corresponding byte of the select vector. The selected byte values are placed into parameter d.
Figure 3-538. Vector Permute (form 2) (__ev_perm2)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evperm2 d,b,c
63
d 56
0 8 16 24 32 40 48 56
63
a 56
0 8 16 24 32 40 48 56
10 0C 05 03 0E 0A 14 01
63
b
0 8 16 24 32 40 48 56
7 6 5 4 3 2 1 0
63
c 56
0 8 16 24 32 40 48 56
F E D C B A 9 8
0x00 0x00
SPE2 Programming Interface Manual, Rev. 1.0-2
3-700 Freescale Semiconductor
SPE2 Operations
__ev_perm3 __ev_perm3
Vector Permute (form 3)
d = __ev_perm3 (a,b,c)
temp
0:127
a
0:63
| | c
0:63
tempctl
0:63
b
0:63
sa tempctl
0:7
sb tempctl
8:15
sc tempctl
16:23
sd tempctl
24:31
se tempctl
32:39
sf tempctl
40:47
sg tempctl
48:55
sh tempctl
56:63
if (sa <
ui
16) then d
0:7
temp
sa*8:(sa*8)+7
else d
0:7
0x00
if (sb <
ui
16) then d
8:15
temp
sb*8:(sb*8)+7
else d
8:15
0x00
if (sc <
ui
16) then d
16:23
temp
sc*8:(sc*8)+7
else d
16:23
0x00
if (sd <
ui
16) then d
24:31
temp
sd*8:(sd*8)+7
else d
24:31
0x00
if (se <
ui
16) then d
32:39
temp
se*8:(se*8)+7
else d
32:39
0x00
if (sf <
ui
16) then d
40:47
temp
sf*8:(sf*8)+7
else d
40:47
0x00
if (sg <
ui
16) then d
48:55
temp
sg*8:(sg*8)+7
else d
48:55
0x00
if (sh <
ui
16) then d
56:63
temp
sh*8:(sh*8)+7
else d
56:63
0x00
The contents of parameters a and c are concatenated into a 128-bit source vector consisting of
bytes 0:15. The contents of parameter b are copied to a temporary select vector. For each byte in
the destination vector, a byte is selected from either the source vector or a constant of 0x00 by the
corresponding byte of the select vector. The selected byte values are placed into parameter d.
Figure 3-539. Vector Permute (form 3) (__ev_perm3)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evperm3 d,b,c
63
d 56
0 8 16 24 32 40 48 56
63
b 56
0 8 16 24 32 40 48 56
10 0C 05 03 0E 0A 14 01
63
a
0 8 16 24 32 40 48 56
7 6 5 4 3 2 1 0
63
c 56
0 8 16 24 32 40 48 56
F E D C B A 9 8
0x00 0x00
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-701
__ev_pksdshefrs __ev_pksdshefrs
Vector Pack Signed Doublewords to Signed Halfwords Even, Fractional, Round, and Saturate
d = __ev_pksdshefrs (a,b)
// h0
if (a
0:63
>=
si
0x0000_0000_7FFF_8000) | (a
0:63
<
si
0xFFFF_FFFF_7FFF_8000) then
ovh 1
temph
0:31
SATURATE(ovh, a
0
, 0x8000_0000, 0x7fff_0000, --------)
else
ovh 0
tempr
0:63
ROUND(a
0:63
,16)
temph
0:31
tempr
32:47
||
16
0
d
0:31
temph
0:31
// h2
if (b
0:63
>=
si
0x0000_0000_7FFF_8000) | (b
0:63
<
si
0xFFFF_FFFF_7FFF_8000) then
ovl 1
templ
0:31
SATURATE(ovl, b
0
, 0x8000_0000, 0x7fff_0000, --------)
else
ovl 0
tempr
0:63
ROUND(b
0:63
,16)
templ
0:31
tempr
32:47
||
16
0
d
32:63
templ
0:31
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The signed 64-bit guarded fractional elements of parameters a and b are rounded and saturated to
16 bits. The results in 1.15 fractional format are packed into the even halfwords of parameter d,
zeroing the odd halfwords. The original values are assumed to be in 33.31 fractional format as a result
of one or more guarded halfword fractional operations. Any saturation is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-540. Vector Pack Signed Doublewords to Signed Halfwords Even Fractional,
Round, and Saturate (__ev_pksdshefrs)
0 63
a
0 63
b
d
0 31 32 63 15 16 47 48
ROUND&SAT ROUND&SAT
31 32 33
.
31 32 33
.
SPE2 Programming Interface Manual, Rev. 1.0-2
3-702 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpksdshefrs d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-703
__ev_pksdswfrs __ev_pksdswfrs
Vector Pack Signed Doublewords to Signed Words Fractional, Round and Saturate
d = __ev_pksdswfrs (a,b)
// w0
if (a
0:63
>=
si
0x7FFF_FFFF_8000_0000) then
temph
0:63
0x7FFF_FFFF_0000_0000
ovh 1
else
temph
0:63
ROUND(a
0:63
,32)
ovh 0
d
0:31
temph
0:31
// w1
if (b
0:63
>=
si
0x7FFF_FFFF_8000_0000) then
templ
0:63
0x7FFF_FFFF_0000_0000
ovl 1
else
templ
0:63
ROUND(b
0:63
,32)
ovl 0
d
32:63
templ
0:31
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 64-bit fractional elements of parameters a and b are rounded and saturated to 32 bits
using the current rounding mode in SPEFSCR. The 32-bit results are packed into parameter d. Any
saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-541. Vector Pack Signed Doublewords to Signed Words Fractional, Round and
Saturate (__ev_pksdswfrs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpksdswfrs d,a,b
0 63
a
d
0 31 32 63
0 63
b
ROUND & SAT ROUND & SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-704 Freescale Semiconductor
SPE2 Operations
__ev_pksdsws __ev_pksdsws
Vector Pack Signed Doublewords to Signed Words and Saturate
d = __ev_pksdsws (a,b)
// w0
if ((a
0:63
<
si
0xFFFF_FFFF_8000_0000) | (a
0:63
>
si
0x0000_0000_7FFF_FFFF)) then ovh=1 else
ovh=0;
d
0:31
SATURATE(ovh, a
0
, 0x8000_0000, 0x7fff_ffff, a
32:63
)
// w1
if ((b
0:63
<
si
0xFFFF_FFFF_8000_0000) | (b
0:63
>
si
0x0000_0000_7FFF_FFFF)) then ovl=1 else
ovl=0;
d
32:63
SATURATE(ovl, b
0
, 0x8000_0000, 0x7fff_ffff, b
32:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 64-bit elements of parameters a and b are saturated to 32 bits. The 32-bit results are
packed into parameter d. Any saturation is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-542. Vector Pack Signed Doublewords to Signed Words and Saturate
(__ev_pksdsws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpksdsws d,a,b
0 63
a
d
0 31 32 63
0 63
b
SAT SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-705
__ev_pkshsbs __ev_pkshsbs
Vector Pack Signed Half Words to Signed Bytes and Saturate
d = __ev_pkshsbs (a,b)
// b0
if ((a
0:15
<
si
0xFF80) | (a
0:15
>
si
0x007F)) then ovb0=1 else ovb0=0;
d
0:7
SATURATE(ovb0, a
0
, 0x80, 0x7f, a
8:15
)
// b1
if ((a
16:31
<
si
0xFF80) | (a
16:31
>
si
0x007F)) then ovb1=1 else ovb1=0;
d
8:15
SATURATE(ovb1, a
16
, 0x80, 0x7f, a
24:31
)
// b2
if ((a
32:47
<
si
0xFF80) | (a
32:47
>
si
0x007F)) then ovb2=1 else ovb2=0;
d
16:23
SATURATE(ovb2, a
32
, 0x80, 0x7f, a
40:47
)
// b3
if ((a
48:63
<
si
0xFF80) | (a
48:63
>
si
0x007F)) then ovb3=1 else ovb3=0;
d
24:31
SATURATE(ovb3, a
48
, 0x80, 0x7f, a
56:63
)
// b4
if ((b
0:15
<
si
0xFF80) | (b
0:15
>
si
0x007F)) then ovb4=1 else ovb4=0;
d
32:39
SATURATE(ovb4, b
0
, 0x80, 0x7f, b
8:15
)
// b5
if ((b
16:31
<
si
0xFF80) | (b
16:31
>
si
0x007F)) then ovb5=1 else ovb5=0;
d
40:47
SATURATE(ovb5, b
16
, 0x80, 0x7f, b
24:31
)
// b6
if ((b
32:47
<
si
0xFF80) | (b
32:47
>
si
0x007F)) then ovb6=1 else ovb6=0;
d
48:55
SATURATE(ovb6, b
32
, 0x80, 0x7f, b
40:47
)
// b7
if ((b
48:63
<
si
0xFF80) | (b
48:63
>
si
0x007F)) then ovb7=1 else ovb7=0;
d
56:63
SATURATE(ovb7, b
48
, 0x80, 0x7f, b
56:63
)
ovh ovb0 | ovb1 | ovb2 | ovb3; ovl ovb4 | ovb5 | ovb6 | ovb7;
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl;
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl;
The signed 16-bit elements of parameters a and b are saturated to 8 bits. The 8-bit results are
packed into parameter d. Any saturation is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-543. Vector Pack Signed Half Words to Signed Bytes and Saturate
(__ev_pkshsbs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkshsbs d,a,b
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
63
b
0 16 32 48
Saturate
Transfers
SPE2 Programming Interface Manual, Rev. 1.0-2
3-706 Freescale Semiconductor
SPE2 Operations
__ev_pkshubs __ev_pkshubs
Vector Pack Signed Half Words to Unsigned Bytes and Saturate
d = __ev_pkshubs (a,b)
// b0
if ((a
0:15
<
si
0x0000) | (a
0:15
>
si
0x00FF)) then ovb0=1 else ovb0=0;
d
0:7
SATURATE(ovb0, a
0
, 0x00, 0xFF, a
8:15
)
// b1
if ((a
16:31
<
si
0x000) | (a
16:31
>
si
0x00FF)) then ovb1=1 else ovb1=0;
d
8:15
SATURATE(ovb1, a
16
, 0x00, 0xFF, a
24:31
)
// b2
if ((a
32:47
<
si
0x0000) | (a
32:47
>
si
0x00FF)) then ovb2=1 else ovb2=0;
d
16:23
SATURATE(ovb2, a
32
, 0x00, 0xFF, a
40:47
)
// b3
if ((a
48:63
<
si
0x0000) | (a
48:63
>
si
0x00FF)) then ovb3=1 else ovb3=0;
d
24:31
SATURATE(ovb3, a
48
, 0x00, 0xFF, a
56:63
)
// b4
if ((b
0:15
<
si
0x0000) | (b
0:15
>
si
0x00FF)) then ovb4=1 else ovb4=0;
d
32:39
SATURATE(ovb4, b
0
, 0x00, 0xFF, b
8:15
)
// b5
if ((b
16:31
<
si
0x0000) | (b
16:31
>
si
0x00FF)) then ovb5=1 else ovb5=0;
d
40:47
SATURATE(ovb5, b
16
, 0x00, 0xFF, b
24:31
)
// b6
if ((b
32:47
<
si
0x0000) | (b
32:47
>
si
0x00FF)) then ovb6=1 else ovb6=0;
d
48:55
SATURATE(ovb6, b
32
, 0x00, 0xFF, b
40:47
)
// b7
if ((b
48:63
<
si
0x0000) | (b
48:63
>
si
0x00FF)) then ovb7=1 else ovb7=0;
d
56:63
SATURATE(ovb7, b
48
, 0x00, 0xFF, b
56:63
)
ovh ovb0 | ovb1 | ovb2 | ovb3 ; ovl ovb4 | ovb5 | ovb6 | ovb7
SPEFSCR
OVH
ovh ; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh ; SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 16-bit elements of parameters a and b are saturated to 8-bit unsigned elements.
Negative elements saturate to 0. The 8-bit results are packed into parameter d. Any saturation is
recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-544. Vector Pack Signed Half Words to Unsigned Bytes and Saturate
(__ev_pkshubs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkshubs d,a,b
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
63
b
0 16 32 48
Saturate
Transfers
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-707
__ev_pkswgshefrs __ev_pkswgshefrs
Vector Pack Signed Words Guarded to Signed Halfwords Even Fractional Round and Saturate
d = __ev_pkswgshefrs (a,b)
// h0
if (a
0:63
>=
si
0x0000_7FFF_8000_0000) | (a
0:63
<
si
0xFFFF_7FFF_8000_0000) then
ovh 1
temph
0:31
SATURATE(ovh, a
0
, 0x8000_0000, 0x7fff_0000, --------)
else
ovh 0
tempr
0:63
ROUND(a
0:63
,32)
temph
0:31
tempr
16:31
||
16
0
d
0:31
temph
0:31
// h2
if (b
0:63
>=
si
0x0000_7FFF_8000_0000) | (b
0:63
<
si
0xFFFF_7FFF_8000_0000) then
ovl 1
templ
0:31
SATURATE(ovl, b
0
, 0x8000_0000, 0x7fff_0000, --------)
else
ovl 0
tempr
0:63
ROUND(b
0:63
,32)
templ
0:31
tempr
16:31
||
16
0
d
32:63
templ
0:31
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 64-bit elements of parameters a and b are rounded and saturated to 16 bits. The 16-bit
results in 1.15 fractional format are packed into the even halfwords of parameter d. The original
values are assumed to be in 17.47 fractional format as a result of one or more guarded word fractional
operations. Any saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-545. Vector Pack Signed Words Guarded to Signed Words Fractional Round and
Saturate (__ev_pkswgswfrs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswgswfrs d,a,b
0 63
a
0 63
b
15 16 17
.
15 16 17
.
ROUND&SAT
d
0 31 32 63 15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-708 Freescale Semiconductor
SPE2 Operations
__ev_pkswgswfrs __ev_pkswgswfrs
Vector Pack Signed Words Guarded to Signed Words Fractional Round and Saturate
d = __ev_pkswgswfrs (a,b)
// w0
if ((a
0:63
<
si
0xFFFF_7FFF_FFFF_8000) | (a
0:63
>=
si
0x0000_7FFF_FFFF_8000)) then
ovh=1
temph
0:31
SATURATE(ovh, a
0
, 0x8000_0000, 0x7fff_ffff, -----)
else
ovh=0
tempr
0:63
ROUND(a
0:63
, 16)
temph
0:31
tempr
16:47
d
0:31
temph
0:31
// w1
if ((b
0:63
<
si
0xFFFF_7FFF_FFFF_8000) | (b
0:63
>=
si
0x0000_7FFF_FFFF_8000)) then
ovl=1
templ
0:31
SATURATE(ovl, b
0
, 0x8000_0000, 0x7fff_ffff, -----)
else
ovl=0
tempr
0:63
ROUND(b
0:63
, 16)
templ
0:31
tempr
16:47
d
32:63
templ
0:31
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 64-bit elements of parameters a and b are saturated to 32 bits. The 32-bit results in 1.31
fractional format are packed into parameter d. The original values are assumed to be in 17.47 fractional
format as a result of one or more guarded word fractional operations. Any saturation is recorded in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-546. Vector Pack Signed Words Guarded to Signed Words Fractional Round and
Saturate (__ev_pkswgswfrs)
0 63
a
d
0 31 32 63
0 63
b
15 16 17
.
15 16 17
.
ROUND&SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-709
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswgswfrs d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-710 Freescale Semiconductor
SPE2 Operations
__ev_pkswshfrs __ev_pkswshfrs
Vector Pack Signed Words to Signed Halfwords Fractional, Round and Saturate
d = __ev_pkswshfrs (a,b)
if (a
0:31
>=
si
0x7FFF_8000) then
ovh0 1; temph0
0:15
0x7FFF
else
ovh0 0; tempr
0:63
ROUND(EXTS
64
(a
0:31
),16); temph0
0:15
tempr
32:47
d
0:15
temph0
0:15
if (a
32:63
>=
si
0x7FFF_8000) then
ovh1 1; temph1
0:15
0x7FFF
else
ovh1 0; tempr
0:63
ROUND(EXTS
64
(a
32:63
),16); temph1
0:15
tempr
32:47
d
16:31
temph1
0:15
if (b
0:31
>=
si
0x7FFF_8000) then
ovh2 1; temph2
0:15
0x7FFF
else
ovh2 0; tempr
0:63
ROUND(EXTS
64
(b
0:31
),16); temph2
0:15
tempr
32:47
d
32:47
temph2
0:15
if (b
32:63
>=
si
0x7FFF_8000) then
ovh3 1; temph3
0:15
0x7FFF
else
ovh3 0; tempr
0:63
ROUND(EXTS
64
(b
32:63
),16); temph3
0:15
tempr
32:47
d
48:63
temph3
0:15
ovh ovh0 | ovh1; ovl ovh2 | ovh3
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The signed 32-bit fractional elements of parameters a and b are rounded and saturated to 16 bits
using the current rounding mode in SPEFSCR. The 16-bit results are packed into parameter d. Any
saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-547. Vector Pack Signed Words to Signed Halfwords Fractional, Round and
Saturate (__ev_pkswshfrs)
0 31 32 63
a
d
0 31 32 63 15 16 47 48
0 31 32 63
b
RND & SAT RND & SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-711
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswshfrs d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-712 Freescale Semiconductor
SPE2 Operations
__ev_pkswshilvfrs __ev_pkswshilvfrs
Vector Pack Signed Words to Signed Halfwords Interleaved, Fractional, Round and Saturate
d = __ev_pkswshilvfrs (a,b)
if (a
0:31
>=
si
0x7FFF_8000) then
ovh0 1; temph0
0:15
0x7FFF
else
ovh0 0; tempr
0:63
ROUND(EXTS
64
(a
0:31
),16); temph0
0:15
tempr
32:47
d
0:15
temph0
0:15
if (a
32:63
>=
si
0x7FFF_8000) then
ovh2 1; temph2
0:15
0x7FFF
else
ovh2 0; tempr
0:63
ROUND(EXTS
64
(a
32:63
),16); temph2
0:15
tempr
32:47
d
32:47
temph2
0:15
if (b
0:31
>=
si
0x7FFF_8000) then
ovh1 1; temph2
0:15
0x7FFF
else
ovh1 0; tempr
0:63
ROUND(EXTS
64
(b
0:31
),16); temph1
0:15
tempr
32:47
d
16:31
temph1
0:15
if (b
32:63
>=
si
0x7FFF_8000) then
ovh3 1; temph3
0:15
0x7FFF
else
ovh3 0; tempr
0:63
ROUND(EXTS
64
(b
32:63
),16); temph3
0:15
tempr
32:47
d
48:63
temph3
0:15
ovh ovh0 | ovh1; ovl ovh2 | ovh3
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The signed 32-bit fractional elements of parameters a and b are rounded and saturated to 16 bits
using the current rounding mode in SPEFSCR. The 16-bit results are packed into parameter d with
interleaving. Any saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-548. Vector Pack Signed Words to Signed Halfwords Interleaved Fractional,
Round and Saturate (__ev_pkswshilvfrs)
0 31 32 63
a
d
0 31 32 63 15 16 47 48
0 31 32 63
b
RND & SAT RND & SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-713
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswshilvfrs d,a,b
SPE2 Programming Interface Manual, Rev. 1.0-2
3-714 Freescale Semiconductor
SPE2 Operations
__ev_pkswshilvs __ev_pkswshilvs
Vector Pack Signed Words to Signed Halfwords Interleaved and Saturate
d = __ev_pkswshilvs (a,b)
// h0
if ((a
0:31
<
si
0xFFFF8000) | (a
0:31
>
si
0x00007FFF)) then ovh0=1 else ovh0=0;
d
0:15
SATURATE(ovh0, a
0
, 0x8000, 0x7fff, a
16:31
)
// h2
if ((a
32:63
<
si
0xFFFF8000) | (a
32:63
>
si
0x00007FFF)) then ovh2=1 else ovh2=0;
d
32:47
SATURATE(ovh2, a
32
, 0x8000, 0x7fff, a
48:63
)
// h1
if ((b
0:31
<
si
0xFFFF8000) | (b
0:31
>
si
0x00007FFF)) then ovh1 else ovh1=0;
d
16:23
SATURATE(ovh1, b
0
, 0x8000, 0x7fff, b
16:31
)
// h3
if ((b
32:63
<
si
0xFFFF8000) | (b
32:63
>
si
0x00007FFF)) then ovh3=1 else ovh3=0;
d
48:63
SATURATE(ovh3, b
32
, 0x8000, 0x7fff, b
48:63
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 32-bit elements of parameters a and b are saturated to 16 bits. The 16-bit results are
packed into parameter d with interleaving. Any saturation is recorded in the SPEFSCR overflow
and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-549. Vector Pack Signed Words to Signed Halfwords Interleaved and Saturate
(__ev_pkswshilvs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswshilvs d,a,b
0 31 32 63
a
d
0 31 32 63 15 16 47 48
0 31 32 63
b
SAT SAT
SAT SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-715
__ev_pkswshs __ev_pkswshs
Vector Pack Signed Words to Signed Half Words and Saturate
d = __ev_pkswshs (a,b)
// h0
if ((a
0:31
<
si
0xFFFF8000) | (a
0:31
>
si
0x00007FFF)) then ovh0=1 else ovh0=0;
d
0:15
SATURATE(ovh0, a
0
, 0x8000, 0x7fff, a
16:31
)
// h1
if ((a
32:63
<
si
0xFFFF8000) | (a
32:63
>
si
0x00007FFF)) then ovh1=1 else ovh1=0;
d
16:31
SATURATE(ovh1, a
32
, 0x8000, 0x7fff, a
48:63
)
// h2
if ((b
0:31
<
si
0xFFFF8000) | (b
0:31
>
si
0x00007FFF)) then ovh21 else ovh2=0;
d
32:47
SATURATE(ovh2, b
0
, 0x8000, 0x7fff, b
16:31
)
// h3
if ((b
32:63
<
si
0xFFFF8000) | (b
32:63
>
si
0x00007FFF)) then ovh3=1 else ovh3=0;
d
48:63
SATURATE(ovh3, b
32
, 0x8000, 0x7fff, b
48:63
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 32-bit elements of parameters a and b are saturated to 16 bits. The 16-bit results are
packed into parameter d. Any saturation is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-550. Vector Pack Signed Words to Signed Half Words and Saturate
(__ev_pkswshs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswshs d,a,b
0 31 32 63
a
d
0 31 32 63 15 16 47 48
0 31 32 63
b
SAT
SAT SAT SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-716 Freescale Semiconductor
SPE2 Operations
__ev_pkswuhs __ev_pkswuhs
Vector Pack Signed Words to Unsigned Half Words and Saturate
d = __ev_pkswuhs (a,b)
// h0
if ((a
0:31
<
si
0x00000000) | (a
0:31
>
si
0x0000FFFF)) then ovh0=1 else ovh0=0;
d
0:15
SATURATE(ovh0, a
0
, 0x0000, 0xFFFF, a
16:31
)
// h1
if ((a
32:63
<
si
0x00000000) | (a
32:63
>
si
0x0000FFFF)) then ovh1=1 else ovh1=0;
d
16:31
SATURATE(ovh1, a
32
, 0x0000, 0xFFFF, a
48:63
)
// h2
if ((b
0:31
<
si
0x00000000) | (b
0:31
>
si
0x0000FFFF)) then ovh21 else ovh2=0;
d
32:47
SATURATE(ovh2, b
0
, 0x0000, 0xFFFF, b
16:31
)
// h3
if ((b
32:63
<
si
0x00000000) | (b
32:63
>
si
0x0000FFFF)) then ovh3=1 else ovh3=0;
d
48:63
SATURATE(ovh3, b
32
, 0x0000, 0xFFFF, b
48:63
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 32-bit elements of parameters a and b are saturated to 16-bit unsigned elements.
Negative elements saturate to 0. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-551. Vector Pack Signed Words to Unsigned Half Words and Saturate
(__ev_pkswuhs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkswuhs d,a,b
0 31 32 63
a
d
0 31 32 63 15 16 47 48
0 31 32 63
b
SAT
SAT SAT SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-717
__ev_pkuduws __ev_pkuduws
Vector Pack Unsigned Doublewords to Unsigned Words and Saturate
d = __ev_pkuduws (a,b)
// w0
if (a
0:63
>
ui
0x0000_0000_FFFF_FFFF) then ovh=1 else ovh=0;
d
0:31
SATURATE(ovh, a
0
, 0xffff_ffff, 0xffff_ffff, a
32:63
)
// w1
if (b
0:63
>
ui
0x0000_0000_FFFF_FFFF) then ovl=1 else ovl=0;
d
32:63
SATURATE(ovl, b
0
, 0xffff_ffff, 0xffff_ffff, b
32:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 64-bit elements of parameters a and b are saturated to 32 bits. The 32-bit results are
packed into parameter d. Any saturation is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-552. Vector Pack Unsigned Doublewords to Unsigned Words and Saturate
(__ev_pkuduws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkuduws d,a,b
0 63
a
d
0 31 32 63
0 63
b
SAT SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-718 Freescale Semiconductor
SPE2 Operations
__ev_pkuhubs __ev_pkuhubs
Vector Pack Unsigned Half Words to Unsigned Bytes and Saturate
d = __ev_pkuhubs (a,b)
// b0
if (a
0:15
>
ui
0x00FF) then ovb0=1 else ovb0=0;
d
0:7
SATURATE(ovb0, 0, 0xFF, 0xFF, a
8:15
)
// b1
if (a
16:31
>
ui
0x00FF) then ovb1=1 else ovb1=0;
d
8:15
SATURATE(ovb1, 0, 0xFF, 0xFF, a
24:31
)
// b2
if (a
32:47
>
ui
0x00FF) then ovb2=1 else ovb2=0;
d
16:23
SATURATE(ovb2, 0, 0xFF, 0xFF, a
40:47
)
// b3
if (a
48:63
>
ui
0x00FF) then ovb3=1 else ovb3=0;
d
24:31
SATURATE(ovb3, 0, 0xFF, 0xFF, a
56:63
)
// b4
if (b
0:15
>
ui
0x00FF) then ovb4=1 else ovb4=0;
d
32:39
SATURATE(ovb4, 0, 0xFF, 0xFF, b
8:15
)
// b5
if (b
16:31
>
ui
0x00FF) then ovb5=1 else ovb5=0;
d
40:47
SATURATE(ovb5, 0, 0xFF, 0xFF, b
24:31
)
// b6
if (b
32:47
>
ui
0x00FF) then ovb6=1 else ovb6=0;
d
48:55
SATURATE(ovb6, 0, 0xFF, 0xFF, b
40:47
)
// b7
if (b
48:63
>
ui
0x00FF) then ovb7=1 else ovb7=0;
d
56:63
SATURATE(ovb7, 0, 0xFF, 0xFF, b
56:63
)
ovh ovb0 | ovb1 | ovb2 | ovb3 ; ovl ovb4 | ovb5 | ovb6 | ovb7
SPEFSCR
OVH
ovh ; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh ; SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 16-bit elements of parameters a and b are saturated to 8 bits. The 8-bit unsigned
results are packed into parameter d. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-553. Vector Pack Unsigned Half Words to Unsigned Bytes and Saturate
(__ev_pkuhubs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkuhubs d,a,b
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
63
b
0 16 32 48
Saturate
Transfers
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-719
__ev_pkuwuhs __ev_pkuwuhs
Vector Pack Unsigned Words to Unsigned Half Words and Saturate
d = __ev_pkuwuhs (a,b)
// h0
if (a
0:31
>
ui
0x0000FFFF) then ovh0=1 else ovh0=0;
d
0:15
SATURATE(ovh0, 0, 0xFFFF, 0xFFFF, a
16:31
)
// h1
if ((a
32:63
>
ui
0x0000FFFF) then ovh1=1 else ovh1=0;
d
16:31
SATURATE(ovh1, 0, 0xFFFF, 0xFFFF, a
48:63
)
// h2
if ((b
0:31
>
ui
0x0000FFFF) then ovh2=1 else ovh2=0;
d
32:47
SATURATE(ovh2, 0, 0xFFFF, 0xFFFF, b
16:31
)
// h3
if ((b
32:63
>
ui
0x0000FFFF) then ovh3=1 else ovh3=0;
d
48:63
SATURATE(ovh3, 0, 0xFFFF, 0xFFFF, b
48:63
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 32-bit elements of parameters a and b are saturated to 16 bits. The 16-bit unsigned
results are packed into parameter d. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-554. Vector Pack Unsigned Words to Unsigned Half Words and Saturate
(__ev_pkuwuhs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evpkuwuhs d,a,b
0 31 32 63
a
d
0 31 32 63 15 16 47 48
0 31 32 63
b
SAT
SAT SAT SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-720 Freescale Semiconductor
SPE2 Operations
__ev_popcntb __ev_popcntb
Vector Population Count Byte
d = __ev_popcntb (a)
d
0:7
POPCNT(a
0:7
)
d
8:15
POPCNT(a
8:15
)
d
16:23
POPCNT(a
16:23
)
d
24:31
POPCNT(a
24:31
)
d
32:39
POPCNT(a
32:39
)
d
40:47
POPCNT(a
40:47
)
d
48:55
POPCNT(a
48:55
)
d
56:63
POPCNT(a
56:63
)
The number of set bits in each byte element of parameter a are counted, and the results are placed
into the corresponding element of parameter d.
Figure 3-555. Vector Population Count Byte (__ev_popcntb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evpopcntb d,a
0 40 32 63
a
d
POPC POPC POPC POPC POPC POPC POPC POPC
8 16 24 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-721
__ev_rlb __ev_rlb
Vector Rotate Left Byte
d = __ev_rlb (a,b)
nb0 b
5:7
nb1 b
13:15
nb2 b
21:23
nb3 b
29:31
nb4 b
37:39
nb5 b
45:47
nb6 b
53:55
nb7 b
61:63
d
0:7
ROTL(a
0:7
, nb0)
d
8:15
ROTL(a
8:15
, nb1)
d
16:23
ROTL(a
16:23
, nb2)
d
24:31
ROTL(a
24:31
, nb3)
d
32:39
ROTL(a
32:39
, nb4)
d
40:47
ROTL(a
40:47
, nb5)
d
48:55
ROTL(a
48:55
, nb6)
d
56:63
ROTL(a
56:63
, nb7)
Each of the byte elements of parameter a are rotated left by an amount specified in the lower 3 bits
of the corresponding byte elements of parameter b. The result is placed into parameter d. The
separate rotate amounts for each element are specified by the lower 3 bits in each byte element of
parameter b that lie in bit positions 5:7, 13:15, 21:23, 29:31, 37:39, 45:47 53:55, and 61:63.
Figure 3-556. Vector Rotate Left Byte (__ev_rlb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evrlb d,a,b
Bytes rotate by
value specified in nb[x]
7 8 5 15 16 13 23 24 21 31 32 29 39 40 37 47 48 45 55 56 53 0 63 61
b
nb7 nb6 nb5 nb4 nb3 nb2 nb1 nb0
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
a
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-722 Freescale Semiconductor
SPE2 Operations
__ev_rlbi __ev_rlbi
Vector Rotate Left Byte Immediate
d = __ev_rlbi (a,b)
UIMM EXTZ(b)
n UIMM
d
0:7
ROTL(a
0:7
, n)
d
8:15
ROTL(a
8:15
, n)
d
16:23
ROTL(a
16:23
, n)
d
24:31
ROTL(a
24:31
, n)
d
32:39
ROTL(a
32:39
, n)
d
40:47
ROTL(a
40:47
, n)
d
48:55
ROTL(a
48:55
, n)
d
56:63
ROTL(a
56:63
, n)
Each of the byte elements of parameter a are rotated left by the UIMM value formed from
parameter b and the results are placed in parameter d.
NOTE
Values greater than 7 are illegal for the UIMM value contained in
parameter b.
Figure 3-557. Vector Rotate Left Byte Immediate (__ev_rlbi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evrlbi d,a,b
Bytes rotated by
value specified in UIMM
7 8 1516 23 24 3132 3940 4748 5556 0 63
a
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-723
__ev_rlh __ev_rlh
Vector Rotate Left Half Word
d = __ev_rlh (a,b)
nh0 b
12:15
nh1 b
28:31
nh2 b
44:47
nh3 b
60:63
d
0:15
ROTL(a
0:15
,nh0)
d
16:31
ROTL(a
16:31
,nh1)
d
32:47
ROTL(a
32:47
,nh2)
d
48:63
ROTL(a
48:63
,nh3)
Each of the half word elements of parameter a are rotated left by an amount specified in the low
order four bits or the corresponding half word elements of parameter b. The result is placed into
parameter d. The separate rotate amounts for each element are specified by 4 bits in parameter b
that lie in bit positions 12-15, 28-31, 44-47 and 60-63.
Figure 3-558. Vector Rotate Left Half Word (__ev_rlh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evrlh d,a,b
b
d
0 31 32 63
a
15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-724 Freescale Semiconductor
SPE2 Operations
__ev_rlhi __ev_rlhi
Vector Rotate Left Half Word Immediate
d = __ev_rlhi (a,b)
UIMM EXTZ(b)
n UIMM
d
0:15
ROTL(a
0:15
,n)
d
16:31
ROTL(a
16:31
,n)
d
32:47
ROTL(a
32:47
,n)
d
48:63
ROTL(a
48:63
,n)
Each of the half word elements of parameter a are rotated left by the immediate value specified in
UIMM and formed from parameter b. The result is placed into parameter d.
NOTE
Values greater than 15 are illegal for the UIMM value contained in
parameter b.
Figure 3-559. Vector Rotate Left Half Word Immediate (evrlhi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evrlhi d,a,b
UIMM
d
0 31 32 63
a
15 16 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-725
__ev_rlw __ev_rlw
Vector Rotate Left Word
d = __ev_rlw (a,b)
nh b
27:31
nl b
59:63
d
0:31
ROTL(a
0:31
, nh)
d
32:63
ROTL(a
32:63
, nl)
Each of the high and low elements of parameter a is rotated left by an amount specified in
parameter b. The result is placed into parameter d. Rotate values for each element of parameter a
are found in bit positions b[2731] and b[5963].
Figure 3-560. Vector Rotate Left Word (__ev_rlw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evrlw d,a,b
0 31 32 63
a
b
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-726 Freescale Semiconductor
SPE2 Operations
__ev_rlwi __ev_rlwi
Vector Rotate Left Word Immediate
d = __ev_rlwi (a,b)
n UIMM
d
0:31
ROTL(a
0:31
, n)
d
32:63
ROTL(a
32:63
, n)
Both the high and low elements of parameter a are rotated left by an amount specified by a 5-bit
immediate value contained in parameter b. The results are placed in parameter d.
Figure 3-561. Vector Rotate Left Word Immediate (__ev_rlwi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evrlwi d,a,b
0 31 32 63
a
UIMM
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-727
__ev_rnddnw __ev_rnddnw
Vector Round Doubleword to Nearest Even Word
d = __ev_rnddnw (a)
if (a
31:63
=
1
0 || 0x8000_0000) then temp
0:32
a
0:32
// check for even 0.5
else temp
0:32
a
0:32
+ 1 // modulo sum
d
0:63
temp
0:31
||
32
0
The 64-bit value in parameter a is rounded into 32 bits using round_to_nearest_even rounding. The
result is placed into the most significant 32 bits of parameter d, zeroing out the low order 32 bits of
parameter d.
Figure 3-562. Vector Round Doubleword to Nearest Even Word (__ev_rnddnw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnddnw d,a
0 63
a
d
0 31 32 63
ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-728 Freescale Semiconductor
SPE2 Operations
__ev_rnddnwss __ev_rnddnwss
Vector Round Doubleword to Nearest Even Word Signed and Saturate
d = __ev_rnddnwss (a)
if (a
0:63
>=
si
0x7FFF_FFFF_8000_0000) then
temp
0:32
0x7FFF_FFFF ||
1
0
ov 1
else
ov 0
if (a
31:63
=
1
0 || 0x8000_0000) then temp
0:32
a
0:32
// check for even 0.5
else temp
0:32
a
0:32
+ 1 // modulo sum
d
0:63
temp
0:31
||
32
0
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The signed 64-bit value in parameter a is rounded with saturation into 32 bits using
round_to_nearest_even rounding. The 32-bit result is placed into the most significant word
element of parameter d, zeroing out the low word element. The overflow and summary overflow bits
are recorded in the SPEFSCR based on an overflow from the round operation.
Other registers altered: SPEFSCR
Figure 3-563. Vector Round Doubleword to Nearest Even Word Signed and Saturate
(__ev_rnddnwss)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnddnwss d,a
0 63
a
d
0 31 32 63
ROUND & SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-729
__ev_rnddnwus __ev_rnddnwus
Vector Round Doubleword to Nearest Even Word Unsigned and Saturate
d = __ev_rnddnwus (a)
if (a
0:63
>=
ui
0xFFFF_FFFF_8000_0000) then
temp
0:32
0xFFFF_FFFF ||
1
0
ov 1
else
ov 0
if (a
31:63
=
1
0 || 0x8000_0000) then temp
0:32
a
0:32
// check for even 0.5
else temp
0:32
a
0:32
+ 1 // modulo sum
d
0:63
temp
0:31
||
32
0
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The unsigned 64-bit value in parameter a is rounded with saturation into 32 bits using
round_to_nearest_even rounding. The 32-bit result is placed into the most significant word
element of parameter d, zeroing out the low word element. The overflow and summary overflow bits
are recorded in the SPEFSCR based on an overflow from the round operation.
Other registers altered: SPEFSCR
Figure 3-564. Vector Round Doubleword to Nearest Even Word Unsigned and Saturate
(__ev_rnddnwus)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnddnwus d,a
0 63
a
d
0 31 32 63
ROUND & SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-730 Freescale Semiconductor
SPE2 Operations
__ev_rnddw __ev_rnddw
Vector Round Doubleword to Word
d = __ev_rnddw (a)
d
0:63
(a
0:63
+0x0000_0000_8000_0000) & 0xFFFF_FFFF_0000_0000 // Modulo sum
The 64-bit value in parameter a is rounded into 32 bits. The result is placed into the most
significant 32 bits of parameter d, zeroing out the low order 32 bits of parameter d.
Figure 3-565. Vector Round Doubleword to Word (__ev_rnddw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnddw d,a
0 63
a
d
0 31 32 63
ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-731
__ev_rnddwss __ev_rnddwss
Vector Round Doubleword to Word Signed and Saturate
d = __ev_rnddwss (a)
if (a
0:63
>=
si
0x7FFF_FFFF_8000_0000) then
temp
0:32
0x7FFF_FFFF ||
1
0
ov 1
else
temp
0:32
(a
0:32
+ 1)
ov 0
d
0:63
temp
0:31
||
32
0
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The signed 64-bit value in parameter a is rounded with saturation into 32 bits. The 32-bit result is
placed into the most significant word element of parameter d, zeroing out the low word element. The
overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow from the
round operation.
Other registers altered: SPEFSCR
Figure 3-566. Vector Round Doubleword to Word Signed and Saturate (__ev_rnddwss)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnddwss d,a
0 63
a
d
0 31 32 63
ROUND & SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-732 Freescale Semiconductor
SPE2 Operations
__ev_rnddwus __ev_rnddwus
Vector Round Doubleword to Word Unsigned and Saturate
d = __ev_rnddwus (a)
if (a
0:63
>=
ui
0xFFFF_FFFF_8000_0000) then
temp
0:32
0xFFFF_FFFF ||
1
0
ov 1
else
temp
0:32
(a
0:32
+ 1)
ov 0
d
0:63
temp
0:31
||
32
0
// update SPEFSCR
SPEFSCR
OVH
0; SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The unsigned 64-bit value in parameter a is rounded with saturation into 32 bits. The 32-bit result
is placed into the most significant word element of parameter d, zeroing out the low word element.
The overflow and summary overflow bits are recorded in the SPEFSCR based on an overflow from
the round operation.
Other registers altered: SPEFSCR
Figure 3-567. Vector Round Doubleword to Word Unsigned and Saturate (__ev_rnddwus)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnddwus d,a
0 63
a
d
0 31 32 63
ROUND & SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-733
__ev_rndhb __ev_rndhb
Vector Round Halfword to Byte
d = __ev_rndhb (a)
d
0:15
(a
0:15
+0x0080) & 0xFF00
d
16:31
(a
16:31
+0x0080) & 0xFF00
d
32:47
(a
32:47
+0x0080) & 0xFF00
d
48:63
(a
48:63
+0x0080) & 0xFF00
The 16-bit elements of parameter a are rounded into 8 bits. The 8-bit results are placed in the most
significant 8 bits of each halfword element of parameter d, zeroing out the low order 8 bits of each
halfword element.
Figure 3-568. Vector Round Halfword to Byte (__ev_rndhb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndhb d,a
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
ROUND ROUND ROUND ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-734 Freescale Semiconductor
SPE2 Operations
__ev_rndhbss __ev_rndhbss
Vector Round Halfword to Byte Signed and Saturate
d = __ev_rndhbss (a)
if (a
0:15
>=
si
0x7F80) then
temp0
0:15
= 0x7F00; ovh0 1
else
temp0
0:15
= (a
0:15
+0x0080); ovh0 0
d
0:15
temp0
0:15
& 0xFF00
if (a
16:31
>=
si
0x7F80) then
temp1
0:15
= 0x7F00; ovh1 1
else
temp1
0:15
= (a
16:31
+0x0080); ovh1 0
d
16:31
temp1
0:15
& 0xFF00
if (a
32:47
>=
si
0x7F80) then
temp2
0:15
= 0x7F00; ovh2 1
else
temp2
0:15
= (a
32:47
+0x0080); ovh2 0
d
32:47
temp2
0:15
& 0xFF00
if (a
48:63
>=
si
0x7F80) then
temp3
0:15
= 0x7F00; ovh3 1
else
temp3
0:15
= (a
48:63
+0x0080); ovh3 0
d
48:63
temp3
0:15
& 0xFF00
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 16-bit elements of parameter a are rounded with saturation into 8 bits. The 8-bit results
are placed in the most significant 8 bits of each halfword element of parameter d, zeroing out the
low order 8 bits of each halfword element. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-569. Vector Round Halfword to Byte Signed and Saturate (__ev_rndhbss)
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
ROUND&SAT ROUND&SAT ROUND&SAT ROUND&SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-735
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndhbss d,a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-736 Freescale Semiconductor
SPE2 Operations
__ev_rndhbus __ev_rndhbus
Vector Round Half Word to Byte Unsigned and Saturate
d = __ev_rndhbus (a)
if (a
0:15
>=
ui
0xFF80) then
temp0
0:15
= 0xFF00; ovh0 1
else
temp0
0:15
= (a
0:15
+0x0080); ovh0 0
d
0:15
temp0
0:15
& 0xFF00
if (a
16:31
>=
ui
0xFF80) then
temp1
0:15
= 0xFF00; ovh1 1
else
temp1
0:15
= (a
16:31
+0x0080); ovh1 0
d
16:31
temp1
0:15
& 0xFF00
if (a
32:47
>=
ui
0xFF80) then
temp2
0:15
= 0xFF00; ovh02 1
else
temp2
0:15
= (a
32:47
+0x0080); ovh2 0
d
32:47
temp2
0:15
& 0xFF00
if (a
48:63
>=
ui
0xFF80) then
temp3
0:15
= 0xFF00; ovh3 1
else
temp3
0:15
= (a
48:63
+0x0080); ovh3 0
d
48:63
temp3
0:15
& 0xFF00
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The unsigned 16-bit elements of parameter a are rounded with saturation into 8 bits. The 8-bit
results are placed in the most significant 8 bits of each halfword element of parameter d, zeroing
out the low order 8 bits of each halfword element. Any saturation is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-570. Vector Round Half Word to Byte Unsigned and Saturate (__ev_rndhbus)
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
ROUND&SAT ROUND&SAT ROUND&SAT ROUND&SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-737
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndhbus d,a
SPE2 Programming Interface Manual, Rev. 1.0-2
3-738 Freescale Semiconductor
SPE2 Operations
__ev_rndhnb __ev_rndhnb
Vector Round Halfword to Nearest Even Byte
d = __ev_rndhnb (a)
if (a
7:15
=
1
0 || 0x80) then temp
0:8
a
0:8
// check for even 0.5
else temp
0:8
a
0:8
+ 1 // modulo sum
d
0:15
temp
0:7
||
8
0
if (a
23:31
=
1
0 || 0x80) then temp
0:8
a
16:24
// check for even 0.5
else temp
0:8
a
16:24
+ 1 // modulo sum
d
16:31
temp
0:7
||
8
0
if (a
39:47
=
1
0 || 0x80) then temp
0:8
a
32:40
// check for even 0.5
else temp
0:8
a
32:40
+ 1 // modulo sum
d
32:47
temp
0:7
||
8
0
if (a
55:63
=
1
0 || 0x80) then temp
0:8
a
48:56
// check for even 0.5
else temp
0:8
a
48:56
+ 1 // modulo sum
d
48:63
temp
0:7
||
8
0
The 16-bit elements of parameter a are rounded into 8 bits using round_to_nearest_even rounding.
The 8-bit results are placed in the most significant 8 bits of each halfword element of parameter d,
zeroing out the low order 8 bits of each halfword element.
Figure 3-571. Vector Round Halfword to Nearest Even Byte (__ev_rndhnb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndhnb d,a
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
ROUND ROUND ROUND ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-739
__ev_rndhnbss __ev_rndhnbss
Vector Round Halfword to Nearest Even Byte Signed and Saturate
d = __ev_rndhnbss (a)
if (a
0:15
>=
si
0x7F80) then temp
0:8
= (0x7F ||
1
0); ovh0 1
else
ovh0 0
if (a
7:15
=
1
0 || 0x80) then temp
0:8
a
0:8
// check for even 0.5
else temp
0:8
a
0:8
+ 1 // modulo sum
d
0:15
temp
0:7
||
8
0
if (a
16:31
>=
si
0x7F80) then temp
0:8
= (0x7F ||
1
0); ovh1 1
else
ovh1 0
if (a
23:31
=
1
0 || 0x80) then temp
0:8
a
16:24
// check for even 0.5
else temp
0:8
a
16:24
+ 1 // modulo sum
d
16:31
temp
0:7
||
8
0
if (a
32:47
>=
si
0x7F80) then temp
0:8
= (0x7F ||
1
0); ovh2 1
else
ovh2 0
if (a
39:47
=
1
0 || 0x80) then temp
0:8
a
32:40
// check for even 0.5
else temp
0:8
a
32:40
+ 1 // modulo sum
d
32:47
temp
0:7
||
8
0
if (a
48:63
>=
si
0x7F80) then temp
0:8
= (0x7F ||
1
0); ovh3 1
else
ovh3 0
if (a
55:63
=
1
0 || 0x80) then temp
0:8
a
48:56
// check for even 0.5
else temp
0:8
a
48:56
+ 1 // modulo sum
d
48:63
temp
0:7
||
8
0
ovh ovh0 | ovh1; ovl ovh2 | ovh3
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The signed 16-bit elements of parameter a are rounded with saturation into 8 bits using
round_to_nearest_even rounding. The 8-bit results are placed in the most significant 8 bits of each
halfword element of parameter d, zeroing out the low order 8 bits of each halfword element. Any
saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-572. Vector Round Halfword to Nearest Even Byte Signed and Saturate
(__ev_rndhnbss)
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
ROUND&SAT ROUND&SAT ROUND&SAT ROUND&SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-740 Freescale Semiconductor
SPE2 Operations
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndhnbss d,a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-741
__ev_rndhnbus __ev_rndhnbus
Vector Round Halfword to Nearest Byte Unsigned and Saturate
d = __ev_rndhnbus (a)
if (a
0:15
>=
ui
0xFF80) then temp
0:8
= (0xFF ||
1
0); ovh0 1
else
ovh0 0
if (a
7:15
=
1
0 || 0x80) then temp
0:8
a
0:8
// check for even 0.5
else temp
0:8
a
0:8
+ 1 // modulo sum
d
0:15
temp
0:7
||
8
0
if (a
16:31
>=
ui
0xFF80) then temp
0:8
= (0xFF ||
1
0); ovh1 1
else
ovh1 0
if (a
23:31
=
1
0 || 0x80) then temp
0:8
a
16:24
// check for even 0.5
else temp
0:8
a
16:24
+ 1 // modulo sum
d
16:31
temp
0:7
||
8
0
if (a
32:47
>=
ui
0xFF80) then temp
0:8
= (0xFF ||
1
0); ovh2 1
else
ovh2 0
if (a
39:47
=
1
0 || 0x80) then temp
0:8
a
32:40
// check for even 0.5
else temp
0:8
a
32:40
+ 1 // modulo sum
d
32:47
temp
0:7
||
8
0
if (a
48:63
>=
ui
0xFF80) then temp
0:8
= (0xFF ||
1
0); ovh3 1
else
ovh3 0
if (a
55:63
=
1
0 || 0x80) then temp
0:8
a
48:56
// check for even 0.5
else temp
0:8
a
48:56
+ 1 // modulo sum
d
48:63
temp
0:7
||
8
0
ovh ovh0 | ovh1; ovl ovh2 | ovh3
SPEFSCR
OVH
ovh; SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh; SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The unsigned 16-bit elements of parameter a are rounded with saturation into 8 bits using
round_to_nearest_even rounding. The 8-bit results are placed in the most significant 8 bits of each
halfword element of parameter d, zeroing out the low order 8 bits of each halfword element. Any
saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-573. Vector Round Halfword to Nearest Even Byte Unsigned and Saturate
(__ev_rndhnbus)
63
d 56
0 8 16 24 32 40 48 56
63
a
0 16 32 48
ROUND&SAT ROUND&SAT ROUND&SAT ROUND&SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-742 Freescale Semiconductor
SPE2 Operations
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndhnbus d,a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-743
__ev_rndwh __ev_rndwh
Vector Round Word to Halfword
d = __ev_rndw (a)
d = __ev_rndwh (a)
d
0:31
(a
0:31
+ 0x0000_8000) & 0xFFFF_0000 // Modulo sum
d
32:63
(a
32:63
+ 0x0000_8000) & 0xFFFF_0000 // Modulo sum
The 32-bit elements of parameter a are rounded into 16 bits. The result is placed into parameter d.
The resulting 16 bits are placed in the most significant 16 bits of each element of parameter d,
zeroing out the low order 16 bits of each element.
Note: __ev_rndw and __ev_rndwh are the same instruction
Figure 3-574. Vector Round Word to Halfword (__ev_rndwh(__ev_rndw))
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrnd d,a
__ev64_opaque__ __ev64_opaque__ evrndwh d,a
0 31 32 63
a
d
0 31 32 63 15 16 47 48
ROUND ROUND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-744 Freescale Semiconductor
SPE2 Operations
__ev_rndwhss __ev_rndwhss
Vector Round Word to Halfword Signed and Saturate
d = __ev_rndwhss (a)
if (a
0:31
>=
si
0x7FFF_8000) then
temph
0:31
= 0x7FFF_0000; ovh 1
else
temph
0:31
= (a
0:31
+ 0x0000_8000); ovh 0
d
0:31
temph
0:31
& 0xFFFF_0000
if (a
32:63
>=
si
0x7FFF_8000) then
templ
0:31
= 0x7FFF_0000; ovl 1
else
templ
0:31
= (a
32:63
+ 0x0000_8000); ovl 0
d
32:63
templ
0:31
& 0xFFFF_0000
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The signed 32-bit elements of parameter a are rounded with saturation into 16 bits. The 16-bit
results are placed in the most significant 16 bits of each word element of parameter d, zeroing out
the low order 16 bits of each word element. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-575. Vector Round Word to Halfword Signed and Saturate (__ev_rndwhss)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndwhss d,a
0 31 32 63
a
d
0 31 32 63 15 16 47 48
ROUND&SAT ROUND&SAT
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-745
__ev_rndwhus __ev_rndwhus
Vector Round Word to Halfword Unsigned and Saturate
d = __ev_rndwhus (a)
if (a
0:31
>=
ui
0xFFFF_8000) then
temph
0:31
= 0xFFFF_0000; ovh 1
else
temph
0:31
= (a
0:31
+ 0x0000_8000); ovh 0
d
0:31
temph
0:31
& 0xFFFF_0000
if (a
32:63
>=
ui
0xFFFF_8000) then
templ
0:31
= 0xFFFF_0000; ovl 1
else
templ
0:31
= (a
32:63
+ 0x0000_8000); ovl 0
d
32:63
templ
0:31
& 0xFFFF_0000
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The unsigned 32-bit elements of parameter a are rounded with saturation into 16 bits. The 16-bit
results are placed in the most significant 16 bits of each word element of parameter d, zeroing out
the low order 16 bits of each word element. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-576. Vector Round Word to Halfword Unsigned and Saturate (__ev_rndwhus)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndwhus d,a
0 31 32 63
a
d
0 31 32 63 15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-746 Freescale Semiconductor
SPE2 Operations
__ev_rndwnh __ev_rndwnh
Vector Round Word to Nearest Even Halfword
d = __ev_rndwnh (a)
if (a
15:31
=
1
0 || 0x8000) then temp
0:16
a
0:16
// check for even 0.5
else temp
0:16
a
0:16
+ 1 // modulo sum
d
0:31
temp
0:15
||
16
0
if (a
47:63
=
1
0 || 0x8000) then temp
0:16
a
32:48
// check for even 0.5
else temp
0:16
a
32:48
+ 1 // modulo sum
d
32:63
temp
0:15
||
16
0
The 32-bit elements of parameter a are rounded into 16 bits using round_to_nearest_even
rounding. The resulting 16 bits are placed in the most significant 16 bits of each element of
parameter d, zeroing out the low order 16 bits of each element.
Figure 3-577. Vector Round Word to Nearest Even Halfword (__ev_rndwnh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndwnh d,a
0 31 32 63
a
d
0 31 32 63 15 16 47 48
ROUND ROUND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-747
__ev_rndwnhss __ev_rndwnhss
Vector Round Word to Nearest Even Halfword Signed and Saturate
d = __ev_rndwnhss (a)
if (a
0:31
>=
si
0x7FFF_8000) then
temph
0:16
= 0x7FFF ||
1
0; ovh 1
else
ovh 0
if (a
15:31
=
1
0 || 0x8000) then temph
0:16
a
0:16
// check for even 0.5
else temph
0:16
a
0:16
+ 1 // modulo sum
d
0:31
temph
0:15
||
16
0
if (a
32:63
>=
si
0x7FFF_8000) then
templ
0:16
= 0x7FFF ||
1
0; ovl 1
else
ovl 0
if (a
47:63
=
1
0 || 0x8000) then templ
0:16
a
32:48
// check for even 0.5
else templ
0:16
a
32:48
+ 1 // modulo sum
d
32:63
templ
0:15
||
16
0
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The signed 32-bit elements of parameter a are rounded with saturation into 16 bits using
round_to_nearest_even rounding. The 16-bit results are placed in the most significant 16 bits of
each word element of parameter d, zeroing out the low order 16 bits of each word element. Any
saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-578. Vector Round Word to Nearest Even Halfword Signed and Saturate
(__ev_rndwnhss)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndwnhss d,a
0 31 32 63
a
d
0 31 32 63 15 16 47 48
ROUND&SAT ROUND&SAT
SPE2 Programming Interface Manual, Rev. 1.0-2
3-748 Freescale Semiconductor
SPE2 Operations
__ev_rndwnhus __ev_rndwnhus
Vector Round Word to Nearest Even Halfword Unsigned and Saturate
d = __ev_rndwnhus (a)
if (a
0:31
>=
ui
0xFFFF_8000) then
temph
0:16
= 0xFFFF ||
1
0; ovh 1
else
ovh 0
if (a
15:31
=
1
0 || 0x8000) then temph
0:16
a
0:16
// check for even 0.5
else temph
0:16
a
0:16
+ 1 // modulo sum
d
0:31
temph
0:15
||
16
0
if (a
32:63
>=
ui
0xFFFF_8000) then
templ
0:16
= 0xFFFF ||
1
0; ovl 1
else
ovl 0
if (a
47:63
=
1
0 || 0x8000) then templ
0:16
a
32:48
// check for even 0.5
else templ
0:16
a
32:48
+ 1 // modulo sum
d
32:63
templ
0:15
||
16
0
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The unsigned 32-bit elements of parameter a are rounded with saturation into 16 bits. The 16-bit
results are placed in the most significant 16 bits of each word element of parameter d, zeroing out
the low order 16 bits of each word element. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-579. Vector Round Word to Nearest Even Halfword Unsigned and Saturate
(__ev_rndwnhus)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evrndwnhus d,a
0 31 32 63
a
d
0 31 32 63 15 16 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-749
__ev_sad2sh[a] __ev_sad2sh[a]
Vector Sum of Absolute Differences of 2 Signed Half Words (to Accumulator)
d = __ev_sad2sh (a,b) (A = 0)
d = __ev_sad2sha (a,b) (A = 1)
temp0
0:15
ABS(b
0:15
- a
0:15
)
temp1
0:15
ABS(b
16:31
- a
16:31
)
temp2
0:15
ABS(b
32:47
- a
32:47
)
temp3
0:15
ABS(b
48:63
- a
48:63
)
d
0:31
EXTZ(temp0
0:15
) + EXTZ(temp1
0:15
)
d
32:63
EXTZ(temp2
0:15
) + EXTZ(temp3
0:15
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of two signed half word elements of parmeter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 17-bit differences from each group are summed
together, and the results are placed into the word elements of parameter d. If A = 1, the result in
parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-580. Vector Sum of Absolute Differences of 2 Signed Half Words (to
Accumulator) (__ev_sad2sh[a])
0 32 63
b
a
16 48
-
0 63
d (and Accumulator if A=1)
- - -
ABS ABS ABS ABS
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-750 Freescale Semiconductor
SPE2 Operations
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad2sh d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad2sha d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-751
__ev_sad2shaaw __ev_sad2shaaw
Vector Sum of Absolute Differences of 2 Signed Half Words and Accumulate into Words
d = __ev_sad2shaaw (a,b)
temp0
0:15
ABS(b
0:15
- a
0:15
)
temp1
0:15
ABS(b
16:31
- a
16:31
)
temp2
0:15
ABS(b
32:47
- a
32:47
)
temp3
0:15
ABS(b
48:63
- a
48:63
)
d
0:31
ACC
0:31
+ EXTZ(temp0
0:15
) + EXTZ(temp1
0:15
)
d
32:63
ACC
32:63
+ EXTZ(temp2
0:15
) + EXTZ(temp3
0:15
)
// update accumulator
ACC
0:63
d
0:63
Groups of two signed half word elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 17-bit differences from each group are summed
together, then added to the corresponding word elements in the accumulator, and the results are
placed into the word elements of parameter d and the accumulator.
Other registers altered: ACC
Figure 3-581. Vector Sum of Absolute Differences of 2 Signed Half Words and Accumulate
(__ev_sad2shaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad2shaaw d,a,b
+
SUM
+
SUM
0 32 63
b
a
16 48
-
0 63
d and Accumulator
- - -
ABS ABS ABS ABS
0 63
Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-752 Freescale Semiconductor
SPE2 Operations
__ev_sad2uh[a] __ev_sad2uh[a]
Vector Sum of Absolute Differences of 2 Unsigned Half Words (to Accumulator)
d = __ev_sad2uh (a,b) (A = 0)
d = __ev_sad2uha (a,b) (A = 1)
temp0
0:15
ABS(b
0:15
- a
0:15
)
temp1
0:15
ABS(b
16:31
- a
16:31
)
temp2
0:15
ABS(b
32:47
- a
32:47
)
temp3
0:15
ABS(b
48:63
- a
48:63
)
d
0:31
EXTZ(temp0
0:15
) + EXTZ(temp1
0:15
)
d
32:63
EXTZ(temp2
0:15
) + EXTZ(temp3
0:15
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of two unsigned half word elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 16-bit differences from each group are summed
together, and the results are placed into the word elements of parameter d. If A = 1, the result in
parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-582. Vector Sum of Absolute Differences of 2 Unsigned Half Words (to
Accumulator) (__ev_sad2uh[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad2uh d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad2uha d,a,b
0 32 63
b
a
16 48
-
0 63
d (and Accumulator if A=1)
- - -
ABS ABS ABS ABS
SUM SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-753
__ev_sad2uhaaw __ev_sad2uhaaw
Vector Sum of Absolute Differences of 2 Unsigned Half Words and Accumulate into Words
d = __ev_sad2uhaaw (a,b)
temp0
0:15
ABS(b
0:15
- a
0:15
)
temp1
0:15
ABS(b
16:31
- a
16:31
)
temp2
0:15
ABS(b
32:47
- a
32:47
)
temp3
0:15
ABS(b
48:63
- a
48:63
)
d
0:31
ACC
0:31
+ EXTZ(temp0
0:15
) + EXTZ(temp1
0:15
)
d
32:63
ACC
32:63
+ EXTZ(temp2
0:15
) + EXTZ(temp3
0:15
)
// update accumulator
ACC
0:63
d
0:63
Groups of two unsigned half word elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 16-bit differences from each group are summed
together, then added to the corresponding word elements in the accumulator, and the results are
placed into the word elements of parameter d and the accumulator.
Other registers altered: ACC
Figure 3-583. Vector Sum of Absolute Differences of 2 Unsigned Half Words and
Accumulate (__ev_sad2uhaaw)
+
SUM
+
SUM
0 32 63
b
a
16 48
-
0 63
d and Accumulator
- - -
ABS ABS ABS ABS
0 63
Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-754 Freescale Semiconductor
SPE2 Operations
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad2uhaaw d,a,b
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-755
__ev_sad4sb[a] __ev_sad4sb[a]
Vector Sum of Absolute Differences of 4 Signed Bytes (to Accumulator)
d = __ev_sad4sb (a,b) (A = 0)
d = __ev_sad4sba (a,b) (A = 1)
temp0
0:7
ABS(b
0:7
- a
0:7
)
temp1
0:7
ABS(b
8:15
- a
8:15
)
temp2
0:7
ABS(b
16:23
- a
16:23
)
temp3
0:7
ABS(b
24:31
- a
24:31
)
temp4
0:7
ABS(b
32:39
- a
32:39
)
temp5
0:7
ABS(b
40:47
- a
40:47
)
temp6
0:7
ABS(b
48:55
- a
48:55
)
temp7
0:7
ABS(b
56:63
- a
56:63
)
d
0:31
EXTZ(temp0
0:7
) + EXTZ(temp1
0:7
) + EXTZ(temp2
0:7
) + EXTZ(temp3
0:7
)
d
32:63
EXTZ(temp4
0:7
) + EXTZ(temp5
0:7
) + EXTZ(temp6
0:7
) + EXTZ(temp7
0:7
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of four signed byte elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 8-bit differences from each group are summed
together, and the results are placed into the word elements of parameter d. If A = 1, the result in
parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-584. Vector Sum of Absolute Differences of 4 Signed Bytes (to Accumulator
(__ev_sad4sb[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad4sb d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad4sba d,a,b
0 32 63
b
a
24 16 8 40 48 56
-
0 63
d (and Accumulator if A=1)
- - - - - - -
ABS ABS ABS ABS ABS ABS ABS ABS
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-756 Freescale Semiconductor
SPE2 Operations
__ev_sad4sbaaw __ev_sad4sbaaw
Vector Sum of Absolute Differences of 4 Signed Bytes and Accumulate into Words
d = __ev_sad4sbaaw (a,b)
temp0
0:7
ABS(b
0:7
- a
0:7
)
temp1
0:7
ABS(b
8:15
- a
8:15
)
temp2
0:7
ABS(b
16:23
- a
16:23
)
temp3
0:7
ABS(b
24:31
- a
24:31
)
temp4
0:7
ABS(b
32:39
- a
32:39
)
temp5
0:7
ABS(b
40:47
- a
40:47
)
temp6
0:7
ABS(b
48:55
- a
48:55
)
temp7
0:7
ABS(b
56:63
- a
56:63
)
d
0:31
ACC
0:31
+ EXTZ(temp0
0:7
) + EXTZ(temp1
0:7
) + EXTZ(temp2
0:7
) + EXTZ(temp3
0:7
)
d
32:63
ACC
32:63
+ EXTZ(temp4
0:7
) + EXTZ(temp5
0:7
) + EXTZ(temp6
0:7
) + EXTZ(temp7
0:7
)
// update accumulator
ACC
0:63
d
0:63
Groups of four signed byte elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 8-bit differences from each group are summed
together, then added to the respective word elements of the accumulator, and the result is placed
in parameter d and the accumulator.
Other registers altered: ACC
Figure 3-585. Vector Sum of Absolute Differences of 4 Signed Bytes and Accumulate
Words (__ev_sad4sbaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad4sbaaw d,a,b
0 32 63
b
a
24 16 8 40 48 56
- - - - - - - -
ABS ABS ABS ABS ABS ABS ABS ABS
0 63
d and Accumulator
0 63
Accumulator
+ +
SUM SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-757
__ev_sad4ub[a] __ev_sad4ub[a]
Vector Sum of Absolute Differences of 4 Unsigned Bytes (to Accumulator)
d = __ev_sad4ub (a,b) (A = 0)
d = __ev_sad4uba (a,b) (A = 1)
temp0
0:7
ABS(b
0:7
- a
0:7
)
temp1
0:7
ABS(b
8:15
- a
8:15
)
temp2
0:7
ABS(b
16:23
- a
16:23
)
temp3
0:7
ABS(b
24:31
- raA
24:31
)
temp4
0:7
ABS(b
32:39
- a
32:39
)
temp5
0:7
ABS(b
40:47
- a
40:47
)
temp6
0:7
ABS(b
48:55
- a
48:55
)
temp7
0:7
ABS(b
56:63
- a
56:63
)
d
0:31
EXTZ(temp0
0:7
) + EXTZ(temp1
0:7
) + EXTZ(temp2
0:7
) + EXTZ(temp3
0:7
)
d
32:63
EXTZ(temp4
0:7
) + EXTZ(temp5
0:7
) + EXTZ(temp6
0:7
) + EXTZ(temp7
0:7
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of four signed byte elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 9-bit differences from each group are summed
together, and the results are placed into the word elements of parameter d. If A = 1, the result in
parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-586. Vector Sum of Absolute Differences of 4 Unsigned Bytes (to Accumulator
(__ev_sad4ub[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad4ub d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad4uba d,a,b
0 32 63
b
a
24 16 8 40 48 56
-
0 63
d (and Accumulator if A=1)
- - - - - - -
ABS ABS ABS ABS ABS ABS ABS ABS
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-758 Freescale Semiconductor
SPE2 Operations
__ev_sad4ubaaw __ev_sad4ubaaw
Vector Sum of Absolute Differences of 4 Unsigned Bytes and Accumulate into Words
d = __ev_sad4ubaaw (a,b)
temp0
0:7
ABS(b
0:7
- a
0:7
)
temp1
0:7
ABS(b
8:15
- a
8:15
)
temp2
0:7
ABS(b
16:23
- a
16:23
)
temp3
0:7
ABS(b
24:31
- a
24:31
)
temp4
0:7
ABS(b
32:39
- a
32:39
)
temp5
0:7
ABS(b
40:47
- a
40:47
)
temp6
0:7
ABS(b
48:55
- a
48:55
)
temp7
0:7
ABS(b
56:63
- a
56:63
)
d
0:31
ACC
0:31
+ EXTZ(temp0
0:7
) + EXTZ(temp1
0:7
) + EXTZ(temp2
0:7
) + EXTZ(temp3
0:7
)
d
32:63
ACC
32:63
+ EXTZ(temp4
0:7
) + EXTZ(temp5
0:7
) + EXTZ(temp6
0:7
) + EXTZ(temp7
0:7
)
// update accumulator
ACC
0:63
d
0:63
Groups of four unsigned byte elements of parameter a are subtracted from the corresponding
elements of parameter b, the absolute values of the 9-bit differences from each group are summed
together, then added to the respective word elements of the accumulator, and the result is placed
in parameter d and the accumulator.
Other registers altered: ACC
Figure 3-587. Vector Sum of Absolute Differences of 4 Unsigned Bytes and Accumulate
Words (__ev_sad4ubaaw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsad4ubaaw d,a,b
0 32 63
b
a
24 16 8 40 48 56
- - - - - - - -
ABS ABS ABS ABS ABS ABS ABS ABS
0 63
d and Accumulator
0 63
Accumulator
+
SUM
+
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-759
__ev_sadsw[a] __ev_sadsw[a]
Vector Sum of Absolute Differences of Signed Words (to Accumulator)
d = __ev_sadsw (a,b) (A = 0)
d = __ev_sadswa (a,b) (A = 1)
temph
0:31
ABS(b
0:31
- a
0:31
)
templ
0:31
ABS(b
32:63
- a
32:63
)
d
0:63
EXTZ(temph
0:31
) + EXTZ(temph
0:31
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed word elements of parameter a are subtracted from the corresponding elements of
parameter b, the absolute values of the 32-bit differences are summed together, and the result is
placed in parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-588. Vector Sum of Absolute Differences of Signed Words (to Accumulator)
(__ev_sadsw[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsadsw d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsadswa d,a,b
0 32 63
b
a
0 63
d (and Accumulator if A=1)
- -
ABS ABS
SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-760 Freescale Semiconductor
SPE2 Operations
__ev_sadswaa __ev_sadswaa
Vector Sum of Absolute Differences of Signed Words and Accumulate
d = __ev_sadswaa (a,b)
temph
0:31
ABS(b
0:31
- a
0:31
)
templ
0:31
ABS(b
32:63
- a
32:63
)
d
0:63
EXTZ(temph
0:31
) + EXTZ(temph
0:31
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The signed word elements of parameter a are subtracted from the corresponding elements of
parameter b, the absolute values of the 32-bit differences are summed together then added to the
contents of the accumulator, and the result is placed in parameter d and the accumulator.
Other registers altered: ACC
Figure 3-589. Vector Sum of Absolute Differences of Signed Words and Accumulate
(__ev_sadswaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsadswaa d,a,b
0 63
d and Accumulator
0 63
Accumulator
+
0 32 63
b
a
- -
ABS ABS
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-761
__ev_saduw[a] __ev_saduw[a]
Vector Sum of Absolute Differences of Unsigned Words (to Accumulator)
d = __ev_saduw (a,b) (A = 0)
d = __ev_saduwa (a,b) (A = 1)
temph
0:31
ABS(b
0:31
- a
0:31
)
templ
0:31
ABS(b
32:63
- a
32:63
)
d
0:63
EXTZ(temph
0:31
) + EXTZ(temph
0:31
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The unsigned word elements of parameter a are subtracted from the corresponding elements of
parameter b, the absolute values of the 33-bit differences are summed together, and the result is
placed in parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-590. Vector Sum of Absolute Differences of Unsigned Words (to Accumulator)
(__ev_saduw[a])
A d a b Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsaduw d,a,b
A = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsaduwa d,a,b
0 32 63
b
a
0 63
d (and Accumulator if A=1)
- -
ABS ABS
SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-762 Freescale Semiconductor
SPE2 Operations
__ev_saduwaa __ev_saduwaa
Vector Sum of Absolute Differences of Unsigned Words and Accumulate
d = __ev_saduwaa (a,b)
temph
0:31
ABS(b
0:31
- a
0:31
)
templ
0:31
ABS(b
32:63
- a
32:63
)
d
0:63
EXTZ(temph
0:31
) + EXTZ(temph
0:31
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The unsigned word elements of parameter a are subtracted from the corresponding elements of
parameter b, the absolute values of the 33-bit differences are summed together then added to the
contents of the accumulator, and the result is placed in parameter d and the accumulator.
Other registers altered: ACC
Figure 3-591. Vector Sum of Absolute Differences of Unsigned Words and Accumulate
(__ev_saduwaa)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsaduwaa d,a,b
0 63
d and Accumulator
0 63
Accumulator
+
0 32 63
b
a
- -
ABS ABS
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-763
__ev_satsbub __ev_satsbub
Vector Saturate Signed Bytes to Unsigned Byte Range
d = __ev_satsbub (a)
// b0
if (a
0:7
<
si
0x00) then ovb0=1 else ovb0=0;
d
0:7
SATURATE(ovb0, 0, 0x00, 0x00, a
0:7
)
// b1
if (a
8:15
<
si
0x00) then ovb1=1 else ovb1=0;
d
8:15
SATURATE(ovb1, 0, 0x00, 0x00, a
8:15
)
// b2
if (a
16:23
<
si
0x00) then ovb2=1 else ovb2=0;
d
16:23
SATURATE(ovb2, 0, 0x00, 0x00, a
16:23
)
// b3
if (a
24:31
<
si
0x00) then ovb3=1 else ovb3=0;
d
24:31
SATURATE(ovb3, 0, 0x00, 0x00, a
24:31
)
// b4
if (a
32:39
<
si
0x00) then ovb4=1 else ovb4=0;
d
32:39
SATURATE(ovb4, 0, 0x00, 0x00, a
32:39
)
// b5
if (a
40:47
<
si
0x00) then ovb5=1 else ovb5=0;
d
40:47
SATURATE(ovb5, 0, 0x00, 0x00, a
40:47
)
// b6
if (a
48:55
<
si
0x00) then ovb6=1 else ovb6=0;
d
48:55
SATURATE(ovb6, 0, 0x00, 0x00, a
48:55
)
// b7
if (a
56:63
<
si
0x00) then ovb7=1 else ovb7=0;
d
56:63
SATURATE(ovb7, 0, 0x00, 0x00, a
56:63
)
ovh ovb0 | ovb1 | ovb2 | ovb3
ovl ovh4 | ovb5 | ovb6 | ovb7
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 8-bit elements of parameter a are saturated to 8-bit unsigned values. Negative elements
saturate to 0. The 8-bit results are placed into parameter d. Any saturation is recorded in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-592. Vector Saturate Signed Byte to Unsigned Byte Range (__ev_satsbub)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatsbub d,a
0 31 32 63
a
15 16 47 48 39 40 55 56 23 24 7 8
0 31 32 63
d
15 16 47 48 39 40 55 56 23 24 7 8
Saturate value to
unsigned-byte range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-764 Freescale Semiconductor
SPE2 Operations
__ev_satsdsw __ev_satsdsw
Vector Saturate Signed Double Word to Signed Word Range
d = __ev_satsdsw (a)
if ((a
0:63
<
si
0xFFFF_FFFF_FFFF_8000) | (a
0:63
>
si
0x0000_0000_7FFF_FFFF)) then ov=1
else ov=0;
d
0:63
EXTS(SATURATE(ov, a
0
, 0x8000_0000, 0x7fff_ffff, a
32:63
))
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The signed 64-bit value in parameter a is saturated to a 32-bit signed value. The 32-bit value is
sign-extended to 64 bits and placed into parameter d. Any saturation is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-593. Vector Saturate Signed Doubleword to Signed Word Range (__ev_satsdsw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatsdsw d,a
0 63
d
0 63
a
sss..................................................s
Saturate value to
signed-word range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-765
__ev_satsduw __ev_satsduw
Vector Saturate Signed Double Word to Unsigned Word Range
d = __ev_satsduw (a)
if ((a
0:63
<
si
0x0000_0000_0000_0000) | (a
0:63
>
si
0x0000_0000_FFFF_FFFF)) then ov=1
else ov=0;
d
0:63
EXTZ(SATURATE(ov, a
0
, 0x0000_0000, 0xFFFF_FFFF, a
32:63
))
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The signed 64-bit value in parameter a is saturated to a 32-bit unsigned value. The 32-bit value is
zero-extended to 64 bits and placed into parameter d. Any saturation is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-594. Vector Saturate Signed Double Word to Unsigned Word Range
(__ev_satsduw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatsduw d,a
0 63
d
0 63
a
zzz................................................z
Saturate value to
unsigned-word range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-766 Freescale Semiconductor
SPE2 Operations
__ev_satshsb __ev_satshsb
Vector Saturate Signed Half Words to Signed Byte Range
d = __ev_satshsb (a)
// h0
if ((a
0:15
<
si
0xFF80) | (a
0:15
>
si
0x007F)) then ovh0=1 else ovh0=0;
d
0:15
EXTS(SATURATE(ovh0, a
0
, 0x80, 0x7f, a
8:15
))
// h1
if ((a
16:31
<
si
0xFF80) | (a
16:31
>
si
0x007F)) then ovh1=1 else ovh1=0;
d
16:31
EXTS(SATURATE(ovh1, a
16
, 0x80, 0x7f, a
24:31
))
// h2
if ((a
32:47
<
si
0xFF80) | (a
32:47
>
si
0x007F)) then ovh2=1 else ovh2=0;
d
32:47
EXTS(SATURATE(ovh2, a
32
, 0x80, 0x7f, a
40:47
))
// h3
if ((a
48:63
<
si
0xFF80) | (a
48:63
>
si
0x007F)) then ovh3=1 else ovh3=0;
d
48:63
EXTS(SATURATE(ovh3, a
48
, 0x80, 0x7f, a
56:63
))
ovh ovh0 | ovh1
ovl ovb2 | ovb3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 16-bit elements of parameter a are saturated to 8-bit signed values. The 8-bit values are
sign-extended to half words and placed into parameter d. Any saturation is recorded in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-595. Vector Saturate Signed Half Words to Signed Byte Range (__ev_satshsb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatshsb d,a
0 32 63
d
16 48
0 32 63
a
16 48
sss........s sss........s sss........s sss........s
Saturate values to
signed-byte range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-767
__ev_satshub __ev_satshub
Vector Saturate Signed Half Words to Unsigned Byte Range
d = __ev_satshub (a)
// b0
if ((a
0:15
<
si
0x0000) | (a
0:15
>
si
0x00FF)) then ovh0=1 else ovh0=0;
d
0:15
EXTZ(SATURATE(ovh0, a
0
, 0x00, 0xFF, a
8:15
))
// b1
if ((a
16:31
<
si
0x000) | (a
16:31
>
si
0x00FF)) then ovh1=1 else ovh1=0;
d
16:31
EXTZ(SATURATE(ovh1, a
16
, 0x00, 0xFF, a
24:31
))
// b2
if ((a
32:47
<
si
0x0000) | (a
32:47
>
si
0x00FF)) then ovh2=1 else ovh2=0;
d
32:47
EXTZ(SATURATE(ovh2, a
32
, 0x00, 0xFF, a
40:47
))
// b3
if ((a
48:63
<
si
0x0000) | (a
48:63
>
si
0x00FF)) then ovh3=1 else ovh3=0;
d
48:63
EXTZ(SATURATE(ovh3, a
48
, 0x00, 0xFF, a
56:63
))
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 16-bit elements of parameter a are saturated to 8-bit unsigned values. Negative
elements saturate to 0. The 8-bit results are zero-extended to half words and placed into parameter
d. Any saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-596. Vector Saturate Signed Half Words to Unsigned Byte Range (__ev_satshub)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatshub d,a
0 32 63
d
16 48
0 32 63
a
16 48
zzz.......z zzz.......z zzz.......z zzz......z
Saturate values to
unsigned-byte range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-768 Freescale Semiconductor
SPE2 Operations
__ev_satshuh __ev_satshuh
Vector Saturate Signed Half Words to Unsigned Half Word Range
d = __ev_satshuh (a)
// h0
if (a
0:15
<
si
0x0000) then ovh0=1 else ovh0=0;
d
0:15
SATURATE(ovh0, 0, 0x0000, 0x0000, a
0:15
)
// h1
if (a
16:31
<
si
0x000) then ovh1=1 else ovh1=0;
d
16:31
SATURATE(ovh1, 0, 0x0000, 0x0000, a
16:31
)
// h2
if (a
32:47
<
si
0x0000) then ovh2=1 else ovh2=0;
d
32:47
SATURATE(ovh2, 0, 0x0000, 0x0000, a
32:47
)
// h3
if (a
48:63
<
si
0x0000) then ovh3=1 else ovh3=0;
d
48:63
SATURATE(ovh3, 0, 0x0000, 0x0000, a
48:63
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 16-bit elements of parameter a are saturated to 16-bit unsigned values. Negative
elements saturate to 0. The 16-bit results are placed into parameter d. Any saturation is recorded
in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-597. Vector Saturate Signed Half Words to Unsigned Half Word Range
(__ev_satshuh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatshuh d,a
0 32 63
d
16 48
0 32 63
a
16 48
z z z z
Saturate values to
unsigned-half word
range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-769
__ev_satswgsdf __ev_satswgsdf
Vector Saturate Signed Word Guarded to Signed Doubleword Fractional
d = __ev_satswgsdf (a)
if ((a
0:63
<
si
0xFFFF_8000_0000_0000) | (a
0:63
>
si
0x0000_7FFF_FFFF_FFFF)) then
ov=1
temp
0:63
SATURATE(ov, a
0
, 0x8000_0000_0000_0000, 0x7fff_ffff_ffff_0000, -----)
else
ov=0
temp
0:63
a
16:63
||
16
0)
d
0:63
temp
0:63
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The signed 64-bit guarded word fraction in 17.47 fractional format in parameter a is saturated to
a 64-bit fraction in 1.63 fractional format, zeroing the low-order 16 bits. The result is placed into
parameter d. The original value is assumed to be in 17.47 fractional format as a result of one or more
guarded word fractional operations. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-598. Vector Saturate Signed Word Guarded to Signed Doubleword Fractional
(__ev_satswgsdf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatswgsdf d,a
0 63
a
d
47 48 63
15 16 17
.
SAT
0 1
.
SPE2 Programming Interface Manual, Rev. 1.0-2
3-770 Freescale Semiconductor
SPE2 Operations
__ev_satswsh __ev_satswsh
Vector Saturate Signed Words to Signed Half Word Range
d = __ev_satswsh (a)
// w0
if ((a
0:31
<
si
0xFFFF8000) | (a
0:31
>
si
0x00007FFF)) then ovh=1 else ovh=0;
d
0:31
EXTS(SATURATE(ovh, a
0
, 0x8000, 0x7fff, a
16:31
))
// w1
if ((a
32:63
<
si
0xFFFF8000) | (a
32:63
>
si
0x00007FFF)) then ovl=1 else ovl=0;
d
32:63
EXTS(SATURATE(ovl, a
32
, 0x8000, 0x7fff, a
48:63
))
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 32-bit elements of parameter a are saturated to 16-bit signed values. The 16-bit results
are sign-extended to 32-bits and placed into parameter d. Any saturation is recorded in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-599. Vector Saturate Signed Words to Signed Half Word Range (__ev_satswsh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatswsh d,a
0 32 63
d
0 32 63
a
sss......................s sss......................s
Saturate values to
signed-half word range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-771
__ev_satswuh __ev_satswuh
Vector Saturate Signed Words to Unsigned Half Word Range
d = __ev_satswuh (a)
// w0
if ((a
0:31
<
si
0x00000000) | (a
0:31
>
si
0x0000FFFF)) then ovh=1 else ovh=0;
d
0:31
EXTZ(SATURATE(ovh, a
0
, 0x0000, 0xFFFF, a
16:31
))
// w1
if ((a
32:63
<
si
0x00000000) | (a
32:63
>
si
0x0000FFFF)) then ovl=1 else ovl=0;
d
32:63
EXTZ(SATURATE(ovl, a
32
, 0x0000, 0xFFFF, a
48:63
))
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 32-bit elements of parameter a are saturated to 16-bit unsigned values. Negative
elements saturate to 0. The values are zero-extended to 32-bits and placed into parameter d. Any
saturation is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-600. Vector Saturate Signed Words to Unsigned Half Word Range
(__ev_satswuh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatswuh d,a
0 32 63
d
0 32 63
a
zzz....................z zzz....................z
Saturate values to
unsigned-half word range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-772 Freescale Semiconductor
SPE2 Operations
__ev_satswuw __ev_satswuw
Vector Pack Signed Words to Unsigned Word Range
d = __ev_satswuw (a)
// w0
if (a
0:31
<
si
0x00000000) then ovh=1 else ovh=0;
d
0:31
SATURATE(ovh, 0, 0x00000000, 0x00000000, a
0:31
)
// w1
if (a
32:63
<
si
0x00000000) then ovl=1 else ovl=0;
d
32:63
SATURATE(ovl, 0, 0x00000000, 0x00000000, a
32:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The signed 32-bit elements of parameter a are saturated to 32-bit unsigned values. Negative
elements saturate to 0. The results are placed into parameter d. Any saturation is recorded in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-601. Vector Saturate Signed Words to Unsigned Word Range (__ev_satswuw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatswuw d,a
0 32 63
d
0 32 63
a
z z
Saturate values to
unsigned-word range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-773
__ev_satuduw __ev_satuduw
Vector Saturate Unsigned Double Word to Unsigned Word Range
d = __ev_satuduw (a)
if (a
0:63
>
ui
0x0000_0000_FFFF_FFFF) then ov=1 else ov=0;
d
0:63
EXTZ(SATURATE(ov, 0, 0xFFFF_FFFF, 0xFFFF_FFFF, a
32:63
))
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The unsigned 64-bit value in parameter a is saturated to a 32-bit unsigned value. The 32-bit value
is zero-extended to 64 bits and placed into parameter d. Any saturation is recorded in the
SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-602. Vector Saturate Unsigned Double Word to Unsigned Word Range
(__ev_satuduw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatuduw d,a
0 63
d
0 63
a
zzz................................................z
Saturate value to
unsigned-word range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-774 Freescale Semiconductor
SPE2 Operations
__ev_satubsb __ev_satubsb
Vector Saturate Unsigned Bytes to Signed Byte Range
d = __ev_satubsb (a)
// b0
if (a
0:7
>
ui
0x7F) then ovb0=1 else ovb0=0;
d
0:7
SATURATE(ovb0, 0, 0x7F, 0x7F, a
0:7
)
// b1
if (a
8:15
>
ui
0x7F) then ovb1=1 else ovb1=0;
d
8:15
SATURATE(ovb1, 0, 0x7F, 0x7F, a
8:15
)
// b2
if (a
16:23
>
ui
0x7F) then ovb2=1 else ovb2=0;
d
16:23
SATURATE(ovb2, 0, 0x7F, 0x7F, a
16:23
)
// b3
if (a
24:31
>
ui
0x7F) then ovb3=1 else ovb3=0;
d
24:31
SATURATE(ovb3, 0, 0x7F, 0x7F, a
24:31
)
// b4
if (a
32:39
>
ui
0x7F) then ovb4=1 else ovb4=0;
d
32:39
SATURATE(ovb4, 0, 0x7F, 0x7F, a
32:39
)
// b5
if (a
40:47
>
ui
0x7F) then ovb5=1 else ovb5=0;
d
40:47
SATURATE(ovb5, 0, 0x7F, 0x7F, a
40:47
)
// b6
if (a
48:55
>
ui
0x7F) then ovb6=1 else ovb6=0;
d
48:55
SATURATE(ovb6, 0, 0x7F, 0x7F, a
48:55
)
// b7
if (a
56:63
>
ui
0x7F) then ovb7=1 else ovb7=0;
d
56:63
SATURATE(ovb7, 0, 0x7F, 0x7F, a
56:63
)
ovh ovb0 | ovb1 | ovb2 | ovb3
ovl ovh4 | ovb5 | ovb6 | ovb7
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 8-bit elements of parameter a are saturated to 8-bit signed values. The 8-bit results
are placed into parameter d. Any saturation is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-603. Vector Saturate Unsigned Byte to Signed Byte Range (__ev_satubsb)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatubsb d,a
0 31 32 63
a
15 16 47 48 39 40 55 56 23 24 7 8
0 31 32 63
d
15 16 47 48 39 40 55 56 23 24 7 8
Saturate value to
signed-byte range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-775
__ev_satuhsh __ev_satuhsh
Vector Saturate Unsigned Half Words to Signed Half Word Range
d = __ev_satuhsh (a)
// h0
if (a
0:15
>
ui
0x7FFF) then ovh0=1 else ovh0=0;
d
0:15
SATURATE(ovh0, 0, 0x7FFF, 0x7FFF, a
0:15
)
// h1
if (a
16:31
>
ui
0x7FFF) then ovh1=1 else ovh1=0;
d
16:31
SATURATE(ovh1, 0, 0x7FFF, 0x7FFF, a
16:31
)
// h2
if (a
32:47
>
ui
0x7FFF) then ovh2=1 else ovh2=0;
d
32:47
SATURATE(ovh2, 0, 0x7FFF, 0x7FFF, a
32:47
)
// h3
if (a
48:63
>
ui
0x7FFF) then ovh3=1 else ovh3=0;
d
48:63
SATURATE(ovh3, 0, 0x7FFF, 0x7FFF, a
48:63
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 16-bit elements of parameter a are saturated to 16-bit signed values. The 16-bit
results are placed into parameter d. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-604. Vector Saturate Unsigned Half Words to Signed Half Word Range
(__ev_satuhsh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatuhsh d,a
0 32 63
d
16 48
0 32 63
a
16 48
z z z z
Saturate values to
signed-half word range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-776 Freescale Semiconductor
SPE2 Operations
__ev_satuhub __ev_satuhub
Vector Saturate Unsigned Half Words to Unsigned Byte Range
d = __ev_satuhub (a)
// h0
if (a
0:15
>
ui
0x00FF) then ovh0=1 else ovh0=0;
d
0:15
EXTZ(SATURATE(ovh0, 0, 0xFF, 0xFF, a
8:15
))
// h1
if (a
16:31
>
ui
0x00FF) then ovh1=1 else ovh1=0;
d
16:31
EXTZ(SATURATE(ovh1, 0, 0xFF, 0xFF, a
24:31
))
// h2
if (a
32:47
>
ui
0x00FF) then ovh2=1 else ovh2=0;
d
32:47
EXTZ(SATURATE(ovh2, 0, 0xFF, 0xFF, a
40:47
))
// h3
if (a
48:63
>
ui
0x00FF) then ovh3=1 else ovh3=0;
d
48:63
EXTZ(SATURATE(ovh3, 0, 0xFF, 0xFF, a
56:63
))
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 16-bit elements of parameter a are saturated to 8-bit unsigned values. The 8-bit
results are zero-extended to half words and placed into parameter d. Any saturation is recorded in
the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-605. Vector Saturate Unsigned Half Words to Unsigned Byte Range
(__ev_satuhub)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatuhub d,a
0 32 63
d
16 48
0 32 63
a
16 48
zzz.......z zzz.......z zzz.......z zzz......z
Saturate values to
unsigned-byte range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-777
__ev_satuwsw __ev_satuwsw
Vector Saturate Unsigned Words to Signed Word Range
d = __ev_satuwsw (a)
// w0
if (a
0:31
>
ui
0x7FFFFFFF) then ovh=1 else ovh=0;
d
0:31
SATURATE(ovh, 0, 0x7FFFFFFF, 0x7FFFFFFF, a
0:31
)
// w1
if ((a
32:63
>
ui
0x7FFFFFFFF) then ovl=1 else ovl=0;
d
32:63
SATURATE(ovl, 0, 0x7FFFFFFF, 0x7FFFFFFF, a
32:63
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 32-bit elements of parameter a are saturated to 32-bit signed values. The 32-bit
results are placed into parameter d. Any saturation is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-606. Vector Saturate Unsigned Words to Signed Word Range (__ev_satuwsw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatuwsw d,a
0 32 63
d
0 32 63
a
z z
Saturate values to
signed-word range
SPE2 Programming Interface Manual, Rev. 1.0-2
3-778 Freescale Semiconductor
SPE2 Operations
__ev_satuwuh __ev_satuwuh
Vector Saturate Unsigned Words to Unsigned Half Word Range
d = __ev_satuwuh (a)
// w0
if (a
0:31
>
ui
0x0000FFFF) then ovh=1 else ovh=0;
d
0:15
EXTZ(SATURATE(ovh, 0, 0xFFFF, 0xFFFF, a
16:31
))
// w1
if ((a
32:63
>
ui
0x0000FFFF) then ovl=1 else ovl=0;
d
16:31
EXTZ(SATURATE(ovl, 0, 0xFFFF, 0xFFFF, a
48:63
))
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The unsigned 32-bit elements of parameters a and b are saturated to 16-bit unsigned values. The
16-bit results are zero-extended into words and placed into parameter d. Any saturation is recorded
in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-607. Vector Saturate Unsigned Words to Unsigned Half Word Range
(__ev_satuwuh)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsatuwuh d,a
0 32 63
d
0 32 63
a
zzz....................z zzz....................z
Saturate values to
unsigned-half word range
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-779
__ev_select_eq __ev_select_eq
Vector Select Equal
e = __ev_select_eq (a,b,c,d)
if (a
0:31
= b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
= b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameters c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a = b? c : d.
Figure 3-608. Vector Select Equal (__ev_select_eq)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evcmpeq x,a,b
evsel e,c,d,x
0 31 32 63
a
b
=
=
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-780 Freescale Semiconductor
SPE2 Operations
__ev_select_gts __ev_select_gts
Vector Select Greater Than Signed
e = __ev_select_gts (a,b,c,d)
if (a
0:31
>
signed
b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
>
signed
b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a > b ? c : d.
Figure 3-609. Vector Select Greater Than Signed (__ev_select_gts)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evcmpgts x,a,b
evsel e,c,d,x
0 31 32 63
a
b
>
>
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-781
__ev_select_gtu __ev_select_gtu
Vector Select Greater Than Unsigned
e = __ev_select_gtu (a,b,c,d)
if (a
0:31
>
unsigned
c
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
>
unsigned
b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a > b? c : d.
Figure 3-610. Vector Select Greater Than Unsigned (__ev_select_gtu)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evcmpgtu x,a,b
evsel e,c,d,x
0 31 32 63
a
b
>
>
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-782 Freescale Semiconductor
SPE2 Operations
__ev_select_lts __ev_select_lts
Vector Select Less Than Signed
e = __ev_select_lts (a,b,c,d)
if (a
0:31
<
signed
b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
<
signed
b
32:63
) then e c
32:63
else e d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a < b? c : d.
Figure 3-611. Vector Select Less Than Signed (__ev_select_lts)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evcmplts x,a,b
evsel e,c,d,x
0 31 32 63
a
b
<
<
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-783
__ev_select_ltu __ev_select_ltu
Vector Select Less Than Unsigned
e = __ev_select_ltu (a,b,c,d)
if (a
0:31
<<
unsigned
b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
<
unsigned
b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a < b? c : d.
Figure 3-612. Vector Select Less Than Unsigned (__ev_select_ltu)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evcmpltu x,a,b
evsel e,c,d,x
0 31 32 63
a
b
<
<
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-784 Freescale Semiconductor
SPE2 Operations
__ev_selbit __ev_selbit
Vector Select Bit
d = __ev_selbit (a,b,c)
temp
0:63
64
0
do i=0 to 63
if (a
i
= 0) then temp
i
b
i
else temp
i
c
i
end
d
0:63
temp
0:63
For each bit in parameter a that contains the value 0, the corresponding bit in parameter b is
selected. For each bit in parameter a that contains the value 1, the corresponding bit in parameter
c is selected. The selected bits are then placed into parameter d.
Figure 3-613. Vector Select Bit (__ev_selbit)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evselbit d,b,c
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
0 31 32 63
b
c
temp
a
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
d
0 1 0 1 1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-785
__ev_selbitm0 __ev_selbitm0
Vector Select Bit if Mask is 0
d = __ev_selbitm0 (a,b,c)
temp
0:63
64
0
do i=0 to 63
if (b
i
= 0) then temp
i
c
i
else temp
i
a
i
end
d
0:63
temp
0:63
For each bit in parameter b that contains the value 0, the corresponding bit in parameter c is
selected. For each bit in parameter b that contains the value 1, the corresponding bit in parameter
a is selected. The selected bits are then placed into parameter d.
Figure 3-614. Vector Select Bit if Mask is 0 (__ev_selbitm0)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evselbitm0 d,b,c
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
0 31 32 63
c
a
temp
b
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
d
0 1 0 1 1
SPE2 Programming Interface Manual, Rev. 1.0-2
3-786 Freescale Semiconductor
SPE2 Operations
__ev_selbitm1 __ev_selbitm1
Vector Select Bit if Mask is 1
d = __ev_selbitm1 (a,b,c)
temp
0:63
64
0
do i=0 to 63
if (b
i
= 0) then temp
i
a
i
else temp
i
c
i
end
d
0:63
temp
0:63
For each bit in parameter b that contains the value 1, the corresponding bit in parameter c is
selected. For each bit in parameter b that contains the value 0, the corresponding bit in parameter
a is selected. The selected bits are then placed into parameter d.
Figure 3-615. Vector Select Bit if Mask is 1 (__ev_selbitm1)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ d a
evselbitm1 d,b,c
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
0 31 32 63
a
c
temp
b
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
d
0 1 0 1 1
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-787
__ev_seteqb[_rc] __ev_seteqb[_rc]
Vector Set if Equal Byte [and Record]
d = __ev_seteqb (a,b) (Rc = 0)
d = __ev_seteqb_rc (a,b) (Rc = 1)
do i=0 to 63 by 8
if a
i:i+7
=
b
i:i+7
then d
i:i+7
=
8
1 else d
i:i+7
=
8
0
end
c0 (a
0:7
=
b
0:7
)
| (a
8:15
=
b
8:15
)
| (a
16:23
=
b
16:23
)
| (a
24:31
=
b
24:31
)
c1 (a
0:7
=
b
0:7
)
& (a
8:15
=
b
8:15
)
& (a
16:23
=
b
16:23
)
& (a
24:31
=
b
24:31
)
c2 (a
0:7
=
b
0:7
)
| (a
8:15
=
b
8:15
)
| (a
16:23
=
b
16:23
)
| (a
24:31
=
b
24:31
) |
(a
32:39
=
b
32:39
)
| (a
40:47
=
b
40:47
)
| (a
48:55
=
b
48:55
)
| (a
56:63
=
b
56:63
)
c3 (a
0:7
=
b
0:7
)
& (a
8:15
=
b
8:15
)
& (a
16:23
=
b
16:23
)
& (a
24:31
=
b
24:31
) &
(a
32:39
=
b
32:39
)
& (a
40:47
=
b
40:47
)
& (a
48:55
=
b
48:55
)
& (a
56:63
=
b
56:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each byte element in parameter a is compared to the corresponding element in parameter b. The
corresponding element of parameter d is set to all 1s if the element in parameter a is equal to the
element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if any of the high-order four elements of parameter a is equal to the corresponding element
of parameter b; it is cleared otherwise. The next bit in cr0 is set if all of the high-order four
elements of parameter a are equal to the corresponding element of parameter b; it is cleared
otherwise. The third bit of cr0 is set if any element of parameter a is equal to the corresponding
element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the elements of
parameter a are equal to the corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-616. Vector Set if Equal Byte [and Record] (__ev_seteqb[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evseteqb d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evseteqb. d,a,b
OR
AND
OR
AND
cr0
0 32 63
b
a
24 16 8 40 48 56
= = = = = = = =
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-788 Freescale Semiconductor
SPE2 Operations
__ev_seteqh[_rc] __ev_seteqh[_rc]
Vector Set if Equal Half Word [and Record]
d = __ev_seteqh (a,b) (Rc = 0)
d = __ev_seteqh_rc (a,b) (Rc = 1)
do i=0 to 63 by 16
if a
i:i+15
=
b
i:i+15
then d
i:i+15
=
16
1 else d
i:i+15
=
16
0
end
c0 (a
0:15
=
b
0:15
)
| (a
16:31
=
b
16:31
)
c1 (a
0:15
=
b
0:15
)
& (a
16:31
=
b
16:31
)
c2 (a
0:15
=
b
0:15
)
| (a
16:31
=
b
16:31
)
| (a
32:47
=
b
32:47
)
| (a
48:63
=
b
48:63
)
c3 (a
0:15
=
b
0:15
)
& (a
16:31
=
b
16:31
)
& (a
32:47
=
b
32:47
)
& (a
48:63
=
b
48:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each half word element in parameter a is compared to the corresponding element in parameter b.
The corresponding element of parameter d is set to all 1s if the element in parameter a is equal
to the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if either of the high-order two elements of parameter a is equal to the corresponding element
of parameter b; it is cleared otherwise. The next bit in cr0 is set if both of the high-order two
elements of parameter a are equal to the corresponding element of parameter b; it is cleared
otherwise. The third bit of cr0 is set if any element of parameter a is equal to the corresponding
element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the elements of
parameter a are equal to the corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-617. Vector Set if Equal Half Word [and Record] (__ev_seteqh[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evseteqh d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evseteqh. d,a,b
b
0 31 32 63
a
15 16 47 48
= = = =
d
OR
AND
OR
AND
cr0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-789
__ev_seteqw[_rc] __ev_seteqw[_rc]
Vector Set if Equal Word [and Record]
d = __ev_seteqw (a,b) (Rc = 0)
d = __ev_seteqw_rc (a,b) (Rc = 1)
if a
0:31
=
b
0:31
then d
0:31
=
32
1 else d
0:31
=
32
0
if a
32:63
=
b
32:63
then d
32:63
=
32
1 else d
32:63
=
32
0
c0 (a
0:31
=
b
0:31
)
c1 (a
32:63
=
b
32:63
)
c2 (a
0:31
=
b
0:31
)
| (a
32:63
=
b
32:63
)
c3 (a
0:31
=
b
0:31
)
& (a
32:63
=
b
32:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each word element in parameter a is compared to the corresponding element in parameter b. The
corresponding element of parameter d is set to all 1s if the element in parameter a is equal to the
element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if the high-order element of parameter a is equal to the high-order element of parameter b;
it is cleared otherwise. The next bit in cr0 is set if the low-order element of parameter a is equal
to the low-order element of parameter b; it is cleared otherwise. The third bit of cr0 is set if any
element of parameter a is equal to the corresponding element of parameter b; it is cleared
otherwise. The last bit of cr0 is set if both elements of parameter a are equal to the corresponding
elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-618. Vector Set if Equal Word (__ev_seteqw[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evseteqw d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evseteqw. d,a,b
b
0 31 32 63
a
= =
d
OR
AND
cr0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-790 Freescale Semiconductor
SPE2 Operations
__ev_setgtbs[_rc] __ev_setgtbs[_rc]
Vector Set if Greater Than Byte Signed [and Record]
d = __ev_setgtbs (a,b) (Rc = 0)
d = __ev_setgtbs_rc (a,b) (Rc = 1)
do i=0 to 63 by 8
if a
i:i+7
>
si
b
i:i+7
then d
i:i+7
=
8
1 else d
i:i+7
=
8
0
end
c0 (a
0:7
>
si
b
0:7
)
| (a
8:15
>
si
b
8:15
)
| (a
16:23
>
si
b
16:23
)
| (a
24:31
>
si
b
24:31
)
c1 (d
0:7
>
si
b
0:7
)
& (a
8:15
>
si
b
8:15
)
& (a
16:23
>
si
b
16:23
)
& (a
24:31
>
si
b
24:31
)
c2 (a
0:7
>
si
b
0:7
)
| (a
8:15
>
si
b
8:15
)
| (a
16:23
>
si
b
16:23
)
| (a
24:31
>
si
b
24:31
) |
(a
32:39
>
si
b
32:39
)
| (a
40:47
>
si
b
40:47
)
| (a
48:55
>
si
b
48:55
)
| (a
56:63
>
si
b
56:63
)
c3 (a
0:7
>
si
b
0:7
)
& (a
8:15
>
si
b
8:15
)
& (a
16:23
>
si
b
16:23
)
& (a
24:31
>
si
b
24:31
) &
(a
32:39
>
si
b
32:39
)
& (a
40:47
>
si
b
40:47
)
& (a
48:55
>
si
b
48:55
)
& (a
56:63
>
si
b
56:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each signed byte element in parameter a is compared to the corresponding signed byte element in
parameter b. The corresponding element of parameter d is set to all 1s if the element in parameter
a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if any of the high-order four elements of parameter a is greater than the corresponding
element of parameter b; it is cleared otherwise. The next bit in cr0 is set if all of the high-order
four elements of parameter a are greater than the corresponding element of parameter b; it is
cleared otherwise. The third bit of cr0 is set if any element of parameter a is greater than the
corresponding element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the
elements of parameter a are greater than the corresponding elements of parameter b; it is cleared
otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-619. Vector Set if Greater than Byte Signed [and Record] (__ev_setgtbs[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtbs d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtbs. d,a,b
OR
AND
OR
AND
cr0
0 32 63
b
a
24 16 8 40 48 56
>
si
>
si
>
si
>
si
>
si
>
si
>
si
>
si
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-791
__ev_setgtbu[_rc] __ev_setgtbu[_rc]
Vector Set if Greater Than Byte Unsigned [and Record]
d = __ev_setgtbu (a,b) (Rc = 0)
d = __ev_setgtbu_rc (a,b) (Rc = 1)
do i=0 to 63 by 8
if a
i:i+7
>
ui
b
i:i+7
then d
i:i+7
=
8
1 else d
i:i+7
=
8
0
end
c0 (a
0:7
>
ui
b
0:7
)
| (a
8:15
>
ui
b
8:15
)
| (a
16:23
>
ui
b
16:23
)
| (a
24:31
>
ui
b
24:31
)
c1 (a
0:7
>
ui
b
0:7
)
& (a
8:15
>
ui
b
8:15
)
& (a
16:23
>
ui
b
16:23
)
& (a
24:31
>
ui
b
24:31
)
c2 (a
0:7
>
ui
b
0:7
)
| (a
8:15
>
ui
b
8:15
)
| (a
16:23
>
ui
b
16:23
)
| (a
24:31
>
ui
b
24:31
) |
(a
32:39
>
ui
b
32:39
)
| (a
40:47
>
ui
b
40:47
)
| (a
48:55
>
ui
b
48:55
)
| (a
56:63
>
ui
b
56:63
)
c3 (a
0:7
>
ui
b
0:7
)
& (a
8:15
>
ui
b
8:15
)
& (a
16:23
>
ui
b
16:23
)
& (a
24:31
>
ui
b
24:31
) &
(a
32:39
>
ui
b
32:39
)
& (a
40:47
>
ui
b
40:47
)
& (a
48:55
>
ui
b
48:55
)
& (a
56:63
>
ui
b
56:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each unsigned byte element in parameter a is compared to the corresponding unsigned byte
element in parameter b. The corresponding element of parameter d is set to all 1s if the element
in parameter a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if any of the high-order four elements of parameter a is greater than the corresponding
element of parameter b; it is cleared otherwise. The next bit in cr0 is set if all of the high-order
four elements of parameter a are greater than the corresponding element of parameter b; it is
cleared otherwise. The third bit of cr0 is set if any element of parameter a is greater than the
corresponding element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the
elements of parameter a are greater than the corresponding elements of parameter b; it is cleared
otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-620. Vector Set if Greater than Byte Unsigned [and Record] (__ev_setgtbu[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtbu d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtbu. d,a,b
OR
AND
OR
AND
cr0
0 32 63
b
a
24 16 8 40 48 56
>
ui
>
ui
>
ui
>
ui
>
ui
>
ui
>
ui
>
ui
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-792 Freescale Semiconductor
SPE2 Operations
__ev_setgths[_rc] __ev_setgths[_rc]
Vector Set if Greater Than Half Word SIgned [and Record]
d = __ev_setgths (a,b) (Rc = 0)
d = __ev_setgths_rc (a,b) (Rc = 1)
do i=0 to 63 by 16
if a
i:i+15
>
si
b
i:i+15
then d
i:i+15
=
16
1 else d
i:i+15
=
16
0
end
c0 (a
0:15
>
si
b
0:15
)
| (a
16:31
>
si
b
16:31
)
c1 (a
0:15
>
si
b
0:15
)
& (a
16:31
>
si
b
16:31
)
c2 (a
0:15
>
si
b
0:15
)
| (a
16:31
>
si
b
16:31
)
| (a
32:47
>
si
b
32:47
)
| (a
48:63
>
si
b
48:63
)
c3 (a
0:15
>
si
b
0:15
)
& (a
16:31
>
si
b
16:31
)
& (a
32:47
>
si
b
32:47
)
& (a
48:63
>
si
b
48:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each signed half word element in parameter a is compared to the corresponding signed element in
parameter b. The corresponding element of parameter d is set to all 1s if the element in parameter
a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if either of the high-order two elements of parameter a is greater than the corresponding
element of parameter b; it is cleared otherwise. The next bit in cr0 is set if both of the high-order
two elements of parameter a are greater than the corresponding element of parameter b; it is
cleared otherwise. The third bit of cr0 is set if any element of parameter a is greater than the
corresponding element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the
elements of parameter a are greater than the corresponding elements of parameter b; it is cleared
otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-621. Vector Set if Greater than Half Word Signed [and Record]
(__ev_setgths[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgths d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgths. d,a,b
b
0 31 32 63
a
15 16 47 48
>
si
>
si
>
si
>
si
d
OR
AND
OR
AND
cr0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-793
__ev_setgthu[_rc] __ev_setgthu[_rc]
Vector Set if Greater Than Half Word Unsigned [and Record]
d = __ev_setgthu (a,b) (Rc = 0)
d = __ev_setgthu_rc (a,b) (Rc = 1)
do i=0 to 63 by 16
if a
i:i+15
>
ui
b
i:i+15
then d
i:i+15
=
16
1 else d
i:i+15
=
16
0
end
c0 (a
0:15
>
ui
b
0:15
)
| (a
16:31
>
ui
b
16:31
)
c1 (a
0:15
>
ui
b
0:15
)
& (a
16:31
>
ui
b
16:31
)
c2 (a
0:15
>
ui
b
0:15
)
| (a
16:31
>
ui
b
16:31
)
| (a
32:47
>
ui
b
32:47
)
| (a
48:63
>
ui
b
48:63
)
c3 (a
0:15
>
ui
b
0:15
)
& (a
16:31
>
ui
b
16:31
)
& (a
32:47
>
ui
b
32:47
)
& (a
48:63
>
ui
b
48:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each unsigned half word element in parameter a is compared to the corresponding unsigned
element in parameter b. The corresponding element of parameter d is set to all 1s if the element
in parameter a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if either of the high-order two elements of parameter a is greater than the corresponding
element of parameter b; it is cleared otherwise. The next bit in cr0 is set if both of the high-order
two elements of parameter a are greater than the corresponding element of parameter b; it is
cleared otherwise. The third bit of cr0 is set if any element of parameter a is greater than the
corresponding element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the
elements of parameter a are greater than the corresponding elements of parameter b; it is cleared
otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-622. Vector Set if Greater than Half Word Unsigned [and Record]
(__ev_setgthu[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgthu d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgthu. d,a,b
b
0 31 32 63
a
15 16 47 48
>
ui
>
ui
>
ui
>
ui
d
OR
AND
OR
AND
cr0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-794 Freescale Semiconductor
SPE2 Operations
__ev_setgtws[_rc] __ev_setgtws[_rc]
Vector Set if Greater Than Word Signed [and Record]
d = __ev_setgtws (a,b) (Rc = 0)
d = __ev_setgtws_rc (a,b) (Rc = 1)
if a
0:31
>
si
b
0:31
then d
0:31
=
32
1 else d
0:31
=
32
0
if a
32:63
>
si
b
32:63
then d
32:63
=
32
1 else d
32:63
=
32
0
c0 (a
0:31
>
si
b
0:31
)
c1 (a
32:63
>
si
b
32:63
)
c2 (a
0:31
>
si
b
0:31
)
| (a
32:63
>
si
b
32:63
)
c3 (a
0:31
>
si
b
0:31
)
& (a
32:63
>
si
b
32:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each signed word element in parameter a is compared to the corresponding signed word element
in parameter b. The corresponding element of parameter d is set to all 1s if the element in
parameter a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if the high-order element of parameter a is greater than the high-order element of parameter
b; it is cleared otherwise. The next bit in cr0 is set if the low-order element of parameter a is greater
than the low-order element of parameter b; it is cleared otherwise. The third bit of cr0 is set if any
element of parameter a is greater than the corresponding element of parameter b; it is cleared
otherwise. The last bit of cr0 is set if both elements of parameter a are greater than the
corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-623. Vector Set if Greater than Word Signed (__ev_setgtws[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtws d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtws. d,a,b
b
0 31 32 63
a
>
si
>
si
d
OR
AND
cr0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-795
__ev_setgtwu[_rc] __ev_setgtwu[_rc]
Vector Set if Greater Than Word Unsigned [and Record]
d = __ev_setgtwu (a,b) (Rc = 0)
d = __ev_setgtwu_rc (a,b) (Rc = 1)
if a
0:31
>
ui
b
0:31
then d
0:31
=
32
1 else d
0:31
=
32
0
if a
32:63
>
ui
b
32:63
then d
32:63
=
32
1 else d
32:63
=
32
0
c0 (a
0:31
>
ui
b
0:31
)
c1 (a
32:63
>
ui
b
32:63
)
c2 (a
0:31
>
ui
b
0:31
)
| (a
32:63
>
ui
b
32:63
)
c3 (a
0:31
>
ui
b
0:31
)
& (a
32:63
>
ui
b
32:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each unsigned word element in parameter a is compared to the corresponding unsigned word
element in parameter b. The corresponding element of parameter d is set to all 1s if the element
in parameter a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if the high-order element of parameter a is greater than the high-order element of parameter
b; it is cleared otherwise. The next bit in cr0 is set if the low-order element of parameter a is greater
than the low-order element of parameter b; it is cleared otherwise. The third bit of cr0 is set if any
element of parameter a is greater than the corresponding element of parameter b; it is cleared
otherwise. The last bit of cr0 is set if both elements of parameter a are greater than the
corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-624. Vector Set if Greater than Word Unsigned (__ev_setgtwu[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtwu d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetgtwu. d,a,b
b
0 31 32 63
a
>
ui
>
ui
d
OR
AND
cr0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-796 Freescale Semiconductor
SPE2 Operations
__ev_setltbs[_rc] __ev_setltbs[_rc]
Vector Set if Greater Than Byte Signed [and Record]
d = __ev_setltbs (a,b) (Rc = 0)
d = __ev_setltbs_rc (a,b) (Rc = 1)
do i=0 to 63 by 8
if a
i:i+7
<
si
b
i:i+7
then d
i:i+7
=
8
1 else d
i:i+7
=
8
0
end
c0 (a
0:7
<
si
b
0:7
)
| (a
8:15
<
si
b
8:15
)
| (a
16:23
<
si
b
16:23
)
| (a
24:31
<
si
b
24:31
)
c1 (a
0:7
<
si
b
0:7
)
& (a
8:15
<
si
b
8:15
)
& (a
16:23
<
si
b
16:23
)
& (a
24:31
<
si
b
24:31
)
c2 (a
0:7
<
si
b
0:7
)
| (a
8:15
<
si
b
8:15
)
| (a
16:23
<
si
b
16:23
)
| (a
24:31
<
si
b
24:31
) |
(a
32:39
<
si
b
32:39
)
| (a
40:47
<
si
b
40:47
)
| (a
48:55
<
si
b
48:55
)
| (a
56:63
<
si
b
56:63
)
c3 (a
0:7
<
si
b
0:7
)
& (a
8:15
<
si
b
8:15
)
& (a
16:23
<
si
b
16:23
)
& (a
24:31
<
si
b
24:31
) &
(a
32:39
<
si
b
32:39
)
& (a
40:47
<
si
b
40:47
)
& (a
48:55
<
si
b
48:55
)
& (a
56:63
<
si
b
56:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each signed byte element in parameter a is compared to the corresponding signed byte element in
parameter b. The corresponding element of parameter d is set to all 1s if the element in parameter
a is greater than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if any of the high-order four elements of parameter a is less than the corresponding element
of parameter b; it is cleared otherwise. The next bit in cr0 is set if all of the high-order four
elements of parameter a are less than the corresponding element of parameter b; it is cleared
otherwise. The third bit of cr0 is set if any element of parameter a is less than the corresponding
element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the elements of
parameter a are less than the corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-625. Vector Set if Less Than Byte Signed [and Record] (__ev_setltbs[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltbs d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltbs. d,a,b
OR
AND
OR
AND
cr0
0 32 63
b
a
24 16 8 40 48 56
<
si
<
si
<
si
<
si
<
si
<
si
<
si
<
si
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-797
__ev_setltbu[_rc] __ev_setltbu[_rc]
Vector Set if Less Than Byte Unsigned [and Record]
d = __ev_setltbu (a,b) (Rc = 0)
d = __ev_setltbu_rc (a,b) (Rc = 1)
do i=0 to 63 by 8
if a
i:i+7
<
ui
b
i:i+7
then d
i:i+7
=
8
1 else d
i:i+7
=
8
0
end
c0 (a
0:7
<
ui
b
0:7
)
| (a
8:15
<
ui
b
8:15
)
| (a
16:23
<
ui
b
16:23
)
| (a
24:31
<
ui
b
24:31
)
c1 (a
0:7
<
ui
b
0:7
)
& (a
8:15
<
ui
b
8:15
)
& (a
16:23
<
ui
b
16:23
)
& (a
24:31
<
ui
b
24:31
)
c2 (a
0:7
<
ui
b
0:7
)
| (a
8:15
<
ui
b
8:15
)
| (a
16:23
<
ui
b
16:23
)
| (a
24:31
<
ui
b
24:31
) |
(a
32:39
<
ui
b
32:39
)
| (a
40:47
<
ui
b
40:47
)
| (a
48:55
<
ui
b
48:55
)
| (a
56:63
<
ui
b
56:63
)
c3 (a
0:7
<
ui
b
0:7
)
& (a
8:15
<
ui
b
8:15
)
& (a
16:23
<
ui
b
16:23
)
& (a
24:31
<
ui
b
24:31
) &
(a
32:39
<
ui
b
32:39
)
& (a
40:47
<
ui
b
40:47
)
& (a
48:55
<
ui
b
48:55
)
& (a
56:63
<
ui
b
56:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each unsigned byte element in parameter a is compared to the corresponding unsigned byte
element in parameter b. The corresponding element of parameter d is set to all 1s if the element
in parameter a is less than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if any of the high-order four elements of parameter a is less than the corresponding element
of parameter b; it is cleared otherwise. The next bit in cr0 is set if all of the high-order four
elements of parameter a are less than the corresponding element of parameter b; it is cleared
otherwise. The third bit of cr0 is set if any element of parameter a is less than the corresponding
element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the elements of
parameter a are less than the corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-626. Vector Set if Less Than Byte Unsigned [and Record] (__ev_setltbu[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltbu d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltbu. d,a,b
OR
AND
OR
AND
cr0
0 32 63
b
a
24 16 8 40 48 56
<
ui
<
ui
<
ui
<
ui
<
ui
<
ui
<
ui
<
ui
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-798 Freescale Semiconductor
SPE2 Operations
__ev_setlths[_rc] __ev_setlths[_rc]
Vector Set if Greater Than Half Word SIgned [and Record]
d = __ev_setlths (a,b) (Rc = 0)
d = __ev_setlths_rc (a,b) (Rc = 1)
do i=0 to 63 by 16
if a
i:i+15
<
si
b
i:i+15
then d
i:i+15
=
16
1 else d
i:i+15
=
16
0
end
c0 (a
0:15
<
si
b
0:15
)
| (a
16:31
<
si
b
16:31
)
c1 (a
0:15
<
si
b
0:15
)
& (a
16:31
<
si
b
16:31
)
c2 (a
0:15
<
si
b
0:15
)
| (a
16:31
<
si
b
16:31
)
| (a
32:47
<
si
b
32:47
)
| (a
48:63
<
si
b
48:63
)
c3 (a
0:15
<
si
b
0:15
)
& (a
16:31
<
si
b
16:31
)
& (a
32:47
<
si
b
32:47
)
& (a
48:63
<
si
b
48:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each signed half word element in parameter a is compared to the corresponding signed element in
parameter b. The corresponding element of parameter d is set to all 1s if the element in parameter
a is less than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if either of the high-order two elements of parameter a is less than the corresponding element
of parameter b; it is cleared otherwise. The next bit in cr0 is set if both of the high-order two
elements of parameter a are less than the corresponding element of parameter b; it is cleared
otherwise. The third bit of cr0 is set if any element of parameter a is less than the corresponding
element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the elements of
parameter a are less than the corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-627. Vector Set if Less than Half Word Signed [and Record] (__ev_setlths[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetlths d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetlths. d,a,b
b
0 31 32 63
a
15 16 47 48
<
si
<
si
<
si
<
si
d
OR
AND
OR
AND
cr0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-799
__ev_setlthu[_rc] __ev_setlthu[_rc]
Vector Set if Less Than Half Word Unsigned [and Record]
d = __ev_setlthu (a,b) (Rc = 0)
d = __ev_setlthu_rc (a,b) (Rc = 1)
do i=0 to 63 by 16
if a
i:i+15
<
ui
b
i:i+15
then d
i:i+15
=
16
1 else d
i:i+15
=
16
0
end
c0 (a
0:15
<
ui
b
0:15
)
| (a
16:31
<
ui
b
16:31
)
c1 (a
0:15
<
ui
b
0:15
)
& (a
16:31
<
ui
b
16:31
)
c2 (a
0:15
<
ui
b
0:15
)
| (a
16:31
<
ui
b
16:31
)
| (a
32:47
<
ui
b
32:47
)
| (a
48:63
<
ui
b
48:63
)
c3 (a
0:15
<
ui
b
0:15
)
& (a
16:31
<
ui
b
16:31
)
& (a
32:47
<
ui
b
32:47
)
& (a
48:63
<
ui
b
48:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each unsigned half word element in parameter a is compared to the corresponding unsigned
element in parameter b. The corresponding element of parameter d is set to all 1s if the element
in parameter a is less than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if either of the high-order two elements of parameter a is less than the corresponding element
of parameter b; it is cleared otherwise. The next bit in cr0 is set if both of the high-order two
elements of parameter a are less than the corresponding element of parameter b; it is cleared
otherwise. The third bit of cr0 is set if any element of parameter a is less than the corresponding
element of parameter b; it is cleared otherwise. The last bit of cr0 is set if all of the elements of
parameter a are less than the corresponding elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-628. Vector Set if Less than Half Word Unsigned [and Record] (__ev_setlthu[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetlthu d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetlthu. d,a,b
b
0 31 32 63
a
15 16 47 48
<
ui
<
ui
<
ui
<
ui
d
OR
AND
OR
AND
cr0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-800 Freescale Semiconductor
SPE2 Operations
__ev_setltws[_rc] __ev_setltws[_rc]
Vector Set if Less Than Word Signed [and Record]
d = __ev_setltws (a,b) (Rc = 0)
d = __ev_setltws_rc (a,b) (Rc = 1)
if a
0:31
<
si
b
0:31
then d
0:31
=
32
1 else d
0:31
=
32
0
if a
32:63
<
si
b
32:63
then d
32:63
=
32
1 else d
32:63
=
32
0
c0 (a
0:31
<
si
b
0:31
)
c1 (a
32:63
<
si
b
32:63
)
c2 (a
0:31
<
si
b
0:31
)
| (a
32:63
<
si
b
32:63
)
c3 (a
0:31
<
si
b
0:31
)
& (a
32:63
<
si
b
32:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each signed word element in parameter a is compared to the corresponding signed word element
in parameter b. The corresponding element of parameter d is set to all 1s if the element in
parameter a is less than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if the high-order element of parameter a is less than the high-order element of parameter b;
it is cleared otherwise. The next bit in cr0 is set if the low-order element of parameter a is less than
the low-order element of parameter b; it is cleared otherwise. The third bit of cr0 is set if any
element of parameter a is less than the corresponding element of parameter b; it is cleared
otherwise. The last bit of cr0 is set if both elements of parameter a are less than the corresponding
elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-629. Vector Set if Less than Word Signed (__ev_setltws[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltws d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltws. d,a,b
b
0 31 32 63
a
<
si
<
si
d
OR
AND
cr0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-801
__ev_setltwu[_rc] __ev_setltwu[_rc]
Vector Set if Less Than Word Unsigned [and Record]
d = __ev_setltwu (a,b) (Rc = 0)
d = __ev_setltwu_rc (a,b) (Rc = 1)
if a
0:31
<
ui
b
0:31
then d
0:31
=
16
1 else d
0:31
=
16
0
if a
32:63
<
ui
b
32:63
then d
32:63
=
16
1 else d
32:63
=
16
0
c0 (a
0:31
<
ui
b
0:31
)
c1 (a
32:63
<
ui
b
32:63
)
c2 (a
0:31
<
ui
b
0:31
)
| (a
32:63
<
ui
b
32:63
)
c3 (a
0:31
<
ui
b
0:31
)
& (a
32:63
<
ui
b
32:63
)
if (Rc = 1) then CR
0:3
c0 || c1 || c2 || c3
Each unsigned word element in parameter a is compared to the corresponding unsigned word
element in parameter b. The corresponding element of parameter d is set to all 1s if the element
in parameter a is less than the element in parameter b, and is cleared to all 0s otherwise.
If Rc=1, then cr0 is updated according to the comparison results. The most significant bit in cr0
is set if the high-order element of parameter a is less than the high-order element of parameter b;
it is cleared otherwise. The next bit in cr0 is set if the low-order element of parameter a is less than
the low-order element of parameter b; it is cleared otherwise. The third bit of cr0 is set if any
element of parameter a is less than the corresponding element of parameter b; it is cleared
otherwise. The last bit of cr0 is set if both elements of parameter a are less than the corresponding
elements of parameter b; it is cleared otherwise.
Other registers altered: Condition Register (CR0) (if Rc=1)
Figure 3-630. Vector Set if Less than Word Unsigned (__ev_setltwu[_rc])
Rc d a b Maps to
Rc = 0 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltwu d,a,b
Rc = 1 __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsetltwu. d,a,b
b
0 31 32 63
a
<
ui
<
ui
d
OR
AND
cr0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-802 Freescale Semiconductor
SPE2 Operations
__ev_sl __ev_sl
Vector Shift Left
d = __ev_sl (a,b)
sh b
57:63
if b
57
= 0 then
d
0:63
EXTZ(a
0:63-sh
)
else
d
64
0
The value in parameter a is shifted left by sh bit positions specified in parameter b
57:63
, filling
vacated bit positions with zeros, and the result is placed into parameter d. Shift amounts of 64 to
127 give a zero result.
Figure 3-631. Vector Shift Left (__ev_sl)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsl d,a,b
Bits shifted by
value specified in b
57:63
0 63
d
0 63
a
00..0
0 63
b
57
sh //
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-803
__ev_sli __ev_sli
Vector Shift Left Immediate
d = __ev_sli (a,b)
UIMM b
n UIMM
d
0:63
EXTZ(a
0:63-n
)
The value in parameter a is shifted left by n bit positions specified by the 5-bit unsigned literal
(UIMM) provided by parameter b, filling vacated bit positions with zeros, and the result is placed
into parameter d.
Figure 3-632. Vector Shift Left Immediate (__ev_sli)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsli d,a,b
Bits shifted by
value specified in UIMM
0 63
d
0 63
a
00..0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-804 Freescale Semiconductor
SPE2 Operations
__ev_slb __ev_slb
Vector Shift Left Byte
d = __ev_slb (a,b)
nb0 b
4:7
nb1 b
12:15
nb2 b
20:23
nb3 b
28:31
nb4 b
36:39
nb5 b
44:47
nb6 b
52:55
nb7 b
60:63
d
0:7
SL(a
0:7
, nb0)
d
8:15
SL(a
8:15
, nb1)
d
16:23
SL(a
16:23
, nb2)
d
24:31
SL(a
24:31
, nb3)
d
32:39
SL(a
32:39
, nb4)
d
40:47
SL(a
40:47
, nb5)
d
48:55
SL(a
48:55
, nb6)
d
56:63
SL(a
56:63
, nb7)
Each of the byte elements of parameter a are shifted left by an amount specified in the
corresponding byte elements of parameter b. The result is placed into parameter d. The separate
shift amounts for each element are specified by the lower 5 bits in each byte element of parameter
b that lie in bit positions 4-7, 12-15, 20-23, 28-31, 36-39, 44-47, 52-55, and 60-63.
Shift amounts from 8 to 15 give a zero result.
Figure 3-633. Vector Shift Left Byte (__ev_slb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evslb d,a,b
Bytes shifted by
value specified in nb[x]
7 8 3 15 16 11 23 24 19 31 32 27 39 40 35 47 48 43 55 56 51 0 63 59
b
nb7 nb6 nb5 nb4 nb3 nb2 nb1 nb0
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
a
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-805
__ev_slbi __ev_slbi
Vector Shift Left Byte Immediate
d = __ev_slbi (a,b)
UIMM b
n UIMM
d
0:7
SL(a
0:7
, n)
d
8:15
SL(a
8:15
, n)
d
16:23
SL(a
16:23
, n)
d
24:31
SL(a
24:31
, n)
d
32:39
SL(a
32:39
, n)
d
40:47
SL(a
40:47
, n)
d
48:55
SL(a
48:55
, n)
d
56:63
SL(a
56:63
, n)
Each of the byte elements of parameter a are shifted left by the unsigned literal value (UIMM)
provided by parameter b (range of 0-7) and the results are placed in parameter d.
Shift amounts greater than 7 are illegal.
Figure 3-634. Vector Shift Left Byte Immediate (__ev_slbi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evslbi d,a,b
Bytes shifted by
value specified in UIMM
7 8 1516 23 24 3132 3940 4748 5556 0 63
a
7 8 1516 23 24 3132 3940 4748 5556 0 63
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-806 Freescale Semiconductor
SPE2 Operations
__ev_slh __ev_slh
Vector Shift Left Half Word
d = __ev_slh (a,b)
nh0 b
11:15
nh1 b
27:31
nh2 b
43:47
nh3 b
59:63
d
0:15
SL(a
0:15
, nh0)
d
16:31
SL(a
16:31
, nh1)
d
32:47
SL(a
32:47
, nh2)
d
48:63
SL(a
48:63
, nh3)
Each of the half word elements of parameter a are shifted left by an amount specified in the
corresponding half word elements of parameter b. The result is placed into parameter d. The
separate shift amounts for each element are specified by 5 bits in parameter b that lie in bit
positions 11-15, 27-31, 43-47 and 59-63.
Shift amounts from 16 to 31 give a zero result.
Figure 3-635. Vector Shift Left Half Word (__ev_slh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evslh d,a,b
d
0 31 32 63 27 59
Half word shifted by
value specified in nh1
Half word shifted by
value specified in nh3
0 31 32 63
a
15 16 47 48
b
15 16 11 47 48 43
nh3 nh2 nh1 nh0
Half word shifted by
value specified in nh2
Half word shifted by
value specified in nh0
0 31 32 63 15 16 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-807
__ev_slhi __ev_slhi
Vector Shift Left Half Word Immediate
d = __ev_slhi (a,b)
UIMM b
n UIMM
d
0:15
SL(a
0:15
, n)
d
16:31
SL(a
16:31
, n)
d
32:47
SL(a
32:47
, n)
d
48:63
SL(a
48:63
, n)
Each of the half word elements of parameter a are shifted left by the unsigned literal value (UIMM)
provided by parameter b (range of 0-15) and the results are placed in parameter d.
Shift amounts greater than 15 are illegal.
Figure 3-636. Vector Shift Left Half Word (__ev_slhi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evslhi d,a,b
d
0 31 32 63
a
15 16 47 48
Half words are shifted left by
value specified in UIMM
0 31 32 63 15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-808 Freescale Semiconductor
SPE2 Operations
__ev_sloi __ev_sloi
Vector Shift Left by Octet Immediate
d = __ev_sloi (a,b)
n b * 8
d
0:63
SL(a
0:63
, n)
The value in parameter a is shifted left by the number of byte positions contained in parameter b,
filling vacated byte positions with zeros, and the result is placed into parameter d.
Figure 3-637. Vector Shift Left by Octet Immediate (__ev_sloi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal evsloi d,a,b
7 8 1516 2324 31 32 39 40 47 48 55 56 0 63
a
A B C D E F G H
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
d, b=3
D E F G H 0 0 0
7 8 1516 2324 31 32 39 40 47 48 55 56 0 63
d, b=6
G H 0 0 0 0 0 0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-809
__ev_slw __ev_slw
Vector Shift Left Word
d = __ev_slw (a,b)
nh b
26:31
nl b
58:63
d
0:31
SL(a
0:31
, nh)
d
32:63
SL(a
32:63
, nl)
Each of the high and low elements of parameter a are shifted left by an amount specified in
parameter b. The result is placed into parameter d. The separate shift amounts for each element
are specified by 6 bits in parameter b that lie in bit positions 2631 and 5863.
Shift amounts from 32 to 63 give a zero result.
Figure 3-638. Vector Shift Left Word (__ev_slw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evslw d,a,b
0 31 32 63
a
b
d
nh nl
0 31 32 63 26 58
High word shifted by
value specified in nh
Low word shifted by
value specified in nl
25 57
SPE2 Programming Interface Manual, Rev. 1.0-2
3-810 Freescale Semiconductor
SPE2 Operations
__ev_slwi __ev_slwi
Vector Shift Left Word Immediate
d = __ev_slwi (a,b)
UIMM b
n UIMM
d
0:31
SL(a
0:31
, n)
d
32:63
SL(a
32:63
, n)
Both high and low elements of parameter a are shifted left by the 5-bit unsigned literal (UIMM)
provided by parameter b, and the results are placed in parameter d.
Figure 3-639. Vector Shift Left Word Immediate (__ev_slwi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evslwi d,a,b
0 31 32 63
a
d
High and low words shifted by UIMM value
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-811
__ev_splatb __ev_splatb
Vector Splat Byte
d = __ev_splatb (a,b)
temp
0:7
a
0+(b*8):7+(b*8)
d
0:7
temp
0:7
d
8:15
temp
0:7
d
16:23
temp
0:7
d
24:31
temp
0:7
d
32:39
temp
0:7
d
40:47
temp
0:7
d
48:55
temp
0:7
d
56:63
temp
0:7
The byte element in parameter a specified by parameter b is placed into all byte elements of
parameter d. Byte 0 is the most-significant byte.
Figure 3-640. Vector Splat Byte (__ev_splatb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal evsplatb d,a,b
b
0 1 2 3
0 32 63
d
24 16 8 40 48 56
0 32 63
a
24 16 8 40 48 56
4 5 6 7
SPE2 Programming Interface Manual, Rev. 1.0-2
3-812 Freescale Semiconductor
SPE2 Operations
__ev_splatfi[a] __ev_splatfi[a]
Vector Splat Fractional Immediate (to Accumulator)
d = __ev_splatfi (a) (A = 0)
d = __ev_splatfia (a) (A = 1)
SIMM a
d
0:31
SIMM ||
27
0
d
32:63
SIMM ||
27
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed in both elements of parameter
d, as shown in Figure 3-641. The SIMM ends up in bit positions d
0:4
and d
32:36
. If A = 1, the result
in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-641. Vector Splat Fractional Immediate (to Accumulator) (__ev_splatfi[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfi d,a
__ev64_opaque__ 5-bit signed literal evsplatfia d,a
SIMM
d (and Accumulator if A=1)
0 31 32 63
SABCD
SABCD000...........000000 SABCD000...........000000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-813
__ev_splatfib[a] __ev_splatfib[a]
Vector Splat Fractional Immediate Byte (to Accumulator)
d = __ev_splatfib (a) (A = 0)
d = __ev_splatfiba (a) (A = 1)
SIMM a
d
0:7
SIMM ||
3
0
d
8:15
SIMM ||
3
0
d
16:23
SIMM ||
3
0)
d
24:31
SIMM ||
3
0
d
32:39
SIMM ||
3
0
d
40:47
SIMM ||
3
0
d
48:55
SIMM ||
3
0
d
56:63
SIMM ||
3
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed into all byte elements of
parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-642. Vector Splat Fractional Immediate Byte (to Accumulator) (__ev_splatfib[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfib d,a
__ev64_opaque__ 5-bit signed literal evsplatfiba d,a
SIMM SABCD
0 32 63
d (and Accumulator if A=1)
24 16 8 40 48 56
SABCD000
SPE2 Programming Interface Manual, Rev. 1.0-2
3-814 Freescale Semiconductor
SPE2 Operations
__ev_splatfibo[a] __ev_splatfibo[a]
Vector Splat Fractional Immediate Byte Odd (to Accumulator)
d = __ev_splatfibo (a) (A = 0)
d = __ev_splatfiboa (a) (A = 1)
SIMM a
d
0:7
8
0
d
8:15
SIMM ||
3
0
d
16:23
8
0
d
24:31
SIMM ||
3
0
d
32:39
8
0
d
40:47
SIMM ||
3
0
d
48:55
8
0
d
56:63
SIMM ||
3
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed into all odd byte elements of
parameter d, zeroing the even byte elements. If A = 1, the result in parameter d is also placed into
the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-643. Vector Splat Fractional Immediate Byte Odd (to Accumulator)
(__ev_splatfibo[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfibo d,a
__ev64_opaque__ 5-bit signed literal evsplatfiboa d,a
SIMM SABCD
0 32 63
d (and Accumulator if A=1)
24 16 8 40 48 56
SABCD000
8
0
8
0
8
0
8
0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-815
__ev_splatfid[a] __ev_splatfid[a]
Vector Splat Fractional Immediate Doubleword (to Accumulator)
d = __ev_splatfid (a) (A = 0)
d = __ev_splatfida (a) (A = 1)
SIMM a
d
0:63
SIMM ||
59
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed into parameter d. The SIMM
ends up in bit positions d
0:4
. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-644. Vector Splat Fractional Immediate Doubleword (to Accumulator)
(__ev_splatfid[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfid d,a
__ev64_opaque__ 5-bit signed literal evsplatfida d,a
SIMM SABCD
0 63
d (and Accumulator if A=1)
SABCD00000000000....................................0000000000000000
SPE2 Programming Interface Manual, Rev. 1.0-2
3-816 Freescale Semiconductor
SPE2 Operations
__ev_splatfih[a] __ev_splatfih[a]
Vector Splat Fractional Immediate Half Word (to Accumulator)
d = __ev_splatfih (a) (A = 0)
d = __ev_splatfiha (a) (A = 1)
SIMM a
d
0:15
SIMM ||
11
0
d
16:31
SIMM ||
11
0
d
32:47
SIMM ||
11
0
d
48:63
SIMM ||
11
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed into all halfword elements of
parameter d, as shown in Figure 3-645. The SIMM ends up in bit positions d
0:4
, d
16:20
, d
32:36
, and
d
48:52
. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-645. Vector Splat Fractional Immediate Halfword (to Accumulator)
(__ev_splatfih[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfih d,a
__ev64_opaque__ 5-bit signed literal evsplatfiha d,a
SIMM SABCD
15 16 47 48 0 31 32 63
d (and Accumulator if A=1) SABCD00..00 SABCD00..00 SABCD00..00 SABCD00..00
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-817
__ev_splatfiho[a] __ev_splatfiho[a]
Vector Splat Fractional Immediate Half Word Odd (to Accumulator)
d = __ev_splatfiho (a) (A = 0)
d = __ev_splatfihoa (a) (A = 1)
SIMM a
d
0:15
16
0
d
16:31
SIMM ||
11
0
d
32:47
16
0
d
48:63
SIMM ||
11
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed into the odd halfword elements
of parameter d, zeroing the even halfword elements. The SIMM ends up in bit positions d
16:20
and
d
48:52
. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-646. Vector Splat Fractional Immediate Halfword Odd (to Accumulator)
(__ev_splatfiho[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfiho d,a
__ev64_opaque__ 5-bit signed literal evsplatfihoa d,a
SIMM SABCD
15 16 47 48 0 31 32 63
SABCD00..00 0000........0000 SABCD00..00 0000........0000 d (and Accumulator if A=1)
SPE2 Programming Interface Manual, Rev. 1.0-2
3-818 Freescale Semiconductor
SPE2 Operations
__ev_splatfio[a] __ev_splatfio[a]
Vector Splat Fractional Immediate Odd (to Accumulator)
d = __ev_splatfio (a) (A = 0)
d = __ev_splatfioa (a) (A = 1)
SIMM a
d
0:31
32
0
d
32:63
SIMM ||
27
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is padded with trailing zeros and placed into the odd word element of
parameter d, zeroing the even word element. The SIMM ends up in bit positions d
32:36
. If A = 1,
the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-647. Vector Splat Fractional Immediate Odd (to Accumulator) (__ev_splatfio[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatfio d,a
__ev64_opaque__ 5-bit signed literal evsplatfioa d,a
SIMM SABCD
0 31 32 63
SABCD000....................00000000 00000000................00000000
d (and Accumulator if A=1)
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-819
__ev_splath __ev_splath
Vector Splat Half Word
d = __ev_splath (a,b)
hh
b
temp
0:15
a
0+(hh*16):15+(hh*16)
d
0:15
temp
0:15
d
16:31
temp
0:15
d
32:47
temp
0:15
d
48:63
temp
0:15
The half word element in parameter a specified by the 2-bit unsigned value in parameter b (hh)
is placed into all half word elements of parameter d, as shown in Figure 3-648. Half word 0 is the
most-significant half word.
Figure 3-648. Vector Splat Half Word (__ev_splath)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 2-bit unsigned literal evsplath d,a,b
15 16 47 48 0 31 32 63
d
15 16 47 48 0 31 32 63
a
hh
0 1 2 3
SPE2 Programming Interface Manual, Rev. 1.0-2
3-820 Freescale Semiconductor
SPE2 Operations
__ev_splati[a] __ev_splati[a]
Vector Splat Immediate (to Accumulator)
d = __ev_splati (a) (A = 0)
d = __ev_splatia (a) (A = 1)
SIMM a
d
0:31
EXTS
32
(SIMM)
d
32:63
EXTS
32
(SIMM)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign extended and placed in both elements of parameter d. If A = 1,
the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-649. __ev_splati[a] Sign Extend (to Accumulator)
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplati d,a
__ev64_opaque__ 5-bit signed literal evsplatia d,a
SIMM
d (and Accumulator if A=1)
0 31 32 63
SABCD
SSS......................SABCD SSS......................SABCD
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-821
__ev_splatib[a] __ev_splatib[a]
Vector Splat Immediate Byte (to Accumulator)
d = __ev_splatib (a) (A = 0)
d = __ev_splatiba (a) (A = 1)
SIMM a
d
0:7
EXTS
8
(SIMM)
d
8:15
EXTS
8
(SIMM)
d
16:23
EXTS
8
(SIMM)
d
24:31
EXTS
8
(SIMM)
d
32:39
EXTS
8
(SIMM)
d
40:47
EXTS
8
(SIMM)
d
48:55
EXTS
8
(SIMM)
d
56:63
EXTS
8
(SIMM)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign-extended and placed into all byte elements of parameter d. If A
= 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-650. Vector Splat Immediate Byte (to Accumulator) (__ev_splatib[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatib d,a
__ev64_opaque__ 5-bit signed literal evsplatiba d,a
SIMM SABCD
0 32 63
d (and Accumulator if A=1)
24 16 8 40 48 56
exts
SPE2 Programming Interface Manual, Rev. 1.0-2
3-822 Freescale Semiconductor
SPE2 Operations
__ev_splatibe[a] __ev_splatibe[a]
Vector Splat Immediate Byte Even (to Accumulator)
d = __ev_splatibe (a) (A = 0)
d = __ev_splatibea (a) (A = 1)
SIMM a
d
0:7
EXTS
8
(SIMM)
d
8:15
8
0
d
16:23
EXTS
8
(SIMM)
d
24:31
8
0
d
32:39
EXTS
8
(SIMM)
d
40:47
8
0
d
48:55
EXTS
8
(SIMM)
d
56:63
8
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign-extended and placed into the even byte elements of parameter
d, zeroing the odd byte elements. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-651. Vector Splat Immediate Byte Even (to Accumulator) (__ev_splatibe[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatibe d,a
__ev64_opaque__ 5-bit signed literal evsplatibea d,a
SIMM SABCD
0 32 63
d (and Accumulator if A=1)
24 16 8 40 48 56
exts
8
0
8
0
8
0
8
0
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-823
__ev_splatid[a] __ev_splatid[a]
Vector Splat Immediate Doubleword (to Accumulator)
d = __ev_splatid (a) (A = 0)
d = __ev_splatida (a) (A = 1)
SIMM a
d
0:63
EXTS
64
(SIMM)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign extended and placed into parameter d. If A = 1, the result in
parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-652. Vector Splat Immediate Doubleword (to Accumulator) (__ev_splatid[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatid d,a
__ev64_opaque__ 5-bit signed literal evsplatida d,a
SIMM
d (and Accumulator if A=1)
0 63
SABCD
SSSSSSSSSS...........................................................SSSSSSABCD
SPE2 Programming Interface Manual, Rev. 1.0-2
3-824 Freescale Semiconductor
SPE2 Operations
__ev_splatie[a] __ev_splatie[a]
Vector Splat Immediate Even (to Accumulator)
d = __ev_splatie (a) (A = 0)
d = __ev_splatiea (a) (A = 1)
SIMM a
d
0:31
EXTS
32
(SIMM)
d
32:63
32
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign extended and placed in the even word element of parameter d,
zeroing the odd word element. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-653. Vector Splat Immediate Even (to Accumulator) (__ev_splatie[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatie d,a
__ev64_opaque__ 5-bit signed literal evsplatiea d,a
SIMM
d (and Accumulator if A=1)
0 31 32 63
SABCD
00000000................00000000 SSS......................SABCD
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-825
__ev_splatih[a] __ev_splatih[a]
Vector Splat Immediate Half Word (to Accumulator)
d = __ev_splatih (a) (A = 0)
d = __ev_splatiha (a) (A = 1)
SIMM a
d
0:15
EXTS
16
(SIMM)
d
16:31
EXTS
16
(SIMM)
d
32:47
EXTS
16
(SIMM)
d
48:63
EXTS
16
(SIMM)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign extended and placed into all halfword elements of parameter d.
If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-654. Vector Splat Immediate Halfword (to Accumulator) (__ev_splatih[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatih d,a
__ev64_opaque__ 5-bit signed literal evsplatiha d,a
SIMM SABCD
15 16 47 48 0 31 32 63
d (and Accumulator if A=1) SS...SSABCD SS...SSABCD SS...SSABCD SS...SSABCD
SPE2 Programming Interface Manual, Rev. 1.0-2
3-826 Freescale Semiconductor
SPE2 Operations
__ev_splatihe[a] __ev_splatihe[a]
Vector Splat Immediate Halfword Even (to Accumulator)
d = __ev_splatihe (a) (A = 0)
d = __ev_splatihea (a) (A = 1)
SIMM a
d
0:15
EXTS
16
(SIMM)
d
16:31
16
0
d
32:47
EXTS
16
(SIMM)
d
48:63
16
0
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The 5-bit immediate value is sign extended and placed into the even halfword elements of
parameter d, zeroing the odd halfword elements. If A = 1, the result in parameter d is also placed
into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-655. Vector Splat Immediate Halfword Even (to Accumulator) (__ev_splatihe[a])
d a Maps to
__ev64_opaque__ 5-bit signed literal evsplatihe d,a
__ev64_opaque__ 5-bit signed literal evsplatihea d,a
SIMM SABCD
15 16 47 48 0 31 32 63
d (and Accumulator if A=1) 0000........0000 SS...SSABCD 0000........0000 SS...SSABCD
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-827
__ev_srbis __ev_srbis
Vector Shift Right Byte Immediate Signed
d = __ev_srbis (a,b)
UIMM b
n UIMM
d
0:7
EXTS(a
0:7-n
)
d
8:15
EXTS(a
8:15-n
)
d
16:23
EXTS(a
16:23-n
)
d
24:31
EXTS(a
24:31-n
)
d
32:39
EXTS(a
32:39-n
)
d
40:47
EXTS(a
40:47-n
)
d
48:55
EXTS(a
48:55-n
)
d
56:63
EXTS(a
56:63-n
)
Each of the byte elements of parameter a are shifted right by the unsigned literal value (UIMM) in
parameter b (range 0-7). Bits in the most significant positions vacated by the shift are filled with
a copy of the sign bit.
Shift amounts greater than 7 are illegal.
Figure 3-656. Vector Shift Right Byte Immediate Signed (__ev_srbis)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsrbis d,a,b
Bytes shifted by
value specified in UIMM
7 8 1516 23 24 3132 3940 4748 5556 0 63
a
7 8 1516 23 24 3132 3940 4748 5556 0 63
d
ssss ssss ssss ssss ssss ssss ssss ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-828 Freescale Semiconductor
SPE2 Operations
__ev_srbiu __ev_srbiu
Vector Shift Right Byte Immediate Unsigned
d = __ev_srbiu (a,b)
UIMM b
n UIMM
d
0:7
EXTZ(a
0:7-n
)
d
8:15
EXTZ(a
8:15-n
)
d
16:23
EXTZ(a
16:23-n
)
d
24:31
EXTZ(a
24:31-n
)
d
32:39
EXTZ(a
32:39-n
)
d
40:47
EXTZ(a
40:47-n
)
d
48:55
EXTZ(a
48:55-n
)
d
56:63
EXTZ(a
56:63-n
)
Each of the byte elements of parameter a are shifted right by the unsigned literal value (UIMM) in
parameter b (range 0-7). Bits in the most significant positions vacated by the shift are filled with
zeros.
Shift amounts greater than 7 are illegal.
Figure 3-657. Vector Shift Right Byte Immediate Unsigned (__ev_srbiu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsrbiu d,a,b
Bytes shifted by
value specified in UIMM
7 8 1516 23 24 3132 3940 4748 5556 0 63
a
7 8 1516 23 24 3132 3940 4748 5556 0 63
d
000 000 000 000 000 000 000 000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-829
__ev_srbs __ev_srbs
Vector Shift Right Byte Signed
d = __ev_srbs (a,b)
nb0 b
4:7
nb1 b
12:15
nb2 b
20:23
nb3 b
28:31
nb4 b
36:39
nb5 b
44:47
nb6 b
52:55
nb7 b
60:63
d
0:7
EXTS(a
0:7-nb0
)
d
8:15
EXTS(a
8:15-nb1
)
d
16:23
EXTS(a
16:23-nb2
)
d
24:31
EXTS(a
24:31-nb3
)
d
32:39
EXTS(a
32:39-nb4
)
d
40:47
EXTS(a
40:47-nb5
)
d
48:55
EXTS(a
48:55-nb6
)
d
56:63
EXTS(a
56:63-nb7
)
Each of the byte elements of parameter a are shifted right by an amount specified in the
corresponding byte elements of parameter b. Bits in the most significant positions vacated by the
shift are filled with a copy of the sign bit and the result is placed into parameter d. The separate
shift amounts for each element are specified by the lower 5 bits in each byte element of parameter
b that lie in bit positions 4-7, 12-15, 20-23, 28-31, 36-39, 44-47, 52-55, and 60-63.
Shift amounts from 8 to 15 give a result of 8 sign bits.
Figure 3-658. Vector Shift Right Byte Signed (__ev_srbs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrbs d,a,b
Bytes shifted by
value specified in nb[x]
7 8 3 15 16 11 23 24 19 31 32 27 39 40 35 47 48 43 55 56 51 0 63 59
b
nb7 nb6 nb5 nb4 nb3 nb2 nb1 nb0
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
a
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
d
ssss sss ssssss ssss ssssss sss ssssss ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-830 Freescale Semiconductor
SPE2 Operations
__ev_srbu __ev_srbu
Vector Shift Right Byte Unsigned
d = __ev_srbu (a,b)
nb0 b
4:7
nb1 b
12:15
nb2 b
20:23
nb3 b
28:31
nb4 b
36:39
nb5 b
44:47
nb6 b
52:55
nb7 b
60:63
d
0:7
EXTZ(a
0:7-nb0
)
d
8:15
EXTZ(a
8:15-nb1
)
d
16:23
EXTZ(a
16:23-nb2
)
d
24:31
EXTZ(a
24:31-nb3
)
d
32:39
EXTZ(a
32:39-nb4
)
d
40:47
EXTZ(a
40:47-nb5
)
d
48:55
EXTZ(a
48:55-nb6
)
d
56:63
EXTZ(a
56:63-nb7
)
Each of the byte elements of parameter a are shifted right by an amount specified in the
corresponding byte elements of parameter b. Bits in the most significant positions vacated by the
shift are filled with zeros and the result is placed into parameter d. The separate shift amounts for
each element are specified by the lower 5 bits in each byte element of parameter b that lie in bit
positions 4-7, 12-15, 20-23, 28-31, 36-39, 44-47, 52-55, and 60-63.
Shift amounts from 8 to 31 give a result of zero.
Figure 3-659. Vector Shift Right Byte Unsigned (__ev_srbu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrbu d,a,b
Bytes shifted by
value specified in nb[x]
7 8 3 15 16 11 23 24 19 31 32 27 39 40 35 47 48 43 55 56 51 0 63 59
b
nb7 nb6 nb5 nb4 nb3 nb2 nb1 nb0
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
a
7 8 15 16 23 24 31 32 39 40 47 48 55 56 0 63
d
000 00 00000 000 00000 00 00000 000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-831
__ev_srhis __ev_srhis
Vector Shift Right Half Word Immediate Signed
d = __ev_srhis (a,b)
UIMM b
n UIMM
d
0:15
EXTS(a
0:15-n
)
d
16:31
EXTS(a
16:31-n
)
d
32:47
EXTS(a
32:47-n
)
d
48:63
EXTS(a
48:63-n
)
Each of the half word elements of parameter a are shifted right by the unsigned literal value
(UIMM) in parameter b (range 0-15). Bits in the most significant positions vacated by the shift are
filled with a copy of the sign bit.
Shift amounts greater than 15 are illegal.
Figure 3-660. Vector Shift Right Half Word Immediate Signed (__ev_srhis)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsrhis d,a,b
d
0 31 32 63
a
15 16 47 48
Half word shifted by
value specified in UIMM
0 31 32 63 15 16 47 48
SSSSSS SSSSSS SSSSSS SSSSSS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-832 Freescale Semiconductor
SPE2 Operations
__ev_srhiu __ev_srhiu
Vector Shift Right Half Word Immediate Unsigned
d = __ev_srhiu (a,b)
UIMM b
n UIMM
d
0:15
EXTZ(a
0:15-n
)
d
16:31
EXTZ(a
16:31-n
)
d
32:47
EXTZ(a
32:47-n
)
d
48:63
EXTZ(a
48:63-n
)
Each of the half word elements of parameter a are shifted right by the unsigned literal value
(UIMM) in parameter b (range 0-15). Bits in the most significant positions vacated by the shift are
filled with zeros. The results are placed into parameter d.
Shift amounts greater than 15 are illegal.
Figure 3-661. Vector Shift Right Half Word Immediate Unsigned (__ev_srhiu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsrhiu d,a,b
d
0 31 32 63
a
15 16 47 48
Half word shifted by
value specified in UIMM
0 31 32 63 15 16 47 48
00000 000000 000000 000000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-833
__ev_srhs __ev_srhs
Vector Shift Right Half Word Signed
d = __ev_srhs (a,b)
nh0 b
11:15
nh1 b
27:31
nh2 b
43:47
nh3 b
59:63
d
0:15
EXTS(a
0:15-nh0
)
d
16:31
EXTS(a
16:31-nh1
)
d
32:47
EXTS(a
32:47-nh2
)
d
48:63
EXTS(a
48:63-nh3
)
Each of the half word elements of parameter a are shifted right by an amount specified in the
corresponding half word elements of parameter b. The result is placed into parameter d. The
separate shift amounts for each element are specified by 5 bits in parameter b that lie in bit
positions 11-15, 27-31, 43-47 and 59-63. Bits in the most significant positions vacated by the shift
are filled with a copy of the sign bit
Shift amounts from 16 to 31 give a result of 16 sign bits.
Figure 3-662. Vector Shift Right Half Word Signed (__ev_srhs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrhs d,a,b
d
0 31 32 63 27 59
Half word shifted by
value specified in nh1
Half word shifted by
value specified in nh3
0 31 32 63
a
15 16 47 48
b
15 16 11 47 48 43
nh3 nh2 nh1 nh0
Half word shifted by
value specified in nh2
Half word shifted by
value specified in nh0
0 31 32 63 15 16 47 48
SSSSSS SSSS SSSSSSSSSS SSSSSSSSS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-834 Freescale Semiconductor
SPE2 Operations
__ev_srhu __ev_srhu
Vector Shift Right Half Word Unsigned
d = __ev_srhu (a,b)
nh0 b
11:15
nh1 b
27:31
nh2 b
43:47
nh3 b
59:63
d
0:15
EXTZ(a
0:15-nh0
)
d
16:31
EXTZ(a
16:31-nh1
)
d
32:47
EXTZ(a
32:47-nh2
)
d
48:63
EXTZ(a
48:63-nh3
)
Each of the half word elements of parameter a are shifted right by an amount specified in the
corresponding half word elements of parameter b. The result is placed into parameter d. The
separate shift amounts for each element are specified by 5 bits in parameter b that lie in bit
positions 11-15, 27-31, 43-47 and 59-63. Bits in the most significant positions vacated by the shift
are filled with zeros.
Shift amounts from 16 to 31 give a zero result.
Figure 3-663. Vector Shift Right Half Word Unsigned (__ev_srhu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrhu d,a,b
d
0 31 32 63 27 59
Half word shifted by
value specified in nh1
Half word shifted by
value specified in nh3
0 31 32 63
a
15 16 47 48
b
15 16 11 47 48 43
nh3 nh2 nh1 nh0
Half word shifted by
value specified in nh2
Half word shifted by
value specified in nh0
0 31 32 63 15 16 47 48
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-835
__ev_sris __ev_sris
Vector Shift Right Immediate Signed
d = __ev_sris (a,b)
n b
d
0:63
EXTS(a
0:63-n
)
The value in parameter a is shifted right by the number of bit positions specified by parameter b,
filling vacated bit positions with the sign of parameter a, and the result is placed into parameter d.
Note: arbitrary shifts may be performed by using __ev_sris in conjunction with __ev_srois.
Figure 3-664. Vector Shift Right Immediate Signed (__ev_sris)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsris d,a,b
Bits shifted by
value specified in parameter b (values 0-31)
0 63
a
0 63
d
ssss
SPE2 Programming Interface Manual, Rev. 1.0-2
3-836 Freescale Semiconductor
SPE2 Operations
__ev_sriu __ev_sriu
Vector Shift Right Immediate
d = __ev_sriu (a,b)
n b
d
0:63
EXTZ(a
0:63-n
)
The value in parameter a is shifted right by the number of bit positions specified by parameter b,
filling vacated bit positions with zeros, and the result is placed into parameter d.
Note: arbitrary shifts may be performed by using __ev_sriu in conjunction with __ev_sroiu.
Figure 3-665. Vector Shift Right Immediate (evsri)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsriu d,a,b
Bits shifted by
value specified in parameter b (values 0-31)
0 63
a
0 63
d
000
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-837
__ev_srois __ev_srois
Vector Shift Right by Octet Immediate Signed
d = __ev_srois (a,b)
n b * 8
d
0:63
EXTS(a
n:63
)
The value in parameter a is shifted right by the number of byte positions contained in parameter
b, filling vacated byte positions with the sign of parameter a, and the result is placed into parameter
d.
Note: arbitrary shifts may be performed by using __ev_sris in conjunction with __ev_srois.
Figure 3-666. Vector Shift Right by Octet Immediate Signed (__ev_srois)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal evsrois d,a,b
7 8 1516 2324 31 32 39 40 47 48 55 56 0 63
a
A B C D E F G H
7 8 15 16 2324 31 32 39 40 47 48 55 56 0 63
d (b=3)
0 0 0 A B C D E
7 8 1516 2324 3132 3940 4748 5556 0 63
d (b=6)
0 0 0 0 0 0 A B
SPE2 Programming Interface Manual, Rev. 1.0-2
3-838 Freescale Semiconductor
SPE2 Operations
__ev_sroiu __ev_sroiu
Vector Shift Right by Octet Immediate Unsigned
d = __ev_sroiu (a,b)
n b * 8
d
0:63
EXTZ(a
n:63
)
The value in parameter a is shifted right by the number of byte positions contained in parameter
b, filling vacated byte positions with zeros, and the result is placed into parameter d.
Note: arbitrary shifts may be performed by using __ev_sriu in conjunction with __ev_sroiu.
Figure 3-667. Vector Shift Right by Octet Immediate (__ev_sroiu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal evsroiu d,a,b
7 8 1516 2324 31 32 39 40 47 48 55 56 0 63
a
A B C D E F G H
7 8 15 16 2324 31 32 39 40 47 48 55 56 0 63
d (b=3)
0 0 0 A B C D E
7 8 1516 2324 3132 3940 4748 5556 0 63
d (b=6)
0 0 0 0 0 0 A B
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-839
__ev_srs __ev_srs
Vector Shift Right Signed
d = __ev_srs (a,b)
sh b
57:63
s a
0
if b
57
= 0 then
d
0:63
EXTS(a
0:63-sh
)
else
d
64
s
The value in parameter a is shifted right arithmetically by sh bit positions specified in parameter
b
57:63
, filling vacated bit positions with the sign of parameter a, and the result is placed into
parameter d. Shift amounts of 64 to 127 give a result of 0 if parameter a is positive, or -1 if
parameter a is negative.
Figure 3-668. Vector Shift Right Signed (__ev_srs)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrs d,a,b
Bits shifted by
value specified in b
57:63
0 63
a
0 63
d
ss..ss
0 63
b
57
sh //
s
SPE2 Programming Interface Manual, Rev. 1.0-2
3-840 Freescale Semiconductor
SPE2 Operations
__ev_sru __ev_sru
Vector Shift Right Unsigned
d = __ev_sru (a,b)
sh b
57:63
if b
57
= 0 then
d
0:63
EXTZ(a
0:63-sh
)
else
d
64
0
The value in parameter a is shifted right by sh bit positions specified in parameter b
57:63
, filling
vacated bit positions with zeros, and the result is placed into parameter d. Shift amounts of 64 to
127 give a zero result.
Figure 3-669. Vector Shift Right Unsigned (__ev_sru)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsru d,a,b
Bits shifted by
value specified in b
57:63
0 63
a
0 63
d
00..0
0 63
b
57
sh
//
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-841
__ev_srwis __ev_srwis
Vector Shift Right Word Immediate Signed
d = __ev_srwis(a,b)
n UIMM
d
0:31
EXTS(a
0:31n
)
d
32:63
EXTS(b
32:63n
)
Both high and low elements of parameter a are shifted right by the 5-bit unsigned immediate
(UIMM) value in parameter b. Bits in the most significant positions vacated by the shift are filled
with a copy of the sign bit. The results are placed into parameter d.
Figure 3-670. Vector Shift Right Word Immediate Signed (__ev_srwis)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsrwis d,a,b
0 31 32 63
a
d
High and low words shifted by UIMM value
SPE2 Programming Interface Manual, Rev. 1.0-2
3-842 Freescale Semiconductor
SPE2 Operations
__ev_srwiu __ev_srwiu
Vector Shift Right Word Immediate Unsigned
d = __ev_srwiu(a,b)
n UIMM
d
0:31
EXTZ(a
0:31n
)
d
32:63
EXTZ(a
32:63n
)
Both high and low elements of parameter a are shifted right by the 5-bit unsigned immediate
(UIMM) value in parameter b; 0 bits are shifted in to the most significant position. Bits in the most
significant positions vacated by the shift are filled with a zero bit. The results are placed into
parameter d.
Figure 3-671. Vector Shift Right Word Immediate Unsigned (__ev_srwiu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ 5-bit unsigned literal evsrwiu d,a,b
0 31 32 63
a
d
High and low words shifted by UIMM value
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-843
__ev_srws __ev_srws
Vector Shift Right Word Signed
d = __ev_srws (a,b)
nh b
26:31
nl b
58:63
d
0:31
EXTS(a
0:31nh
)
d
32:63
EXTS(a
32:63nl
)
Both the high and low elements of parameter a are shifted right by an amount specified in
parameter b. The result is placed into parameter d. The separate shift amounts for each element
are specified by 6 bits in parameter b that lie in bit positions 2631 and 5863. The sign bits are
shifted in to the most significant position.
Shift amounts from 32 to 63 give a result of 32 sign bits.
Figure 3-672. Vector Shift Right Word Signed (__ev_srws)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrws d,a,b
0 31 32 63
a
b
d
nh nl
0 31 32 63 26 58
Low word shifted by
value specified in nl
High word shifted by
value specified in nh
25 57
SPE2 Programming Interface Manual, Rev. 1.0-2
3-844 Freescale Semiconductor
SPE2 Operations
__ev_srwu __ev_srwu
Vector Shift Right Word Unsigned
d = __ev_srwu (a,b)
nh b
26:31
nl b
58:63
d
0:31
EXTZ(a
0:31nh
)
d
32:63
EXTZ(a
32:63nl
)
Both the high and low elements of parameter a are shifted right by an amount specified in
parameter b. The result is placed into parameter d. The separate shift amounts for each element
are specified by 6 bits in parameter b that lie in bit positions 2631 and 5863. Zero bits are shifted
in to the most significant position.
Shift amounts from 32 to 63 give a zero result.
Figure 3-673. Vector Shift Right Word Unsigned (__ev_srwu)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsrwu d,a,b
0 31 32 63
a
b
d
nh nl
0 31 32 63 26 58
High word shifted by
value specified in nl
High word shifted by
value specified in nh
57 25
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-845
__ev_stdb[u] __ev_stdb[u]
Vector Store Double of Eight Bytes [with Update]
__ev_stdb (a,b,c) (U = 0)
__ev_stdbu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*8)
MEM(EA,1) a
0:7
MEM(EA+1,1) a
8:15
MEM(EA+2,1) a
16:23
MEM(EA+3,1) a
24:31
MEM(EA+4,1) a
32:39
MEM(EA+5,1) a
40:47
MEM(EA+6,1) a
48:55
MEM(EA+7,1) a
56:63
if (U=1) then b EA
The contents of parameter a are stored as eight bytes in storage addressed by the effective address
(EA).
If U=1 (with update), EA is placed into parameter b.
Figure 3-674 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-674. __ev_stdb[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter b, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter b is both
an input and output operand. Also, for the with update form (U=1),
parameter c cannot be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evstdb a,c(b)
U = 1 void __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evstdbu a,c(b)
c d e f h a b g
0 1 2 3 4 5 6 7
c d e f h a b g
parameter a
Memory in big or little
Byte address
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-846 Freescale Semiconductor
SPE2 Operations
__ev_stdb[m]x __ev_stdb[m]x
Vector Store Double of Eight Bytes [with Modify] Indexed
__ev_stdbx (a,b,c) (M = 0)
__ev_stdbmx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,1) a
0:7
MEM(EA+1,1)
*
a
8:15
MEM(EA+2,1)
*
a
16:23
MEM(EA+3,1)
*
a
24:31
MEM(EA+4,1)
*
a
32:39
MEM(EA+5,1)
*
a
40:47
MEM(EA+6,1)
*
a
48:55
MEM(EA+7,1)
*
a
56:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The contents of parameter a are stored as eight bytes in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-675 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-675. __ev_stdb[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter c, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter c
is both an input and output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ __ev64_opaque__ * int32_t evstdbx a,b,c
M = 1 void __ev64_opaque__ __ev64_opaque__ *& int32_t evstdbmx a,b,c
c d e f h a b g
0 1 2 3 4 5 6 7
c d e f h a b g
GPR
Memory in big or little
Byte address
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-847
__ev_stdd[u] __ev_stdd[u]
Vector Store Double of Double [with Update]
__ev_stdd (a,b,c) (U = 0)
__ev_stddu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*8)
MEM(EA,8) a
0:63
if (U=1) then b EA
The contents of parameter a are stored as a double word in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-676 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-676. __ev_stdd[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter b, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter b is both
an input and output operand. Also, for the with update form (U=1),
parameter c cannot be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evstdd a,c(b)
U = 1 void __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evstddu a,c(b)
c d e f h a b g
0 1 2 3 4 5 6 7
c d e f h a b g
f e d c a h g b
parameter a
Memory in big endian
Memory in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-848 Freescale Semiconductor
SPE2 Operations
__ev_stdd[m]x __ev_stdd[m]x
Vector Store Double of Double [with Modify] Indexed
__ev_stddx (a,b,c) (M = 0)
__ev_stddmx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp(b)
EA temp + (c)
MEM(EA,8)
*
a
0:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The contents of parameter a are stored as a doubleword in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-677 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-677. __ev_stdd[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter c, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter c
is both an input and output operand.
U returns a b c Maps to
M = 0 void __ev64_opaque__ __ev64_opaque__ * int32_t evstddx a,b,c
M = 1 void __ev64_opaque__ __ev64_opaque__ *& int32_t evstddmx a,b,c
c d e f h a b g
c d e f h a b g
f e d c a h g b
GPR
Memory in big endian
Memory in little endian
0 1 2 3 4 5 6 7 Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-849
__ev_stdh[u] __ev_stdh[u]
Vector Store Double of Four Half Words [with Update]
__ev_stdh (a,b,c) (U = 0)
__ev_stdhu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp b
EA temp + EXTZ(c*8)
MEM(EA,2) a
0:15
MEM(EA+2,2) a
16:31
MEM(EA+4,2) a
32:47
MEM(EA+6,2) a
48:63
if (U=1) then b EA
The contents of parameter a are stored as four half words in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-678 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-678. __ev_stdh[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter b, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter b is both
an input and output operand. Also, for the with update form (U=1),
parameter c cannot be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evstdh a,c(b)
U = 1 void __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evstdhu a,c(b)
c d e f h a b g
c d e f h a b g
d c f e g b a h
parameter a
Memory in big endian
Memory in little endian
0 1 2 3 4 5 6 7 Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-850 Freescale Semiconductor
SPE2 Operations
__ev_stdh[m]x __ev_stdh[m]x
Vector Store Double of Four Half Words [with Modify] Indexed
__ev_stdhx (a,b,c) (M = 0)
__ev_stdhmx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,2)
*
a
0:15
MEM(EA+2,2)
*
a
16:31
MEM(EA+4,2)
*
a
32:47
MEM(EA+6,2)
*
a
48:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The contents of parameter a are stored as four halfwords in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-679 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-679. __ev_stdh[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter c, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter c
is both an input and output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ __ev64_opaque__ * int32_t evstdhx a,b,c
M = 1 void __ev64_opaque__ __ev64_opaque__ *& int32_t evstdhmx a,b,c
c d e f h a b g
c d e f h a b g
d c f e g b a h
GPR
Memory in big endian
Memory in little endian
0 1 2 3 4 5 6 7 Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-851
__ev_stdw[u] __ev_stdw[u]
Vector Store Double of Two Words [with Update]
__ev_stdw (a,b,c) (U = 0)
__ev_stdwu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*8)
MEM(EA,4) a
0:31
MEM(EA+4,4) a
32:63
if (U=1) then b EA
The contents of parameter a are stored as two words in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-680 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-680. __ev_stdw[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter b, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter b is both
an input and output operand. Also, for the with update form (U=1),
parameter c cannot be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ __ev64_opaque__ * 5-bit unsigned literal evstdw a,c(b)
U = 1 void __ev64_opaque__ __ev64_opaque__ *& 5-bit unsigned literal evstdwu a,c(b)
c d e f h a b g
0 1 2 3 4 5 6 7
c d e f h a b g
b a h g e d c f
parameter a
Memory in big endian
Memory in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-852 Freescale Semiconductor
SPE2 Operations
__ev_stdw[m]x __ev_stdw[m]x
Vector Store Double of Two Words [with Modify] Indexed
__ev_stdwx (a,b,c) (M = 0)
__ev_stdwmx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,4)
*
a
0:31
MEM(EA+4,4)
*
a
32:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The contents of parameter a are stored as two words in storage addressed by EA.
If M=1 (with modify), parameter b
32:63
is updated with an address value determined by the mode
specifier in parameter b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-681 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-681. __ev_stdw[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
double-word aligned. Also, if machine register r0 is used to pass
parameter c, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter c
is both an input and output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ __ev64_opaque__ * int32_t evstdwx a,b,c
M = 1 void __ev64_opaque__ __ev64_opaque__ *& int32_t evstdwmx a,b,c
c d e f h a b g
0 1 2 3 4 5 6 7
c d e f h a b g
b a h g e d c f
GPR
Memory in big endian
Memory in little endian
Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-853
__ev_sthb[u] __ev_sthb[u]
Vector Store Half Word of Two Bytes [with Update]
__ev_sthb (a,b,c) (U = 0)
__ev_sthbu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*2)
MEM(EA,1) a
48:55
MEM(EA+1,1) a
56:63
if U=1 then b EA
The two lower byte elements of parameter a are stored as two bytes in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-682 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-682. __ev_sthb[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half word aligned. Also, if machine register r0 is used to pass
parameter b, then a zero value is used as the base value when
calculating EA. For the with update form (U=1), parameter b is both
an input and output operand. Also, for the with update form (U=1),
parameter c cannot be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint16_t * 5-bit unsigned literal evsthb a,c(b)
U = 1 void __ev64_opaque__ uint16_t *& 5-bit unsigned literal evsthbu a,c(b)
h g
g h
parameter a
Memory in big or little
0 1 Byte address
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-854 Freescale Semiconductor
SPE2 Operations
__ev_sthb[m]x __ev_sthb[m]x
Vector Store Half Word of Two Bytes [with Modify] Indexed
__ev_sthbx (a,b,c) (M = 0)
__ev_sthbmx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp(b)
EA temp + (c)
MEM(EA,1) a
48:55
MEM(EA+1,1)
*
a
56:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The two lower byte elements of parameter a are stored as two bytes in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-683 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-683. __ev_sthb[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
half word aligned. Also, if machine register r0 is used to pass
parameter c, then a zero value is used as the base value when
calculating EA. And, for the with modify form (M=1), parameter c
is both an input and output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint16_t * int32_t evsthbx a,b,c
M = 1 void __ev64_opaque__ uint16_t *& int32_t evsthbmx a,b,c
h g
g h
GPR
Memory in big or little
0 1 Byte address
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-855
__ev_stwb[u] __ev_stwb[u]
Vector Store Word of Four Bytes [with Update]
d = __ev_stwb (a,b,c) (U = 0)
d = __ev_stwbu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,1) a
32:39
MEM(EA+1,1) a
40:47
MEM(EA+2,1) a
48:55
MEM(EA+3,1) a
56:63
if U=1 then b EA
The four lower byte elements of parameter a are stored as four bytes in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-684 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-684. __ev_stwb[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwb a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwbu a,c(b)
e f h g
g h e f
parameter a
Memory in big or little
0 1 2 3 Byte address
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-856 Freescale Semiconductor
SPE2 Operations
__ev_stwb[m]x __ev_stwb[m]x
Vector Store Word of Four Bytes [with Modify] Indexed
__ev_stwbx (a,b,c) (M = 0)
__ev_stwbmx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,1) a
32:39
MEM(EA+1,1)
*
a
40:47
MEM(EA+2,1)
*
a
48:55
MEM(EA+3,1)
*
a
56:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The four lower byte elements of parameter a are stored as four bytes in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-685 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-685. __ev_stwb[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint32_t * int32_t evstwbx a,b,c
M = 1 void __ev64_opaque__ uint32_t *& int32_t evstwbmx a,b,c
e f h g
g h e f
GPR
Memory in big or little
0 1 2 3 Byte address
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-857
__ev_stwbe[u] __ev_stwbe[u]
Vector Store Word of Four Bytes from Even [with Update]
__ev_stwbe (a,b,c) (U = 0)
__ev_stwbeu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,1) a
0:7
MEM(EA+1,1) a
16:23
MEM(EA+2,1) a
32:39
MEM(EA+3,1) a
48:55
if (U=1) then b EA
The even byte elements of parameter a are stored as four bytes in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-686 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-686. __ev_stwbe[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwbe a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwbeu a,c(b)
c d e f h a b g
e g a c
parameter a
Memory in big or little
0 1 2 3 Byte address
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-858 Freescale Semiconductor
SPE2 Operations
__ev_stwbe[m]x __ev_stwbe[m]x
Vector Store Word of Four Bytes from Even [with Modify] Indexed
__ev_stwbex (a,b,c) (M = 0)
__ev_stwbemx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,1) a
0:7
MEM(EA+1,1)
*
a
16:23
MEM(EA+2,1)
*
a
32:39
MEM(EA+3,1)
*
a
48:55
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The even byte elements of parameter a are stored as four bytes in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-687 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-687. __ev_stwbe[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint32_t * int32_t evstwbex a,b,c
M = 1 void __ev64_opaque__ uint32_t *& int32_t evstwbemx a,b,c
c d e f h a b g
e g a c
GPR
Memory in big or little
0 1 2 3 Byte address
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-859
__ev_stwbo[u] __ev_stwbo[u]
Vector Store Word of Four Bytes from Odd [with Update]
__ev_stwbo (a,b,c) (U = 0)
__ev_stwbou (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,1) a
8:15
MEM(EA+1,1) a
24:31
MEM(EA+2,1) a
40:47
MEM(EA+3,1) a
56:63
if (U=1) then b EA
The odd byte elements of parameter a are stored as four bytes in storage addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-688 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-688. __ev_stwbo[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwbo a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwbou a,c(b)
c d e f h a b g
f h b d
parameter a
Memory in big or little
0 1 2 3 Byte address
endian
SPE2 Programming Interface Manual, Rev. 1.0-2
3-860 Freescale Semiconductor
SPE2 Operations
__ev_stwbo[m]x __ev_stwbo[m]x
Vector Store Word of Four Bytes from Odd [with Modify] Indexed
__ev_stwbox (a,b,c) (M = 0)
__ev_stwbomx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,1) a
8:15
MEM(EA+1,1)
*
a
24:31
MEM(EA+2,1)
*
a
40:47
MEM(EA+3,1)
*
a
56:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The odd byte elements of parameter a are stored as four bytes in storage addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-689 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-689. __ev_stwbo[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint32_t * int32_t evstwbox a,b,c
M = 1 void __ev64_opaque__ uint32_t *& int32_t evstwbomx a,b,c
c d e f h a b g
f h b d
GPR
Memory in big or little
0 1 2 3 Byte address
endian
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-861
__ev_stwhe[u] __ev_stwhe[u]
Vector Store Word of Two Half Words from Even [with Update]
__ev_stwhe (a,b,c) (U = 0)
__ev_stwheu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,2) a
0:15
MEM(EA+2,2) a
32:47
if (U=1) then b EA
The even half words from each element of parameter a are stored as two half words in storage
addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-690 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-690. __ev_stwhe[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwhe a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwheu a,c(b)
c d e f h a b g
e f a b
f e b a
parameter a
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-862 Freescale Semiconductor
SPE2 Operations
__ev_stwhe[m]x __ev_stwhe[m]x
Vector Store Word of Two Half Words from Even [with Modify] Indexed
__ev_stwhex (a,b,c) (M = 0)
__ev_stwhemx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,2)
*
a
0:15
MEM(EA+2,2)
*
a
32:47
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The even halfwords from each element of parameter a are stored as two halfwords in storage
addressed by EA.
If M=1 (with modify), parameter b
32:63
is updated with an address value determined by the mode
specifier in parameter b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-691 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-691. __ev_stwhe[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint32_t * int32_t evstwhex a,b,c
M = 1 void __ev64_opaque__ uint32_t *& int32_t evstwhemx a,b,c
c d e f h a b g
e f a b
f e b a
GPR
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-863
__ev_stwho[u] __ev_stwho[u]
Vector Store Word of Two Half Words from Odd [with Update]
__ev_stwho (a,b,c) (U = 0)
__ev_stwhou (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,2) a
16:31
MEM(EA+2,2) a
48:63
if (U=1) then b EA
The odd half words from each element of parameter a are stored as two half words in storage
addressed by EA.
If U=1 (with update), EA is placed into parameter b.
Figure 3-692. __ev_stwho[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwho a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwhou a,c(b)
c d e f h a b g
0 1 2 3
g h c d
h g d c
parameter a
Memory in big endian
Memory in little endian
Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-864 Freescale Semiconductor
SPE2 Operations
__ev_stwho[m]x __ev_stwho[m]x
Vector Store Word of Two Half Words from Odd [with Modify] Indexed
__ev_stwhox (a,b,c) (M = 0)
__ev_stwhomx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,2)
*
a
16:31
MEM(EA+2,2)
*
a
48:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The odd halfwords from each element of parameter a are stored as two halfwords in storage
addressed by EA.
If M=1 (with modify), b
32:63
is updated with an address value determined by the mode specifier
in b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-693 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-693. __ev_stwho[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint32_t * int32_t evstwhox a,b,c
M = 1 void __ev64_opaque__ uint32_t *& int32_t evstwhomx a,b,c
c d e f h a b g
g h c d
h g d c
GPR
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-865
__ev_stwwe[u] __ev_stwwe[u]
Vector Store Word of Word from Even [with Update]
__ev_stwwe (a,b,c) (U = 0)
__ev_stwweu (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,4) a
0:31
if (U=1) then b EA
The even word of parameter a is stored in storage addressed by EA.
Figure 3-694 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-694. __ev_stwwe[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwwe a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwweu a,c(b)
c d e f h a b g
c d a b
b a d c
parameter a
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-866 Freescale Semiconductor
SPE2 Operations
__ev_stwwe[m]x __ev_stwwe[m]x
Vector Store Word of Word from Even [with Modify] Indexed
__ev_stwwex (a,b,c) (M = 0)
__ev_stwwemx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,4)
*
a
0:31
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The even word of parameter a is stored in storage addressed by EA.
If M=1 (with modify), parameter b
32:63
is updated with an address value determined by the mode
specifier in parameter b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-695 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-695. __ev_stwwe[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b Maps to
M = 0 void __ev64_opaque__ uint32_t * evstwwex a,b,c
M = 1 void __ev64_opaque__ uint32_t *& evstwwemx a,b,c
c d e f h a b g
c d a b
b a d c
GPR
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-867
__ev_stwwo[u] __ev_stwwo[u]
Vector Store Word of Word from Odd [with Update]
__ev_stwwo (a,b,c) (U = 0)
__ev_stwwou (a,b,c) (U = 1)
if (b = r0) then temp 0
else temp (b)
EA temp + EXTZ(c*4)
MEM(EA,4) a
32:63
if (U=1) then b EA
The odd word of parameter a is stored in storage addressed by EA.
Figure 3-696 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-696. __ev_stwwo[u] Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter b,
then a zero value is used as the base value when calculating EA. For
the with update form (U=1), parameter b is both an input and output
operand. Also, for the with update form (U=1), parameter c cannot
be 0.
U returns a b c Maps to
U = 0 void __ev64_opaque__ uint32_t * 5-bit unsigned literal evstwwo a,c(b)
U = 1 void __ev64_opaque__ uint32_t *& 5-bit unsigned literal evstwwou a,c(b)
c d e f h a b g
g h e f
f e h g
parameter a
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
SPE2 Programming Interface Manual, Rev. 1.0-2
3-868 Freescale Semiconductor
SPE2 Operations
__ev_stwwo[m]x __ev_stwwo[m]x
Vector Store Word of Word from Odd [with Modify] Indexed
__ev_stwwox (a,b,c) (M = 0)
__ev_stwwomx (a,b,c) (M = 1)
if b=0 & M=1 then take_illegal_exception
if b=0 & M=0 then temp 0
else temp (b)
EA temp + (c)
MEM(EA,4)
*
a
32:63
if (M=1) then b
32:63
calc_b_update(b,c)
* - may wrap at length boundary for M=1 and mode 1000.
The odd word of parameter a is stored in storage addressed by EA.
If M=1 (with modify), parameter b
32:63
is updated with an address value determined by the mode
specifier in parameter b
0:3
. See Section 3.2.3, Addressing Modes - Modify forms.
Figure 3-697 shows how bytes are stored in memory as determined by the endian mode.
Figure 3-697. __ev_stwwo[m]x Results in Big- and Little-Endian Modes
NOTE
An alignment exception may occur if the effective address (EA) is not
word aligned. Also, if machine register r0 is used to pass parameter c,
then a zero value is used as the base value when calculating EA. And,
for the with modify form (M=1), parameter c is both an input and
output operand.
M returns a b c Maps to
M = 0 void __ev64_opaque__ uint32_t * int32_t evstwwox a,b,c
M = 1 void __ev64_opaque__ uint32_t *& int32_t evstwwomx a,b,c
c d e f h a b g
g h e f
f e h g
GPR
Memory in big endian
Memory in little endian
0 1 2 3 Byte address
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-869
__ev_subf2add2h __ev_subf2add2h
Vector Subtract from Upper 2 / Add Lower 2 Half Words
d = __ev_subf2add2h (a,b)
d
0:15
b
0:15
- a
0:15
// Modulo
d
16:31
b
16:31
- a
16:31
// Modulo
d
32:47
b
32:47
+ a
32:47
// Modulo
d
48:63
b
48:63
+ a
48:63
// Modulo
The upper two half word elements of parameter a are subtracted from the upper 2 half word
elements of parameter b, the lower two half word elements of parameter a are added to the lower
two half word elements of parameter b, and the results are placed in parameter d. The sum and
difference are modulo.
Figure 3-698. Vector Subtract from Upper 2 / Add Lower 2 Half Words (__ev_subf2add2h)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubf2add2h d,a,b
a
0 31 32 63
b
15 16 47 48
+
d
+ - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-870 Freescale Semiconductor
SPE2 Operations
__ev_subf2add2hss __ev_subf2add2hss
Vector Subtract from Upper 2 / Add Lower 2 Half Words Signed and Saturate
d = __ev_subf2add2hss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) - EXTS(a
0:15
)
ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) - EXTS(a
16:31
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) + EXTS(a
32:47
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) + EXTS(a
48:63
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The upper two signed half word elements of parameter a are subtracted from the upper two signed
half word elements of parameter b, the lower two signed half word elements of parameter a are
added to the lower two signed half word elements of parameter b, saturating if overflow or
underflow occurs, and the results are placed in parameter d. Any overflow or underflow is recorded
in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-699. Vector Subtract from Upper 2 / Add Lower 2 Half Words Signed and Saturate
(__ev_subf2add2hss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubf2add2hss d,a,b
a
0 31 32 63
b
15 16 47 48
+
d
+ - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-871
__ev_subfaddh __ev_subfaddh
Vector Subtract from / Add Half Words
d = __ev_subfaddh (a,b)
d
0:15
b
0:15
- a
0:15
// Modulo difference, b - a
d
16:31
b
16:31
+ a
16:31
// Modulo sum
d
32:47
b
32:47
- a
32:47
// Modulo difference, b - a
d
48:63
b
48:63
+ a
48:63
// Modulo sum
The even half word elements of parameter a are subtracted from the even half word elements of
parameter b, the odd half word elements of parameter a are added to the odd half word elements
of parameter b, and the results are placed in parameter d. The sum and difference are modulo.
Figure 3-700. Vector Subtract from / Add Half Words (__ev_subfaddh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddh d,a,b
a
0 31 32 63
b
15 16 47 48
+
d
- + -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-872 Freescale Semiconductor
SPE2 Operations
__ev_subfaddhss __ev_subfaddhss
Vector Subtract from / Add Half Words Signed and Saturate
d = __ev_subfaddhss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) - EXTS(a
0:15
)
ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) + EXTS(a
16:31
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) - EXTS(a
32:47
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) + EXTS(a
48:63
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The even signed half word elements of parameter a are subtracted from the even signed half word
elements of parameter b, the odd signed half word elements of parameter a are added to the odd
signed half word elements of parameter b, saturating if overflow or underflow occurs, and the
results are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-701. Vector Subtract from / Add Half Words Signed and Saturate
(__ev_subfaddhss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddhss d,a,b
a
0 31 32 63
b
15 16 47 48
+
d
- + -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-873
__ev_subfaddhx __ev_subfaddhx
Vector Subtract from / Add Half Words Exchanged
d = __ev_subfaddhx (a,b)
// h0
d
0:15
b
0:15
- a
16:31
// modulo difference
// h1
d
16:31
b
16:31
+ a
0:15
// modulo sum
// h2
d
32:47
b
32:47
- a
48:63
// modulo difference
// h3
d
48:63
b
48:63
+ a
32:47
// modulo sum
The odd exchanged half word elements of parameter a are subtracted from the even half word
elements of parameter b, the even exchanged half words of parameter a are added to the odd half
words of parameter b, and the results are placed in parameter d. The sum and differences are
modulo.
Figure 3-702. Vector Subtract from / Add Half Words Exchanged (__ev_subfaddhx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddhx d,a,b
a
0 31 32 63
b
15 16 47 48
+
d
- + -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-874 Freescale Semiconductor
SPE2 Operations
__ev_subfaddhxss __ev_subfaddhxss
Vector Subtract from / Add Half Words Exchanged, Signed and Saturate
d = __ev_subfaddhxss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) - EXTS(a
16:31
)
ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) + EXTS(a
0:15
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) - EXTS(a
48:63
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) + EXTS(a
32:47
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The odd exchanged signed half word elements of parameter a are subtracted from the even signed
half word elements of parameter b, the even exchanged signed half words of parameter a are added
to the odd signed half words of parameter b, saturating if overflow or underflow occurs, and the
results are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-703. Vector Subtract from / Add Half Words Exchanged, Signed and Saturate
(__ev_subfaddhxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddhxss d,a,b
a
0 31 32 63
b
15 16 47 48
+
d
- + -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-875
__ev_subfaddw __ev_subfaddw
Vector Subtract from / Add Word
d = __ev_subfaddw (a,b)
d
0:31
b
0:31
- a
0:31
// Modulo
d
32:63
b
32:63
+ a
32:63
// Modulo
The high word element of parameter a is subtracted from the high word element of parameter b,
the low word element of parameter a is added to the low word element of parameter b, and the
results are placed in parameter d.
Figure 3-704. Vector Subtract from / Add Word (__ev_subfaddw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddw d,a,b
0 31 32 63
b
a
- +
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-876 Freescale Semiconductor
SPE2 Operations
__ev_subfaddwss __ev_subfaddwss
Vector Subtract from / Add Word Signed and Saturate
d = __ev_subfaddwss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) - EXTS(a
0:31
)
ovh temp
0
temp
1
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) + EXTS(a
32:63
)
ovl temp
0
temp
1
d
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The high word element of parameter a is subtracted from the high word element of parameter b,
saturating if overflow or underflow occurs, the low word element of parameter a is added to the
low word element of parameter b, saturating if overflow or underflow occurs, and the results are
placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-705. Vector Subtract from / Add Word Signed and Saturate (__ev_subfaddwss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddwss d,a,b
0 31 32 63
b
a
- +
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-877
__ev_subfaddwx __ev_subfaddwx
Vector Add / Subtract from Word Exchanged
d = __ev_subfaddwx (a,b)
d
0:31
b
0:31
- a
32:63
// Modulo
d
32:63
b
32:63
+ a
0:31
// Modulo
The low word element of parameter a is subtracted from the high word element of parameter b,
the high word element of parameter a is added to the low word element of parameter b, and the
results are placed in parameter d.
Figure 3-706. Vector Subtract from / Add Word Exchanged (__ev_subfaddwx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfaddwx d,a,b
0 31 32 63
b
a
- +
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-878 Freescale Semiconductor
SPE2 Operations
__ev_subfaddwxss __ev_subfaddwxss
Vector Add / Subtract from Word Exchanged Signed and Saturate
d = __ev_subfaddwxss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) - EXTS(a
32:63
)
ovh temp
0
temp
11
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) + EXTS(a
0:31
)
ovl temp
0
temp
1
d
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The low word element of parameter a is subtracted from the high word element of parameter b,
saturating if overflow or underflow occurs, the high word element of parameter a is added to the
low word element of parameter b, saturating if overflow or underflow occurs, and the results are
placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-707. Vector Add / Subtract from Word Exchanged Signed and Saturate
(__ev_addsubfwxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evaddsubfwxss d,a,b
0 31 32 63
b
a
- +
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-879
__ev_subfb __ev_subfb
Vector Subtract from Byte
d = __ev_subfb (a,b)
d
0:7
b
0:7
- a
0:7
// Modulo difference
d
8:15
b
8:15
- a
8:15
// Modulo difference
d
16:23
b
16:23
- a
16:23
// Modulo difference
d
24:31
b
24:31
- a
24:31
// Modulo difference
d
32:39
b
32:39
- a
32:39
// Modulo difference
d
40:47
b
40:47
- a
40:47
// Modulo difference
d
48:55
b
48:55
- a
48:55
// Modulo difference
d
56:63
b
56:63
- a
56:63
// Modulo difference
The eight byte elements of parameter a are subtracted from the corresponding elements of
parameter b and the results are placed in parameter d. The difference is a modulo difference.
Figure 3-708. Vector Subtract from Byte (__ev_subfb)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfb d,a,b
0 32 63
a
b
24 16 8 40 48 56
-
0 32 63
d
24 16 8 40 48 56
- - - - - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-880 Freescale Semiconductor
SPE2 Operations
__ev_subfbss __ev_subfbss
Vector Subtract from Byte Signed and Saturate
d = __ev_subfbss (a,b)
temp
0:8
EXTS(b
0:7
) - EXTS(a
0:7
); ovb0 temp
0
temp
1
d
0:7
SATURATE(ovb0, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
8:15
) - EXTS(a
8:15
); ovb1 temp
0
temp
1
d
8:15
SATURATE(ovb1, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
16:23
) - EXTS(a
16:23
); ovb2 temp
0
temp
1
d
16:23
SATURATE(ovb2, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
24:31
) - EXTS(a
24:31
); ovb3 temp
0
temp
1
d
24:31
SATURATE(ovb3, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
32:39
) - EXTS(a
32:39
); ovb4 temp
0
temp
1
d
32:39
SATURATE(ovb4, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
40:47
) - EXTS(a
40:47
); ovb5 temp
0
temp
1
d
40:47
SATURATE(ovb5, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
48:55
) - EXTS(a
48:55
); ovb6 temp
0
temp
1
d
48:55
SATURATE(ovb6, temp
0
, 0x80, 0x7f, temp
1:8
)
temp
0:8
EXTS(b
56:63
) - EXTS(a
56:63
); ovb7 temp
0
temp
1
d
56:63
SATURATE(ovb7, temp
0
, 0x80, 0x7f, temp
1:8
)
ovh ovb0 | ovb1 | ovb2 | ovb3
ovl ovh4 | ovh5 | ovb6 | ovb7
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The eight signed byte elements of parameter a are subtracted from the corresponding signed
elements of parameter b, saturating if overflow or underflow occurs, and the results are placed in
parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and summary
overflow bits.
Other registers altered: SPEFSCR
Figure 3-709. Vector Subtract from Byte Signed and Saturate (__ev_subfbss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfbss d,a,b
0 32 63
a
b
24 16 8 40 48 56
-
0 32 63
d
24 16 8 40 48 56
- - - - - - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-881
__ev_subfbus __ev_subfbus
Vector Subtract from Byte Unsigned and Saturate
d = __ev_subfbus (a,b)
temp
0:8
EXTZ(b
0:7
) - EXTZ(a
0:7
); ovb0 temp
0
d
0:7
SATURATE(ovb0, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
8:15
) - EXTZ(a
8:15
); ovb1 temp
0
d
8:15
SATURATE(ovb1, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
16:23
) - EXTZ(a
16:23
); ovb2 temp
0
d
16:23
SATURATE(ovb2, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
24:31
) - EXTZ(a
24:31
); ovb3 temp
0
d
24:31
SATURATE(ovb3, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
32:39
) - EXTZ(a
32:39
); ovb4 temp
0
d
32:39
SATURATE(ovb4, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
40:47
) - EXTZ(a
40:47
); ovb5 temp
0
d
40:47
SATURATE(ovb5, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
48:55
) - EXTZ(a
48:55
); ovb6 temp
0
d
48:55
SATURATE(ovb6, temp
0
, 0x0, 0x0, temp
1:8
)
temp
0:8
EXTZ(b
56:63
) - EXTZ(a
56:63
); ovb7 temp
0
d
56:63
SATURATE(ovb7, temp
0
, 0x0, 0x0, temp
1:8
)
ovh ovb0 | ovb1 | ovb2 | ovb3
ovl ovh4 | ovh5 | ovb6 | ovb7
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
The eight unsigned byte elements of parameter a are subtracted from the corresponding unsigned
elements of parameter b, saturating if underflow occurs, and the results are placed in parameter d.
Any underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-710. Vector Subtract from Byte Unsigned and Saturate (__ev_subfbus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfbus d,a,b
0 32 63
a
b
24 16 8 40 48 56
-
0 32 63
d
24 16 8 40 48 56
- - - - - - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-882 Freescale Semiconductor
SPE2 Operations
__ev_subfd __ev_subfd
Vector Subtract from Doubleword
d = __ev_subfd (a,b)
d
0:63
b
0:63
- a
0:63
The 64-bit value in a is subtracted from the 64-bit value in parameter b and the results are placed
into parameter d.
Figure 3-711. Vector Subtract from Doubleword (__ev_subfd)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfd d,a,b
0 63
b
a
-
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-883
__ev_subfdss __ev_subfdss
Vector Subtract from Doubleword Signed and Saturate
d = __ev_subfdss (a,b)
temp
0:64
EXTS
65
(b
0:63
) - EXTS
65
(a
0:63
)
ov temp
0
temp
1
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7fff_ffff_ffff_ffff, temp
1:64
)
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The signed doubleword in parameter a is subtracted from the signed doubleword in parameter b,
saturating if overflow or underflow occurs, and the result is placed into parameter d. Any overflow
or underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-712. Vector Subtract from Doubleword Signed and Saturate (__ev_subfdss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfdss d,a,b
0 63
a
b
-
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-884 Freescale Semiconductor
SPE2 Operations
__ev_subfdus __ev_subfdus
Vector Subtract from Doubleword Unsigned and Saturate
d = __ev_subfdus (a,b)
temp
0:64
EXTZ
65
(b
0:63
) - EXTZ
65
(a
0:63
)
ov temp
0
d
0:63
SATURATE(ov, temp
0
, 0x0000_0000_0000_0000, 0x0000_0000_0000_0000, temp
1:64
)
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov.
The unsigned doubleword in parameter a is subtracted from the unsigned doubleword in parameter
b, saturating if underflow occurs, and the result is placed into parameter d. Any underflow is
recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-713. Vector Subtract from Doubleword Unsigned and Saturate (__ev_subfdus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfdus d,a,b
0 63
a
b
-
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-885
__ev_subfh __ev_subfh
Vector Subtract from Half Word
d = __ev_subfh (a,b)
d
0:15
b
0:15
- a
0:15
// Modulo difference
d
16:31
b
16:31
- a
16:31
// Modulo difference
d
32:47
b
32:47
- a
32:47
// Modulo difference
d
48:63
b
48:63
- a
48:63
// Modulo difference
The four half word elements of parameter a are subtracted from the four half word elements of
parameter b and the results are placed in parameter d. The difference is a modulo difference.
Figure 3-714. Vector Subtract from Half Word (__ev_subfh)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfh d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-886 Freescale Semiconductor
SPE2 Operations
__ev_subfhhisw __ev_subfhhisw
Vector Subtract from Halfwords High Signed to Words
d = __ev_subfhhisw (a,b)
d
0:31
EXTS
32
(b
0:15
) - EXTS
32
(a
0:15
)// Modulo
d
32:63
EXTS
32
(b
16:31
) - EXTS
32
(a
16:31
) // Modulo
The high halfword elements of parameter a are sign-extended to 32 bits and subtracted from the
respective sign-extended high halfword elements of parameter b and the 32-bit results are placed
into parameter d. The difference is a modulo difference.
Figure 3-715. Vector Subtract from Halfwords High Signed to Words (__ev_subfhhisw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhhisw d,a,b
b
0 31 32 63
a
15 16 47 48
d
- -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-887
__ev_subfhhiuw __ev_subfhhiuw
Vector Subtract from Half Words High Unsigned to Words
d = __ev_subfhhiuw (a,b)
d
0:31
EXTZ
32
(b
0:15
) - EXTZ
32
(a
0:15
)// Modulo
d
32:63
EXTZ
32
(b
16:31
) - EXTZ
32
(a
16:31
) // Modulo
The high halfword elements of parameter a are zero-extended to 32 bits and subtracted from the
respective zero-extended high halfword elements of parameter b and the 32-bit results are placed
into parameter d. The difference is a modulo difference.
Figure 3-716. Vector Subtract from Halfwords High Unsigned to Words (__ev_subfhhiuw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhhiuw d,a,b
b
0 31 32 63
a
15 16 47 48
d
- -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-888 Freescale Semiconductor
SPE2 Operations
__ev_subfhlosw __ev_subfhlosw
Vector Subtract from Halfwords Low Signed to Words
d = __ev_subfhlosw (a,b)
d
0:31
EXTS
32
(b
32:47
) - EXTS
32
(a
32:47
)// Modulo
d
32:63
EXTS
32
(b
48:63
) - EXTS
32
(a
48:63
) // Modulo
The low halfword elements of parameter a are sign-extended to 32 bits and subtracted from the
respective sign-extended low halfword elements of parameter b and the 32-bit results are placed
into parameter d. The sum is a modulo sum.
Figure 3-717. Vector Subtract from Halfwords Low Signed to Words (__ev_subfhlosw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhlosw d,a,b
b
0 31 32 63
a
15 16 47 48
d
- -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-889
__ev_subfhlouw __ev_subfhlouw
Vector Subtract from Halfwords Low Unsigned to Words
d = __ev_subfhlouw (a,b)
d
0:31
EXTZ
32
(b
32:47
) - EXTZ
32
(a
32:47
)// Modulo
d
32:63
EXTZ
32
(b
48:63
) - EXTZ
32
(a
48:63
) // Modulo
The low halfword elements of parameter a are zero-extended to 32 bits and subtracted from the
respective zero-extended low halfword elements of parameter b and the 32-bit results are placed
into parameter d. The sum is a modulo sum.
Figure 3-718. Vector Subtract from Halfwords Low Unsigned to Words (__ev_subfhlouw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhlouw d,a,b
b
0 31 32 63
a
15 16 47 48
d
- -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-890 Freescale Semiconductor
SPE2 Operations
__ev_subfhss __ev_subfhss
Vector Subtract from Half Words Signed and Saturate
d = __ev_subfhss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) - EXTS(a
0:15
)
ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) - EXTS(a
16:31
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) - EXTS(a
32:47
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) - EXTS(a
48:63
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The four signed half word elements of parameter a are subtracted from the corresponding four
signed half word elements of parameter b, saturating if overflow or underflow occurs, and the
results are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-719. Vector Subtract from Half Words Signed and Saturate (__ev_subfhss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhss d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-891
__ev_subfhus __ev_subfhus
Vector Subtract from Half Words Unsigned and Saturate
d = __ev_subfhus (a,b)
// h0
temp
0:31
EXTZ(b
0:15
) - EXTZ(a
0:15
)
ovh0 temp
15
d
0:15
SATURATE(ovh0, temp
15
, 0x0000, 0x0000, temp
16:31
)
// h1
temp
0:31
EXTZ(b
16:31
) - EXTZ(a
16:31
)
ovh1 temp
15
d
16:31
SATURATE(ovh1, temp
15
, 0x0000, 0x0000, temp
16:31
)
// h2
temp
0:31
EXTZ(b
32:47
) - EXTZ(a
32:47
)
ovh2 temp
15
d
32:47
SATURATE(ovh2, temp
15
, 0x0000, 0x0000, temp
16:31
)
// h3
temp
0:31
EXTZ(b
48:63
) - EXTZ(a
48:63
)
ovh3 temp
15
d
48:63
SATURATE(ovh3, temp
15
, 0x0000, 0x0000, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The four unsigned half word elements of parameter a are subtracted from four exchanged unsigned
half word elements of parameter b, saturating if underflow occurs, and the results are placed in
parameter d. Any underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-720. Vector Subtract from Half Words Unsigned and Saturate (__ev_subfhus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhus d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-892 Freescale Semiconductor
SPE2 Operations
__ev_subfhx __ev_subfhx
Vector Subtract from Half Words Exchanged
d = __ev_subfhx (a,b)
// h0
d
0:15
b
0:15
- a
16:31
// modulo difference
// h1
d
16:31
b
16:31
- a
0:15
// modulo difference
// h2
d
32:47
b
32:47
- a
48:63
// modulo difference
// h3
d
48:63
b
48:63
- a
32:47
// modulo difference
The four exchanged half word elements of parameter a are subtracted from the four half word
elements of parameter b. The results are placed into parameter d. The difference is a modulo
difference.
Figure 3-721. Vector Subtract from Half Words Exchanged (__ev_subfhx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhx d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-893
__ev_subfhxss __ev_subfhxss
Vector Subtract from Half Words Exchanged, Signed and Saturate
d = __ev_subfhxss (a,b)
// h0
temp
0:31
EXTS(b
0:15
) - EXTS(a
16:31
)
ovh0 temp
15
temp
16
d
0:15
SATURATE(ovh0, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h1
temp
0:31
EXTS(b
16:31
) - EXTS(a
0:15
)
ovh1 temp
15
temp
16
d
16:31
SATURATE(ovh1, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h2
temp
0:31
EXTS(b
32:47
) - EXTS(a
48:63
)
ovh2 temp
15
temp
16
d
32:47
SATURATE(ovh2, temp
15
, 0x8000, 0x7fff, temp
16:31
)
// h3
temp
0:31
EXTS(b
48:63
) - EXTS(a
32:47
)
ovh3 temp
15
temp
16
d
48:63
SATURATE(ovh3, temp
15
, 0x8000, 0x7fff, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The four exchanged signed half word elements of parameter a are subtracted from four signed half
word elements of parameter b, saturating if overflow or underflow occurs, and the results are
placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-722. Vector Subtract from Half Words Exchanged Signed and Saturate
(__ev_subfhxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhxss d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- - -
SPE2 Programming Interface Manual, Rev. 1.0-2
3-894 Freescale Semiconductor
SPE2 Operations
__ev_subfhxus __ev_subfhxus
Vector Subtract from Half Words Exchanged, Unsigned and Saturate
d = __ev_subfhxus (a,b)
// h0
temp
0:31
EXTZ(b
0:15
) - EXTZ(a
16:31
)
ovh0 temp
15
d
0:15
SATURATE(ovh0, temp
15
, 0x0000, 0x0000, temp
16:31
)
// h1
temp
0:31
EXTZ(b
16:31
) - EXTZ(a
0:15
)
ovh1 temp
15
d
16:31
SATURATE(ovh1, temp
15
, 0x0000, 0x0000, temp
16:31
)
// h2
temp
0:31
EXTZ(b
32:47
) - EXTZ(a
48:63
)
ovh2 temp
15
d
32:47
SATURATE(ovh2, temp
15
, 0x0000, 0x0000, temp
16:31
)
// h3
temp
0:31
EXTZ(b
48:63
) - EXTZ(a
32:47
)
ovh3 temp
15
d
48:63
SATURATE(ovh3, temp
15
, 0x0000, 0x0000, temp
16:31
)
ovh ovh0 | ovh1
ovl ovh2 | ovh3
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The four exchanged unsigned half word elements of parameter a are subtracted from four unsigned
half word elements of parameter b, saturating if underflow occurs, and the results are placed in
parameter d. Any underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-723. Vector Subtract from Exchanged Half Words Unsigned and Saturate
(__ev_subfhxus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfhxus d,a,b
a
0 31 32 63
b
15 16 47 48
-
d
- - -
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-895
__ev_subfsmiaa __ev_subfsmiaa
Vector Subtract from Accumulator to Accumulator
d = __ev_subfsmiaa (a)
d
0:63
ACC
0:63
- a
0:63
// update accumulator
ACC
0:63
d
0:63
The 64-bit value in parameter a is subtracted from the value in the accumulator and the difference
is placed into parameter d and the accumulator.
This instruction can be used for signed or unsigned integers or fractions.
Other registers altered: ACC
Figure 3-724. Vector Subtract from Accumulator to Accumulator (__ev_subfsmiaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfsmiaa d,a
0 63
a
Accumulator
-
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-896 Freescale Semiconductor
SPE2 Operations
__ev_subfsmiaaw __ev_subfsmiaaw
Vector Subtract Signed, Modulo, Integer to Accumulator Word
d = __ev_subfsmiaaw (a)
// high
d
0:31
ACC
0:31
a
0:31
// low
d
32:63
ACC
32:63
a
32:63
// update accumulator
ACC
0:63
d
0:63
Each word element in parameter a is subtracted from the corresponding element in the
accumulator and the difference is placed into the corresponding parameter d word and into the
accumulator.
Other registers altered: ACC
Figure 3-725. Vector Subtract Signed, Modulo, Integer to Accumulator Word
(__ev_subfsmiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfsmiaaw d,a
0 31 32 63
Accumulator
a
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-897
__ev_subfssiaa __ev_subfssiaa
Vector Subtract from Signed, Saturate, Integer to Accumulator
d = __ev_subfssiaa (a)
temp
0:64
EXTS(ACC
0:63
) - EXTS(a
0:63
)
ov temp
0
temp
1
d
0:63
SATURATE(ov, temp
0
, 0x8000_0000_0000_0000, 0x7fff_ffff_ffff_ffff, temp
1:64
)
// update accumulator
ACC
0:63
d
0:63
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The signed 64-bit value in parameter a is subtracted from the value in the accumulator saturating
if overflow or underflow occurs, and the results are placed in parameter d and the accumulator.
Any overflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-726. Vector Subtract from Signed, Saturate, Integer to Accumulator
(__ev_subfssiaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfssiaa d,a
0 63
a
Accumulator
-
ssat
d and Accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-898 Freescale Semiconductor
SPE2 Operations
__ev_subfssiaaw __ev_subfssiaaw
Vector Subtract Signed, Saturate, Integer to Accumulator Word
d = __ev_subfssiaaw (a)
// high
temp
0:63
EXTS(ACC
0:31
) - EXTS(a
0:31
)
ovh temp
31
temp
32
d
0:31
SATURATE(ovh, temp
31
, 0x80000000, 0x7fffffff, temp
32:63
)
// low
temp
0:63
EXTS(ACC
32:63
) - EXTS(a
32:63
)
ovl temp
31
temp
32
d
32:63
SATURATE(ovl, temp
31
, 0x80000000, 0x7fffffff, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
Each signed integer word element in parameter a is sign-extended and subtracted from the
corresponding sign-extended element in the accumulator, saturating if overflow or underflow
occurs, and the results are placed in parameter d and the accumulator. Any overflow is recorded in
the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-727. Vector Subtract Signed, Saturate, Integer to Accumulator Word
(__ev_subfssiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfssiaaw d,a
0 31 32 63
Accumulator
a
d and accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-899
__ev_subfumiaaw __ev_subfumiaaw
Vector Subtract Unsigned, Modulo, Integer to Accumulator Word
d = __ev_subfumiaaw (a)
// high
d
0:31
ACC
0:31
a
0:31
// low
d
32:63
ACC
32:63
a
32:63
// update accumulator
ACC
0:63
d
0:63
Each unsigned integer word element in parameter a is subtracted from the corresponding element
in the accumulator, and the results are placed in the corresponding parameter d and into the
accumulator.
Other registers altered: ACC
Figure 3-728. Vector Subtract Unsigned, Modulo, Integer to Accumulator Word
(__ev_subfumiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfumiaaw d,a
0 31 32 63
Accumulator
a
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-900 Freescale Semiconductor
SPE2 Operations
__ev_subfusiaa __ev_subfusiaa
Vector Subtract from (Accumulator) Unsigned, Saturate, Integer to Accumulator
d = __ev_subfusiaa (a)
temp
0:64
EXTZ(ACC
0:63
) - EXTZ(a
0:63
)
ov temp
0
d
0:63
SATURATE(ov, temp
0
, 0xffff_ffff_ffff_ffff, 0xffff_ffff_ffff_ffff, temp
1:64
)
// update accumulator
ACC
0:63
d
0:63
SPEFSCR
OVH
0
SPEFSCR
OV
ov
SPEFSCR
SOV
SPEFSCR
SOV
| ov
The unsigned 64-bit value in parameter a is subtracted from the value in the accumulator saturating
if underflow occurs, and the results are placed in parameter d and the accumulator. Any underflow
is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-729. Vector Subtract from Unsigned, Saturate, Integer to Accumulator
(__ev_subfusiaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfusiaa d,a
0 63
a
Accumulator
-
usat
d and Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-901
__ev_subfusiaaw __ev_subfusiaaw
Vector Subtract Unsigned, Saturate, Integer to Accumulator Word
d = __ev_subfusiaaw (a)
// high
temp
0:63
EXTZ(ACC
0:31
) - EXTZ(a
0:31
)
ovh temp
31
d
0:31
SATURATE(ovh, temp
31
, 0x00000000, 0x00000000, temp
32:63
)
// low
temp
0:63
EXTZ(ACC
32:63
) - EXTZ(a
32:63
)
ovl temp
31
d
32:63
SATURATE(ovl, temp
31
, 0x00000000, 0x00000000, temp
32:63
)
// update accumulator
ACC
0:63
d
0:63
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl
Each unsigned integer word element in parameter a is zero-extended and subtracted from the
corresponding zero-extended element in the accumulator, saturating if underflow occurs, and the
results are placed in parameter d and the accumulator. Any underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR ACC
Figure 3-730. Vector Subtract Unsigned, Saturate, Integer to Accumulator Word
(__ev_subfusiaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsubfusiaaw d,a
0 31 32 63
Accumulator
a
d and accumulator
SPE2 Programming Interface Manual, Rev. 1.0-2
3-902 Freescale Semiconductor
SPE2 Operations
__ev_subfw __ev_subfw
Vector Subtract from Word
d = __ev_subfw (a,b)
d
0:31
b
0:31
- a
0:31
// Modulo difference
d
32:63
b
32:63
- a
32:63
// Modulo difference
Each signed integer element of parameter a is subtracted from the corresponding element of
parameter b, and the results are placed into parameter d.
Figure 3-731. Vector Subtract from Word (__ev_subfw)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfw d,a,b
0 31 32 63
b
a
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-903
__ev_subfwegsf __ev_subfwegsf
Vector Subtract from Word Even Guarded Signed Fraction
d = __ev_subfwegsf (a,b)
d
0:63
(EXTS
48
(b
0:31
) ||
16
0) - (EXTS
48
(a
0:31
) ||
16
0)
The even word elements of parameters a and b are sign-extended with 16 guard bits and padded
with 16 0s, and then the parameter a value is subtracted from the parameter b value to produce a
64-bit difference, and the result is placed into parameter d.
Note: __ev_subfwegsf is used to subtract 1.31 fractions to produce a 17.47 fractional difference.
Figure 3-732. Vector Subtract from Word Even Guarded Signed Fraction
(__ev_subfwegsf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwegsf d,a,b
0 31 32 63
b
a
0 63
a
15 16 17
.
-
SPE2 Programming Interface Manual, Rev. 1.0-2
3-904 Freescale Semiconductor
SPE2 Operations
__ev_subfwegsi __ev_subfwegsi
Vector Subtract from Word Even Guarded Signed Integer
d = __ev_subfwegsi (a,b)
d
0:63
EXTS
64
(b
0:31
) - EXTS
64
(a
0:31
)
The even word element of parameter a is sign-extended to 64 bits and subtracted from the
sign-extended even word element of parameter b to produce a 64-bit difference, and the result is
placed into parameter d.
Note: __ev_subfwegsi can also be used to subtract 1.31 fractions to produce a 33.31 fractional
difference.
Figure 3-733. Vector Subtract from Word Even Guarded Signed Integer (__ev_subfwegsi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwegsi d,a,b
0 31 32 63
b
a
-
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-905
__ev_subfwogsf __ev_subfwogsf
Vector Subtract from Word Odd Guarded Signed Fraction
d = __ev_subfwogsf (a,b)
d
0:63
(EXTS
48
(b
32:63
) ||
16
0) - (EXTS
48
(a
32:63
) ||
16
0)
The odd word elements of parameters a and b are sign-extended with 16 guard bits and padded
with 16 0s, and then the parameter a value is subtracted from the parameter b value to produce a
64-bit difference, and the result is placed into parameter d.
Note: __ev_subfwogsf is used to subtract 1.31 fractions to produce a 17.47 fractional difference.
Figure 3-734. Vector Subtract from Word Odd Guarded Signed Fraction (__ev_subfwogsf)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwogsf d,a,b
0 31 32 63
b
a
0 63
a
15 16 17
.
-
SPE2 Programming Interface Manual, Rev. 1.0-2
3-906 Freescale Semiconductor
SPE2 Operations
__ev_subfwogsi __ev_subfwogsi
Vector Subtract from Word Odd Guarded Signed Integer
d = __ev_subfwogsi (a,b)
d
0:63
EXTS
64
(b
32:63
) - EXTS
64
(a
32:63
)
The odd word element of parameter a is sign-extended to 64 bits and subtracted from the
sign-extended odd word element of parameter b to produce a 64-bit difference, and the result is
placed into parameter d.
Note: __ev_subfwogsi can also be used to subtract 1.31 fractions to produce a 33.31 fractional
difference.
Figure 3-735. Vector Subtract from Word Odd Guarded Signed Integer (__ev_subfwogsi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev_subfwogsi d,a,b
0 31 32 63
b
a
-
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-907
__ev_subfwss __ev_subfwss
Vector Subtract from Word
d = __ev_subfwss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) - EXTS(a
0:31
)
ovh temp
0
temp
1
d
0:31
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:32
EXTS(b
32:63
) - EXTS(a
32:63
)
ovl temp
0
temp
1
d
32:63
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The high word element of parameter a is subtracted from the high word element of parameter b,
saturating if overflow or underflow occurs, the low word element of parameter a is subtracted from
the low word element of parameter b, saturating if overflow or underflow occurs, and the results
are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR overflow and
summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-736. Vector Subtract from Word Signed and Saturate (__ev_subfwss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwss d,a,b
0 31 32 63
b
a
- -
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-908 Freescale Semiconductor
SPE2 Operations
__ev_subfwus __ev_subfwus
Vector Subtract from Word Unsigned and Saturate
d = __ev_subfwus (a,b)
// h0
temp
0:32
EXTZ(b
0:31
) - EXTZ(a
0:31
)
ovh temp
0
d
0:31
SATURATE(ovh, temp
0
, 0x0000_0000, 0x0000_0000, temp
1:32
)
// h1
temp
0:32
EXTZ(b
32:63
) - EXTZ(a
32:63
)
ovl temp
0
d
32:63
SATURATE(ovl, temp
0
, x0000_0000, 0x0000_0000, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The high word element of parameter a is subtracted from the high word element of parameter b,
saturating if underflow occurs, the low word element of parameter a is subtracted from the low
word element of parameter b, saturating if underflow occurs, and the results are placed in
parameter d. Any underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-737. Vector Subtract from Word Unsigned and Saturate (__ev_subfwus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwus d,a,b
0 31 32 63
b
a
- -
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-909
__ev_subfwx __ev_subfwx
Vector Subtract from Word Exchanged
d = __ev_subfwx (a,b)
d
0:31
b
0:31
- a
32:63
// Modulo difference
d
32:63
b
32:63
- a
0:31
// Modulo difference
The exchanged signed integer word elements of parameter a are subtracted from the corresponding
elements of parameter b and the results are placed into parameter d.
Figure 3-738. Vector Subtract from Word Exchanged (__ev_subfwx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwx d,a,b
0 31 32 63
b
a
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-910 Freescale Semiconductor
SPE2 Operations
__ev_subfwxss __ev_subfwxss
Vector Subtract from Word Exchanged Signed and Saturate
d = __ev_subfwxss (a,b)
// h0
temp
0:32
EXTS(b
0:31
) - EXTS(a
32:63
)
ovh temp
0
temp
1
d
0:15
SATURATE(ovh, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
// h1
temp
0:31
EXTS(b
32:63
) - EXTS(a
0:31
)
ovl temp
0
temp
1
d
16:31
SATURATE(ovl, temp
0
, 0x8000_0000, 0x7fff_ffff, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The low word element of parameter a is subtracted from the high word element of parameter b,
saturating if overflow or underflow occurs, the high word element of parameter a is subtracted
from the low word element of parameter b, saturating if overflow or underflow occurs, and the
results are placed in parameter d. Any overflow or underflow is recorded in the SPEFSCR
overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-739. Vector Subtract from Word Exchanged Signed and Saturate
(__ev_subfwxss)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwxss d,a,b
0 31 32 63
b
a
- -
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-911
__ev_subfwxus __ev_subfwxus
Vector Subtract from Word Exchanged Unsigned and Saturate
d = __ev_subfwxus (a,b)
// h0
temp
0:32
EXTZ(b
0:31
) - EXTZ(a
32:63
)
ovh temp
0
d
0:15
SATURATE(ovh, temp
0
, 0x0000_0000, 0x0000_0000, temp
1:32
)
// h1
temp
0:31
EXTZ(b
32:63
) - EXTZ(a
0:31
)
ovl temp
0
d
16:31
SATURATE(ovl, temp
0
, x0000_0000, 0x0000_0000, temp
1:32
)
SPEFSCR
OVH
ovh
SPEFSCR
OV
ovl
SPEFSCR
SOVH
SPEFSCR
SOVH
| ovh
SPEFSCR
SOV
SPEFSCR
SOV
| ovl.
The high word element of parameter a is subtracted from the high word element of parameter b,
saturating if underflow occurs, the low word element of parameter a is subtracted from the low
word element of parameter b, saturating if underflow occurs, and the results are placed in
parameter d. Any underflow is recorded in the SPEFSCR overflow and summary overflow bits.
Other registers altered: SPEFSCR
Figure 3-740. Vector Subtract from Word Exchanged Unsigned and Saturate
(__ev_subfwxus)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evsubfwxus d,a,b
0 31 32 63
b
a
- -
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-912 Freescale Semiconductor
SPE2 Operations
__ev_subifb __ev_subifb
Vector Subtract Immediate from Byte
d = __ev_subifb (a,b)
UIMM a
d
0:7
b
0:7
- EXTZ(UIMM) // Modulo
d
8:15
b
8:15
- EXTZ(UIMM) // Modulo
d
16:23
b
16:23
- EXTZ(UIMM) // Modulo
d
4:31
b
24:31
- EXTZ(UIMM) // Modulo
d
32:39
b
32:39
- EXTZ(UIMM) // Modulo
d
40:47
b
40:47
- EXTZ(UIMM) // Modulo
d
48:55
b
48:55
- EXTZ(UIMM) // Modulo
d
56:63
b
56:63
- EXTZ(UIMM) // Modulo
The 5-bit unsigned literal provided by parameter a (UIMM) is zero-extended and subtracted from
the eight byte elements of parameter b and the results are placed in parameter d. Note that the same
value is subtracted from all elements of parameter b. UIMM is 5 bits.
Figure 3-741. Vector Subtract Immediate from Byte (__ev_subifb)
d a b Maps to
__ev64_opaque__ 5-bit unsigned literal __ev64_opaque__ evsubifb d,a,b
0 32 63
UIMM
b
24 16 8 40 48 56
-
0 32 63
d
24 16 8 40 48 56
- - - - - - -
UIMM UIMM UIMM UIMM UIMM UIMM UIMM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-913
__ev_subifh __ev_subifh
Vector Subtract Immediate from Half Word
d = __ev_subifh (a,b)
UIMM a
d
0:15
b
0:15
- EXTZ(UIMM)// Modulo difference
d
16:31
b
16:31
- EXTZ(UIMM)// Modulo difference
d
32:47
b
32:47
- EXTZ(UIMM)// Modulo difference
d
48:63
b
48:63
- EXTZ(UIMM)// Modulo difference
The 5-bit unsigned literal provided by parameter a (UIMM) is zero-extended and subtracted from
the four half word elements of parameter b and the results are placed in parameter d. Note that the
same value is subtracted from all elements of parameter b. UIMM is 5 bits.
Figure 3-742. Vector Subtract Immediate from Half Word (__ev_subifh)
d a b Maps to
__ev64_opaque__ 5-bit unsigned literal __ev64_opaque__ evsubifh d,a,b
0 31 32 63
b
15 16 47 48
-
d
UIMM
-
UIMM
-
UIMM
-
UIMM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-914 Freescale Semiconductor
SPE2 Operations
__ev_subifw __ev_subifw
Vector Subtract Immediate from Word
d = __ev_subifw (a,b)
UIMM a
d
0:31
b
0:31
- EXTZ(UIMM) // Modulo difference
d
32:63
b
32:63
- EXTZ(UIMM)// Modulo difference
The 5-bit unsigned literal provided by parameter a (UIMM) is zero-extended and subtracted from
both the high and low elements of parameter b. Note that the same value is subtracted from both
elements of parameter b. UIMM is 5 bits.
Figure 3-743. Vector Subtract Immediate from Word (__ev_subifw)
d a b Maps to
__ev64_opaque__ 5-bit unsigned literal __ev64_opaque__ evsubifw d,a,b
0 31 32 63
b
UIMM
d and Accumulator
UIMM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-915
__ev_sum2his[a] __ev_sum2his[a]
Vector Sum of 2 Half Words Interleaved Signed (to Accumulator)
d = __ev_sum2his (a) (A = 0)
d = __ev_sum2hisa (a) (A = 1)
d
0:31
EXTS(a
0:15
) + EXTS(a
32:47
)
d
32:63
EXTS(a
16:31
) + EXTS(a
48:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of two interleaved signed half word elements of parameter a are summed together, and the
results are placed into the word elements of parameter d. If A = 1, the result in parameter d is also
placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-744. Vector Sum of 2 Half Words Interleaved Signed (to Accumulator)
(__ev_sum2his[a])
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsum2his d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsum2hisa d,a
0 32 63
a
16 48
0 63
d (and Accumulator if A=1)
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-916 Freescale Semiconductor
SPE2 Operations
__ev_sum2hisaaw __ev_sum2hisaaw
Vector Sum of 2 Half Words Interleaved Signed and Accumulate into Words
d = __ev_sum2hisaaw (a)
d
0:31
ACC
0:31
+EXTS(a
0:15
) + EXTS(a
32:47
)
d
32:63
ACC
32:63
+ EXTS(a
16:31
) + EXTS(a
48:63
)
// update accumulator
ACC
0:63
d
0:63
Groups of two interleaved signed half word elements of parameter a are summed together, then
added to the corresponding word elements in the accumulator, and the results are placed into the
word elements of parameter d and the accumulator.
Other registers altered: ACC
Figure 3-745. Vector Sum of 2 Half Words Interleaved Signed and Accumulate
(__ev_sum2hisaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsum2hisaaw d,a
+
SUM
+
SUM
0 32 63
a
16 48
0 63
d and Accumulator
0 63
Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-917
__ev_sum2hs[a] __ev_sum2hs[a]
Vector Sum of 2 Half Words Signed (to Accumulator)
d = __ev_sum2hs (a) (A = 0)
d = __ev_sum2hsa (a) (A = 1)
d
0:31
EXTS(a
0:15
) + EXTS(a
16:315
)
d
32:63
EXTS(a
32:47
) + EXTS(a
48:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of two signed half word elements of parameter a are summed together, and the results are
placed into the word elements of parameter d. If A = 1, the result in parameter d is also placed into
the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-746. Vector Sum of 2 Half Words Signed (to Accumulator) (__ev_sum2hs[a])
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsum2hs d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsum2hsa d,a
0 32 63
a
16 48
0 63
d (and Accumulator if A=1)
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-918 Freescale Semiconductor
SPE2 Operations
__ev_sum2hsaaw __ev_sum2hsaaw
Vector Sum of 2 Half Words Signed and Accumulate into Words
d = __ev_sum2hsaaw (a)
d
0:31
ACC
0:31
+EXTS(a
0:15
) + EXTS(a
16:315
)
d
32:63
ACC
32:63
+ EXTS(a
32:47
) + EXTS(a
48:63
)
// update accumulator
ACC
0:63
d
0:63
Groups of two signed half word elements of parameter a are summed together, then added to the
corresponding word elements in the accumulator, and the results are placed into the word elements
of parameter d and the accumulator.
Other registers altered: ACC
Figure 3-747. Vector Sum of 2 Half Words Signed and Accumulate (__ev_sum2hsaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsum2hsaaw d,a
+
SUM
+
SUM
0 32 63
a
16 48
0 63
d and Accumulator
0 63
Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-919
__ev_sum2hu[a] __ev_sum2hu[a]
Vector Sum of 2 Half Words Unsigned (to Accumulator)
d = __ev_sum2hu (a) (A = 0)
d = __ev_sum2hua (a) (A = 1)
d
0:31
EXTZ(a
0:15
) + EXTZ(a
16:315
)
d
32:63
EXTZ(a
32:47
) + EXTZ(a
48:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of two unsigned half word elements of parameter a are summed together, and the results
are placed into the word elements of parameter d. If A = 1, the result in parameter d is also placed
into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-748. Vector Sum of 2 Half Words Unsigned (to Accumulator) (__ev_sum2hu[a])
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsum2hu d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsum2hua d,a
0 32 63
a
16 48
0 63
d (and Accumulator if A=1)
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-920 Freescale Semiconductor
SPE2 Operations
__ev_sum2huaaw __ev_sum2huaaw
Vector Sum of 2 Half Words Unsigned and Accumulate into Words
d = __ev_sum2huaaw (a)
d
0:31
ACC
0:31
+EXTZ(a
0:15
) + EXTZ(a
16:315
)
d
32:63
ACC
32:63
+ EXTZ(a
32:47
) + EXTZ(a
48:63
)
// update accumulator
ACC
0:63
d
0:63
Groups of two unsigned half word elements of parameter a are summed together, then added to
the corresponding word elements in the accumulator, and the results are placed into the word
elements of parameter d and the accumulator.
Other registers altered: ACC
Figure 3-749. Vector Sum of 2 Half Words Signed and Accumulate (__ev_sum2huaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsum2huaaw d,a
+
SUM
+
SUM
0 32 63
a
16 48
0 63
d and Accumulator
0 63
Accumulator
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-921
__ev_sum4bs[a] __ev_sum4bs[a]
Vector Sum of 4 Bytes SIgned (to Accumulator)
d = __ev_sum4bs (a) (A = 0)
d = __ev_sum4bsa (a) (A = 1)
d
0:31
EXTS(a
0:7
) + EXTS(a
8:15
) + EXTS(a
16:23
) + EXTS(a
24:31
)
d
32:63
EXTS(a
32:39
)+ EXTS(a
40:47
) + EXTS(a
48:55
) + EXTS(a
56:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of four signed byte elements of parameter a are summed together and the results are placed
into the word elements of parameter d. If A = 1, the result in parameter d is also placed into the
accumulator.
Other registers altered: ACC (if A=1)
Figure 3-750. Vector Sum of 4 Bytes Signed (to Accumulator) (__ev_sum4bs[a])
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsum4bs d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsum4bsa d,a
0 32 63
a
24 16 8 40 48 56
0 63
d (and Accumulator if A=1)
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-922 Freescale Semiconductor
SPE2 Operations
__ev_sum4bsaaw __ev_sum4bsaaw
Vector Sum of 4 Bytes Signed and Accumulate into Words
d = __ev_sum4bsaaw (a)
d
0:31
ACC
0:31
+ EXTS(a
0:7
) + EXTS(a
8:15
) + EXTS(a
16:23
) + EXTS(a
24:31
)
d
32:63
ACC
32:63
+ EXTS(a
32:39
)+ EXTS(a
40:47
) + EXTS(a
48:55
) + EXTS(a
56:63
)
// update accumulator
ACC
0:63
d
0:63
Groups of four signed byte elements of parameter a are summed together, then added to the
respective word elements of the accumulator, and the result is placed in parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-751. Vector Sum of 4 Bytes Signed and Accumulate into Words
(__ev_sum4bsaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsum4bsaaw d,a
0 32 63
a
24 16 8 40 48 56
0 63
d and Accumulator
0 63
Accumulator
+
SUM
+
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-923
__ev_sum4bu[a] __ev_sum4bu[a]
Vector Sum of 4 Bytes Unsigned (to Accumulator)
d = __ev_sum4bu (a) (A = 0)
d = __ev_sum4bua (a) (A = 1)
d
0:31
EXTZ(a
0:7
) + EXTZ(a
8:15
) + EXTZ(a
16:23
) + EXTZ(a
24:31
)
d
32:63
EXTZ(a
32:39
)+ EXTZ(a
40:47
) + EXTZ(a
48:55
) + EXTZ(a
56:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
Groups of four unsigned byte elements of parameter a are summed together and the results are
placed into the word elements of parameter d. If A = 1, the result in parameter d is also placed into
the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-752. Vector Sum of 4 Bytes Unsigned (to Accumulator) (__ev_sum4bs[a])
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsum4bu d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsum4bua d,a
0 32 63
a
24 16 8 40 48 56
0 63
d (and Accumulator if A=1)
SUM SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-924 Freescale Semiconductor
SPE2 Operations
__ev_sum4buaaw __ev_sum4buaaw
Vector Sum of 4 Bytes Unsigned and Accumulate into Words
d = __ev_sum4buaaw (a)
d
0:31
ACC
0:31
+ EXTZ(a
0:7
) + EXTZ(a
8:15
) + EXTZ(a
16:23
) + EXTZ(a
24:31
)
d
32:63
ACC
32:63
+ EXTZ(a
32:39
)+ EXTZ(a
40:47
) + EXTZ(a
48:55
) + EXTZ(a
56:63
)
// update accumulator
ACC
0:63
d
0:63
Groups of four unsigned byte elements of parameter a are summed together, then added to the
respective word elements of the accumulator, and the result is placed in parameter d and the
accumulator.
Other registers altered: ACC
Figure 3-753. Vector Sum of 4 Bytes Unsigned and Accumulate into Words
(__ev_sum4buaaw)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsum4buaaw d,a
0 32 63
a
24 16 8 40 48 56
0 63
d and Accumulator
0 63
Accumulator
+
SUM
+
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-925
__ev_sumws[a] __ev_sumws[a]
Vector Sum of Words Signed (to Accumulator)
d = __ev_sumws (a) (A = 0)
d = __ev_sumwsa (a) (A = 1)
d
0:63
EXTS(a
0:31
) + EXTS(a
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The signed word elements of parameter a are summed together, and the result is placed in
parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-754. Vector Sum of Words Signed (to Accumulator) (__ev_sumws)
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsumws d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsumwsa d,a
0 32 63
a
0 63
d (and Accumulator if A=1)
SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-926 Freescale Semiconductor
SPE2 Operations
__ev_sumwsaa __ev_sumwsaa
Vector Sum of Words Signed and Accumulate
d = __ev_sumwsaa (a)
d
0:63
EXTS(a
0:31
) + EXTS(a
32:63
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The signed word elements of parameter a are summed together then added to the contents of the
accumulator, and the result is placed in parameter d and the accumulator.
Other registers altered: ACC
Figure 3-755. Vector Sum of Words Signed and Accumulate (__ev_sumwsaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsumwsaa d,a
0 63
d and Accumulator
0 63
Accumulator
+
0 32 63
a
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-927
__ev_sumwu[a] __ev_sumwu[a]
Vector Sum of Words Unsigned (to Accumulator)
d = __ev_sumwu (a) (A = 0)
d = __ev_sumwua (a) (A = 1)
d
0:63
EXTZ(a
0:31
) + EXTZ(a
32:63
)
// update accumulator
if A = 1 then ACC
0:63
d
0:63
The unsigned word elements of parameter a are summed together, and the result is placed in
parameter d. If A = 1, the result in parameter d is also placed into the accumulator.
Other registers altered: ACC (if A=1)
Figure 3-756. Vector Sum of Words Unsigned (to Accumulator) (__ev_sumwu)
A d a Maps to
A = 0 __ev64_opaque__ __ev64_opaque__ evsumwu d,a
A = 1 __ev64_opaque__ __ev64_opaque__ evsumwua d,a
0 32 63
a
0 63
d (and Accumulator if A=1)
SUM
SPE2 Programming Interface Manual, Rev. 1.0-2
3-928 Freescale Semiconductor
SPE2 Operations
__ev_sumwuaa __ev_sumwuaa
Vector Sum of Words Unsigned and Accumulate
d = __ev_sumwuaa (a)
d
0:63
EXTZ(a
0:31
) + EXTZ(a
32:63
) + ACC
0:63
// update accumulator
ACC
0:63
d
0:63
The unsigned word elements of parameter a are summed together then added to the contents of the
accumulator, and the result is placed in parameter d and the accumulator.
Other registers altered: ACC
Figure 3-757. Vector Sum of Words Unsigned and Accumulate (__ev_sumwuaa)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evsumwuaa d,a
0 63
d and Accumulator
0 63
Accumulator
+
0 32 63
a
SUM
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-929
__ev_swapbhilo __ev_swapbhilo
Vector Swap Bytes High/Low
d = __ev_swapbhilo (a,b)
d
0:7
a
8:15
d
8:15
a
0:7
d
16:23
a
24:31
d
24:31
a
16:23
d
32:39
b
40:47
d
40:47
b
32:39
d
48:55
b
56:63
d
56:63
b
48:55
Pairs of byte elements within the high half words of parameter a and the low half words of
parameter b are swapped and placed in parameter d.
Figure 3-758. Vector Swap Bytes High/Low (__ev_swapbhilo)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswapbhilo d,a,b
0 32 63
d
24 16 8 40 48 56
0 32 63
a
24 16 8 40 48 56
0 32 63
b
24 16 8 40 48 56
SPE2 Programming Interface Manual, Rev. 1.0-2
3-930 Freescale Semiconductor
SPE2 Operations
__ev_swapblohi __ev_swapblohi
Vector Swap Bytes Low/High
d = __ev_swapblohi (a,b)
d
0:7
a
40:47
d
8:15
a
32:39
d
16:23
a
56:63
d
24:31
a
48:55
d
32:39
b
8:15
d
40:47
b
0:7
d
48:55
b
24:31
d
56:63
b
16:23
Pairs of byte elements within the low half words of parameter a and the high half words of
parameter b are swapped and placed in parameter d.
Figure 3-759. Vector Swap Bytes Low/High (__ev_swapblohi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswapblohi d,a,b
0 32 63
d
24 16 8 40 48 56
0 32 63
a
24 16 8 40 48 56
0 32 63
b
24 16 8 40 48 56
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-931
__ev_swaphe __ev_swaphe
Vector Swap Half Words Even
d = __ev_swaphe (a,b)
d
0:63
a
32:47
|| b
16:31
|| a
0:15
|| b
48:63
The even half word elements in parameter a are swapped and then merged with the odd half word
elements in parameter b and placed into parameter d.
Figure 3-760. Vector Swap Half Words Even (__ev_swaphe)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswaphe d,a,b
b
15 16 47 48 0 31 32 63
a
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-932 Freescale Semiconductor
SPE2 Operations
__ev_swaphhi __ev_swaphhi
Vector Swap Half Words High
d = __ev_swaphhi (a,b)
d
0:63
a
16:31
|| a
0:15
|| b
32:47
|| b
48:63
The most significant two half word elements in parameter a are swapped and merged with the least
significant two half word elements in parameter b and placed into parameter d.
Figure 3-761. Vector Swap Half Words High (__ev_swaphhi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswaphhi d,a,b
b
15 16 47 48 0 31 32 63
a
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-933
__ev_swaphhilo __ev_swaphhilo
Vector Swap Half Words High/Low
d = __ev_swaphhilo (a,b)
d
0:63
a
16:31
|| a
0:15
|| b
48:63
|| b
32:48
The most significant two half word elements in parameter a are swapped are merged with the
swapped least significant two half word elements in parameter b and placed into parameter d.
Figure 3-762. Vector Swap Half Words High/Low (__ev_swaphhilo)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswaphhilo d,a,b
b
15 16 47 48 0 31 32 63
a
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-934 Freescale Semiconductor
SPE2 Operations
__ev_swaphlo __ev_swaphlo
Vector Swap Half Words Low
d = __ev_swaphlo (a,b)
d
0:63
b
0:15
|| b
16:31
|| a
48:63
|| a
32:47
The least significant two half word elements in parameter a are swapped and then merged with the
most significant two half word elements in parameter b and placed into parameter d.
Figure 3-763. Vector Swap Half Words Low (__ev_swaphlo)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswaphlo d,a,b
b
15 16 47 48 0 31 32 63
a
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-935
__ev_swaphlohi __ev_swaphlohi
Vector Swap Half Words Low/High
d = __ev_swaphlohi (a,b)
d
0:63
a
48:63
|| a
32:47
|| b
16:31
|| b
0:15
The least significant two half word elements in parameter a are swapped and merged with the
swapped most significant two half word elements in parameter b and placed into parameter d.
Figure 3-764. Vector Swap Half Words Low/High (__ev_swaphlohi)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswaphlohi d,a,b
b
15 16 47 48 0 31 32 63
a
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-936 Freescale Semiconductor
SPE2 Operations
__ev_swapho __ev_swapho
Vector Swap Half Words Odd
d = __ev_swapho (a,b)
d
0:63
b
0:15
|| a
48:63
|| b
32:47
|| a
16:31
The odd half word elements in parameter a are swapped and then merged with the even half word
elements in parameter b and placed into parameter d.
Figure 3-765. Vector Swap Half Words Odd (__ev_swapho)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evswapho d,a,b
b
15 16 47 48 0 31 32 63
a
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-937
__ev_unpkhibsi __ev_unpkhibsi
Vector Unpack High Bytes as Signed Integers
d = __ev_unpkhibsi (a)
d
0:15
EXTS(a
0:7
)
d
16:31
EXTS(a
8:15
)
d
32:47
EXTS(a
16:23
)
d
48:63
EXTS(a
24:31
)
The high four signed byte elements in parameter a are sign-extended into half words and placed
in the same order into parameter d.
Figure 3-766. Vector Unpack High Bytes as Signed Integer (__ev_unpkhibsi)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpkhibsi d,a
0 32 63
d
16 48
0 32 63
a
24 16 8 40 48 56
sss........s sss........s sss........s sss........s
SPE2 Programming Interface Manual, Rev. 1.0-2
3-938 Freescale Semiconductor
SPE2 Operations
__ev_unpkhibui __ev_unpkhibui
Vector Unpack High Bytes as Unsigned Integers
d = __ev_unpkhibui (a)
d
0:15
EXTZ(a
0:7
)
d
16:31
EXTZ(a
8:15
)
d
32:47
EXTZ(a
16:23
)
d
48:63
EXTZ(a
24:31
)
The high four unsigned byte elements in parameter a are zero-extended into half words and placed
in the same order into parameter d.
Figure 3-767. Vector Unpack High Bytes as Unsigned Integers (__ev_unpkhibui)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpkhibui d,a
0 32 63
d
16 48
0 32 63
a
24 16 8 40 48 56
zzz......z zzz......z zzz......z zzz......z
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-939
__ev_unpkhihf __ev_unpkhihf
Vector Unpack High Half Words as Fractional
d = __ev_unpkhihf (a)
d
0:31
a
0:15
||
16
0
d
32:63
a
16:31
||
16
0
The half word elements 0 and 1 in parameter a are padded with 16 zeros, and the results are placed
into word elements 0 and 1 respectively of parameter d.
Figure 3-768. Vector Unpack High Half Words as Fractional (__ev_unpkhihf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpkhihf d,a
0000_0000_0000_0000 0000_0000_0000_0000
0 31 32 63
a
d
15 16 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-940 Freescale Semiconductor
SPE2 Operations
__ev_unpkhihsi __ev_unpkhihsi
Vector Extract High Half Words as Signed Integers
d = __ev_unpkhihsi (a)
d
0:31
EXTS(a
0:15
)
d
32:63
EXTS(a
16:31
)
The half word elements 0 and 1 in parameter a are sign-extended to 32 bits, and the results are
placed into word elements 0 and 1 respectively of parameter d.
Figure 3-769. Vector Extract High Half Words as Signed Integers (__ev_unpkhihsi)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpkhihsi d,a
0 31 32 63
a
d
s
s s ssss_ssss_ssss_ssss ssss_ssss_ssss_ssss
15 16 17 47 48
s
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-941
__ev_unpkhihui __ev_unpkhihui
Vector Extract High Half Words as Signed Integers
d = __ev_unpkhihui (a)
d
0:31
EXTZ(a
0:15
)
d
32:63
EXTZ(a
16:31
)
The half word elements 0 and 1 in parameter a are zero-extended to 32 bits, and the results are
placed into word elements 0 and 1 respectively of parameter d.
Figure 3-770. Vector Extract High Half Words as Unsigned Integers (__ev_unpkhihui)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpkhihui d,a
0 31 32 63
a
d 0000_0000_0000_0000 0000_0000_0000_0000
15 16 17 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-942 Freescale Semiconductor
SPE2 Operations
__ev_unpkhiwgsf __ev_unpkhiwgsf
Vector Unpack High Word to Guarded Signed Fraction
d = __ev_unpkhiwgsf (a)
d
0:63
(EXTS
48
(a
0:31
) ||
16
0)
The even word element of parameter a is sign-extended with 16 guard bits and padded with 16 0s,
and the result is placed into parameter d.
Note: __ev_unpkhiwgsf is used to convert a 1.31 fractional word to a 17.47 fractional format.
Figure 3-771. Vector Unpack High Word to Guarded Signed Fraction (__ev_unpkhiwgsf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpkhiwgsf d,a
31 32 63
a
0
S
0
d
47 48 63 15 16 17
.
S........................S
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-943
__ev_unpklobsi __ev_unpklobsi
Vector Unpack Low Bytes as Signed Integers
d = __ev_unpklobsi (a)
d
0:15
EXTS(a
32:39
)
d
16:31
EXTS(a
40:47
)
d
32:47
EXTS(a
48:55
)
d
48:63
EXTS(a
56:63
)
The low four signed byte elements in parameter a are sign-extended into half words and placed in
the same order into parameter d.
Figure 3-772. Vector Unpack Low Bytes as Signed Integers (__ev_unpklobsi)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpklobsi d,a
0 32 63
d
16 48
0 32 63
a
24 16 8 40 48 56
sss........s sss........s sss........s sss........s
SPE2 Programming Interface Manual, Rev. 1.0-2
3-944 Freescale Semiconductor
SPE2 Operations
__ev_unpklobui __ev_unpklobui
Vector Unpack Low Bytes as Unsigned Integers
d = __ev_unpklobui (a)
d
0:15
EXTZ(a
32:39
)
d
16:31
EXTZ(a
40:47
)
d
32:47
EXTZ(a
48:55
)
d
48:63
EXTZ(a
56:63
)
The low four unsigned byte elements in parameter a are zero-extended into half words and placed
in the same order into parameter d.
Figure 3-773. Vector Unpack Low Bytes as Unsigned Integers (__ev_unpklobui)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpklobui d,a
0 32 63
d
16 48
0 32 63
a
24 16 8 40 48 56
zzz......z zzz......z zzz......z zzz......z
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-945
__ev_unpklohf __ev_unpklohf
Vector Unpack Low Half Words as Fractional
d = __ev_unpklohf (a)
d
0:31
a
32:47
||
16
0
d
32:63
a
48:63
||
16
0
The half word elements 2 and 3 in parameter a are padded with 16 zeros, and the results are placed
into word elements 0 and 1 respectively of parameter d.
Figure 3-774. Vector Unpack Low Half Words as Fractional (__ev_unpklohf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpklohf d,a
0000_0000_0000_0000 0000_0000_0000_0000
0 31 32 63
a
d
15 16 17 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-946 Freescale Semiconductor
SPE2 Operations
__ev_unpklohsi __ev_unpklohsi
Vector Unpack Low Half Words as Signed Integers
d = __ev_unpklohsi (a)
d
0:31
EXTS(a
32:47
)
d
32:63
EXTS(a
48:63
)
The half word elements 2 and 3 in parameter a are sign-extended to 32 bits, and the results are
placed into word elements 0 and 1 respectively of parameter d.
Figure 3-775. Vector Unpack Low Half Words as Signed Integers (__ev_unpklohsi)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpklohsi d,a
0 31 32 63
a
d s s ssss_ssss_ssss_ssss ssss_ssss_ssss_ssss
15 16 17 47 48
s s
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-947
__ev_unpklohui __ev_unpklohui
Vector Unpack Low Half Words as Unsigned Integers
d = __ev_unpklohui (a)
d
0:31
EXTZ(a
32:47
)
d
32:63
EXTZ(a
48:63
)
The half word elements 2 and 3 in parameter a are zero-extended to 32 bits, and the results are
placed into word elements 0 and 1 respectively of parameter d.
Figure 3-776. Vector Unpack Low Half Words as Unsigned Integers (__ev_unpklohui)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpklohui d,a
0 31 32 63
a
d 0000_0000_0000_0000 0000_0000_0000_0000
15 16 17 47 48
SPE2 Programming Interface Manual, Rev. 1.0-2
3-948 Freescale Semiconductor
SPE2 Operations
__ev_unpklowgsf __ev_unpklowgsf
Vector Unpack Low Word to Guarded Signed Fraction
d = __ev_unpklowgsf (a)
d
0:63
(EXTS
48
(a
32:63
) ||
16
0)
The odd word element of parameter a is sign-extended with 16 guard bits and padded with 16 0s,
and the result is placed into parameter d.
Note: __ev_unpklowgsf is used to convert a 1.31 fractional word to a 17.47 fractional format.
Figure 3-777. Vector Unpack Low Word to Guarded Signed Fraction(__ev_unpklowgsf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evunpklowgsf d,a
31 32 63
a
0
S
0
d
47 48 63 15 16 17
.
S........................S
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-949
__ev_upper_eq __ev_upper_eq
Vector Upper Bits Equal
d = __ev_upper_eq (a,b)
if (a
0:31
= b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are equal to the upper 32 bits of
parameter b.
Figure 3-778. Vector Upper Equal(__ev_upper_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpeq x,a,b
0 31 32 63
a
b
=
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-950 Freescale Semiconductor
SPE2 Operations
__ev_upper_gts __ev_upper_gts
Vector Upper Bits Greater Than Signed
d = __ev_upper_gts (a,b)
if (a
0:31
>
signed
b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are greater than the upper 32 bits of
parameter b.
Figure 3-779. Vector Upper Greater Than Signed (__ev_upper_gts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgts x,a,b
0 31 32 63
a
b
>
D
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-951
__ev_upper_gtu __ev_upper_gtu
Vector Upper Bits Greater Than Unsigned
d = __ev_upper_gtu (a,b)
if (a
0:31
>
unsigned
b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are greater than the upper 32 bits of
parameter b.
Figure 3-780. Vector Upper Greater Than Unsigned (__ev_upper_gtu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpgtu x,a,b
0 31 32 63
a
b
>
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-952 Freescale Semiconductor
SPE2 Operations
__ev_upper_lts __ev_upper_lts
Vector Upper Bits Less Than Signed
d = __ev_upper_lts (a,b)
if (a
0:31
<
signed
b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b.
Figure 3-781. Vector Upper Less Than Signed (__ev_upper_lts)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmplts x,a,b
0 31 32 63
a
b
<
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-953
__ev_upper_ltu __ev_upper_ltu
Vector Upper Bits Less Than Unsigned
d = __ev_upper_ltu (a,b)
if (a
0:31
<
unsigned
b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b.
Figure 3-782. Vector Upper Less Than Unsigned (__ev_upper_ltu)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evcmpltu x,a,b
0 31 32 63
a
b
<
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-954 Freescale Semiconductor
SPE2 Operations
__ev_xor __ev_xor
Vector XOR
d = __ev_xor (a,b)
d
0:31
a
0:31
b
0:31
// Bitwise XOR
d
32:63
a
32:63
b
32:63
// Bitwise XOR
Each element of parameters a and b is exclusive-ORed. The results are placed in parameter d.
Figure 3-783. Vector XOR (__ev_xor)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evxor d,a,b
0 31 32 63
a
b
XOR XOR
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-955
__ev_xtrb __ev_xtrb
Vector Extract Byte
d = __ev_xtrb (a,b,c)
temp
0:7
a
0+(c*8):7+(c*8)
if (b=0) then d
0:7
temp
0:7
else d
0:7
0x00
if (b=1) then d
8:15
temp
0:7
else d
8:15
0x00
if (b=2) then d
16:23
temp
0:7
else d
16:23
0x00
if (b=3) then d
24:31
temp
0:7
else d
24:31
0x00
if (b=4) then d
32:39
temp
0:7
else d
32:39
0x00
if (b=5) then d
40:47
temp
0:7
else d
40:47
0x00
if (b=6) then d
48:55
temp
0:7
else d
48:55
0x00
if (b=7) then d
56:63
temp
0:7
else d
56:63
0x00
The byte element of parameter a specified by the 3-bit unsigned literal in parameter c is placed into
the byte element of parameter d specified by the 3-bit unsigned literal in parameter b, zero-filling
the remaining bytes of parameter d. Byte 0 is the most-significant byte.
Figure 3-784. Vector Extract Byte (__ev_xtrb)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal 3-bit unsigned literal evxtrb d,a,b,c
c
0 1 2 3
0 32 63
a
24 16 8 40 48 56
4 5 6 7
b
=
7
0 32 63
d
24 16 8 40 48 56
b
=
6
b
=
5
b
=
4
b
=
3
b
=
2
b
=
1
b
=
0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-956 Freescale Semiconductor
SPE2 Operations
__ev_xtrd __ev_xtrd
Vector Extract Double Word
d = __ev_xtrd (a,b,c)
offset c
n (offset * 8)
temp
0:127
a
0:63
|| b
0:63
d
0:63
temp
0+n:63+n
A double word is extracted from the concatenation of parameter a and parameter b beginning at
byte offset provided by the 3-bit unsigned literal in parameter c and placed into parameter d.
offset must be in the range [1:7].
Figure 3-785. Vector Extract Double Word (__ev_xtrd)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ 3-bit unsigned literal evxtrd d,a,b,c
c (offset)
b
d
a
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-957
__ev_xtrh __ev_xtrh
Vector Extract Half Word
d = __ev_xtrh (a,b,c)
temp
0:15
a
0+(c*16):15+(c*16)
if (b=0) then d
0:15
temp
0:15
else d
0:15
0x0000
if (b=1) then d
16:31
temp
0:15
else d
16:31
0x0000
if (b=2) then d
32:47
temp
0:15
else d
32:47
0x0000
if (b=3) then d
48:63
temp
0:15
else d
48:63
0x0000
The half word element of parameter a specified by the 2-bit unsigned literal in parameter c is
placed into the half word element of parameter d specified by the 2-bit unsigned literal in
parameter b, zero-filling the remaining half words of parameter d. Half word 0 is the
most-significant half word.
Figure 3-786. Vector Extract Half Word (__ev_xtrh)
d a b c Maps to
__ev64_opaque__ __ev64_opaque__ 2-bit unsigned literal 2-bit unsigned literal evxtrh d,a,b,c
15 16 47 48 0 31 32 63
d
15 16 47 48 0 31 32 63
a
c
0 1 2 3
0 0 0 0
b=3 b=2 b=1 b=0
SPE2 Programming Interface Manual, Rev. 1.0-2
3-958 Freescale Semiconductor
SPE2 Operations
3.7.2 EFP2 Intrinsic Definitions
This section provides the intrinsic definitions for the Embedded Floating-Point Version 2 (EFP2)
APU.
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-959
__ev_all_fs_eq __ev_all_fs_eq
Vector All Floating-Point Equal
d = __ev_all_fs_eq (a,b)
if ( (a
0:31
= b
0:31
) & (a
32:63
= b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are equal to the upper 32 bits of
parameter b and the lower 32 bits of parameter a are equal to the lower 32 bits of parameter b.
Figure 3-787. Vector All Floating-Point Equal (__ev_all_fs_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmplt x,a,b
0 31 32 63
a
b
OR
AND
x
&
d
=
=
SPE2 Programming Interface Manual, Rev. 1.0-2
3-960 Freescale Semiconductor
SPE2 Operations
__ev_all_fs_gt __ev_all_fs_gt
Vector All Floating-Point Greater Than
d = __ev_all_fs_gt (a,b)
if ( a
0:31
> b
0:31
) & (a
32:63
> b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are greater than the upper
32 bits of parameter b and the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-788. Vector All Floating-Point Greater Than (__ev_all_fs_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpgt x,a,b
0 31 32 63
a
b
> >
OR
AND
x
&
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-961
__ev_all_fs_lt __ev_all_fs_lt
Vector All Floating-Point Less Than
d = __ev_all_fs_lt (a,b)
if ( (a
0:31
< b
0:31
) & (a
32:63
< b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b, and the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
Figure 3-789. Vector All Floating-Point Less Than (__ev_all_fs_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmplt x,a,b
0 31 32 63
a
b
< <
OR
AND
x
&
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-962 Freescale Semiconductor
SPE2 Operations
__ev_all_fs_tst_eq __ev_all_fs_tst_eq
Vector All Floating-Point Test Equal
d = __ev_all_fs_tst_eq (a,b)
if ( (a
0:31
=
unsigned
b
0:31
) & (a
32:63
=
unsigned
b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are equal to the upper 32 bits of
parameter b, and the lower 32 bits of parameter a are equal to the lower 32 bits of parameter b.
This intrinsic differs from __ev_all_fs_eq because no exceptions are taken during its execution. If
strict IEEE 754 compliance is required, use __ev_all_fs_eq instead.
Figure 3-790. Vector All Floating-Point Test Equal (__ev_all_fs_tst_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststeq x,a,b
0 31 32 63
a
b
= =
x
&
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-963
__ev_all_fs_tst_gt __ev_all_fs_tst_gt
Vector All Floating-Point Test Greater Than
d = __ev_all_fs_tst_gt (a,b)
if ( (a
0:31
> b
0:31
) & (a
32:63
> b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are greater than the upper 32 bits
of parameter b and the lower 32 bits of parameter a are greater than the lower 32 bits of parameter
b. This intrinsic differs from __ev_all_fs_gt because no exceptions are taken during its execution.
If strict IEEE 754 compliance is required, use __ev_all_fs_gt instead.
Figure 3-791. Vector All Floating-Point Test Greater Than (__ev_all_fs_tst_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststgt x,a,b
0 31 32 63
a
b
> >
x
&
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-964 Freescale Semiconductor
SPE2 Operations
__ev_all_fs_tst_lt __ev_all_fs_tst_lt
Vector All Floating-Point Test Less Than
d = __ev_all_fs_tst_lt (a,b)
if ( (a
0:31
< b
0:31
) & (a
32:63
< b
32:63
)) then d true
else d false
This intrinsic returns true if both the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b and the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
This intrinsic differs from __ev_all_fs_lt because no exceptions are taken during its execution. If
strict IEEE 754 compliance is required, use __ev_all_fs_lt instead.
Figure 3-792. Vector All Floating-Point Test Less Than (__ev_all_fs_tst_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststlt x,a,b
0 31 32 63
a
b
<
<
x
&
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-965
__ev_any_fs_eq __ev_any_fs_eq
Vector Any Floating-Point Equal
d = __ev_any_fs_eq (a,b)
if ( (a
0:31
= b
0:31
) | (a
32:63
= b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are equal to the upper 32 bits
of parameter b or the lower 32 bits of parameter a are equal to the lower 32 bits of parameter b.
Figure 3-793. Vector Any Floating-Point Equal (__ev_any_fs_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpeq x,a,b
0 31 32 63
a
b
= =
x
|
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-966 Freescale Semiconductor
SPE2 Operations
__ev_any_fs_gt __ev_any_fs_gt
Vector Any Floating-Point Greater Than
d = __ev_any_fs_gt (a,b)
if ( (a
0:31
> b
0:31
) | (a
32:63
> b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are greater than the upper 32
bits of parameter b or the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-794. Vector Any Floating-Point Greater Than (__ev_any_fs_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpgt x,a,b
0 31 32 63
a
b
> >
x
|
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-967
__ev_any_fs_lt __ev_any_fs_lt
Vector Any Floating-Point Less Than
d = __ev_any_fs_lt (a,b)
if ( (a
0:31
< b
0:31
) | (a
32:63
< b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are less than the upper 32 bits
of parameter b or the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
Figure 3-795. Vector Any Floating-Point Less Than (__ev_any_fs_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmplt x,a,b
0 31 32 63
a
b
< <
x
|
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-968 Freescale Semiconductor
SPE2 Operations
__ev_any_fs_tst_eq __ev_any_fs_tst_eq
Vector Any Floating-Point Test Equal
d = __ev_any_fs_tst_eq (a,b)
if ( (a
0:31
= b
0:31
) | (a
32:63
= b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are equal to the upper 32 bits
of parameter b or the lower 32 bits of parameter a are equal to the lower 32 bits of parameter b.
This intrinsic differs from __ev_any_fs_eq because no exceptions are taken during its execution.
If strict IEEE 754 compliance is required, use __ev_any_fs_eq instead.
Figure 3-796. Vector Any Floating-Point Test Equal (__ev_any_fs_tst_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststeq x,a,b
0 31 32 63
a
b
x
|
d
=
=
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-969
__ev_any_fs_tst_gt __ev_any_fs_tst_gt
Vector Any Floating-Point Test Greater Than
d = __ev_any_fs_tst_gt (a,b)
if ( (a
0:31
> b
0:31
) | (a
32:63
> b
2:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are greater than the upper 32
bits of parameter b or the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b. This intrinsic differs from __ev_any_fs_gt because no exceptions are taken during its
execution. If strict IEEE 754 compliance is required, use __ev_any_fs_gt instead.
Figure 3-797. Vector Any Floating-Point Test Greater Than (__ev_any_fs_tst_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststgt x,a,b
0 31 32 63
a
b
> >
x
|
d
OR
AND
SPE2 Programming Interface Manual, Rev. 1.0-2
3-970 Freescale Semiconductor
SPE2 Operations
__ev_any_fs_tst_lt __ev_any_fs_tst_lt
Vector Any Floating-Point Test Less Than
d = __ev_any_fs_tst_lt (a,b)
if ( (a
0:31
< b
0:31
) || (a
32:63
< b
32:63
)) then d true
else d false
This intrinsic returns true if either the upper 32 bits of parameter a are less than the upper 32 bits
of parameter b or the lower 32 bits of parameter a are less than the lower 32 bits of parameter b.
This intrinsic differs from __ev_any_fs_lt because no exceptions are taken during its execution. If
strict IEEE 754 compliance is required, use __ev_any_fs_lt instead.
Figure 3-798. Vector Any Floating-Point Test Less Than (__ev_any_fs_tst_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststlt x,a,b
0 31 32 63
a
b
< <
x
|
d
OR
AND
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-971
__ev_fsabs __ev_fsabs
Vector Floating-Point Absolute Value
d = __ev_fsabs (a)
d
0:31
0b0 || a
1:31
d
32:63
0b0 || a
33:63
The signed bits of each element of parameter a are cleared, and the result is placed into parameter
d. No exceptions are taken during the execution of this instruction.
Figure 3-799. Vector Floating-Point Absolute Value (__ev_fsabs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsabs d,a
0 31 32 63
a
d
ABS ABS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-972 Freescale Semiconductor
SPE2 Operations
__ev_fsadd __ev_fsadd
Vector Floating-Point Add
d = __ev_fsadd (a,b)
d
0:31
a
0:31
+
sp
b
0:31
d
32:63
a
32:63
+
sp
b
32:63
The single-precision floating-point value of each element of parameter a is added to the
corresponding element in parameter b, and the results are placed in parameter d.
If an overflow condition is detected or the contents of parameters a or b are NaN or Infinity, the
result is an appropriately signed maximum floating-point value.
If an underflow condition is detected, the result is an appropriately signed floating-point 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a or parameter b are +inf, -inf, Denorm, or NaN
FOFV, FOFVH if an overflow occurs
FUNF, FUNFH if an underflow occurs
FINXS, FG, FGH, FX, FXH if the result is inexact or overflow occurred and overflow
exceptions are disabled
Figure 3-800. Vector Floating-Point Add (__ev_fsadd)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsadd d,a,b
0 31 32 63
a
b
+
sp
+
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-973
__ev_fsaddsub __ev_fsaddsub
Vector Floating-Point Single-Precision Add / Subtract
d = __ev_fsaddsub (a,b)
d
0:31
a
0:31
+
sp
b
0:31
d
32:63
a
32:63
-
sp
b
32:63
The high order single-precision floating-point element of parameter a is added to the
corresponding element of parameter b, the low order single-precision floating-point element of
parameter b is subtracted from the corresponding element of parameter a, and the results are stored
in parameter d. If an element of parameter a is NaN or infinity, the corresponding result is either
pmax or nmax (as appropriate). Otherwise, if an element of parameter b is NaN or infinity, the
corresponding result is either pmax or nmax (as appropriate). Otherwise, if an overflow occurs,
pmax or nmax (as appropriate) is stored in the corresponding element of parameter d. If an
underflow occurs, +0 (for rounding modes RN, RZ, RP) or -0 (for rounding mode RM) is stored
in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or parameter b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-801. Vector Floating-Point Single-Precision Add / Subtract (__ev_fsaddsub)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsaddsub d,a,b
0 31 32 63
a
b
+
sp
-
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-974 Freescale Semiconductor
SPE2 Operations
__ev_fsaddsubx __ev_fsaddsubx
Vector Floating-Point Single-Precision Add / Subtract Exchanged
d = __ev_fsaddsubx (a,b)
d
0:31
a
32:63
+
sp
b
0:31
d
32:63
a
0:31
-
sp
b
32:63
The high-order single-precision floating-point element of parameter b is added to the low-order
element of parameter a, the low-order single-precision floating-point element of parameter b is
subtracted from the high-order element of parameter a, and the results are stored in parameter d.
If an element of parameter a is NaN or infinity, the corresponding result is either pmax or nmax
(as appropriate). Otherwise, if an element of parameter b is NaN or infinity, the corresponding
result is either pmax or nmax (as appropriate). Otherwise, if an overflow occurs, pmax or nmax (as
appropriate) is stored in the corresponding element of parameter d. If an underflow occurs, +0 (for
rounding modes RN, RZ, RP) or -0 (for rounding mode RM) is stored in the corresponding
element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-802. Vector Floating-Point Add / Subtract Exchanged (__ev_fsaddsubx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsaddsubx d,a,b
0 31 32 63
a
b
+
sp
-
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-975
__ev_fsaddx __ev_fsaddx
Vector Floating-Point Single-Precision Add Exchanged
d = __ev_fsaddx (a,b)
d
0:31
a
32:63
+
sp
b
0:31
d
32:63
a
0:31
+
sp
b
32:63
The high-order single-precision floating-point element of parameter b is added to the low-order
element of parameter a, the low-order single-precision floating-point element of parameter b is
added to the high-order element of parameter a, and the results are stored in parameter d. If an
element of parameter a is NaN or infinity, the corresponding result is either pmax or nmax (as
appropriate). Otherwise, if an element of parameter b is NaN or infinity, the corresponding result
is either pmax or nmax (as appropriate). Otherwise, if an overflow occurs, pmax or nmax (as
appropriate) is stored in the corresponding element of parameter d. If an underflow occurs, +0 (for
rounding modes RN, RZ, RP) or -0 (for rounding mode RM) is stored in the corresponding
element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-803. Vector Floating-Point Add Exchanged (__ev_fsaddx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsaddx d,a,b
0 31 32 63
a
b
+
sp
+
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-976 Freescale Semiconductor
SPE2 Operations
__ev_fscfsf __ev_fscfsf
Vector Convert Floating-Point from Signed Fraction
d = __ev_fscfsf (a)
d
0:31
CnvtI32ToFP32Sat(a
0:31
, SIGN, UPPER, F)
d
32:63
CnvtI32ToFP32Sat(a
32:63
, SIGN, LOWER, F)
The signed fractional values in each element of parameter a are converted to the nearest
single-precision floating-point value using the current rounding mode and placed in parameter d.
The following status bits are set in the SPEFSCR:
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-804. Vector Convert Floating-Point from Signed Fraction (__ev_fscfsf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfscfsf d,a
0 31 32 63
b
d
s
s
s
Exponent Mantissa s Exponent Mantissa
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-977
__ev_fscfsi __ev_fscfsi
Vector Convert Floating-Point from Signed Integer
d = __ev_fscfsi (a)
d
0:31
CnvtSI32ToFP32Sat(a
0:31
, SIGN, UPPER, I)
d
32:63
CnvtSI32ToFP32Sat(a
32:63
, SIGN, LOWER, I)
The signed integer values in each element in parameter a are converted to the nearest
single-precision floating-point value using the current rounding mode and placed in parameter d.
The following status bits are set in the SPEFSCR:
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-805. Vector Convert Floating-Point from Signed Integer (__ev_fscfsi)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfscfsi d,a
0 31 32 63
b
d
s
s
s
Exponent Mantissa s Exponent Mantissa
SPE2 Programming Interface Manual, Rev. 1.0-2
3-978 Freescale Semiconductor
SPE2 Operations
__ev_fscfuf __ev_fscfuf
Vector Convert Floating-Point from Unsigned Fraction
d = __ev_fscfuf (a)
d
0:31
CnvtI32ToFP32Sat(a
0:31
, UNSIGN, UPPER, F)
d
32:63
CnvtI32ToFP32Sat(a
32:63
, UNSIGN, LOWER, F)
The unsigned fractional values in each element of parameter a are converted to the nearest
single-precision floating-point value using the current rounding mode and placed in parameter d.
The following status bits are set in the SPEFSCR:
FINXS, FG, FX if the result is inexact
Figure 3-806. Vector Convert Floating-Point from Unsigned Fraction (__ev_fscfuf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfscfuf d,a
0 31 32 63
b
d s=0 Exponent Mantissa s=0 Exponent Mantissa
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-979
__ev_fscfui __ev_fscfui
Vector Convert Floating-Point from Unsigned Integer
d = __ev_fscfui (a)
d
0:31
CnvtI32ToFP32Sat(a
031
, UNSIGN, UPPER, I)
d
32:63
CnvtI32ToFP32Sat(a
32:63
, UNSIGN, LOWER, I)
The unsigned integer value in each element of parameter a are converted to the nearest
single-precision floating-point value using the current rounding mode and placed in parameter d.
The following status bits are set in the SPEFSCR:
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-807. Vector Convert Floating-Point from Unsigned Integer (__ev_fscfui)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfscfui d,a
0 31 32 63
b
d s=0 Exponent Mantissa s=0 Exponent Mantissa
SPE2 Programming Interface Manual, Rev. 1.0-2
3-980 Freescale Semiconductor
SPE2 Operations
__ev_fsctsf __ev_fsctsf
Vector Convert Floating-Point to Signed Fraction
d = __ev_fsctsf (a)
d
0:31
CnvtFP32ToISat(a
0:31
, SIGN, UPPER, ROUND, F)
d
32:63
CnvtFP32ToISat(a
32:63
, SIGN, LOWER, ROUND, F)
The single-precision floating-point value in each element of parameter a is converted to a signed
fraction using the current rounding mode and the results are placed in parameter d. The result
saturates if it cannot be represented in a 32-bit fraction. NaNs are converted to 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a are +inf, -inf, Denorm, or NaN or parameter
a cannot be represented in the target format
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-808. Vector Convert Floating-Point to Signed Fraction (__ev_x)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsctsf d,a
0 31 32 63
a
d
s
s
s
Exponent Mantissa s Exponent Mantissa
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-981
__ev_fsctsi __ev_fsctsi
Vector Convert Floating-Point to Signed Integer
d = __ev_fsctsi (a)
d
0:31
CnvtFP32ToISat(a
0:31
, SIGN, UPPER, ROUND, I)
d
32:63
CnvtFP32ToISat(a
32:63
, SIGN, LOWER, ROUND, I)
The single-precision floating-point value in each element of parameter a is converted to a signed
integer using the current rounding mode, and the results are placed in parameter d. The result
saturates if it cannot be represented in a 32-bit integer. NaNs are converted to 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a are +inf, -inf, Denorm or NaN or parameter a
cannot be represented in the target format
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-809. Vector Convert Floating-Point to Signed Integer (__ev_fsctsi)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsctsi d,a
0 31 32 63
a
d
s
s
s
Exponent Mantissa s Exponent Mantissa
SPE2 Programming Interface Manual, Rev. 1.0-2
3-982 Freescale Semiconductor
SPE2 Operations
__ev_fsctsiz __ev_fsctsiz
Vector Convert Floating-Point to Signed Integer with Round Toward Zero
d = __ev_fsctsiz (a)
d
0:31
CnvtFP32ToISat(a
0:31
, SIGN, UPPER, TRUNC, I)
d
32:63
CnvtFP32ToISat(a
32:63
, SIGN, LOWER, TRUNC, I)
The single-precision floating-point value in each element of parameter a is converted to a signed
integer using the rounding mode Round Towards Zero, and the results are placed in parameter d.
The result saturates if it cannot be represented in a 32-bit integer. NaNs are converted to 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a are +inf, -inf, Denorm, or NaN or if parameter
a cannot be represented in the target format
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-810. Vector Convert Floating-Point to Signed Integer with Round
Toward Zero (__ev_fsctsiz)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsctsiz d,a
0 31 32 63
a
d
s
s
s
Exponent Mantissa s Exponent Mantissa
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-983
__ev_fsctuf __ev_fsctuf
Vector Convert Floating-Point to Unsigned Fraction
d = __ev_fsctuf (a)
d
0:31
CnvtFP32ToISat(a
0:31
, UNSIGN, UPPER, ROUND, F)
d
32:63
CnvtFP32ToISat(a
32:63
, UNSIGN, LOWER, ROUND, F)
The single-precision floating-point value in each element of parameter a is converted to an
unsigned fraction using the current rounding mode, and the results are placed in parameter d. The
result saturates if it cannot be represented in a 32-bit unsigned fraction. NaNs are converted to 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a are +inf, -inf, Denorm, or NaN or if parameter
a cannot be represented in the target format
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-811. Vector Convert Floating-Point to Unsigned Fraction (__ev_fsctuf)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsctuf d,a
0 31 32 63
a
d s Exponent Mantissa s Exponent Mantissa
SPE2 Programming Interface Manual, Rev. 1.0-2
3-984 Freescale Semiconductor
SPE2 Operations
__ev_fsctui __ev_fsctui
Vector Convert Floating-Point to Unsigned Integer
d = __ev_fsctui (a)
d
0:31
CnvtFP32ToISat(a
0:31
, UNSIGN, UPPER, ROUND, I)
d
32:63
CnvtFP32ToISat(a
32:63
, UNSIGN, LOWER, ROUND, I)
The single-precision floating-point value in each element of parameter a is converted to an
unsigned integer using the current rounding mode, and the results are placed in parameter d. The
result saturates if it cannot be represented in a 32-bit unsigned integer. NaNs are converted to 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a are +inf, -inf, Denorm or NaN or parameter a
cannot be represented in the target format
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-812. Vector Convert Floating-Point to Unsigned Integer (__ev_fsctui)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsctui d,a
0 31 32 63
a
d s Exponent Mantissa s Exponent Mantissa
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-985
__ev_fsctuiz __ev_fsctuiz
Vector Convert Floating-Point to Unsigned Integer with Round toward Zero
d = __ev_fsctuiz (a)
d
0:31
CnvtFP32ToISat(a
0:31
, UNSIGN, UPPER, TRUNC, I)
d
32:63
CnvtFP32ToISat(a
32:63
, UNSIGN, LOWER, TRUNC, I)
The single-precision floating-point value in each element of parameter a is converted to an
unsigned integer using the rounding mode Round towards Zero, and the results are placed in
parameter d. The result saturates if it cannot be represented in a 32-bit unsigned integer. NaNs are
converted to 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a are +inf, -inf, Denorm, or NaN or parameter
a cannot be represented in the target format
FINXS, FG, FGH, FX, FXH if the result is inexact
Figure 3-813. Vector Convert Floating-Point to Unsigned Integer with Round
Toward Zero (__ev_fsctuiz)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsctuiz d,a
0 31 32 63
a
d s Exponent Mantissa s Exponent Mantissa
SPE2 Programming Interface Manual, Rev. 1.0-2
3-986 Freescale Semiconductor
SPE2 Operations
__ev_fsdiff __ev_fsdiff
Vector Floating-Point Single-Precision Differences
d = __ev_fsdiff (a,b)
d
0:31
a
0:31
-
sp
a
32:63
d
32:63
b
0:31
-
sp
b
32:63
The low-order single-precision floating-point element of parameter a is subtracted from the
high-order element of parameter a, the low-order single-precision floating-point element of
parameter b is subtracted from the high-order element of parameter b, and the results are stored in
parameter d. If the high-order element of parameter a or b is NaN or infinity, the corresponding
result is either pmax or nmax (as appropriate). Otherwise, if the low order element of parameter a
or b is NaN or infinity, the corresponding result is either pmax or nmax (as appropriate).
Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored in the corresponding
element of parameter d. If an underflow occurs, +0 (for rounding modes RN, RZ, RP) or -0 (for
rounding mode RM) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-814. Vector Floating-Point Single-Precision Differences (__ev_fsdiff)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsdiff d,a,b
0 31 32 63
a
b
-
sp
-
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-987
__ev_fsdiffsum __ev_fsdiffsum
Vector Floating-Point Single-Precision Difference / Sum
d = __ev_fsdiffsum (a,b)
d
0:31
a
0:31
-
sp
a
32:63
d
32:63
b
0:31
+
sp
b
32:63
The low-order single-precision floating-point element of parameter a is subtracted from the
high-order element of parameter a, the low-order single-precision floating-point element of
parameter b is added to the high-order element of parameter b, and the results are stored in
parameter d. If the high-order element of parameter a or b is NaN or infinity, the corresponding
result is either pmax or nmax (as appropriate). Otherwise, if the low order element of parameter a
or b is NaN or infinity, the corresponding result is either pmax or nmax (as appropriate).
Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored in the corresponding
element of parameter d. If an underflow occurs, +0 (for rounding modes RN, RZ, RP) or -0 (for
rounding mode RM) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-815. Vector Floating-Point Single-Precision Difference / Sum (__ev_fsdiffsum)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsdiffsum d,a,b
0 31 32 63
a
b
-
sp
+
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-988 Freescale Semiconductor
SPE2 Operations
__ev_fsdiv __ev_fsdiv
Vector Floating-Point Divide
d = __ev_fsdiv (a,b)
d
0:31
a
0:31
sp
b
0:31
d
32:63
a
32:63
sp
d
32:63
The single-precision floating-point value in each element of parameter a is divided by the
corresponding elements in parameter b, and the results are placed in parameter d.
If an overflow is detected, parameter b is a Denorm (or 0 value), or parameter a is a NaN or Infinity
and parameter b is a normalized number, the result is an appropriately signed maximum
floating-point value.
If an underflow is detected or parameter b is a NaN or Infinity, the result is an appropriately signed
floating-point 0.
The following status bits are set in the SPEFSCR:
FINV, FINVH if the contents of parameter a or b are +inf, -inf, Denorm, or NaN
FOFV, FOFVH if an overflow occurs
FUNV, FUNVH if an underflow occurs
FDBZS, FDBZ, FDBZH if a divide by zero occurs
FINXS, FG, FGH, FX, FXH if the result is inexact or overflow occurred and overflow
exceptions are disabled
Figure 3-816. Vector Floating-Point Divide (__ev_fsdiv)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsdiv d,a,b
0 31 32 63
a (dividends)
b (divisors)
B
sp
A B
sp
A
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-989
__ev_fsmax __ev_fsmax
Vector Floating-Point Single-Precision Maximum
d = __ev_fsmax (a,b)
ah a
0:31
bh b
0:31
if (ah < bh) then temph bh
else temph ah
d
0:31
temph
al a
32:63
bl b
32:63
if (al < bl) then templ bl
else templ al
d
32:63
templ
Each single-precison floating-point element of parameter a is compared against the corresponding
elements of parameter b. The larger element is selected and placed into the corresponding element
of parameter d. The maximum of +0 and -0 is +0.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken, and parameter d is not updated.
Otherwise, the comparison proceeds after treating NaNs, Infinities, and Denorms as normalized
numbers, using their values of e and f directly. If the selected element is denorm, the result is a
same signed zero. If the selected element is +NaN or +infinity, the corresponding result is pmax.
Otherwise, if the selected element is -NaN or -infinity, the corresponding result is nmax.
Figure 3-817. Vector Floating-Point Single-Precision Maximum (__ev_fsmax)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsmax d,a,b
0 31 32 63
a
b
Max Max
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-990 Freescale Semiconductor
SPE2 Operations
__ev_fsmin __ev_fsmin
Vector Floating-Point Single-Precision Minimum
d = __ev_fsmin (a,b)
ah a
0:31
bh b
0:31
if (ah < bh) then temph ah
else temph bh
d
0:31
temph
al a
32:63
bl b
32:63
if (al < bl) then templ al
else templ bl
d
32:63
templ
Each single-precison floating-point element of parameter a is compared against the corresponding
element of parameter b. The smaller element is selected and placed into the corresponding element
of parameter d. The minimum of +0 and -0 is -0.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken, and parameter d is not updated.
Otherwise, the comparison proceeds after treating NaNs, Infinities, and Denorms as normalized
numbers, using their values of e and f directly. If the selected element is denorm, the result is a
same signed zero. If the selected element is +NaN or +infinity, the corresponding result is pmax.
Otherwise, if the selected element is -NaN or -infinity, the corresponding result is nmax.
Figure 3-818. Vector Floating-Point Single-Precision Maximum (__ev_fsmin)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsmin d,a,b
0 31 32 63
a
b
Min Min
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-991
__ev_fsmul __ev_fsmul
Vector Floating-Point Multiply
d = __ev_fsmul (a,b)
d
0:31
a
0:31
sp
b
0:31
d
32:63
a
32:63
sp
b
32:63
Each single-precision floating-point element of parameter a is multiplied with the corresponding
element of parameter b, and the result is stored in parameter d. If an overflow is likely, pmax or
nmax is stored in parameter d. If an underflow is likely, +0, or 0 is stored in parameter d. The
following condition defines when an overflow is likely and the corresponding result for each
element of the vector:
ei = (ea 127) + (eb 127) + 127
if (sa = sb) then
if (ei 127) then r = pmax
else if (ei < 126) then r = +0
else
if (ei 127) then r = nmax
else if (ei < 126) then r = 0
If the contents of parameter a or b are +inf, inf, Denorm, QNaN, or SNaN, at least one of
the SPEFSCR[FINVH] or SPEFSCR[FINV] bits is set.
If an overflow occurs or is likely, at least one of the SPEFSCR[FOVFH] or
SPEFSCR[FOVF] bits is set.
If an underflow occurs or is likely, at least one of the SPEFSCR[FUNFH] or
SPEFSCR[FUNF] bits is set.
If the exception is enabled for the high or low element in which the error occurs, the
exception is taken.
Figure 3-819. Vector Floating-Point Multiply (__ev_fsmul)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsmul d,a,b
0 31 32 63
a
b
B x
sp
A B x
sp
A
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-992 Freescale Semiconductor
SPE2 Operations
__ev_fsmule __ev_fsmule
Vector Floating-Point Single-Precision Multiply By Even Element
d = __ev_fsmule (a,b)
d
0:31
a
0:31
sp
b
0:31
d
32:63
a
0:31
sp
b
32:63
The single-precision floating-point elements of parameter b are multiplied by the even
(high-order) element of parameter a, and the results are stored in parameter d. If an element of
parameter b or the even element of parameter a is either zero (or a denormalized number optionally
transformed to zero by the implementation), the corresponding result is a properly signed zero.
Otherwise, if an element of parameter b or the even element of parameter a is either NaN or
infinity, the corresponding result is either pmax (a
sign
==b
sign
), or nmax (a
sign
!=b
sign
). Otherwise, if
an overflow occurs, pmax or nmax (as appropriate) is stored in the corresponding element of
parameter d. If an underflow occurs, +0 or -0 (as appropriate) is stored in the corresponding
element of parameter d.
Exceptions:
If the contents of either element of parameter b or the even element of parameter a is Infinity,
Denorm, or NaN, SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are
cleared appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not
updated. Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an
underflow occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow
exceptions are enabled and a corresponding status bit is set, an interrupt is taken. If any of these
interrupts are taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow exception is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-820. Vector Floating-Point Multiply By Even Element (__ev_fsmule)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsmule d,a,b
0 31 32 63
a
b
x
sp x
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-993
__ev_fsmulo __ev_fsmulo
Vector Floating-Point Single-Precision Multiply By Odd Element
d = __ev_fsmulo (a,b)
d
0:31
a
32:63
sp
b
0:31
d
32:63
a
32:63
sp
b
32:63
The single-precision floating-point elements of parameter b are multiplied by the odd (low-order)
element of parameter a, and the results are stored in parameter d. If an element of parameter b or
the odd element of parameter a is either zero (or a denormalized number optionally transformed
to zero by the implementation), the corresponding result is a properly signed zero. Otherwise, if
an element of parameter b or the odd element of parameter a is either NaN or infinity, the
corresponding result is either pmax (a
sign
==b
sign
), or nmax (a
sign
!=b
sign
). Otherwise, if an overflow
occurs, pmax or nmax (as appropriate) is stored in the corresponding element of parameter d. If an
underflow occurs, +0 or -0 (as appropriate) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter b or the odd element of parameter a is Infinity,
Denorm, or NaN, SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are
cleared appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not
updated. Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an
underflow occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow
exceptions are enabled and a corresponding status bit is set, an interrupt is taken. If any of these
interrupts are taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow exception is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-821. Vector Floating-Point Multiply By Even Element (__ev_fsmulo)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsmulo d,a,b
0 31 32 63
a
b
x
sp x
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-994 Freescale Semiconductor
SPE2 Operations
__ev_fsmulx __ev_fsmulx
Vector Floating-Point Single-Precision Multiply Exchanged
d = __ev_fsmulx (a,b)
d
0:31
a
32:63
sp
b
0:31
d
32:63
a
0:31
sp
b
32:63
The high-order single-precision floating-point element of parameter b is multiplied by the
low-order element of parameter a, the low-order single-precision floating-point element of
parameter b is multiplied by the high-order element of parameter a, and the results are stored in
parameter d. If an element of parameter a or b is either zero (or a denormalized number optionally
transformed to zero by the implementation), the corresponding result is a properly signed zero.
Otherwise, if an element of parameter a or b is either NaN or infinity, the corresponding result is
either pmax (a
sign
==b
sign
), or nmax (a
sign
!=b
sign
). Otherwise, if an overflow occurs, pmax or nmax
(as appropriate) is stored in the corresponding element of parameter d. If an underflow occurs, +0
or -0 (as appropriate) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow exception is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-822. Vector Floating-Point Multiply By Even Element (__ev_fsmulo)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfsmulx d,a,b
0 31 32 63
a
b
x
sp x
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-995
__ev_fsnabs __ev_fsnabs
Vector Floating-Point Negative Absolute Value
d = __ev_fsnabs (a)
d
0:31
0b1 || a
1:31
d
32:63
0b1 || a
33:63
The signed bits of each element of parameter a are all set and the result is placed into parameter
d. No exceptions are taken during the execution of this instruction.
Figure 3-823. Vector Floating-Point Negative Absolute Value (__ev_fsnabs)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsnabs d,a
0 31 32 63
a
d
NABS NABS
SPE2 Programming Interface Manual, Rev. 1.0-2
3-996 Freescale Semiconductor
SPE2 Operations
__ev_fsneg __ev_fsneg
Vector Floating-Point Negate
d = __ev_fsneg (a)
d
0:31
a
0
|| a
1:31
d
32:63
a
32
|| a
33:63
The signed bits of each element of parameter a are complemented and the result is placed into
parameter d. No exceptions are taken during the execution of this instruction.
Figure 3-824. Vector Floating-Point Negate (__ev_fsneg)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfsneg d,a
0 31 32 63
a
d s Exponent Mantissa s Exponent Mantissa
s Exponent Mantissa s Exponent Mantissa
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-997
__ev_fssqrt __ev_fssqrt
Vector Floating-Point Single-Precision Square Root
d = __ev_fssqrt (a)
d
0:31
SQRT(a
0:31
)
d
32:63
SQRT( a
32:63
)
The square root of each single-precision floating-point element of parameter a is calculated, and
the results are stored in parameter d. If an element of parameter a is zero or denorm, the result is
a same signed zero. If an element of parameter a is +NaN or +infinity, the corresponding result is
pmax. Otherwise, if an element of parameter a is non-zero and has a negative sign, including -NaN
or -infinity, the corresponding result is TBD. Otherwise, if an underflow occurs, +0 (for rounding
modes RN, RZ, RP) or -0 (for rounding mode RM) is stored in the corresponding element of
parameter d.
Exceptions:
If the contents of either element of parameter a are non-zero and have a negative sign, or are
Infinity, Denorm, or NaN, SPEFSCR
FINV,FINVH
are set appropriately, and
SPEFSCR
FGH,FXH,FG,FX
are cleared appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken
and parameter d is not updated. Otherwise, if an underflow occurs, SPEFSCR
FUNF,FUNFH
are set
appropriately. If underflow exceptions are enabled and a corresponding status bit is set, an
interrupt is taken. If any of these interrupts are taken, parameter d is not updated.
If either result element of this instruction is inexact, or underflows but underflow exceptions are
disabled, and no other interrupt is taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point
inexact exception is enabled, an interrupt is taken using the floating-point round interrupt vector.
In this case, parameter d is updated with the truncated result(s). The FG and FX bits are properly
updated to allow rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an underflow interrupt is taken, or if an invalid
operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-825. Vector Floating-Point Multiply By Even Element (__ev_fssqrt)
d a Maps to
__ev64_opaque__ __ev64_opaque__ evfssqrt d,a
0 31 32 63
a
SQRT
SQRT
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-998 Freescale Semiconductor
SPE2 Operations
__ev_fssub __ev_fssub
Vector Floating-Point Subtract
d = __ev_fssub (a,b)
d
0:31
a
0:31
sp
b
0:31
d
32:63
a
32:63
sp
b
32:63
Each single-precision floating-point element of parameter b is subtracted from the corresponding
element of parameter a and the result is stored in parameter d. If an overflow is likely, pmax or
nmax is stored in parameter d. If an underflow is likely, +0 or 0 is stored in parameter d. The
following condition defines how boundary cases of inputs (+inf, inf, Denorm, QNaN, SNaN) are
treated, when an overflow is likely, and the corresponding result for each element of the vector:
if ((sa = 0) & (sb = 1)) then
if (max(ea, eb) 127) then r = pmax
else if ((sa = 1) & (sb = 0)) then
if (max(ea, eb) 127) then r = nmax
else if (sa = sb) then
// Boundary case to be defined later
If the contents of parameter a or b are +inf, inf, Denorm, QNaN, or SNaN, at least one of
the SPEFSCR[FINVH] or SPEFSCR[FINV] bits is set.
If an overflow occurs or is likely, the SPEFSCR[FOVFH] or SPEFSCR[FOVF] bits is set.
If an underflow occurs or is likely, at least one of the SPEFSCR[FUNFH] or
SPEFSCR[FUNF] bits is set.
If the exception is enabled for the high or low element in which the error occurs, the
exception is taken.
Figure 3-826. Vector Floating-Point Subtract (__ev_fssub)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfssub d,a,b
0 31 32 63
a
b
sp
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-999
__ev_fssubadd __ev_fssubadd
Vector Floating-Point Single-Precision Subtract / Add
d = __ev_fssubadd (a,b)
d
0:31
a
0:31
-
sp
b
0:31
d
32:63
a
32:63
+
sp
b
32:63
The high-order floating-point element of parameter b is subtracted from the corresponding
element of parameter a, the low-order floating-point element of parameter b is subtracted from the
corresponding element of parameter a, and the results are stored in parameter d. If an element of
parameter a is NaN or infinity, the corresponding result is either pmax or nmax (as appropriate).
Otherwise, if an element of parameter b is NaN or infinity, the corresponding result is either nmax
or pmax (as appropriate). Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored
in the corresponding element of parameter d. If an underflow occurs, +0 (for rounding modes RN,
RZ, RP) or -0 (for rounding mode RM) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-827. Vector Floating-Point Subtract / Add(__ev_fssubadd)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfssubadd d,a,b
0 31 32 63
a
b
sp +
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1000 Freescale Semiconductor
SPE2 Operations
__ev_fssubaddx __ev_fssubaddx
Vector Floating-Point Single-Precision Subtract / Add Exchanged
d = __ev_fssubaddx (a,b)
d
0:31
a
32:63
-
sp
b
0:31
d
32:63
a
0:31
+
sp
b
32:63
The high-order floating-point element of parameter b is subtracted from the low-order element of
parameter a, the low-order floating-point element of parameter b is added to the high-order from
the corresponding element of parameter a, and the results are stored in parameter d. If an element
of parameter a is NaN or infinity, the corresponding result is either pmax or nmax (as appropriate).
Otherwise, if an element of parameter b is NaN or infinity, the corresponding result is either nmax
or pmax (as appropriate). Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored
in the corresponding element of parameter d. If an underflow occurs, +0 (for rounding modes RN,
RZ, RP) or -0 (for rounding mode RM) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-828. Vector Floating-Point Subtract / Add Exchanged (__ev_fssubaddx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfssubaddx d,a,b
0 31 32 63
a
b
sp +
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1001
__ev_fssubx __ev_fssubx
Vector Floating-Point Single-Precision Subtract Exchanged
d = __ev_fssubx (a,b)
d
0:31
a
32:63
-
sp
b
0:31
d
32:63
a
0:31
-
sp
b
32:63
The high-order floating-point element of parameter b is subtracted from the low-order element of
parameter a, the low-order floating-point element of parameter b is subtracted from the high-order
element of parameter a, and the results are stored in parameter d. If an element of parameter a is
NaN or infinity, the corresponding result is either pmax or nmax (as appropriate). Otherwise, if an
element of parameter b is NaN or infinity, the corresponding result is either nmax or pmax (as
appropriate). Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored in the
corresponding element of parameter d. If an underflow occurs, +0 (for rounding modes RN, RZ,
RP) or -0 (for rounding mode RM) is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN,
SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared
appropriately. If SPEFSCR
FINVE
is set, an interrupt is taken and parameter d is not updated.
Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow
occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either underflow or overflow exceptions are
enabled and a corresponding status bit is set, an interrupt is taken. If any of these interrupts are
taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are
disabled, and no other interrupt is taken, or underflows but underflow exceptions are disabled, and
no other interrupt is taken, SPEFSCR
FINXS
is set. If the floating-point inexact exception is
enabled, an interrupt is taken using the floating-point round interrupt vector. In this case, parameter
d is updated with the truncated result(s). The FG and FX bits are properly updated to allow
rounding to be performed in the interrupt handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an
invalid operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-829. Vector Floating-Point Subtract Exchanged (__ev_fssubx)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfssubx d,a,b
0 31 32 63
a
b
sp -
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1002 Freescale Semiconductor
SPE2 Operations
__ev_fssum __ev_fssum
Vector Floating-Point Single-Precision Sums
d = __ev_fssum (a,b)
d
0:31
a
0:31
+
sp
a
32:63
d
32:63
b
0:31
+
sp
b
32:63
The high-order floating-point element of parameter a is added to the low-order element of parameter a, the
high-order floating-point element of parameter b is added to the low-order element of parameter b, and the
results are stored in parameter d. If the high-order element of parameter a or b is NaN or infinity, the
corresponding result is either pmax or nmax (as appropriate). Otherwise, if the low order element of
parameter a or b is NaN or infinity, the corresponding result is either pmax or nmax (as appropriate).
Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored in the corresponding element of
parameter d. If an underflow occurs, +0 (for rounding modes RN, RZ, RP) or -0 (for rounding mode RM)
is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN, SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared appropriately. If SPEFSCR
FINVE
is set, an
interrupt is taken and parameter d is not updated. Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either
underflow or overflow exceptions are enabled and a corresponding status bit is set, an interrupt is taken. If
any of these interrupts are taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are disabled, and
no other interrupt is taken, or underflows but underflow exceptions are disabled, and no other interrupt is
taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is enabled, an interrupt is taken
using the floating-point round interrupt vector. In this case, parameter d is updated with the truncated
result(s). The FG and FX bits are properly updated to allow rounding to be performed in the interrupt
handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an invalid
operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-830. Vector Floating-Point Sums (__ev_fssum)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfssum d,a,b
0 31 32 63
a
b
+
sp +
sp
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1003
__ev_fssumdiff __ev_fssumdiff
Vector Floating-Point Single-Precision Sum / Difference
d = __ev_fssumdiff (a,b)
d
0:31
a
0:31
+
sp
a
32:63
d
32:63
b
0:31
-
sp
b
32:63
The high-order floating-point element of parameter a is added to the low-order element of parameter a, the
low-order floating-point element of parameter b is subtracted from the high-order element of parameter b,
and the results are stored in parameter d. If the high-order element of parameter a or b is NaN or infinity,
the corresponding result is either pmax or nmax (as appropriate). Otherwise, if the low order element of
parameter a or b is NaN or infinity, the corresponding result is either pmax or nmax (as appropriate).
Otherwise, if an overflow occurs, pmax or nmax (as appropriate) is stored in the corresponding element of
parameter d. If an underflow occurs, +0 (for rounding modes RN, RZ, RP) or -0 (for rounding mode RM)
is stored in the corresponding element of parameter d.
Exceptions:
If the contents of either element of parameter a or b are Infinity, Denorm, or NaN, SPEFSCR
FINV,FINVH
are set appropriately, and SPEFSCR
FGH,FXH,FG,FX
are cleared appropriately. If SPEFSCR
FINVE
is set, an
interrupt is taken and parameter d is not updated. Otherwise, if an overflow occurs, SPEFSCR
FOVF,FOVFH
are set appropriately, or if an underflow occurs, SPEFSCR
FUNF,FUNFH
are set appropriately. If either
underflow or overflow exceptions are enabled and a corresponding status bit is set, an interrupt is taken. If
any of these interrupts are taken, parameter d is not updated.
If either result element of this instruction is inexact, or overflows but overflow exceptions are disabled, and
no other interrupt is taken, or underflows but underflow exceptions are disabled, and no other interrupt is
taken, SPEFSCR
FINXS,FINXSH
is set. If the floating-point inexact exception is enabled, an interrupt is taken
using the floating-point round interrupt vector. In this case, parameter d is updated with the truncated
result(s). The FG and FX bits are properly updated to allow rounding to be performed in the interrupt
handler.
FG and FX (FGH and FXH) are cleared if an overflow or underflow interrupt is taken, or if an invalid
operation/input error is signaled for the low (high) element (regardless of FINVE).
Figure 3-831. Vector Floating-Point Sum / Difference (__ev_fssumdiff)
d a b Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfssumdiff d,a,b
0 31 32 63
a
b
+
sp -
sp
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1004 Freescale Semiconductor
SPE2 Operations
__ev_lower_fs_eq __ev_lower_fs_eq
Vector Lower Bits Floating-Point Equal
d = __ev_lower_fs_eq (a,b)
if (a
32:63
= b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are equal to the lower 32 bits of
parameter b.
Figure 3-832. Vector Lower Floating-Point Equal (__ev_lower_fs_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpeq x,a,b
0 31 32 63
a
b
=
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1005
__ev_lower_fs_gt __ev_lower_fs_gt
Vector Lower Bits Floating-Point Greater Than
d = __ev_lower_fs_gt (a,b)
if (a
32:63
> b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b.
Figure 3-833. Vector Lower Floating-Point Greater Than (__ev_lower_fs_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpgt x,a,b
0 31 32 63
a
b
>
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1006 Freescale Semiconductor
SPE2 Operations
__ev_lower_fs_lt _ev_lower_fs_lt
Vector Lower Bits Floating-Point Less Than
d = __ev_lower_fs_lt (a,b)
if (a
32:63
< b
32:63
) then d true
else dfalse
This intrinsic returns true if the lower 32 bits of parameter a are less than the lower 32 bits of
parameter b.
Figure 3-834. Vector Lower Floating-Point Less Than (__ev_lower_fs_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmplt x,a,b
0 31 32 63
a
b
<
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1007
__ev_lower_fs_tst_eq __ev_lower_fs_tst_eq
Vector Lower Bits Floating-Point TestEqual
d = __ev_lower_fs_tst_eq (a,b)
if (a
32:63
= b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are equal to the lower 32 bits of
parameter b. This intrinsic differs from __ev_lower_fs_eq because no exceptions are taken during
its execution. If strict IEEE 754 compliance is required, use __ev_lower_fs_eq instead.
Figure 3-835. Vector Lower Floating-Point Test Equal (__ev_lower_fs_tst_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststeq x,a,b
0 31 32 63
a
b
=
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1008 Freescale Semiconductor
SPE2 Operations
__ev_lower_fs_tst_gt __ev_lower_fs_tst_gt
Vector Lower Bits Floating-Point Test Greater Than
d = __ev_lower_fs_tst_gt (a,b)
if (a
32:63
> b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are greater than the lower 32 bits of
parameter b. This intrinsic differs from __ev_lower_fs_gt because no exceptions are taken during
its execution. If strict IEEE 754 compliance is required, use __ev_lower_fs_gt instead.
Figure 3-836. Vector Lower Floating-Point Test Greater Than (__ev_lower_fs_tst_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststgt x,a,b
0 31 32 63
a
b
>
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1009
__ev_lower_fs_tst_lt __ev_lower_fs_tst_lt
Vector Lower Bits Floating-Point Test Less Than
d = __ev_lower_fs_tst_lt (a,b)
if (a
32:63
< b
32:63
) then d true
else d false
This intrinsic returns true if the lower 32 bits of parameter a are less than the lower 32 bits of
parameter b. This intrinsic differs from __ev_lower_fs_lt because no exceptions are taken during
its execution. If strict IEEE 754 compliance is required, use __ev_lower_fs_lt instead.
Figure 3-837. Vector Lower Floating-Point Test Less Than (__ev_lower_fs_tst_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststlt x,a,b
0 31 32 63
a
b
<
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1010 Freescale Semiconductor
SPE2 Operations
__ev_select_fs_eq __ev_select_fs_eq
Vector Select Floating-Point Equal
e = __ev_select_fs_eq (a,b,c,d)
if (a
0:31
= b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
= b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in the C programming language. For example, the aforementioned intrinsic
maps to the following logical expression: a = b? c : d.
Figure 3-838. Vector Select Floating-Point Equal (__ev_select_fs_eq)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfscmpeq x,a,b
evsel e,c,d,x
0 31 32 63
a
b
=
=
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1011
__ev_select_fs_gt __ev_select_fs_gt
Vector Select Floating-Point Greater Than
e = ___ev_select_fs_gt (a,b,c,d)
if (a
0:31
> b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
> b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a > b ? c : d.
Figure 3-839. Vector Select Floating-Point Greater Than (__ev_select_fs_gt)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfscmpgt x,a,b
evsel e,c,d,x
0 31 32 63
a
b
>
>
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1012 Freescale Semiconductor
SPE2 Operations
__ev_select_fs_lt __ev_select_fs_lt
Vector Select Floating-Point Less Than
e = __ev_select_fs_lt (a,b,c,d)
if (a
0:31
< b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
< b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a < b? c : d.
Figure 3-840. Vector Select Floating-Point Less Than (__ev_select_fs_lt)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfscmplt x,a,b
evsel e,c,d,x
0 31 32 63
a
b
<
<
0 31
0
31
e
c
true false
0 31
d
32
63
32 63
true false
32 63
c d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1013
__ev_select_fs_tst_eq __ev_select_fs_tst_eq
Vector Select Floating-Point Test Equal
e = __ev_select_fs_tst_eq (a,b,c,d)
if (a
0:31
= b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
= b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a =b? c : d. This intrinsic differs from __ev_select_fs_eq because no exceptions are
taken during its execution. If strict IEEE 754 compliance is required, use __ev_select_fs_eq
instead.
Figure 3-841. Vector Select Floating-Point Test Equal (__ev_select_fs_tst_eq)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfststeq x,a,b
evsel e,c,d,x
0 31 32 63
a
b
=
=
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1014 Freescale Semiconductor
SPE2 Operations
__ev_select_fs_tst_gt __ev_select_fs_tst_gt
Vector Select Floating-Point Test Greater Than
e = ___ev_select_fs_tst_gt (a,b,c,d)
if (a
0:31
> b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
> b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a > b ? c : d. This intrinsic differs from __ev_select_fs_gt because no exceptions are
taken during its execution. If strict IEEE 754 compliance is required, use __ev_select_fs_gt
instead.
Figure 3-842. Vector Select Floating-Point Test Greater Than (__ev_select_fs_tst_gt)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfststgt x,a,b
evsel e,c,d,x
0 31 32 63
a
b
>
>
0 31
0
31
e
c
true false
0 31
d
32
63
32 63
true false
32 63
c d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1015
__ev_select_fs_tst_lt __ev_select_fs_tst_lt
Vector Select Floating-Point Test Less Than
e = __ev_select_fs_tst_lt (a,b,c,d)
if (a
0:31
< b
0:31
) then e
0:31
c
0:31
else e
0:31
d
0:31
if (a
32:63
< b
32:63
) then e
32:63
c
32:63
else e
32:63
d
32:63
This intrinsic returns a concatenated value of the upper and lower bits of parameter c or d based
on the sizes of the upper and lower bits of parameters a and b. The __ev_select_* functions work
like the ? : operator in C. For example, the aforementioned intrinsic maps to the following logical
expression: a < b? c : d. This intrinsic differs from __ev_select_fs_lt because no exceptions are
taken during its execution. If strict IEEE 754 compliance is required, use __ev_select_fs_lt
instead.
Figure 3-843. Vector Select Floating-Point Test Less Than (__ev_select_fs_tst_lt)
e a b c d Maps to
__ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ __ev64_opaque__ evfststlt x,a,b
evsel e,c,d,x
0 31 32 63
a
b
<
<
0 31
0
31
e
c
True False
0 31
d
32
63
32 63
True False
32 63
c d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1016 Freescale Semiconductor
SPE2 Operations
__ev_upper_fs_eq __ev_upper_fs_eq
Vector Upper Bits Floating-Point Equal
d = __ev_upper_fs_eq (a,b)
if (a
0:31
= b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are equal to the upper 32 bits of
parameter b.
Figure 3-844. Vector Upper Floating-Point Equal(__ev_upper_fs_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpeq x,a,b
0 31 32 63
a
b
=
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1017
__ev_upper_fs_gt __ev_upper_fs_gt
Vector Upper Bits Floating-Point Greater Than
d = __ev_upper_fs_gt (a,b)
if (a
0:31
> b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are greater than the upper 32 bits of
parameter b.
Figure 3-845. Vector Upper Floating-Point Greater Than (__ev_upper_fs_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmpgt x,a,b
0 31 32 63
a
b
>
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1018 Freescale Semiconductor
SPE2 Operations
__ev_upper_fs_lt __ev_upper_fs_lt
Vector Upper Bits Floating-Point Less Than
d = __ev_upper_fs_lt (a,b)
if (a
0:31
< b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b.
Figure 3-846. Vector Upper Floating-Point Less Than (__ev_upper_fs_lt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfscmplt x,a,b
0 31 32 63
a
b
<
d
Intrinsic Definitions
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1019
__ev_upper_fs_tst_eq __ev_upper_fs_tst_eq
Vector Upper Bits Floating-Point Test Equal
d = __ev_upper_fs_tst_eq (a,b)
if (a
0:31
= b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are equal to the upper 32 bits of
parameter b. This intrinsic differs from __ev_upper_fs_eq because no exceptions are taken during
its execution. If strict IEEE 754 compliance is required, use __ev_upper_fs_eq instead.
Figure 3-847. Vector Upper Floating-Point Test Equal (__ev_upper_fs_tst_eq)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststeq x,a,b
0 31 32 63
a
b
=
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1020 Freescale Semiconductor
SPE2 Operations
__ev_upper_fs_tst_gt __ev_upper_fs_tst_gt
Vector Upper Bits Floating-Point Test Greater Than
d = __ev_upper_fs_tst_gt (a,b)
if (a
0:31
> b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are greater than the upper 32 bits of
parameter b. This intrinsic differs from __ev_upper_fs_gt because no exceptions are taken during
its execution. If strict IEEE 754 compliance is required, use __ev_upper_fs_gt instead.
Figure 3-848. Vector Upper Floating-Point Test Greater Than (__ev_upper_fs_tst_gt)
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststgt x,a,b
0 31 32 63
a
b
>
d
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1021
__ev_upper_fs_tst_lt __ev_upper_fs_tst_lt
Vector Upper Bits Floating-Point TestLess Than
d = __ev_upper_fs_tst_lt (a,b)
if (a
0:31
< b
0:31
) then d true
else d false
This intrinsic returns true if the upper 32 bits of parameter a are less than the upper 32 bits of
parameter b. This intrinsic differs from __ev_upper_fs_lt because no exceptions are taken during
its execution. If strict IEEE 754 compliance is required, use __ev_upper_fs_lt instead.
Figure 3-849. Vector Upper Floating-Point Test Less Than (__ev_upper_fs_tst_lt)
3.8 Basic Instruction Mapping
//
uint32_t __brinc( uint32_t a, uint32_t b );
__ev64_opaque__ __ev_circinc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_abs( __ev64_opaque__ a );
__ev64_opaque__ __ev_absb( __ev64_opaque__ a );
__ev64_opaque__ __ev_absbs( __ev64_opaque__ a );
__ev64_opaque__ __ev_absd( __ev64_opaque__ a );
__ev64_opaque__ __ev_absdifsb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_absdifsh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_absdifsw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_absdifub( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_absdifuh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_absdifuw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_absds( __ev64_opaque__ a );
__ev64_opaque__ __ev_absh( __ev64_opaque__ a );
__ev64_opaque__ __ev_abshs( __ev64_opaque__ a );
__ev64_opaque__ __ev_abss( __ev64_opaque__ a );
d a b Maps to
_Bool __ev64_opaque__ __ev64_opaque__ evfststlt x,a,b
0 31 32 63
a
b
<
d
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1022 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_add2subf2h( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_add2subf2hss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsmiaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_addb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addbss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addbus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addd( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_adddss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_adddus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhhisw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhhiuw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhlosw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhlouw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addhxus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addib( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_addih( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_addiw( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_addsmiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_addssiaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_addssiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_addsubfh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfhss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfhx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfhxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfwss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfwx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addsubfwxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addumiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_addusiaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_addusiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_addw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwegsf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwegsi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwogsf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwogsi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_addwxus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_and( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_andc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgbsr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgbu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgbur( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgds( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgdsr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgdu( __ev64_opaque__ a, __ev64_opaque__ b );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1023
__ev64_opaque__ __ev_avgdur( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avghs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avghsr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avghu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avghur( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgwsr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_avgwur( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_clrbe( __ev64_opaque__ a, 4-bit unsigned literal );
__ev64_opaque__ __ev_clrbo( __ev64_opaque__ a, 4-bit unsigned literal );
__ev64_opaque__ __ev_clrh( __ev64_opaque__ a, 4-bit unsigned literal );
__ev64_opaque__ __ev_cntlsh( __ev64_opaque__ a );
__ev64_opaque__ __ev_cntlsw( __ev64_opaque__ a );
__ev64_opaque__ __ev_cntlzh( __ev64_opaque__ a );
__ev64_opaque__ __ev_cntlzw( __ev64_opaque__ a );
__ev64_opaque__ __ev_diff2his( __ev64_opaque__ a );
__ev64_opaque__ __ev_diff2hisa( __ev64_opaque__ a );
__ev64_opaque__ __ev_diff2hisaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_divs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_divu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_divws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_divwsf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_divwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_divwuf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlveb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlveh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlveob( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlveoh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlvob( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlvoeb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlvoeh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dlvoh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_dotp4hgasmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotp4hgasmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasmiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasmiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotp4hgasumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasumiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgasumiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotp4hgaumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgaumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgaumiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgaumiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotp4hgssmf(__ev64_opaque__ a,__ev64_opaque__ b);
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1024 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_dotp4hgssmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgssmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgssmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotp4hgssmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgssmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgssmiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hgssmiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotp4hxgasmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgasmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgasmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgasmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotp4hxgasmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgasmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgasmiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgasmiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotp4hxgssmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgssmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgssmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgssmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotp4hxgssmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgssmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgssmiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotp4hxgssmiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotpbasmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbasmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbasmiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbasmiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpbasumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbasumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbasumiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbasumiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpbaumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbaumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbaumiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpbaumiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphasmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasmiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasmiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphassf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassfaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassfaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphassfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassfraaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassfraaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphassi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphassiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphasumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasumia(__ev64_opaque__ a,__ev64_opaque__ b);
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1025
__ev64_opaque__ __ev_dotphasumiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasumiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphasusi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasusia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasusiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphasusiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphaumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphaumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphaumiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphaumiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphausi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphausia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphausiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphausiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphihcsmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcsmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcsmiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcsmiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotphihcssf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssfaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssfaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotphihcssfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssfraaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssfraaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotphihcssi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphihcssiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotphssmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphssmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphssmiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphssmiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphsssf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssfaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssfaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphsssfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssfraaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssfraaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotphsssi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotphsssiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotplohcsmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcsmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcsmiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcsmiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotplohcssf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssfa(__ev64_opaque__ a,__ev64_opaque__ b);
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1026 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_dotplohcssfaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssfaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotplohcssfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssfraaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssfraaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotplohcssi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotplohcssiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotpwasmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasmiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasmiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwassi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwassia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwassiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwassiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwasumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasumiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasumiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwasusi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasusia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasusiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwasusiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwaumi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwaumia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwaumiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwaumiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwausi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwausia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwausiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwausiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwcsmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcsmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcsmiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcsmiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwcssf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssfaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssfaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwcssfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssfraaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssfraaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwcssi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssiaaw(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwcssiaaw3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwgasmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgasmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgasmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgasmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1027
__ev64_opaque__ __ev_dotpwgasmfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgasmfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgasmfraa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgasmfraa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwgssmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgssmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgssmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgssmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwgssmfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgssmfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgssmfraa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwgssmfraa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwssmi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwssmia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwssmiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwssmiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwsssi(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwsssia(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwsssiaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwsssiaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwxgasmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgasmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgasmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgasmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwxgasmfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgasmfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgasmfraa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgasmfraa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_dotpwxgssmf(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgssmfa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgssmfaa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgssmfaa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__ c);
__ev64_opaque__ __ev_dotpwxgssmfr(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgssmfra(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgssmfraa(__ev64_opaque__ a,__ev64_opaque__ b);
__ev64_opaque__ __ev_dotpwxgssmfraa3(__ev64_opaque__ a,__ev64_opaque__ b,__ev64_opaque__
c);
__ev64_opaque__ __ev_eqv( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_extsb( __ev64_opaque__ a );
__ev64_opaque__ __ev_extsbh( __ev64_opaque__ a );
__ev64_opaque__ __ev_extsh( __ev64_opaque__ a );
__ev64_opaque__ __ev_extsw( __ev64_opaque__ a );
__ev64_opaque__ __ev_extzb( __ev64_opaque__ a );
__ev64_opaque__ __ev_insb( __ev64_opaque__ a, __ev64_opaque__ b, 3-bit unsigned literal,
3-bit unsigned literal );
__ev64_opaque__ __ev_insh( __ev64_opaque__ a, __ev64_opaque__ b, 2-bit unsigned literal,
2-bit unsigned literal );
__ev64_opaque__ __ev_ilveh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilveoh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilvhih( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilvhiloh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilvloh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilvlohih( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilvoeh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_ilvoh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_lvsl( __ev64_opaque__ a, __ev64_opaque__ b );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1028 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_lvsr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mar( );
__ev64_opaque__ __ev_maxbpsh( __ev64_opaque__ a );
__ev64_opaque__ __ev_maxbpuh( __ev64_opaque__ a );
__ev64_opaque__ __ev_maxbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxbu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxds( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxdu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxhpsw( __ev64_opaque__ a );
__ev64_opaque__ __ev_maxhpuw( __ev64_opaque__ a );
__ev64_opaque__ __ev_maxhs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxhu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxmagws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxwpsd( __ev64_opaque__ a );
__ev64_opaque__ __ev_maxwpud( __ev64_opaque__ a );
__ev64_opaque__ __ev_maxws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_maxwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesmi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesmia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesmiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesmianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbessiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbessianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesumiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesumianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesusiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbesusianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbeumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbeumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbeumiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbeumianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbeusiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbeusianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosmi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosmia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosmiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosmianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbossiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbossianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosumiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosumianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosusiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbosusianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mboumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mboumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mboumiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mboumianh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbousiaah( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mbousianh( __ev64_opaque__ a, __ev64_opaque__ b );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1029
__ev64_opaque__ __ev_mergehi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mergehilo( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mergelo( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mergelohi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhegsmfaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhegsmfan( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhegsmiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhegsmian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhegumiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhegumian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmfanw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesmianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhessf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhessfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhessfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhessfanw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhessiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhessianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesumiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesumianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesusiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhesusianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mheumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mheumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mheumiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mheumianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mheusiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mheusianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhogsmfaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhogsmfan( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhogsmiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhogsmian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhogumiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhogumian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmfanw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosmianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhossf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhossfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhossfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhossfanw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhossiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhossianw( __ev64_opaque__ a, __ev64_opaque__ b );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1030 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_mhosumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosumiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosumianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosusiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhosusianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhoumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhoumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhoumiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhoumianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhousiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhousianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhssf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhssfr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhssi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhsusi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mhusi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minbpsh( __ev64_opaque__ a );
__ev64_opaque__ __ev_minbpuh( __ev64_opaque__ a );
__ev64_opaque__ __ev_minbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minbu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minds( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mindu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minhpsw( __ev64_opaque__ a );
__ev64_opaque__ __ev_minhpuw( __ev64_opaque__ a );
__ev64_opaque__ __ev_minhs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minhu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minwpsd( __ev64_opaque__ a );
__ev64_opaque__ __ev_minwpud( __ev64_opaque__ a );
__ev64_opaque__ __ev_minws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_minwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mra( __ev64_opaque__ a );
__ev64_opaque__ __ev_mwhsmf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhsmfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfan( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfra( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfraa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwehgsmfran( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhsmi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhsmia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfaaw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwhssfanw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfanw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c);
__ev64_opaque__ __ev_mwhssfr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfra( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfraaw( __ev64_opaque__ a, __ev64_opaque__ b );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1031
__ev64_opaque__ __ev_mwhssfraaw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c);
__ev64_opaque__ __ev_mwhssfranw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssfranw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c);
__ev64_opaque__ __ev_mwhumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlsmiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlsmiaaw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlsmianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlsmianw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlssiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlssiaaw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlssianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlssianw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlumiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlumiaaw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlumianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlumianw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlusiaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlusiaaw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwlusianw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwlusianw3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_mwohgsmf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfan( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfr( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfra( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfraa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwohgsmfran( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmfaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmfan( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwsmian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssfaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssfan( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwssiw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwumi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwumia( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwumiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwumian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwusiaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwusian( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwusiw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_nand( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_neg( __ev64_opaque__ a );
__ev64_opaque__ __ev_negb( __ev64_opaque__ a );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1032 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_negbo( __ev64_opaque__ a );
__ev64_opaque__ __ev_negbos( __ev64_opaque__ a );
__ev64_opaque__ __ev_negbs( __ev64_opaque__ a );
__ev64_opaque__ __ev_negd( __ev64_opaque__ a );
__ev64_opaque__ __ev_negds( __ev64_opaque__ a );
__ev64_opaque__ __ev_negh( __ev64_opaque__ a );
__ev64_opaque__ __ev_negho( __ev64_opaque__ a );
__ev64_opaque__ __ev_neghos( __ev64_opaque__ a );
__ev64_opaque__ __ev_neghs( __ev64_opaque__ a );
__ev64_opaque__ __ev_negs( __ev64_opaque__ a );
__ev64_opaque__ __ev_negwo( __ev64_opaque__ a );
__ev64_opaque__ __ev_negwos( __ev64_opaque__ a );
__ev64_opaque__ __ev_nor( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_or( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_orc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_perm( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_perm2( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_perm3( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_pksdshefrs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pksdswfrs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pksdsws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkshsbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkshubs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswgshefrs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswgswfrs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswshfrs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswshilvfrs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswshilvs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswshs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkswuhs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkuduws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkuhubs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_pkuwuhs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_popcntb( __ev64_opaque__ a );
__ev64_opaque__ __ev_rlb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_rlbi( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_rlh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_rlhi( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_rlw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_rlwi( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_rnddnw( __ev64_opaque__ a );
__ev64_opaque__ __ev_rnddnwss( __ev64_opaque__ a );
__ev64_opaque__ __ev_rnddnwus( __ev64_opaque__ a );
__ev64_opaque__ __ev_rnddw( __ev64_opaque__ a );
__ev64_opaque__ __ev_rnddwss( __ev64_opaque__ a );
__ev64_opaque__ __ev_rnddwus( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndhb( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndhbss( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndhbus( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndhnb( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndhnbss( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndhnbus( __ev64_opaque__ a );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1033
__ev64_opaque__ __ev_rndw( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndwh( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndwhss( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndwhus( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndwnh( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndwnhss( __ev64_opaque__ a );
__ev64_opaque__ __ev_rndwnhus( __ev64_opaque__ a );
__ev64_opaque__ __ev_sad2sh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad2sha( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad2shaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad2uh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad2uha( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad2uhaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad4sb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad4sba( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad4sbaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad4ub( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad4uba( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sad4ubaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sadsw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sadswa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sadswaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_saduw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_saduwa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_saduwaa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_satsbub( __ev64_opaque__ a );
__ev64_opaque__ __ev_satsdsw( __ev64_opaque__ a );
__ev64_opaque__ __ev_satsduw( __ev64_opaque__ a );
__ev64_opaque__ __ev_satshsb( __ev64_opaque__ a );
__ev64_opaque__ __ev_satshub( __ev64_opaque__ a );
__ev64_opaque__ __ev_satshuh( __ev64_opaque__ a );
__ev64_opaque__ __ev_satswgsdf( __ev64_opaque__ a );
__ev64_opaque__ __ev_satswsh( __ev64_opaque__ a );
__ev64_opaque__ __ev_satswuh( __ev64_opaque__ a );
__ev64_opaque__ __ev_satswuw( __ev64_opaque__ a );
__ev64_opaque__ __ev_satuduw( __ev64_opaque__ a );
__ev64_opaque__ __ev_satubsb( __ev64_opaque__ a );
__ev64_opaque__ __ev_satuhsh( __ev64_opaque__ a );
__ev64_opaque__ __ev_satuhub( __ev64_opaque__ a );
__ev64_opaque__ __ev_satuwsw( __ev64_opaque__ a );
__ev64_opaque__ __ev_satuwuh( __ev64_opaque__ a );
__ev64_opaque__ __ev_selbit( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_selbitm0( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_selbitm1( __ev64_opaque__ a, __ev64_opaque__ b, __ev64_opaque__ c );
__ev64_opaque__ __ev_seteqb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_seteqb_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_seteqh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_seteqh_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_seteqw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_seteqw_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtbs_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtbu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtbu_rc( __ev64_opaque__ a, __ev64_opaque__ b );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1034 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_setgths( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgths_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgthu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgthu_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtws_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setgtwu_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltbs_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltbu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltbu_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setlths( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setlths_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setlthu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setlthu_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltws_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_setltwu_rc( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sl( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sli( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_slb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_slbi( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_slh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_slhi( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_sloi( __ev64_opaque__ a, 3-bit unsigned literal );
__ev64_opaque__ __ev_slw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_slwi( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_splatb( __ev64_opaque__ a, 3-bit unsigned literal );
__ev64_opaque__ __ev_splatfi( 5-bit signed literal );
__ev64_opaque__ __ev_splatfia( 5-bit signed literal );
__ev64_opaque__ __ev_splatfib( 5-bit signed literal );
__ev64_opaque__ __ev_splatfiba( 5-bit signed literal );
__ev64_opaque__ __ev_splatfibo( 5-bit signed literal );
__ev64_opaque__ __ev_splatfiboa( 5-bit signed literal );
__ev64_opaque__ __ev_splatfid( 5-bit signed literal );
__ev64_opaque__ __ev_splatfida( 5-bit signed literal );
__ev64_opaque__ __ev_splatfih( 5-bit signed literal );
__ev64_opaque__ __ev_splatfiha( 5-bit signed literal );
__ev64_opaque__ __ev_splatfiho( 5-bit signed literal );
__ev64_opaque__ __ev_splatfihoa( 5-bit signed literal );
__ev64_opaque__ __ev_splatfio( 5-bit signed literal );
__ev64_opaque__ __ev_splatfioa( 5-bit signed literal );
__ev64_opaque__ __ev_splath( __ev64_opaque__ a, 2-bit unsigned literal );
__ev64_opaque__ __ev_splati( 5-bit signed literal );
__ev64_opaque__ __ev_splatia( 5-bit signed literal );
__ev64_opaque__ __ev_splatib( 5-bit signed literal );
__ev64_opaque__ __ev_splatiba( 5-bit signed literal );
__ev64_opaque__ __ev_splatibe( 5-bit signed literal );
__ev64_opaque__ __ev_splatibea( 5-bit signed literal );
__ev64_opaque__ __ev_splatid( 5-bit signed literal );
__ev64_opaque__ __ev_splatida( 5-bit signed literal );
__ev64_opaque__ __ev_splatie( 5-bit signed literal );
__ev64_opaque__ __ev_splatiea( 5-bit signed literal );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1035
__ev64_opaque__ __ev_splatih( 5-bit signed literal );
__ev64_opaque__ __ev_splatiha( 5-bit signed literal );
__ev64_opaque__ __ev_splatihe( 5-bit signed literal );
__ev64_opaque__ __ev_splatihea( 5-bit signed literal );
__ev64_opaque__ __ev_srbis( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srbiu( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srbs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_srbu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_srhis( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srhiu( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srhs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_srhu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sris( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_sriu( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srois( __ev64_opaque__ a, 3-bit unsigned literal );
__ev64_opaque__ __ev_sroiu( __ev64_opaque__ a, 3-bit unsigned literal );
__ev64_opaque__ __ev_srs( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_sru( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_srwis( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srwiu( __ev64_opaque__ a, 5-bit unsigned literal );
__ev64_opaque__ __ev_srws( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_srwu( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subf2add2h( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subf2add2hss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddhss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddhx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddhxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddwss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddwx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfaddwxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfb( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfbss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfbus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfd( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfdss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfdus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfh( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhhisw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhhiuw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhlosw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhlouw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfhxus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfsmiaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_subfsmiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_subfssiaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_subfssiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_subfumiaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_subfusiaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_subfusiaaw( __ev64_opaque__ a );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1036 Freescale Semiconductor
SPE2 Operations
__ev64_opaque__ __ev_subfw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwegsf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwegsi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwogsf( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwogsi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwx( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwxss( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subfwxus( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_subifb( 5-bit unsigned literal, __ev64_opaque__ b );
__ev64_opaque__ __ev_subifh( 5-bit unsigned literal, __ev64_opaque__ b );
__ev64_opaque__ __ev_subifw( 5-bit unsigned literal, __ev64_opaque__ b );
__ev64_opaque__ __ev_sum2his( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hisa( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hisaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hs( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hsa( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hsaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hu( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2hua( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum2huaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum4bs( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum4bsa( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum4bsaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum4bu( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum4bua( __ev64_opaque__ a );
__ev64_opaque__ __ev_sum4buaaw( __ev64_opaque__ a );
__ev64_opaque__ __ev_sumws( __ev64_opaque__ a );
__ev64_opaque__ __ev_sumwsa( __ev64_opaque__ a );
__ev64_opaque__ __ev_sumwsaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_sumwu( __ev64_opaque__ a );
__ev64_opaque__ __ev_sumwua( __ev64_opaque__ a );
__ev64_opaque__ __ev_sumwuaa( __ev64_opaque__ a );
__ev64_opaque__ __ev_swapbhilo( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swapblohi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swaphe( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swaphhi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swaphhilo( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swaphlo( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swaphlohi( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_swapho( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_unpkhibsi( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpkhibui( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpkhihf( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpkhihsi( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpkhihui( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpkhiwgsf( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpklobsi( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpklobui( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpklohf( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpklohsi( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpklohui( __ev64_opaque__ a );
__ev64_opaque__ __ev_unpklowgsf( __ev64_opaque__ a );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1037
__ev64_opaque__ __ev_xor( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_xtrb(__ev64_opaque__ a,3-bit unsigned literal,3-bit unsigned literal);
__ev64_opaque__ __ev_xtrd( __ev64_opaque__ a, __ev64_opaque__ b, 3-bit unsigned literal);
__ev64_opaque__ __ev_xtrh(__ev64_opaque__ a,2-bit unsigned literal,2-bit unsigned literal);
# COMPARE PREDICATES
NOTE
The __ev_select_* operations work much like the ? : operator does in
C. For example:
__ev_select_gts(a,b,c,d) maps to the logical expression a > b ? c : d.
The following code shows an example of the assembly code:
evcmpgts crfD, A, B
evsel ret, C, D, crfD
_Bool __ev_any_gts( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_gts( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_gts( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_gts( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_gts( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_gtu(__ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_gtu( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_gtu( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_gtu( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_gtu( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_lts( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_lts( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_lts( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_lts( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_lts( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_ltu( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_ltu( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_ltu( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_ltu( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_ltu( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_eq( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_eq( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
# LOAD/STORE
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1038 Freescale Semiconductor
SPE2 Operations
NOTE
The 5-bit unsigned literal in the immediate form is scaled by the size
of the load or store to determine how many bytes the pointer 'p' is
offset by. The size of the load is determined by the first letter after the
'l': 'd'double-word (8 bytes), 'w'word (4 bytes), 'h'half word
(2 bytes). For details, see Chapter 5, Programming Interface
Examples.
__ev64_opaque__ __ev_lbbsplatb( uint8_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lbbsplatbu( uint8_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lbbsplatbx( uint8_t * p, int32_t offset );
__ev64_opaque__ __ev_lbbsplatbmx( uint8_t * &p, int32_t offset );
__ev64_opaque__ __ev_ldb( __ev64_opaque__ * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_ldbu( __ev64_opaque__ * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_ldbx( __ev64_opaque__ * p, int32_t offset );
__ev64_opaque__ __ev_ldbmx( __ev64_opaque__ * &p, int32_t offset );
__ev64_opaque__ __ev_ldd( __ev64_opaque__ * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lddu( __ev64_opaque__ * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lddx( __ev64_opaque__ * p, int32_t offset );
__ev64_opaque__ __ev_lddmx( __ev64_opaque__ * &p, int32_t offset );
__ev64_opaque__ __ev_ldh( __ev64_opaque__ * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_ldhu( __ev64_opaque__ * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_ldhx( __ev64_opaque__ * p, int32_t offset );
__ev64_opaque__ __ev_ldhmx( __ev64_opaque__ * &p, int32_t offset );
__ev64_opaque__ __ev_ldw( __ev64_opaque__ * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_ldwu( __ev64_opaque__ * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_ldwx( __ev64_opaque__ * p, int32_t offset );
__ev64_opaque__ __ev_ldwmx( __ev64_opaque__ * &p, int32_t offset );
__ev64_opaque__ __ev_lhhesplat( uint16_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhesplatu( uint16_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhesplatx( uint16_t * p, int32_t offset );
__ev64_opaque__ __ev_lhhesplatmx( uint16_t * &p, int32_t offset );
__ev64_opaque__ __ev_lhhossplat( uint16_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhossplatu( uint16_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhossplatx( uint16_t * p, int32_t offset );
__ev64_opaque__ __ev_lhhossplatmx( uint16_t * &p, int32_t offset );
__ev64_opaque__ __ev_lhhousplat( uint16_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhousplatu( uint16_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhousplatx( uint16_t * p, int32_t offset );
__ev64_opaque__ __ev_lhhousplatmx( uint16_t * &p, int32_t offset );
__ev64_opaque__ __ev_lhhsplath( uint16_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhsplathu( uint16_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lhhsplathx( uint16_t * p, int32_t offset );
__ev64_opaque__ __ev_lhhsplathmx( uint16_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwbe( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbeu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbex( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwbemx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwbos( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbosu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbosx( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwbosmx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwbou( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbouu( uint32_t * &p, 5-bit unsigned literal );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1039
__ev64_opaque__ __ev_lwboux( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwboumx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwbsplatw( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbsplatwu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwbsplatwx( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwbsplatwmx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwhe( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwheu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhex( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwhemx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwhos( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhosu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhosx( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwhosmx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwhou( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhouu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhoux( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwhoumx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwhsplat( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhsplatu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhsplatw( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhsplatwu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwhsplatwx( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwhsplatwmx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwhsplatx( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwhsplatmx( uint32_t * &p, int32_t offset );
__ev64_opaque__ __ev_lwwsplat( uint32_t * p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwwsplatu( uint32_t * &p, 5-bit unsigned literal );
__ev64_opaque__ __ev_lwwsplatx( uint32_t * p, int32_t offset );
__ev64_opaque__ __ev_lwwsplatmx( uint32_t * &p, int32_t offset );
void __ev_stdb( __ev64_opaque__ a, __ev64_opaque__ * p, 5-bit unsigned literal );
void __ev_stdbu( __ev64_opaque__ a, __ev64_opaque__ * &p, 5-bit unsigned literal );
void __ev_stdbx( __ev64_opaque__ a, __ev64_opaque__ * p, int32_t offset );
void __ev_stdbmx( __ev64_opaque__ a, __ev64_opaque__ * &p, int32_t offset );
void __ev_stdd( __ev64_opaque__ a, __ev64_opaque__ * p, 5-bit unsigned literal );
void __ev_stddu( __ev64_opaque__ a, __ev64_opaque__ * &p, 5-bit unsigned literal );
void __ev_stddx( __ev64_opaque__ a, __ev64_opaque__ * p, int32_t offset );
void __ev_stddmx( __ev64_opaque__ a, __ev64_opaque__ * &p, int32_t offset );
void __ev_stdh( __ev64_opaque__ a, __ev64_opaque__ * p, 5-bit unsigned literal );
void __ev_stdhu( __ev64_opaque__ a, __ev64_opaque__ * &p, 5-bit unsigned literal );
void __ev_stdhx( __ev64_opaque__ a, __ev64_opaque__ * p, int32_t offset );
void __ev_stdhmx( __ev64_opaque__ a, __ev64_opaque__ * &p, int32_t offset );
void __ev_stdw( __ev64_opaque__ a, __ev64_opaque__ * p, 5-bit unsigned literal );
void __ev_stdwu( __ev64_opaque__ a, __ev64_opaque__ * &p, 5-bit unsigned literal );
void __ev_stdwx( __ev64_opaque__ a, __ev64_opaque__ * p, int32_t offset );
void __ev_stdwmx( __ev64_opaque__ a, __ev64_opaque__ * &p, int32_t offset );
void __ev_sthb( __ev64_opaque__ a, uint16_t * p, 5-bit unsigned literal );
void __ev_sthbu( __ev64_opaque__ a, uint16_t * &p, 5-bit unsigned literal );
void __ev_sthbx( __ev64_opaque__ a, uint16_t * p, int32_t offset );
void __ev_sthbmx( __ev64_opaque__ a, uint16_t * &p, int32_t offset );
void __ev_stwb( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwbu( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
void __ev_stwbx( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwbmx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
void __ev_stwbe( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwbeu( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1040 Freescale Semiconductor
SPE2 Operations
void __ev_stwbex( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwbemx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
void __ev_stwbo( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwbou( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
void __ev_stwbox( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwbomx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
void __ev_stwhe( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwheu( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
void __ev_stwhex( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwhemx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
void __ev_stwho( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwhou( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
void __ev_stwhox( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwhomx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
void __ev_stwwe( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwweu( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
void __ev_stwwex( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwwemx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
void __ev_stwwo( __ev64_opaque__ a, uint32_t * p, 5-bit unsigned literal );
void __ev_stwwou( __ev64_opaque__ a, uint32_t * &p, 5-bit unsigned literal );
void __ev_stwwox( __ev64_opaque__ a, uint32_t * p, int32_t offset );
void __ev_stwwomx( __ev64_opaque__ a, uint32_t * &p, int32_t offset );
//***************************
// maps to __ev_mhoumi
__ev64_opaque__ __ev_mhoumf( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mheumi
__ev64_opaque__ __ev_mheumf( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhoumia
__ev64_opaque__ __ev_mhoumfa( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mheumia
__ev64_opaque__ __ev_mheumfa( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhousiaaw
__ev64_opaque__ __ev_mhousfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhoumiaaw
__ev64_opaque__ __ev_mhoumfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mheusiaaw
__ev64_opaque__ __ev_mheusfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mheumiaaw
__ev64_opaque__ __ev_mheumfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhousianw
__ev64_opaque__ __ev_mhousfanw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhoumianw
__ev64_opaque__ __ev_mhoumfanw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mheusianw
__ev64_opaque__ __ev_mheusfanw( __ev64_opaque__ a, __ev64_opaque__ b );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1041
// maps to __ev_mheumianw
__ev64_opaque__ __ev_mheumfanw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhogumiaa
__ev64_opaque__ __ev_mhogumfaa( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhegumiaa
__ev64_opaque__ __ev_mhegumfaa( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhogumian
__ev64_opaque__ __ev_mhogumfan( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mhegumian
__ev64_opaque__ __ev_mhegumfan( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mwhumi
__ev64_opaque__ __ev_mwhumf( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mwhumia
__ev64_opaque__ __ev_mwhumfa( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssiaaw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmi(a,b);
__ev_addssiaaw(temp);
}
__ev64_opaque__ __ev_mwhsmiaaw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmi(a,b);
__ev_addsmiaaw(temp);
}
__ev64_opaque__ __ev_mwhusiaaw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhumi(a,b);
__ev_addusiaaw(temp);
}
__ev64_opaque__ __ev_mwhumiaaw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhumi(a,b);
__ev_addumiaaw(temp);
}
// maps to __ev_mwhusiaaw
__ev64_opaque__ __ev_mwhusfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mwhumiaaw
__ev64_opaque__ __ev_mwhumfaaw( __ev64_opaque__ a, __ev64_opaque__ b );
__ev64_opaque__ __ev_mwhssianw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmi(a,b);
__ev_subfssiaaw(temp);
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1042 Freescale Semiconductor
SPE2 Operations
}
__ev64_opaque__ __ev_mwhsmianw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmi(a,b);
__ev_subfsmiaaw(temp);
}
__ev64_opaque__ __ev_mwhusianw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhumi(a,b);
__ev_subfusiaaw(temp);
}
__ev64_opaque__ __ev_mwhumianw( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhumi(a,b);
__ev_subfumiaaw(temp);
}
__ev64_opaque__ __ev_mwhgssfaa( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhssf(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwsmiaa(temp, (__ev64_u32__){1, 1});
}
__ev64_opaque__ __ev_mwhgsmfaa( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmf(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwsmiaa(temp, (__ev64_u32__){1, 1});
}
__ev64_opaque__ __ev_mwhgsmiaa( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmi(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwsmiaa(temp, (__ev64_u32__){1, 1});
}
__ev64_opaque__ __ev_mwhgumiaa( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhumi(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwumiaa(temp, (__ev64_u32__){1, 1});
}
// maps to __ev_mwhgumiaa
__ev64_opaque__ __ev_mwhgumfaa( __ev64_opaque__ a, __ev64_opaque__ b );
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1043
__ev64_opaque__ __ev_mwhgssfan( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhssf(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwsmian(temp, (__ev64_u32__){1, 1});
}
__ev64_opaque__ __ev_mwhgsmfan( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmf(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwsmian(temp, (__ev64_u32__){1, 1});
}
__ev64_opaque__ __ev_mwhgsmian( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhsmi(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwsmian(temp, (__ev64_u32__){1, 1});
}
__ev64_opaque__ __ev_mwhgumian( __ev64_opaque__ a, __ev64_opaque__ b ) {
__ev64_opaque__ temp = __ev_mwhumi(a, b);
// Note: the upper 32 bits of the immediate is a do not care. Therefore
// we spec {1, 1} because it can easily be generated by a __ev_splati(1)
__ev_mwumian(temp, (__ev64_u32__){1, 1});
}
// maps to __ev_mwhgumian
__ev64_opaque__ __ev_mwhgumfan( __ev64_opaque__ a, __ev64_opaque__ b );
NOTE:
An optimizing compiler should be able to improve performance by scheduling
the instructions implementing an intrinsic, that is, __ev_mwhgumfan.
// maps to __ev_mwumi
__ev64_opaque__ __ev_mwumf( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mwumia
__ev64_opaque__ __ev_mwumfa( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mwumiaa
__ev64_opaque__ __ev_mwumfaa( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_mwumian
__ev64_opaque__ __ev_mwumfan( __ev64_opaque__ a, __ev64_opaque__ b );
// maps to __ev_addusiaaw
__ev64_opaque__ __ev_addusfaaw( __ev64_opaque__ a );
// maps to __ev_addumiaaw
__ev64_opaque__ __ev_addumfaaw( __ev64_opaque__ a );
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1044 Freescale Semiconductor
SPE2 Operations
// maps to __ev_addsmiaaw
__ev64_opaque__ __ev_addsmfaaw( __ev64_opaque__ a );
// maps to __ev_addssiaaw
__ev64_opaque__ __ev_addssfaaw( __ev64_opaque__ a );
// maps to __ev_subfusiaaw
__ev64_opaque__ __ev_subfusfaaw( __ev64_opaque__ a );
// maps to __ev_subfumiaaw
__ev64_opaque__ __ev_subfumfaaw( __ev64_opaque__ a );
// maps to __ev_subfsmiaaw
__ev64_opaque__ __ev_subfsmfaaw( __ev64_opaque__ a );
// maps to __ev_subfssiaaw
__ev64_opaque__ __ev_subfssfaaw( __ev64_opaque__ a );
# Floating-Point SIMD Instructions
__ev64_opaque__ __ev_fsabs( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsadd( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsaddsub( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsaddsubx( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsaddx( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fscfsf( __ev64_opaque__ a);
__ev64_opaque__ __ev_fscfsi( __ev64_opaque__ a);
__ev64_opaque__ __ev_fscfuf( __ev64_opaque__ a);
__ev64_opaque__ __ev_fscfui( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsctsf( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsctsi( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsctsiz( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsctuf( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsctui( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsctuiz( __ev64_opaque__ a);
__ev64_opaque__ __ev_fsdiff( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsdiffsum( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsdiv( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsmax( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsmin( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsmul( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsmule( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsmulo( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsmulx( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fsnabs( __ev64_opaque__ a)
__ev64_opaque__ __ev_fsneg( __ev64_opaque__ a);
__ev64_opaque__ __ev_fssqrt( __ev64_opaque__ a);
__ev64_opaque__ __ev_fssub( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fssubadd( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fssubaddx( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fssubx( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fssum( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_fssumdiff( __ev64_opaque__ a, __ev64_opaque__ b);
Basic Instruction Mapping
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 3-1045
# COMPARE PREDICATES
NOTE
The __ev_select_* operations work much like the ? : operator does in
C. For example:
__ev_select_fs_gts(a,b,c,d) maps to the logical expression a > b ? c : d.
_Bool __ev_any_fs_gt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_fs_gt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_fs_gt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_fs_gt( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_fs_gt( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_fs_lt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_fs_lt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_fs_lt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_fs_lt( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_fs_lt( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_fs_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_fs_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_fs_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_fs_eq( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_fs_eq( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_fs_tst_gt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_fs_tst_gt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_fs_tst_gt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_fs_tst_gt( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_fs_tst_gt( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_fs_tst_lt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_fs_tst_lt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_fs_tst_lt( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_fs_tst_lt( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_fs_tst_lt( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
_Bool __ev_any_fs_tst_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_all_fs_tst_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_upper_fs_tst_eq( __ev64_opaque__ a, __ev64_opaque__ b);
_Bool __ev_lower_fs_tst_eq( __ev64_opaque__ a, __ev64_opaque__ b);
__ev64_opaque__ __ev_select_fs_tst_eq( __ev64_opaque__ a, __ev64_opaque__ b,
__ev64_opaque__ c, __ev64_opaque__ d);
SPE2 Programming Interface Manual, Rev. 1.0-2
3-1046 Freescale Semiconductor
SPE2 Operations
SPE2 Programming Interface Manual, Rev. 1.0-2
Freescale Semiconductor 4-1
Chapter 4
Additional Operations
4.1 Data Manipulation
The intrinsics in Section Chapter 3, SPE2 Operations act like functions with parameters that are
passed by value (except for the update forms of load and store intrinsics which are noted).
Figure 4-1 and Figure 4-2 show the layout of a __ev64_opaque__ variable in the register with
reference to creation, insertion, and extraction routines (regardless of endianess).
Figure 4-2 shows byte, half-word, and word ordering.
4.1.1 Creation Intrinsics
These intrinsics create new generic 64-bit opaque data types from the given inputs passed by value.
More specifically, they are created from the following inputs: 1 signed or unsigned 64-bit integer,
2 single-precision floats, 2 signed or unsigned 32-bit integers, or 4 signed or unsigned 16-bit
integers.
__ev64_opaque__ __ev_create_u64( uint64_t a );
__ev64_opaque__ __ev_create_s64( int64_t a );
0 31 32 63
Upper Word
(Word A)
0
Lower Word
(Word B)
1