Arithmetic

The arithmetic part of IR APIs.

tvm.aipu.script.ir.arithmetic.vadd(x, y, mask=None, saturate=False, out_sign=None, r=None)

Computes the addition on active elements of x with the corresponding elements of y.

The inactive elements of result vector are determined by r.
The feature Flexible Width Vector is supported.
The feature Multiple Width Vector is supported.

   x: 1  2  3  4  5  6   7   8
   y: 1  2  3  4  5  6   7   8
mask: T  T  T  T  F  F   T   T
   z: 9  8  7  6  4  3   2   1

 out = S.vadd(x, y, mask, r=z)
 out: 2  4  6  8  4  3  14  16

Parameters

x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.
saturateOptional[bool]: Whether the result needs to be saturated or not.
out_signOptional[str]: Specify whether the output sign is signed or unsigned. It is only needed for integer operation. None means same as operands, so the sign of operands must be the same, u means unsigned, s means signed.
rOptional[PrimExpr, int, float]: Provide the value of the inactive elements in result vector. If it is a scalar, it will be automatically broadcast. None means the inactive elements of result vector are undefined.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”, “float16/32”.

Examples

vc = S.vadd(va, vb)
vc = S.vadd(va, 3)
vc = S.vadd(va, vb, saturate=True)
vc = S.vadd(va, vb, out_sign="u")
vc = S.vadd(va, vb, mask=S.tail_mask(n, 8))
vc = S.vadd(va, vb, mask="3T5F", r=vb)

Parameters

x, yUnion[PrimExpr, int]: The operands. If either one is a scalar, it will be automatically broadcast.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”.

Examples

vc = S.vaddh(va, vb)
vc = S.vaddh(va, 3)

Parameters

x: Union[PrimExpr]: The operands. The vector x.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.
saturateOptional[bool]: Whether the result needs to be saturated or not.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”, “float16/32”.

Examples

vc = S.vabs(va)
vc = S.vabs(va, saturate=True)
vc = S.vabs(va, mask="3T5F")
vc = S.vabs(va, mask=S.tail_mask(n, 8))

Parameters

x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.
saturateOptional[bool]: Whether the result needs to be saturated or not.
out_signOptional[str]: Specify whether the output sign is signed or unsigned. It is only needed for integer operation. None means same as operands, so the sign of operands must be the same, u means unsigned, s means signed.
rOptional[PrimExpr, int, float]: Provide the value of the inactive elements in result vector. If it is a scalar, it will be automatically broadcast. None means the inactive elements of result vector are undefined.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”, “float16/32”.

Examples

vc = S.vsub(va, vb)
vc = S.vsub(va, 3)
vc = S.vsub(va, vb, saturate=True)
vc = S.vsub(va, vb, out_sign="u")
vc = S.vsub(va, vb, mask=S.tail_mask(n, 8))
vc = S.vsub(va, vb, mask="3T5F", r=vb)

Parameters

x, yUnion[PrimExpr, int]: The operands. If either one is a scalar, it will be automatically broadcast.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”.

Examples

vc = S.vsubh(va, vb)
vc = S.vsubh(va, 3)

Parameters

x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.
out_signOptional[str]: Specify whether the output sign is signed or unsigned. It is only needed for integer operation. None means same as operands, so the sign of operands must be the same, u means unsigned, s means signed.
rOptional[PrimExpr, int, float]: Provide the value of the inactive elements in result vector. If it is a scalar, it will be automatically broadcast. None means the inactive elements of result vector are undefined.

Returns

retPrimExpr: The result expression.

Supported DType

“int16/32”, “uint16/32”, “float16/32”.

Examples

vc = S.vmul(va, vb)
vc = S.vmul(va, 3)
vc = S.vmul(va, vb, out_sign="u")
vc = S.vmul(va, vb, mask="3T5F")
vc = S.vmul(va, vb, mask=S.tail_mask(n, 8))
vc = S.vmul(va, vb, mask"T7F", r=vb)

Parameters

x, yUnion[PrimExpr, int]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.
out_signOptional[str]: Specify whether the output sign is signed or unsigned. It is only needed for integer operation. None means same as operands, so the sign of operands must be the same, u means unsigned, s means signed.
rOptional[PrimExpr, int, float]: Provide the value of the inactive elements in result vector. If it is a scalar, it will be automatically broadcast. None means the inactive elements of result vector are undefined.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”.

Examples

vc = S.vmull(va, vb)
vc = S.vmull(va, 3)
vc = S.vmull(va, vb, out_sign="u")
vc = S.vmull(va, vb, mask="3T5F")
vc = S.vmull(va, vb, mask=S.tail_mask(n, 8))
vc = S.vmull(va, vb, mask="T7F", r=vb)

Parameters

x, yUnion[PrimExpr, int]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.
out_signOptional[str]: Specify whether the output sign is signed or unsigned. It is only needed for integer operation. None means same as operands, so the sign of operands must be the same, u means unsigned, s means signed.
rOptional[PrimExpr, int, float]: Provide the value of the inactive elements in result vector. If it is a scalar, it will be automatically broadcast. None means the inactive elements of result vector are undefined.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”.

Examples

vc = S.vmulh(va, vb)
vc = S.vmulh(va, 3)
vc = S.vmulh(va, vb, out_sign="u")
vc = S.vmulh(va, vb, mask="3T5F")
vc = S.vmulh(va, vb, mask=S.tail_mask(n, 8))
vc = S.vmulh(va, vb, mask="T7F", r=vb)

Parameters

x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”, “float32”.

Examples

vc = S.vdiv(va, vb)
vc = S.vdiv(va, 3)
vc = S.vdiv(va, vb, mask="3T5F")
vc = S.vdiv(va, vb, mask=S.tail_mask(n, 8))

Parameters

x, yUnion[PrimExpr, int]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”.

Examples

vc = S.vmod(va, vb)
vc = S.vmod(va, 3)
vc = S.vmod(va, vb, mask="3T5F")
vc = S.vmod(va, vb, mask=S.tail_mask(n, 8))

Parameters

x, yUnion[PrimExpr, int]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

# Only supported integer cases:
case  result dtype  x.dtype   y.dtype
   "int16"       "int8"    "int8"
   "int16"       "int8"    "uint8"
   "int16"       "uint8"   "int8"
   "uint16"      "uint8"   "uint8"

   "int32"       "int16"   "int16"
   "int32"       "int16"   "uint16"
   "int32"       "uint16"  "int16"
   "uint32"      "uint16"  "uint16"

Examples

out0 = S.vdot(x, y)
out1 = S.vdot(x, 3, mask)

Parameters

x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

Supported integer cases:
case  result dtype  x.dtype    y.dtype
1     "int32"       "int8"     "int8"
2     "int32"       "int8"     "uint8"
3     "int32"       "uint8"    "int8"
4     "uint32"      "uint8"    "uint8"

Supported floating cases:
case  result dtype  x.dtype    y.dtype
1     "float32"     "float16"  "float16"

Examples

out0 = S.vqdot(x, y)
out1 = S.vqdot(x, 3, mask)

Parameters

accPrimExpr: The accumulate register, should be initialized.
x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

# Only supported integer cases:
case  acc.dtype  x.dtype   y.dtype
   "int16"    "int8"    "int8"
   "int16"    "int8"    "uint8"
   "int16"    "uint8"   "int8"
   "uint16"   "uint8"   "uint8"

   "int32"    "int16"   "int16"
   "int32"    "int16"   "uint16"
   "int32"    "uint16"  "int16"
   "uint32"   "uint16"  "uint16"

Examples

acc = S.int32x8(0)
out = S.vdpa(acc, x, y)

acc = S.int32x8(0)
out = S.vdpa(acc, x, y, mask)

Parameters

accPrimExpr: The accumulate register, should be initialized.
x, yUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

Supported integer cases:
case  acc.dtype  x.dtype    y.dtype
1     "int32"    "int8"     "int8"
2     "int32"    "int8"     "uint8"
3     "int32"    "uint8"    "int8"
4     "uint32"   "uint8"    "uint8"

Supported floating cases:
case  acc.dtype  x.dtype    y.dtype
1     "float32"  "float16"  "float16"

Examples

acc = S.int32x8(0)
out = S.vqdpa(acc, x, y)

acc = S.int32x8(0)
out = S.vqdpa(acc, x, y, mask)

Parameters

xPrimExpr,: The operands.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”, “float16/32”.

Examples

out = S.vrpadd(x)
out = S.vrpadd(x, mask)

Parameters

ptrPointer: The pointer that store the memory address in where the result will be stored, it can be a scalar or vector float32 pointer, the memory space it point to at least must can represent a 4x4 float32 matrix with row major.
xPrimExpr: The operand x with vector type float16x16 representing 4x4 fp16 elements with row major.
yPrimExpr: The operand y with vector type float16x16 representing 4x4 fp16 elements with column major.

Supported DType

“float16”.

Examples

# The "vc_fp32_ptr" can be scalar or vector float32 pointer, as long as the memory space
# that it point to is enough to store 4x4 float32 data.
S.vmml(vc_fp32_ptr, va_fp16x16, vb_fp16x16)

Parameters

acc_ptrPointer: The pointer that store the memory address in where the result will be stored, it can be a scalar or vector float32 pointer, the memory space it point to at least must can represent a 4x4 float32 matrix with row major.
xPrimExpr: The operand x with vector type float16x16 representing 4x4 fp16 elements with row major.
yPrimExpr: The operand y with vector type float16x16 representing 4x4 fp16 elements with column major.

Supported DType

“float16”.

Examples

# The "vc_fp32_ptr" can be scalar or vector float32 pointer, as long as the memory space
# that it point to is enough to store 4x4 float32 data.
S.vmma(vc_fp32_ptr, va_fp16x16, vb_fp16x16)

Parameters

accPrimExpr: The accumulate register, should be initialized.
x, yUnion[PrimExpr, float]: The operands. If it is a scalar in the vector situation, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

“float32”.

Examples

acc = S.float32x8(10)
out = S.fma(acc, x, y, mask)

scalar_out = S.fma(scalar_acc, scalar_x, scalar_y)

Parameters

accPrimExpr: The accumulate register, should be initialized.
x, yUnion[PrimExpr, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

# Only supported floating cases:
case  acc.dtype  x.dtype    y.dtype
1     "float32"  "float16"  "float16"

Examples

acc = S.float32x8(10)
out = S.vfmae(acc, x, y)

acc = S.float32x8(10)
out = S.vfmae(acc, x, y, mask)

Parameters

accPrimExpr: The accumulate register, should be initialized.
x, yUnion[PrimExpr, float]: The operands. If either one is a scalar, it will be automatically broadcast.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

# Only supported floating cases:
case  acc.dtype  x.dtype    y.dtype
1     "float32"  "float16"  "float16"

Examples

acc = S.float32x8(10)
out = S.vfmao(acc, x, y)

acc = S.float32x8(10)
out = S.vfmao(acc, x, y, mask)

Parameters

xUnion[PrimExpr]: The operands. The vector x.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

“float16/32”.

Examples

vc = S.vrint(va)
vc = S.vrint(va, mask="3T5F")
vc = S.vrint(va, mask=S.tail_mask(n, 8))

Parameters

x, min_val, max_valUnion[PrimExpr, int, float]: The operands. If either one is a scalar, it will be automatically broadcast. It should be noted that: min_val < max_val.
maskOptional[Union[Tuple[bool], List[bool], numpy.ndarray[bool], str, PrimExpr]]: The predication mask to indicate which elements of the vector are active for the operation. None means all elements are active.

Returns

retPrimExpr: The result expression.

Supported DType

“int8/16/32”, “uint8/16/32”, “float16/32”.

Examples

b = S.clip(a, -10, 10)
vc = S.clip(va, vb, vc)
vc = S.clip(va, 3, 30)
vc = S.clip(va, vb, vc, mask="3T5F")
vc = S.clip(va, vb, vc, mask=S.tail_mask(n, 8))

Arithmetic

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Returns

Supported DType

Examples

See Also

Parameters

Supported DType

Examples

See Also