<Untitled>#
C++ API example demonstrating how one can perform reduced precision matrix-matrix multiplication using MatMul and the accuracy of the result compared to the floating point computations.
C++ API example demonstrating how one can perform reduced precision matrix-matrix multiplication using MatMul and the accuracy of the result compared to the floating point computations.
Concepts:
Static and dynamic quantization
Asymmetric quantization
Zero points: dnnl::primitive_attr::set_zero_points_mask()