This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][MC][MI100+] Enable VOP3 variants of dot2c/dot4c/dot8c opcodes
ClosedPublic

Authored by dp on Nov 22 2022, 6:25 AM.

Details

Summary

Unfortunately AMD documentation does not specify which modifiers should be supported for these instructions.
The patch enables abs, neg, clamp and omod modifiers for v_dot2c_f32_f16 and clamp for remaining dotc opcodes, but it is a somewhat arbitrary decision.

Diff Detail