This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Add WMMA clang builtins
ClosedPublic

Authored by piotr on Jun 30 2022, 3:01 PM.

Details

Summary

Add WMMA clang builtins and tests. Extra changes in code
are needed to handle function overloads.

WavefrontSize 32:
builtin_amdgcn_wmma_f32_16x16x16_f16_w32
builtin_amdgcn_wmma_f32_16x16x16_bf16_w32
builtin_amdgcn_wmma_f16_16x16x16_f16_w32
builtin_amdgcn_wmma_bf16_16x16x16_bf16_w32
builtin_amdgcn_wmma_i32_16x16x16_iu8_w32
builtin_amdgcn_wmma_i32_16x16x16_iu4_w32

WavefrontSize 64:
builtin_amdgcn_wmma_f32_16x16x16_f16_w64
builtin_amdgcn_wmma_f32_16x16x16_bf16_w64
builtin_amdgcn_wmma_f16_16x16x16_f16_w64
builtin_amdgcn_wmma_bf16_16x16x16_bf16_w64
builtin_amdgcn_wmma_i32_16x16x16_iu8_w64
builtin_amdgcn_wmma_i32_16x16x16_iu4_w64

Diff Detail

Event Timeline

piotr created this revision.Jun 30 2022, 3:01 PM
Herald added a project: Restricted Project. · View Herald TranscriptJun 30 2022, 3:01 PM
piotr requested review of this revision.Jun 30 2022, 3:01 PM
Herald added a project: Restricted Project. · View Herald TranscriptJun 30 2022, 3:01 PM
This revision is now accepted and ready to land.Jun 30 2022, 3:44 PM
This revision was landed with ongoing or failed builds.Jun 30 2022, 11:56 PM
This revision was automatically updated to reflect the committed changes.