This is needed for the upcoming implementation of the
new 8x32x16 and 32x8x16 variants of WMMA instructions
introduced in CUDA 9.1.
NFC intended, modulo name change.
Paths
| Differential D44719
[NVPTX] Make tensor shape part of WMMA intrinsic's name. ClosedPublic Authored by tra on Mar 20 2018, 4:50 PM.
Details Summary This is needed for the upcoming implementation of the NFC intended, modulo name change.
Diff Detail
Event TimelineHerald added subscribers: hiraditya, sanjoy, jholewinski. · View Herald TranscriptMar 20 2018, 4:50 PM This revision is now accepted and ready to land.Mar 21 2018, 4:34 AM Closed by commit rL328158: [NVPTX] Make tensor shape part of WMMA intrinsic's name. (authored by tra). · Explain WhyMar 21 2018, 3:00 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 139223 clang/lib/CodeGen/CGBuiltin.cpp
llvm/include/llvm/IR/IntrinsicsNVVM.td
llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp
llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
llvm/test/CodeGen/NVPTX/wmma.py
|