This is an archive of the discontinued LLVM Phabricator instance.

[NVPTX, CUDA] Added support for m8n32k16 and m32n8k16 variants of wmma instructions.
ClosedPublic

Authored by tra on Mar 29 2018, 3:54 PM.

Diff Detail

Repository
rL LLVM

Event Timeline

tra created this revision.Mar 29 2018, 3:54 PM
jlebar accepted this revision.Mar 29 2018, 4:14 PM
jlebar added inline comments.
clang/lib/Driver/ToolChains/Cuda.cpp
578 ↗(On Diff #140354)

CUDA-9.1?

This revision is now accepted and ready to land.Mar 29 2018, 4:14 PM
tra updated this revision to Diff 140363.Mar 29 2018, 4:42 PM

Updated comments.

tra marked an inline comment as done.Mar 29 2018, 4:43 PM
tra updated this revision to Diff 142262.Apr 12 2018, 2:30 PM

Updated BuiltinsNVPTX.def and tests to deal with changes in the way we deal with required features in TARGET_BUILTIN.

tra updated this revision to Diff 143003.EditedApr 18 2018, 2:53 PM

Rebased to TOT

This revision was automatically updated to reflect the committed changes.