This work introduces : wgmma.fence.aligned, wgmma.commit.group.sync.aligned and wgmma.wait.group.sync.aligned Ops. They are used to syncronize warpgroup level matrix multiply-accumulate instructions, as known as WGMMA.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td | ||
---|---|---|
1429 | can we add a description for each op that at least points to the proper place in the cuda documentation? |
can we add a description for each op that at least points to the proper place in the cuda documentation?