This allows user to give both the thread ids and dimension of the threads we want to distribute on.
This means we can use it to distribute on warps as well.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Looks good overall! Thanks for getting this to work with warps!
mlir/lib/Dialect/GPU/TransformOps/GPUTransformOps.cpp | ||
---|---|---|
437 | is threadOpsUpdated size guaranteed to be the same as globalBlockDims size? If not, this might result in out of bounds access. Maybe have an assert that threadOps size is 3? |
Comment Actions
address review comment
mlir/lib/Dialect/GPU/TransformOps/GPUTransformOps.cpp | ||
---|---|---|
437 | Good point, added an assert |
is threadOpsUpdated size guaranteed to be the same as globalBlockDims size? If not, this might result in out of bounds access. Maybe have an assert that threadOps size is 3?