Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Not sure about the defaults but looks good otherwise. Also +1 to @bondhugula comments. Please address those first.
mlir/include/mlir/Dialect/GPU/Transforms/Passes.h | ||
---|---|---|
129–131 | Why do we need to give these defaults, at all? |
mlir/include/mlir/Dialect/GPU/Transforms/Passes.h | ||
---|---|---|
129–131 | @bondhugula @herhut I have removed the default arguments here, and left newer defaults ones in the class constructor - should help someone immediately know what should go in there at a glance. | |
mlir/lib/Dialect/GPU/Transforms/SerializeToCubin.cpp | ||
148–149 | Removed. |
Changing the defaults makes some integration tests fail if a toolchain doesn't support sm_80, e.g. mlir/test/Integration/GPU/CUDA:all-reduce-max.mlir.test
Error: failed with error code the provided PTX was compiled with an unsupported toolchain.[ptxas application ptx input, line 5; fatal : Unsupported .version 7.5; current version is '7.4']
Was it really necessary to also change the defaults?
Any reason to default this to old hardware - how about sm_80 instead?