Add 16-bit version of cudaMemset in cudaRuntimeWrappers and update the GPU to LLVM lowering.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
[MLIR][GPU] Add 16-bit version of cudaMemset in cudaRuntimeWrappers
Add 16-bit version of cudaMemset in cudaRuntimeWrappers and update the GPU to LLVM lowering.
mlir/lib/Conversion/GPUCommon/GPUToLLVMConversion.cpp | ||
---|---|---|
1377–1378 | Nit: Avoid multiple calls to getIntOr... |
mlir/lib/Conversion/GPUCommon/GPUToLLVMConversion.cpp | ||
---|---|---|
1396–1398 | Use ternary operator for the builder ref: ...& builder = .... ? memset32.... : memset16...; to avoid duplication. |
mlir/lib/Conversion/GPUCommon/GPUToLLVMConversion.cpp | ||
---|---|---|
1396–1398 | This isn't what I meant -- this won't save anything in terms of duplication. Use the ternary operator to assign the right builder. |
Please add in sorted order. One place above.