User Details
- User Since
- Sep 6 2020, 6:45 AM (142 w, 5 d)
Today
Address comments.
Mon, May 29
[MLIR][GPU] Add 16-bit version of cudaMemset in cudaRuntimeWrappers
Sun, May 28
Dec 4 2022
Sync with upstream
Dec 3 2022
Remove whitespace errors.
Dec 2 2022
Address comments
Nov 30 2022
Update SPIR-V lowerings.
Address comments
Clang-format
Jun 29 2021
Jun 10 2021
Jun 9 2021
This is a great addition. We can bring in a scaling op also which scales mmaMatrix by a certain value. Maybe I can take that up.
May 27 2021
Thanks! this is something I wasn't aware of. BTW I tested these on a Turing with CUDA10.2, and they passed, but maybe they fail on some other devices.
This is a great addition.
May 25 2021
May 22 2021
Great to see this, But I am not fully sure why the earlier key did not work? Isn't the underlying data used to generate the hashCode for ArrayRef<> and StirngRef<>. Here are some references from llvm/include/llvm/ADT/ArrayRef.h and llvm/lib/Support/StringRef.cpp.
template <typename T> hash_code hash_value(ArrayRef<T> S) { return hash_combine_range(S.begin(), S.end()); }
// Implementation of StringRef hashing. hash_code llvm::hash_value(StringRef S) { return hash_combine_range(S.begin(), S.end()); }
Fix commit summary and title.
Changes in this diff :-
Closing revision as changes here were moved to D95331.
Closing revision, As recent changes(removal of mlir-cuda-runner and others) have baked the functionality of this patch into convert-gpu-to-nvvm and gpu-to-cubin.
Changes in this diff :-
Changes in this diff:-
May 21 2021
Rebase on upstream/main.
Rebase on upstream/main.
Changes in this diff :-
Changes in this diff :-
May 14 2021
Changes in this diff :-
1.) Fix formatting in WmmaOpsToNvvmLowering.cpp.
Changes in this diff :-
1.) Address comments on previous diff(343284).
May 5 2021
Changes in this diff :-
1.) Clang-format fix. 2.) Added TODO to generate MMAMatrix via ODS.
Changes in this diff :-
1.) Address comments on previous diff(342267).
Changes in this diff :-
1.) Address comments on previous diff(342265).
May 2 2021
Changes in this diff :-
1.) Make changes to operate with the newly intoduced gpu.mma_matrix type.
Changes in this diff :-
1.) Rebase on upstream/main. 3.) Make changes to operate with the newly intoduced gpu.mma_matrix type.
Changes in this diff :-
1.) Rebase on upstream/main. 2.) Address comments on previous diff(324324). 3.) Remove gpu.mmafragment and introduce gpu.mma_matrix type.
Apr 27 2021
Apr 12 2021
Hi @ThomasRaoux,
Sorry for the late reply. Great to hear that these ops can be reused in the IREE pipeline too. I was actually busy in some parallel work using these ops and getting it ready for an upcoming submission. The comments regarding the types are still to be addressed. I will surely be working on this, But I will get started on any major changes only by next week. As you mention, It would be great to know what your plans are and how you wish to proceed.
Mar 7 2021
Hi all, Thanks for the valuable comments. @ThomasRaoux Thanks for clarifying things on the SPIR-V side.
Mar 3 2021
Feb 17 2021
Changes in this diff :-
Changes in this diff :-
Changes in this diff:-
Feb 9 2021
Hi, Thanks for the comments.
Feb 4 2021
Changes in this diff :-
1.) Modify the test case to use the !gpu.mmafragment type introduced in revision D95330.
Changes in this diff:-
1.) Add CL option to pass index-bitwidth for LLVM lowering passes on the device side.
Changes in this diff :-
1.) Rebase on master to drop the use of LLVMType. 2.) Make changes to use the !gpu.mmafragment type introduced in parent revision D95330.
Feb 3 2021
Issues in diff 318905 :-
1.) The design used memrefs to model mmafragments and were allocated in `.local` space in the PTX generated. This compeletely destroyed the purpose of wmmaOps(to use operands in `.reg` space.).
Jan 24 2021
Jan 8 2021
Dec 5 2020
Address comments on diff 308060
Nov 27 2020
Sep 17 2020
Remove include directive "block.h" from "TestAffineLoopParametricTiling.cpp"
Address comments on diff(291964).
Sep 16 2020
Sep 15 2020
Address comments on diff (291253).
Sep 11 2020
Remove auto from appropriate places in TestAffineLoopParametricTiling.cpp
Address comments on diff (290923).
Sep 10 2020
Address comments on diff (290734).
Sep 9 2020
Address comments on initial diff (290671)