User Details
- User Since
- Nov 1 2021, 3:10 AM (99 w, 5 d)
Aug 17 2023
Update alignment for int32 type lds_size argument
Make lds size argument 32 bit int type.
Jul 19 2023
Jun 5 2023
May 30 2023
Update test.
May 29 2023
May 19 2023
May 11 2023
Rebase.
May 5 2023
Updated patch as per feedback from @Pierre-vh
May 4 2023
expandImpliedFeatures gets all the features implied by current GPU. But wavefrontsize32 and 64 are not part of the feature list for gfx10 and above targets in AMDGPU.td. AFAIU, It is assumed those subtarget supports both the features if they are not part of the list. So GPUFeatureBits cannot be relied on to query FeatureWavefrontSize32/64.
May 1 2023
Remove wavefrontsize64 function if compiling in wave32 mode.
Apr 29 2023
Updated patch to delete the function if FeatureWavefrontSize32 is not supported by current GPU.
Apr 27 2023
Apr 25 2023
Rebase.
Apr 21 2023
Apr 17 2023
Apr 14 2023
Port MBUF related tests from global-isel containing all the test combinations.
Apr 12 2023
Apr 11 2023
Apr 10 2023
Rebase. Update test.
Apr 9 2023
Apr 6 2023
Apr 4 2023
Changes done as per review by @arsenm
Hi @nikic. I have made the changes that were suggested previously. Could you please review?
Rebase. Update test control-flow-fastregalloc.ll
Apr 2 2023
Rebase.
Changes done as per @arsenm review.
Mar 30 2023
Rebase
Fix mul.ll test failure.
Mar 29 2023
Mar 16 2023
Mar 10 2023
Feb 28 2023
Use ReversePostOrderTraversal list to compare machine instructions from different basic blocks.
Feb 21 2023
Feb 13 2023
Rebase. Ping
Feb 10 2023
Feb 9 2023
Introduce SIInstrWorklist. It has a std::set with comparison operator to store instructions as per order in machine function. This tries to solve the previous issue with sorting the vector in every iteration.
Feb 5 2023
Jan 21 2023
Make changes to legalize soffset and rsrc together.
Jan 16 2023
Rebase
Jan 10 2023
Rebase
Jan 9 2023
Rebase
Jan 5 2023
Use loadMBUFScalarOperandFromVGPR to build waterfall loop for soffset and srsrc.
Jan 4 2023
Jan 2 2023
Remove check for src array type.
Drop last zero from resulting NewIndices after matching the types.
Drop triple from test. Updated test checks with update_test_checks.py.
Added new tests for struct of array type and to check new zero index in resulting gep.
Dec 23 2022
Dec 15 2022
Rebase. Use poison for srsrc operands.
Rebase. Ping
Dec 13 2022
Removed inreg to params of tests. Made srsrc operand undef.
Dec 12 2022
Rebase. Use -passes=instcombine in test.
Dec 9 2022
Made changes to only identify copy and its result used by soffset of MUBUF/MTBUF. needToBeConvertedToVALU returns false if such pattern is found.
This also fixes vgpr-descriptor-waterfall-loop-idom-update.ll test, where the previous revision of the patch doesn't generate waterfall loop.
Dec 7 2022
Rebase.
Dec 5 2022
Thanks @nikic for feedback. Made changes to match Src ResultElementType with GEP SrcElementType by adding additional zero indices to Src gep.
Dec 4 2022
Update code as per @jmmartinez comments.
Nov 30 2022
Added comments to tests.
Rebase.
Nov 24 2022
Rebase.
Nov 23 2022
Nov 17 2022
Nov 16 2022
Nov 8 2022
Nov 7 2022
changes as per review suggestion.