User Details
User Details
- User Since
- Dec 16 2022, 3:39 AM (15 w, 2 d)
Fri, Mar 31
Fri, Mar 31
OutOfCache updated the diff for D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
- add internal error if the mask size is not 32 or 64
Thu, Mar 30
Thu, Mar 30
OutOfCache added inline comments to D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
OutOfCache updated the diff for D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
- Simplify SelDAG with call to SIInstrInfo::readlaneVGPRToSGPR
- Add comment to clarify that current GISel test is incorrect
- remove [VCC] form pseudo instruction def
OutOfCache added inline comments to D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
Tue, Mar 28
Tue, Mar 28
After further discussion, @mareko is right and the waitcnts are necessary. Thanks for bringing that up!
OutOfCache added a comment to D146829: [AMDGPU] Remove unnecessary waitcnts.
OutOfCache added inline comments to D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
Mon, Mar 27
Mon, Mar 27
OutOfCache added a comment to D146829: [AMDGPU] Remove unnecessary waitcnts.
OutOfCache added inline comments to D146829: [AMDGPU] Remove unnecessary waitcnts.
OutOfCache updated the summary of D146829: [AMDGPU] Remove unnecessary waitcnts.
OutOfCache updated the diff for D146829: [AMDGPU] Remove unnecessary waitcnts.
- editing DSInstructions.td instead of SIInsertWaitcnts.cpp
Fri, Mar 24
Fri, Mar 24
OutOfCache updated the summary of D146829: [AMDGPU] Remove unnecessary waitcnts.
OutOfCache requested review of D146829: [AMDGPU] Remove unnecessary waitcnts.
Tue, Mar 21
Tue, Mar 21
OutOfCache updated the diff for D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
- Fix typos
- Rename variable
OutOfCache added inline comments to D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
OutOfCache updated the diff for D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
- Simplify GISel Implementation by using legalizeOperands
- Remove unnecessary checks.
- Remove redundant tests and move GISel ones to SDAG tests
- Increase readability.
Fri, Mar 17
Fri, Mar 17
OutOfCache added inline comments to D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
OutOfCache updated the summary of D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
OutOfCache added reviewers for D146287: [AMDGPU][GISel] Add inverse ballot intrinsic: tsymalla, nhaehnle, arsenm, critson.
OutOfCache requested review of D146287: [AMDGPU][GISel] Add inverse ballot intrinsic.
Feb 22 2023
Feb 22 2023
[AMDGPU] Improved wide multiplies
Feb 21 2023
Feb 21 2023
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Remove gfx11 from MIR tests
OutOfCache committed rGc9fd858172d0: [AMDGPU] MIR-Tests for Multiplication using KBA (authored by OutOfCache).
[AMDGPU] MIR-Tests for Multiplication using KBA
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Added MIR Tests for gfx10 and gfx11.
Feb 20 2023
Feb 20 2023
OutOfCache committed rG959216f9b1f1: [AMDGPU] MIR-Tests for Multiplication using KBA (authored by OutOfCache).
[AMDGPU] MIR-Tests for Multiplication using KBA
Feb 14 2023
Feb 14 2023
OutOfCache added inline comments to D140208: [AMDGPU] Improved wide multiplies.
Jan 23 2023
Jan 23 2023
OutOfCache added inline comments to D140208: [AMDGPU] Improved wide multiplies.
OutOfCache added inline comments to D140208: [AMDGPU] Improved wide multiplies.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] reverting separation of for-loops
Jan 22 2023
Jan 22 2023
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] reverting separation of for-loops
Jan 20 2023
Jan 20 2023
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] reverting separation of for-loops
Jan 16 2023
Jan 16 2023
OutOfCache added inline comments to D140208: [AMDGPU] Improved wide multiplies.
Jan 11 2023
Jan 11 2023
OutOfCache added inline comments to D140208: [AMDGPU] Improved wide multiplies.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] Removing redundant zero-check for mults
Jan 10 2023
Jan 10 2023
OutOfCache added inline comments to D140208: [AMDGPU] Improved wide multiplies.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [GISel] Adding KnownBitsAnalysis to Legalizer
- [AMDGPU] Wide multiplies with Known Bits Analysis
- [AMDGPU] Improved wide multiplies tests
- [GISel/AMDGPU] caching results of isZero()
- [AMDGPU] Inlining conditions in buildMultiply
- [AMDGPU] Removing redundant zero-check for mults
OutOfCache committed rGf33633f51243: [AMDGPU] adding test for partially masked operands (authored by OutOfCache).
[AMDGPU] adding test for partially masked operands
Jan 5 2023
Jan 5 2023
OutOfCache added inline comments to D140907: [GlobalISel] New combine to commute constant operands to the RHS.
Dec 27 2022
Dec 27 2022
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] Wide multiplies with Known Bits Analysis
- [AMDGPU] Improved wide multiplies tests
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] Improved wide multiplies tests
Dec 26 2022
Dec 26 2022
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [GISel] Adding KnownBitsAnalysis to Legalizer
- [AMDGPU] Wide multiplies with Known Bits Analysis
- [AMDGPU] Improved wide multiplies tests
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Fixing constructor issues by moving the default parameter to the header.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Fixing constructor issues by moving the default parameter to the header.
Dec 24 2022
Dec 24 2022
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Improved tests with better naming.
Better variable naming in LegalizerInfo.
Added constructor again because that was causing issues.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
- [AMDGPU] Improved naming of mul-known-bits tests
- [AMDGPU] Better variable naming in LegalizerInfo
Dec 23 2022
Dec 23 2022
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Improved tests with better naming.
Better variable naming in LegalizerInfo.
Added constructor again because that was causing issues.
Dec 20 2022
Dec 20 2022
OutOfCache committed rG5ee13e6c6527: [AMDGPU] Wide multiplies tests for D140208 (authored by OutOfCache).
[AMDGPU] Wide multiplies tests for D140208
Dec 16 2022
Dec 16 2022
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Adding the new test file
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Adressing the redundant constructor and minor format issues
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Adding missing commits.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Fixing indentation
OutOfCache retitled D140208: [AMDGPU] Improved wide multiplies from [AMDGPU] Improved wide multiplies tests to [AMDGPU] Improved wide multiplies.
OutOfCache updated the diff for D140208: [AMDGPU] Improved wide multiplies.
Added missing commits
OutOfCache requested review of D140208: [AMDGPU] Improved wide multiplies.