Page MenuHomePhabricator

critson (Carl Ritson)
User

Projects

User does not belong to any projects.

User Details

User Since
Sep 4 2018, 4:49 AM (247 w, 3 d)

Recent Activity

Yesterday

critson committed rG2e87ed80b23a: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator (authored by critson).
[AMDGPU] WQM: Allow insertion of exact mode transition as terminator
Thu, Jun 1, 10:01 PM · Restricted Project, Restricted Project
critson closed D151797: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator.
Thu, Jun 1, 10:01 PM · Restricted Project, Restricted Project
critson added inline comments to D151797: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator.
Thu, Jun 1, 6:33 PM · Restricted Project, Restricted Project
critson added a comment to D151797: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator.

Missing test updates?

Thu, Jun 1, 12:52 AM · Restricted Project, Restricted Project

Wed, May 31

critson added a comment to D145329: AMDGPU: Always split blocks for si_end_cf.

Sorry for the delay. I believe D151797 should allow this to proceed.

Wed, May 31, 5:11 AM · Restricted Project, Restricted Project
critson requested review of D151797: [AMDGPU] WQM: Allow insertion of exact mode transition as terminator.
Wed, May 31, 5:06 AM · Restricted Project, Restricted Project

Wed, May 17

critson committed rG9602c7a0817f: [AMDGPU][Uniformity] V_MBCNT* is never uniform (authored by critson).
[AMDGPU][Uniformity] V_MBCNT* is never uniform
Wed, May 17, 9:51 PM · Restricted Project, Restricted Project
critson closed D150759: [AMDGPU][Uniformity] V_MBCNT* is never uniform.
Wed, May 17, 9:50 PM · Restricted Project, Restricted Project
critson requested review of D150759: [AMDGPU][Uniformity] V_MBCNT* is never uniform.
Wed, May 17, 12:26 AM · Restricted Project, Restricted Project
critson committed rG0b9571fe598b: [AMDGPU] Add missing whitespace in SIInstrInfo header (NFC) (authored by critson).
[AMDGPU] Add missing whitespace in SIInstrInfo header (NFC)
Wed, May 17, 12:24 AM · Restricted Project, Restricted Project
critson committed rGcd811e2421a4: [AMDGPU][UniformityAnalysis] Fix typos in test comment (NFC) (authored by critson).
[AMDGPU][UniformityAnalysis] Fix typos in test comment (NFC)
Wed, May 17, 12:23 AM · Restricted Project, Restricted Project

Mon, May 8

critson accepted D150105: [AMDGPU] Recompute liveness in SIOptimizeExecMaskingPreRA.

LGTM

Mon, May 8, 4:53 PM · Restricted Project, Restricted Project

Apr 25 2023

critson accepted D149184: [AMDGPU] Delete test for illegal v_cndmask_b16_dpp.

LGTM

Apr 25 2023, 5:27 PM · Restricted Project, Restricted Project

Apr 7 2023

critson added a comment to D147721: [AMDGPU] Fix a case of updating LiveIntervals in SIOptimizeExecMaskingPreRA.

I assume D129208 is the test case for this, so it does not need any new tests?

Apr 7 2023, 1:36 AM · Restricted Project, Restricted Project

Apr 5 2023

critson added a comment to D145329: AMDGPU: Always split blocks for si_end_cf.

My intention is to sit down and try make WQM work with this.
It is a non-trivial change, so no promises, but I will try to look at it in the next few days.

Apr 5 2023, 3:16 AM · Restricted Project, Restricted Project

Mar 27 2023

critson accepted D145315: AMDGPU: Fix missing MIR serialization for PSInputAddr/PSInputEnable.

LGTM

Mar 27 2023, 5:48 PM · Restricted Project, Restricted Project
critson added a comment to D145315: AMDGPU: Fix missing MIR serialization for PSInputAddr/PSInputEnable.

LGTM, but one question about the diff preparation.

Mar 27 2023, 5:14 PM · Restricted Project, Restricted Project

Mar 13 2023

critson committed rG2aaaed3527de: [IRLinker] Fix mapping of declaration metadata (authored by critson).
[IRLinker] Fix mapping of declaration metadata
Mar 13 2023, 4:43 PM · Restricted Project, Restricted Project
critson closed D145318: [IRLinker] Fix mapping of declaration metadata.
Mar 13 2023, 4:43 PM · Restricted Project, Restricted Project

Mar 12 2023

critson added inline comments to D145318: [IRLinker] Fix mapping of declaration metadata.
Mar 12 2023, 6:25 PM · Restricted Project, Restricted Project
critson updated the diff for D145318: [IRLinker] Fix mapping of declaration metadata.
  • Exclude global objects without metadata from unmapped set
  • Add assertion to cover the case of intrinsics with metadata
Mar 12 2023, 6:22 PM · Restricted Project, Restricted Project

Mar 9 2023

critson added a comment to D145318: [IRLinker] Fix mapping of declaration metadata.

can you re-upload with context?

Mar 9 2023, 11:02 PM · Restricted Project, Restricted Project
critson updated the diff for D145318: [IRLinker] Fix mapping of declaration metadata.
  • Add missing context
Mar 9 2023, 11:01 PM · Restricted Project, Restricted Project
critson reopened D145318: [IRLinker] Fix mapping of declaration metadata.
Mar 9 2023, 11:01 PM · Restricted Project, Restricted Project
critson updated the diff for D145318: [IRLinker] Fix mapping of declaration metadata.
  • Add missing erasure of functions which are erased through renaming.
Mar 9 2023, 12:02 AM · Restricted Project, Restricted Project

Mar 8 2023

critson added a reverting change for rGbe0ef4b95d94: [IRLinker] Fix mapping of declaration metadata: rGe510d0bda087: Revert "[IRLinker] Fix mapping of declaration metadata".
Mar 8 2023, 3:43 AM · Restricted Project, Restricted Project
critson committed rGe510d0bda087: Revert "[IRLinker] Fix mapping of declaration metadata" (authored by critson).
Revert "[IRLinker] Fix mapping of declaration metadata"
Mar 8 2023, 3:43 AM · Restricted Project, Restricted Project
critson added a reverting change for D145318: [IRLinker] Fix mapping of declaration metadata: rGe510d0bda087: Revert "[IRLinker] Fix mapping of declaration metadata".
Mar 8 2023, 3:43 AM · Restricted Project, Restricted Project

Mar 7 2023

critson committed rGbe0ef4b95d94: [IRLinker] Fix mapping of declaration metadata (authored by critson).
[IRLinker] Fix mapping of declaration metadata
Mar 7 2023, 11:18 PM · Restricted Project, Restricted Project
critson closed D145318: [IRLinker] Fix mapping of declaration metadata.
Mar 7 2023, 11:17 PM · Restricted Project, Restricted Project

Mar 5 2023

critson updated the diff for D145318: [IRLinker] Fix mapping of declaration metadata.
  • Remove unnecessary erase calls, simplifying implementation
  • Extend comment
Mar 5 2023, 11:16 PM · Restricted Project, Restricted Project
critson committed rG8e1038bd1596: [ValueMapper] Preserve poison types during value mapping (authored by critson).
[ValueMapper] Preserve poison types during value mapping
Mar 5 2023, 9:22 PM · Restricted Project, Restricted Project
critson closed D145317: [ValueMapper] Preserve poison types during value mapping.
Mar 5 2023, 9:22 PM · Restricted Project, Restricted Project
critson added a comment to D145318: [IRLinker] Fix mapping of declaration metadata.

Is this a case where we didn't in the past have metadata attachments on declarations requiring mapping, and now we do? Also, the example uses a "!types" metadata - I'm familiar with !type metadata on definitions, but haven't seen !types before - is this new metadata? I can't find any uses of it in the test suite or any documentation.

Mar 5 2023, 6:43 PM · Restricted Project, Restricted Project
critson added a comment to D145329: AMDGPU: Always split blocks for si_end_cf.

Unfortunately this interferes with WQM mode change insertion.
You can see this in the reordered s_or + s_and instruction pairs.
I guess this was always a risk with block splitting.

Mar 5 2023, 6:16 PM · Restricted Project, Restricted Project
critson requested review of D145318: [IRLinker] Fix mapping of declaration metadata.
Mar 5 2023, 12:21 AM · Restricted Project, Restricted Project

Mar 4 2023

critson requested review of D145317: [ValueMapper] Preserve poison types during value mapping.
Mar 4 2023, 10:25 PM · Restricted Project, Restricted Project

Feb 22 2023

critson added a comment to D144034: [AMDGPU][GFX11] Legalize and select partial NSA MIMG instructions.

Thank you for implementing this.
I have left a few minor comments inline.

Feb 22 2023, 12:31 AM · Restricted Project, Restricted Project

Dec 19 2022

critson committed rG5bc703f75557: [AMDGPU] Replace getPhysRegClass with getPhysRegBaseClass (authored by critson).
[AMDGPU] Replace getPhysRegClass with getPhysRegBaseClass
Dec 19 2022, 11:23 PM · Restricted Project, Restricted Project
critson closed D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
Dec 19 2022, 11:23 PM · Restricted Project, Restricted Project
critson committed rGd393d0d24239: [TableGen] Emit table mapping physical registers to base classes (authored by critson).
[TableGen] Emit table mapping physical registers to base classes
Dec 19 2022, 10:46 PM · Restricted Project, Restricted Project
critson closed D139616: [TableGen] Emit table mapping physical registers to base classes.
Dec 19 2022, 10:46 PM · Restricted Project, Restricted Project
critson added inline comments to D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
Dec 19 2022, 1:18 AM · Restricted Project, Restricted Project

Dec 15 2022

critson added a comment to D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.

@arsenm - please can you elucidate the changes you'd like made.

Dec 15 2022, 5:52 PM · Restricted Project, Restricted Project

Dec 13 2022

critson added inline comments to D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
Dec 13 2022, 6:28 PM · Restricted Project, Restricted Project
critson updated the diff for D139616: [TableGen] Emit table mapping physical registers to base classes.
  • Extend available base class range by one
Dec 13 2022, 3:32 PM · Restricted Project, Restricted Project

Dec 9 2022

critson updated the diff for D139616: [TableGen] Emit table mapping physical registers to base classes.
  • Address reviewer comments
Dec 9 2022, 5:36 PM · Restricted Project, Restricted Project
critson added inline comments to D139616: [TableGen] Emit table mapping physical registers to base classes.
Dec 9 2022, 5:35 PM · Restricted Project, Restricted Project
critson updated the diff for D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
  • Avoid passing virtual registers to getPhysRegBaseClass in IsCopyFromSGPR
Dec 9 2022, 3:19 PM · Restricted Project, Restricted Project
critson updated the diff for D139616: [TableGen] Emit table mapping physical registers to base classes.
  • Use assertion instead of range check when resolving base classes
Dec 9 2022, 3:13 PM · Restricted Project, Restricted Project

Dec 8 2022

critson updated the diff for D139616: [TableGen] Emit table mapping physical registers to base classes.
  • Fix trivial off by one error in base class table
Dec 8 2022, 10:33 PM · Restricted Project, Restricted Project
critson updated the diff for D139616: [TableGen] Emit table mapping physical registers to base classes.
  • Address reviewer feedback
  • Add lit test
Dec 8 2022, 9:45 PM · Restricted Project, Restricted Project
critson added inline comments to D139616: [TableGen] Emit table mapping physical registers to base classes.
Dec 8 2022, 9:45 PM · Restricted Project, Restricted Project
critson updated the diff for D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
  • Remove stray code from earlier revision
Dec 8 2022, 3:36 AM · Restricted Project, Restricted Project
critson updated the diff for D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
  • Turn this into a proof of concept for D139616
Dec 8 2022, 3:11 AM · Restricted Project, Restricted Project
critson requested review of D139616: [TableGen] Emit table mapping physical registers to base classes.
Dec 8 2022, 2:55 AM · Restricted Project, Restricted Project

Dec 6 2022

critson added a comment to D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.

Ideally this would be a static table, but the changes required to TableGen to understand register classes for this are non-trivial.

Why? The regclass structure is exactly represented there already

Dec 6 2022, 11:23 PM · Restricted Project, Restricted Project
critson added a comment to D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.

Ideally this would be a static table, but the changes required to TableGen to understand register classes for this are non-trivial.

Dec 6 2022, 5:50 AM · Restricted Project, Restricted Project
critson requested review of D139422: [AMDGPU] Accelerate SIRegisterInfo::getPhysRegClass.
Dec 6 2022, 5:48 AM · Restricted Project, Restricted Project

Oct 27 2022

critson committed rGa3646ec1bc66: [AMDGPU] Add pseudo wavemode to optimize strict_wqm (authored by critson).
[AMDGPU] Add pseudo wavemode to optimize strict_wqm
Oct 27 2022, 5:46 PM · Restricted Project, Restricted Project
critson closed D136813: [AMDGPU] Add pseudo wavemode to optimize strict_wqm.
Oct 27 2022, 5:46 PM · Restricted Project, Restricted Project
critson added inline comments to D136813: [AMDGPU] Add pseudo wavemode to optimize strict_wqm.
Oct 27 2022, 2:04 AM · Restricted Project, Restricted Project

Oct 26 2022

critson requested review of D136813: [AMDGPU] Add pseudo wavemode to optimize strict_wqm.
Oct 26 2022, 9:03 PM · Restricted Project, Restricted Project

Oct 25 2022

critson accepted D136718: AMDGPU: Fix assert when trying to overextend liverange.

LGTM

Oct 25 2022, 8:01 PM · Restricted Project, Restricted Project

Oct 5 2022

critson committed rGc316332e1789: [Sink] Allow sinking of invariant loads across critical edges (authored by critson).
[Sink] Allow sinking of invariant loads across critical edges
Oct 5 2022, 5:22 PM · Restricted Project, Restricted Project
critson closed D135133: [Sink] Allow sinking of invariant loads across critical edges.
Oct 5 2022, 5:21 PM · Restricted Project, Restricted Project

Oct 4 2022

critson updated the diff for D135133: [Sink] Allow sinking of invariant loads across critical edges.
  • Add test llvm/test/Transforms/Sink/invariant-load.ll
Oct 4 2022, 6:06 PM · Restricted Project, Restricted Project
critson abandoned D135071: [AMDGPU] Implement TargetTransformInfo canAlwaysSinkRead.

-> D135133

Oct 4 2022, 1:03 AM · Restricted Project, Restricted Project
critson abandoned D135070: [TTI][Sink] Add target support for sinking more reads.

Logically I'd like some kind of control over the sinking process, but I guess that can be handled by removing invariant metadata as appropriate.

Oct 4 2022, 1:03 AM · Restricted Project, Restricted Project
critson requested review of D135133: [Sink] Allow sinking of invariant loads across critical edges.
Oct 4 2022, 1:02 AM · Restricted Project, Restricted Project

Oct 3 2022

critson abandoned D134723: [AMDGPU] Set memory bound occupancy based on addressable VGPRs.
Oct 3 2022, 4:07 AM · Restricted Project, Restricted Project
critson requested review of D135071: [AMDGPU] Implement TargetTransformInfo canAlwaysSinkRead.
Oct 3 2022, 4:06 AM · Restricted Project, Restricted Project
critson requested review of D135070: [TTI][Sink] Add target support for sinking more reads.
Oct 3 2022, 4:04 AM · Restricted Project, Restricted Project

Oct 1 2022

critson committed rGa35013bec68d: [AMDGPU][GFX11] Mitigate VALU mask write hazard (authored by critson).
[AMDGPU][GFX11] Mitigate VALU mask write hazard
Oct 1 2022, 12:49 AM · Restricted Project, Restricted Project
critson closed D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.
Oct 1 2022, 12:49 AM · Restricted Project, Restricted Project

Sep 29 2022

critson added a comment to D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.

Beyond DPP which other exotic forms are you looking for?

The ones I can find at the moment are:

V_ADDC_U32_sdwa
V_CNDMASK_B16_sdwa
V_CNDMASK_B32_sdwa
V_SUBBREV_U32_sdwa
V_SUBB_U32_sdwa

Sep 29 2022, 2:53 PM · Restricted Project, Restricted Project
critson updated the diff for D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.
  • Change mitigation strategy to s_waitcnt_depctr after SALU
Sep 29 2022, 1:37 AM · Restricted Project, Restricted Project
critson added a comment to D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.

Sure, although...

There is still a small problem that the list of instructions here is missing exotic forms like V_CNDMASK_B32_dpp.

... this still bothers me.

Sep 29 2022, 1:31 AM · Restricted Project, Restricted Project

Sep 28 2022

critson updated the diff for D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.
  • Add DPP forms and associated tests
Sep 28 2022, 11:52 PM · Restricted Project, Restricted Project
critson added a comment to D134780: [AMDGPU] Add MIMG NSA threshold configuration attribute.

Out of ~20000 pipelines, ~2000 had higher VGPR usage with threshold 2 and ~1000 had lower VGPR usage.

That's weird. I can't see why enabling NSA would consistently cause higher vgpr usage.

FYI I looked at one case where this happens and it was caused by GCNNSAReassign making strange (well, different) decisions. So now I need to try to understand what that pass does.

Sep 28 2022, 8:07 PM · Restricted Project, Restricted Project
critson committed rG266b5dbc5dd4: [AMDGPU] Add MIMG NSA threshold configuration attribute (authored by critson).
[AMDGPU] Add MIMG NSA threshold configuration attribute
Sep 28 2022, 4:04 AM · Restricted Project, Restricted Project
critson closed D134780: [AMDGPU] Add MIMG NSA threshold configuration attribute.
Sep 28 2022, 4:04 AM · Restricted Project, Restricted Project
critson updated the diff for D134780: [AMDGPU] Add MIMG NSA threshold configuration attribute.
  • Address reviewer feedback
Sep 28 2022, 3:39 AM · Restricted Project, Restricted Project
critson added a comment to D134780: [AMDGPU] Add MIMG NSA threshold configuration attribute.

Seems reasonable.

I would also be in favour of changing the default to 2. That would tend to introduce fewer "mov"s to shuffle data around, at the expense of using a larger encoding on average for mimg instructions. But I think that is a good trade-off to make for the sake of performance.

Sep 28 2022, 3:04 AM · Restricted Project, Restricted Project

Sep 27 2022

critson added a comment to D134723: [AMDGPU] Set memory bound occupancy based on addressable VGPRs.

So this was based on dealing with the temporary 128 VGPR limit on GFX11, which has now been removed.
On that basis I will probably abandon this revision; however, I do still have concerns about this magic number 4.

Sep 27 2022, 6:54 PM · Restricted Project, Restricted Project
critson requested review of D134780: [AMDGPU] Add MIMG NSA threshold configuration attribute.
Sep 27 2022, 6:31 PM · Restricted Project, Restricted Project
critson added a comment to D134723: [AMDGPU] Set memory bound occupancy based on addressable VGPRs.

I'm not sure where this is currently tested to add a new test.
If any reviewers know?

Sep 27 2022, 2:07 AM · Restricted Project, Restricted Project
critson requested review of D134723: [AMDGPU] Set memory bound occupancy based on addressable VGPRs.
Sep 27 2022, 2:06 AM · Restricted Project, Restricted Project

Sep 20 2022

critson updated the diff for D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.
  • Address reviewer comments
Sep 20 2022, 7:13 PM · Restricted Project, Restricted Project
critson added inline comments to D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.
Sep 20 2022, 7:13 PM · Restricted Project, Restricted Project

Sep 18 2022

critson requested review of D134151: [AMDGPU][GFX11] Mitigate VALU mask write hazard.
Sep 18 2022, 5:26 PM · Restricted Project, Restricted Project
critson committed rG930315f6aa58: [AMDGPU] Fix isSGPRReg for special registers (authored by critson).
[AMDGPU] Fix isSGPRReg for special registers
Sep 18 2022, 4:50 PM · Restricted Project, Restricted Project
critson closed D134025: [AMDGPU] Fix isSGPRReg for special registers.
Sep 18 2022, 4:50 PM · Restricted Project, Restricted Project

Sep 16 2022

critson added a comment to D134025: [AMDGPU] Fix isSGPRReg for special registers.

What context is passing this through?

Sep 16 2022, 6:33 AM · Restricted Project, Restricted Project
critson requested review of D134025: [AMDGPU] Fix isSGPRReg for special registers.
Sep 16 2022, 1:36 AM · Restricted Project, Restricted Project

Aug 30 2022

critson added a comment to D132447: AMDGPU: Add a pass to fix SGPR liveness.

I think the code is correct, but this needs a MIR test checking the behaviour of the pass.

Aug 30 2022, 1:26 AM · Restricted Project, Restricted Project

Aug 23 2022

critson added inline comments to D132447: AMDGPU: Add a pass to fix SGPR liveness.
Aug 23 2022, 6:04 AM · Restricted Project, Restricted Project

Jul 29 2022

critson committed rG4c4db81630e2: [AMDGPU] Extend SILoadStoreOptimizer to s_load instructions (authored by critson).
[AMDGPU] Extend SILoadStoreOptimizer to s_load instructions
Jul 29 2022, 7:42 PM · Restricted Project, Restricted Project
critson closed D130742: [AMDGPU] Extend SILoadStoreOptimizer to s_load instructions.
Jul 29 2022, 7:41 PM · Restricted Project, Restricted Project

Jul 28 2022

critson added a comment to D130742: [AMDGPU] Extend SILoadStoreOptimizer to s_load instructions.

Compilation testing on graphics shader corpus (~10k pipelines) for Navi10.

Jul 28 2022, 11:14 PM · Restricted Project, Restricted Project