Page MenuHomePhabricator

kerbowa (Austin Kerbow)
User

Projects

User does not belong to any projects.

User Details

User Since
Dec 31 2018, 12:07 PM (128 w, 3 d)

Recent Activity

May 14 2021

kerbowa requested review of D102536: [AMDGPU] Rename MUBUF_Invalidate to MUBUF_CacheControl.
May 14 2021, 2:32 PM · Restricted Project

May 11 2021

kerbowa committed rG4433f4601e8a: [AMDGPU] Fix extra waitcnt being added with BUFFER_INVL2 (authored by kerbowa).
[AMDGPU] Fix extra waitcnt being added with BUFFER_INVL2
May 11 2021, 1:19 PM
kerbowa closed D102252: [AMDGPU] Fix extra waitcnt being added with BUFFER_INVL2.
May 11 2021, 1:19 PM · Restricted Project
kerbowa requested review of D102252: [AMDGPU] Fix extra waitcnt being added with BUFFER_INVL2.
May 11 2021, 9:46 AM · Restricted Project

May 6 2021

kerbowa committed rG172d746e167b: [AMDGPU][NFC] Fix typos in SIFormMemoryClauses description (authored by kerbowa).
[AMDGPU][NFC] Fix typos in SIFormMemoryClauses description
May 6 2021, 7:49 AM

May 5 2021

kerbowa committed rG6617a5a5eaee: [AMDGPU] Move insertion of function entry waitcnt later (authored by kerbowa).
[AMDGPU] Move insertion of function entry waitcnt later
May 5 2021, 5:59 PM
kerbowa closed D101380: [AMDGPU] Move insertion of function entry waitcnt later.
May 5 2021, 5:59 PM · Restricted Project
kerbowa committed rGf5199d7ae0ed: [AMDGPU] Revise handling of preexisting waitcnt (authored by kerbowa).
[AMDGPU] Revise handling of preexisting waitcnt
May 5 2021, 5:22 PM
kerbowa closed D100281: [AMDGPU] Revise handling of preexisting waitcnt.
May 5 2021, 5:21 PM · Restricted Project

May 2 2021

kerbowa added inline comments to D101380: [AMDGPU] Move insertion of function entry waitcnt later.
May 2 2021, 2:16 PM · Restricted Project
kerbowa updated the diff for D101380: [AMDGPU] Move insertion of function entry waitcnt later.

Address comments.

May 2 2021, 2:06 PM · Restricted Project
kerbowa updated the diff for D100281: [AMDGPU] Revise handling of preexisting waitcnt.

Simplify conditional and add comment. Add test with KILL.

May 2 2021, 1:35 PM · Restricted Project

Apr 27 2021

kerbowa requested review of D101380: [AMDGPU] Move insertion of function entry waitcnt later.
Apr 27 2021, 9:39 AM · Restricted Project
kerbowa updated the diff for D100281: [AMDGPU] Revise handling of preexisting waitcnt.

Use isMetaInstruction; move func entry waitcnt change to separate patch.

Apr 27 2021, 9:22 AM · Restricted Project

Apr 12 2021

kerbowa added inline comments to D100281: [AMDGPU] Revise handling of preexisting waitcnt.
Apr 12 2021, 11:54 AM · Restricted Project
kerbowa added inline comments to D100281: [AMDGPU] Revise handling of preexisting waitcnt.
Apr 12 2021, 8:12 AM · Restricted Project

Apr 11 2021

kerbowa requested review of D100281: [AMDGPU] Revise handling of preexisting waitcnt.
Apr 11 2021, 9:28 PM · Restricted Project

Mar 19 2021

kerbowa added a comment to D98940: [AMDGPU] Allow index optimisation in SIPreEmitPeephole for bundles.

It seems like there is an issue with defining exactly which bundles should be allowed to be modified and what passes should be allowed to modify them. There is already a pass that will do unbundling and that seems like the best place to do that if we enabled it.

Mar 19 2021, 10:21 AM · Restricted Project

Feb 9 2021

kerbowa added a comment to D96336: [AMDGPU] Save VGPR of whole wave when spilling.

If every SGPR spill that goes to scratch has to do an extra store+load (or multiple) then is that not potentially worse than the performance hit of reserving an entire VGPR for spilling in the case that we know we are going to have to use one? (I guess perhaps we have no way of knowing we need one?)

Feb 9 2021, 10:35 PM · Restricted Project

Feb 1 2021

kerbowa committed rGe068e236c3b8: [AMDGPU] Fix release build after 0397dca0. (authored by kerbowa).
[AMDGPU] Fix release build after 0397dca0.
Feb 1 2021, 8:57 AM
kerbowa committed rG0397dca0214a: [AMDGPU] Fix crash with sgpr spills to vgpr disabled (authored by kerbowa).
[AMDGPU] Fix crash with sgpr spills to vgpr disabled
Feb 1 2021, 8:42 AM
kerbowa closed D95768: [AMDGPU] Fix crash with sgpr spills to vgpr disabled.
Feb 1 2021, 8:42 AM · Restricted Project

Jan 31 2021

kerbowa requested review of D95768: [AMDGPU] Fix crash with sgpr spills to vgpr disabled.
Jan 31 2021, 3:56 PM · Restricted Project

Jan 26 2021

kerbowa committed rG2291bd137d12: [AMDGPU] Update subtarget features for new target ID support (authored by kerbowa).
[AMDGPU] Update subtarget features for new target ID support
Jan 26 2021, 11:30 AM
kerbowa closed D85882: [AMDGPU] Update subtarget features for new target ID support.
Jan 26 2021, 11:29 AM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Rebase.

Jan 26 2021, 11:18 AM · Restricted Project

Jan 23 2021

kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Rebase. Address comments.

Jan 23 2021, 12:45 AM · Restricted Project

Jan 21 2021

kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Jan 21 2021, 11:18 PM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Rebase after 560d7e04113bf.

Jan 21 2021, 11:07 PM · Restricted Project

Jan 17 2021

kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Jan 17 2021, 10:39 PM · Restricted Project
kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Jan 17 2021, 1:54 PM · Restricted Project
kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Jan 17 2021, 12:52 PM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Don't use Optional type for TargetID.

Jan 17 2021, 11:55 AM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Rebase.

Jan 17 2021, 1:07 AM · Restricted Project

Jan 4 2021

kerbowa added a comment to D93715: AMDGPU: Don't fold AGPR copy pairs that need a temp VGPR.

I'm working on eliminating some AGPR->VGPR->AGPR copies which commonly cause problems in loops where the VGPR is considered live across iterations by the register allocator even if the VGPR copy should be sunk. Why do we need this temp VGPR?

Jan 4 2021, 8:43 AM · Restricted Project

Dec 8 2020

kerbowa committed rG4aa842a800b5: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing (authored by kerbowa).
[AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing
Dec 8 2020, 12:25 PM
kerbowa closed D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.
Dec 8 2020, 12:24 PM · Restricted Project
kerbowa updated the diff for D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

Add M0 to defs.

Dec 8 2020, 12:08 PM · Restricted Project

Dec 7 2020

kerbowa added a comment to D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

JBTW, having a need to def M0 probably defeats the idea of potential rescheduling which justifies the separation of the indirect access methods.

Dec 7 2020, 8:34 PM · Restricted Project

Dec 6 2020

kerbowa added a comment to D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

ping

Dec 6 2020, 1:08 PM · Restricted Project

Nov 30 2020

kerbowa updated the diff for D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

Fix test.

Nov 30 2020, 10:22 PM · Restricted Project
kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Nov 30 2020, 11:34 AM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Rebase. Move some utility functions to header.

Nov 30 2020, 11:21 AM · Restricted Project

Nov 29 2020

kerbowa added a comment to D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

EPSDB is passing.

Nov 29 2020, 12:29 AM · Restricted Project
kerbowa updated the diff for D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

Address comments.

Nov 29 2020, 12:27 AM · Restricted Project

Nov 9 2020

kerbowa added a comment to D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

Is there a practical reason to have different pseudos for set_idx and movrel methods and not just expand a common pseudo accordingly post RA?

It's a good question but in the end, I chose not to since m0 initializations can be rescheduled, and combing the pseudo's would disallow this. The other issue is that the movrel pseudo includes the scalar version which is used on targets with VGPR indexing as well. I'm open to trying it if you think it seems more practical.

Both methods use M0, aren't they?

Nov 9 2020, 7:39 PM · Restricted Project
kerbowa added a comment to D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

Is there a practical reason to have different pseudos for set_idx and movrel methods and not just expand a common pseudo accordingly post RA?

Nov 9 2020, 12:20 PM · Restricted Project

Nov 8 2020

kerbowa updated the diff for D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.

Remove dead code.

Nov 8 2020, 11:40 PM · Restricted Project
kerbowa requested review of D91048: [AMDGPU] Add new pseudos for indirect addressing with VGPR Indexing.
Nov 8 2020, 11:28 PM · Restricted Project

Oct 28 2020

kerbowa committed rGde518673436e: [AMDGPU] Add Reset function to GCNHazardRecognizer (authored by kerbowa).
[AMDGPU] Add Reset function to GCNHazardRecognizer
Oct 28 2020, 4:33 PM
kerbowa closed D90347: [AMDGPU] Add Reset function to GCNHazardRecognizer.
Oct 28 2020, 4:33 PM · Restricted Project
kerbowa requested review of D90347: [AMDGPU] Add Reset function to GCNHazardRecognizer.
Oct 28 2020, 3:13 PM · Restricted Project
kerbowa committed rG8b127a8661a1: [AMDGPU] Fix inserting combined s_nop in bundles (authored by kerbowa).
[AMDGPU] Fix inserting combined s_nop in bundles
Oct 28 2020, 2:35 PM
kerbowa closed D90334: [AMDGPU] Fix inserting combined s_nop in bundles.
Oct 28 2020, 2:34 PM · Restricted Project
kerbowa added inline comments to D90334: [AMDGPU] Fix inserting combined s_nop in bundles.
Oct 28 2020, 2:16 PM · Restricted Project
kerbowa added inline comments to D90334: [AMDGPU] Fix inserting combined s_nop in bundles.
Oct 28 2020, 1:41 PM · Restricted Project
kerbowa requested review of D90334: [AMDGPU] Fix inserting combined s_nop in bundles.
Oct 28 2020, 12:23 PM · Restricted Project

Oct 26 2020

kerbowa updated the summary of D85882: [AMDGPU] Update subtarget features for new target ID support.
Oct 26 2020, 11:17 PM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Update, rebase, and merge with change of default patch.

Oct 26 2020, 11:16 PM · Restricted Project

Oct 20 2020

kerbowa committed rGebdcef20ce29: [AMDGPU] Avoid inserting noops during scheduling (authored by kerbowa).
[AMDGPU] Avoid inserting noops during scheduling
Oct 20 2020, 5:12 PM
kerbowa closed D89754: [AMDGPU] Avoid inserting noops during scheduling.
Oct 20 2020, 5:12 PM · Restricted Project
kerbowa committed rG37d907899f49: [HazardRec] Allow inserting multiple wait-states simultaneously (authored by kerbowa).
[HazardRec] Allow inserting multiple wait-states simultaneously
Oct 20 2020, 5:06 PM
kerbowa closed D89753: [HazardRec] Allow inserting multiple wait-states simultaneously.
Oct 20 2020, 5:06 PM · Restricted Project
kerbowa added a comment to D89753: [HazardRec] Allow inserting multiple wait-states simultaneously.

Do we properly count number of pre-existing wait states if we have s_nop > 0?

Oct 20 2020, 11:52 AM · Restricted Project
kerbowa updated the diff for D89753: [HazardRec] Allow inserting multiple wait-states simultaneously.

Address comments.

Oct 20 2020, 11:42 AM · Restricted Project
kerbowa added inline comments to D89754: [AMDGPU] Avoid inserting noops during scheduling.
Oct 20 2020, 10:01 AM · Restricted Project

Oct 19 2020

kerbowa requested review of D89754: [AMDGPU] Avoid inserting noops during scheduling.
Oct 19 2020, 5:14 PM · Restricted Project
kerbowa requested review of D89753: [HazardRec] Allow inserting multiple wait-states simultaneously.
Oct 19 2020, 5:11 PM · Restricted Project
kerbowa abandoned D75030: [AMDGPU] Use conservative defaults for XNACK/SRAM ECC.
Oct 19 2020, 9:36 AM · Restricted Project
kerbowa abandoned D86042: [AMDGPU] Use correct defaults for xnack and sramecc.
Oct 19 2020, 9:35 AM · Restricted Project

Oct 16 2020

kerbowa committed rG978fbd8268ce: [AMDGPU] Run hazard recognizer pass later (authored by kerbowa).
[AMDGPU] Run hazard recognizer pass later
Oct 16 2020, 12:16 PM
kerbowa closed D89077: [AMDGPU] Run hazard recognizer pass later.
Oct 16 2020, 12:16 PM · Restricted Project

Oct 11 2020

kerbowa updated the diff for D89077: [AMDGPU] Run hazard recognizer pass later.

Add test.

Oct 11 2020, 10:28 PM · Restricted Project

Oct 8 2020

kerbowa added a comment to D89077: [AMDGPU] Run hazard recognizer pass later.

Is this now running after the waitcnt insertion pass? That would avoid the NOPs currently being inserted to split memory clauses that are not necessary as the waitcnt instructions will split the clauses.

Oct 8 2020, 5:42 PM · Restricted Project
kerbowa added a comment to D89077: [AMDGPU] Run hazard recognizer pass later.

Is this now running after the waitcnt insertion pass? That would avoid the NOPs currently being inserted to split memory clauses that are not necessary as the waitcnt instructions will split the clauses.

Oct 8 2020, 5:41 PM · Restricted Project
kerbowa committed rGa4f35ab23281: [AMDGPU] Fix mai hazard VALU to LD/ST (authored by kerbowa).
[AMDGPU] Fix mai hazard VALU to LD/ST
Oct 8 2020, 5:13 PM
kerbowa closed D89079: [AMDGPU] Fix mai hazard VALU to LD/ST.
Oct 8 2020, 5:13 PM · Restricted Project
kerbowa updated the diff for D89079: [AMDGPU] Fix mai hazard VALU to LD/ST.

Wt -> Wr

Oct 8 2020, 4:50 PM · Restricted Project
kerbowa requested review of D89079: [AMDGPU] Fix mai hazard VALU to LD/ST.
Oct 8 2020, 4:38 PM · Restricted Project
kerbowa requested review of D89077: [AMDGPU] Run hazard recognizer pass later.
Oct 8 2020, 4:10 PM · Restricted Project

Sep 16 2020

kerbowa added inline comments to D87621: [AMDGPU] Add XDL resource to scheduling model.
Sep 16 2020, 8:13 PM · Restricted Project

Sep 14 2020

kerbowa committed rGf859c30ecbbb: [AMDGPU] Add XDL resource to scheduling model (authored by kerbowa).
[AMDGPU] Add XDL resource to scheduling model
Sep 14 2020, 1:49 PM
kerbowa closed D87621: [AMDGPU] Add XDL resource to scheduling model.
Sep 14 2020, 1:49 PM · Restricted Project
kerbowa requested review of D87621: [AMDGPU] Add XDL resource to scheduling model.
Sep 14 2020, 10:58 AM · Restricted Project

Aug 16 2020

kerbowa requested review of D86042: [AMDGPU] Use correct defaults for xnack and sramecc.
Aug 16 2020, 2:21 PM · Restricted Project

Aug 14 2020

kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Aug 14 2020, 8:50 AM · Restricted Project

Aug 13 2020

kerbowa updated the summary of D85882: [AMDGPU] Update subtarget features for new target ID support.
Aug 13 2020, 10:51 PM · Restricted Project
kerbowa updated the summary of D85882: [AMDGPU] Update subtarget features for new target ID support.
Aug 13 2020, 7:24 PM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Rename "Default" to "Any". Address comments.

Aug 13 2020, 7:21 PM · Restricted Project
kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Aug 13 2020, 4:21 PM · Restricted Project
kerbowa committed rG7d1cb187fbee: [AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded (authored by kerbowa).
[AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded
Aug 13 2020, 2:12 PM
kerbowa closed D85772: [AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded.
Aug 13 2020, 2:12 PM · Restricted Project
kerbowa updated the diff for D85882: [AMDGPU] Update subtarget features for new target ID support.

Re-add DoesNotSupport features. Use enum class. No fatal error on setting mismatch.

Aug 13 2020, 1:53 PM · Restricted Project
kerbowa updated the diff for D85772: [AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded.

Fix typo.

Aug 13 2020, 11:08 AM · Restricted Project
kerbowa added a reviewer for D85772: [AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded: hsmhsm.
Aug 13 2020, 10:53 AM · Restricted Project
kerbowa updated the diff for D85772: [AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded.

As discussed offline with Mahesha, this and https://reviews.llvm.org/D85767 are fixing the same issue, we decided I would take the fix forward.

Aug 13 2020, 10:52 AM · Restricted Project
kerbowa added inline comments to D85882: [AMDGPU] Update subtarget features for new target ID support.
Aug 13 2020, 8:39 AM · Restricted Project
kerbowa requested review of D85882: [AMDGPU] Update subtarget features for new target ID support.
Aug 13 2020, 12:21 AM · Restricted Project

Aug 11 2020

kerbowa requested review of D85772: [AMDGPU] Fix FP/BP spills when MUBUF constant offset exceeded.
Aug 11 2020, 12:51 PM · Restricted Project

Aug 7 2020

kerbowa committed rG04bd5b528620: [AMDGPU] Fix not rescheduling without clustering (authored by vangthao95).
[AMDGPU] Fix not rescheduling without clustering
Aug 7 2020, 11:18 AM