Page MenuHomePhabricator

cdevadas (Christudasan Devadasan)
User

Projects

User does not belong to any projects.

User Details

User Since
Apr 29 2019, 11:13 PM (202 w, 6 d)

Recent Activity

Tue, Mar 7

cdevadas committed rG2171f04c121e: [AMDGPU] Extend WorkGroupID* codegen for compute shaders (authored by cdevadas).
[AMDGPU] Extend WorkGroupID* codegen for compute shaders
Tue, Mar 7, 6:22 PM · Restricted Project, Restricted Project
cdevadas closed D145045: [AMDGPU] Extend WorkGroupID* codegen for compute shaders.
Tue, Mar 7, 6:22 PM · Restricted Project, Restricted Project
cdevadas updated the diff for D145045: [AMDGPU] Extend WorkGroupID* codegen for compute shaders.

Split the assertion into smaller chunks.

Tue, Mar 7, 3:33 AM · Restricted Project, Restricted Project

Sun, Mar 5

cdevadas added a comment to D145045: [AMDGPU] Extend WorkGroupID* codegen for compute shaders.

Ping

Sun, Mar 5, 8:41 PM · Restricted Project, Restricted Project

Wed, Mar 1

cdevadas updated the diff for D145045: [AMDGPU] Extend WorkGroupID* codegen for compute shaders.

Extended only for amdgpu_cs and when the subtarget has architected SGPRs enabled.

Wed, Mar 1, 11:02 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D145045: [AMDGPU] Extend WorkGroupID* codegen for compute shaders.

Renamed the testcase.

Wed, Mar 1, 1:47 AM · Restricted Project, Restricted Project
cdevadas requested review of D145045: [AMDGPU] Extend WorkGroupID* codegen for compute shaders.
Wed, Mar 1, 1:27 AM · Restricted Project, Restricted Project

Feb 16 2023

cdevadas added a comment to D143754: [MachineInstr] Introduce generic predicated copy opcode.

The IsPredicable field will always be turned to 1 for Pred_Copy to indicate that it is the Predicated version of the copy.

Feb 16 2023, 10:05 AM · Restricted Project, Restricted Project
cdevadas added a comment to D143753: [MachineInstr] Introduce TII buildCopy helper functions (NFC)..

Hey,

What's the long term plan for this helper?

I am not mad at the refactoring itself, but I'm wondering what kind of criteria we should use to decide when we add such helper.

The real motivation for these buildCopy helper functions is to minimize the additional code changes required while introducing getCopyOpcode() as part of D143754.
One could probably say that "still you could have avoided the buildCopy functions, and instead replace TargetOpcode::COPY with getCopyOpcode()".
I think it would be ok to refactor the code if found an opportunity while working around that code.

For instance, should we do the same thing for PHI, INSERT_SUBREG, etc. and why or why not.
Also, should these helper be virtual or not?

As I said earlier, COPY wasn't a random pick, but rather a code refactor for another patch.

I guess where I am going is what do you intend to simplify/achieve with them?

I felt it was better with buildCopy rather than the direct replacement of TargetOpcode::COPY with getCopyOpcode() in their original BuildMI instances. You tend to agree with that towards the end - "(I actually like the patch)".

The bottom line is without a proper rationale on why we need this helper, I feel that it gets difficult to know which method should be used to construct a copy.
Right now we have BuildMI (and the underlying MachineInstrBuilder object that can be called directly), MachineFunction::CreateMachineInstr, MachineIRBuilder::buildInstr (BTW MachineIRBuilder has its own MachineIRBuilder::buildCopy) and we add a new method here.

I agree. All my buildCopy instances currently return MachineInstr*. I wasn't sure they should return MachineInstrBuilder or any other form as you mentioned. It can be worked out if anyone has a better suggestion.
And about MachineIRBuilder::buildCopy, I feel it is better to retain this very specific instance considering the actual function being called ( the generic MachineIRBuilder::buildInstr) and the operands used which are different from the instances I added.

I believe we need to disambiguate when this one should be used and why.

Admittedly I'm playing the devil's advocate here (I actually like the patch) but I think it is important to flush out the goals and intended usages to be able to better guide future users of this (and the other) API(s).

Cheers,
-Quentin

Feb 16 2023, 8:32 AM · Restricted Project, Restricted Project

Feb 15 2023

cdevadas added a comment to D143754: [MachineInstr] Introduce generic predicated copy opcode.

RFC posted https://discourse.llvm.org/t/rfc-introduce-generic-predicated-copy-opcode/68494

Feb 15 2023, 7:53 AM · Restricted Project, Restricted Project

Feb 13 2023

cdevadas committed rG1c9e6238fe2f: [AMDGPU] Allow architected SGPRs for workgroup IDs (authored by cdevadas).
[AMDGPU] Allow architected SGPRs for workgroup IDs
Feb 13 2023, 8:44 AM · Restricted Project, Restricted Project
cdevadas closed D143707: [AMDGPU] Allow architected SGPRs for workgroup IDs.
Feb 13 2023, 8:44 AM · Restricted Project, Restricted Project
cdevadas committed rGb35d0a6834f8: [AMDGPU] Add switch to enable architected SGPRs. (authored by cdevadas).
[AMDGPU] Add switch to enable architected SGPRs.
Feb 13 2023, 8:41 AM · Restricted Project, Restricted Project
cdevadas closed D143706: [AMDGPU] Add switch to enable architected SGPRs..
Feb 13 2023, 8:41 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D143707: [AMDGPU] Allow architected SGPRs for workgroup IDs.

Added an assertion as per review comment.

Feb 13 2023, 6:39 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D143707: [AMDGPU] Allow architected SGPRs for workgroup IDs.

Adjusted NumRequiredSystemSGPRs before force initializing the 16 input SGPRs.

Feb 13 2023, 1:29 AM · Restricted Project, Restricted Project

Feb 10 2023

cdevadas added a comment to D143754: [MachineInstr] Introduce generic predicated copy opcode.

D143757 in the review chain enables predicated copy for AMDGPU target.
The application of pred-copy is here: D143762.
Testcase llvm/test/CodeGen/AMDGPU/whole-wave-register-copy.ll added along with D124196 demonstrates the need for pred-copy.

Feb 10 2023, 10:05 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

Rebased after whole-wave copy implementation.

Feb 10 2023, 9:56 AM · Restricted Project, Restricted Project
cdevadas requested review of D143762: [AMDGPU] Enable whole wave register copy.
Feb 10 2023, 9:51 AM · Restricted Project, Restricted Project
cdevadas requested review of D143759: [AMDGPU] Implement whole wave register spill.
Feb 10 2023, 9:30 AM · Restricted Project, Restricted Project
cdevadas requested review of D143758: [CodeGen] MRI call back in TargetMachine.
Feb 10 2023, 9:28 AM · Restricted Project, Restricted Project
cdevadas requested review of D143757: [AMDGPU] Enable predicated copy right from instruction selection.
Feb 10 2023, 9:26 AM · Restricted Project, Restricted Project
cdevadas requested review of D143756: [AMDGPU] Use buildCopy and isCopy helper functions (NFC)..
Feb 10 2023, 9:24 AM · Restricted Project, Restricted Project
cdevadas requested review of D143754: [MachineInstr] Introduce generic predicated copy opcode.
Feb 10 2023, 9:20 AM · Restricted Project, Restricted Project
cdevadas requested review of D143753: [MachineInstr] Introduce TII buildCopy helper functions (NFC)..
Feb 10 2023, 9:17 AM · Restricted Project, Restricted Project
cdevadas requested review of D143752: [MachineInstr] Use isCopy helper function (NFC)..
Feb 10 2023, 9:15 AM · Restricted Project, Restricted Project
cdevadas added reviewers for D143742: [VirtRegMap] Further optimize emitting KILL for copy: qcolombet, MatzeB, stoklund, arsenm.
Feb 10 2023, 9:13 AM · Restricted Project, Restricted Project
cdevadas requested review of D143742: [VirtRegMap] Further optimize emitting KILL for copy.
Feb 10 2023, 8:04 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D143741: [AMDGPU] Add pre-commit test for optimized KILL insertion..
Feb 10 2023, 7:54 AM · Restricted Project, Restricted Project
cdevadas requested review of D143741: [AMDGPU] Add pre-commit test for optimized KILL insertion..
Feb 10 2023, 7:50 AM · Restricted Project, Restricted Project
cdevadas added inline comments to D143707: [AMDGPU] Allow architected SGPRs for workgroup IDs.
Feb 10 2023, 3:11 AM · Restricted Project, Restricted Project
cdevadas added inline comments to D143707: [AMDGPU] Allow architected SGPRs for workgroup IDs.
Feb 10 2023, 2:52 AM · Restricted Project, Restricted Project
cdevadas requested review of D143707: [AMDGPU] Allow architected SGPRs for workgroup IDs.
Feb 10 2023, 12:56 AM · Restricted Project, Restricted Project
cdevadas requested review of D143706: [AMDGPU] Add switch to enable architected SGPRs..
Feb 10 2023, 12:51 AM · Restricted Project, Restricted Project

Dec 23 2022

cdevadas abandoned D134951: [CodeGen][RegAllocFast] Add MRI delegate callback to notify VReg spill.

D138656 handled it better and hence abanding it.

Dec 23 2022, 11:37 PM · Restricted Project, Restricted Project
cdevadas abandoned D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.
Dec 23 2022, 11:35 PM · Restricted Project, Restricted Project

Dec 21 2022

cdevadas added a reverting change for rG40ba0942e2ab: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs: rGa3028239a751: Revert "[AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs".
Dec 21 2022, 2:51 AM · Restricted Project, Restricted Project
cdevadas added a reverting change for D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs: rGa3028239a751: Revert "[AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs".
Dec 21 2022, 2:51 AM · Restricted Project, Restricted Project
cdevadas committed rGa3028239a751: Revert "[AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs" (authored by cdevadas).
Revert "[AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs"
Dec 21 2022, 2:50 AM · Restricted Project, Restricted Project

Dec 20 2022

cdevadas added a comment to D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

This patch causes OpenMC offloaded via OpenMP on AMDGPUs to crash at runtime. It looks like some corruption in the memory address.
You can find build instructions here: https://github.com/jtramm/openmc_offloading_builder

The commit before this one works fine though, assuming you cherry picked https://reviews.llvm.org/rGee1d000d43321590771a2f047c8c55d07d09ad28 first as it landed after.
I assume other codes will be impacted too.

@jtramm @ronlieb @jhuber6 FYI

Dec 20 2022, 12:12 AM · Restricted Project, Restricted Project

Dec 16 2022

cdevadas committed rG40ba0942e2ab: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs (authored by cdevadas).
[AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs
Dec 16 2022, 10:27 PM · Restricted Project, Restricted Project
cdevadas closed D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.
Dec 16 2022, 10:27 PM · Restricted Project, Restricted Project
cdevadas committed rGb5efec4b27bf: [CodeGen] Additional Register argument to… (authored by cdevadas).
[CodeGen] Additional Register argument to…
Dec 16 2022, 10:26 PM · Restricted Project, Restricted Project
cdevadas closed D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.
Dec 16 2022, 10:26 PM · Restricted Project, Restricted Project
cdevadas committed rGce02d5a53956: [CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit (authored by cdevadas).
[CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit
Dec 16 2022, 10:25 PM · Restricted Project, Restricted Project
cdevadas closed D138517: [CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit.
Dec 16 2022, 10:24 PM · Restricted Project, Restricted Project
cdevadas committed rG2f23f5c0d53c: [CodeGen] Use delegate to notify targets when virtual registers are created (authored by cdevadas).
[CodeGen] Use delegate to notify targets when virtual registers are created
Dec 16 2022, 10:24 PM · Restricted Project, Restricted Project
cdevadas closed D134950: [CodeGen] Use delegate to notify targets when virtual registers are created.
Dec 16 2022, 10:23 PM · Restricted Project, Restricted Project
cdevadas committed rG29247824f57e: [AMDGPU][SIFrameLowering] Use the right frame register in CSR spills (authored by cdevadas).
[AMDGPU][SIFrameLowering] Use the right frame register in CSR spills
Dec 16 2022, 10:23 PM · Restricted Project, Restricted Project
cdevadas closed D134949: [AMDGPU][SIFrameLowering] Use the right frame register in CSR spills.
Dec 16 2022, 10:23 PM · Restricted Project, Restricted Project
cdevadas committed rG7a72a9358014: [AMDGPU] Preserve only the inactive lanes of scratch vgprs (authored by cdevadas).
[AMDGPU] Preserve only the inactive lanes of scratch vgprs
Dec 16 2022, 10:22 PM · Restricted Project, Restricted Project
cdevadas closed D134526: [AMDGPU] Preserve only the inactive lanes of scratch vgprs.
Dec 16 2022, 10:22 PM · Restricted Project, Restricted Project
cdevadas committed rG20a940f1e217: [AMDGPU][SIFrameLowering] Unify PEI SGPR spill saves and restores (authored by cdevadas).
[AMDGPU][SIFrameLowering] Unify PEI SGPR spill saves and restores
Dec 16 2022, 10:21 PM · Restricted Project, Restricted Project
cdevadas closed D132436: [AMDGPU][SIFrameLowering] Unify PEI SGPR spill saves and restores.
Dec 16 2022, 10:20 PM · Restricted Project, Restricted Project
cdevadas committed rGb25b4c0ab4ad: [AMDGPU] Separate out SGPR spills to VGPR lanes during PEI (authored by cdevadas).
[AMDGPU] Separate out SGPR spills to VGPR lanes during PEI
Dec 16 2022, 10:20 PM · Restricted Project, Restricted Project
cdevadas closed D124195: [AMDGPU] Separate out SGPR spills to VGPR lanes during PEI.
Dec 16 2022, 10:20 PM · Restricted Project, Restricted Project
cdevadas committed rG5ebe91fcb2a1: [AMDGPU] Correctly set IsKill flag for VGPR spills in the prolog (authored by cdevadas).
[AMDGPU] Correctly set IsKill flag for VGPR spills in the prolog
Dec 16 2022, 10:19 PM · Restricted Project, Restricted Project
cdevadas closed D124194: [AMDGPU] Correctly set IsKill flag for VGPR spills in the prolog.
Dec 16 2022, 10:19 PM · Restricted Project, Restricted Project
cdevadas committed rGaf5e5c40ff73: [AMDGPU] Add WWM reserved VGPRs to WWMSpills (authored by cdevadas).
[AMDGPU] Add WWM reserved VGPRs to WWMSpills
Dec 16 2022, 10:18 PM · Restricted Project, Restricted Project
cdevadas closed D124193: [AMDGPU] Add WWM reserved VGPRs to WWMSpills.
Dec 16 2022, 10:18 PM · Restricted Project, Restricted Project
cdevadas committed rG5692a7e84e12: [AMDGPU] Callee must always spill writelane VGPRs (authored by cdevadas).
[AMDGPU] Callee must always spill writelane VGPRs
Dec 16 2022, 10:17 PM · Restricted Project, Restricted Project
cdevadas closed D124192: [AMDGPU] Callee must always spill writelane VGPRs.
Dec 16 2022, 10:17 PM · Restricted Project, Restricted Project

Dec 15 2022

cdevadas updated the diff for D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

rebase

Dec 15 2022, 10:37 AM · Restricted Project, Restricted Project
cdevadas committed rG229c466bc8fb: [AMDGPU] Test fixup (authored by cdevadas).
[AMDGPU] Test fixup
Dec 15 2022, 10:07 AM · Restricted Project, Restricted Project

Dec 14 2022

cdevadas added a comment to D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

I will wait for a couple of days before I upstream it to see if there is any comment from others.

Dec 14 2022, 9:35 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

Fixed the comment appropriately.

Dec 14 2022, 9:18 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D134949: [AMDGPU][SIFrameLowering] Use the right frame register in CSR spills.

code rebase

Dec 14 2022, 9:14 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D134526: [AMDGPU] Preserve only the inactive lanes of scratch vgprs.

code rebase

Dec 14 2022, 9:11 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D132436: [AMDGPU][SIFrameLowering] Unify PEI SGPR spill saves and restores.

code rebase

Dec 14 2022, 9:07 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124195: [AMDGPU] Separate out SGPR spills to VGPR lanes during PEI.

code rebase

Dec 14 2022, 9:06 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124194: [AMDGPU] Correctly set IsKill flag for VGPR spills in the prolog.

code rebase

Dec 14 2022, 9:01 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124193: [AMDGPU] Add WWM reserved VGPRs to WWMSpills.

Code rebase

Dec 14 2022, 8:59 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124192: [AMDGPU] Callee must always spill writelane VGPRs.

Rebased

Dec 14 2022, 8:53 AM · Restricted Project, Restricted Project
cdevadas added inline comments to D139433: [amdgpu] Reimplement LDS lowering.
Dec 14 2022, 8:34 AM · Restricted Project, Restricted Project

Dec 12 2022

cdevadas updated the diff for D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

Addressed the review comment.

Dec 12 2022, 1:00 AM · Restricted Project, Restricted Project

Dec 8 2022

cdevadas added a comment to D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

Ping

Dec 8 2022, 10:19 AM · Restricted Project, Restricted Project

Dec 5 2022

cdevadas added a comment to D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

Ping. Much appreciate if this can be reviewed.

Dec 5 2022, 9:21 AM · Restricted Project, Restricted Project

Nov 29 2022

cdevadas added a comment to D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

Ping

Nov 29 2022, 8:26 PM · Restricted Project, Restricted Project

Nov 24 2022

cdevadas added a comment to D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.

Posted D138656 to pass the virtual register as an additional argument to the spiller interface. This review can be abandoned once D138656 gets approved.

Nov 24 2022, 5:52 AM · Restricted Project, Restricted Project
cdevadas added a comment to D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.

This is a better and a stable approach than D138515.
The use case of the extra argument can be found in D124196. See the implementations SIInstrInfo::storeRegToStackSlot and SIInstrInfo::loadRegFromStackSlot.

Nov 24 2022, 5:50 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

Implemented the WWM spill during RegAllocFast using the additional argument to the spiller interface introduced with patch D138656.

Nov 24 2022, 5:02 AM · Restricted Project, Restricted Project
cdevadas requested review of D138656: [CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot.
Nov 24 2022, 4:30 AM · Restricted Project, Restricted Project

Nov 23 2022

cdevadas added a comment to D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.

The idea of mapping a physical register into the current virtual register in RegAllocFast would be likely a risky choice with a global variable. The same was true for the initial patch I posted D134951 which does the same thing specifically around the spill and reload functions via. the delegate.
The effort was to identify the physical register allocated for some special machine operands (whole-wave vector registers for AMDGPU), especially inside storeRegToStackSlot & loadRegFromStackSlot spill interfaces called from fast regalloc that directly spills the physRegs.
The better approach would be by passing an additional parameter (the virtual register) to the spiller interface. The interface is common to Greedy, PEI, and to some other places that have no use for this additional parameter. Still, I think it is the right way to handle the problem.

Nov 23 2022, 11:55 AM · Restricted Project, Restricted Project
cdevadas added a comment to D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.

See the Usecase of PhysToCurrentVirtReg in D124196 (look for getPhysToCurrentVirtReg)

Nov 23 2022, 9:50 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.

PhysToCurrentVirtReg should also be updated while reloading a physical register from the spill location. It happens when a RegMask (for instance, a function call) clobbers a register that is used in an instruction later in the BB. Also, at a BB entry, the LiveIn registers are reloaded. The reload point begins a new live range and it should be properly reflected in the mapping.

Nov 23 2022, 9:47 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

Rebase + Incorporated changes after D138515 to move the handling of physReg to current VirtReg mapping entirely into the generic design.

Nov 23 2022, 9:25 AM · Restricted Project, Restricted Project

Nov 22 2022

cdevadas added a comment to D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.

This should replace the patch D134951 which used the delegate method for a specific purpose.
This is a more generic approach for targets to help identify the exact live range (virtual register) a physical register was currently assigned for.
While the allocation is in progress, this would serve the purpose of mapping a physical register to its virtual register.

Nov 22 2022, 12:27 PM · Restricted Project, Restricted Project
cdevadas requested review of D138517: [CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit.
Nov 22 2022, 11:26 AM · Restricted Project, Restricted Project
cdevadas requested review of D138515: [CodeGen][RegAllocFast] Map PhysReg to its current VirtReg.
Nov 22 2022, 11:05 AM · Restricted Project, Restricted Project

Nov 21 2022

cdevadas added a comment to D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

Ping

Nov 21 2022, 3:13 AM · Restricted Project, Restricted Project

Nov 15 2022

cdevadas committed rG349b6afef47b: [AMDGPU] Remove the assertion for MUBUF instruction with voffset (authored by cdevadas).
[AMDGPU] Remove the assertion for MUBUF instruction with voffset
Nov 15 2022, 10:40 PM · Restricted Project, Restricted Project
cdevadas closed D137892: [AMDGPU] Remove the assertion for MUBUF instruction with voffset.
Nov 15 2022, 10:40 PM · Restricted Project, Restricted Project
cdevadas updated the diff for D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.

Rebase + Suggestions incorporated.

Nov 15 2022, 10:44 AM · Restricted Project, Restricted Project
cdevadas added inline comments to D124196: [AMDGPU][SILowerSGPRSpills] Spill SGPRs to virtual VGPRs.
Nov 15 2022, 10:30 AM · Restricted Project, Restricted Project
cdevadas updated the diff for D134949: [AMDGPU][SIFrameLowering] Use the right frame register in CSR spills.

Removed the IsIgnored flag and special-handled FP spill & restore cases.

Nov 15 2022, 4:46 AM · Restricted Project, Restricted Project
cdevadas added a comment to D134949: [AMDGPU][SIFrameLowering] Use the right frame register in CSR spills.

For the downstream case, can we just erase the entry in the map after emitting the spill?

Can do that.

Nov 15 2022, 3:52 AM · Restricted Project, Restricted Project

Nov 14 2022

cdevadas added inline comments to D124195: [AMDGPU] Separate out SGPR spills to VGPR lanes during PEI.
Nov 14 2022, 11:06 PM · Restricted Project, Restricted Project
cdevadas updated the diff for D124195: [AMDGPU] Separate out SGPR spills to VGPR lanes during PEI.

Renamed IsPEI to IsPrologEpilog & removed the unwanted comment.

Nov 14 2022, 10:14 PM · Restricted Project, Restricted Project
cdevadas added inline comments to D124195: [AMDGPU] Separate out SGPR spills to VGPR lanes during PEI.
Nov 14 2022, 10:08 PM · Restricted Project, Restricted Project
cdevadas updated the diff for D137892: [AMDGPU] Remove the assertion for MUBUF instruction with voffset.

Fixed a typo in the IR test file name.

Nov 14 2022, 4:00 AM · Restricted Project, Restricted Project