Page MenuHomePhabricator

tstellarAMD (Tom Stellard)
Disabled

Projects

User does not belong to any projects.

User Details

User Since
Dec 3 2014, 1:14 PM (433 w, 4 d)
Roles
Disabled

Recent Activity

Feb 27 2020

GitHub <noreply@github.com> committed rGc34f93cb415d: Merge pull request #38 from yxsamliu/addrcast5 (authored by tstellarAMD).
Merge pull request #38 from yxsamliu/addrcast5
Feb 27 2020, 3:36 AM
tstellarAMD committed rG6baedc3f128b: Merge branch 'amd-master' of into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' of into amd-common
Feb 27 2020, 3:35 AM
tstellarAMD committed rG0e8fdb31ca56: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 27 2020, 3:34 AM
tstellarAMD committed rGd06cef3ec6b6: AMDGPU/SI: Don't emit buffer_wbinvl1_vol on SI (authored by tstellarAMD).
AMDGPU/SI: Don't emit buffer_wbinvl1_vol on SI
Feb 27 2020, 3:34 AM
tstellarAMD committed rG1bfee011e794: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 27 2020, 3:30 AM
tstellarAMD committed rG2c933ed880af: AMDGPU: Fix lit test broken by previous commit (authored by tstellarAMD).
AMDGPU: Fix lit test broken by previous commit
Feb 27 2020, 3:25 AM
GitHub <noreply@github.com> committed rG8244ae74a85f: Merge pull request #23 from alex-t/amd-common (authored by tstellarAMD).
Merge pull request #23 from alex-t/amd-common
Feb 27 2020, 3:25 AM
GitHub <noreply@github.com> committed rG0651842fd97e: Merge pull request #22 from RadeonOpenCompute/amd-common-long-branch (authored by tstellarAMD).
Merge pull request #22 from RadeonOpenCompute/amd-common-long-branch
Feb 27 2020, 3:23 AM
tstellarAMD committed rGdac126b84afc: AMDGPU/SI: Emit long branch when branch size is >= 15 bits (authored by tstellarAMD).
AMDGPU/SI: Emit long branch when branch size is >= 15 bits
Feb 27 2020, 3:23 AM
GitHub <noreply@github.com> committed rG4f77e98ae350: Merge pull request #12 from RadeonOpenCompute/addrspacecast-fixes (authored by tstellarAMD).
Merge pull request #12 from RadeonOpenCompute/addrspacecast-fixes
Feb 27 2020, 3:19 AM
tstellarAMD committed rG6da97268804d: [Loads] Fix crash in is isDereferenceableAndAlignedPointer() (authored by tstellarAMD).
[Loads] Fix crash in is isDereferenceableAndAlignedPointer()
Feb 27 2020, 3:19 AM
tstellarAMD committed rG9966a4033e1b: [ValueTracking] Fix crash in GetPointerBaseWithConstantOffset() (authored by tstellarAMD).
[ValueTracking] Fix crash in GetPointerBaseWithConstantOffset()
Feb 27 2020, 3:19 AM
GitHub <noreply@github.com> committed rG56460b2e8322: Merge pull request #8 from kzhuravl/memory-model-fixes (authored by tstellarAMD).
Merge pull request #8 from kzhuravl/memory-model-fixes
Feb 27 2020, 3:16 AM
GitHub <noreply@github.com> committed rG51f36424c298: Merge pull request #9 from yxsamliu/fix-atomic (authored by tstellarAMD).
Merge pull request #9 from yxsamliu/fix-atomic
Feb 27 2020, 3:16 AM
tstellarAMD committed rGcd1c5b637611: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 27 2020, 3:15 AM
GitHub <noreply@github.com> committed rG5f5b0c34b99e: Merge pull request #10 from kzhuravl/memory-model (authored by tstellarAMD).
Merge pull request #10 from kzhuravl/memory-model
Feb 27 2020, 3:14 AM
tstellarAMD committed rG490571d4b97f: Merge 'amd-master' into 'amd-common' (authored by tstellarAMD).
Merge 'amd-master' into 'amd-common'
Feb 27 2020, 3:12 AM
GitHub <noreply@github.com> committed rGe2d25cb00374: Merge pull request #5 from kzhuravl/synchscopes (authored by tstellarAMD).
Merge pull request #5 from kzhuravl/synchscopes
Feb 27 2020, 3:11 AM
GitHub <noreply@github.com> committed rGef20578e7d0c: Merge pull request #4 from changpeng/mimg-intrinsics (authored by tstellarAMD).
Merge pull request #4 from changpeng/mimg-intrinsics
Feb 27 2020, 3:11 AM
tstellarAMD committed rG1ab793bea333: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 27 2020, 3:10 AM
tstellarAMD committed rGe275d85ba14d: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 27 2020, 3:10 AM

Feb 26 2020

GitHub <noreply@github.com> committed rGa4b676d52144: Merge pull request #20 from yxsamliu/null9 (authored by tstellarAMD).
Merge pull request #20 from yxsamliu/null9
Feb 26 2020, 11:22 PM
tstellarAMD committed rG3cdc6f757705: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 26 2020, 11:21 PM
tstellarAMD committed rG5f10b2c2ee7c: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 26 2020, 11:21 PM
tstellarAMD committed rG88f4fe6de69a: Merge 'amd-master' into amd-common (authored by tstellarAMD).
Merge 'amd-master' into amd-common
Feb 26 2020, 11:19 PM
GitHub <noreply@github.com> committed rG1fdb9ed60f85: Merge pull request #16 from yxsamliu/conv3 (authored by tstellarAMD).
Merge pull request #16 from yxsamliu/conv3
Feb 26 2020, 11:17 PM
tstellarAMD committed rG69cd0483059f: Revert "Do not validate pch when -fno-validate-pch is set." (authored by tstellarAMD).
Revert "Do not validate pch when -fno-validate-pch is set."
Feb 26 2020, 11:09 PM
GitHub <noreply@github.com> committed rGe5476175eb42: Merge pull request #10 from RadeonOpenCompute/amd-common-fix-pch (authored by tstellarAMD).
Merge pull request #10 from RadeonOpenCompute/amd-common-fix-pch
Feb 26 2020, 11:09 PM
tstellarAMD committed rG880138bea657: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 26 2020, 11:08 PM
tstellarAMD committed rG535ba5a074ed: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 26 2020, 11:08 PM
tstellarAMD committed rG4952cacfd76e: Merge remote-tracking branch 'amd/amd-master' into amd-common (authored by tstellarAMD).
Merge remote-tracking branch 'amd/amd-master' into amd-common
Feb 26 2020, 11:08 PM
tstellarAMD committed rG80afb2f9ba51: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 26 2020, 11:07 PM
tstellarAMD committed rG5db19dc58240: Merge branch 'amd-master' into amd-common (authored by tstellarAMD).
Merge branch 'amd-master' into amd-common
Feb 26 2020, 11:07 PM

Feb 3 2017

tstellarAMD added a comment to D24930: libclc: Remove code that is not licensed under either the UIUC or MIT license.

Why is this necessary? is the explicit mention of "merge ... sublicense and/or sell" the dealbraker?

The license is not an approved license for LLVM projects. If we want to add it back, we need to ask for an exception for this license from the LLVM foundation.

Is there a description how to do that? has the process been started?
Is there an alternative implementation with compatible license?

Feb 3 2017, 11:53 AM
tstellarAMD added inline comments to D29473: [AMDGPU] Unroll preferences improvements.
Feb 3 2017, 8:33 AM
tstellarAMD added inline comments to D29473: [AMDGPU] Unroll preferences improvements.
Feb 3 2017, 6:39 AM
tstellarAMD added a comment to D24930: libclc: Remove code that is not licensed under either the UIUC or MIT license.

Why is this necessary? is the explicit mention of "merge ... sublicense and/or sell" the dealbraker?

Feb 3 2017, 6:07 AM

Feb 2 2017

tstellarAMD added a comment to D20116: Add speculatable function attribute.

LGTM. But let's wait a little to give a change to @sanjoy or @chandlerc to comment if they feel the need.

I repeat here one of the important earlier comment from @tstellarAMD , since it is not in the description and easy to miss:

I added two new Intrinsic attributes IntrNoSideEffects and IntrHasSideEffects,
which make it possible to specify all the possible memory interaction / side effect
combinations. With these properties in place, it should be possible in the future
to drop the 'no side effect' portion of the intrinsic memory properties once targets
have been updated to use these new properties.
Feb 2 2017, 12:19 PM
tstellarAMD updated the diff for D24929: libclc: Remove copyright headers from top of files.

Add some build fixes from awatry.

Feb 2 2017, 9:53 AM
tstellarAMD updated the diff for D24930: libclc: Remove code that is not licensed under either the UIUC or MIT license.

Also remove tgamma which calls lgamma.

Feb 2 2017, 9:51 AM
tstellarAMD updated the diff for D29452: AMDGPU/SI: DAGMutation for removing deps between CSEs.

Correctly remove predecessors and clean up some of the loops.

Feb 2 2017, 9:15 AM
tstellarAMD created D29452: AMDGPU/SI: DAGMutation for removing deps between CSEs.
Feb 2 2017, 7:07 AM

Feb 1 2017

tstellarAMD accepted D20552: AMDGPU: Allow clustering flat memory operations.
Feb 1 2017, 10:59 AM
tstellarAMD added a comment to D28993: AMDGPU: Try to select SMEM opcodes for llvm.amdgcn.buffer.load.

Another possible issue is that SMEM instructions ignore bits of the resource descriptor. So you would need some way to tell the compiler that it wouldn't be ignoring some relevant resource bits by selecting to SMEM.

Feb 1 2017, 10:16 AM
tstellarAMD added a comment to D27586: AMDGPU/SI: Add llvm.amdgcn.s.buffer.load intrinsic.

Would you please describe the purpose of this patch? It's not obvious why it's useful.

Feb 1 2017, 9:59 AM
tstellarAMD updated the diff for D20116: Add speculatable function attribute.

Update definition in LangReg.

Feb 1 2017, 9:35 AM
tstellarAMD added a comment to D27586: AMDGPU/SI: Add llvm.amdgcn.s.buffer.load intrinsic.

I haven't looked in too much detail yet. I assume getelementptr doesn't work with these pointers, so it would be good to have a negative test which ensures that GEP use fails.

Feb 1 2017, 3:56 AM

Jan 31 2017

tstellarAMD added a comment to D28993: AMDGPU: Try to select SMEM opcodes for llvm.amdgcn.buffer.load.

What if something else has written to the buffer in the same shader? That would make using smem instructions illegal.

Jan 31 2017, 5:37 PM
tstellarAMD abandoned D6820: Add LLVMSetTargetMachineMCOption() to the C API.
Jan 31 2017, 5:36 PM
tstellarAMD added a comment to D27586: AMDGPU/SI: Add llvm.amdgcn.s.buffer.load intrinsic.

How is this different from using amdgcn.buffer.load if D28993 lands (which is not certain)?

I don't think it's legal to select amdgcn.buffer.load to SMRD unless you can prove that it is uniform. llvm.amdgcn.s.buffer.load is known to always be uniform.

Jan 31 2017, 4:54 PM
tstellarAMD added a comment to D27586: AMDGPU/SI: Add llvm.amdgcn.s.buffer.load intrinsic.

How is this different from using amdgcn.buffer.load if D28993 lands (which is not certain)?

Jan 31 2017, 4:34 PM
tstellarAMD created D29342: TableGen: Add support for intrinsic patterns with vAny.
Jan 31 2017, 11:49 AM
tstellarAMD added inline comments to D27586: AMDGPU/SI: Add llvm.amdgcn.s.buffer.load intrinsic.
Jan 31 2017, 11:46 AM
tstellarAMD updated the diff for D27586: AMDGPU/SI: Add llvm.amdgcn.s.buffer.load intrinsic.

More test cases and rebase on top of latest master.

Jan 31 2017, 11:44 AM
tstellarAMD accepted D28784: AMDGPU: Check users in fneg/fabs select combine.
Jan 31 2017, 10:51 AM · Restricted Project
tstellarAMD accepted D28783: AMDGPU: Check if users of fneg can fold mods.
Jan 31 2017, 10:50 AM
tstellarAMD created D29325: AMDGPU/SI: Attempt to fix inst-select-load-smrd.mir on some builds.
Jan 31 2017, 6:20 AM
tstellarAMD updated the diff for D24929: libclc: Remove copyright headers from top of files.

Fix gen_convert.py and add LLVM file headers to all files.

Jan 31 2017, 5:01 AM
tstellarAMD updated the diff for D24930: libclc: Remove code that is not licensed under either the UIUC or MIT license.

Remove lgamma functions too.

Jan 31 2017, 2:21 AM

Jan 30 2017

tstellarAMD accepted D29302: AMDGPU: Allow sinking of addressing modes for atomic_inc/dec.

LGTM.

Jan 30 2017, 3:23 PM
tstellarAMD abandoned D13808: ELF: Add AMDGPU relocation types.
Jan 30 2017, 3:20 PM
tstellarAMD accepted D28936: AMDGPU: Always allocate emergency stack slot at offset 0.

LGTM.

Jan 30 2017, 3:18 PM
tstellarAMD accepted D28833: AMDGPU: Generalize matching of v_med3_f32.

LGTM.

Jan 30 2017, 3:16 PM
tstellarAMD accepted D28886: AMDGPU: Use source mods with fcanonicalize.

LGTM.

Jan 30 2017, 3:15 PM
tstellarAMD accepted D28912: AMDGPU: Fold fneg into fmin/fmax_legacy.

LGTM.

Jan 30 2017, 4:19 AM
tstellarAMD accepted D28063: AMDGPU: Make i32 uaddo/usubo legal.

LGTM.

Jan 30 2017, 4:18 AM
tstellarAMD accepted D29187: AMDGPU: Implement hook for InferAddressSpaces.

LGTM.

Jan 30 2017, 4:01 AM
tstellarAMD accepted D29070: AMDGPU: Run AMDGPUCodeGenPrepare after inlining.

LGTM.

Jan 30 2017, 4:00 AM
tstellarAMD accepted D29210: AMDGPU: Fix atomic_inc/atomic_dec + ds_swizzle not being divergent.

LGTM.

Jan 30 2017, 4:00 AM

Jan 27 2017

tstellarAMD updated the diff for D29049: TableGen: Fix infinite recursion in RegisterBankEmitter.

Keep track of visited register class and make sure that equivalent register
classes are both added to the CoverageData.

Jan 27 2017, 1:44 PM

Jan 24 2017

tstellarAMD added inline comments to D29068: AMDGPU/SI: Move some ISel helpers into utils so they can be shared with GISel.
Jan 24 2017, 6:56 PM
tstellarAMD added reviewers for D29105: Fix regalloc assignment of overlapping registers: MatzeB, qcolombet.
Jan 24 2017, 5:10 PM
tstellarAMD updated the diff for D26730: AMDGPU/GlobalISel: Add support for simple shaders.

Rebase on top of master.

Jan 24 2017, 7:03 AM

Jan 23 2017

tstellarAMD created D29068: AMDGPU/SI: Move some ISel helpers into utils so they can be shared with GISel.
Jan 23 2017, 7:06 PM
tstellarAMD created D29049: TableGen: Fix infinite recursion in RegisterBankEmitter.
Jan 23 2017, 2:24 PM
tstellarAMD accepted D28787: AMDGPU: Propagate fast math flags in fneg combines.

LGTM.

Jan 23 2017, 9:18 AM
tstellarAMD accepted D28995: AMDGPU: Remove a useless VI SMRD pattern.
Jan 23 2017, 8:10 AM

Dec 22 2016

tstellarAMD accepted D27822: AMDGPU: Constant fold when immediate is materialized.

LGTM.

Dec 22 2016, 8:33 AM
tstellarAMD accepted D27881: AMDGPU: Custom lower more vector operations.

LGTM.

Dec 22 2016, 8:31 AM
tstellarAMD accepted D27882: AMDGPU: Use i16 for i16 shift amount.

LGTM.

Dec 22 2016, 8:29 AM
tstellarAMD accepted D27886: AMDGPU: Fix missing 16-bit cmpx instructions.

LGTM.

Dec 22 2016, 8:27 AM

Dec 21 2016

tstellarAMD added a comment to D27964: AMDGPU/R600: Serialize vector trunc stores to private AS.

Why are all the test changes for local memory read/write, when the code changes behavior for private memory stores?

Dec 21 2016, 7:26 PM
tstellarAMD accepted D27884: AMDGPU: Use i16 comparison instructions.

LGTM.

Dec 21 2016, 6:56 PM
tstellarAMD accepted D27887: AMDGPU: Fix missing commute table entries for cmpx.

LGTM.

Dec 21 2016, 6:55 PM
tstellarAMD accepted D27888: AMDGPU: Implement isFMAFasterThanFMulAndFAdd for f16.

LGTM.

Dec 21 2016, 6:54 PM
tstellarAMD accepted D27889: AMDGPU: Enable some f32 fadd/fsub combines for f16.

LGTM.

Dec 21 2016, 6:52 PM
tstellarAMD accepted D27891: AMDGPU: Implement f16 fcanonicalize.

LGTM.

Dec 21 2016, 6:52 PM
tstellarAMD accepted D27892: AMDGPU: Update isFPImmLegal for f16.

LGTM.

Dec 21 2016, 6:51 PM
tstellarAMD accepted D27895: AMDGPU: Move combines into separate functions.

LGTM.

Dec 21 2016, 6:50 PM
tstellarAMD accepted D27903: AMDGPU: Form more FMAs if fusion is allowed.

LGTM.

Dec 21 2016, 6:49 PM
tstellarAMD accepted D27904: AMDGPU: Check fast math flags in fadd/fsub combines .

LGTM.

Dec 21 2016, 6:47 PM
tstellarAMD accepted D27914: AMDGPU: Custom lower f16 fdiv.

LGTM.

Dec 21 2016, 6:46 PM
tstellarAMD accepted D27989: AMDGPU: split ret/noret patterns for global atomics.

LGTM.

Dec 21 2016, 6:44 PM
tstellarAMD accepted D28004: AMDGPU: Invert cmp + select with constant.

LGTM.

Dec 21 2016, 6:40 PM
tstellarAMD added inline comments to D26730: AMDGPU/GlobalISel: Add support for simple shaders.
Dec 21 2016, 3:34 PM
tstellarAMD updated the diff for D26730: AMDGPU/GlobalISel: Add support for simple shaders.

Address review comments.

Dec 21 2016, 3:34 PM

Dec 20 2016

tstellarAMD accepted D27864: AMDGPU: Don't add same instruction multiple times to worklist.

LGTM.

Dec 20 2016, 10:30 AM
tstellarAMD accepted D27890: AMDGPU: Run fp combine tests on VI.

LGTM.

Dec 20 2016, 10:17 AM
tstellarAMD added inline comments to D27834: AMDGPU/SI: Add a MachineMemOperand when lowering llvm.amdgcn.buffer.load.*.
Dec 20 2016, 8:36 AM

Dec 19 2016

tstellarAMD accepted D27905: AMDGPU: Swap order of operands in fadd/fsub combine .

LGTM.

Dec 19 2016, 2:21 PM