sheredom (Neil Henning)
User

Projects

User does not belong to any projects.

User Details

User Since
Dec 6 2013, 2:54 AM (258 w, 3 d)

All things performance at AMD: current focus - Vulkan!

Recent Activity

Thu, Nov 8

sheredom added inline comments to D54042: [AMDGPU] Extend the SI Load/Store optimizer to combine more things..
Thu, Nov 8, 9:25 AM · Restricted Project

Tue, Nov 6

sheredom updated the diff for D54042: [AMDGPU] Extend the SI Load/Store optimizer to combine more things..

We discussed this on an internal AMD meeting Monday 5th November 2018, and came to the conclusion that even though I do want the scalar load combining to be brought upstream, it would be better as a separate change so that we can get broader testing across the users of our AMDGPU backend.

Tue, Nov 6, 2:39 AM · Restricted Project

Mon, Nov 5

sheredom committed rL346128: [AMDGPU] Fix the new atomic optimizer in pixel shaders..
[AMDGPU] Fix the new atomic optimizer in pixel shaders.
Mon, Nov 5, 4:07 AM
sheredom closed D53930: [AMDGPU] Fix the new atomic optimizer in pixel shaders..
Mon, Nov 5, 4:07 AM · Restricted Project
sheredom added inline comments to D53930: [AMDGPU] Fix the new atomic optimizer in pixel shaders..
Mon, Nov 5, 4:05 AM · Restricted Project
sheredom updated the diff for D53930: [AMDGPU] Fix the new atomic optimizer in pixel shaders..

Fix review comments made by @nhaehnle

Mon, Nov 5, 4:05 AM · Restricted Project

Fri, Nov 2

sheredom created D54042: [AMDGPU] Extend the SI Load/Store optimizer to combine more things..
Fri, Nov 2, 11:27 AM · Restricted Project
sheredom committed rL345962: [AMDGPU] UBSan bug fix for r345710.
[AMDGPU] UBSan bug fix for r345710
Fri, Nov 2, 3:27 AM

Thu, Nov 1

sheredom updated the diff for D53930: [AMDGPU] Fix the new atomic optimizer in pixel shaders..

Review fixes.

Thu, Nov 1, 3:25 AM · Restricted Project

Wed, Oct 31

sheredom created D53930: [AMDGPU] Fix the new atomic optimizer in pixel shaders..
Wed, Oct 31, 6:46 AM · Restricted Project
sheredom committed rL345710: [AMDGPU] support image load/store a16.
[AMDGPU] support image load/store a16
Wed, Oct 31, 3:38 AM
sheredom closed D53750: [AMDGPU] support image load/store a16.
Wed, Oct 31, 3:38 AM · Restricted Project

Mon, Oct 29

sheredom updated the diff for D53750: [AMDGPU] support image load/store a16.

Added an additional lit test for the dim variants (including the 2darraymsaa requested by @nhaehnle), and change the naming of the a16.d16 tests to vNf16.

Mon, Oct 29, 2:43 AM · Restricted Project

Fri, Oct 26

sheredom created D53750: [AMDGPU] support image load/store a16.
Fri, Oct 26, 2:24 AM · Restricted Project

Oct 19 2018

sheredom added a comment to D42885: [AMDGPU] intrintrics for byte/short load/store.

Maybe a dumb question - but why can't we just use the tbuffer load/store instead of these? It already upcasts for you (the zext/sext is built in depending on the nfmt I believe).

Oct 19 2018, 1:55 AM

Oct 10 2018

sheredom committed rL344128: Fix an ordering bug in the scalarizer..
Fix an ordering bug in the scalarizer.
Oct 10 2018, 2:29 AM
sheredom closed D52540: Fix an ordering bug in the scalarizer..
Oct 10 2018, 2:29 AM

Oct 8 2018

sheredom committed rL343973: [AMDGPU] Add an AMDGPU specific atomic optimizer..
[AMDGPU] Add an AMDGPU specific atomic optimizer.
Oct 8 2018, 8:51 AM
sheredom closed D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..
Oct 8 2018, 8:51 AM · Restricted Project
sheredom updated the diff for D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..

Rebased ontop of tip to get the IRBuilder change in https://reviews.llvm.org/D52087, and incorporate the requested changes by @nhaehnle resulting from that.

Oct 8 2018, 4:49 AM · Restricted Project
sheredom committed rL343962: [IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle..
[IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle.
Oct 8 2018, 3:35 AM
sheredom closed D52087: [IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle..
Oct 8 2018, 3:35 AM

Oct 5 2018

sheredom committed rL343842: Add missing period to comment to match style of file..
Add missing period to comment to match style of file.
Oct 5 2018, 2:42 AM

Sep 28 2018

sheredom added a comment to D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Thanks - LGTM. As before, I'd prefer to have the baseline tests in place before the patch. Let me know if I should commit on your behalf.

Sep 28 2018, 7:30 AM
sheredom updated the diff for D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Fix latest review comments.

Sep 28 2018, 7:22 AM
sheredom added inline comments to D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..
Sep 28 2018, 6:27 AM
sheredom retitled D52548: Stop instcombining propagating wider shufflevector arguments to predecessors. from Stop instcombining introducing undef's in div/rem instructions. to Stop instcombining propagating wider shufflevector arguments to predecessors..
Sep 28 2018, 6:12 AM
sheredom added inline comments to D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..
Sep 28 2018, 2:43 AM
sheredom updated the diff for D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Move the shuffle widens check into Instructions.h, and call it increasesLength to match the existing changesLength call that was already on ShuffleVectorInst.

Sep 28 2018, 2:42 AM
sheredom added inline comments to D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..
Sep 28 2018, 1:59 AM
sheredom updated the diff for D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Changed the check to not ever push wider shufflevector stuff back onto predecessor instructions as per spatel's suggestion!

Sep 28 2018, 1:50 AM

Sep 27 2018

sheredom updated the diff for D52540: Fix an ordering bug in the scalarizer..

Incorporate a related test case that my approach also fixes from https://bugs.llvm.org/show_bug.cgi?id=28911

Sep 27 2018, 7:16 AM
sheredom added a comment to D52540: Fix an ordering bug in the scalarizer..

dstenb described a possible fix in a comment of PR28911 and we've been using that fix for a long time now for our
out-of-tree target without problems.

Perhaps that fix is a hack and using RPOT is the proper way to deal with this, I've no idea. I just wanted to point out the
possibility.

Sep 27 2018, 6:23 AM
sheredom added a comment to D52540: Fix an ordering bug in the scalarizer..

Is this the same problem as described in https://bugs.llvm.org/show_bug.cgi?id=28911 ?

Sep 27 2018, 5:30 AM
sheredom updated the diff for D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Rebased on tip trunk.

Sep 27 2018, 2:07 AM

Sep 26 2018

sheredom updated the diff for D52556: Add a test case showing the instcombine fail from D52548.
  • Added a comment explaining each of the 3 variants for each of sdiv/srem/udiv/urem
  • Fixed the whitespace issue
  • Removed the FMF flags that were a legacy from the original shader this was reduced form
Sep 26 2018, 10:42 AM
sheredom updated the diff for D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Removed the fdiv/frem as they weren't currently inst simplifying the bad behaviour, and used the utils script to update the test case.

Sep 26 2018, 8:47 AM
sheredom added a comment to D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

See https://reviews.llvm.org/D52556 for just the test case with the bad output expected.

Sep 26 2018, 8:20 AM
sheredom created D52556: Add a test case showing the instcombine fail from D52548.
Sep 26 2018, 8:20 AM
sheredom added a comment to D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..

Just to be clear you want me to commit the tests in a separate commit with the bad output first?

Sep 26 2018, 7:31 AM
sheredom created D52548: Stop instcombining propagating wider shufflevector arguments to predecessors..
Sep 26 2018, 6:36 AM
sheredom created D52540: Fix an ordering bug in the scalarizer..
Sep 26 2018, 3:06 AM

Sep 25 2018

sheredom added a comment to D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..

What happens if a shader already does "if (threadID == 0) { do_atomic(); }"? Is the optimization skipped in this case?

Sep 25 2018, 1:55 AM · Restricted Project

Sep 24 2018

sheredom updated the diff for D52087: [IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle..

Removed the no-args overload as per Nicolai's suggestion.

Sep 24 2018, 7:20 AM
sheredom added inline comments to D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..
Sep 24 2018, 7:20 AM · Restricted Project

Sep 14 2018

sheredom updated the diff for D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..

Added an interaction with the DominatorTree, so that if its present in the PassManager we can preserve + update it.

Sep 14 2018, 6:25 AM · Restricted Project
sheredom added inline comments to D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..
Sep 14 2018, 4:38 AM · Restricted Project
sheredom created D52087: [IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle..
Sep 14 2018, 4:35 AM
sheredom added inline comments to D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..
Sep 14 2018, 2:01 AM · Restricted Project

Sep 13 2018

sheredom added inline comments to D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..
Sep 13 2018, 6:12 AM · Restricted Project

Sep 12 2018

sheredom updated the diff for D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..

Update to include tip LLVM changes (DivergenceAnalysis -> LegacyDivergenceAnalysis, AMDGPUAS is become just an enum now).

Sep 12 2018, 2:44 AM · Restricted Project
sheredom created D51969: [AMDGPU] Add an AMDGPU specific atomic optimizer..
Sep 12 2018, 2:37 AM · Restricted Project

Sep 9 2016

sheredom added a comment to D21723: [RFC] Enhance synchscope representation.

This is something we (Codeplay) would like to see upstream, it is a much cleaner solution than the metadata workarounds everyone (including us) have been using to fix this.

Sep 9 2016, 2:20 AM

Apr 27 2016

sheredom closed D19478: Remove assert mandating you can only use SPIR target with OpenCL.

Thanks!

Apr 27 2016, 1:28 AM

Apr 26 2016

sheredom added a comment to D19478: Remove assert mandating you can only use SPIR target with OpenCL.

So we build a bunch of internal libraries in a mix of OpenCL and C++, and then link them all together to create SPIR libraries that can be fed to calls to clLinkProgram and linked against user kernels.

Apr 26 2016, 2:19 AM

Apr 25 2016

sheredom retitled D19478: Remove assert mandating you can only use SPIR target with OpenCL from to Remove assert mandating you can only use SPIR target with OpenCL.
Apr 25 2016, 3:38 AM

Jan 29 2015

sheredom updated the diff for D7245: Fix OpenCL 1.2 double as an optional core feature behaviour.

Ran clang-format on the file, and there was a TON of line changes out-with the patch. Extracted the formatted lines from my original patch only, and updated the patch.

Jan 29 2015, 4:58 AM
sheredom updated the test plan for D7245: Fix OpenCL 1.2 double as an optional core feature behaviour.
Jan 29 2015, 2:40 AM
sheredom retitled D7245: Fix OpenCL 1.2 double as an optional core feature behaviour from to Fix OpenCL 1.2 double as an optional core feature behaviour.
Jan 29 2015, 2:39 AM