All things performance at AMD: current focus - Vulkan!
- User Since
- Dec 6 2013, 2:54 AM (258 w, 3 d)
Thu, Nov 8
Tue, Nov 6
We discussed this on an internal AMD meeting Monday 5th November 2018, and came to the conclusion that even though I do want the scalar load combining to be brought upstream, it would be better as a separate change so that we can get broader testing across the users of our AMDGPU backend.
Mon, Nov 5
Fix review comments made by @nhaehnle
Fri, Nov 2
Thu, Nov 1
Wed, Oct 31
Mon, Oct 29
Added an additional lit test for the dim variants (including the 2darraymsaa requested by @nhaehnle), and change the naming of the a16.d16 tests to vNf16.
Fri, Oct 26
Oct 19 2018
Maybe a dumb question - but why can't we just use the tbuffer load/store instead of these? It already upcasts for you (the zext/sext is built in depending on the nfmt I believe).
Oct 10 2018
Oct 8 2018
Oct 5 2018
Sep 28 2018
Fix latest review comments.
Move the shuffle widens check into Instructions.h, and call it increasesLength to match the existing changesLength call that was already on ShuffleVectorInst.
Changed the check to not ever push wider shufflevector stuff back onto predecessor instructions as per spatel's suggestion!
Sep 27 2018
Incorporate a related test case that my approach also fixes from https://bugs.llvm.org/show_bug.cgi?id=28911
Rebased on tip trunk.
Sep 26 2018
- Added a comment explaining each of the 3 variants for each of sdiv/srem/udiv/urem
- Fixed the whitespace issue
- Removed the FMF flags that were a legacy from the original shader this was reduced form
Removed the fdiv/frem as they weren't currently inst simplifying the bad behaviour, and used the utils script to update the test case.
See https://reviews.llvm.org/D52556 for just the test case with the bad output expected.
Just to be clear you want me to commit the tests in a separate commit with the bad output first?
Sep 25 2018
Sep 24 2018
Removed the no-args overload as per Nicolai's suggestion.
Sep 14 2018
Added an interaction with the DominatorTree, so that if its present in the PassManager we can preserve + update it.
Sep 13 2018
Sep 12 2018
Update to include tip LLVM changes (DivergenceAnalysis -> LegacyDivergenceAnalysis, AMDGPUAS is become just an enum now).
Sep 9 2016
This is something we (Codeplay) would like to see upstream, it is a much cleaner solution than the metadata workarounds everyone (including us) have been using to fix this.
Apr 27 2016
Apr 26 2016
So we build a bunch of internal libraries in a mix of OpenCL and C++, and then link them all together to create SPIR libraries that can be fed to calls to clLinkProgram and linked against user kernels.
Apr 25 2016
Jan 29 2015
Ran clang-format on the file, and there was a TON of line changes out-with the patch. Extracted the formatted lines from my original patch only, and updated the patch.