Details: https://reviews.llvm.org/D96805 changed the GCNTTIImpl::getCFInstrCost to return 1 for the PHI nodes for the TTI::TCK_CodeSize and TTI::TCK_SizeAndLatency. This is incorrect because the value moves that are the result of the PHI lowering are inserted into the basic block predecessors - not into the block itself. As a result of this change LoopRotate and LoopUnroll were broken because of the incorrect Loop header and loop body size/cost estimation. Fixes SWDEV-289429 10-11% Performance drop observed with ROC_OCL_Perf_Linpack_DGEMM_W32 Differential Revision: https://reviews.llvm.org/D105104
Details
Details
- Reviewers
- None
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Seems it is a duplicate of D105104?
Instead of committing that patch, you create a new revision :)
clang-format not found in user’s local PATH; not linting file.