We set AddedComplexity = 100 for s_load patterns to prefer them over
global loads, but for s_buffer_load patterns there is no need to do
this and it was quietly overriding the AddedComplexity of each
individual GCNPat that is defined inside SMLoad_Pattern (but in practice
that did not appear to make any difference).
Details
Details
- Reviewers
arsenm kosarev - Group Reviewers
Restricted Project - Commits
- rGf34a1953ce33: [AMDGPU] Fix AddedComplexity for s_buffer_load patterns. NFCI.
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/lib/Target/AMDGPU/SMInstructions.td | ||
---|---|---|
904 | Can you just move this up instead of indenting all the patterns? |
Can you just move this up instead of indenting all the patterns?