Kernel arguments likely read by all workitems and should not bypass
cache. Fixes performance hit in sub-dword argument loads.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D43249
[AMDGPU] Remove non-temporal flag from argument loads ClosedPublic Authored by rampitec on Feb 13 2018, 11:21 AM.
Details Summary Kernel arguments likely read by all workitems and should not bypass
Diff Detail Event TimelineHerald added subscribers: tpr, dstuttard, yaxunl and 2 others. · View Herald TranscriptFeb 13 2018, 11:21 AM This revision is now accepted and ready to land.Feb 14 2018, 9:15 AM Closed by commit rL325146: [AMDGPU] Remove non-temporal flag from argument loads (authored by rampitec). · Explain WhyFeb 14 2018, 10:07 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 134087 lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/kernel-args.ll
test/CodeGen/AMDGPU/llvm.amdgcn.buffer.store.format.d16.ll
test/CodeGen/AMDGPU/llvm.amdgcn.image.d16.ll
test/CodeGen/AMDGPU/llvm.amdgcn.tbuffer.store.d16.ll
|