This is less important than increase threshold for private memory,
but still brings performance improvements in a wide range of tests.
Unrolling more for local memory serves three purposes: it allows
to combine ds operations if offset becomes static, saves registers
used for offsets in case of static offsets, and allows better lds
latency hiding.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
Comment Actions
LGTM except the line should be "else if" or begin a newline!
lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp | ||
---|---|---|
87 ↗ | (On Diff #93194) | Should this be "} else if (AS == ASST.LOCAL_ADDRESS) { |