This is an archive of the discontinued LLVM Phabricator instance.

Boost unroll threshold for loops reading local memory
ClosedPublic

Authored by rampitec on Mar 27 2017, 5:21 PM.

Details

Summary

This is less important than increase threshold for private memory,
but still brings performance improvements in a wide range of tests.
Unrolling more for local memory serves three purposes: it allows
to combine ds operations if offset becomes static, saves registers
used for offsets in case of static offsets, and allows better lds
latency hiding.

Diff Detail

Repository
rL LLVM

Event Timeline

rampitec created this revision.Mar 27 2017, 5:21 PM
cfang accepted this revision.Mar 28 2017, 2:13 PM

LGTM except the line should be "else if" or begin a newline!

lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
87 ↗(On Diff #93194)

Should this be "} else if (AS == ASST.LOCAL_ADDRESS) {

This revision is now accepted and ready to land.Mar 28 2017, 2:13 PM
rampitec updated this revision to Diff 93308.Mar 28 2017, 3:03 PM

Fixed else if statement.

rampitec marked an inline comment as done.Mar 28 2017, 3:04 PM
This revision was automatically updated to reflect the committed changes.