This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Remove non-temporal flag from argument loads
ClosedPublic

Authored by rampitec on Feb 13 2018, 11:21 AM.

Details

Summary

Kernel arguments likely read by all workitems and should not bypass
cache. Fixes performance hit in sub-dword argument loads.

Diff Detail

Event Timeline

rampitec created this revision.Feb 13 2018, 11:21 AM
kzhuravl accepted this revision.Feb 14 2018, 9:15 AM

LGTM.

This revision is now accepted and ready to land.Feb 14 2018, 9:15 AM
This revision was automatically updated to reflect the committed changes.