To ensure correct behaviour NSA instructions should not be claused.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
As I understand it, this is only needed for GFX10.1, not for GFX10.3. So I think it probably deserves its own target feature which would be included in GFX10_1_Bugs.
llvm/lib/Target/AMDGPU/SIInsertHardClauses.cpp | ||
---|---|---|
80 | Maybe sink this down to just before we return HARDCLAUSE_VMEM? |
Maybe sink this down to just before we return HARDCLAUSE_VMEM?