If a target can encode multiple wait-states into a noop allow emitting such
instructions directly.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
There's a comment in AMDGPUTargetMachine.cpp that you could remove:
// FIXME: This stand-alone pass will emit indiv. S_NOP 0, as needed. It would // be better for it to emit S_NOP <N> when possible.
Comment Actions
In "HazardRecognizerMode", yes. It uses SIInstrInfo::getNumWaitStates which accounts for noops with immediates > 0.
The post-RA scheduler should not insert these combined noops so we did not handle it there.
I think unsigned would make more sense, and WaitStates seems like AMDGPU-specific terminology.