When the initial propagation determines that a VMEM load does not have to run
in WQM, make it run in Exact mode instead if the remainder of the shader
doesn't care. This can reduce the bandwidth required for loads.
This typically introduces two additional SALU instructions and uses two
additional SGPRs in pixel shaders for the livemask.
TII->usesVM_CNT(MI)