Causes some VGPR usage improvements in shaderdb, but
introduces some SGPR spilling regressions due to random
scheduling changes later.
Details
Details
Diff Detail
Diff Detail
Event Timeline
lib/Transforms/InstCombine/InstCombineSimplifyDemanded.cpp | ||
---|---|---|
1734 | Gather4 opcodes always return 4 VGPRs and DMASK has a different meaning. Specifically, Gather4 reads 4 texels from memory and DMASK selects which color component is returned for the texels (i.e. 4x red channel, or 4x green channel, etc.) So DMASK shouldn't be changed by the compiler for gather4 opcodes. |
Gather4 opcodes always return 4 VGPRs and DMASK has a different meaning. Specifically, Gather4 reads 4 texels from memory and DMASK selects which color component is returned for the texels (i.e. 4x red channel, or 4x green channel, etc.) So DMASK shouldn't be changed by the compiler for gather4 opcodes.