v3i16 and v3f16 currently cannot be legalized and lowered so they should
not be emitted by inst combining.
Moved the check down to still allow extracting 1 or 2 elements via the dmask.
Fixes image intrinsics being combined to return v3x16.
Differential D84223
[AMDGPU] Don't combine memory intrs to v3i16 Flakebi on Jul 21 2020, 2:14 AM. Authored by
Details
v3i16 and v3f16 currently cannot be legalized and lowered so they should Moved the check down to still allow extracting 1 or 2 elements via the dmask. Fixes image intrinsics being combined to return v3x16.
Diff Detail
Event TimelineComment Actions I’m also trying to get it working properly (currently for SDag). I think I got the legalization/widening part working but I’m still trying to figure out how to select the right instruction patterns. The next two weeks I’m on vacation, so it will still take a while. I think Marek wants a slightly quicker fix, probably something in mesa hit this. |