Remove the restriction, from the mve tail predication pass, that the all masked vectors instructions need to be 128-bits. This allows us to supported extending loads and truncating stores.
Details
Details
Diff Detail
Diff Detail
Event Timeline
Comment Actions
All the codegen tests are dlstp.32 loops, but do we have or need to have some tests with size = 8, 16, or 64 too?
Comment Actions
Added a test for i8 to i16 conversions. I remembered that there's more tests in the parent ticket that handle sizes other than 32.