Currently we only reduce vector.reduce.add to sdot if the vectors are either <8 x i8> or <16 x i8>.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Thanks for adding the tests! Could you also add a few variants where the number of elements is not a multiple of 16?
this will generate a massive amount of code, so it might be better to just keep the one with 33 elements.