As per the discussion in D103818, so far, this does not appear to be worthwhile.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Why is this better? vinsertf128 tends to be faster than broadcasts
llvm/test/CodeGen/X86/vector-shuffle-256-v4.ll | ||
---|---|---|
1016 ↗ | (On Diff #350299) | The AVX1 shuffle looks to be much better............ |