As per the discussion in D103818, so far, this does not appear to be worthwhile.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Why is this better? vinsertf128 tends to be faster than broadcasts
| llvm/test/CodeGen/X86/vector-shuffle-256-v4.ll | ||
|---|---|---|
| 1016 ↗ | (On Diff #350299) | The AVX1 shuffle looks to be much better............ |