The FP16 broadcast and transpose can always use the same instructions as are used for i16 vectors, with or without +fullfp16. This fills in some extra costs to make sure we get it right.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo