This adds some basic fadd/fmul reduction costs for MVE/NEON. It reduces by halving the vector size until it it gets scalarized, with some additional costs for fp16 which may require extracting the top lanes.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo