As noticed in D90554 , the AVX2 costs for 256-bit vectors did not include FMAXNUM entries, so we fell back to AVX1 which assumes those ops will be split into 128-bit halves.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/lib/Target/X86/X86TargetTransformInfo.cpp | ||
---|---|---|
2531 | Nit: the comments should be MAXSS + CMPUNORDSS. |
Nit: the comments should be MAXSS + CMPUNORDSS.
The same below.