FP16 to FP32 converts can be handled in MVE lane interleaving, much like the sext/zext lowering we do. This expands the pass with fpext and fptrunc handling, and basic fp operations allowing more efficient lowering of fp vectors.
Details
Details
Diff Detail
Diff Detail
Event Timeline
llvm/lib/Target/ARM/MVELaneInterleavingPass.cpp | ||
---|---|---|
130 | Can you clarify this a bit? I am no following the "extra VCVT's" => "cheap" reasoning. I am guessing you meant expensive? |
Can you clarify this a bit? I am no following the "extra VCVT's" => "cheap" reasoning. I am guessing you meant expensive?