With DQI but without VLX, lower v2i64 and v4i64 MUL operations with v8i64 MUL (vpmullq).
Updated cost table accordingly.
Paths
| Differential D26011
[X86][AVX512DQ] Improve lowering of MUL v2i64 and v4i64 ClosedPublic Authored by RKSimon on Oct 26 2016, 1:55 PM.
Details Summary With DQI but without VLX, lower v2i64 and v4i64 MUL operations with v8i64 MUL (vpmullq). Updated cost table accordingly.
Diff Detail
Event TimelineRKSimon updated this object. igorb edited edge metadata. Comment ActionsLGTM
This revision is now accepted and ready to land.Oct 27 2016, 7:01 AM Closed by commit rL285304: [X86][AVX512DQ] Improve lowering of MUL v2i64 and v4i64 (authored by RKSimon). · Explain WhyOct 27 2016, 8:36 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 75943 lib/Target/X86/X86ISelLowering.cpp
lib/Target/X86/X86TargetTransformInfo.cpp
test/Analysis/CostModel/X86/arith.ll
test/CodeGen/X86/avx512-arith.ll
|
Hello,
It is possible to implement this logic in td file, similar to multiclass avx512_var_shift_w_lowering<..> implementation.