Different variants of idot8 codegen dag patterns are not generated by llvm-tablegen due to a huge increase in the compile time. Support the pattern that clang FE generates after reordering the additions in integer-dot8 source language pattern.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
lib/Target/AMDGPU/VOP3PInstructions.td | ||
---|---|---|
290–291 ↗ | (On Diff #171943) | How much of an increase? I would be surprised the addition of anything here would cause a noticeable difference |
lib/Target/AMDGPU/VOP3PInstructions.td | ||
---|---|---|
290–291 ↗ | (On Diff #171943) | It's huge, up a few hours. There are multiplications too and the tree height is 8. The number of patterns grows exponentially, roughly (4^8) and this is without applying further associative property. |