This patch lowers the SAD intrinsics to native LLVM IR. Comes with an LLVM patch (D45723).
Details
Details
Diff Detail
Diff Detail
- Repository
- rC Clang
Event Timeline
clang/lib/CodeGen/CGBuiltin.cpp | ||
---|---|---|
8426 ↗ | (On Diff #142914) | Size the ShuffleMask to N when it's created. Then you can use just direct assign each array entry in the loops. This will remove the need for the clear() in the later loop. It will also remove the hidden code that checks if we need to grow on every call to push_back. |
8431 ↗ | (On Diff #142914) | You can just pass AD twice. You don't need to create an Undef value. It will get optimized later. |
This clear isn't needed.