Need to fix ther cost estimation for split loads, since we look at the
subregs already, no need to permute them, need just to estimate
subregister insert, if it is smaller than the real register. Also, using
split loads, it might be profitable already to vectorize smaller trees
with gathering of the loads.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp | ||
---|---|---|
4553 | What in particular needs fixing with SK_InsertSubvector ? |
llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp | ||
---|---|---|
4553 | In many cases X86 model fallbacks to the default (very pessimistic!) case and estimates it as extracts+gathers. |
llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp | ||
---|---|---|
4553 | I'll check it later. Plus, need to fux a regression caused by thus patch, which I missed in the latest update. |
What in particular needs fixing with SK_InsertSubvector ?