For Neon, the default nonconst stride cost is conservative,
and it is a local variable, which is not convenience to
to tune the loop vectorize.
So I try to use a option, which is similar to SVEGatherOverhead brought in D115143.
Details
Details
- Reviewers
david-arm dmgreen - Commits
- rGdf19d872277f: [LV] Add option to tune the cost model, NFC
Diff Detail
Diff Detail
Event Timeline
Comment Actions
Sounds OK to me. Feel free to add an option for MaxMergeDistance too if it is useful.