The CS preinliner computes sample counts for outlined functions globally, resulting in a more accurate function profile than the actual thinlto postlink sample inliner does. I've seen that turning on --overwrite-existing-weights for postlink results in general 0.5% to 1% code size win (perf wise is netural) for CSSPGO accross four of our big internal services, but slight code size regression for non-CS AutoFDO. I'm turning on the switch for preinlined profile only.
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Unit Tests
Unit Tests