It is possible to merge reuse and reorder shuffles and reduce the total
cost of the vectorization tree/number of final instructions.
LG after addressing all comments.
May be just something like "CommonCost"? The "dead" part of cost is used at the end only.
This phrase "Before this patch..." looks misplaced here, inside comment. May be move it to summary?
Didn't find them as well...