If we know the source operands are already in range for PACKS/PACKUS then expand these to generic shuffle+truncate IR to allow us to further combine them.
DAG can lower these back to PACKSS/PACKUS without much trouble if they don't fold.
This was noticed while investigating PR39984.
It looks like we're actually better off truncating before shuffling - it creates more IR but gives us more scope for folding - I'll revert rL359191