The existing unpck instruction lowering was based on matching explicit shuffle patterns, and missed many alternative shuffle masks (notably commuted masks and duplicate inputs).
This patch adds lowerVectorUnpack() which can be used to thoroughly match any unpckl/unpckh pattern.