These cases uses the same codegen as AVX2 (pshuflw/pshufd) for the sub-128bit vector deinterleaving, and unpcklqdq for v2i64.
It's going to take a while to add full interleaved cost coverage, but since these are the same for SSE2 -> AVX2 it should be an easy win.
Fixes PR47437
Looking at llvm-project/llvm/test/CodeGen/X86/vector-interleaved-load-i16-stride-2.ll,
VF4 codegen is really different between SSE2 and AVX2.