This was crashing with the example from:
https://llvm.org/PR49716
...and I hopefully stubbed that out with a283d7258360 , but as we can see from the SSE vs. AVX code, I think we need to try harder to match the pattern.
This matcher code was adapted from another pmadd pattern match in D49636, but it needs different ops to deal with size mismatches.