This patch adds support for decoding XOP VPPERM instruction when it represents a basic shuffle.
The mask decoding required the existing MCInstrLowering code to be updated to support binary shuffles - the implementation now matches what is done in X86InstrComments.cpp. This should be useful for some AVX512 binary shuffles (VPERMT2 etc.) as well.
Split off from D18189
Add a comment here to explain. Something like: if there's really only one source operand, fix the mask so we can print all elements in one span.