Extending inline assembly support, compatible with GCC as folowing:
"k" constraint hints the compiler to select any of AVX512 k0-k7 registers.
"Yk" constraint is a subset of "k" excluding k0 which is not allowd to be used as a mask.
This patch is complimented by the following review:
D25063
Let's check size 2 after size 1, it seems more logical, and is 1 is probably the common case.