This bug is introduced by r211144. When building a v4i8 vector, v4i8 will be widened to v4i16 after legalization. Then the element of operand is smaller than the element of result, but previous commit can only handle the contrary condition. This patch is going to handle this scenario and to generate optimized codes like ZIP1. Please review.