- No need to call areAllUsersVectorized as later the cost is calculated only if the instruction has one use and gets vectorized.
- Need to calculate the cost of the dead extractelement more precisely, taking the vector type of the vector operand, not the resulting vector type.
Part of D57059.
Why do we have both a v4i32 and v8i32 shl in here?