Don't you need to check for s_mul_lo as well here?
There is only 's_mul_i32' instead of s_mul_lo. `s_mul_i32' is already supported in the current code base.
There is no _lo, it's the same. More checks are better though
Update the test case following reviewers' comments.
Right. I just assume it is a good idea we have low multiplication check anyway.
The only thing I suggest is to use GFX9-DAG as they may be easily reordered.