Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
Comment Actions
LGTM.
I have a patch which starts matching v_cvt_pk_u16_u32 in many of these cases, which conflicts with most of these test changes. In some of cases, it's probably better to use SDWA, but that won't work if the input sources don't have SDWA (like fmas). Do you think it would make sense to merge these passes to more broadly handle bit packing optimizations?
Comment Actions
Sorry, I had to revert this revision as asan/msan detected issues with the change:
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/3988
http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/1173