According to spec cvtdq2pd and cvtps2pd instructions don't require memory operand to be aligned to 16 bytes. This patch removes this requirement from the memory folding table.
Details
Details
Diff Detail
Diff Detail
Event Timeline
test/CodeGen/X86/peephole-cvt-sse.ll | ||
---|---|---|
2 | Please can you test on a 32-bit target as well and use utils/update_llc_test_checks.py if you can. |
test/CodeGen/X86/peephole-cvt-sse.ll | ||
---|---|---|
3 | Thanks, its also better to use -mattr=+sse4.2 (or similar) instead of a cpu target unless you are specifically testings for that cpu. |
Please can you test on a 32-bit target as well and use utils/update_llc_test_checks.py if you can.