According to spec cvtdq2pd and cvtps2pd instructions don't require memory operand to be aligned to 16 bytes. This patch removes this requirement from the memory folding table.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
test/CodeGen/X86/peephole-cvt-sse.ll | ||
---|---|---|
1 ↗ | (On Diff #69346) | Please can you test on a 32-bit target as well and use utils/update_llc_test_checks.py if you can. |
Comment Actions
Hi Simon,
Thanks for the review.
test/CodeGen/X86/peephole-cvt-sse.ll | ||
---|---|---|
2 ↗ | (On Diff #69561) | Both are done. |
test/CodeGen/X86/peephole-cvt-sse.ll | ||
---|---|---|
2 ↗ | (On Diff #69561) | Thanks, its also better to use -mattr=+sse4.2 (or similar) instead of a cpu target unless you are specifically testings for that cpu. |