These would currently fail to select.
Diff Detail
Event Timeline
- Add S[SD]Zrr->rm folding table entries
- Cleanup patterns as rm variants aren't needed
lib/Target/X86/X86InstrAVX512.td | ||
---|---|---|
3365 | All AVX-512 intrinsics, even FP scalar, come with masks and with rounding mode. |
lib/Target/X86/X86InstrAVX512.td | ||
---|---|---|
3365 | These patterns are just for the SSE1/SSE2 intrinsics (_mm_add_ss, int_x86_sse_add_ss, ...; see tests), which will fail to select when used on AVX512, because the X86InstrSSE.td SI defs are all predicated on "UseAVX" rather than "HaveAVX". I went with patterns because for _ss, IntrinsicsInfo would turn something like _mm_add_ss to a v4f32 fadd and later addps. We could add an IntrinsicType for _ss _2OP, or maybe even, an additional field in the intrinsic data tables saying that even though an intrinsic has vector type, it's just a scalar op? |
All problems of sse intrinsics should be resolved in X86InstrSSE.td. Please change UseAVX to HasAVX for these intrinsics. I don't think that we should add all SSE intrinsics to AVX512 now.
I don't think that we should add all SSE intrinsics to AVX512 now.
I kind of disagree.
When AVX512 is available we have additional registers we can use for those. I believe we should do the same thing as we do for thumb, i.e., choose the most permissive instruction encoding then shorten the encoding if possible (like the Thumb2SizeReduction pass).
For the long term we may want to find a better solution for such case (same instruction available with different encoding based on extension) to avoid the explosion in size of the description.
What do you think?
Cheers,
-Quentin
I agree, this seems like the way to go; see http://llvm.org/bugs/show_bug.cgi?id=23376.
For the long term we may want to find a better solution for such case (same instruction available with different encoding based on extension) to avoid the explosion in size of the description.
I also agree. In fact I think there are two orthogonal issues here:
- like I proposed earlier, we should use IntrinsicsInfo for scalar intrinsics. I filed https://llvm.org/bugs/show_bug.cgi?id=23449
- in lots of cases, the main difference between SSE/AVX/AVX512 instructions is the encoding. Due to the way the backend evolved, they're mostly apart, with lots of duplications (now it's even worse because AVX512 is in a completely different file). There's room for refactoring, I think, but that's a big change.
I can repurpose this review to implement the first one, if people agree? Also, if the second makes sense, I'll file a PR as well.
-Ahmed
What do you think?
Cheers,
-Quentin
Hi,
We encountered with another problem - some sse intrinsics don't work on AVX-512 mode at all, due to UseAVX prefix.
I'm going to fix this, just to let these intrinsics be mapped to AVX instructions.
But I agree, as a long term solution, all these intrinsics should be mapped to AVX-512 instructions. And it is not only scalar FP. SKX covers all AVX and AVX2 instructions with EVEX version.
I'm against writing all this manually, it will be the last option. I'll think how all this mapping may be auto-generated.
- Elena
All AVX-512 intrinsics, even FP scalar, come with masks and with rounding mode.
I don't understand where these intrinsics _sse_avx512_ come from.
Due to very complex form of these inrinsics, we add them to X86IntrinsicsInfo.h