Currently any legal predicate types will be pattern-matched when
creating a PTEST instruction. This could be a problem in future since
PTEST always uses the .B specifier for the operand, but it is not
always guaranteed that the extra lanes of unpacked types (e.g. nxv4i1)
are zero. This patch ensures the operands of PTEST are type nxv16i1,
where the undef lanes are set to zero.
Details
- Reviewers
sdesmalen paulwalker-arm efriedma
Diff Detail
Event Timeline
Thanks @RosieSumpter for fixing this. Just left a new nits, but overall the changes look sensible to me.
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp | ||
---|---|---|
241 | nit: isZeroingInactiveLanes ? | |
16185 | nit: Can you propagate MVT::nxv16i1 to all uses of WidenVT, and remove WidenVT? | |
16186–16187 | nit: are ReinterpertOp and ReinterpretPg needed? I think you can just reassign Op and Pg? |
- Renamed hasZeroedOtherLanes -> isZeroingInactiveLanes
- Removed`WidenVT`
- Used existing Pg and Op instead of defining new ReinterpretPg and ReinterpretOp
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp | ||
---|---|---|
16192 | For the same reason as below I think this can be just if (isZeroingInactiveLanes(Op)) | |
16199–16202 | Isn't this equivalent to just Pg = getSVEPredicateBitCast(MVT::nxv16i1, Pg, DAG);? because getSVEPredicateBitCast already has the isZeroingInactiveLanes logic. Which means these two cases can be removed as the default case should just work? | |
llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td | ||
2134–2135 | Now that we'll only ever have a single pattern, can you do this as part of PTEST_PP's definition? |
llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td | ||
---|---|---|
782–783 | nit: We normally put the pattern in the class definition itself, which has most benefits if there are multiple instructions being instantiated from the same (multi)class, so that we only have to specify the pattern once. In this case, there's only one instantiation from sve_int_ptest, but for consistency and to keep this file tidy, I'd rather see the pattern living in that class. This means you'll need to make sve_int_ptest a multiclass and pass in AArch64ptest as a SDPatternOperator. You can then also use SVE_2_Op_Pat for the pattern itself. |
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp | ||
---|---|---|
16178 | Does adding an assert that Op->getValueType() == Pg->getValueTypes() cause any issues, because otherwise I think the AArch64CC::ANY_ACTIVE code in unsafe? | |
llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td | ||
782–783 | You can also do this within sve_int_ptest directly by just passing in AArch64ptest. See sve_int_wrffr for inspiration. |
- Added assert that PTEST operands have same type
- Moved PTEST pattern to class definition
llvm/lib/Target/AArch64/AArch64ISelLowering.cpp | ||
---|---|---|
16178 | Hey Paul, thanks for the comments. I've added the assert - doesn't seem to cause any problems. |
nit: isZeroingInactiveLanes ?