This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Optimize conditional branches followed by unconditional branches
ClosedPublic

Authored by aemerson on Jul 8 2019, 9:45 AM.

Details

Summary

If we have an icmp->brcond->br sequence where the brcond just branches to the next block jumping over the br, while the br takes the false edge, then we can modify the conditional branch to jump to the br's target while inverting the condition of the incoming icmp. This means we can eliminate the br as an unconditional branch to the fallthrough block.

Diff Detail

Event Timeline

aemerson created this revision.Jul 8 2019, 9:45 AM
arsenm added a comment.Jul 8 2019, 9:52 AM

Why is this done in the target selector instead of in a generic combine?

arsenm added a comment.Jul 8 2019, 9:55 AM

Why is this done in the target selector instead of in a generic combine?

I'm pretty sure SelectionDAGBuilder does this today

I haven't read the patch yet, but just from reading the description I remembered that the DAG does this while building up the DAG. Maybe this needs to be done in the IRTranslator to keep this behavior? (This helps while comparing codegen with sdag/gisel).

qcolombet added inline comments.Jul 8 2019, 9:59 AM
llvm/lib/Target/AArch64/AArch64InstructionSelector.cpp
1171 ↗(On Diff #208448)

Taking advantage of fall-through sounds like something beneficial for all targets.
Could we turn this into a combiner helper?

(Then you can call this specific helper in here.)

FWIW I would like some control over what boolean sources are acceptable to invert. SelectionDAG inserting a condition invert between a control flow intrinsic and the branch required work to undo

This seemed a little presumptive to do for all targets for me. I can try to move it earlier into a Combiner helper, since I don't want to put optimizations into the IRTranslator unless we can't do it in other places.

FWIW I would like some control over what boolean sources are acceptable to invert. SelectionDAG inserting a condition invert between a control flow intrinsic and the branch required work to undo

In this case I'm just inverting the condition code, we shouldn't need to insert extra instructions.

Combiner helper makes sense here IMO.

llvm/lib/Target/AArch64/AArch64InstructionSelector.cpp
1186 ↗(On Diff #208448)

Would BrIt->isTerminator() work here?

1197 ↗(On Diff #208448)

Is trunc the only thing we should look through here? Can we also handle copies, for example?

aemerson marked 2 inline comments as done.Jul 8 2019, 11:15 AM
aemerson added inline comments.
llvm/lib/Target/AArch64/AArch64InstructionSelector.cpp
1186 ↗(On Diff #208448)

Branch is a terminator by nature, so it doesn't check the same thing, although the usefulness of this assert is debatable.

1197 ↗(On Diff #208448)

For arm64 it seems to be the only thing in between compares and branches in practice due to our legalization rules. This code will be removed anyway since I'm going to put this before the legalizer so we shouldn't see any artifacts in between the ops,

aemerson updated this revision to Diff 208507.Jul 8 2019, 2:23 PM
aemerson edited the summary of this revision. (Show Details)

New patch to do the optimization in the CombinerHelper and add it to the prelegalizer combiner for AArch64.

This doesn't get all the code size improvements of the last patch, I'll make that a separate patch. On it's own, this change gives a 2.5% geomean improvement on CTMark -O0.

aemerson updated this revision to Diff 208510.Jul 8 2019, 2:33 PM

Add missing test.

Unless others have comments, this looks good to go in to me.

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
392

Orthogonal to this patch - I've found that MachineOperand::getPredicate returning an unsigned and users having to explicitly cast to Predicate is a little annoying. What do you think of getPredicate returning Predicate type (in a separate patch)? In my use cases I've almost never used the unsigned value without casting to Predicate.

aemerson accepted this revision.Jul 9 2019, 8:50 AM
aemerson marked an inline comment as done.

Marking as accepted.

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
392

That sounds reasonable to me.

This revision is now accepted and ready to land.Jul 9 2019, 8:50 AM
This revision was automatically updated to reflect the committed changes.