This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
TargetTransformInfo.h
-
TargetTransformInfoImpl.h
-
IR/
-
IntrinsicsBPF.td
-
Transforms/
-
InstCombine/
-
InstCombiner.h
-
Utils/
-
LoopUtils.h
-
lib/
-
Analysis/
-
TargetTransformInfo.cpp
-
Target/BPF/
-
BPF/
-
BPF.h
-
BPFAdjustOpt.cpp
-
BPFCheckAndAdjustIR.cpp
-
BPFTargetMachine.cpp
-
BPFTargetTransformInfo.h
-
CMakeLists.txt
-
Transforms/
-
InstCombine/
-
InstCombineAndOrXor.cpp
-
Scalar/
-
LICM.cpp
-
Utils/
-
SimplifyCFG.cpp
-
test/CodeGen/BPF/
-
CodeGen/
-
BPF/
1
adjust-opt-icmp1.ll
-
adjust-opt-icmp2.ll
-
adjust-opt-icmp3.ll
-
adjust-opt-icmp4.ll
-
adjust-opt-speculative1.ll
-
adjust-opt-speculative2.ll

Differential D147968

[TTI][BPF]: Undo specific transform-preventing passes and add one TTI hook
Needs ReviewPublic

Authored by yonghong-song on Apr 10 2023, 1:18 PM.

Download Raw Diff

Details

Reviewers

ast
mkazantsev
nikic
chandlerc
lebedev.ri
spatel
RKSimon
reames
SjoerdMeijer
vdmitrie
davidxl

Summary

LLVM optimization may generate certain codes which cannot be
handled by kernel verifier, e.g., some optimizations in
InstCombine and SimplifyCFG as bpf verifier is implemented
in kernel and it cannot perform complicated code analysis like
llvm compiler. Memory, verification speed and verifier complexity
are all part of considerations when adding new analysis
in verifier.

To avoid such harmful transformation, BPF backend has implemented
some passes, esp. through 'barrier' builtin function to prevent
certain InstCombine and SimplifyCFG transformations.
In these BPF backend passes pattern matching are used
to capture some specific patterns to prevent some
llvm transformations. But such pattern matching may not be precise
and may prevent some useful transformations. It would be great
if we can directly disable llvm transformations and this will
also avoid bpf specific transformation-preventing passes.
The following is 'git show --stat' to show that we can remove
lots of bpf hacking codes by adding one TTI hook.

llvm/include/llvm/Analysis/TargetTransformInfo.h        |   9 +++
llvm/include/llvm/Analysis/TargetTransformInfoImpl.h    |   2 +
llvm/include/llvm/IR/IntrinsicsBPF.td                   |   3 -
llvm/include/llvm/Transforms/InstCombine/InstCombiner.h |   1 +
llvm/include/llvm/Transforms/Utils/LoopUtils.h          |   8 +-
llvm/lib/Analysis/TargetTransformInfo.cpp               |   4 +
llvm/lib/Target/BPF/BPF.h                               |   7 --
llvm/lib/Target/BPF/BPFAdjustOpt.cpp                    | 393 --------------------------------------------------------------------------------------------
llvm/lib/Target/BPF/BPFCheckAndAdjustIR.cpp             |  45 +----------
llvm/lib/Target/BPF/BPFTargetMachine.cpp                |   5 --
llvm/lib/Target/BPF/BPFTargetTransformInfo.h            |   3 +
llvm/lib/Target/BPF/CMakeLists.txt                      |   1 -
llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp |   4 +
llvm/lib/Transforms/Scalar/LICM.cpp                     |  27 ++++---
llvm/lib/Transforms/Utils/SimplifyCFG.cpp               |   3 +
llvm/test/CodeGen/BPF/adjust-opt-icmp1.ll               |  14 +---
llvm/test/CodeGen/BPF/adjust-opt-icmp2.ll               |  10 +--
llvm/test/CodeGen/BPF/adjust-opt-icmp3.ll               |  12 +--
llvm/test/CodeGen/BPF/adjust-opt-icmp4.ll               |  12 +--
llvm/test/CodeGen/BPF/adjust-opt-speculative1.ll        |  17 +---
llvm/test/CodeGen/BPF/adjust-opt-speculative2.ll        |  22 +-----
21 files changed, 69 insertions(+), 533 deletions(-)

Below are detailed explanations for three transformations
which hurts bpf verification.

FoldAndOrOfICmpsUsingRanges

The following is an example to show how FoldAndOrOfICmpsUsingRanges
transformation may generate codes which hurts bpf verifier.
For bpf prog in linux/tools/testing/selftests/bpf/progs/map_kptr_fail.c:

...
id = ctx->protocol;
if (id < 4 || id > 12)
  return 0;
*(u64 *)((void *)v + id) = 0;
...

With FoldAndOrOfICmpsUsingRanges, the find pseudo code looks like:

...
id = ctx->protocol;
tmp = id;
tmp += -13;
if (tmp < 0xfffffff7) goto next;
v += id;
*v = 0;
next:

In the above code, the verifier considers 'id' in 'v += id' as a arbitrary
unsigned integer so later '*v = 0' is considered as possible out-of-bound
memory access. This is because the verifier, as a post analysis tool,
does not know the relationship of tmp/id at 'v += id' point. Although it
is possible to improve verifier to track tmp/id relationship, this would
increase bpf verifier complexity a lot. llvm FoldAndOrOfICmpsUsingRanges
does the transformation based on pattern matching and it certainly aware
tmp/id relationship.

The actual verification failure looks like below:

; id = ctx->protocol;
9: (61) r1 = *(u32 *)(r6 +16)         ; R1_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff)) R6_w=ctx(off=0,imm=0)
; if (id < 4 || id > 12)
10: (bc) w2 = w1                      ; R1_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff)) R2_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff))
11: (04) w2 += -13                    ; R2=scalar(umax=4294967295,var_off=(0x0; 0xffffffff))
12: (a6) if w2 < 0xfffffff7 goto pc+3         ; R2=scalar(umin=4294967287,umax=4294967295,var_off=(0xfffffff0; 0xf),s32_min=-9,s32_max=-1)
; *(u64 *)((void *)v + id) = 0;
13: (0f) r0 += r1                     ; R0_w=map_value(off=0,ks=4,vs=32,umax=4294967295,var_off=(0x0; 0xffffffff)) R1=scalar(umax=4294967295,var_off=(0x0; 0xffffffff))
14: (b7) r1 = 0                       ; R1_w=0
; *(u64 *)((void *)v + id) = 0;
15: (7b) *(u64 *)(r0 +0) = r1
R0 unbounded memory access, make sure to bounds check any such access

FoldTwoEntryPHINode

The following is an example to show FoldTwoEntryPHINode transformation
may generate codes which hurts bpf verifier.
For bpf prog in linux/tools/testing/selftests/bpf/progs/test_tc_dtime.c:

static void inc_errs(__u32 idx)
{
      if (test < 9)
              errs[test][idx]++;
      else
              errs[UKN_TEST][idx]++;
}
...
if (skb->tstamp == 0xb9fbeef)
  inc_errs(2);
...

With FoldTwoEntryPHINode, the final generated code looks like

...
r1 = test;
r2 = skb->tstamp;
if (r2 != 0xb9fbeef) goto next;
w2 = w1; // w1/w2 are lower 32 bit values of r1/r2.
if (w1 >= 9)
  w2 = 9;
tmp = r2 * 28;
r3 = errs + tmp;
... *r3 ...
...

In the above code, for the case where 'w1 >= 9' is false, verifier
concludes that 'r2' at 'tmp = r2 * 28' as an arbitrary scalar which
caused verificaiton failure for later dereference of r3.

The actual verification failure looks like below:

8: (18) r1 = 0xffffc900001ca230       ; R1_w=map_value(off=560,ks=4,vs=564,imm=0)
10: (61) r1 = *(u32 *)(r1 +0)         ; R1_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff))
; if (skb->tstamp == EGRESS_ENDHOST_MAGIC)
11: (79) r2 = *(u64 *)(r6 +152)       ; R2_w=scalar() R6=ctx(off=0,imm=0)
; if (skb->tstamp == EGRESS_ENDHOST_MAGIC)
12: (55) if r2 != 0xb9fbeef goto pc+10        ; R2_w=195018479
13: (bc) w2 = w1                      ; R1_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff)) R2_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff))
; if (test < __NR_TESTS)
14: (a6) if w1 < 0x9 goto pc+1 16: R0=2 R1_w=scalar(umax=8,var_off=(0x0; 0xf)) R2_w=scalar(umax=4294967295,var_off=(0x0; 0xffffffff)) R6=ctx(off=0,imm=0) R10=fp0
;
16: (27) r2 *= 28                     ; R2_w=scalar(umax=120259084260,var_off=(0x0; 0x1ffffffffc),s32_max=2147483644,u32_max=-4)
17: (18) r3 = 0xffffc900001ca118      ; R3_w=map_value(off=280,ks=4,vs=564,imm=0)
19: (0f) r3 += r2                     ; R2_w=scalar(umax=120259084260,var_off=(0x0; 0x1ffffffffc),s32_max=2147483644,u32_max=-4) R3_w=map_value(off=280,ks=4,vs=564,umax=120259084260,var_off=(0x0; 0x1ffffffffc),s32_max=2147483644,u32_max=-4)
20: (61) r2 = *(u32 *)(r3 +0)
R3 unbounded memory access, make sure to bounds check any such access

The unit test adjust-opt-speculative2.ll also shows how FoldTwoEntryPHINode
might hurt verifier.

The original code:

unsigned foo();
void *test(void *p) {
  unsigned ret = foo();
  if (ret <= 7)
    p += ret;
  return p;
}

Compiled with clang -target bpf -O2 -S t.c, with FoldTwoEntryPHINode enabled,
the following code is generated:

1:    r6 = r1
2:    call foo
3:    r1 = r0
4:    r1 <<= 32
5:    r1 >>= 32
6:    r2 = 8
7:    if r2 > r1 goto LBB0_2
8:    r0 = 0
   LBB0_2:
9:    r0 <<= 32
10:   r0 >>= 32
11:   r6 += r0
12:   r0 = r6
13:   exit

In the above example, insn 3 establishes r1 and r0 equivalence. Insns 4-7
establishes r1 < 8 if branch is taken. However, with branch taken, later
verifier is not able to ensure r0 < 8 and 'r6 += r0' may have a verificaiton
error.

With FoldTwoEntryPHINode disabled, the following code is generated:

1:    r6 = r1
2:    call foo
3:    r0 <<= 32
4:    r0 >>= 32
5:    if r0 > 7 goto LBB0_2
6:    r6 += r0
   LBB0_2:
7:    r0 = r6
8:    exit

The 'r6 += r0' is safe as the verifier can deduce 'r0 <= 7' based on the branch.

MinMaxHoisting

Furthermore, recently we hit another issue related LICM
MinMaxHoisting transformation (https://reviews.llvm.org/D147078)
which also hurts verifier.
For bpf prog in linux/tools/testing/selftests/bpf/progs/loop6.c:

The original code:

for (i = 0; (i < VIRTIO_MAX_SGS) && (i < out_sgs); i++) {
        for (n = 0, sgp = get_sgp(sgs, i); sgp && (n < SG_MAX);
             sgp = __sg_next(sgp)) {
                bpf_probe_read_kernel(&len, sizeof(len), &sgp->length);
                length1 += len;
                n++;
        }
}

After MinMaxHoisting,

upper = MIN(VIRTIO_MAX_SGS, out_sgs);
for (i = 0; i < upper; i++) {
        for (n = 0, sgp = get_sgp(sgs, i); sgp && (n < SG_MAX);
             sgp = __sg_next(sgp)) {
                bpf_probe_read_kernel(&len, sizeof(len), &sgp->length);
                length1 += len;
                n++;
        }
}

The verifier is not able to verify properly since it assumes the loop upper
is a arbitrary scalar (up to 32bit integer). The actual verification failure
looks like:

...
119: (15) if r1 == 0x0 goto pc+1
The sequence of 8193 jumps is too complex.

We have a draft to show bpf backend
implementation to undo the transformation (https://reviews.llvm.org/D147990).

Proposal

We feel adding proper TTI hooks is a better solution.
TTI provides a mechanism so backend can influence the
transformation. This also helps remove the associated bpf backend
pattern matching transformations.

This patch undos previous InstCombine/SimplifyCFG
transformation-preventing passes and adds one TTI hook
TTI->needsPreserveRangeInfoInVerification() such that
the above mentioned transformations can be disabled
by the target. The hook name needsPreserveRangeInfoInVerification()
implies that the transformation is disabled due to
later downstrean code verification.

Another possible solution is to legalize IR for verificaiton requirement.
This may require to add the verification requirement to IR, or
establish certain illegal code patterns, etc. This approach
requires more thought as downstream verification capability and
new code pattern verification failure is also a moving target.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

yonghong-song created this revision.Apr 10 2023, 1:18 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 10 2023, 1:18 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

yonghong-song requested review of this revision.Apr 10 2023, 1:18 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 10 2023, 1:18 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B224633: Diff 512236.Apr 10 2023, 2:05 PM

The previous patch caused several bpf selftest failures. Remove two flags and changed a few tests so bpf selftests can pass.

Harbormaster completed remote builds in B225115: Diff 512893.Apr 12 2023, 11:44 AM

Using TTI hooks instead of flags

Harbormaster completed remote builds in B225479: Diff 513411.Apr 13 2023, 7:55 PM

do not add TTI to InstCombinerImpl, use the existing one in InstCombiner
add a code example to illustrate the problem in Summary

Harbormaster completed remote builds in B225739: Diff 513759.Apr 14 2023, 5:16 PM

add TTI hook for LICM/HoistMinMax transformations.
add detailed code analysis in commit message.

Herald added subscribers: asbirlea, kristof.beyls. · View Herald TranscriptApr 14 2023, 10:52 PM

Harbormaster completed remote builds in B225803: Diff 513844.Apr 14 2023, 11:55 PM

yonghong-song edited the summary of this revision. (Show Details)Apr 16 2023, 9:30 AM

yonghong-song edited the summary of this revision. (Show Details)

yonghong-song edited the summary of this revision. (Show Details)Apr 16 2023, 4:08 PM

yonghong-song added reviewers: mkazantsev, nikic, chandlerc, lebedev.ri, spatel.Apr 17 2023, 2:26 PM

yonghong-song added subscribers: jemarch, dfaust.

Herald added a subscriber: StephenFan. · View Herald TranscriptApr 17 2023, 2:26 PM

@mkazantsev @nikic @chandlerc @lebedev.ri @spatel Ping. Could you help take a look at this patch? Thanks!

yonghong-song added reviewers: RKSimon, reames, SjoerdMeijer, vdmitrie.Apr 20 2023, 1:00 PM

Ping again. @mkazantsev @nikic @chandlerc @lebedev.ri @spatel Could you help review the patch and share your opinion? Thanks!

Herald added a subscriber: hoy. · View Herald TranscriptMay 1 2023, 11:06 AM

In my opinion this new approach of using target hooks is way better than trying to amend the effect of certain passes by adding "counter passes", which is not only a moving target, but also fragile and IMO a waste of effort. We do have the same problem with the GCC BPF backend, and will be using a similar strategy to what Yonghong is proposing here.

RKSimon added inline comments.May 14 2023, 6:43 AM

llvm/test/CodeGen/BPF/adjust-opt-icmp1.ll
36–37	You've left a lot of orphan CHECK-DISABLE in various files where you've removed the bpf-disable-serialize-icmp RUN

This has already been discussed in D147078. All the middle-end maintainers who chimed in on that one were opposed to the proposal, and I don't think their responses will be different this time. I don't really want to repeat that discussion one more time here, because it felt a bit like talking at a wall. If you want to pursue this further, I would suggest starting an RFC on Discourse, because it seems pretty clear that we don't have a consensus between BPF and middle-end maintainers here.

PS: I would recommend shifting your thinking about these undo transforms from "precisely undo what this pass did" to "legalize IR for BPF". The draft patch at D147990 is needlessly complex because it tries to precisely undo what LICM does (down to loop invariance checks!) instead of treating it as a generic legalization problem.

This revision now requires changes to proceed.May 14 2023, 7:32 AM

Can we have a middle ground here. Instead of having too many new TTI interfaces here, a single one can be used instead: TTI:needsPreserveRangeInfoInVerification(). If this returns true, some passes (or subpasses in instcombine) can be turned off for the target. Similar things are done in vectorizer.

Transformations that are guarded by this check also need to be checked case by case. If the verifier can be enhanced without too much compile time overhead, it should probably be done there -- it adds more benefit of allowing more flexibility in source patterns.

Legalization can be a longer term thing to to think about. Like undoing transformations (excluding pure canonicalization ones), it does add unnecessary compile time overhead.

@davidxl Thanks for your suggestions. Adding a single TTI->needsPreserveRangeInfoInVerification() sounds a reasonable idea. Legalization of IR for BPF verification is a great idea but it requires more thought, e.g., how IR will be enhanced to encode verification requirement and how middle end optimization reacts to such requirement as typically it is not clear whether a particular optimization will hurt bpf verification or not unless running though bpf verifier or having a deep knowledge of bpf verifier. We can continue to think and discuss 'legalization of IR for BPF verification' idea, but it could be great we can have current TTI->needsPreserveRangeInfoInVerification() approach which gcc community intends to do the same.

Also as you mentioned, bpf verifier is a moving target as well and we are constantly improving verifier as well. Yes, once we fixed verifier, after some times, old kernels are considered not important, we might undo some hooks in llvm.

I will post another version of the patch soon.

use one hook TTI->needsPreserveRangeInfoInVerification() instead of three hooks
remove unused checks in the tests

Harbormaster completed remote builds in B232787: Diff 523276.May 18 2023, 12:41 AM

Considering the overall longer term maintenance cost to middle-end maintainers and the cost the BPF backend developers, this approach seems like the least intrusive method to me. Middle end maintainers (nikic@) need to chime in if there are more concerns on the approach to unblock the progress. The assumption is that more longer term solution will be explored further (e.g. legalization, or maintaining range info etc).

wenlei added a subscriber: wenlei.May 24 2023, 12:36 PM

@nikic Could you comment on @davidxl suggestion? I would be great if we can find a path forward for this particular issue. Thanks!

My two cents:

I think that in general honoring canonicalization is good, but it's not always a clear cut. Legality and profitability are sometimes relative, and in the case of BPF, I do think there's case to be made for mid-end to look at TTI more then what is traditionally allowed. The failure mode here is different from profitability, where you may just end up with inefficient code if you don't undo later; for BPF though, it can generate program that will be rejected by verifier (not run slower). Technically BPF backend can define what is legal for that target, i.e. one could argue that BPF target requires preserving predicates in certain form (needs to be well defined though), which would then restrict certain optimization from the mid-end.

Perhaps BPF is an uncharted territory in the sense that it's so restrictive that its requirements are often at odds with more canonicalizations, and that requiring backend to undo everything is bordering impractical. A special case grant for untraditional use of TTI may be reasonable here - we can make the reasons clear so it won't set bad precedence for others.

In D147968#4416082, @wenlei wrote:

I think that in general honoring canonicalization is good, but it's not always a clear cut. Legality and profitability are sometimes relative, and in the case of BPF, I do think there's case to be made for mid-end to look at TTI more then what is traditionally allowed. The failure mode here is different from profitability, where you may just end up with inefficient code if you don't undo later; for BPF though, it can generate program that will be rejected by verifier (not run slower). Technically BPF backend can define what is legal for that target, i.e. one could argue that BPF target requires preserving predicates in certain form (needs to be well defined though), which would then restrict certain optimization from the mid-end.

I am, generally speaking, not opposed to doing TTI-based legality checks in special circumstances. I have approved exceptions for doing so in the past. A recent one was using TTI in InstCombine to check whether a certain address space cast is legal for the target. Without TTI, we assume that all address space casts are illegal. This exception made sense to me, because addrspacecast semantics are fundamentally target-dependent, and the legality check is compatible with the LangRef semantics.

The case here is very different, because the legality conditions for BPF are not well-defined and not compatible with IR semantics. BPF considers transforms "illegal" that are clearly legal under our operational semantics, and, to the best of my knowledge, there is no principled way a maintainer could determine whether or not a given transform would be "legal" for BPF or not.

We have other targets with somewhat "unusual" legality requirements, where the target does not accept arbitrary inputs. These include things like wasm (which has verifier requirements) and gpu targets like amdgpu and nvptx (which also have verifier requirements, as well as convergence restrictions). These additional legality requirements are handled in one of two ways: Either the backend takes the responsibility of converting IR into a legal form (e.g. CFG structurization or removal of irreducible cycles) or the additional legality requirements are encoded into the IR, e.g. through the use special intrinsics, operand bundles and/or token values. This makes the legality constraint part of the normal IR semantics, and we can use our normal reasoning and tools to determine whether transforms are legal or not.

I maintain my position that the BPF target should be using either of those approaches (or a combination of them). Otherwise we will end up with checks for the BPF target littered over random places in the code base, with a new check being added every time a commit breaks the BPF verifier.

@nikic Thanks for your comment and pointer! I will study Webassembly/AMDGPU/NVPTX to see how they resolve their respective 'legality' issues and will report back once I got some understanding about their strategy and commonality/difference w.r.t. BPF.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

TargetTransformInfo.h

9 lines

TargetTransformInfoImpl.h

2 lines

IR/

IntrinsicsBPF.td

3 lines

Transforms/

InstCombine/

InstCombiner.h

1 line

Utils/

LoopUtils.h

8 lines

lib/

Analysis/

TargetTransformInfo.cpp

4 lines

Target/

BPF/

BPF.h

7 lines

BPFAdjustOpt.cpp

BPFCheckAndAdjustIR.cpp

45 lines

BPFTargetMachine.cpp

5 lines

BPFTargetTransformInfo.h

3 lines

CMakeLists.txt

1 line

Transforms/

InstCombine/

InstCombineAndOrXor.cpp

4 lines

Scalar/

LICM.cpp

27 lines

Utils/

SimplifyCFG.cpp

3 lines

test/

CodeGen/

BPF/

14 lines

10 lines

12 lines

12 lines

adjust-opt-speculative1.ll

17 lines

adjust-opt-speculative2.ll

22 lines

Diff 523276

llvm/include/llvm/Analysis/TargetTransformInfo.h

Show First 20 Lines • Show All 1,634 Lines • ▼ Show 20 Lines
///		///
/// For non-Arm targets, this function isn't used. It defaults to returning		/// For non-Arm targets, this function isn't used. It defaults to returning
/// false, but it shouldn't matter what it returns anyway.		/// false, but it shouldn't matter what it returns anyway.
bool hasArmWideBranch(bool Thumb) const;		bool hasArmWideBranch(bool Thumb) const;

/// \return The maximum number of function arguments the target supports.		/// \return The maximum number of function arguments the target supports.
unsigned getMaxNumArgs() const;		unsigned getMaxNumArgs() const;

		/// \returns True if the target wants to preserve range information to make
		/// later code verification easier.
		bool needsPreserveRangeInfoInVerification() const;

/// @}		/// @}

private:		private:
/// The abstract base class used to type erase specific TTI		/// The abstract base class used to type erase specific TTI
/// implementations.		/// implementations.
class Concept;		class Concept;

/// The template model for the base class which wraps a concrete		/// The template model for the base class which wraps a concrete
▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	public:
virtual bool enableScalableVectorization() const = 0;		virtual bool enableScalableVectorization() const = 0;
virtual bool supportsScalableVectors() const = 0;		virtual bool supportsScalableVectors() const = 0;
virtual bool hasActiveVectorLength(unsigned Opcode, Type *DataType,		virtual bool hasActiveVectorLength(unsigned Opcode, Type *DataType,
Align Alignment) const = 0;		Align Alignment) const = 0;
virtual VPLegalization		virtual VPLegalization
getVPLegalizationStrategy(const VPIntrinsic &PI) const = 0;		getVPLegalizationStrategy(const VPIntrinsic &PI) const = 0;
virtual bool hasArmWideBranch(bool Thumb) const = 0;		virtual bool hasArmWideBranch(bool Thumb) const = 0;
virtual unsigned getMaxNumArgs() const = 0;		virtual unsigned getMaxNumArgs() const = 0;
		virtual bool needsPreserveRangeInfoInVerification() const = 0;
};		};

template <typename T>		template <typename T>
class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {		class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {
T Impl;		T Impl;

public:		public:
Model(T Impl) : Impl(std::move(Impl)) {}		Model(T Impl) : Impl(std::move(Impl)) {}
▲ Show 20 Lines • Show All 679 Lines • ▼ Show 20 Lines	public:

bool hasArmWideBranch(bool Thumb) const override {		bool hasArmWideBranch(bool Thumb) const override {
return Impl.hasArmWideBranch(Thumb);		return Impl.hasArmWideBranch(Thumb);
}		}

unsigned getMaxNumArgs() const override {		unsigned getMaxNumArgs() const override {
return Impl.getMaxNumArgs();		return Impl.getMaxNumArgs();
}		}

		bool needsPreserveRangeInfoInVerification() const override {
		return Impl.needsPreserveRangeInfoInVerification();
		}
};		};

template <typename T>		template <typename T>
TargetTransformInfo::TargetTransformInfo(T Impl)		TargetTransformInfo::TargetTransformInfo(T Impl)
: TTIImpl(new Model<T>(Impl)) {}		: TTIImpl(new Model<T>(Impl)) {}

/// Analysis pass providing the \c TargetTransformInfo.		/// Analysis pass providing the \c TargetTransformInfo.
///		///
▲ Show 20 Lines • Show All 94 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 874 Lines • ▼ Show 20 Lines	return TargetTransformInfo::VPLegalization(
/* EVLParamStrategy */ TargetTransformInfo::VPLegalization::Discard,		/* EVLParamStrategy */ TargetTransformInfo::VPLegalization::Discard,
/* OperatorStrategy */ TargetTransformInfo::VPLegalization::Convert);		/* OperatorStrategy */ TargetTransformInfo::VPLegalization::Convert);
}		}

bool hasArmWideBranch(bool) const { return false; }		bool hasArmWideBranch(bool) const { return false; }

unsigned getMaxNumArgs() const { return UINT_MAX; }		unsigned getMaxNumArgs() const { return UINT_MAX; }

		bool needsPreserveRangeInfoInVerification() const { return false; }

protected:		protected:
// Obtain the minimum required size to hold the value (without the sign)		// Obtain the minimum required size to hold the value (without the sign)
// In case of a vector it returns the min required size for one element.		// In case of a vector it returns the min required size for one element.
unsigned minRequiredElementSize(const Value *Val, bool &isSigned) const {		unsigned minRequiredElementSize(const Value *Val, bool &isSigned) const {
if (isa<ConstantDataVector>(Val) \|\| isa<ConstantVector>(Val)) {		if (isa<ConstantDataVector>(Val) \|\| isa<ConstantVector>(Val)) {
const auto *VectorValue = cast<Constant>(Val);		const auto *VectorValue = cast<Constant>(Val);

// In case of a vector need to pick the max between the min		// In case of a vector need to pick the max between the min
▲ Show 20 Lines • Show All 464 Lines • Show Last 20 Lines

llvm/include/llvm/IR/IntrinsicsBPF.td

Show All 28 Lines	let TargetPrefix = "bpf" in { // All intrinsics start with "llvm.bpf."
def int_bpf_preserve_type_info : ClangBuiltin<"__builtin_bpf_preserve_type_info">,		def int_bpf_preserve_type_info : ClangBuiltin<"__builtin_bpf_preserve_type_info">,
Intrinsic<[llvm_i32_ty], [llvm_i32_ty, llvm_i64_ty],		Intrinsic<[llvm_i32_ty], [llvm_i32_ty, llvm_i64_ty],
[IntrNoMem]>;		[IntrNoMem]>;
def int_bpf_preserve_enum_value : ClangBuiltin<"__builtin_bpf_preserve_enum_value">,		def int_bpf_preserve_enum_value : ClangBuiltin<"__builtin_bpf_preserve_enum_value">,
Intrinsic<[llvm_i64_ty], [llvm_i32_ty, llvm_ptr_ty, llvm_i64_ty],		Intrinsic<[llvm_i64_ty], [llvm_i32_ty, llvm_ptr_ty, llvm_i64_ty],
[IntrNoMem]>;		[IntrNoMem]>;
def int_bpf_passthrough : ClangBuiltin<"__builtin_bpf_passthrough">,		def int_bpf_passthrough : ClangBuiltin<"__builtin_bpf_passthrough">,
Intrinsic<[llvm_any_ty], [llvm_i32_ty, llvm_any_ty], [IntrNoMem]>;		Intrinsic<[llvm_any_ty], [llvm_i32_ty, llvm_any_ty], [IntrNoMem]>;
def int_bpf_compare : ClangBuiltin<"__builtin_bpf_compare">,
Intrinsic<[llvm_i1_ty], [llvm_i32_ty, llvm_anyint_ty, llvm_anyint_ty],
[IntrNoMem]>;
}		}

llvm/include/llvm/Transforms/InstCombine/InstCombiner.h

	Show All 37 Lines
	class TargetLibraryInfo;			class TargetLibraryInfo;
	class TargetTransformInfo;			class TargetTransformInfo;

	/// The core instruction combiner logic.			/// The core instruction combiner logic.
	///			///
	/// This class provides both the logic to recursively visit instructions and			/// This class provides both the logic to recursively visit instructions and
	/// combine them.			/// combine them.
	class LLVM_LIBRARY_VISIBILITY InstCombiner {			class LLVM_LIBRARY_VISIBILITY InstCombiner {
				protected:
	/// Only used to call target specific intrinsic combining.			/// Only used to call target specific intrinsic combining.
	/// It must NOT be used for any other purpose, as InstCombine is a			/// It must NOT be used for any other purpose, as InstCombine is a
	/// target-independent canonicalization transform.			/// target-independent canonicalization transform.
	TargetTransformInfo &TTI;			TargetTransformInfo &TTI;

	public:			public:
	/// Maximum size of array considered when transforming.			/// Maximum size of array considered when transforming.
	uint64_t MaxArraySizeForCombine = 0;			uint64_t MaxArraySizeForCombine = 0;
	▲ Show 20 Lines • Show All 487 Lines • Show Last 20 Lines

llvm/include/llvm/Transforms/Utils/LoopUtils.h

	Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines
	/// before uses, allowing us to hoist a loop body in one pass without iteration.			/// before uses, allowing us to hoist a loop body in one pass without iteration.
	/// Takes DomTreeNode, AAResults, LoopInfo, DominatorTree,			/// Takes DomTreeNode, AAResults, LoopInfo, DominatorTree,
	/// TargetLibraryInfo, Loop, AliasSet information for all			/// TargetLibraryInfo, Loop, AliasSet information for all
	/// instructions of the loop and loop safety information as arguments.			/// instructions of the loop and loop safety information as arguments.
	/// Diagnostics is emitted via \p ORE. It returns changed status.			/// Diagnostics is emitted via \p ORE. It returns changed status.
	/// \p AllowSpeculation is whether values should be hoisted even if they are not			/// \p AllowSpeculation is whether values should be hoisted even if they are not
	/// guaranteed to execute in the loop, but are safe to speculatively execute.			/// guaranteed to execute in the loop, but are safe to speculatively execute.
	bool hoistRegion(DomTreeNode , AAResults , LoopInfo , DominatorTree ,			bool hoistRegion(DomTreeNode , AAResults , LoopInfo , DominatorTree ,
	AssumptionCache , TargetLibraryInfo , Loop *,			AssumptionCache , TargetLibraryInfo , TargetTransformInfo *,
	MemorySSAUpdater &, ScalarEvolution , ICFLoopSafetyInfo ,			Loop , MemorySSAUpdater &, ScalarEvolution ,
	SinkAndHoistLICMFlags &, OptimizationRemarkEmitter *, bool,			ICFLoopSafetyInfo *, SinkAndHoistLICMFlags &,
	bool AllowSpeculation);			OptimizationRemarkEmitter *, bool, bool AllowSpeculation);

	/// Return true if the induction variable \p IV in a Loop whose latch is			/// Return true if the induction variable \p IV in a Loop whose latch is
	/// \p LatchBlock would become dead if the exit test \p Cond were removed.			/// \p LatchBlock would become dead if the exit test \p Cond were removed.
	/// Conservatively returns false if analysis is insufficient.			/// Conservatively returns false if analysis is insufficient.
	bool isAlmostDeadIV(PHINode IV, BasicBlock LatchBlock, Value *Cond);			bool isAlmostDeadIV(PHINode IV, BasicBlock LatchBlock, Value *Cond);

	/// This function deletes dead loops. The caller of this function needs to			/// This function deletes dead loops. The caller of this function needs to
	/// guarantee that the loop is infact dead.			/// guarantee that the loop is infact dead.
	▲ Show 20 Lines • Show All 382 Lines • Show Last 20 Lines

llvm/lib/Analysis/TargetTransformInfo.cpp

	Show First 20 Lines • Show All 1,192 Lines • ▼ Show 20 Lines
	bool TargetTransformInfo::hasArmWideBranch(bool Thumb) const {			bool TargetTransformInfo::hasArmWideBranch(bool Thumb) const {
	return TTIImpl->hasArmWideBranch(Thumb);			return TTIImpl->hasArmWideBranch(Thumb);
	}			}

	unsigned TargetTransformInfo::getMaxNumArgs() const {			unsigned TargetTransformInfo::getMaxNumArgs() const {
	return TTIImpl->getMaxNumArgs();			return TTIImpl->getMaxNumArgs();
	}			}

				bool TargetTransformInfo::needsPreserveRangeInfoInVerification() const {
				return TTIImpl->needsPreserveRangeInfoInVerification();
				}

	bool TargetTransformInfo::shouldExpandReduction(const IntrinsicInst *II) const {			bool TargetTransformInfo::shouldExpandReduction(const IntrinsicInst *II) const {
	return TTIImpl->shouldExpandReduction(II);			return TTIImpl->shouldExpandReduction(II);
	}			}

	unsigned TargetTransformInfo::getGISelRematGlobalCost() const {			unsigned TargetTransformInfo::getGISelRematGlobalCost() const {
	return TTIImpl->getGISelRematGlobalCost();			return TTIImpl->getGISelRematGlobalCost();
	}			}

	▲ Show 20 Lines • Show All 66 Lines • Show Last 20 Lines

llvm/lib/Target/BPF/BPF.h

	Show All 12 Lines
	#include "llvm/IR/PassManager.h"			#include "llvm/IR/PassManager.h"
	#include "llvm/Pass.h"			#include "llvm/Pass.h"
	#include "llvm/Target/TargetMachine.h"			#include "llvm/Target/TargetMachine.h"

	namespace llvm {			namespace llvm {
	class BPFTargetMachine;			class BPFTargetMachine;
	class PassRegistry;			class PassRegistry;

	ModulePass *createBPFAdjustOpt();
	ModulePass *createBPFCheckAndAdjustIR();			ModulePass *createBPFCheckAndAdjustIR();

	FunctionPass createBPFAbstractMemberAccess(BPFTargetMachine TM);			FunctionPass createBPFAbstractMemberAccess(BPFTargetMachine TM);
	FunctionPass *createBPFPreserveDIType();			FunctionPass *createBPFPreserveDIType();
	FunctionPass *createBPFIRPeephole();			FunctionPass *createBPFIRPeephole();
	FunctionPass *createBPFISelDag(BPFTargetMachine &TM);			FunctionPass *createBPFISelDag(BPFTargetMachine &TM);
	FunctionPass *createBPFMISimplifyPatchablePass();			FunctionPass *createBPFMISimplifyPatchablePass();
	FunctionPass *createBPFMIPeepholePass();			FunctionPass *createBPFMIPeepholePass();
	FunctionPass *createBPFMIPeepholeTruncElimPass();			FunctionPass *createBPFMIPeepholeTruncElimPass();
	FunctionPass *createBPFMIPreEmitPeepholePass();			FunctionPass *createBPFMIPreEmitPeepholePass();
	FunctionPass *createBPFMIPreEmitCheckingPass();			FunctionPass *createBPFMIPreEmitCheckingPass();

	void initializeBPFAbstractMemberAccessLegacyPassPass(PassRegistry &);			void initializeBPFAbstractMemberAccessLegacyPassPass(PassRegistry &);
	void initializeBPFAdjustOptPass(PassRegistry&);
	void initializeBPFCheckAndAdjustIRPass(PassRegistry&);			void initializeBPFCheckAndAdjustIRPass(PassRegistry&);
	void initializeBPFDAGToDAGISelPass(PassRegistry &);			void initializeBPFDAGToDAGISelPass(PassRegistry &);
	void initializeBPFIRPeepholePass(PassRegistry &);			void initializeBPFIRPeepholePass(PassRegistry &);
	void initializeBPFMIPeepholePass(PassRegistry&);			void initializeBPFMIPeepholePass(PassRegistry&);
	void initializeBPFMIPeepholeTruncElimPass(PassRegistry &);			void initializeBPFMIPeepholeTruncElimPass(PassRegistry &);
	void initializeBPFMIPreEmitCheckingPass(PassRegistry&);			void initializeBPFMIPreEmitCheckingPass(PassRegistry&);
	void initializeBPFMIPreEmitPeepholePass(PassRegistry &);			void initializeBPFMIPreEmitPeepholePass(PassRegistry &);
	void initializeBPFMISimplifyPatchablePass(PassRegistry &);			void initializeBPFMISimplifyPatchablePass(PassRegistry &);
	Show All 18 Lines
	};			};

	class BPFIRPeepholePass : public PassInfoMixin<BPFIRPeepholePass> {			class BPFIRPeepholePass : public PassInfoMixin<BPFIRPeepholePass> {
	public:			public:
	PreservedAnalyses run(Function &F, FunctionAnalysisManager &AM);			PreservedAnalyses run(Function &F, FunctionAnalysisManager &AM);

	static bool isRequired() { return true; }			static bool isRequired() { return true; }
	};			};

	class BPFAdjustOptPass : public PassInfoMixin<BPFAdjustOptPass> {
	public:
	PreservedAnalyses run(Module &M, ModuleAnalysisManager &AM);
	};
	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/lib/Target/BPF/BPFAdjustOpt.cpp

This file was deleted.

	//===---------------- BPFAdjustOpt.cpp - Adjust Optimization --------------===//
	//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//
	//===----------------------------------------------------------------------===//
	//
	// Adjust optimization to make the code more kernel verifier friendly.
	//
	//===----------------------------------------------------------------------===//

	#include "BPF.h"
	#include "BPFCORE.h"
	#include "BPFTargetMachine.h"
	#include "llvm/IR/Instruction.h"
	#include "llvm/IR/Instructions.h"
	#include "llvm/IR/IntrinsicsBPF.h"
	#include "llvm/IR/Module.h"
	#include "llvm/IR/PatternMatch.h"
	#include "llvm/IR/Type.h"
	#include "llvm/IR/User.h"
	#include "llvm/IR/Value.h"
	#include "llvm/Pass.h"
	#include "llvm/Transforms/Utils/BasicBlockUtils.h"

	#define DEBUG_TYPE "bpf-adjust-opt"

	using namespace llvm;
	using namespace llvm::PatternMatch;

	static cl::opt<bool>
	DisableBPFserializeICMP("bpf-disable-serialize-icmp", cl::Hidden,
	cl::desc("BPF: Disable Serializing ICMP insns."),
	cl::init(false));

	static cl::opt<bool> DisableBPFavoidSpeculation(
	"bpf-disable-avoid-speculation", cl::Hidden,
	cl::desc("BPF: Disable Avoiding Speculative Code Motion."),
	cl::init(false));

	namespace {

	class BPFAdjustOpt final : public ModulePass {
	public:
	static char ID;

	BPFAdjustOpt() : ModulePass(ID) {}
	bool runOnModule(Module &M) override;
	};

	class BPFAdjustOptImpl {
	struct PassThroughInfo {
	Instruction *Input;
	Instruction *UsedInst;
	uint32_t OpIdx;
	PassThroughInfo(Instruction I, Instruction U, uint32_t Idx)
	: Input(I), UsedInst(U), OpIdx(Idx) {}
	};

	public:
	BPFAdjustOptImpl(Module *M) : M(M) {}

	bool run();

	private:
	Module *M;
	SmallVector<PassThroughInfo, 16> PassThroughs;

	bool adjustICmpToBuiltin();
	void adjustBasicBlock(BasicBlock &BB);
	bool serializeICMPCrossBB(BasicBlock &BB);
	void adjustInst(Instruction &I);
	bool serializeICMPInBB(Instruction &I);
	bool avoidSpeculation(Instruction &I);
	bool insertPassThrough();
	};

	} // End anonymous namespace

	char BPFAdjustOpt::ID = 0;
	INITIALIZE_PASS(BPFAdjustOpt, "bpf-adjust-opt", "BPF Adjust Optimization",
	false, false)

	ModulePass *llvm::createBPFAdjustOpt() { return new BPFAdjustOpt(); }

	bool BPFAdjustOpt::runOnModule(Module &M) { return BPFAdjustOptImpl(&M).run(); }

	bool BPFAdjustOptImpl::run() {
	bool Changed = adjustICmpToBuiltin();

	for (Function &F : *M)
	for (auto &BB : F) {
	adjustBasicBlock(BB);
	for (auto &I : BB)
	adjustInst(I);
	}
	return insertPassThrough() \|\| Changed;
	}

	// Commit acabad9ff6bf ("[InstCombine] try to canonicalize icmp with
	// trunc op into mask and cmp") added a transformation to
	// convert "(conv)a < power_2_const" to "a & <const>" in certain
	// cases and bpf kernel verifier has to handle the resulted code
	// conservatively and this may reject otherwise legitimate program.
	// Here, we change related icmp code to a builtin which will
	// be restored to original icmp code later to prevent that
	// InstCombine transformatin.
	bool BPFAdjustOptImpl::adjustICmpToBuiltin() {
	bool Changed = false;
	ICmpInst *ToBeDeleted = nullptr;
	for (Function &F : *M)
	for (auto &BB : F)
	for (auto &I : BB) {
	if (ToBeDeleted) {
	ToBeDeleted->eraseFromParent();
	ToBeDeleted = nullptr;
	}

	auto *Icmp = dyn_cast<ICmpInst>(&I);
	if (!Icmp)
	continue;

	Value *Op0 = Icmp->getOperand(0);
	if (!isa<TruncInst>(Op0))
	continue;

	auto ConstOp1 = dyn_cast<ConstantInt>(Icmp->getOperand(1));
	if (!ConstOp1)
	continue;

	auto ConstOp1Val = ConstOp1->getValue().getZExtValue();
	auto Op = Icmp->getPredicate();
	if (Op == ICmpInst::ICMP_ULT \|\| Op == ICmpInst::ICMP_UGE) {
	if ((ConstOp1Val - 1) & ConstOp1Val)
	continue;
	} else if (Op == ICmpInst::ICMP_ULE \|\| Op == ICmpInst::ICMP_UGT) {
	if (ConstOp1Val & (ConstOp1Val + 1))
	continue;
	} else {
	continue;
	}

	Constant *Opcode =
	ConstantInt::get(Type::getInt32Ty(BB.getContext()), Op);
	Function *Fn = Intrinsic::getDeclaration(
	M, Intrinsic::bpf_compare, {Op0->getType(), ConstOp1->getType()});
	auto *NewInst = CallInst::Create(Fn, {Opcode, Op0, ConstOp1});
	NewInst->insertBefore(&I);
	Icmp->replaceAllUsesWith(NewInst);
	Changed = true;
	ToBeDeleted = Icmp;
	}

	return Changed;
	}

	bool BPFAdjustOptImpl::insertPassThrough() {
	for (auto &Info : PassThroughs) {
	auto *CI = BPFCoreSharedInfo::insertPassThrough(
	M, Info.UsedInst->getParent(), Info.Input, Info.UsedInst);
	Info.UsedInst->setOperand(Info.OpIdx, CI);
	}

	return !PassThroughs.empty();
	}

	// To avoid combining conditionals in the same basic block by
	// instrcombine optimization.
	bool BPFAdjustOptImpl::serializeICMPInBB(Instruction &I) {
	// For:
	// comp1 = icmp <opcode> ...;
	// comp2 = icmp <opcode> ...;
	// ... or comp1 comp2 ...
	// changed to:
	// comp1 = icmp <opcode> ...;
	// comp2 = icmp <opcode> ...;
	// new_comp1 = __builtin_bpf_passthrough(seq_num, comp1)
	// ... or new_comp1 comp2 ...
	Value Op0, Op1;
	// Use LogicalOr (accept `or i1` as well as `select i1 Op0, true, Op1`)
	if (!match(&I, m_LogicalOr(m_Value(Op0), m_Value(Op1))))
	return false;
	auto *Icmp1 = dyn_cast<ICmpInst>(Op0);
	if (!Icmp1)
	return false;
	auto *Icmp2 = dyn_cast<ICmpInst>(Op1);
	if (!Icmp2)
	return false;

	Value *Icmp1Op0 = Icmp1->getOperand(0);
	Value *Icmp2Op0 = Icmp2->getOperand(0);
	if (Icmp1Op0 != Icmp2Op0)
	return false;

	// Now we got two icmp instructions which feed into
	// an "or" instruction.
	PassThroughInfo Info(Icmp1, &I, 0);
	PassThroughs.push_back(Info);
	return true;
	}

	// To avoid combining conditionals in the same basic block by
	// instrcombine optimization.
	bool BPFAdjustOptImpl::serializeICMPCrossBB(BasicBlock &BB) {
	// For:
	// B1:
	// comp1 = icmp <opcode> ...;
	// if (comp1) goto B2 else B3;
	// B2:
	// comp2 = icmp <opcode> ...;
	// if (comp2) goto B4 else B5;
	// B4:
	// ...
	// changed to:
	// B1:
	// comp1 = icmp <opcode> ...;
	// comp1 = __builtin_bpf_passthrough(seq_num, comp1);
	// if (comp1) goto B2 else B3;
	// B2:
	// comp2 = icmp <opcode> ...;
	// if (comp2) goto B4 else B5;
	// B4:
	// ...

	// Check basic predecessors, if two of them (say B1, B2) are using
	// icmp instructions to generate conditions and one is the predesessor
	// of another (e.g., B1 is the predecessor of B2). Add a passthrough
	// barrier after icmp inst of block B1.
	BasicBlock *B2 = BB.getSinglePredecessor();
	if (!B2)
	return false;

	BasicBlock *B1 = B2->getSinglePredecessor();
	if (!B1)
	return false;

	Instruction *TI = B2->getTerminator();
	auto *BI = dyn_cast<BranchInst>(TI);
	if (!BI \|\| !BI->isConditional())
	return false;
	auto *Cond = dyn_cast<ICmpInst>(BI->getCondition());
	if (!Cond \|\| B2->getFirstNonPHI() != Cond)
	return false;
	Value *B2Op0 = Cond->getOperand(0);
	auto Cond2Op = Cond->getPredicate();

	TI = B1->getTerminator();
	BI = dyn_cast<BranchInst>(TI);
	if (!BI \|\| !BI->isConditional())
	return false;
	Cond = dyn_cast<ICmpInst>(BI->getCondition());
	if (!Cond)
	return false;
	Value *B1Op0 = Cond->getOperand(0);
	auto Cond1Op = Cond->getPredicate();

	if (B1Op0 != B2Op0)
	return false;

	if (Cond1Op == ICmpInst::ICMP_SGT \|\| Cond1Op == ICmpInst::ICMP_SGE) {
	if (Cond2Op != ICmpInst::ICMP_SLT && Cond2Op != ICmpInst::ICMP_SLE)
	return false;
	} else if (Cond1Op == ICmpInst::ICMP_SLT \|\| Cond1Op == ICmpInst::ICMP_SLE) {
	if (Cond2Op != ICmpInst::ICMP_SGT && Cond2Op != ICmpInst::ICMP_SGE)
	return false;
	} else if (Cond1Op == ICmpInst::ICMP_ULT \|\| Cond1Op == ICmpInst::ICMP_ULE) {
	if (Cond2Op != ICmpInst::ICMP_UGT && Cond2Op != ICmpInst::ICMP_UGE)
	return false;
	} else if (Cond1Op == ICmpInst::ICMP_UGT \|\| Cond1Op == ICmpInst::ICMP_UGE) {
	if (Cond2Op != ICmpInst::ICMP_ULT && Cond2Op != ICmpInst::ICMP_ULE)
	return false;
	} else {
	return false;
	}

	PassThroughInfo Info(Cond, BI, 0);
	PassThroughs.push_back(Info);

	return true;
	}

	// To avoid speculative hoisting certain computations out of
	// a basic block.
	bool BPFAdjustOptImpl::avoidSpeculation(Instruction &I) {
	if (auto *LdInst = dyn_cast<LoadInst>(&I)) {
	if (auto *GV = dyn_cast<GlobalVariable>(LdInst->getOperand(0))) {
	if (GV->hasAttribute(BPFCoreSharedInfo::AmaAttr) \|\|
	GV->hasAttribute(BPFCoreSharedInfo::TypeIdAttr))
	return false;
	}
	}

	if (!isa<LoadInst>(&I) && !isa<CallInst>(&I))
	return false;

	// For:
	// B1:
	// var = ...
	// ...
	// /* icmp may not be in the same block as var = ... */
	// comp1 = icmp <opcode> var, <const>;
	// if (comp1) goto B2 else B3;
	// B2:
	// ... var ...
	// change to:
	// B1:
	// var = ...
	// ...
	// /* icmp may not be in the same block as var = ... */
	// comp1 = icmp <opcode> var, <const>;
	// if (comp1) goto B2 else B3;
	// B2:
	// var = __builtin_bpf_passthrough(seq_num, var);
	// ... var ...
	bool isCandidate = false;
	SmallVector<PassThroughInfo, 4> Candidates;
	for (User *U : I.users()) {
	Instruction *Inst = dyn_cast<Instruction>(U);
	if (!Inst)
	continue;

	// May cover a little bit more than the
	// above pattern.
	if (auto *Icmp1 = dyn_cast<ICmpInst>(Inst)) {
	Value *Icmp1Op1 = Icmp1->getOperand(1);
	if (!isa<Constant>(Icmp1Op1))
	return false;
	isCandidate = true;
	continue;
	}

	// Ignore the use in the same basic block as the definition.
	if (Inst->getParent() == I.getParent())
	continue;

	// use in a different basic block, If there is a call or
	// load/store insn before this instruction in this basic
	// block. Most likely it cannot be hoisted out. Skip it.
	for (auto &I2 : *Inst->getParent()) {
	if (isa<CallInst>(&I2))
	return false;
	if (isa<LoadInst>(&I2) \|\| isa<StoreInst>(&I2))
	return false;
	if (&I2 == Inst)
	break;
	}

	// It should be used in a GEP or a simple arithmetic like
	// ZEXT/SEXT which is used for GEP.
	if (Inst->getOpcode() == Instruction::ZExt \|\|
	Inst->getOpcode() == Instruction::SExt) {
	PassThroughInfo Info(&I, Inst, 0);
	Candidates.push_back(Info);
	} else if (auto *GI = dyn_cast<GetElementPtrInst>(Inst)) {
	// traverse GEP inst to find Use operand index
	unsigned i, e;
	for (i = 1, e = GI->getNumOperands(); i != e; ++i) {
	Value *V = GI->getOperand(i);
	if (V == &I)
	break;
	}
	if (i == e)
	continue;

	PassThroughInfo Info(&I, GI, i);
	Candidates.push_back(Info);
	}
	}

	if (!isCandidate \|\| Candidates.empty())
	return false;

	llvm::append_range(PassThroughs, Candidates);
	return true;
	}

	void BPFAdjustOptImpl::adjustBasicBlock(BasicBlock &BB) {
	if (!DisableBPFserializeICMP && serializeICMPCrossBB(BB))
	return;
	}

	void BPFAdjustOptImpl::adjustInst(Instruction &I) {
	if (!DisableBPFserializeICMP && serializeICMPInBB(I))
	return;
	if (!DisableBPFavoidSpeculation && avoidSpeculation(I))
	return;
	}

	PreservedAnalyses BPFAdjustOptPass::run(Module &M, ModuleAnalysisManager &AM) {
	return BPFAdjustOptImpl(&M).run() ? PreservedAnalyses::none()
	: PreservedAnalyses::all();
	}

llvm/lib/Target/BPF/BPFCheckAndAdjustIR.cpp

Show All 40 Lines
public:		public:
static char ID;		static char ID;
BPFCheckAndAdjustIR() : ModulePass(ID) {}		BPFCheckAndAdjustIR() : ModulePass(ID) {}

private:		private:
void checkIR(Module &M);		void checkIR(Module &M);
bool adjustIR(Module &M);		bool adjustIR(Module &M);
bool removePassThroughBuiltin(Module &M);		bool removePassThroughBuiltin(Module &M);
bool removeCompareBuiltin(Module &M);
};		};
} // End anonymous namespace		} // End anonymous namespace

char BPFCheckAndAdjustIR::ID = 0;		char BPFCheckAndAdjustIR::ID = 0;
INITIALIZE_PASS(BPFCheckAndAdjustIR, DEBUG_TYPE, "BPF Check And Adjust IR",		INITIALIZE_PASS(BPFCheckAndAdjustIR, DEBUG_TYPE, "BPF Check And Adjust IR",
false, false)		false, false)

ModulePass *llvm::createBPFCheckAndAdjustIR() {		ModulePass *llvm::createBPFCheckAndAdjustIR() {
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	for (auto &BB : F)
Changed = true;		Changed = true;
Value *Arg = Call->getArgOperand(1);		Value *Arg = Call->getArgOperand(1);
Call->replaceAllUsesWith(Arg);		Call->replaceAllUsesWith(Arg);
ToBeDeleted = Call;		ToBeDeleted = Call;
}		}
return Changed;		return Changed;
}		}

bool BPFCheckAndAdjustIR::removeCompareBuiltin(Module &M) {
// Remove __builtin_bpf_compare()'s which are used to prevent
// certain IR optimizations. Now major IR optimizations are done,
// remove them.
bool Changed = false;
CallInst *ToBeDeleted = nullptr;
for (Function &F : M)
for (auto &BB : F)
for (auto &I : BB) {
if (ToBeDeleted) {
ToBeDeleted->eraseFromParent();
ToBeDeleted = nullptr;
}

auto *Call = dyn_cast<CallInst>(&I);
if (!Call)
continue;
auto *GV = dyn_cast<GlobalValue>(Call->getCalledOperand());
if (!GV)
continue;
if (!GV->getName().startswith("llvm.bpf.compare"))
continue;

Changed = true;
Value *Arg0 = Call->getArgOperand(0);
Value *Arg1 = Call->getArgOperand(1);
Value *Arg2 = Call->getArgOperand(2);

auto OpVal = cast<ConstantInt>(Arg0)->getValue().getZExtValue();
CmpInst::Predicate Opcode = (CmpInst::Predicate)OpVal;

auto *ICmp = new ICmpInst(Opcode, Arg1, Arg2);
ICmp->insertBefore(Call);

Call->replaceAllUsesWith(ICmp);
ToBeDeleted = Call;
}
return Changed;
}

bool BPFCheckAndAdjustIR::adjustIR(Module &M) {		bool BPFCheckAndAdjustIR::adjustIR(Module &M) {
bool Changed = removePassThroughBuiltin(M);		return removePassThroughBuiltin(M);
Changed = removeCompareBuiltin(M) \|\| Changed;
return Changed;
}		}

bool BPFCheckAndAdjustIR::runOnModule(Module &M) {		bool BPFCheckAndAdjustIR::runOnModule(Module &M) {
checkIR(M);		checkIR(M);
return adjustIR(M);		return adjustIR(M);
}		}

llvm/lib/Target/BPF/BPFTargetMachine.cpp

Show All 37 Lines	extern "C" LLVM_EXTERNAL_VISIBILITY void LLVMInitializeBPFTarget() {
RegisterTargetMachine<BPFTargetMachine> X(getTheBPFleTarget());		RegisterTargetMachine<BPFTargetMachine> X(getTheBPFleTarget());
RegisterTargetMachine<BPFTargetMachine> Y(getTheBPFbeTarget());		RegisterTargetMachine<BPFTargetMachine> Y(getTheBPFbeTarget());
RegisterTargetMachine<BPFTargetMachine> Z(getTheBPFTarget());		RegisterTargetMachine<BPFTargetMachine> Z(getTheBPFTarget());

PassRegistry &PR = *PassRegistry::getPassRegistry();		PassRegistry &PR = *PassRegistry::getPassRegistry();
initializeBPFAbstractMemberAccessLegacyPassPass(PR);		initializeBPFAbstractMemberAccessLegacyPassPass(PR);
initializeBPFPreserveDITypePass(PR);		initializeBPFPreserveDITypePass(PR);
initializeBPFIRPeepholePass(PR);		initializeBPFIRPeepholePass(PR);
initializeBPFAdjustOptPass(PR);
initializeBPFCheckAndAdjustIRPass(PR);		initializeBPFCheckAndAdjustIRPass(PR);
initializeBPFMIPeepholePass(PR);		initializeBPFMIPeepholePass(PR);
initializeBPFMIPeepholeTruncElimPass(PR);		initializeBPFMIPeepholeTruncElimPass(PR);
initializeBPFDAGToDAGISelPass(PR);		initializeBPFDAGToDAGISelPass(PR);
}		}

// DataLayout: little or big endian		// DataLayout: little or big endian
static std::string computeDataLayout(const Triple &TT) {		static std::string computeDataLayout(const Triple &TT) {
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	PB.registerPipelineStartEPCallback(
FPM.addPass(BPFPreserveDITypePass());		FPM.addPass(BPFPreserveDITypePass());
FPM.addPass(BPFIRPeepholePass());		FPM.addPass(BPFIRPeepholePass());
MPM.addPass(createModuleToFunctionPassAdaptor(std::move(FPM)));		MPM.addPass(createModuleToFunctionPassAdaptor(std::move(FPM)));
});		});
PB.registerPeepholeEPCallback([=](FunctionPassManager &FPM,		PB.registerPeepholeEPCallback([=](FunctionPassManager &FPM,
OptimizationLevel Level) {		OptimizationLevel Level) {
FPM.addPass(SimplifyCFGPass(SimplifyCFGOptions().hoistCommonInsts(true)));		FPM.addPass(SimplifyCFGPass(SimplifyCFGOptions().hoistCommonInsts(true)));
});		});
PB.registerPipelineEarlySimplificationEPCallback(
[=](ModulePassManager &MPM, OptimizationLevel) {
MPM.addPass(BPFAdjustOptPass());
});
}		}

void BPFPassConfig::addIRPasses() {		void BPFPassConfig::addIRPasses() {
addPass(createBPFCheckAndAdjustIR());		addPass(createBPFCheckAndAdjustIR());
TargetPassConfig::addIRPasses();		TargetPassConfig::addIRPasses();
}		}

TargetTransformInfo		TargetTransformInfo
Show All 33 Lines

llvm/lib/Target/BPF/BPFTargetTransformInfo.h

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	TTI::MemCmpExpansionOptions enableMemCmpExpansion(bool OptSize,
Options.MaxNumLoads = TLI->getMaxExpandSizeMemcmp(OptSize);		Options.MaxNumLoads = TLI->getMaxExpandSizeMemcmp(OptSize);
return Options;		return Options;
}		}

unsigned getMaxNumArgs() const {		unsigned getMaxNumArgs() const {
return 5;		return 5;
}		}

		bool needsPreserveRangeInfoInVerification() const {
		return true;
		}
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_LIB_TARGET_BPF_BPFTARGETTRANSFORMINFO_H		#endif // LLVM_LIB_TARGET_BPF_BPFTARGETTRANSFORMINFO_H

llvm/lib/Target/BPF/CMakeLists.txt

	Show All 10 Lines
	tablegen(LLVM BPFGenMCCodeEmitter.inc -gen-emitter)			tablegen(LLVM BPFGenMCCodeEmitter.inc -gen-emitter)
	tablegen(LLVM BPFGenRegisterInfo.inc -gen-register-info)			tablegen(LLVM BPFGenRegisterInfo.inc -gen-register-info)
	tablegen(LLVM BPFGenSubtargetInfo.inc -gen-subtarget)			tablegen(LLVM BPFGenSubtargetInfo.inc -gen-subtarget)

	add_public_tablegen_target(BPFCommonTableGen)			add_public_tablegen_target(BPFCommonTableGen)

	add_llvm_target(BPFCodeGen			add_llvm_target(BPFCodeGen
	BPFAbstractMemberAccess.cpp			BPFAbstractMemberAccess.cpp
	BPFAdjustOpt.cpp
	BPFAsmPrinter.cpp			BPFAsmPrinter.cpp
	BPFCheckAndAdjustIR.cpp			BPFCheckAndAdjustIR.cpp
	BPFFrameLowering.cpp			BPFFrameLowering.cpp
	BPFInstrInfo.cpp			BPFInstrInfo.cpp
	BPFIRPeephole.cpp			BPFIRPeephole.cpp
	BPFISelDAGToDAG.cpp			BPFISelDAGToDAG.cpp
	BPFISelLowering.cpp			BPFISelLowering.cpp
	BPFMCInstLower.cpp			BPFMCInstLower.cpp
	Show All 35 Lines

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp

	//===- InstCombineAndOrXor.cpp --------------------------------------------===//			//===- InstCombineAndOrXor.cpp --------------------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements the visitAnd, visitOr, and visitXor functions.			// This file implements the visitAnd, visitOr, and visitXor functions.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "InstCombineInternal.h"			#include "InstCombineInternal.h"
	#include "llvm/Analysis/CmpInstAnalysis.h"			#include "llvm/Analysis/CmpInstAnalysis.h"
	#include "llvm/Analysis/InstructionSimplify.h"			#include "llvm/Analysis/InstructionSimplify.h"
				#include "llvm/Analysis/TargetTransformInfo.h"
	#include "llvm/IR/ConstantRange.h"			#include "llvm/IR/ConstantRange.h"
	#include "llvm/IR/Intrinsics.h"			#include "llvm/IR/Intrinsics.h"
	#include "llvm/IR/PatternMatch.h"			#include "llvm/IR/PatternMatch.h"
	#include "llvm/Transforms/InstCombine/InstCombiner.h"			#include "llvm/Transforms/InstCombine/InstCombiner.h"
	#include "llvm/Transforms/Utils/Local.h"			#include "llvm/Transforms/Utils/Local.h"

	using namespace llvm;			using namespace llvm;
	using namespace PatternMatch;			using namespace PatternMatch;
	▲ Show 20 Lines • Show All 1,139 Lines • ▼ Show 20 Lines

	/// Fold (icmp Pred1 V1, C1) & (icmp Pred2 V2, C2)			/// Fold (icmp Pred1 V1, C1) & (icmp Pred2 V2, C2)
	/// or (icmp Pred1 V1, C1) \| (icmp Pred2 V2, C2)			/// or (icmp Pred1 V1, C1) \| (icmp Pred2 V2, C2)
	/// into a single comparison using range-based reasoning.			/// into a single comparison using range-based reasoning.
	/// NOTE: This is also used for logical and/or, must be poison-safe!			/// NOTE: This is also used for logical and/or, must be poison-safe!
	Value InstCombinerImpl::foldAndOrOfICmpsUsingRanges(ICmpInst ICmp1,			Value InstCombinerImpl::foldAndOrOfICmpsUsingRanges(ICmpInst ICmp1,
	ICmpInst *ICmp2,			ICmpInst *ICmp2,
	bool IsAnd) {			bool IsAnd) {
				if (TTI.needsPreserveRangeInfoInVerification())
				return nullptr;

	ICmpInst::Predicate Pred1, Pred2;			ICmpInst::Predicate Pred1, Pred2;
	Value V1, V2;			Value V1, V2;
	const APInt C1, C2;			const APInt C1, C2;
	if (!match(ICmp1, m_ICmp(Pred1, m_Value(V1), m_APInt(C1))) \|\|			if (!match(ICmp1, m_ICmp(Pred1, m_Value(V1), m_APInt(C1))) \|\|
	!match(ICmp2, m_ICmp(Pred2, m_Value(V2), m_APInt(C2))))			!match(ICmp2, m_ICmp(Pred2, m_Value(V2), m_APInt(C2))))
	return nullptr;			return nullptr;

	// Look through add of a constant offset on V1, V2, or both operands. This			// Look through add of a constant offset on V1, V2, or both operands. This
	▲ Show 20 Lines • Show All 3,245 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/LICM.cpp

Show First 20 Lines • Show All 166 Lines • ▼ Show 20 Lines	static bool isSafeToExecuteUnconditionally(
AssumptionCache *AC, bool AllowSpeculation);		AssumptionCache *AC, bool AllowSpeculation);
static bool pointerInvalidatedByLoop(MemorySSA MSSA, MemoryUse MU,		static bool pointerInvalidatedByLoop(MemorySSA MSSA, MemoryUse MU,
Loop *CurLoop, Instruction &I,		Loop *CurLoop, Instruction &I,
SinkAndHoistLICMFlags &Flags,		SinkAndHoistLICMFlags &Flags,
bool InvariantGroup);		bool InvariantGroup);
static bool pointerInvalidatedByBlock(BasicBlock &BB, MemorySSA &MSSA,		static bool pointerInvalidatedByBlock(BasicBlock &BB, MemorySSA &MSSA,
MemoryUse &MU);		MemoryUse &MU);
/// Aggregates various functions for hoisting computations out of loop.		/// Aggregates various functions for hoisting computations out of loop.
static bool hoistArithmetics(Instruction &I, Loop &L,		static bool hoistArithmetics(TargetTransformInfo *TTI, Instruction &I, Loop &L,
ICFLoopSafetyInfo &SafetyInfo,		ICFLoopSafetyInfo &SafetyInfo,
MemorySSAUpdater &MSSAU, AssumptionCache *AC,		MemorySSAUpdater &MSSAU, AssumptionCache *AC,
DominatorTree *DT);		DominatorTree *DT);
static Instruction *cloneInstructionInExitBlock(		static Instruction *cloneInstructionInExitBlock(
Instruction &I, BasicBlock &ExitBlock, PHINode &PN, const LoopInfo *LI,		Instruction &I, BasicBlock &ExitBlock, PHINode &PN, const LoopInfo *LI,
const LoopSafetyInfo *SafetyInfo, MemorySSAUpdater &MSSAU);		const LoopSafetyInfo *SafetyInfo, MemorySSAUpdater &MSSAU);

static void eraseInstruction(Instruction &I, ICFLoopSafetyInfo &SafetyInfo,		static void eraseInstruction(Instruction &I, ICFLoopSafetyInfo &SafetyInfo,
▲ Show 20 Lines • Show All 265 Lines • ▼ Show 20 Lines	if (L->hasDedicatedExits())
Changed \|=		Changed \|=
LoopNestMode		LoopNestMode
? sinkRegionForLoopNest(DT->getNode(L->getHeader()), AA, LI, DT,		? sinkRegionForLoopNest(DT->getNode(L->getHeader()), AA, LI, DT,
TLI, TTI, L, MSSAU, &SafetyInfo, Flags, ORE)		TLI, TTI, L, MSSAU, &SafetyInfo, Flags, ORE)
: sinkRegion(DT->getNode(L->getHeader()), AA, LI, DT, TLI, TTI, L,		: sinkRegion(DT->getNode(L->getHeader()), AA, LI, DT, TLI, TTI, L,
MSSAU, &SafetyInfo, Flags, ORE);		MSSAU, &SafetyInfo, Flags, ORE);
Flags.setIsSink(false);		Flags.setIsSink(false);
if (Preheader)		if (Preheader)
Changed \|= hoistRegion(DT->getNode(L->getHeader()), AA, LI, DT, AC, TLI, L,		Changed \|= hoistRegion(DT->getNode(L->getHeader()), AA, LI, DT, AC, TLI,
MSSAU, SE, &SafetyInfo, Flags, ORE, LoopNestMode,		TTI, L, MSSAU, SE, &SafetyInfo, Flags, ORE,
LicmAllowSpeculation);		LoopNestMode, LicmAllowSpeculation);

// Now that all loop invariants have been removed from the loop, promote any		// Now that all loop invariants have been removed from the loop, promote any
// memory references to scalars that we can.		// memory references to scalars that we can.
// Don't sink stores from loops without dedicated block exits. Exits		// Don't sink stores from loops without dedicated block exits. Exits
// containing indirect branches are not transformed by loop simplify,		// containing indirect branches are not transformed by loop simplify,
// make sure we catch that. An additional load may be generated in the		// make sure we catch that. An additional load may be generated in the
// preheader for SSA updater, so also avoid sinking when no preheader		// preheader for SSA updater, so also avoid sinking when no preheader
// is available.		// is available.
▲ Show 20 Lines • Show All 387 Lines • ▼ Show 20 Lines

/// Walk the specified region of the CFG (defined by all blocks dominated by		/// Walk the specified region of the CFG (defined by all blocks dominated by
/// the specified block, and that are in the current loop) in depth first		/// the specified block, and that are in the current loop) in depth first
/// order w.r.t the DominatorTree. This allows us to visit definitions before		/// order w.r.t the DominatorTree. This allows us to visit definitions before
/// uses, allowing us to hoist a loop body in one pass without iteration.		/// uses, allowing us to hoist a loop body in one pass without iteration.
///		///
bool llvm::hoistRegion(DomTreeNode N, AAResults AA, LoopInfo *LI,		bool llvm::hoistRegion(DomTreeNode N, AAResults AA, LoopInfo *LI,
DominatorTree DT, AssumptionCache AC,		DominatorTree DT, AssumptionCache AC,
TargetLibraryInfo TLI, Loop CurLoop,		TargetLibraryInfo TLI, TargetTransformInfo TTI,
MemorySSAUpdater &MSSAU, ScalarEvolution *SE,		Loop *CurLoop, MemorySSAUpdater &MSSAU,
ICFLoopSafetyInfo *SafetyInfo,		ScalarEvolution SE, ICFLoopSafetyInfo SafetyInfo,
SinkAndHoistLICMFlags &Flags,		SinkAndHoistLICMFlags &Flags,
OptimizationRemarkEmitter *ORE, bool LoopNestMode,		OptimizationRemarkEmitter *ORE, bool LoopNestMode,
bool AllowSpeculation) {		bool AllowSpeculation) {
// Verify inputs.		// Verify inputs.
assert(N != nullptr && AA != nullptr && LI != nullptr && DT != nullptr &&		assert(N != nullptr && AA != nullptr && LI != nullptr && DT != nullptr &&
CurLoop != nullptr && SafetyInfo != nullptr &&		CurLoop != nullptr && SafetyInfo != nullptr &&
"Unexpected input to hoistRegion.");		"Unexpected input to hoistRegion.");

▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	for (Instruction &I : llvm::make_early_inc_range(*BB)) {
assert(DT->dominates(PN, BB) && "Conditional PHIs not expected");		assert(DT->dominates(PN, BB) && "Conditional PHIs not expected");
Changed = true;		Changed = true;
continue;		continue;
}		}
}		}

// Try to reassociate instructions so that part of computations can be		// Try to reassociate instructions so that part of computations can be
// done out of loop.		// done out of loop.
if (hoistArithmetics(I, CurLoop, SafetyInfo, MSSAU, AC, DT)) {		if (hoistArithmetics(TTI, I, CurLoop, SafetyInfo, MSSAU, AC, DT)) {
Changed = true;		Changed = true;
continue;		continue;
}		}

// Remember possibly hoistable branches so we can actually hoist them		// Remember possibly hoistable branches so we can actually hoist them
// later if needed.		// later if needed.
if (BranchInst *BI = dyn_cast<BranchInst>(&I))		if (BranchInst *BI = dyn_cast<BranchInst>(&I))
CFH.registerPossiblyHoistableBranch(BI);		CFH.registerPossiblyHoistableBranch(BI);
▲ Show 20 Lines • Show All 1,413 Lines • ▼ Show 20 Lines	for (const auto &MA : *Accesses)
if (MU.getBlock() != MD->getBlock() \|\| !MSSA.locallyDominates(MD, &MU))		if (MU.getBlock() != MD->getBlock() \|\| !MSSA.locallyDominates(MD, &MU))
return true;		return true;
return false;		return false;
}		}

/// Try to simplify things like (A < INV_1 AND icmp A < INV_2) into (A <		/// Try to simplify things like (A < INV_1 AND icmp A < INV_2) into (A <
/// min(INV_1, INV_2)), if INV_1 and INV_2 are both loop invariants and their		/// min(INV_1, INV_2)), if INV_1 and INV_2 are both loop invariants and their
/// minimun can be computed outside of loop, and X is not a loop-invariant.		/// minimun can be computed outside of loop, and X is not a loop-invariant.
static bool hoistMinMax(Instruction &I, Loop &L, ICFLoopSafetyInfo &SafetyInfo,		static bool hoistMinMax(TargetTransformInfo *TTI, Instruction &I, Loop &L,
MemorySSAUpdater &MSSAU) {		ICFLoopSafetyInfo &SafetyInfo, MemorySSAUpdater &MSSAU) {
		if (TTI->needsPreserveRangeInfoInVerification())
		return false;

bool Inverse = false;		bool Inverse = false;
using namespace PatternMatch;		using namespace PatternMatch;
Value Cond1, Cond2;		Value Cond1, Cond2;
if (match(&I, m_LogicalOr(m_Value(Cond1), m_Value(Cond2)))) {		if (match(&I, m_LogicalOr(m_Value(Cond1), m_Value(Cond2)))) {
Inverse = true;		Inverse = true;
} else if (match(&I, m_LogicalAnd(m_Value(Cond1), m_Value(Cond2)))) {		} else if (match(&I, m_LogicalAnd(m_Value(Cond1), m_Value(Cond2)))) {
// Do nothing		// Do nothing
} else		} else
▲ Show 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	Value *NewGEP = Builder.CreateGEP(Src->getSourceElementType(), NewSrc,
SmallVector<Value *>(Src->indices()), "gep",		SmallVector<Value *>(Src->indices()), "gep",
IsInBounds);		IsInBounds);
GEP->replaceAllUsesWith(NewGEP);		GEP->replaceAllUsesWith(NewGEP);
eraseInstruction(*GEP, SafetyInfo, MSSAU);		eraseInstruction(*GEP, SafetyInfo, MSSAU);
eraseInstruction(*Src, SafetyInfo, MSSAU);		eraseInstruction(*Src, SafetyInfo, MSSAU);
return true;		return true;
}		}

static bool hoistArithmetics(Instruction &I, Loop &L,		static bool hoistArithmetics(TargetTransformInfo *TTI, Instruction &I, Loop &L,
ICFLoopSafetyInfo &SafetyInfo,		ICFLoopSafetyInfo &SafetyInfo,
MemorySSAUpdater &MSSAU,		MemorySSAUpdater &MSSAU,
AssumptionCache AC, DominatorTree DT) {		AssumptionCache AC, DominatorTree DT) {
// Optimize complex patterns, such as (x < INV1 && x < INV2), turning them		// Optimize complex patterns, such as (x < INV1 && x < INV2), turning them
// into (x < min(INV1, INV2)), and hoisting the invariant part of this		// into (x < min(INV1, INV2)), and hoisting the invariant part of this
// expression out of the loop.		// expression out of the loop.
if (hoistMinMax(I, L, SafetyInfo, MSSAU)) {		if (hoistMinMax(TTI, I, L, SafetyInfo, MSSAU)) {
++NumHoisted;		++NumHoisted;
++NumMinMaxHoisted;		++NumMinMaxHoisted;
return true;		return true;
}		}

// Try to hoist GEPs by reassociation.		// Try to hoist GEPs by reassociation.
if (hoistGEP(I, L, SafetyInfo, MSSAU, AC, DT)) {		if (hoistGEP(I, L, SafetyInfo, MSSAU, AC, DT)) {
++NumHoisted;		++NumHoisted;
Show All 14 Lines

llvm/lib/Transforms/Utils/SimplifyCFG.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,281 Lines • ▼ Show 20 Lines	static bool FoldCondBranchOnValueKnownInPredecessor(BranchInst *BI,
} while (Result == std::nullopt);		} while (Result == std::nullopt);
return EverChanged;		return EverChanged;
}		}

/// Given a BB that starts with the specified two-entry PHI node,		/// Given a BB that starts with the specified two-entry PHI node,
/// see if we can eliminate it.		/// see if we can eliminate it.
static bool FoldTwoEntryPHINode(PHINode *PN, const TargetTransformInfo &TTI,		static bool FoldTwoEntryPHINode(PHINode *PN, const TargetTransformInfo &TTI,
DomTreeUpdater *DTU, const DataLayout &DL) {		DomTreeUpdater *DTU, const DataLayout &DL) {
		if (TTI.needsPreserveRangeInfoInVerification())
		return false;

// Ok, this is a two entry PHI node. Check to see if this is a simple "if		// Ok, this is a two entry PHI node. Check to see if this is a simple "if
// statement", which has a very simple dominance structure. Basically, we		// statement", which has a very simple dominance structure. Basically, we
// are trying to find the condition that is being branched on, which		// are trying to find the condition that is being branched on, which
// subsequently causes this merge to happen. We really want control		// subsequently causes this merge to happen. We really want control
// dependence information for this check, but simplifycfg can't keep it up		// dependence information for this check, but simplifycfg can't keep it up
// to date, and this catches most of the cases we care about anyway.		// to date, and this catches most of the cases we care about anyway.
BasicBlock *BB = PN->getParent();		BasicBlock *BB = PN->getParent();

▲ Show 20 Lines • Show All 4,024 Lines • Show Last 20 Lines

llvm/test/CodeGen/BPF/adjust-opt-icmp1.ll

; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1		; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -passes='default<O2>' -mtriple=bpf-pc-linux %s \| llvm-dis > %t1		; RUN: opt -passes='default<O2>' -mtriple=bpf-pc-linux %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -O2 -mtriple=bpf-pc-linux -bpf-disable-serialize-icmp %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-DISABLE %s
; RUN: opt -passes='default<O2>' -mtriple=bpf-pc-linux -bpf-disable-serialize-icmp %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-DISABLE %s
;		;
; Source:		; Source:
; int foo();		; int foo();
; int bar(int);		; int bar(int);
; int test() {		; int test() {
; int ret = foo();		; int ret = foo();
; if (ret <= 0 \|\| ret > 7)		; if (ret <= 0 \|\| ret > 7)
; return 0;		; return 0;
Show All 15 Lines	entry:
%cmp = icmp sle i32 %0, 0		%cmp = icmp sle i32 %0, 0
br i1 %cmp, label %if.then, label %lor.lhs.false		br i1 %cmp, label %if.then, label %lor.lhs.false

; CHECK: [[REG1:r[0-9]+]] <<= 32		; CHECK: [[REG1:r[0-9]+]] <<= 32
; CHECK: [[REG1]] s>>= 32		; CHECK: [[REG1]] s>>= 32
; CHECK: [[REG2:r[0-9]+]] = 1		; CHECK: [[REG2:r[0-9]+]] = 1
; CHECK: if [[REG2]] s> [[REG1]] goto		; CHECK: if [[REG2]] s> [[REG1]] goto
; CHECK: if [[REG1]] s> 7 goto		; CHECK: if [[REG1]] s> 7 goto

; CHECK-DISABLE: [[REG1:r[0-9]+]] += -8
; CHECK-DISABLE: [[REG1]] <<= 32
; CHECK-DISABLE: [[REG1]] >>= 32
; CHECK-DISABLE: [[REG2:r[0-9]+]] = 4294967289
; CHECK-DISABLE: if [[REG2]] > [[REG1]] goto

lor.lhs.false: ; preds = %entry		lor.lhs.false: ; preds = %entry
		RKSimonUnsubmitted Not Done Reply Inline Actions You've left a lot of orphan CHECK-DISABLE in various files where you've removed the bpf-disable-serialize-icmp RUN RKSimon: You've left a lot of orphan CHECK-DISABLE in various files where you've removed the bpf-disable…
%1 = load i32, ptr %ret, align 4, !tbaa !2		%1 = load i32, ptr %ret, align 4, !tbaa !2
%cmp1 = icmp sgt i32 %1, 7		%cmp1 = icmp sgt i32 %1, 7
br i1 %cmp1, label %if.then, label %if.end		br i1 %cmp1, label %if.then, label %if.end

if.then: ; preds = %lor.lhs.false, %entry		if.then: ; preds = %lor.lhs.false, %entry
store i32 0, ptr %retval, align 4		store i32 0, ptr %retval, align 4
store i32 1, ptr %cleanup.dest.slot, align 4		store i32 1, ptr %cleanup.dest.slot, align 4
br label %cleanup		br label %cleanup
Show All 38 Lines

llvm/test/CodeGen/BPF/adjust-opt-icmp2.ll

; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1		; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -O2 -mtriple=bpf-pc-linux -bpf-disable-serialize-icmp %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-DISABLE %s
;		;
; Source:		; Source:
; int foo();		; int foo();
; int bar(int);		; int bar(int);
; int test() {		; int test() {
; int ret = foo();		; int ret = foo();
; if (ret <= 0)		; if (ret <= 0)
; return 0;		; return 0;
Show All 18 Lines	entry:
br i1 %cmp, label %if.then, label %if.end		br i1 %cmp, label %if.then, label %if.end

; CHECK: [[REG1:r[0-9]+]] <<= 32		; CHECK: [[REG1:r[0-9]+]] <<= 32
; CHECK: [[REG1]] s>>= 32		; CHECK: [[REG1]] s>>= 32
; CHECK: [[REG2:r[0-9]+]] = 1		; CHECK: [[REG2:r[0-9]+]] = 1
; CHECK: if [[REG2]] s> [[REG1]] goto		; CHECK: if [[REG2]] s> [[REG1]] goto
; CHECK: if [[REG1]] s> 7 goto		; CHECK: if [[REG1]] s> 7 goto

; CHECK-DISABLE: [[REG1:r[0-9]+]] += -8
; CHECK-DISABLE: [[REG1]] <<= 32
; CHECK-DISABLE: [[REG1]] >>= 32
; CHECK-DISABLE: [[REG2:r[0-9]+]] = 4294967289
; CHECK-DISABLE: if [[REG2]] > [[REG1]] goto

if.then: ; preds = %entry		if.then: ; preds = %entry
store i32 0, ptr %retval, align 4		store i32 0, ptr %retval, align 4
store i32 1, ptr %cleanup.dest.slot, align 4		store i32 1, ptr %cleanup.dest.slot, align 4
br label %cleanup		br label %cleanup

if.end: ; preds = %entry		if.end: ; preds = %entry
%1 = load i32, ptr %ret, align 4, !tbaa !2		%1 = load i32, ptr %ret, align 4, !tbaa !2
%cmp1 = icmp sgt i32 %1, 7		%cmp1 = icmp sgt i32 %1, 7
▲ Show 20 Lines • Show All 44 Lines • Show Last 20 Lines

llvm/test/CodeGen/BPF/adjust-opt-icmp3.ll

; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1		; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK,CHECK-V1 %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1		; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1
; RUN: llc %t1 -mcpu=v3 -o - \| FileCheck -check-prefixes=CHECK,CHECK-V3 %s		; RUN: llc %t1 -mcpu=v3 -o - \| FileCheck %s
;		;
; Source:		; Source:
; int test1(unsigned long a) {		; int test1(unsigned long a) {
; if ((unsigned)a <= 3) return 2;		; if ((unsigned)a <= 3) return 2;
; return 3;		; return 3;
; }		; }
; int test2(unsigned long a) {		; int test2(unsigned long a) {
; if ((unsigned)a < 4) return 2;		; if ((unsigned)a < 4) return 2;
Show All 22 Lines	if.end: ; preds = %entry
br label %return		br label %return

return: ; preds = %if.end, %if.then		return: ; preds = %if.end, %if.then
%1 = load i32, ptr %retval, align 4		%1 = load i32, ptr %retval, align 4
ret i32 %1		ret i32 %1
}		}

; CHECK-LABEL: test1		; CHECK-LABEL: test1
; CHECK-V1: if r[[#]] > r[[#]] goto		; CHECK: r[[#]] &= r[[#]]
; CHECK-V3: if w[[#]] < 4 goto		; CHECK: if r[[#]] == 0 goto

; Function Attrs: nounwind		; Function Attrs: nounwind
define dso_local i32 @test2(i64 %a) #0 {		define dso_local i32 @test2(i64 %a) #0 {
entry:		entry:
%retval = alloca i32, align 4		%retval = alloca i32, align 4
%a.addr = alloca i64, align 8		%a.addr = alloca i64, align 8
store i64 %a, ptr %a.addr, align 8, !tbaa !3		store i64 %a, ptr %a.addr, align 8, !tbaa !3
%0 = load i64, ptr %a.addr, align 8, !tbaa !3		%0 = load i64, ptr %a.addr, align 8, !tbaa !3
Show All 10 Lines	if.end: ; preds = %entry
br label %return		br label %return

return: ; preds = %if.end, %if.then		return: ; preds = %if.end, %if.then
%1 = load i32, ptr %retval, align 4		%1 = load i32, ptr %retval, align 4
ret i32 %1		ret i32 %1
}		}

; CHECK-LABEL: test2		; CHECK-LABEL: test2
; CHECK-V1: if r[[#]] > r[[#]] goto		; CHECK: r[[#]] &= r[[#]]
; CHECK-V3: if w[[#]] < 4 goto		; CHECK: if r[[#]] == 0 goto

attributes #0 = { nounwind "frame-pointer"="all" "min-legal-vector-width"="0" "no-trapping-math"="true" "stack-protector-buffer-size"="8" }		attributes #0 = { nounwind "frame-pointer"="all" "min-legal-vector-width"="0" "no-trapping-math"="true" "stack-protector-buffer-size"="8" }

!llvm.module.flags = !{!0, !1}		!llvm.module.flags = !{!0, !1}
!llvm.ident = !{!2}		!llvm.ident = !{!2}

!0 = !{i32 1, !"wchar_size", i32 4}		!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{i32 7, !"frame-pointer", i32 2}		!1 = !{i32 7, !"frame-pointer", i32 2}
!2 = !{!"clang version 14.0.0 (https://github.com/llvm/llvm-project.git b7892f95881c891032742e0cd81861b845512653)"}		!2 = !{!"clang version 14.0.0 (https://github.com/llvm/llvm-project.git b7892f95881c891032742e0cd81861b845512653)"}
!3 = !{!4, !4, i64 0}		!3 = !{!4, !4, i64 0}
!4 = !{!"long", !5, i64 0}		!4 = !{!"long", !5, i64 0}
!5 = !{!"omnipotent char", !6, i64 0}		!5 = !{!"omnipotent char", !6, i64 0}
!6 = !{!"Simple C/C++ TBAA"}		!6 = !{!"Simple C/C++ TBAA"}

llvm/test/CodeGen/BPF/adjust-opt-icmp4.ll

; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1		; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK,CHECK-V1 %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1		; RUN: opt -O2 -S -mtriple=bpf-pc-linux %s -o %t1
; RUN: llc %t1 -mcpu=v3 -o - \| FileCheck -check-prefixes=CHECK,CHECK-V3 %s		; RUN: llc %t1 -mcpu=v3 -o - \| FileCheck %s
;		;
; Source:		; Source:
; int test1(unsigned long a) {		; int test1(unsigned long a) {
; if ((unsigned)a > 3) return 2;		; if ((unsigned)a > 3) return 2;
; return 3;		; return 3;
; }		; }
; int test2(unsigned long a) {		; int test2(unsigned long a) {
; if ((unsigned)a >= 4) return 2;		; if ((unsigned)a >= 4) return 2;
Show All 22 Lines	if.end: ; preds = %entry
br label %return		br label %return

return: ; preds = %if.end, %if.then		return: ; preds = %if.end, %if.then
%1 = load i32, ptr %retval, align 4		%1 = load i32, ptr %retval, align 4
ret i32 %1		ret i32 %1
}		}

; CHECK-LABEL: test1		; CHECK-LABEL: test1
; CHECK-V1: if r[[#]] > 3 goto		; CHECK: r[[#]] &= r[[#]]
; CHECK-V3: if w[[#]] > 3 goto		; CHECK: if r[[#]] == 0 goto

; Function Attrs: nounwind		; Function Attrs: nounwind
define dso_local i32 @test2(i64 %a) #0 {		define dso_local i32 @test2(i64 %a) #0 {
entry:		entry:
%retval = alloca i32, align 4		%retval = alloca i32, align 4
%a.addr = alloca i64, align 8		%a.addr = alloca i64, align 8
store i64 %a, ptr %a.addr, align 8, !tbaa !3		store i64 %a, ptr %a.addr, align 8, !tbaa !3
%0 = load i64, ptr %a.addr, align 8, !tbaa !3		%0 = load i64, ptr %a.addr, align 8, !tbaa !3
Show All 10 Lines	if.end: ; preds = %entry
br label %return		br label %return

return: ; preds = %if.end, %if.then		return: ; preds = %if.end, %if.then
%1 = load i32, ptr %retval, align 4		%1 = load i32, ptr %retval, align 4
ret i32 %1		ret i32 %1
}		}

; CHECK-LABEL: test2		; CHECK-LABEL: test2
; CHECK-V1: if r[[#]] > 3 goto		; CHECK: r[[#]] &= r[[#]]
; CHECK-V3: if w[[#]] > 3 goto		; CHECK: if r[[#]] == 0 goto

attributes #0 = { nounwind "frame-pointer"="all" "min-legal-vector-width"="0" "no-trapping-math"="true" "stack-protector-buffer-size"="8" }		attributes #0 = { nounwind "frame-pointer"="all" "min-legal-vector-width"="0" "no-trapping-math"="true" "stack-protector-buffer-size"="8" }

!llvm.module.flags = !{!0, !1}		!llvm.module.flags = !{!0, !1}
!llvm.ident = !{!2}		!llvm.ident = !{!2}

!0 = !{i32 1, !"wchar_size", i32 4}		!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{i32 7, !"frame-pointer", i32 2}		!1 = !{i32 7, !"frame-pointer", i32 2}
!2 = !{!"clang version 14.0.0 (https://github.com/llvm/llvm-project.git 930ccf0191b4a33332d924522e5676fff583f083)"}		!2 = !{!"clang version 14.0.0 (https://github.com/llvm/llvm-project.git 930ccf0191b4a33332d924522e5676fff583f083)"}
!3 = !{!4, !4, i64 0}		!3 = !{!4, !4, i64 0}
!4 = !{!"long", !5, i64 0}		!4 = !{!"long", !5, i64 0}
!5 = !{!"omnipotent char", !6, i64 0}		!5 = !{!"omnipotent char", !6, i64 0}
!6 = !{!"Simple C/C++ TBAA"}		!6 = !{!"Simple C/C++ TBAA"}

llvm/test/CodeGen/BPF/adjust-opt-speculative1.ll

; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1		; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-COMMON,CHECK %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -O2 -mtriple=bpf-pc-linux -bpf-disable-avoid-speculation %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-COMMON,CHECK-DISABLE %s
;		;
; Source:		; Source:
; unsigned long foo();		; unsigned long foo();
; ptr test(ptr p) {		; ptr test(ptr p) {
; unsigned long ret = foo();		; unsigned long ret = foo();
; if (ret <= 7)		; if (ret <= 7)
; p += ret;		; p += ret;
; return p;		; return p;
Show All 21 Lines	if.then: ; preds = %entry
store ptr %add.ptr, ptr %p.addr, align 8, !tbaa !2		store ptr %add.ptr, ptr %p.addr, align 8, !tbaa !2
br label %if.end		br label %if.end

if.end: ; preds = %if.then, %entry		if.end: ; preds = %if.then, %entry
%3 = load ptr, ptr %p.addr, align 8, !tbaa !2		%3 = load ptr, ptr %p.addr, align 8, !tbaa !2
call void @llvm.lifetime.end.p0(i64 8, ptr %ret) #3		call void @llvm.lifetime.end.p0(i64 8, ptr %ret) #3
ret ptr %3		ret ptr %3
}		}
; CHECK-COMMON: [[REG6:r[0-9]+]] = r1		; CHECK: [[REG6:r[0-9]+]] = r1
; CHECK-COMMON: call foo		; CHECK: call foo

; CHECK: if r0 > 7 goto [[LABEL:.*]]		; CHECK: if r0 > 7 goto [[LABEL:.*]]
; CHECK: [[REG6]] += r0		; CHECK: [[REG6]] += r0
; CHECK: [[LABEL]]:		; CHECK: [[LABEL]]:
; CHECK: r0 = [[REG6]]		; CHECK: r0 = [[REG6]]

; CHECK-DISABLE: [[REG1:r[0-9]+]] = 8		; CHECK: exit
; CHECK-DISABLE: if [[REG1]] > r0 goto [[LABEL:.*]]
; CHECK-DISABLE: r0 = 0
; CHECK-DISABLE: [[LABEL]]:
; CHECK-DISABLE: [[REG6]] += r0
; CHECK-DISABLE: r0 = [[REG6]]

; CHECK-COMMON: exit

; Function Attrs: argmemonly nounwind willreturn		; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0(i64 immarg, ptr nocapture) #1		declare void @llvm.lifetime.start.p0(i64 immarg, ptr nocapture) #1

declare dso_local i64 @foo(...) #2		declare dso_local i64 @foo(...) #2

; Function Attrs: argmemonly nounwind willreturn		; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0(i64 immarg, ptr nocapture) #1		declare void @llvm.lifetime.end.p0(i64 immarg, ptr nocapture) #1
Show All 17 Lines

llvm/test/CodeGen/BPF/adjust-opt-speculative2.ll

; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1		; RUN: opt -O2 -mtriple=bpf-pc-linux %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-COMMON,CHECK %s		; RUN: llc %t1 -o - \| FileCheck %s
; RUN: opt -O2 -mtriple=bpf-pc-linux -bpf-disable-avoid-speculation %s \| llvm-dis > %t1
; RUN: llc %t1 -o - \| FileCheck -check-prefixes=CHECK-COMMON,CHECK-DISABLE %s
;		;
; Source:		; Source:
; unsigned foo();		; unsigned foo();
; ptr test(ptr p) {		; ptr test(ptr p) {
; unsigned ret = foo();		; unsigned ret = foo();
; if (ret <= 7)		; if (ret <= 7)
; p += ret;		; p += ret;
; return p;		; return p;
Show All 23 Lines	if.then: ; preds = %entry
br label %if.end		br label %if.end

if.end: ; preds = %if.then, %entry		if.end: ; preds = %if.then, %entry
%3 = load ptr, ptr %p.addr, align 8, !tbaa !2		%3 = load ptr, ptr %p.addr, align 8, !tbaa !2
call void @llvm.lifetime.end.p0(i64 4, ptr %ret) #3		call void @llvm.lifetime.end.p0(i64 4, ptr %ret) #3
ret ptr %3		ret ptr %3
}		}

; CHECK-COMMON: [[REG6:r[0-9]+]] = r1		; CHECK: [[REG6:r[0-9]+]] = r1
; CHECK-COMMON: call foo		; CHECK: call foo

; CHECK: r0 <<= 32		; CHECK: r0 <<= 32
; CHECK: r0 >>= 32		; CHECK: r0 >>= 32
; CHECK: if r0 > 7 goto [[LABEL:.*]]		; CHECK: if r0 > 7 goto [[LABEL:.*]]
; CHECK: [[REG6]] += r0		; CHECK: [[REG6]] += r0
; CHECK: [[LABEL]]:		; CHECK: [[LABEL]]:
; CHECK: r0 = [[REG6]]		; CHECK: r0 = [[REG6]]

; CHECK-DISABLE: [[REG1:r[0-9]+]] = r0		; CHECK: exit
; CHECK-DISABLE: [[REG1]] <<= 32
; CHECK-DISABLE: [[REG1]] >>= 32
; CHECK-DISABLE: [[REG2:r[0-9]+]] = 8
; CHECK-DISABLE: if [[REG2]] > [[REG1]] goto [[LABEL:.*]]
; CHECK-DISABLE: r0 = 0
; CHECK-DISABLE: [[LABEL]]:
; CHECK-DISABLE: r0 <<= 32
; CHECK-DISABLE: r0 >>= 32
; CHECK-DISABLE: [[REG6]] += r0
; CHECK-DISABLE: r0 = [[REG6]]

; CHECK-COMMON: exit

; Function Attrs: argmemonly nounwind willreturn		; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0(i64 immarg, ptr nocapture) #1		declare void @llvm.lifetime.start.p0(i64 immarg, ptr nocapture) #1

declare dso_local i32 @foo(...) #2		declare dso_local i32 @foo(...) #2

; Function Attrs: argmemonly nounwind willreturn		; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0(i64 immarg, ptr nocapture) #1		declare void @llvm.lifetime.end.p0(i64 immarg, ptr nocapture) #1
Show All 17 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[TTI][BPF]: Undo specific transform-preventing passes and add one TTI hookNeeds ReviewPublic

Details

FoldAndOrOfICmpsUsingRanges

FoldTwoEntryPHINode

MinMaxHoisting

Proposal

Diff Detail

Event Timeline

Revision Contents

Diff 523276

llvm/include/llvm/Analysis/TargetTransformInfo.h

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

llvm/include/llvm/IR/IntrinsicsBPF.td

llvm/include/llvm/Transforms/InstCombine/InstCombiner.h

llvm/include/llvm/Transforms/Utils/LoopUtils.h

llvm/lib/Analysis/TargetTransformInfo.cpp

llvm/lib/Target/BPF/BPF.h

llvm/lib/Target/BPF/BPFAdjustOpt.cpp

llvm/lib/Target/BPF/BPFCheckAndAdjustIR.cpp

llvm/lib/Target/BPF/BPFTargetMachine.cpp

llvm/lib/Target/BPF/BPFTargetTransformInfo.h

llvm/lib/Target/BPF/CMakeLists.txt

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp

llvm/lib/Transforms/Scalar/LICM.cpp

llvm/lib/Transforms/Utils/SimplifyCFG.cpp

llvm/test/CodeGen/BPF/adjust-opt-icmp1.ll

llvm/test/CodeGen/BPF/adjust-opt-icmp2.ll

llvm/test/CodeGen/BPF/adjust-opt-icmp3.ll

llvm/test/CodeGen/BPF/adjust-opt-icmp4.ll

llvm/test/CodeGen/BPF/adjust-opt-speculative1.ll

llvm/test/CodeGen/BPF/adjust-opt-speculative2.ll

[TTI][BPF]: Undo specific transform-preventing passes and add one TTI hook
Needs ReviewPublic