This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Disable insertRangeTest() optimization for BPF target
AbandonedPublic

Authored by yonghong-song on Nov 17 2019, 8:43 PM.

Download Raw Diff

Details

Reviewers

ast
spatel
rnk
greened

Summary

The LLVM generated BPF byte codes need go through kernel
verifier before being allowed to execute in kernel.

Kernel verifier

https://github.com/torvalds/linux/blob/master/kernel/bpf/verifier.c

performs path sensitive analysis to verify safety of the program.
The verification is done during bpf program loading time, and
typically right before the program starts to run.

Since verifier is executed in kernel space and it runs during program
loading time, there is a great effort to avoid introducing complexity
and running time overhead for it. Sometime, in order to add analysis
to verifier, user code hacking is conducted to workaround the issue.

Related to this patch, the following kernel patch is a workaround
for code generated by LLVM instcombine insertRangeTest().

https://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf-next.git/commit/?id=b7a0d65d80a0c5034b366392624397a0915b7556

   107:       w1 = w0
   108:       w1 += -1
   109:       if w1 > 6 goto -24 <LBB0_5>
   110:       w0 += w8

Basically since verifier does not record and propagate copy state
for performance and memory reasons. Register "w0" value range
will become conservative and later on may cause verification failure.

Another example is

https://lore.kernel.org/netdev/871019a0-71f8-c26d-0ae8-c7fd8c8867fc@fb.com/

People has to come up with weird ways to workaround this issue.
To improve user space usability, this patch proposed to disable
insertRangeTest() for bpf target. All other targets are not
affected.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

yonghong-song created this revision.Nov 17 2019, 8:43 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 17 2019, 8:43 PM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

I added a few reviewers based on the recent commits at InstCombine directory. Please let me know if I missed anybody who should review the patch. Thanks!

I can't tell exactly what the motivation is from the description, but there are 2 major problems with this patch:

Disabling a transform does not solve the problem completely. If the source was written in the form that you do not want, you would want the reverse transform to produce the desired IR/asm.
By design, instcombine is a target-independent canonicalization pass. Adding the use of TTI defeats that goal.

There are (at least) 2 possible alternatives if you are determined to change this in the compiler rather than change the source code:

Reverse the existing transform for all targets (this will be very difficult to justify if the transform that you are hoping to disable results in fewer IR instructions).
Reverse the existing transform for BPF in a later pass (most likely candidates are DAGCombiner or CodeGenPrepare).

This revision now requires changes to proceed.Nov 19 2019, 5:28 AM

@spatel Thanks for the comments. I agree with your assessment. Looks like the best place is BPF backend to undo this optimization. I will look into implementation in that direction.

Look into the implementation in BPF backend.

In D70372#1752914, @yonghong-song wrote:

Look into the implementation in BPF backend.

I was going to suggest that, but the problem description read to me
as pretty hopeless situation with no guaranteed way out.
Hopefully i'm just misreading that.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

TargetTransformInfo.h

9 lines

TargetTransformInfoImpl.h

4 lines

lib/

Analysis/

TargetTransformInfo.cpp

4 lines

Target/

BPF/

BPFTargetMachine.h

2 lines

BPFTargetMachine.cpp

8 lines

BPFTargetTransformInfo.h

45 lines

Transforms/

InstCombine/

InstCombineAndOrXor.cpp

6 lines

InstCombineInternal.h

8 lines

InstructionCombining.cpp

15 lines

test/

Transforms/

InstCombine/

BPF/

no-insert-range-test.ll

18 lines

Diff 229742

llvm/include/llvm/Analysis/TargetTransformInfo.h

Show First 20 Lines • Show All 1,140 Lines • ▼ Show 20 Lines
/// \returns True if the target wants to expand the given reduction intrinsic		/// \returns True if the target wants to expand the given reduction intrinsic
/// into a shuffle sequence.		/// into a shuffle sequence.
bool shouldExpandReduction(const IntrinsicInst *II) const;		bool shouldExpandReduction(const IntrinsicInst *II) const;

/// \returns the size cost of rematerializing a GlobalValue address relative		/// \returns the size cost of rematerializing a GlobalValue address relative
/// to a stack reload.		/// to a stack reload.
unsigned getGISelRematGlobalCost() const;		unsigned getGISelRematGlobalCost() const;

		/// \returns True if the target does not want to introduce a new variable
		/// with a better value range compared to existing variable.
		bool preferNoNewValueRange() const;

/// @}		/// @}

private:		private:
/// Estimate the latency of specified instruction.		/// Estimate the latency of specified instruction.
/// Returns 1 as the default value.		/// Returns 1 as the default value.
int getInstructionLatency(const Instruction *I) const;		int getInstructionLatency(const Instruction *I) const;

/// Returns the expected throughput cost of the instruction.		/// Returns the expected throughput cost of the instruction.
▲ Show 20 Lines • Show All 231 Lines • ▼ Show 20 Lines	virtual unsigned getLoadVectorFactor(unsigned VF, unsigned LoadSize,
VectorType *VecTy) const = 0;		VectorType *VecTy) const = 0;
virtual unsigned getStoreVectorFactor(unsigned VF, unsigned StoreSize,		virtual unsigned getStoreVectorFactor(unsigned VF, unsigned StoreSize,
unsigned ChainSizeInBytes,		unsigned ChainSizeInBytes,
VectorType *VecTy) const = 0;		VectorType *VecTy) const = 0;
virtual bool useReductionIntrinsic(unsigned Opcode, Type *Ty,		virtual bool useReductionIntrinsic(unsigned Opcode, Type *Ty,
ReductionFlags) const = 0;		ReductionFlags) const = 0;
virtual bool shouldExpandReduction(const IntrinsicInst *II) const = 0;		virtual bool shouldExpandReduction(const IntrinsicInst *II) const = 0;
virtual unsigned getGISelRematGlobalCost() const = 0;		virtual unsigned getGISelRematGlobalCost() const = 0;
		virtual bool preferNoNewValueRange() const = 0;
virtual int getInstructionLatency(const Instruction *I) = 0;		virtual int getInstructionLatency(const Instruction *I) = 0;
};		};

template <typename T>		template <typename T>
class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {		class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {
T Impl;		T Impl;

public:		public:
▲ Show 20 Lines • Show All 466 Lines • ▼ Show 20 Lines	public:
bool shouldExpandReduction(const IntrinsicInst *II) const override {		bool shouldExpandReduction(const IntrinsicInst *II) const override {
return Impl.shouldExpandReduction(II);		return Impl.shouldExpandReduction(II);
}		}

unsigned getGISelRematGlobalCost() const override {		unsigned getGISelRematGlobalCost() const override {
return Impl.getGISelRematGlobalCost();		return Impl.getGISelRematGlobalCost();
}		}

		bool preferNoNewValueRange () const override {
		return Impl.preferNoNewValueRange();
		}

int getInstructionLatency(const Instruction *I) override {		int getInstructionLatency(const Instruction *I) override {
return Impl.getInstructionLatency(I);		return Impl.getInstructionLatency(I);
}		}
};		};

template <typename T>		template <typename T>
TargetTransformInfo::TargetTransformInfo(T Impl)		TargetTransformInfo::TargetTransformInfo(T Impl)
: TTIImpl(new Model<T>(Impl)) {}		: TTIImpl(new Model<T>(Impl)) {}
▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 605 Lines • ▼ Show 20 Lines	public:
bool shouldExpandReduction(const IntrinsicInst *II) const {		bool shouldExpandReduction(const IntrinsicInst *II) const {
return true;		return true;
}		}

unsigned getGISelRematGlobalCost() const {		unsigned getGISelRematGlobalCost() const {
return 1;		return 1;
}		}

		bool preferNoNewValueRange() const {
		return false;
		}

protected:		protected:
// Obtain the minimum required size to hold the value (without the sign)		// Obtain the minimum required size to hold the value (without the sign)
// In case of a vector it returns the min required size for one element.		// In case of a vector it returns the min required size for one element.
unsigned minRequiredElementSize(const Value* Val, bool &isSigned) {		unsigned minRequiredElementSize(const Value* Val, bool &isSigned) {
if (isa<ConstantDataVector>(Val) \|\| isa<ConstantVector>(Val)) {		if (isa<ConstantDataVector>(Val) \|\| isa<ConstantVector>(Val)) {
const auto* VectorValue = cast<Constant>(Val);		const auto* VectorValue = cast<Constant>(Val);

// In case of a vector need to pick the max between the min		// In case of a vector need to pick the max between the min
▲ Show 20 Lines • Show All 311 Lines • Show Last 20 Lines

llvm/lib/Analysis/TargetTransformInfo.cpp

	Show First 20 Lines • Show All 844 Lines • ▼ Show 20 Lines
	bool TargetTransformInfo::shouldExpandReduction(const IntrinsicInst *II) const {			bool TargetTransformInfo::shouldExpandReduction(const IntrinsicInst *II) const {
	return TTIImpl->shouldExpandReduction(II);			return TTIImpl->shouldExpandReduction(II);
	}			}

	unsigned TargetTransformInfo::getGISelRematGlobalCost() const {			unsigned TargetTransformInfo::getGISelRematGlobalCost() const {
	return TTIImpl->getGISelRematGlobalCost();			return TTIImpl->getGISelRematGlobalCost();
	}			}

				bool TargetTransformInfo::preferNoNewValueRange() const {
				return TTIImpl->preferNoNewValueRange();
				}

	int TargetTransformInfo::getInstructionLatency(const Instruction *I) const {			int TargetTransformInfo::getInstructionLatency(const Instruction *I) const {
	return TTIImpl->getInstructionLatency(I);			return TTIImpl->getInstructionLatency(I);
	}			}

	static bool matchPairwiseShuffleMask(ShuffleVectorInst *SI, bool IsLeft,			static bool matchPairwiseShuffleMask(ShuffleVectorInst *SI, bool IsLeft,
	unsigned Level) {			unsigned Level) {
	// We don't need a shuffle if we just want to have element 0 in position 0 of			// We don't need a shuffle if we just want to have element 0 in position 0 of
	// the vector.			// the vector.
	▲ Show 20 Lines • Show All 534 Lines • Show Last 20 Lines

llvm/lib/Target/BPF/BPFTargetMachine.h

Show All 26 Lines	BPFTargetMachine(const Target &T, const Triple &TT, StringRef CPU,
Optional<Reloc::Model> RM, Optional<CodeModel::Model> CM,		Optional<Reloc::Model> RM, Optional<CodeModel::Model> CM,
CodeGenOpt::Level OL, bool JIT);		CodeGenOpt::Level OL, bool JIT);

const BPFSubtarget *getSubtargetImpl() const { return &Subtarget; }		const BPFSubtarget *getSubtargetImpl() const { return &Subtarget; }
const BPFSubtarget *getSubtargetImpl(const Function &) const override {		const BPFSubtarget *getSubtargetImpl(const Function &) const override {
return &Subtarget;		return &Subtarget;
}		}

		TargetTransformInfo getTargetTransformInfo(const Function &F) override;

TargetPassConfig *createPassConfig(PassManagerBase &PM) override;		TargetPassConfig *createPassConfig(PassManagerBase &PM) override;

TargetLoweringObjectFile *getObjFileLowering() const override {		TargetLoweringObjectFile *getObjFileLowering() const override {
return TLOF.get();		return TLOF.get();
}		}
};		};
}		}

#endif		#endif

llvm/lib/Target/BPF/BPFTargetMachine.cpp

//===-- BPFTargetMachine.cpp - Define TargetMachine for BPF ---------------===//		//===-- BPFTargetMachine.cpp - Define TargetMachine for BPF ---------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// Implements the info about BPF target spec.		// Implements the info about BPF target spec.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "BPFTargetMachine.h"		#include "BPFTargetMachine.h"
#include "BPF.h"		#include "BPF.h"
		#include "BPFTargetTransformInfo.h"
#include "MCTargetDesc/BPFMCAsmInfo.h"		#include "MCTargetDesc/BPFMCAsmInfo.h"
#include "TargetInfo/BPFTargetInfo.h"		#include "TargetInfo/BPFTargetInfo.h"
		#include "llvm/Analysis/TargetTransformInfo.h"
		#include "llvm/CodeGen/BasicTTIImpl.h"
#include "llvm/CodeGen/Passes.h"		#include "llvm/CodeGen/Passes.h"
#include "llvm/CodeGen/TargetLoweringObjectFileImpl.h"		#include "llvm/CodeGen/TargetLoweringObjectFileImpl.h"
#include "llvm/CodeGen/TargetPassConfig.h"		#include "llvm/CodeGen/TargetPassConfig.h"
#include "llvm/IR/LegacyPassManager.h"		#include "llvm/IR/LegacyPassManager.h"
#include "llvm/Support/FormattedStream.h"		#include "llvm/Support/FormattedStream.h"
#include "llvm/Support/TargetRegistry.h"		#include "llvm/Support/TargetRegistry.h"
#include "llvm/Target/TargetOptions.h"		#include "llvm/Target/TargetOptions.h"
using namespace llvm;		using namespace llvm;
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	public:
void addPreEmitPass() override;		void addPreEmitPass() override;
};		};
}		}

TargetPassConfig *BPFTargetMachine::createPassConfig(PassManagerBase &PM) {		TargetPassConfig *BPFTargetMachine::createPassConfig(PassManagerBase &PM) {
return new BPFPassConfig(*this, PM);		return new BPFPassConfig(*this, PM);
}		}

		TargetTransformInfo
		BPFTargetMachine::getTargetTransformInfo(const Function &F) {
		return TargetTransformInfo(BPFTTIImpl(this, F));
		}

void BPFPassConfig::addIRPasses() {		void BPFPassConfig::addIRPasses() {

addPass(createBPFAbstractMemberAccess(&getBPFTargetMachine()));		addPass(createBPFAbstractMemberAccess(&getBPFTargetMachine()));

TargetPassConfig::addIRPasses();		TargetPassConfig::addIRPasses();
}		}

// Install an instruction selector pass using		// Install an instruction selector pass using
Show All 28 Lines

llvm/lib/Target/BPF/BPFTargetTransformInfo.h

This file was added.

				//===------ BPFTargetTransformInfo.h - BPF specific TTI ---------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file a TargetTransformInfo::Concept conforming object specific to the
				// BPF target machine. It uses the target's detailed information to
				// provide more precise answers to certain TTI queries, while letting the
				// target independent and default TTI implementations handle the rest.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_LIB_TARGET_BPF_BPFTARGETTRANSFORMINFO_H
				#define LLVM_LIB_TARGET_BPF_BPFTARGETTRANSFORMINFO_H

				#include "BPFTargetMachine.h"
				#include "llvm/Analysis/TargetTransformInfo.h"
				#include "llvm/CodeGen/BasicTTIImpl.h"

				namespace llvm {
				class BPFTTIImpl : public BasicTTIImplBase<BPFTTIImpl> {
				typedef BasicTTIImplBase<BPFTTIImpl> BaseT;
				typedef TargetTransformInfo TTI;
				friend BaseT;

				const BPFSubtarget *ST;
				const BPFTargetLowering *TLI;

				const BPFSubtarget *getST() const { return ST; }
				const BPFTargetLowering *getTLI() const { return TLI; }

				public:
				explicit BPFTTIImpl(const BPFTargetMachine *TM, const Function &F)
				: BaseT(TM, F.getParent()->getDataLayout()), ST(TM->getSubtargetImpl(F)),
				TLI(ST->getTargetLowering()) {}

				bool preferNoNewValueRange() const { return true; }
				};

				} // end namespace llvm

				#endif // LLVM_LIB_TARGET_BPF_BPFTARGETTRANSFORMINFO_H

llvm/lib/Transforms/InstCombine/InstCombineAndOrXor.cpp

//===- InstCombineAndOrXor.cpp --------------------------------------------===//		//===- InstCombineAndOrXor.cpp --------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file implements the visitAnd, visitOr, and visitXor functions.		// This file implements the visitAnd, visitOr, and visitXor functions.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "InstCombineInternal.h"		#include "InstCombineInternal.h"
#include "llvm/Analysis/CmpInstAnalysis.h"		#include "llvm/Analysis/CmpInstAnalysis.h"
#include "llvm/Analysis/InstructionSimplify.h"		#include "llvm/Analysis/InstructionSimplify.h"
		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"
#include "llvm/IR/ConstantRange.h"		#include "llvm/IR/ConstantRange.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
#include "llvm/IR/PatternMatch.h"		#include "llvm/IR/PatternMatch.h"
using namespace llvm;		using namespace llvm;
using namespace PatternMatch;		using namespace PatternMatch;

#define DEBUG_TYPE "instcombine"		#define DEBUG_TYPE "instcombine"
▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines	Value InstCombiner::insertRangeTest(Value V, const APInt &Lo, const APInt &Hi,
// V >= Min && V < Hi --> V < Hi		// V >= Min && V < Hi --> V < Hi
// V < Min \|\| V >= Hi --> V >= Hi		// V < Min \|\| V >= Hi --> V >= Hi
ICmpInst::Predicate Pred = Inside ? ICmpInst::ICMP_ULT : ICmpInst::ICMP_UGE;		ICmpInst::Predicate Pred = Inside ? ICmpInst::ICMP_ULT : ICmpInst::ICMP_UGE;
if (isSigned ? Lo.isMinSignedValue() : Lo.isMinValue()) {		if (isSigned ? Lo.isMinSignedValue() : Lo.isMinValue()) {
Pred = isSigned ? ICmpInst::getSignedPredicate(Pred) : Pred;		Pred = isSigned ? ICmpInst::getSignedPredicate(Pred) : Pred;
return Builder.CreateICmp(Pred, V, ConstantInt::get(Ty, Hi));		return Builder.CreateICmp(Pred, V, ConstantInt::get(Ty, Hi));
}		}

		// Bail out if the target prefers not to create a new variable even with better
		// value range.
		if (TTI.preferNoNewValueRange())
		return nullptr;

// V >= Lo && V < Hi --> V - Lo u< Hi - Lo		// V >= Lo && V < Hi --> V - Lo u< Hi - Lo
// V < Lo \|\| V >= Hi --> V - Lo u>= Hi - Lo		// V < Lo \|\| V >= Hi --> V - Lo u>= Hi - Lo
Value *VMinusLo =		Value *VMinusLo =
Builder.CreateSub(V, ConstantInt::get(Ty, Lo), V->getName() + ".off");		Builder.CreateSub(V, ConstantInt::get(Ty, Lo), V->getName() + ".off");
Constant *HiMinusLo = ConstantInt::get(Ty, Hi - Lo);		Constant *HiMinusLo = ConstantInt::get(Ty, Hi - Lo);
return Builder.CreateICmp(Pred, VMinusLo, HiMinusLo);		return Builder.CreateICmp(Pred, VMinusLo, HiMinusLo);
}		}

▲ Show 20 Lines • Show All 3,101 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineInternal.h

Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
class DataLayout;		class DataLayout;
class DominatorTree;		class DominatorTree;
class GEPOperator;		class GEPOperator;
class GlobalVariable;		class GlobalVariable;
class LoopInfo;		class LoopInfo;
class OptimizationRemarkEmitter;		class OptimizationRemarkEmitter;
class ProfileSummaryInfo;		class ProfileSummaryInfo;
class TargetLibraryInfo;		class TargetLibraryInfo;
		class TargetTransformInfo;
class User;		class User;

/// Assign a complexity or rank value to LLVM Values. This is used to reduce		/// Assign a complexity or rank value to LLVM Values. This is used to reduce
/// the amount of pattern matching needed for compares and commutative		/// the amount of pattern matching needed for compares and commutative
/// instructions. For example, if we have:		/// instructions. For example, if we have:
/// icmp ugt X, Constant		/// icmp ugt X, Constant
/// or		/// or
/// xor (add X, Constant), cast Z		/// xor (add X, Constant), cast Z
▲ Show 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	private:
AssumptionCache &AC;		AssumptionCache &AC;
TargetLibraryInfo &TLI;		TargetLibraryInfo &TLI;
DominatorTree &DT;		DominatorTree &DT;
const DataLayout &DL;		const DataLayout &DL;
const SimplifyQuery SQ;		const SimplifyQuery SQ;
OptimizationRemarkEmitter &ORE;		OptimizationRemarkEmitter &ORE;
BlockFrequencyInfo *BFI;		BlockFrequencyInfo *BFI;
ProfileSummaryInfo *PSI;		ProfileSummaryInfo *PSI;
		TargetTransformInfo &TTI;

// Optional analyses. When non-null, these can both be used to do better		// Optional analyses. When non-null, these can both be used to do better
// combining and will be updated to reflect any changes.		// combining and will be updated to reflect any changes.
LoopInfo *LI;		LoopInfo *LI;

bool MadeIRChange = false;		bool MadeIRChange = false;

public:		public:
InstCombiner(InstCombineWorklist &Worklist, BuilderTy &Builder,		InstCombiner(InstCombineWorklist &Worklist, BuilderTy &Builder,
bool MinimizeSize, bool ExpensiveCombines, AliasAnalysis *AA,		bool MinimizeSize, bool ExpensiveCombines, AliasAnalysis *AA,
AssumptionCache &AC, TargetLibraryInfo &TLI, DominatorTree &DT,		AssumptionCache &AC, TargetLibraryInfo &TLI, DominatorTree &DT,
OptimizationRemarkEmitter &ORE, BlockFrequencyInfo *BFI,		OptimizationRemarkEmitter &ORE, BlockFrequencyInfo *BFI,
ProfileSummaryInfo PSI, const DataLayout &DL, LoopInfo LI)		ProfileSummaryInfo *PSI, const DataLayout &DL,
		TargetTransformInfo &TTI, LoopInfo *LI)
: Worklist(Worklist), Builder(Builder), MinimizeSize(MinimizeSize),		: Worklist(Worklist), Builder(Builder), MinimizeSize(MinimizeSize),
ExpensiveCombines(ExpensiveCombines), AA(AA), AC(AC), TLI(TLI), DT(DT),		ExpensiveCombines(ExpensiveCombines), AA(AA), AC(AC), TLI(TLI), DT(DT),
DL(DL), SQ(DL, &TLI, &DT, &AC), ORE(ORE), BFI(BFI), PSI(PSI), LI(LI) {}		DL(DL), SQ(DL, &TLI, &DT, &AC), ORE(ORE), BFI(BFI), PSI(PSI), TTI(TTI),
		LI(LI) {}

/// Run the combiner over the entire worklist until it is empty.		/// Run the combiner over the entire worklist until it is empty.
///		///
/// \returns true if the IR is changed.		/// \returns true if the IR is changed.
bool run();		bool run();

AssumptionCache &getAssumptionCache() const { return AC; }		AssumptionCache &getAssumptionCache() const { return AC; }

▲ Show 20 Lines • Show All 654 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
#include "llvm/Analysis/InstructionSimplify.h"		#include "llvm/Analysis/InstructionSimplify.h"
#include "llvm/Analysis/LazyBlockFrequencyInfo.h"		#include "llvm/Analysis/LazyBlockFrequencyInfo.h"
#include "llvm/Analysis/LoopInfo.h"		#include "llvm/Analysis/LoopInfo.h"
#include "llvm/Analysis/MemoryBuiltins.h"		#include "llvm/Analysis/MemoryBuiltins.h"
#include "llvm/Analysis/OptimizationRemarkEmitter.h"		#include "llvm/Analysis/OptimizationRemarkEmitter.h"
#include "llvm/Analysis/ProfileSummaryInfo.h"		#include "llvm/Analysis/ProfileSummaryInfo.h"
#include "llvm/Analysis/TargetFolder.h"		#include "llvm/Analysis/TargetFolder.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constant.h"		#include "llvm/IR/Constant.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DIBuilder.h"		#include "llvm/IR/DIBuilder.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
▲ Show 20 Lines • Show All 3,482 Lines • ▼ Show 20 Lines	static bool prepareICWorklistFromFunction(Function &F, const DataLayout &DL,

return MadeIRChange;		return MadeIRChange;
}		}

static bool combineInstructionsOverFunction(		static bool combineInstructionsOverFunction(
Function &F, InstCombineWorklist &Worklist, AliasAnalysis *AA,		Function &F, InstCombineWorklist &Worklist, AliasAnalysis *AA,
AssumptionCache &AC, TargetLibraryInfo &TLI, DominatorTree &DT,		AssumptionCache &AC, TargetLibraryInfo &TLI, DominatorTree &DT,
OptimizationRemarkEmitter &ORE, BlockFrequencyInfo *BFI,		OptimizationRemarkEmitter &ORE, BlockFrequencyInfo *BFI,
ProfileSummaryInfo *PSI, bool ExpensiveCombines = true,		ProfileSummaryInfo *PSI, TargetTransformInfo &TTI,
LoopInfo *LI = nullptr) {		bool ExpensiveCombines = true, LoopInfo *LI = nullptr) {
auto &DL = F.getParent()->getDataLayout();		auto &DL = F.getParent()->getDataLayout();
ExpensiveCombines \|= EnableExpensiveCombines;		ExpensiveCombines \|= EnableExpensiveCombines;

/// Builder - This is an IRBuilder that automatically inserts new		/// Builder - This is an IRBuilder that automatically inserts new
/// instructions into the worklist when they are created.		/// instructions into the worklist when they are created.
IRBuilder<TargetFolder, IRBuilderCallbackInserter> Builder(		IRBuilder<TargetFolder, IRBuilderCallbackInserter> Builder(
F.getContext(), TargetFolder(DL),		F.getContext(), TargetFolder(DL),
IRBuilderCallbackInserter([&Worklist, &AC](Instruction *I) {		IRBuilderCallbackInserter([&Worklist, &AC](Instruction *I) {
Show All 13 Lines	static bool combineInstructionsOverFunction(
while (true) {		while (true) {
++Iteration;		++Iteration;
LLVM_DEBUG(dbgs() << "\n\nINSTCOMBINE ITERATION #" << Iteration << " on "		LLVM_DEBUG(dbgs() << "\n\nINSTCOMBINE ITERATION #" << Iteration << " on "
<< F.getName() << "\n");		<< F.getName() << "\n");

MadeIRChange \|= prepareICWorklistFromFunction(F, DL, &TLI, Worklist);		MadeIRChange \|= prepareICWorklistFromFunction(F, DL, &TLI, Worklist);

InstCombiner IC(Worklist, Builder, F.hasMinSize(), ExpensiveCombines, AA,		InstCombiner IC(Worklist, Builder, F.hasMinSize(), ExpensiveCombines, AA,
AC, TLI, DT, ORE, BFI, PSI, DL, LI);		AC, TLI, DT, ORE, BFI, PSI, DL, TTI, LI);
IC.MaxArraySizeForCombine = MaxArraySize;		IC.MaxArraySizeForCombine = MaxArraySize;

if (!IC.run())		if (!IC.run())
break;		break;
}		}

return MadeIRChange \|\| Iteration > 1;		return MadeIRChange \|\| Iteration > 1;
}		}
Show All 9 Lines	PreservedAnalyses InstCombinePass::run(Function &F,

auto *AA = &AM.getResult<AAManager>(F);		auto *AA = &AM.getResult<AAManager>(F);
const ModuleAnalysisManager &MAM =		const ModuleAnalysisManager &MAM =
AM.getResult<ModuleAnalysisManagerFunctionProxy>(F).getManager();		AM.getResult<ModuleAnalysisManagerFunctionProxy>(F).getManager();
ProfileSummaryInfo *PSI =		ProfileSummaryInfo *PSI =
MAM.getCachedResult<ProfileSummaryAnalysis>(*F.getParent());		MAM.getCachedResult<ProfileSummaryAnalysis>(*F.getParent());
auto *BFI = (PSI && PSI->hasProfileSummary()) ?		auto *BFI = (PSI && PSI->hasProfileSummary()) ?
&AM.getResult<BlockFrequencyAnalysis>(F) : nullptr;		&AM.getResult<BlockFrequencyAnalysis>(F) : nullptr;
		TargetTransformInfo &TTI = AM.getResult<TargetIRAnalysis>(F);

if (!combineInstructionsOverFunction(F, Worklist, AA, AC, TLI, DT, ORE,		if (!combineInstructionsOverFunction(F, Worklist, AA, AC, TLI, DT, ORE,
BFI, PSI, ExpensiveCombines, LI))		BFI, PSI, TTI, ExpensiveCombines, LI))
// No changes, all analyses are preserved.		// No changes, all analyses are preserved.
return PreservedAnalyses::all();		return PreservedAnalyses::all();

// Mark all the analyses that instcombine updates as preserved.		// Mark all the analyses that instcombine updates as preserved.
PreservedAnalyses PA;		PreservedAnalyses PA;
PA.preserveSet<CFGAnalyses>();		PA.preserveSet<CFGAnalyses>();
PA.preserve<AAManager>();		PA.preserve<AAManager>();
PA.preserve<BasicAA>();		PA.preserve<BasicAA>();
PA.preserve<GlobalsAA>();		PA.preserve<GlobalsAA>();
return PA;		return PA;
}		}

void InstructionCombiningPass::getAnalysisUsage(AnalysisUsage &AU) const {		void InstructionCombiningPass::getAnalysisUsage(AnalysisUsage &AU) const {
AU.setPreservesCFG();		AU.setPreservesCFG();
AU.addRequired<AAResultsWrapperPass>();		AU.addRequired<AAResultsWrapperPass>();
AU.addRequired<AssumptionCacheTracker>();		AU.addRequired<AssumptionCacheTracker>();
AU.addRequired<TargetLibraryInfoWrapperPass>();		AU.addRequired<TargetLibraryInfoWrapperPass>();
AU.addRequired<DominatorTreeWrapperPass>();		AU.addRequired<DominatorTreeWrapperPass>();
AU.addRequired<OptimizationRemarkEmitterWrapperPass>();		AU.addRequired<OptimizationRemarkEmitterWrapperPass>();
AU.addPreserved<DominatorTreeWrapperPass>();		AU.addPreserved<DominatorTreeWrapperPass>();
AU.addPreserved<AAResultsWrapperPass>();		AU.addPreserved<AAResultsWrapperPass>();
AU.addPreserved<BasicAAWrapperPass>();		AU.addPreserved<BasicAAWrapperPass>();
AU.addPreserved<GlobalsAAWrapperPass>();		AU.addPreserved<GlobalsAAWrapperPass>();
AU.addRequired<ProfileSummaryInfoWrapperPass>();		AU.addRequired<ProfileSummaryInfoWrapperPass>();
		AU.addRequired<TargetTransformInfoWrapperPass>();
LazyBlockFrequencyInfoPass::getLazyBFIAnalysisUsage(AU);		LazyBlockFrequencyInfoPass::getLazyBFIAnalysisUsage(AU);
}		}

bool InstructionCombiningPass::runOnFunction(Function &F) {		bool InstructionCombiningPass::runOnFunction(Function &F) {
if (skipFunction(F))		if (skipFunction(F))
return false;		return false;

// Required analyses.		// Required analyses.
auto AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();		auto AA = &getAnalysis<AAResultsWrapperPass>().getAAResults();
auto &AC = getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);		auto &AC = getAnalysis<AssumptionCacheTracker>().getAssumptionCache(F);
auto &TLI = getAnalysis<TargetLibraryInfoWrapperPass>().getTLI(F);		auto &TLI = getAnalysis<TargetLibraryInfoWrapperPass>().getTLI(F);
auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();		auto &DT = getAnalysis<DominatorTreeWrapperPass>().getDomTree();
auto &ORE = getAnalysis<OptimizationRemarkEmitterWrapperPass>().getORE();		auto &ORE = getAnalysis<OptimizationRemarkEmitterWrapperPass>().getORE();

// Optional analyses.		// Optional analyses.
auto *LIWP = getAnalysisIfAvailable<LoopInfoWrapperPass>();		auto *LIWP = getAnalysisIfAvailable<LoopInfoWrapperPass>();
auto *LI = LIWP ? &LIWP->getLoopInfo() : nullptr;		auto *LI = LIWP ? &LIWP->getLoopInfo() : nullptr;
ProfileSummaryInfo *PSI =		ProfileSummaryInfo *PSI =
&getAnalysis<ProfileSummaryInfoWrapperPass>().getPSI();		&getAnalysis<ProfileSummaryInfoWrapperPass>().getPSI();
BlockFrequencyInfo *BFI =		BlockFrequencyInfo *BFI =
(PSI && PSI->hasProfileSummary()) ?		(PSI && PSI->hasProfileSummary()) ?
&getAnalysis<LazyBlockFrequencyInfoPass>().getBFI() :		&getAnalysis<LazyBlockFrequencyInfoPass>().getBFI() :
nullptr;		nullptr;
		auto &TTI = getAnalysis<TargetTransformInfoWrapperPass>().getTTI(F);

return combineInstructionsOverFunction(F, Worklist, AA, AC, TLI, DT, ORE,		return combineInstructionsOverFunction(F, Worklist, AA, AC, TLI, DT, ORE,
BFI, PSI, ExpensiveCombines, LI);		BFI, PSI, TTI, ExpensiveCombines, LI);
}		}

char InstructionCombiningPass::ID = 0;		char InstructionCombiningPass::ID = 0;

InstructionCombiningPass::InstructionCombiningPass(bool ExpensiveCombines)		InstructionCombiningPass::InstructionCombiningPass(bool ExpensiveCombines)
: FunctionPass(ID), ExpensiveCombines(ExpensiveCombines) {		: FunctionPass(ID), ExpensiveCombines(ExpensiveCombines) {
initializeInstructionCombiningPassPass(*PassRegistry::getPassRegistry());		initializeInstructionCombiningPassPass(*PassRegistry::getPassRegistry());
}		}

INITIALIZE_PASS_BEGIN(InstructionCombiningPass, "instcombine",		INITIALIZE_PASS_BEGIN(InstructionCombiningPass, "instcombine",
"Combine redundant instructions", false, false)		"Combine redundant instructions", false, false)
INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)		INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetLibraryInfoWrapperPass)
INITIALIZE_PASS_DEPENDENCY(DominatorTreeWrapperPass)		INITIALIZE_PASS_DEPENDENCY(DominatorTreeWrapperPass)
INITIALIZE_PASS_DEPENDENCY(AAResultsWrapperPass)		INITIALIZE_PASS_DEPENDENCY(AAResultsWrapperPass)
INITIALIZE_PASS_DEPENDENCY(GlobalsAAWrapperPass)		INITIALIZE_PASS_DEPENDENCY(GlobalsAAWrapperPass)
INITIALIZE_PASS_DEPENDENCY(OptimizationRemarkEmitterWrapperPass)		INITIALIZE_PASS_DEPENDENCY(OptimizationRemarkEmitterWrapperPass)
INITIALIZE_PASS_DEPENDENCY(LazyBlockFrequencyInfoPass)		INITIALIZE_PASS_DEPENDENCY(LazyBlockFrequencyInfoPass)
INITIALIZE_PASS_DEPENDENCY(ProfileSummaryInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(ProfileSummaryInfoWrapperPass)
		INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
INITIALIZE_PASS_END(InstructionCombiningPass, "instcombine",		INITIALIZE_PASS_END(InstructionCombiningPass, "instcombine",
"Combine redundant instructions", false, false)		"Combine redundant instructions", false, false)

// Initialization Routines		// Initialization Routines
void llvm::initializeInstCombine(PassRegistry &Registry) {		void llvm::initializeInstCombine(PassRegistry &Registry) {
initializeInstructionCombiningPassPass(Registry);		initializeInstructionCombiningPassPass(Registry);
}		}

Show All 11 Lines

llvm/test/Transforms/InstCombine/BPF/no-insert-range-test.ll

This file was added.

				; RUN: opt < %s -instcombine -S \| FileCheck %s
				; REQUIRES: bpf-registered-target

				; bpf target should not perform insertRangeTest.

				target datalayout = "e-m:e-p:64:64-i64:64-n32:64-S128"
				target triple = "bpf"

				define dso_local i32 @test(i32 %arg) local_unnamed_addr {
				entry:
				%cmp = icmp sgt i32 %arg, 0
				%cmp1 = icmp slt i32 %arg, 8
				; CHECK-NOT: add i32 %arg, -1
				%0 = and i1 %cmp, %cmp1
				; CHECK: %0 = and i1 %cmp, %cmp1
				%. = zext i1 %0 to i32
				ret i32 %.
				}