This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
1
AliasAnalysis.h
2/5
ValueTracking.h
-
lib/Analysis/
-
Analysis/
-
BasicAliasAnalysis.cpp
-
ValueTracking.cpp

Differential D134006

Add an optional cache to computeKnownBits.
Needs ReviewPublic

Authored by jlebar on Sep 15 2022, 8:02 PM.

Download Raw Diff

Details

Reviewers

asbirlea
nikic
fhahn

Summary

BasicAA (via MemoryDependenceResults, via GVN) may make many calls to
computeKnownBits. On my testcase (not able to share, sorry), this cache
reduces the time spent in computeKnownBits from 2s to 270ms, a 7.5x
speedup.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jlebar created this revision.Sep 15 2022, 8:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 15 2022, 8:02 PM

Herald added subscribers: jeroen.dobbelaere, foad, hiraditya. · View Herald Transcript

jlebar requested review of this revision.Sep 15 2022, 8:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 15 2022, 8:02 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

jlebar added a reviewer: asbirlea.Sep 15 2022, 8:03 PM

jlebar added a subscriber: mkuper.

jlebar added a parent revision: D134008: Add Cleanup class..Sep 15 2022, 8:05 PM

jlebar added a parent revision: D133996: Add a cache for DL.getTypeAllocSize() to BasicAA..

Harbormaster completed remote builds in B187032: Diff 460614.Sep 15 2022, 8:09 PM

nikic added reviewers: nikic, fhahn.Sep 16 2022, 12:53 AM

nikic added a subscriber: nikic.

nikic added inline comments.

llvm/include/llvm/Analysis/AliasAnalysis.h
41	Incorrect include paths
llvm/include/llvm/Analysis/ValueTracking.h
52	This is incorrect, the context instruction is used to check applicability of assumes for example, see isValidAssumeForContext().

from 2 to 270ms

Makes it sound like a 135x slow down.

foad added inline comments.Sep 16 2022, 1:57 AM

llvm/include/llvm/Analysis/ValueTracking.h
49	If you want to reformat this file can you do it first as a separate NFC patch please?

jlebar added inline comments.Sep 16 2022, 10:41 AM

llvm/include/llvm/Analysis/ValueTracking.h
52	`isGuaranteedToTransferExecutionToSuccessor`, wow, TIL. Thanks. But, very sad: With the change to look at the real CtxI rather than its BB, this no longer provides a significant speedup. This all seems so silly, because in 99.9% of cases the BB terminator really is sufficient. Any thoughts? Or do you think I should give up on this?

jlebar edited the summary of this revision. (Show Details)Sep 16 2022, 10:41 AM

jlebar added inline comments.Sep 16 2022, 10:48 AM

llvm/include/llvm/Analysis/ValueTracking.h
49	I didn't really want to reformat the file, but arc was doing it for me because I touched these lines... :-/ Anyway, I pushed 8cc3bfd13f3135985e5b15ee65f2fc43239fb9fe.

RKSimon added a subscriber: RKSimon.Sep 18 2022, 1:11 AM

BasicAA (via MemoryDependenceResults, via GVN) may make many calls to
computeKnownBits. On my testcase (not able to share, sorry), this cache
reduces the time spent in computeKnownBits from 2s to 270ms, a 7.5x
speedup.

MDA has a known problem with overly large recursion cutoffs. I'd be interested to know whether passing -memdep-block-number-limit=100 would improve compile-time for your case.

llvm/include/llvm/Analysis/ValueTracking.h
52	I don't have any great ideas on how to do more effective caching without affecting result quality. I guess we could decouple the context-independent from the context-dependent parts somehow, but given that they are currently interleaved, this doesn't sound easy.

MDA has a known problem with overly large recursion cutoffs. I'd be interested to know whether passing -memdep-block-number-limit=100 would improve compile-time for your case.

Turns out that tweaking the memdep limits did help in my case.

So I guess I'm happy dropping this patch. Though I *do* feel like there's still something here.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

AliasAnalysis.h

4 lines

ValueTracking.h

431 lines

lib/

Analysis/

BasicAliasAnalysis.cpp

5 lines

ValueTracking.cpp

65 lines

Diff 460614

llvm/include/llvm/Analysis/AliasAnalysis.h

Show All 31 Lines
// MustAlias at the same time. The current API can only return one result,		// MustAlias at the same time. The current API can only return one result,
// though this is rarely a problem in practice.		// though this is rarely a problem in practice.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_ANALYSIS_ALIASANALYSIS_H		#ifndef LLVM_ANALYSIS_ALIASANALYSIS_H
#define LLVM_ANALYSIS_ALIASANALYSIS_H		#define LLVM_ANALYSIS_ALIASANALYSIS_H

		#include "third_party/llvm/llvm-project/llvm/include/llvm/Analysis/ValueTracking.h"
		#include "third_party/llvm/llvm-project/llvm/include/llvm/Support/KnownBits.h"
		nikicUnsubmitted Not Done Reply Inline Actions Incorrect include paths nikic: Incorrect include paths
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/Sequence.h"		#include "llvm/ADT/Sequence.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/Analysis/MemoryLocation.h"		#include "llvm/Analysis/MemoryLocation.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include <cstdint>		#include <cstdint>
▲ Show 20 Lines • Show All 440 Lines • ▼ Show 20 Lines	public:
/// How many active NoAlias assumption uses there are.		/// How many active NoAlias assumption uses there are.
int NumAssumptionUses = 0;		int NumAssumptionUses = 0;

/// Location pairs for which an assumption based result is currently stored.		/// Location pairs for which an assumption based result is currently stored.
/// Used to remove all potentially incorrect results from the cache if an		/// Used to remove all potentially incorrect results from the cache if an
/// assumption is disproven.		/// assumption is disproven.
SmallVector<AAQueryInfo::LocPair, 4> AssumptionBasedResults;		SmallVector<AAQueryInfo::LocPair, 4> AssumptionBasedResults;

		KnownBitsCache KnownBitsCache;

AAQueryInfo(CaptureInfo *CI) : CI(CI) {}		AAQueryInfo(CaptureInfo *CI) : CI(CI) {}

/// Create a new AAQueryInfo based on this one, but with the cache cleared.		/// Create a new AAQueryInfo based on this one, but with the cache cleared.
/// This is used for recursive queries across phis, where cache results may		/// This is used for recursive queries across phis, where cache results may
/// not be valid.		/// not be valid.
AAQueryInfo withEmptyCache() {		AAQueryInfo withEmptyCache() {
AAQueryInfo NewAAQI(CI);		AAQueryInfo NewAAQI(CI);
NewAAQI.Depth = Depth;		NewAAQI.Depth = Depth;
▲ Show 20 Lines • Show All 866 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/ValueTracking.h

	Show All 40 Lines
	class MDNode;			class MDNode;
	class OptimizationRemarkEmitter;			class OptimizationRemarkEmitter;
	class StringRef;			class StringRef;
	class TargetLibraryInfo;			class TargetLibraryInfo;
	class Value;			class Value;

	constexpr unsigned MaxAnalysisRecursionDepth = 6;			constexpr unsigned MaxAnalysisRecursionDepth = 6;

				// Optional cache used by KnownBits. Populated as we walk the use/def graph.
				//
				// We can key lookups on the basic block that contains CxtI, rather than on CxtI
				// itself, because the context instr itself is not significant, only its BB.
				nikicUnsubmitted Not Done Reply Inline Actions This is incorrect, the context instruction is used to check applicability of assumes for example, see isValidAssumeForContext(). nikic: This is incorrect, the context instruction is used to check applicability of assumes for…
				jlebarAuthorUnsubmitted Done Reply Inline Actions `isGuaranteedToTransferExecutionToSuccessor`, wow, TIL. Thanks. But, very sad: With the change to look at the real CtxI rather than its BB, this no longer provides a significant speedup. This all seems so silly, because in 99.9% of cases the BB terminator really is sufficient. Any thoughts? Or do you think I should give up on this? jlebar: `isGuaranteedToTransferExecutionToSuccessor`, wow, TIL. Thanks. But, very sad: With the…
				nikicUnsubmitted Not Done Reply Inline Actions I don't have any great ideas on how to do more effective caching without affecting result quality. I guess we could decouple the context-independent from the context-dependent parts somehow, but given that they are currently interleaved, this doesn't sound easy. nikic: I don't have any great ideas on how to do more effective caching without affecting result…
				using KnownBitsCache = DenseMap<
				std::pair<const Value , const BasicBlock /BB containing CxtI/>,
				KnownBits>;

	/// Determine which bits of V are known to be either zero or one and return			/// Determine which bits of V are known to be either zero or one and return
	foadUnsubmitted Not Done Reply Inline Actions If you want to reformat this file can you do it first as a separate NFC patch please? foad: If you want to reformat this file can you do it first as a separate NFC patch please?
	jlebarAuthorUnsubmitted Done Reply Inline Actions I didn't really want to reformat the file, but arc was doing it for me because I touched these lines... :-/ Anyway, I pushed 8cc3bfd13f3135985e5b15ee65f2fc43239fb9fe. jlebar: I didn't really want to reformat the file, but arc was doing it for me because I touched…
	/// them in the KnownZero/KnownOne bit sets.			/// them in the KnownZero/KnownOne bit sets.
	///			///
	/// This function is defined on values with integer type, values with pointer			/// This function is defined on values with integer type, values with pointer
	/// type, and vectors of integers. In the case			/// type, and vectors of integers. In the case
	/// where V is a vector, the known zero and known one values are the			/// where V is a vector, the known zero and known one values are the
	/// same width as the vector element, and the bit is set only if it is true			/// same width as the vector element, and the bit is set only if it is true
	/// for all of the elements in the vector.			/// for all of the elements in the vector.
	void computeKnownBits(const Value *V, KnownBits &Known,			void computeKnownBits(const Value *V, KnownBits &Known, const DataLayout &DL,
	const DataLayout &DL, unsigned Depth = 0,			unsigned Depth = 0, AssumptionCache *AC = nullptr,
	AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	OptimizationRemarkEmitter *ORE = nullptr,			OptimizationRemarkEmitter *ORE = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true,
				KnownBitsCache *Cache = nullptr);

	/// Determine which bits of V are known to be either zero or one and return			/// Determine which bits of V are known to be either zero or one and return
	/// them in the KnownZero/KnownOne bit sets.			/// them in the KnownZero/KnownOne bit sets.
	///			///
	/// This function is defined on values with integer type, values with pointer			/// This function is defined on values with integer type, values with pointer
	/// type, and vectors of integers. In the case			/// type, and vectors of integers. In the case
	/// where V is a vector, the known zero and known one values are the			/// where V is a vector, the known zero and known one values are the
	/// same width as the vector element, and the bit is set only if it is true			/// same width as the vector element, and the bit is set only if it is true
	/// for all of the demanded elements in the vector.			/// for all of the demanded elements in the vector.
	void computeKnownBits(const Value *V, const APInt &DemandedElts,			void computeKnownBits(
	KnownBits &Known, const DataLayout &DL,			const Value *V, const APInt &DemandedElts, KnownBits &Known,
	unsigned Depth = 0, AssumptionCache *AC = nullptr,			const DataLayout &DL, unsigned Depth = 0, AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction CxtI = nullptr, const DominatorTree DT = nullptr,
	const DominatorTree *DT = nullptr,			OptimizationRemarkEmitter *ORE = nullptr, bool UseInstrInfo = true,
	OptimizationRemarkEmitter *ORE = nullptr,			KnownBitsCache *Cache = nullptr);
	bool UseInstrInfo = true);

	/// Returns the known bits rather than passing by reference.			/// Returns the known bits rather than passing by reference.
	KnownBits computeKnownBits(const Value *V, const DataLayout &DL,			KnownBits computeKnownBits(
	unsigned Depth = 0, AssumptionCache *AC = nullptr,			const Value *V, const DataLayout &DL, unsigned Depth = 0,
	const Instruction *CxtI = nullptr,			AssumptionCache AC = nullptr, const Instruction CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree DT = nullptr, OptimizationRemarkEmitter ORE = nullptr,
	OptimizationRemarkEmitter *ORE = nullptr,			bool UseInstrInfo = true,
	bool UseInstrInfo = true);			KnownBitsCache *Cache = nullptr);

	/// Returns the known bits rather than passing by reference.			/// Returns the known bits rather than passing by reference.
	KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,			KnownBits computeKnownBits(
	const DataLayout &DL, unsigned Depth = 0,			const Value *V, const APInt &DemandedElts, const DataLayout &DL,
	AssumptionCache *AC = nullptr,			unsigned Depth = 0, AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction CxtI = nullptr, const DominatorTree DT = nullptr,
	const DominatorTree *DT = nullptr,			OptimizationRemarkEmitter *ORE = nullptr, bool UseInstrInfo = true,
	OptimizationRemarkEmitter *ORE = nullptr,			KnownBitsCache *Cache = nullptr);
	bool UseInstrInfo = true);

	/// Compute known bits from the range metadata.			/// Compute known bits from the range metadata.
	/// \p KnownZero the set of bits that are known to be zero			/// \p KnownZero the set of bits that are known to be zero
	/// \p KnownOne the set of bits that are known to be one			/// \p KnownOne the set of bits that are known to be one
	void computeKnownBitsFromRangeMetadata(const MDNode &Ranges,			void computeKnownBitsFromRangeMetadata(const MDNode &Ranges, KnownBits &Known);
	KnownBits &Known);

	/// Return true if LHS and RHS have no common bits set.			/// Return true if LHS and RHS have no common bits set.
	bool haveNoCommonBitsSet(const Value LHS, const Value RHS,			bool haveNoCommonBitsSet(const Value LHS, const Value RHS,
	const DataLayout &DL,			const DataLayout &DL, AssumptionCache *AC = nullptr,
	AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Return true if the given value is known to have exactly one bit set when			/// Return true if the given value is known to have exactly one bit set when
	/// defined. For vectors return true if every element is known to be a power			/// defined. For vectors return true if every element is known to be a power
	/// of two when defined. Supports values with integer or pointer type and			/// of two when defined. Supports values with integer or pointer type and
	/// vectors of integers. If 'OrZero' is set, then return true if the given			/// vectors of integers. If 'OrZero' is set, then return true if the given
	/// value is either a power of two or zero.			/// value is either a power of two or zero.
	bool isKnownToBeAPowerOfTwo(const Value *V, const DataLayout &DL,			bool isKnownToBeAPowerOfTwo(const Value *V, const DataLayout &DL,
	bool OrZero = false, unsigned Depth = 0,			bool OrZero = false, unsigned Depth = 0,
	AssumptionCache *AC = nullptr,			AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	bool isOnlyUsedInZeroEqualityComparison(const Instruction *CxtI);			bool isOnlyUsedInZeroEqualityComparison(const Instruction *CxtI);

	/// Return true if the given value is known to be non-zero when defined. For			/// Return true if the given value is known to be non-zero when defined. For
	/// vectors, return true if every element is known to be non-zero when			/// vectors, return true if every element is known to be non-zero when
	/// defined. For pointers, if the context instruction and dominator tree are			/// defined. For pointers, if the context instruction and dominator tree are
	/// specified, perform context-sensitive analysis and return true if the			/// specified, perform context-sensitive analysis and return true if the
	/// pointer couldn't possibly be null at the specified instruction.			/// pointer couldn't possibly be null at the specified instruction.
	/// Supports values with integer or pointer type and vectors of integers.			/// Supports values with integer or pointer type and vectors of integers.
	bool isKnownNonZero(const Value *V, const DataLayout &DL, unsigned Depth = 0,			bool isKnownNonZero(const Value *V, const DataLayout &DL, unsigned Depth = 0,
	AssumptionCache *AC = nullptr,			AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Return true if the two given values are negation.			/// Return true if the two given values are negation.
	/// Currently can recoginze Value pair:			/// Currently can recoginze Value pair:
	/// 1: <X, Y> if X = sub (0, Y) or Y = sub (0, X)			/// 1: <X, Y> if X = sub (0, Y) or Y = sub (0, X)
	/// 2: <X, Y> if X = sub (A, B) and Y = sub (B, A)			/// 2: <X, Y> if X = sub (A, B) and Y = sub (B, A)
	bool isKnownNegation(const Value X, const Value Y, bool NeedNSW = false);			bool isKnownNegation(const Value X, const Value Y, bool NeedNSW = false);

	/// Returns true if the give value is known to be non-negative.			/// Returns true if the give value is known to be non-negative.
	bool isKnownNonNegative(const Value *V, const DataLayout &DL,			bool isKnownNonNegative(const Value *V, const DataLayout &DL,
	unsigned Depth = 0,			unsigned Depth = 0, AssumptionCache *AC = nullptr,
	AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Returns true if the given value is known be positive (i.e. non-negative			/// Returns true if the given value is known be positive (i.e. non-negative
	/// and non-zero).			/// and non-zero).
	bool isKnownPositive(const Value *V, const DataLayout &DL, unsigned Depth = 0,			bool isKnownPositive(const Value *V, const DataLayout &DL, unsigned Depth = 0,
	AssumptionCache *AC = nullptr,			AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Returns true if the given value is known be negative (i.e. non-positive			/// Returns true if the given value is known be negative (i.e. non-positive
	/// and non-zero).			/// and non-zero).
	bool isKnownNegative(const Value *V, const DataLayout &DL, unsigned Depth = 0,			bool isKnownNegative(const Value *V, const DataLayout &DL, unsigned Depth = 0,
	AssumptionCache *AC = nullptr,			AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Return true if the given values are known to be non-equal when defined.			/// Return true if the given values are known to be non-equal when defined.
	/// Supports scalar integer types only.			/// Supports scalar integer types only.
	bool isKnownNonEqual(const Value V1, const Value V2, const DataLayout &DL,			bool isKnownNonEqual(const Value V1, const Value V2, const DataLayout &DL,
	AssumptionCache *AC = nullptr,			AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Return true if 'V & Mask' is known to be zero. We use this predicate to			/// Return true if 'V & Mask' is known to be zero. We use this predicate to
	/// simplify operations downstream. Mask is known to be zero for bits that V			/// simplify operations downstream. Mask is known to be zero for bits that V
	/// cannot have.			/// cannot have.
	///			///
	/// This function is defined on values with integer type, values with pointer			/// This function is defined on values with integer type, values with pointer
	/// type, and vectors of integers. In the case			/// type, and vectors of integers. In the case
	/// where V is a vector, the mask, known zero, and known one values are the			/// where V is a vector, the mask, known zero, and known one values are the
	/// same width as the vector element, and the bit is set only if it is true			/// same width as the vector element, and the bit is set only if it is true
	/// for all of the elements in the vector.			/// for all of the elements in the vector.
	bool MaskedValueIsZero(const Value *V, const APInt &Mask,			bool MaskedValueIsZero(const Value *V, const APInt &Mask, const DataLayout &DL,
	const DataLayout &DL,
	unsigned Depth = 0, AssumptionCache *AC = nullptr,			unsigned Depth = 0, AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Return the number of times the sign bit of the register is replicated into			/// Return the number of times the sign bit of the register is replicated into
	/// the other bits. We know that at least 1 bit is always equal to the sign			/// the other bits. We know that at least 1 bit is always equal to the sign
	/// bit (itself), but other cases can give us information. For example,			/// bit (itself), but other cases can give us information. For example,
	/// immediately after an "ashr X, 2", we know that the top 3 bits are all			/// immediately after an "ashr X, 2", we know that the top 3 bits are all
	/// equal to each other, so we return 3. For vectors, return the number of			/// equal to each other, so we return 3. For vectors, return the number of
	/// sign bits for the vector element with the mininum number of known sign			/// sign bits for the vector element with the mininum number of known sign
	/// bits.			/// bits.
	unsigned ComputeNumSignBits(const Value *Op, const DataLayout &DL,			unsigned ComputeNumSignBits(const Value *Op, const DataLayout &DL,
	unsigned Depth = 0, AssumptionCache *AC = nullptr,			unsigned Depth = 0, AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr,			const DominatorTree *DT = nullptr,
	bool UseInstrInfo = true);			bool UseInstrInfo = true);

	/// Get the upper bound on bit size for this Value \p Op as a signed integer.			/// Get the upper bound on bit size for this Value \p Op as a signed integer.
	/// i.e. x == sext(trunc(x to MaxSignificantBits) to bitwidth(x)).			/// i.e. x == sext(trunc(x to MaxSignificantBits) to bitwidth(x)).
	/// Similar to the APInt::getSignificantBits function.			/// Similar to the APInt::getSignificantBits function.
	unsigned ComputeMaxSignificantBits(const Value *Op, const DataLayout &DL,			unsigned ComputeMaxSignificantBits(const Value *Op, const DataLayout &DL,
	unsigned Depth = 0,			unsigned Depth = 0,
	AssumptionCache *AC = nullptr,			AssumptionCache *AC = nullptr,
	const Instruction *CxtI = nullptr,			const Instruction *CxtI = nullptr,
	const DominatorTree *DT = nullptr);			const DominatorTree *DT = nullptr);

	/// Map a call instruction to an intrinsic ID. Libcalls which have equivalent			/// Map a call instruction to an intrinsic ID. Libcalls which have equivalent
	/// intrinsics are treated as-if they were intrinsics.			/// intrinsics are treated as-if they were intrinsics.
	Intrinsic::ID getIntrinsicForCallSite(const CallBase &CB,			Intrinsic::ID getIntrinsicForCallSite(const CallBase &CB,
	const TargetLibraryInfo *TLI);			const TargetLibraryInfo *TLI);

	/// Return true if we can prove that the specified FP value is never equal to			/// Return true if we can prove that the specified FP value is never equal to
	/// -0.0.			/// -0.0.
	bool CannotBeNegativeZero(const Value V, const TargetLibraryInfo TLI,			bool CannotBeNegativeZero(const Value V, const TargetLibraryInfo TLI,
	unsigned Depth = 0);			unsigned Depth = 0);

	/// Return true if we can prove that the specified FP value is either NaN or			/// Return true if we can prove that the specified FP value is either NaN or
	/// never less than -0.0.			/// never less than -0.0.
	///			///
	/// NaN --> true			/// NaN --> true
	/// +0 --> true			/// +0 --> true
	/// -0 --> true			/// -0 --> true
	/// x > +0 --> true			/// x > +0 --> true
	/// x < -0 --> false			/// x < -0 --> false
	bool CannotBeOrderedLessThanZero(const Value V, const TargetLibraryInfo TLI);			bool CannotBeOrderedLessThanZero(const Value V, const TargetLibraryInfo TLI);

	/// Return true if the floating-point scalar value is not an infinity or if			/// Return true if the floating-point scalar value is not an infinity or if
	/// the floating-point vector value has no infinities. Return false if a value			/// the floating-point vector value has no infinities. Return false if a value
	/// could ever be infinity.			/// could ever be infinity.
	bool isKnownNeverInfinity(const Value V, const TargetLibraryInfo TLI,			bool isKnownNeverInfinity(const Value V, const TargetLibraryInfo TLI,
	unsigned Depth = 0);			unsigned Depth = 0);

	/// Return true if the floating-point scalar value is not a NaN or if the			/// Return true if the floating-point scalar value is not a NaN or if the
	/// floating-point vector value has no NaN elements. Return false if a value			/// floating-point vector value has no NaN elements. Return false if a value
	/// could ever be NaN.			/// could ever be NaN.
	bool isKnownNeverNaN(const Value V, const TargetLibraryInfo TLI,			bool isKnownNeverNaN(const Value V, const TargetLibraryInfo TLI,
	unsigned Depth = 0);			unsigned Depth = 0);

	/// Return true if we can prove that the specified FP value's sign bit is 0.			/// Return true if we can prove that the specified FP value's sign bit is 0.
	///			///
	/// NaN --> true/false (depending on the NaN's sign bit)			/// NaN --> true/false (depending on the NaN's sign bit)
	/// +0 --> true			/// +0 --> true
	/// -0 --> false			/// -0 --> false
	/// x > +0 --> true			/// x > +0 --> true
	/// x < -0 --> false			/// x < -0 --> false
	bool SignBitMustBeZero(const Value V, const TargetLibraryInfo TLI);			bool SignBitMustBeZero(const Value V, const TargetLibraryInfo TLI);

	/// If the specified value can be set by repeating the same byte in memory,			/// If the specified value can be set by repeating the same byte in memory,
	/// return the i8 value that it is represented with. This is true for all i8			/// return the i8 value that it is represented with. This is true for all i8
	/// values obviously, but is also true for i32 0, i32 -1, i16 0xF0F0, double			/// values obviously, but is also true for i32 0, i32 -1, i16 0xF0F0, double
	/// 0.0 etc. If the value can't be handled with a repeated byte store (e.g.			/// 0.0 etc. If the value can't be handled with a repeated byte store (e.g.
	/// i16 0x1234), return null. If the value is entirely undef and padding,			/// i16 0x1234), return null. If the value is entirely undef and padding,
	/// return undef.			/// return undef.
	Value isBytewiseValue(Value V, const DataLayout &DL);			Value isBytewiseValue(Value V, const DataLayout &DL);

	/// Given an aggregate and an sequence of indices, see if the scalar value			/// Given an aggregate and an sequence of indices, see if the scalar value
	/// indexed is already around as a register, for example if it were inserted			/// indexed is already around as a register, for example if it were inserted
	/// directly into the aggregate.			/// directly into the aggregate.
	///			///
	/// If InsertBefore is not null, this function will duplicate (modified)			/// If InsertBefore is not null, this function will duplicate (modified)
	/// insertvalues when a part of a nested struct is extracted.			/// insertvalues when a part of a nested struct is extracted.
	Value FindInsertedValue(Value V,			Value FindInsertedValue(Value V, ArrayRef<unsigned> idx_range,
	ArrayRef<unsigned> idx_range,
	Instruction *InsertBefore = nullptr);			Instruction *InsertBefore = nullptr);

	/// Analyze the specified pointer to see if it can be expressed as a base			/// Analyze the specified pointer to see if it can be expressed as a base
	/// pointer plus a constant offset. Return the base and offset to the caller.			/// pointer plus a constant offset. Return the base and offset to the caller.
	///			///
	/// This is a wrapper around Value::stripAndAccumulateConstantOffsets that			/// This is a wrapper around Value::stripAndAccumulateConstantOffsets that
	/// creates and later unpacks the required APInt.			/// creates and later unpacks the required APInt.
	inline Value GetPointerBaseWithConstantOffset(Value Ptr, int64_t &Offset,			inline Value GetPointerBaseWithConstantOffset(Value Ptr, int64_t &Offset,
	const DataLayout &DL,			const DataLayout &DL,
	bool AllowNonInbounds = true) {			bool AllowNonInbounds = true) {
	APInt OffsetAPInt(DL.getIndexTypeSizeInBits(Ptr->getType()), 0);			APInt OffsetAPInt(DL.getIndexTypeSizeInBits(Ptr->getType()), 0);
	Value *Base =			Value *Base =
	Ptr->stripAndAccumulateConstantOffsets(DL, OffsetAPInt, AllowNonInbounds);			Ptr->stripAndAccumulateConstantOffsets(DL, OffsetAPInt, AllowNonInbounds);

	Offset = OffsetAPInt.getSExtValue();			Offset = OffsetAPInt.getSExtValue();
	return Base;			return Base;
	}			}
	inline const Value *			inline const Value *
	GetPointerBaseWithConstantOffset(const Value *Ptr, int64_t &Offset,			GetPointerBaseWithConstantOffset(const Value *Ptr, int64_t &Offset,
	const DataLayout &DL,			const DataLayout &DL,
	bool AllowNonInbounds = true) {			bool AllowNonInbounds = true) {
	return GetPointerBaseWithConstantOffset(const_cast<Value *>(Ptr), Offset, DL,			return GetPointerBaseWithConstantOffset(const_cast<Value *>(Ptr), Offset, DL,
	AllowNonInbounds);			AllowNonInbounds);
	}			}
	▲ Show 20 Lines • Show All 573 Lines • Show Last 20 Lines

llvm/lib/Analysis/BasicAliasAnalysis.cpp

Show First 20 Lines • Show All 1,221 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = DecompGEP1.VarIndices.size(); i != e; ++i) {

if (i == 0)		if (i == 0)
GCD = ScaleForGCD.abs();		GCD = ScaleForGCD.abs();
else		else
GCD = APIntOps::GreatestCommonDivisor(GCD, ScaleForGCD.abs());		GCD = APIntOps::GreatestCommonDivisor(GCD, ScaleForGCD.abs());

ConstantRange CR = computeConstantRange(Index.Val.V, /* ForSigned */ false,		ConstantRange CR = computeConstantRange(Index.Val.V, /* ForSigned */ false,
true, &AC, Index.CxtI);		true, &AC, Index.CxtI);
KnownBits Known =		KnownBits Known = computeKnownBits(Index.Val.V, DL, 0, &AC, Index.CxtI, DT,
computeKnownBits(Index.Val.V, DL, 0, &AC, Index.CxtI, DT);		/ORE=/nullptr, /UseInstrInfo=/true,
		&AAQI.KnownBitsCache);
CR = CR.intersectWith(		CR = CR.intersectWith(
ConstantRange::fromKnownBits(Known, /* Signed */ true),		ConstantRange::fromKnownBits(Known, /* Signed */ true),
ConstantRange::Signed);		ConstantRange::Signed);
CR = Index.Val.evaluateWith(CR).sextOrTrunc(OffsetRange.getBitWidth());		CR = Index.Val.evaluateWith(CR).sextOrTrunc(OffsetRange.getBitWidth());

assert(OffsetRange.getBitWidth() == Scale.getBitWidth() &&		assert(OffsetRange.getBitWidth() == Scale.getBitWidth() &&
"Bit widths are normalized to MaxIndexSize");		"Bit widths are normalized to MaxIndexSize");
if (Index.IsNSW)		if (Index.IsNSW)
▲ Show 20 Lines • Show All 679 Lines • Show Last 20 Lines

llvm/lib/Analysis/ValueTracking.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

//===- ValueTracking.cpp - Walk computations to compute properties --------===//		//===- ValueTracking.cpp - Walk computations to compute properties --------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains routines that help analyze properties that chains of		// This file contains routines that help analyze properties that chains of
// computations have.		// computations have.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"

		#include <algorithm>
		#include <cassert>
		#include <cstdint>
		#include <utility>

#include "llvm/ADT/APFloat.h"		#include "llvm/ADT/APFloat.h"
#include "llvm/ADT/APInt.h"		#include "llvm/ADT/APInt.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
		#include "llvm/ADT/Cleanup.h"
#include "llvm/ADT/None.h"		#include "llvm/ADT/None.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallSet.h"		#include "llvm/ADT/SmallSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/iterator_range.h"		#include "llvm/ADT/iterator_range.h"
Show All 39 Lines
#include "llvm/IR/User.h"		#include "llvm/IR/User.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/KnownBits.h"		#include "llvm/Support/KnownBits.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include <algorithm>
#include <cassert>
#include <cstdint>
#include <utility>

using namespace llvm;		using namespace llvm;
using namespace llvm::PatternMatch;		using namespace llvm::PatternMatch;

// Controls the number of uses of the value searched for possible		// Controls the number of uses of the value searched for possible
// dominating comparisons.		// dominating comparisons.
static cl::opt<unsigned> DomConditionsMaxUses("dom-conditions-max-uses",		static cl::opt<unsigned> DomConditionsMaxUses("dom-conditions-max-uses",
cl::Hidden, cl::init(20));		cl::Hidden, cl::init(20));
Show All 28 Lines	struct Query {
AssumptionCache *AC;		AssumptionCache *AC;
const Instruction *CxtI;		const Instruction *CxtI;
const DominatorTree *DT;		const DominatorTree *DT;

// Unlike the other analyses, this may be a nullptr because not all clients		// Unlike the other analyses, this may be a nullptr because not all clients
// provide it currently.		// provide it currently.
OptimizationRemarkEmitter *ORE;		OptimizationRemarkEmitter *ORE;

		// Optional cache.
		KnownBitsCache *Cache = nullptr;

/// If true, it is safe to use metadata during simplification.		/// If true, it is safe to use metadata during simplification.
InstrInfoQuery IIQ;		InstrInfoQuery IIQ;

Query(const DataLayout &DL, AssumptionCache AC, const Instruction CxtI,		Query(const DataLayout &DL, AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo,		const DominatorTree *DT, bool UseInstrInfo,
OptimizationRemarkEmitter *ORE = nullptr)		OptimizationRemarkEmitter *ORE = nullptr,
: DL(DL), AC(AC), CxtI(CxtI), DT(DT), ORE(ORE), IIQ(UseInstrInfo) {}		KnownBitsCache *Cache = nullptr)
		: DL(DL),
		AC(AC),
		CxtI(CxtI),
		DT(DT),
		ORE(ORE),
		Cache(Cache),
		IIQ(UseInstrInfo) {}
};		};

} // end anonymous namespace		} // end anonymous namespace

// Given the provided Value and, potentially, a context instruction, return		// Given the provided Value and, potentially, a context instruction, return
// the preferred context instruction (if any).		// the preferred context instruction (if any).
static const Instruction safeCxtI(const Value V, const Instruction *CxtI) {		static const Instruction safeCxtI(const Value V, const Instruction *CxtI) {
// If we've been provided with a context instruction, then use that (provided		// If we've been provided with a context instruction, then use that (provided
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	APInt DemandedElts =
FVTy ? APInt::getAllOnes(FVTy->getNumElements()) : APInt(1, 1);		FVTy ? APInt::getAllOnes(FVTy->getNumElements()) : APInt(1, 1);
computeKnownBits(V, DemandedElts, Known, Depth, Q);		computeKnownBits(V, DemandedElts, Known, Depth, Q);
}		}

void llvm::computeKnownBits(const Value *V, KnownBits &Known,		void llvm::computeKnownBits(const Value *V, KnownBits &Known,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT,		const DominatorTree *DT,
OptimizationRemarkEmitter *ORE, bool UseInstrInfo) {		OptimizationRemarkEmitter *ORE, bool UseInstrInfo,
		KnownBitsCache *Cache) {
::computeKnownBits(V, Known, Depth,		::computeKnownBits(V, Known, Depth,
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, Cache));
}		}

void llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,		void llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,
KnownBits &Known, const DataLayout &DL,		KnownBits &Known, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT,		const Instruction CxtI, const DominatorTree DT,
OptimizationRemarkEmitter *ORE, bool UseInstrInfo) {		OptimizationRemarkEmitter *ORE, bool UseInstrInfo,
		KnownBitsCache *Cache) {
::computeKnownBits(V, DemandedElts, Known, Depth,		::computeKnownBits(V, DemandedElts, Known, Depth,
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, Cache));
}		}

static KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,		static KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,
unsigned Depth, const Query &Q);		unsigned Depth, const Query &Q);

static KnownBits computeKnownBits(const Value *V, unsigned Depth,		static KnownBits computeKnownBits(const Value *V, unsigned Depth,
const Query &Q);		const Query &Q);

KnownBits llvm::computeKnownBits(const Value *V, const DataLayout &DL,		KnownBits llvm::computeKnownBits(const Value *V, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction *CxtI,		const Instruction *CxtI,
const DominatorTree *DT,		const DominatorTree *DT,
OptimizationRemarkEmitter *ORE,		OptimizationRemarkEmitter *ORE,
bool UseInstrInfo) {		bool UseInstrInfo, KnownBitsCache *Cache) {
return ::computeKnownBits(		return ::computeKnownBits(
V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, Cache));
}		}

KnownBits llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,		KnownBits llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT,		const DominatorTree *DT,
OptimizationRemarkEmitter *ORE,		OptimizationRemarkEmitter *ORE,
bool UseInstrInfo) {		bool UseInstrInfo, KnownBitsCache* Cache) {
return ::computeKnownBits(		return ::computeKnownBits(
V, DemandedElts, Depth,		V, DemandedElts, Depth,
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, Cache));
}		}

bool llvm::haveNoCommonBitsSet(const Value LHS, const Value RHS,		bool llvm::haveNoCommonBitsSet(const Value LHS, const Value RHS,
const DataLayout &DL, AssumptionCache *AC,		const DataLayout &DL, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT,		const Instruction CxtI, const DominatorTree DT,
bool UseInstrInfo) {		bool UseInstrInfo) {
assert(LHS->getType() == RHS->getType() &&		assert(LHS->getType() == RHS->getType() &&
"LHS and RHS should have the same type");		"LHS and RHS should have the same type");
▲ Show 20 Lines • Show All 1,706 Lines • ▼ Show 20 Lines	if (ScalarTy->isPointerTy()) {
assert(BitWidth == Q.DL.getPointerTypeSizeInBits(ScalarTy) &&		assert(BitWidth == Q.DL.getPointerTypeSizeInBits(ScalarTy) &&
"V and Known should have same BitWidth");		"V and Known should have same BitWidth");
} else {		} else {
assert(BitWidth == Q.DL.getTypeSizeInBits(ScalarTy) &&		assert(BitWidth == Q.DL.getTypeSizeInBits(ScalarTy) &&
"V and Known should have same BitWidth");		"V and Known should have same BitWidth");
}		}
#endif		#endif

		bool UseCache = Q.Cache && DemandedElts.isAllOnes();
		auto CacheKey = std::make_pair(V, Q.CxtI ? Q.CxtI->getParent() : nullptr);
		if (UseCache) {
		auto it = Q.Cache->find(CacheKey);
		if (it != Q.Cache->end()) {
		Known = it->second;
		return;
		}
		}

		// Insert `Known` into the cache when this call to computeKnownBits returns.
		Cleanup InsertIntoCacheRAII = [&] {
		if (UseCache) {
		// We can't reuse the `it` that we looked up at the beginning of this
		// function, because Q.Cache is a DenseMap, which does not guarantee
		// iterator stability.
		auto [_, inserted] = Q.Cache->insert({CacheKey, Known});
		(void) inserted;
		assert(inserted);
		}
		};

const APInt *C;		const APInt *C;
if (match(V, m_APInt(C))) {		if (match(V, m_APInt(C))) {
// We know all of the bits for a scalar constant or a splat vector constant!		// We know all of the bits for a scalar constant or a splat vector constant!
Known = KnownBits::makeConstant(*C);		Known = KnownBits::makeConstant(*C);
return;		return;
}		}
// Null and aggregate-zero are all-zeros.		// Null and aggregate-zero are all-zeros.
if (isa<ConstantPointerNull>(V) \|\| isa<ConstantAggregateZero>(V)) {		if (isa<ConstantPointerNull>(V) \|\| isa<ConstantAggregateZero>(V)) {
▲ Show 20 Lines • Show All 5,388 Lines • Show Last 20 Lines