This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
1
TargetTransformInfo.h
-
TargetTransformInfoImpl.h
-
ValueTracking.h
-
CodeGen/
-
BasicTTIImpl.h
-
Support/
-
KnownBits.h
-
lib/
-
Analysis/
-
TargetTransformInfo.cpp
1
ValueTracking.cpp
-
Support/
-
KnownBits.cpp
-
Target/AMDGPU/
-
AMDGPU/
-
AMDGPUTargetTransformInfo.cpp
-
unittests/Analysis/
-
Analysis/
4/5
ValueTrackingTest.cpp

Differential D87342

Allow targets to augment computeKnownBits with their analysis using TargetTransformInfo
AbandonedPublic

Authored by qcolombet on Sep 8 2020, 8:32 PM.

Download Raw Diff

Details

Reviewers

fhahn
nikic
aqjune
spatel
lebedev.ri
RKSimon
efriedma

Summary

This is a proof of concept that shows how we could have the targets provide more information for computeKnownBits analysis.

This was motivated by the discussion in D86364, where we could make the computeKnownBits analysis smarter but it seems that the compile time would not be worth it for all the targets. In other words, this patch shows how we could allow the targets to put some extract effort on the computeKnownBits analysis.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	90 ms	linux > LLVM.CodeGen/AMDGPU::opt-pipeline.ll
	60 ms	windows > LLVM.CodeGen/AArch64/GlobalISel::combine-trunc.mir
	120 ms	windows > LLVM.CodeGen/AMDGPU::opt-pipeline.ll

Event Timeline

qcolombet created this revision.Sep 8 2020, 8:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 8 2020, 8:32 PM

Herald added subscribers: kerbowa, hiraditya, nhaehnle and 2 others. · View Herald Transcript

qcolombet requested review of this revision.Sep 8 2020, 8:32 PM

Harbormaster completed remote builds in B71033: Diff 290629.Sep 8 2020, 8:32 PM

The bulk of the changes are plumbing TTI into the computeKnownBits APIs. On top of that, clang-format did its magic and there are a lot of changes.

llvm/unittests/Analysis/ValueTrackingTest.cpp
1124	This illustrates how the target can put some extract effort on some instructions.
1248	Note: This test illustrates how it would work with a custom TTI but is not commit-able as is because it relies on `BasicTTIImplBase`, which is part of the CodeGen library.

qcolombet mentioned this in D86364: [ValueTracking] Interpret GEPs as a series of adds multiplied by the related scaling factor.Sep 8 2020, 8:39 PM

Use TargetTransformInfoImplCRTPBase instead of BasicTTIImpl to avoid requiring the CodeGen library for the unit tests

llvm/unittests/Analysis/ValueTrackingTest.cpp
1248	Fixed that part.

Harbormaster completed remote builds in B71037: Diff 290634.Sep 8 2020, 9:05 PM

Clean up the proof of concept:

Add the proper includes
Fixup clang-format acting up

Harbormaster completed remote builds in B71588: Diff 291634.Sep 14 2020, 12:54 PM

Gentle ping @nikic.

nikic added reviewers: spatel, lebedev.ri.Sep 15 2020, 11:43 AM

spatel added a reviewer: RKSimon.Sep 15 2020, 12:17 PM

Remove const_cast that was a left over from the proof of concept

Harbormaster completed remote builds in B71787: Diff 292026.Sep 15 2020, 3:27 PM

Fix a few call sites where I was passing TTI instead of the boolean for UseInstrInfo (it doesn't help that the compiler didn't warm on these)

Now, it should really be NFC, unless you set the TTI, like in the unit test.

Harbormaster completed remote builds in B71797: Diff 292053.Sep 15 2020, 5:24 PM

Ping!

Gentle ping @nikic, @aqjune, @spatel, @lebedev.ri, @RKSimon.

Being able to have the target provide more information is very useful to us.
This commit is NFC (or at least if should be if I didn't screw something up) when the target doesn't override the computeKnownBits method or when the TTI is not passed around.

Ping @fhahn, @nikic, @aqjune, @spatel, @lebedev.ri, @RKSimon

It's been almost a month and nobody commented on the approach.

Not again that the patch is NFC when the TTI is not set. Is it okay to proceed?

Sorry for the delay @qcolombet - I think the sheer size of the patch kept putting me off! But you're right, most of its trivial plumbing. There's a few bits that it'd be great to (pre-)commit separately if possible.

llvm/include/llvm/Analysis/TargetTransformInfo.h
52	sorting
llvm/lib/Analysis/ValueTracking.cpp
468	Pull this out into its own commit
llvm/unittests/Analysis/ValueTrackingTest.cpp
1127	These 2 tests above look like they can pre-committed?

This revision is now accepted and ready to land.Oct 5 2020, 11:54 AM

I do have a concern about the general direction here: When we allowed targets to hook into InstCombine, one of the hard design constraints was that the target is only allowed to affect combines for target intrinsics, and nothing else. This patch seems to go against that restriction by allowing to replace general analysis behavior and affect otherwise target-independent folds. Maybe @spatel and @lebedev.ri have something to say regarding that.

In D87342#2312464, @nikic wrote:

I do have a concern about the general direction here: When we allowed targets to hook into InstCombine, one of the hard design constraints was that the target is only allowed to affect combines for target intrinsics, and nothing else. This patch seems to go against that restriction by allowing to replace general analysis behavior and affect otherwise target-independent folds. Maybe @spatel and @lebedev.ri have something to say regarding that.

Yep, i don't really like that.

In D87342#2312464, @nikic wrote:

I do have a concern about the general direction here: When we allowed targets to hook into InstCombine, one of the hard design constraints was that the target is only allowed to affect combines for target intrinsics, and nothing else. This patch seems to go against that restriction by allowing to replace general analysis behavior and affect otherwise target-independent folds. Maybe @spatel and @lebedev.ri have something to say regarding that.

Sorry for the delayed reply. I skimmed the earlier patch and this one now, and I agree, this could open the door to unintended behavior for seemingly target-independent passes.
It's not clear to me if there's anything other than the GEP example planning to use an override. Can we create a dedicated GEP analysis function that would limit the compile-time impact? Or could we use/create some option that would enable the more expensive analysis selectively?

reopening

This revision now requires changes to proceed.Oct 5 2020, 1:51 PM

@nikic, @spatel, @lebedev.ri, @RKSimon, thank you for your feedbacks!

I do have a concern about the general direction here: When we allowed targets to hook into InstCombine, one of the hard design constraints was that the target is only allowed to affect combines for target intrinsics, and nothing else. This patch seems to go against that restriction by allowing to replace general analysis behavior and affect otherwise target-independent folds.

To be honest, I don't really like it either, but given it looks like only us care about precise GEPs, I don't see how we can reconcile both constrains.

That said, this patch doesn't affect the combines per se, since this patch doesn't modify instcombine. It only affects the computeKnownBits analysis (i.e., it is more a complicated version of what in spirit @spatel is suggesting with a dedicated GEP analysis for the precise case.) and I was not planning to have instcombine taking advantage of this. But yes, this opens that door.

It's not clear to me if there's anything other than the GEP example planning to use an override.

I personally don't plan to have any other override here, but that doesn't mean that other needs may arise in the future.

Can we create a dedicated GEP analysis function that would limit the compile-time impact?

What do you have in mind?

Or could we use/create some option that would enable the more expensive analysis selectively?

I have to admit I am not a fan of that because it opens the door of having to do the same for pretty much everything that people would consider too expensive for their target. Put differently why GEPs are different?

Cheers,
-Quentin

qcolombet added inline comments.Oct 5 2020, 1:54 PM

llvm/unittests/Analysis/ValueTrackingTest.cpp
1127	Correct.

TargetTransformInfo exists because some IR transforms really don't work without the extra information. For example, vectorization doesn't make sense without some target-specific parameters. But the further we go down the path of target-specific tweaks, the harder it becomes to make any changes at all: you end up with bugs that can only be reproduced on on specific targets, and the impacts on compile time and code quality become harder to predict. This is why, for example, it's really hard to make changes to DAGCombine.

I think changing computeKnownBits to allow arbitrary target-controlled computation is going too far on the side of unpredictability; it's used by a bunch of passes in contexts you wouldn't really expect to behave in target-specific ways.

In D87342#2312775, @qcolombet wrote:

Can we create a dedicated GEP analysis function that would limit the compile-time impact?

What do you have in mind?

I didn't check the details, so this might be DOA...but I see that we have things like llvm::computeOverflowForSignedAdd() that are specializations/wrappers around the generic computeKnownBits(). If the motivating cases all start with a gep, then could we add the extra logic to a wrapper around the regular computeKnownBits()? That wrapper is only called from places like InstCombinerImpl::visitGetElementPtrInst(), so it would limit the compile-time increase?

If the motivating cases all start with a gep, then could we add the extra logic to a wrapper around the regular computeKnownBits()?

Unfortunately the motivating example doesn't start with a gep, or at least, there is no guarantee it would start with a gep.

Anyway, I can already push the NFC changes like the added tests cases and refactoring of KnownBits.

In the meantime, any other idea on how we could have GEPs being more precise?
Should I resurrect https://reviews.llvm.org/D86364?

In D87342#2314787, @qcolombet wrote:

If the motivating cases all start with a gep, then could we add the extra logic to a wrapper around the regular computeKnownBits()?

Unfortunately the motivating example doesn't start with a gep, or at least, there is no guarantee it would start with a gep.

Anyway, I can already push the NFC changes like the added tests cases and refactoring of KnownBits.

Sure, having those in place will help to focus on the real diffs.

In the meantime, any other idea on how we could have GEPs being more precise?
Should I resurrect https://reviews.llvm.org/D86364?

Ok - I think that's the theoretically/philosophically right way to proceed (target-independent). I don't have any ideas yet on how to limit the cost, but maybe we can collectively find some more savings in an update of that patch.

I don't have any ideas yet on how to limit the cost, but maybe we can collectively find some more savings in an update of that patch.

Sounds like a plan :).

Thanks!

Added the tests in https://reviews.llvm.org/D88934 (without the gep tests since they wouldn't produce the expected results.)

(I'll add the gep test in a separate PR to demonstrate the lack of precision for these.)

Refactored the compute known bits for mul in https://reviews.llvm.org/D88935
Added the sextOrTrunc method to KnownBits https://reviews.llvm.org/D88937

Abandoning this diff in favor of D86364 based on our discussion.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

TargetTransformInfo.h

24 lines

TargetTransformInfoImpl.h

9 lines

ValueTracking.h

109 lines

CodeGen/

BasicTTIImpl.h

8 lines

Support/

KnownBits.h

12 lines

lib/

Analysis/

TargetTransformInfo.cpp

8 lines

ValueTracking.cpp

306 lines

Support/

KnownBits.cpp

79 lines

Target/

AMDGPU/

AMDGPUTargetTransformInfo.cpp

2 lines

unittests/

Analysis/

ValueTrackingTest.cpp

228 lines

Diff 292053

llvm/include/llvm/Analysis/TargetTransformInfo.h

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines
class GlobalValue;		class GlobalValue;
class InstCombiner;		class InstCombiner;
class IntrinsicInst;		class IntrinsicInst;
class LoadInst;		class LoadInst;
class LoopAccessInfo;		class LoopAccessInfo;
class Loop;		class Loop;
class LoopInfo;		class LoopInfo;
class ProfileSummaryInfo;		class ProfileSummaryInfo;
		class OptimizationRemarkEmitter;
		RKSimonUnsubmitted Not Done Reply Inline Actions sorting RKSimon: sorting
class SCEV;		class SCEV;
class ScalarEvolution;		class ScalarEvolution;
class StoreInst;		class StoreInst;
class SwitchInst;		class SwitchInst;
class TargetLibraryInfo;		class TargetLibraryInfo;
class Type;		class Type;
class User;		class User;
class Value;		class Value;
▲ Show 20 Lines • Show All 1,260 Lines • ▼ Show 20 Lines
/// in hardware. (see LLVM Language Reference - "Vector Predication		/// in hardware. (see LLVM Language Reference - "Vector Predication
/// Intrinsics") Use of %evl is discouraged when that is not the case.		/// Intrinsics") Use of %evl is discouraged when that is not the case.
bool hasActiveVectorLength() const;		bool hasActiveVectorLength() const;

/// @}		/// @}

/// @}		/// @}

		/// Improve \p Known for \p V with target specific known bits analysis.
		/// \returns true if the target analyzed \p V, false otherwise.
		bool computeKnownBits(const Value *V, KnownBits &Known, const DataLayout &DL,
		unsigned Depth, AssumptionCache *AC,
		const Instruction CxtI, const DominatorTree DT,
		OptimizationRemarkEmitter *ORE,
		bool UseInstrInfo) const;

private:		private:
/// Estimate the latency of specified instruction.		/// Estimate the latency of specified instruction.
/// Returns 1 as the default value.		/// Returns 1 as the default value.
int getInstructionLatency(const Instruction *I) const;		int getInstructionLatency(const Instruction *I) const;

/// Returns the expected throughput cost of the instruction.		/// Returns the expected throughput cost of the instruction.
/// Returns -1 if the cost is unknown.		/// Returns -1 if the cost is unknown.
int getInstructionThroughput(const Instruction *I) const;		int getInstructionThroughput(const Instruction *I) const;
▲ Show 20 Lines • Show All 263 Lines • ▼ Show 20 Lines	public:
virtual bool preferInLoopReduction(unsigned Opcode, Type *Ty,		virtual bool preferInLoopReduction(unsigned Opcode, Type *Ty,
ReductionFlags) const = 0;		ReductionFlags) const = 0;
virtual bool preferPredicatedReductionSelect(unsigned Opcode, Type *Ty,		virtual bool preferPredicatedReductionSelect(unsigned Opcode, Type *Ty,
ReductionFlags) const = 0;		ReductionFlags) const = 0;
virtual bool shouldExpandReduction(const IntrinsicInst *II) const = 0;		virtual bool shouldExpandReduction(const IntrinsicInst *II) const = 0;
virtual unsigned getGISelRematGlobalCost() const = 0;		virtual unsigned getGISelRematGlobalCost() const = 0;
virtual bool hasActiveVectorLength() const = 0;		virtual bool hasActiveVectorLength() const = 0;
virtual int getInstructionLatency(const Instruction *I) = 0;		virtual int getInstructionLatency(const Instruction *I) = 0;
		virtual bool
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - virtual bool - computeKnownBits(const TargetTransformInfo TTI, const Value V, - KnownBits &Known, const DataLayout &DL, unsigned Depth, - AssumptionCache AC, const Instruction CxtI, - const DominatorTree DT, OptimizationRemarkEmitter ORE, - bool UseInstrInfo) const = 0; + virtual bool computeKnownBits(const TargetTransformInfo TTI, const Value V, + KnownBits &Known, const DataLayout &DL, + unsigned Depth, AssumptionCache AC, + const Instruction CxtI, 3 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - virtual bool - computeKnownBits(const…
		computeKnownBits(const TargetTransformInfo TTI, const Value V,
		KnownBits &Known, const DataLayout &DL, unsigned Depth,
		AssumptionCache AC, const Instruction CxtI,
		const DominatorTree DT, OptimizationRemarkEmitter ORE,
		bool UseInstrInfo) const = 0;
};		};

template <typename T>		template <typename T>
class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {		class TargetTransformInfo::Model final : public TargetTransformInfo::Concept {
T Impl;		T Impl;

public:		public:
Model(T Impl) : Impl(std::move(Impl)) {}		Model(T Impl) : Impl(std::move(Impl)) {}
▲ Show 20 Lines • Show All 503 Lines • ▼ Show 20 Lines	public:

bool hasActiveVectorLength() const override {		bool hasActiveVectorLength() const override {
return Impl.hasActiveVectorLength();		return Impl.hasActiveVectorLength();
}		}

int getInstructionLatency(const Instruction *I) override {		int getInstructionLatency(const Instruction *I) override {
return Impl.getInstructionLatency(I);		return Impl.getInstructionLatency(I);
}		}

		bool computeKnownBits(const TargetTransformInfo TTI, const Value V,
		KnownBits &Known, const DataLayout &DL, unsigned Depth,
		AssumptionCache AC, const Instruction CxtI,
		const DominatorTree DT, OptimizationRemarkEmitter ORE,
		bool UseInstrInfo) const override {
		return Impl.computeKnownBits(TTI, V, Known, DL, Depth, AC, CxtI, DT, ORE,
		UseInstrInfo);
		}
};		};

template <typename T>		template <typename T>
TargetTransformInfo::TargetTransformInfo(T Impl)		TargetTransformInfo::TargetTransformInfo(T Impl)
: TTIImpl(new Model<T>(Impl)) {}		: TTIImpl(new Model<T>(Impl)) {}

/// Analysis pass providing the \c TargetTransformInfo.		/// Analysis pass providing the \c TargetTransformInfo.
///		///
▲ Show 20 Lines • Show All 94 Lines • Show Last 20 Lines

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

Show All 14 Lines
#define LLVM_ANALYSIS_TARGETTRANSFORMINFOIMPL_H		#define LLVM_ANALYSIS_TARGETTRANSFORMINFOIMPL_H

#include "llvm/Analysis/ScalarEvolutionExpressions.h"		#include "llvm/Analysis/ScalarEvolutionExpressions.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/Analysis/VectorUtils.h"		#include "llvm/Analysis/VectorUtils.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/GetElementPtrTypeIterator.h"		#include "llvm/IR/GetElementPtrTypeIterator.h"
		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"

namespace llvm {		namespace llvm {

/// Base class for use as a mix-in that aids implementing		/// Base class for use as a mix-in that aids implementing
/// a TargetTransformInfo-compatible class.		/// a TargetTransformInfo-compatible class.
▲ Show 20 Lines • Show All 1,044 Lines • ▼ Show 20 Lines	int getInstructionLatency(const Instruction *I) {

if (VectorType *VectorTy = dyn_cast<VectorType>(DstTy))		if (VectorType *VectorTy = dyn_cast<VectorType>(DstTy))
DstTy = VectorTy->getElementType();		DstTy = VectorTy->getElementType();
if (DstTy->isFloatingPointTy())		if (DstTy->isFloatingPointTy())
return 3;		return 3;

return 1;		return 1;
}		}

		bool computeKnownBits(const TargetTransformInfo TTI, const Value V,
		KnownBits &Known, const DataLayout &DL, unsigned Depth,
		AssumptionCache AC, const Instruction CxtI,
		const DominatorTree DT, OptimizationRemarkEmitter ORE,
		bool UseInstrInfo) const {
		return false;
		}
};		};
} // namespace llvm		} // namespace llvm

#endif		#endif

llvm/include/llvm/Analysis/ValueTracking.h

Show All 37 Lines
class WithOverflowInst;		class WithOverflowInst;
struct KnownBits;		struct KnownBits;
class Loop;		class Loop;
class LoopInfo;		class LoopInfo;
class MDNode;		class MDNode;
class OptimizationRemarkEmitter;		class OptimizationRemarkEmitter;
class StringRef;		class StringRef;
class TargetLibraryInfo;		class TargetLibraryInfo;
		class TargetTransformInfo;
class Value;		class Value;

constexpr unsigned MaxAnalysisRecursionDepth = 6;		constexpr unsigned MaxAnalysisRecursionDepth = 6;

/// Determine which bits of V are known to be either zero or one and return		/// Determine which bits of V are known to be either zero or one and return
/// them in the KnownZero/KnownOne bit sets.		/// them in the KnownZero/KnownOne bit sets.
///		///
/// This function is defined on values with integer type, values with pointer		/// This function is defined on values with integer type, values with pointer
/// type, and vectors of integers. In the case		/// type, and vectors of integers. In the case
/// where V is a vector, the known zero and known one values are the		/// where V is a vector, the known zero and known one values are the
/// same width as the vector element, and the bit is set only if it is true		/// same width as the vector element, and the bit is set only if it is true
/// for all of the elements in the vector.		/// for all of the elements in the vector.
void computeKnownBits(const Value *V, KnownBits &Known,		void computeKnownBits(const Value *V, KnownBits &Known,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - void computeKnownBits(const Value V, KnownBits &Known, - const DataLayout &DL, unsigned Depth = 0, - AssumptionCache AC = nullptr, - const Instruction CxtI = nullptr, - const DominatorTree DT = nullptr, - OptimizationRemarkEmitter ORE = nullptr, - bool UseInstrInfo = true, - const TargetTransformInfo TTI = nullptr); +void computeKnownBits(const Value V, KnownBits &Known, const DataLayout &DL, + unsigned Depth = 0, AssumptionCache AC = nullptr, 5 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - void computeKnownBits(const Value *V, KnownBits…
const DataLayout &DL, unsigned Depth = 0,		const DataLayout &DL, unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
OptimizationRemarkEmitter *ORE = nullptr,		OptimizationRemarkEmitter *ORE = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Determine which bits of V are known to be either zero or one and return		/// Determine which bits of V are known to be either zero or one and return
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Determine which bits of V are known to be either zero or one and return - /// them in the KnownZero/KnownOne bit sets. - /// - /// This function is defined on values with integer type, values with pointer - /// type, and vectors of integers. In the case - /// where V is a vector, the known zero and known one values are the - /// same width as the vector element, and the bit is set only if it is true - /// for all of the demanded elements in the vector. - void computeKnownBits(const Value V, const APInt &DemandedElts, - KnownBits &Known, const DataLayout &DL, 22 diff lines are omitted. See full path. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - /// Determine which bits of V are known to be…
/// them in the KnownZero/KnownOne bit sets.		/// them in the KnownZero/KnownOne bit sets.
///		///
/// This function is defined on values with integer type, values with pointer		/// This function is defined on values with integer type, values with pointer
/// type, and vectors of integers. In the case		/// type, and vectors of integers. In the case
/// where V is a vector, the known zero and known one values are the		/// where V is a vector, the known zero and known one values are the
/// same width as the vector element, and the bit is set only if it is true		/// same width as the vector element, and the bit is set only if it is true
/// for all of the demanded elements in the vector.		/// for all of the demanded elements in the vector.
void computeKnownBits(const Value *V, const APInt &DemandedElts,		void computeKnownBits(const Value *V, const APInt &DemandedElts,
KnownBits &Known, const DataLayout &DL,		KnownBits &Known, const DataLayout &DL,
unsigned Depth = 0, AssumptionCache *AC = nullptr,		unsigned Depth = 0, AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
OptimizationRemarkEmitter *ORE = nullptr,		OptimizationRemarkEmitter *ORE = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Returns the known bits rather than passing by reference.		/// Returns the known bits rather than passing by reference.
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Returns the known bits rather than passing by reference. - KnownBits computeKnownBits(const Value V, const DataLayout &DL, - unsigned Depth = 0, AssumptionCache AC = nullptr, - const Instruction CxtI = nullptr, - const DominatorTree DT = nullptr, - OptimizationRemarkEmitter ORE = nullptr, - bool UseInstrInfo = true, - const TargetTransformInfo TTI = nullptr); - - /// Returns the known bits rather than passing by reference. 30 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// Returns the known bits rather than passing by…
KnownBits computeKnownBits(const Value *V, const DataLayout &DL,		KnownBits computeKnownBits(const Value *V, const DataLayout &DL,
unsigned Depth = 0, AssumptionCache *AC = nullptr,		unsigned Depth = 0, AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
OptimizationRemarkEmitter *ORE = nullptr,		OptimizationRemarkEmitter *ORE = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Returns the known bits rather than passing by reference.		/// Returns the known bits rather than passing by reference.
KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,		KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,
const DataLayout &DL, unsigned Depth = 0,		const DataLayout &DL, unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
OptimizationRemarkEmitter *ORE = nullptr,		OptimizationRemarkEmitter *ORE = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Compute known bits from the range metadata.		/// Compute known bits from the range metadata.
/// \p KnownZero the set of bits that are known to be zero		/// \p KnownZero the set of bits that are known to be zero
/// \p KnownOne the set of bits that are known to be one		/// \p KnownOne the set of bits that are known to be one
void computeKnownBitsFromRangeMetadata(const MDNode &Ranges,		void computeKnownBitsFromRangeMetadata(const MDNode &Ranges,
KnownBits &Known);		KnownBits &Known);

/// Return true if LHS and RHS have no common bits set.		/// Return true if LHS and RHS have no common bits set.
bool haveNoCommonBitsSet(const Value LHS, const Value RHS,		bool haveNoCommonBitsSet(const Value LHS, const Value RHS,
const DataLayout &DL,		const DataLayout &DL,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code + OptimizationRemarkEmitter ORE = nullptr, Lint: Pre-merge checks:* clang-format: please reformat the code ``` + …
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Return true if the given value is known to have exactly one bit set when		/// Return true if the given value is known to have exactly one bit set when
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Return true if the given value is known to have exactly one bit set when - /// defined. For vectors return true if every element is known to be a power - /// of two when defined. Supports values with integer or pointer type and - /// vectors of integers. If 'OrZero' is set, then return true if the given - /// value is either a power of two or zero. - bool isKnownToBeAPowerOfTwo(const Value V, const DataLayout &DL, - bool OrZero = false, unsigned Depth = 0, - AssumptionCache AC = nullptr, - const Instruction CxtI = nullptr, - const DominatorTree DT = nullptr, 21 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// Return true if the given value is known to…
/// defined. For vectors return true if every element is known to be a power		/// defined. For vectors return true if every element is known to be a power
/// of two when defined. Supports values with integer or pointer type and		/// of two when defined. Supports values with integer or pointer type and
/// vectors of integers. If 'OrZero' is set, then return true if the given		/// vectors of integers. If 'OrZero' is set, then return true if the given
/// value is either a power of two or zero.		/// value is either a power of two or zero.
bool isKnownToBeAPowerOfTwo(const Value *V, const DataLayout &DL,		bool isKnownToBeAPowerOfTwo(const Value *V, const DataLayout &DL,
bool OrZero = false, unsigned Depth = 0,		bool OrZero = false, unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

bool isOnlyUsedInZeroEqualityComparison(const Instruction *CxtI);		bool isOnlyUsedInZeroEqualityComparison(const Instruction *CxtI);

/// Return true if the given value is known to be non-zero when defined. For		/// Return true if the given value is known to be non-zero when defined. For
/// vectors, return true if every element is known to be non-zero when		/// vectors, return true if every element is known to be non-zero when
/// defined. For pointers, if the context instruction and dominator tree are		/// defined. For pointers, if the context instruction and dominator tree are
/// specified, perform context-sensitive analysis and return true if the		/// specified, perform context-sensitive analysis and return true if the
/// pointer couldn't possibly be null at the specified instruction.		/// pointer couldn't possibly be null at the specified instruction.
/// Supports values with integer or pointer type and vectors of integers.		/// Supports values with integer or pointer type and vectors of integers.
bool isKnownNonZero(const Value *V, const DataLayout &DL, unsigned Depth = 0,		bool isKnownNonZero(const Value *V, const DataLayout &DL, unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Return true if the two given values are negation.		/// Return true if the two given values are negation.
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Return true if the two given values are negation. - /// Currently can recoginze Value pair: - /// 1: <X, Y> if X = sub (0, Y) or Y = sub (0, X) - /// 2: <X, Y> if X = sub (A, B) and Y = sub (B, A) - bool isKnownNegation(const Value X, const Value Y, bool NeedNSW = false); - - /// Returns true if the give value is known to be non-negative. - bool isKnownNonNegative(const Value V, const DataLayout &DL, - unsigned Depth = 0, - AssumptionCache AC = nullptr, 20 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// Return true if the two given values are…
/// Currently can recoginze Value pair:		/// Currently can recoginze Value pair:
/// 1: <X, Y> if X = sub (0, Y) or Y = sub (0, X)		/// 1: <X, Y> if X = sub (0, Y) or Y = sub (0, X)
/// 2: <X, Y> if X = sub (A, B) and Y = sub (B, A)		/// 2: <X, Y> if X = sub (A, B) and Y = sub (B, A)
bool isKnownNegation(const Value X, const Value Y, bool NeedNSW = false);		bool isKnownNegation(const Value X, const Value Y, bool NeedNSW = false);

/// Returns true if the give value is known to be non-negative.		/// Returns true if the give value is known to be non-negative.
bool isKnownNonNegative(const Value *V, const DataLayout &DL,		bool isKnownNonNegative(const Value *V, const DataLayout &DL,
unsigned Depth = 0,		unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Returns true if the given value is known be positive (i.e. non-negative		/// Returns true if the given value is known be positive (i.e. non-negative
/// and non-zero).		/// and non-zero).
bool isKnownPositive(const Value *V, const DataLayout &DL, unsigned Depth = 0,		bool isKnownPositive(const Value *V, const DataLayout &DL, unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Returns true if the given value is known be negative (i.e. non-positive		/// Returns true if the given value is known be negative (i.e. non-positive
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Returns true if the given value is known be negative (i.e. non-positive - /// and non-zero). - bool isKnownNegative(const Value V, const DataLayout &DL, unsigned Depth = 0, - AssumptionCache AC = nullptr, - const Instruction CxtI = nullptr, - const DominatorTree DT = nullptr, - bool UseInstrInfo = true); +/// Return true if the given value is known to have exactly one bit set when +/// defined. For vectors return true if every element is known to be a power +/// of two when defined. Supports values with integer or pointer type and 37 diff lines are omitted. See full path. Lint: Pre-merge checks: clang-format: please reformat the code ``` - /// Returns true if the given value is known be…
/// and non-zero).		/// and non-zero).
bool isKnownNegative(const Value *V, const DataLayout &DL, unsigned Depth = 0,		bool isKnownNegative(const Value *V, const DataLayout &DL, unsigned Depth = 0,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true);

/// Return true if the given values are known to be non-equal when defined.		/// Return true if the given values are known to be non-equal when defined.
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Return true if the given values are known to be non-equal when defined. - /// Supports scalar integer types only. - bool isKnownNonEqual(const Value V1, const Value V2, const DataLayout &DL, - AssumptionCache AC = nullptr, +/// Returns true if the given value is known be positive (i.e. non-negative +/// and non-zero). +bool isKnownPositive(const Value V, const DataLayout &DL, unsigned Depth = 0, + AssumptionCache AC = nullptr, + const Instruction CxtI = nullptr, + const DominatorTree DT = nullptr, 31 diff lines are omitted. See full path. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - /// Return true if the given values are known to…
/// Supports scalar integer types only.		/// Supports scalar integer types only.
bool isKnownNonEqual(const Value V1, const Value V2, const DataLayout &DL,		bool isKnownNonEqual(const Value V1, const Value V2, const DataLayout &DL,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Return true if 'V & Mask' is known to be zero. We use this predicate to		/// Return true if 'V & Mask' is known to be zero. We use this predicate to
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - /// Return true if 'V & Mask' is known to be zero. We use this predicate to - /// simplify operations downstream. Mask is known to be zero for bits that V - /// cannot have. - /// - /// This function is defined on values with integer type, values with pointer - /// type, and vectors of integers. In the case - /// where V is a vector, the mask, known zero, and known one values are the - /// same width as the vector element, and the bit is set only if it is true - /// for all of the elements in the vector. - bool MaskedValueIsZero(const Value V, const APInt &Mask, 197 diff lines are omitted. See full path. Lint: Pre-merge checks:* clang-format: please reformat the code ``` - /// Return true if 'V & Mask' is known to be zero.
/// simplify operations downstream. Mask is known to be zero for bits that V		/// simplify operations downstream. Mask is known to be zero for bits that V
/// cannot have.		/// cannot have.
///		///
/// This function is defined on values with integer type, values with pointer		/// This function is defined on values with integer type, values with pointer
/// type, and vectors of integers. In the case		/// type, and vectors of integers. In the case
/// where V is a vector, the mask, known zero, and known one values are the		/// where V is a vector, the mask, known zero, and known one values are the
/// same width as the vector element, and the bit is set only if it is true		/// same width as the vector element, and the bit is set only if it is true
/// for all of the elements in the vector.		/// for all of the elements in the vector.
bool MaskedValueIsZero(const Value *V, const APInt &Mask,		bool MaskedValueIsZero(const Value *V, const APInt &Mask,
const DataLayout &DL,		const DataLayout &DL,
unsigned Depth = 0, AssumptionCache *AC = nullptr,		unsigned Depth = 0, AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// Return the number of times the sign bit of the register is replicated into		/// Return the number of times the sign bit of the register is replicated into
/// the other bits. We know that at least 1 bit is always equal to the sign		/// the other bits. We know that at least 1 bit is always equal to the sign
/// bit (itself), but other cases can give us information. For example,		/// bit (itself), but other cases can give us information. For example,
/// immediately after an "ashr X, 2", we know that the top 3 bits are all		/// immediately after an "ashr X, 2", we know that the top 3 bits are all
/// equal to each other, so we return 3. For vectors, return the number of		/// equal to each other, so we return 3. For vectors, return the number of
/// sign bits for the vector element with the mininum number of known sign		/// sign bits for the vector element with the mininum number of known sign
/// bits.		/// bits.
unsigned ComputeNumSignBits(const Value *Op, const DataLayout &DL,		unsigned ComputeNumSignBits(const Value *Op, const DataLayout &DL,
unsigned Depth = 0, AssumptionCache *AC = nullptr,		unsigned Depth = 0, AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
bool UseInstrInfo = true);		bool UseInstrInfo = true,
		const TargetTransformInfo *TTI = nullptr);

/// This function computes the integer multiple of Base that equals V. If		/// This function computes the integer multiple of Base that equals V. If
/// successful, it returns true and returns the multiple in Multiple. If		/// successful, it returns true and returns the multiple in Multiple. If
/// unsuccessful, it returns false. Also, if V can be simplified to an		/// unsuccessful, it returns false. Also, if V can be simplified to an
/// integer, then the simplified V is returned in Val. Look through sext only		/// integer, then the simplified V is returned in Val. Look through sext only
/// if LookThroughSExt=true.		/// if LookThroughSExt=true.
bool ComputeMultiple(Value V, unsigned Base, Value &Multiple,		bool ComputeMultiple(Value V, unsigned Base, Value &Multiple,
bool LookThroughSExt = false,		bool LookThroughSExt = false,
▲ Show 20 Lines • Show All 275 Lines • ▼ Show 20 Lines	enum class OverflowResult {
/// Always overflows in the direction of signed/unsigned max value.		/// Always overflows in the direction of signed/unsigned max value.
AlwaysOverflowsHigh,		AlwaysOverflowsHigh,
/// May or may not overflow.		/// May or may not overflow.
MayOverflow,		MayOverflow,
/// Never overflows.		/// Never overflows.
NeverOverflows,		NeverOverflows,
};		};

OverflowResult computeOverflowForUnsignedMul(const Value *LHS,		OverflowResult computeOverflowForUnsignedMul(
const Value *RHS,		const Value LHS, const Value RHS, const DataLayout &DL,
const DataLayout &DL,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
AssumptionCache *AC,		bool UseInstrInfo = true, const TargetTransformInfo *TTI = nullptr);
const Instruction *CxtI,		OverflowResult computeOverflowForSignedMul(
const DominatorTree *DT,		const Value LHS, const Value RHS, const DataLayout &DL,
bool UseInstrInfo = true);		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
OverflowResult computeOverflowForSignedMul(const Value LHS, const Value RHS,		bool UseInstrInfo = true, const TargetTransformInfo *TTI = nullptr);
const DataLayout &DL,		OverflowResult computeOverflowForUnsignedAdd(
AssumptionCache *AC,		const Value LHS, const Value RHS, const DataLayout &DL,
const Instruction *CxtI,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
const DominatorTree *DT,		bool UseInstrInfo = true, const TargetTransformInfo *TTI = nullptr);
bool UseInstrInfo = true);		OverflowResult computeOverflowForSignedAdd(
OverflowResult computeOverflowForUnsignedAdd(const Value *LHS,		const Value LHS, const Value RHS, const DataLayout &DL,
const Value *RHS,		AssumptionCache AC = nullptr, const Instruction CxtI = nullptr,
const DataLayout &DL,		const DominatorTree *DT = nullptr,
AssumptionCache *AC,		const TargetTransformInfo *TTI = nullptr);
const Instruction *CxtI,
const DominatorTree *DT,
bool UseInstrInfo = true);
OverflowResult computeOverflowForSignedAdd(const Value LHS, const Value RHS,
const DataLayout &DL,
AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr);
/// This version also leverages the sign bit of Add if known.		/// This version also leverages the sign bit of Add if known.
OverflowResult computeOverflowForSignedAdd(const AddOperator *Add,		OverflowResult
const DataLayout &DL,		computeOverflowForSignedAdd(const AddOperator *Add, const DataLayout &DL,
AssumptionCache *AC = nullptr,		AssumptionCache *AC = nullptr,
const Instruction *CxtI = nullptr,		const Instruction *CxtI = nullptr,
const DominatorTree *DT = nullptr);		const DominatorTree *DT = nullptr,
OverflowResult computeOverflowForUnsignedSub(const Value LHS, const Value RHS,		const TargetTransformInfo *TTI = nullptr);
const DataLayout &DL,		OverflowResult computeOverflowForUnsignedSub(
AssumptionCache *AC,		const Value LHS, const Value RHS, const DataLayout &DL,
const Instruction *CxtI,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
const DominatorTree *DT);		const TargetTransformInfo *TTI = nullptr);
OverflowResult computeOverflowForSignedSub(const Value LHS, const Value RHS,		OverflowResult
const DataLayout &DL,		computeOverflowForSignedSub(const Value LHS, const Value RHS,
AssumptionCache *AC,		const DataLayout &DL, AssumptionCache *AC,
const Instruction *CxtI,		const Instruction CxtI, const DominatorTree DT,
const DominatorTree *DT);		const TargetTransformInfo *TTI = nullptr);

/// Returns true if the arithmetic part of the \p WO 's result is		/// Returns true if the arithmetic part of the \p WO 's result is
/// used only along the paths control dependent on the computation		/// used only along the paths control dependent on the computation
/// not overflowing, \p WO being an <op>.with.overflow intrinsic.		/// not overflowing, \p WO being an <op>.with.overflow intrinsic.
bool isOverflowIntrinsicNoWrap(const WithOverflowInst *WO,		bool isOverflowIntrinsicNoWrap(const WithOverflowInst *WO,
const DominatorTree &DT);		const DominatorTree &DT);


▲ Show 20 Lines • Show All 222 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/BasicTTIImpl.h

Show First 20 Lines • Show All 1,871 Lines • ▼ Show 20 Lines	unsigned getMinMaxReductionCost(VectorType Ty, VectorType CondTy,
// So just need a single extractelement.		// So just need a single extractelement.
return ShuffleCost + MinMaxCost +		return ShuffleCost + MinMaxCost +
thisT()->getVectorInstrCost(Instruction::ExtractElement, Ty, 0);		thisT()->getVectorInstrCost(Instruction::ExtractElement, Ty, 0);
}		}

unsigned getVectorSplitCost() { return 1; }		unsigned getVectorSplitCost() { return 1; }

/// @}		/// @}
		bool computeKnownBits(const TargetTransformInfo TTI, const Value V,
		KnownBits &Known, const DataLayout &DL, unsigned Depth,
		AssumptionCache AC, const Instruction CxtI,
		const DominatorTree DT, OptimizationRemarkEmitter ORE,
		bool UseInstrInfo) const {
		return BaseT::computeKnownBits(TTI, V, Known, DL, Depth, AC, CxtI, DT, ORE,
		UseInstrInfo);
		}
};		};

/// Concrete BasicTTIImpl that can be used if no further customization		/// Concrete BasicTTIImpl that can be used if no further customization
/// is needed.		/// is needed.
class BasicTTIImpl : public BasicTTIImplBase<BasicTTIImpl> {		class BasicTTIImpl : public BasicTTIImplBase<BasicTTIImpl> {
using BaseT = BasicTTIImplBase<BasicTTIImpl>;		using BaseT = BasicTTIImplBase<BasicTTIImpl>;

friend class BasicTTIImplBase<BasicTTIImpl>;		friend class BasicTTIImplBase<BasicTTIImpl>;
Show All 14 Lines

llvm/include/llvm/Support/KnownBits.h

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	public:
KnownBits zextOrTrunc(unsigned BitWidth) const {		KnownBits zextOrTrunc(unsigned BitWidth) const {
if (BitWidth > getBitWidth())		if (BitWidth > getBitWidth())
return zext(BitWidth);		return zext(BitWidth);
if (BitWidth < getBitWidth())		if (BitWidth < getBitWidth())
return trunc(BitWidth);		return trunc(BitWidth);
return *this;		return *this;
}		}

		/// Return known bits for a sign extension or truncation of the value we're
		/// tracking.
		KnownBits sextOrTrunc(unsigned BitWidth) const {
		if (BitWidth > getBitWidth())
		return sext(BitWidth);
		if (BitWidth < getBitWidth())
		return trunc(BitWidth);
		return *this;
		}

/// Return a KnownBits with the extracted bits		/// Return a KnownBits with the extracted bits
/// [bitPosition,bitPosition+numBits).		/// [bitPosition,bitPosition+numBits).
KnownBits extractBits(unsigned NumBits, unsigned BitPosition) const {		KnownBits extractBits(unsigned NumBits, unsigned BitPosition) const {
return KnownBits(Zero.extractBits(NumBits, BitPosition),		return KnownBits(Zero.extractBits(NumBits, BitPosition),
One.extractBits(NumBits, BitPosition));		One.extractBits(NumBits, BitPosition));
}		}

/// Return KnownBits based on this, but updated given that the underlying		/// Return KnownBits based on this, but updated given that the underlying
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	public:
}		}

/// Return a subset of the known bits from [bitPosition,bitPosition+numBits).		/// Return a subset of the known bits from [bitPosition,bitPosition+numBits).
KnownBits extractBits(unsigned NumBits, unsigned BitPosition) {		KnownBits extractBits(unsigned NumBits, unsigned BitPosition) {
return KnownBits(Zero.extractBits(NumBits, BitPosition),		return KnownBits(Zero.extractBits(NumBits, BitPosition),
One.extractBits(NumBits, BitPosition));		One.extractBits(NumBits, BitPosition));
}		}

		static KnownBits computeForMul(const KnownBits &LHS, const KnownBits &RHS);

/// Update known bits based on ANDing with RHS.		/// Update known bits based on ANDing with RHS.
KnownBits &operator&=(const KnownBits &RHS);		KnownBits &operator&=(const KnownBits &RHS);

/// Update known bits based on ORing with RHS.		/// Update known bits based on ORing with RHS.
KnownBits &operator\|=(const KnownBits &RHS);		KnownBits &operator\|=(const KnownBits &RHS);

/// Update known bits based on XORing with RHS.		/// Update known bits based on XORing with RHS.
KnownBits &operator^=(const KnownBits &RHS);		KnownBits &operator^=(const KnownBits &RHS);
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/lib/Analysis/TargetTransformInfo.cpp

Show First 20 Lines • Show All 1,359 Lines • ▼ Show 20 Lines	int TargetTransformInfo::getInstructionThroughput(const Instruction *I) const {
case Instruction::Call:		case Instruction::Call:
return getUserCost(I, CostKind);		return getUserCost(I, CostKind);
default:		default:
// We don't have any information on this instruction.		// We don't have any information on this instruction.
return -1;		return -1;
}		}
}		}

		bool TargetTransformInfo::computeKnownBits(
		const Value *V, KnownBits &Known, const DataLayout &DL, unsigned Depth,
		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
		OptimizationRemarkEmitter *ORE, bool UseInstrInfo) const {
		return TTIImpl->computeKnownBits(this, V, Known, DL, Depth, AC, CxtI, DT, ORE,
		UseInstrInfo);
		}

TargetTransformInfo::Concept::~Concept() {}		TargetTransformInfo::Concept::~Concept() {}

TargetIRAnalysis::TargetIRAnalysis() : TTICallback(&getDefaultTTI) {}		TargetIRAnalysis::TargetIRAnalysis() : TTICallback(&getDefaultTTI) {}

TargetIRAnalysis::TargetIRAnalysis(		TargetIRAnalysis::TargetIRAnalysis(
std::function<Result(const Function &)> TTICallback)		std::function<Result(const Function &)> TTICallback)
: TTICallback(std::move(TTICallback)) {}		: TTICallback(std::move(TTICallback)) {}

▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/lib/Analysis/ValueTracking.cpp

Show All 26 Lines
#include "llvm/Analysis/AssumeBundleQueries.h"		#include "llvm/Analysis/AssumeBundleQueries.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
#include "llvm/Analysis/GuardUtils.h"		#include "llvm/Analysis/GuardUtils.h"
#include "llvm/Analysis/InstructionSimplify.h"		#include "llvm/Analysis/InstructionSimplify.h"
#include "llvm/Analysis/Loads.h"		#include "llvm/Analysis/Loads.h"
#include "llvm/Analysis/LoopInfo.h"		#include "llvm/Analysis/LoopInfo.h"
#include "llvm/Analysis/OptimizationRemarkEmitter.h"		#include "llvm/Analysis/OptimizationRemarkEmitter.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
		#include "llvm/Analysis/TargetTransformInfo.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/Constant.h"		#include "llvm/IR/Constant.h"
#include "llvm/IR/ConstantRange.h"		#include "llvm/IR/ConstantRange.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/DiagnosticInfo.h"		#include "llvm/IR/DiagnosticInfo.h"
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	struct Query {
/// bits in x, etc. Regarding the mutual recursion, computeKnownBits can call		/// bits in x, etc. Regarding the mutual recursion, computeKnownBits can call
/// isKnownNonZero, which calls computeKnownBits and isKnownToBeAPowerOfTwo		/// isKnownNonZero, which calls computeKnownBits and isKnownToBeAPowerOfTwo
/// (all of which can call computeKnownBits), and so on.		/// (all of which can call computeKnownBits), and so on.
std::array<const Value *, MaxAnalysisRecursionDepth> Excluded;		std::array<const Value *, MaxAnalysisRecursionDepth> Excluded;

/// If true, it is safe to use metadata during simplification.		/// If true, it is safe to use metadata during simplification.
InstrInfoQuery IIQ;		InstrInfoQuery IIQ;

		const TargetTransformInfo *TTI;

unsigned NumExcluded = 0;		unsigned NumExcluded = 0;

Query(const DataLayout &DL, AssumptionCache AC, const Instruction CxtI,		Query(const DataLayout &DL, AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo,		const DominatorTree *DT, bool UseInstrInfo,
OptimizationRemarkEmitter *ORE = nullptr)		OptimizationRemarkEmitter *ORE = nullptr,
: DL(DL), AC(AC), CxtI(CxtI), DT(DT), ORE(ORE), IIQ(UseInstrInfo) {}		const TargetTransformInfo *TTI = nullptr)
		: DL(DL), AC(AC), CxtI(CxtI), DT(DT), ORE(ORE), IIQ(UseInstrInfo),
		TTI(TTI) {}

Query(const Query &Q, const Value *NewExcl)		Query(const Query &Q, const Value *NewExcl)
: DL(Q.DL), AC(Q.AC), CxtI(Q.CxtI), DT(Q.DT), ORE(Q.ORE), IIQ(Q.IIQ),		: DL(Q.DL), AC(Q.AC), CxtI(Q.CxtI), DT(Q.DT), ORE(Q.ORE), IIQ(Q.IIQ),
NumExcluded(Q.NumExcluded) {		TTI(Q.TTI), NumExcluded(Q.NumExcluded) {
Excluded = Q.Excluded;		Excluded = Q.Excluded;
Excluded[NumExcluded++] = NewExcl;		Excluded[NumExcluded++] = NewExcl;
assert(NumExcluded <= Excluded.size());		assert(NumExcluded <= Excluded.size());
}		}

bool isExcluded(const Value *Value) const {		bool isExcluded(const Value *Value) const {
if (NumExcluded == 0)		if (NumExcluded == 0)
return false;		return false;
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	APInt DemandedElts =
FVTy ? APInt::getAllOnesValue(FVTy->getNumElements()) : APInt(1, 1);		FVTy ? APInt::getAllOnesValue(FVTy->getNumElements()) : APInt(1, 1);
computeKnownBits(V, DemandedElts, Known, Depth, Q);		computeKnownBits(V, DemandedElts, Known, Depth, Q);
}		}

void llvm::computeKnownBits(const Value *V, KnownBits &Known,		void llvm::computeKnownBits(const Value *V, KnownBits &Known,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT,		const DominatorTree *DT,
OptimizationRemarkEmitter *ORE, bool UseInstrInfo) {		OptimizationRemarkEmitter *ORE, bool UseInstrInfo,
::computeKnownBits(V, Known, Depth,		const TargetTransformInfo *TTI) {
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		::computeKnownBits(
		V, Known, Depth,
		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, TTI));
}		}

void llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,		void llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,
KnownBits &Known, const DataLayout &DL,		KnownBits &Known, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT,		const Instruction CxtI, const DominatorTree DT,
OptimizationRemarkEmitter *ORE, bool UseInstrInfo) {		OptimizationRemarkEmitter *ORE, bool UseInstrInfo,
::computeKnownBits(V, DemandedElts, Known, Depth,		const TargetTransformInfo *TTI) {
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		::computeKnownBits(
		V, DemandedElts, Known, Depth,
		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, TTI));
}		}

static KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,		static KnownBits computeKnownBits(const Value *V, const APInt &DemandedElts,
unsigned Depth, const Query &Q);		unsigned Depth, const Query &Q);

static KnownBits computeKnownBits(const Value *V, unsigned Depth,		static KnownBits computeKnownBits(const Value *V, unsigned Depth,
const Query &Q);		const Query &Q);

KnownBits llvm::computeKnownBits(const Value *V, const DataLayout &DL,		KnownBits
unsigned Depth, AssumptionCache *AC,		llvm::computeKnownBits(const Value *V, const DataLayout &DL, unsigned Depth,
const Instruction *CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT,		const DominatorTree DT, OptimizationRemarkEmitter ORE,
OptimizationRemarkEmitter *ORE,		bool UseInstrInfo, const TargetTransformInfo *TTI) {
bool UseInstrInfo) {
return ::computeKnownBits(		return ::computeKnownBits(
V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, TTI));
}		}

KnownBits llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,		KnownBits llvm::computeKnownBits(const Value *V, const APInt &DemandedElts,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT,		const DominatorTree *DT,
OptimizationRemarkEmitter *ORE,		OptimizationRemarkEmitter *ORE,
bool UseInstrInfo) {		bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
return ::computeKnownBits(		return ::computeKnownBits(
V, DemandedElts, Depth,		V, DemandedElts, Depth,
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE));		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, ORE, TTI));
}		}

bool llvm::haveNoCommonBitsSet(const Value LHS, const Value RHS,		bool llvm::haveNoCommonBitsSet(const Value LHS, const Value RHS,
const DataLayout &DL, AssumptionCache *AC,		const DataLayout &DL, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT,		const Instruction CxtI, const DominatorTree DT,
bool UseInstrInfo) {		bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
assert(LHS->getType() == RHS->getType() &&		assert(LHS->getType() == RHS->getType() &&
"LHS and RHS should have the same type");		"LHS and RHS should have the same type");
assert(LHS->getType()->isIntOrIntVectorTy() &&		assert(LHS->getType()->isIntOrIntVectorTy() &&
"LHS and RHS should be integers");		"LHS and RHS should be integers");
// Look for an inverted mask: (X & ~M) op (Y & M).		// Look for an inverted mask: (X & ~M) op (Y & M).
Value *M;		Value *M;
if (match(LHS, m_c_And(m_Not(m_Value(M)), m_Value())) &&		if (match(LHS, m_c_And(m_Not(m_Value(M)), m_Value())) &&
match(RHS, m_c_And(m_Specific(M), m_Value())))		match(RHS, m_c_And(m_Specific(M), m_Value())))
return true;		return true;
if (match(RHS, m_c_And(m_Not(m_Value(M)), m_Value())) &&		if (match(RHS, m_c_And(m_Not(m_Value(M)), m_Value())) &&
match(LHS, m_c_And(m_Specific(M), m_Value())))		match(LHS, m_c_And(m_Specific(M), m_Value())))
return true;		return true;
IntegerType *IT = cast<IntegerType>(LHS->getType()->getScalarType());		IntegerType *IT = cast<IntegerType>(LHS->getType()->getScalarType());
KnownBits LHSKnown(IT->getBitWidth());		KnownBits LHSKnown(IT->getBitWidth());
KnownBits RHSKnown(IT->getBitWidth());		KnownBits RHSKnown(IT->getBitWidth());
computeKnownBits(LHS, LHSKnown, DL, 0, AC, CxtI, DT, nullptr, UseInstrInfo);		computeKnownBits(LHS, LHSKnown, DL, 0, AC, CxtI, DT, nullptr, UseInstrInfo,
computeKnownBits(RHS, RHSKnown, DL, 0, AC, CxtI, DT, nullptr, UseInstrInfo);		TTI);
		computeKnownBits(RHS, RHSKnown, DL, 0, AC, CxtI, DT, nullptr, UseInstrInfo,
		TTI);
return (LHSKnown.Zero \| RHSKnown.Zero).isAllOnesValue();		return (LHSKnown.Zero \| RHSKnown.Zero).isAllOnesValue();
}		}

bool llvm::isOnlyUsedInZeroEqualityComparison(const Instruction *CxtI) {		bool llvm::isOnlyUsedInZeroEqualityComparison(const Instruction *CxtI) {
for (const User *U : CxtI->users()) {		for (const User *U : CxtI->users()) {
if (const ICmpInst *IC = dyn_cast<ICmpInst>(U))		if (const ICmpInst *IC = dyn_cast<ICmpInst>(U))
if (IC->isEquality())		if (IC->isEquality())
if (Constant *C = dyn_cast<Constant>(IC->getOperand(1)))		if (Constant *C = dyn_cast<Constant>(IC->getOperand(1)))
if (C->isNullValue())		if (C->isNullValue())
continue;		continue;
return false;		return false;
}		}
return true;		return true;
}		}

static bool isKnownToBeAPowerOfTwo(const Value *V, bool OrZero, unsigned Depth,		static bool isKnownToBeAPowerOfTwo(const Value *V, bool OrZero, unsigned Depth,
const Query &Q);		const Query &Q);

bool llvm::isKnownToBeAPowerOfTwo(const Value *V, const DataLayout &DL,		bool llvm::isKnownToBeAPowerOfTwo(const Value *V, const DataLayout &DL,
bool OrZero, unsigned Depth,		bool OrZero, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo) {		const DominatorTree *DT, bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
return ::isKnownToBeAPowerOfTwo(		return ::isKnownToBeAPowerOfTwo(
V, OrZero, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo));		V, OrZero, Depth,
		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, /ORE=/nullptr, TTI));
}		}

static bool isKnownNonZero(const Value *V, const APInt &DemandedElts,		static bool isKnownNonZero(const Value *V, const APInt &DemandedElts,
unsigned Depth, const Query &Q);		unsigned Depth, const Query &Q);

static bool isKnownNonZero(const Value *V, unsigned Depth, const Query &Q);		static bool isKnownNonZero(const Value *V, unsigned Depth, const Query &Q);

bool llvm::isKnownNonZero(const Value *V, const DataLayout &DL, unsigned Depth,		bool llvm::isKnownNonZero(const Value *V, const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo) {		const DominatorTree *DT, bool UseInstrInfo,
return ::isKnownNonZero(V, Depth,		const TargetTransformInfo *TTI) {
Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo));		return ::isKnownNonZero(
		V, Depth,
		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, /ORE=/nullptr, TTI));
}		}

bool llvm::isKnownNonNegative(const Value *V, const DataLayout &DL,		bool llvm::isKnownNonNegative(const Value *V, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT,		const Instruction CxtI, const DominatorTree DT,
bool UseInstrInfo) {		bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
KnownBits Known =		KnownBits Known =
computeKnownBits(V, DL, Depth, AC, CxtI, DT, nullptr, UseInstrInfo);		computeKnownBits(V, DL, Depth, AC, CxtI, DT, nullptr, UseInstrInfo, TTI);
return Known.isNonNegative();		return Known.isNonNegative();
}		}

bool llvm::isKnownPositive(const Value *V, const DataLayout &DL, unsigned Depth,		bool llvm::isKnownPositive(const Value *V, const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo) {		const DominatorTree *DT, bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
if (auto *CI = dyn_cast<ConstantInt>(V))		if (auto *CI = dyn_cast<ConstantInt>(V))
return CI->getValue().isStrictlyPositive();		return CI->getValue().isStrictlyPositive();

// TODO: We'd doing two recursive queries here. We should factor this such		// TODO: We'd doing two recursive queries here. We should factor this such
// that only a single query is needed.		// that only a single query is needed.
return isKnownNonNegative(V, DL, Depth, AC, CxtI, DT, UseInstrInfo) &&		return isKnownNonNegative(V, DL, Depth, AC, CxtI, DT, UseInstrInfo, TTI) &&
isKnownNonZero(V, DL, Depth, AC, CxtI, DT, UseInstrInfo);		isKnownNonZero(V, DL, Depth, AC, CxtI, DT, UseInstrInfo, TTI);
}		}

bool llvm::isKnownNegative(const Value *V, const DataLayout &DL, unsigned Depth,		bool llvm::isKnownNegative(const Value *V, const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo) {		const DominatorTree *DT, bool UseInstrInfo) {
KnownBits Known =		KnownBits Known =
computeKnownBits(V, DL, Depth, AC, CxtI, DT, nullptr, UseInstrInfo);		computeKnownBits(V, DL, Depth, AC, CxtI, DT, nullptr, UseInstrInfo);
return Known.isNegative();		return Known.isNegative();
}		}

static bool isKnownNonEqual(const Value V1, const Value V2, const Query &Q);		static bool isKnownNonEqual(const Value V1, const Value V2, const Query &Q);

bool llvm::isKnownNonEqual(const Value V1, const Value V2,		bool llvm::isKnownNonEqual(const Value V1, const Value V2,
const DataLayout &DL, AssumptionCache *AC,		const DataLayout &DL, AssumptionCache *AC,
const Instruction CxtI, const DominatorTree DT,		const Instruction CxtI, const DominatorTree DT,
bool UseInstrInfo) {		bool UseInstrInfo, const TargetTransformInfo *TTI) {
return ::isKnownNonEqual(V1, V2,		return ::isKnownNonEqual(V1, V2,
Query(DL, AC, safeCxtI(V1, safeCxtI(V2, CxtI)), DT,		Query(DL, AC, safeCxtI(V1, safeCxtI(V2, CxtI)), DT,
UseInstrInfo, /ORE=/nullptr));		UseInstrInfo, /ORE=/nullptr, TTI));
}		}

static bool MaskedValueIsZero(const Value *V, const APInt &Mask, unsigned Depth,		static bool MaskedValueIsZero(const Value *V, const APInt &Mask, unsigned Depth,
const Query &Q);		const Query &Q);

bool llvm::MaskedValueIsZero(const Value *V, const APInt &Mask,		bool llvm::MaskedValueIsZero(const Value *V, const APInt &Mask,
const DataLayout &DL, unsigned Depth,		const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI,		AssumptionCache AC, const Instruction CxtI,
const DominatorTree *DT, bool UseInstrInfo) {		const DominatorTree *DT, bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
return ::MaskedValueIsZero(		return ::MaskedValueIsZero(
V, Mask, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo));		V, Mask, Depth,
		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, /ORE=/nullptr, TTI));
}		}

static unsigned ComputeNumSignBits(const Value *V, const APInt &DemandedElts,		static unsigned ComputeNumSignBits(const Value *V, const APInt &DemandedElts,
unsigned Depth, const Query &Q);		unsigned Depth, const Query &Q);

static unsigned ComputeNumSignBits(const Value *V, unsigned Depth,		static unsigned ComputeNumSignBits(const Value *V, unsigned Depth,
const Query &Q) {		const Query &Q) {
// FIXME: We currently have no way to represent the DemandedElts of a scalable		// FIXME: We currently have no way to represent the DemandedElts of a scalable
// vector		// vector
if (isa<ScalableVectorType>(V->getType()))		if (isa<ScalableVectorType>(V->getType()))
return 1;		return 1;

auto *FVTy = dyn_cast<FixedVectorType>(V->getType());		auto *FVTy = dyn_cast<FixedVectorType>(V->getType());
APInt DemandedElts =		APInt DemandedElts =
FVTy ? APInt::getAllOnesValue(FVTy->getNumElements()) : APInt(1, 1);		FVTy ? APInt::getAllOnesValue(FVTy->getNumElements()) : APInt(1, 1);
return ComputeNumSignBits(V, DemandedElts, Depth, Q);		return ComputeNumSignBits(V, DemandedElts, Depth, Q);
}		}

unsigned llvm::ComputeNumSignBits(const Value *V, const DataLayout &DL,		unsigned llvm::ComputeNumSignBits(const Value *V, const DataLayout &DL,
unsigned Depth, AssumptionCache *AC,		unsigned Depth, AssumptionCache *AC,
const Instruction *CxtI,		const Instruction *CxtI,
const DominatorTree *DT, bool UseInstrInfo) {		const DominatorTree *DT, bool UseInstrInfo,
		const TargetTransformInfo *TTI) {
return ::ComputeNumSignBits(		return ::ComputeNumSignBits(
V, Depth, Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo));		V, Depth,
		Query(DL, AC, safeCxtI(V, CxtI), DT, UseInstrInfo, /ORE=/nullptr, TTI));
}		}

static void computeKnownBitsAddSub(bool Add, const Value Op0, const Value Op1,		static void computeKnownBitsAddSub(bool Add, const Value Op0, const Value Op1,
bool NSW, const APInt &DemandedElts,		bool NSW, const APInt &DemandedElts,
KnownBits &KnownOut, KnownBits &Known2,		KnownBits &KnownOut, KnownBits &Known2,
unsigned Depth, const Query &Q) {		unsigned Depth, const Query &Q) {
computeKnownBits(Op1, DemandedElts, KnownOut, Depth + 1, Q);		computeKnownBits(Op1, DemandedElts, KnownOut, Depth + 1, Q);

// If one operand is unknown and we have no nowrap information,		// If one operand is unknown and we have no nowrap information,
// the result will be unknown independently of the second operand.		// the result will be unknown independently of the second operand.
if (KnownOut.isUnknown() && !NSW)		if (KnownOut.isUnknown() && !NSW)
return;		return;

computeKnownBits(Op0, DemandedElts, Known2, Depth + 1, Q);		computeKnownBits(Op0, DemandedElts, Known2, Depth + 1, Q);
KnownOut = KnownBits::computeForAddSub(Add, NSW, Known2, KnownOut);		KnownOut = KnownBits::computeForAddSub(Add, NSW, Known2, KnownOut);
}		}

static void computeKnownBitsMul(const Value Op0, const Value Op1, bool NSW,		static void computeKnownBitsMul(const Value Op0, const Value Op1, bool NSW,
const APInt &DemandedElts, KnownBits &Known,		const APInt &DemandedElts, KnownBits &Known,
KnownBits &Known2, unsigned Depth,		KnownBits &Known2, unsigned Depth,
const Query &Q) {		const Query &Q) {
unsigned BitWidth = Known.getBitWidth();
computeKnownBits(Op1, DemandedElts, Known, Depth + 1, Q);		computeKnownBits(Op1, DemandedElts, Known, Depth + 1, Q);
computeKnownBits(Op0, DemandedElts, Known2, Depth + 1, Q);		computeKnownBits(Op0, DemandedElts, Known2, Depth + 1, Q);

bool isKnownNegative = false;		bool isKnownNegative = false;
bool isKnownNonNegative = false;		bool isKnownNonNegative = false;
// If the multiplication is known not to overflow, compute the sign bit.		// If the multiplication is known not to overflow, compute the sign bit.
if (NSW) {		if (NSW) {
if (Op0 == Op1) {		if (Op0 == Op1) {
// The product of a number with itself is non-negative.		// The product of a number with itself is non-negative.
isKnownNonNegative = true;		isKnownNonNegative = true;
} else {		} else {
bool isKnownNonNegativeOp1 = Known.isNonNegative();		bool isKnownNonNegativeOp1 = Known.isNonNegative();
bool isKnownNonNegativeOp0 = Known2.isNonNegative();		bool isKnownNonNegativeOp0 = Known2.isNonNegative();
bool isKnownNegativeOp1 = Known.isNegative();		bool isKnownNegativeOp1 = Known.isNegative();
bool isKnownNegativeOp0 = Known2.isNegative();		bool isKnownNegativeOp0 = Known2.isNegative();
// The product of two numbers with the same sign is non-negative.		// The product of two numbers with the same sign is non-negative.
isKnownNonNegative = (isKnownNegativeOp1 && isKnownNegativeOp0) \|\|		isKnownNonNegative = (isKnownNegativeOp1 && isKnownNegativeOp0) \|\|
(isKnownNonNegativeOp1 && isKnownNonNegativeOp0);		(isKnownNonNegativeOp1 && isKnownNonNegativeOp0);
// The product of a negative number and a non-negative number is either		// The product of a negative number and a non-negative number is either
// negative or zero.		// negative or zero.
if (!isKnownNonNegative)		if (!isKnownNonNegative)
isKnownNegative = (isKnownNegativeOp1 && isKnownNonNegativeOp0 &&		isKnownNegative = (isKnownNegativeOp1 && isKnownNonNegativeOp0 &&
isKnownNonZero(Op0, Depth, Q)) \|\|		isKnownNonZero(Op0, Depth, Q)) \|\|
(isKnownNegativeOp0 && isKnownNonNegativeOp1 &&		(isKnownNegativeOp0 && isKnownNonNegativeOp1 &&
isKnownNonZero(Op1, Depth, Q));		isKnownNonZero(Op1, Depth, Q));
}		}
}		}

assert(!Known.hasConflict() && !Known2.hasConflict());		Known = KnownBits::computeForMul(Known, Known2);
		RKSimonUnsubmitted Not Done Reply Inline Actions Pull this out into its own commit RKSimon: Pull this out into its own commit
// Compute a conservative estimate for high known-0 bits.
unsigned LeadZ = std::max(Known.countMinLeadingZeros() +
Known2.countMinLeadingZeros(),
BitWidth) - BitWidth;
LeadZ = std::min(LeadZ, BitWidth);

// The result of the bottom bits of an integer multiply can be
// inferred by looking at the bottom bits of both operands and
// multiplying them together.
// We can infer at least the minimum number of known trailing bits
// of both operands. Depending on number of trailing zeros, we can
// infer more bits, because (ab) <=> ((a/m) (b/n)) * (m*n) assuming
// a and b are divisible by m and n respectively.
// We then calculate how many of those bits are inferrable and set
// the output. For example, the i8 mul:
// a = XXXX1100 (12)
// b = XXXX1110 (14)
// We know the bottom 3 bits are zero since the first can be divided by
// 4 and the second by 2, thus having ((12/4) * (14/2)) * (2*4).
// Applying the multiplication to the trimmed arguments gets:
// XX11 (3)
// X111 (7)
// -------
// XX11
// XX11
// XX11
// XX11
// -------
// XXXXX01
// Which allows us to infer the 2 LSBs. Since we're multiplying the result
// by 8, the bottom 3 bits will be 0, so we can infer a total of 5 bits.
// The proof for this can be described as:
// Pre: (C1 >= 0) && (C1 < (1 << C5)) && (C2 >= 0) && (C2 < (1 << C6)) &&
// (C7 == (1 << (umin(countTrailingZeros(C1), C5) +
// umin(countTrailingZeros(C2), C6) +
// umin(C5 - umin(countTrailingZeros(C1), C5),
// C6 - umin(countTrailingZeros(C2), C6)))) - 1)
// %aa = shl i8 %a, C5
// %bb = shl i8 %b, C6
// %aaa = or i8 %aa, C1
// %bbb = or i8 %bb, C2
// %mul = mul i8 %aaa, %bbb
// %mask = and i8 %mul, C7
// =>
// %mask = i8 ((C1*C2)&C7)
// Where C5, C6 describe the known bits of %a, %b
// C1, C2 describe the known bottom bits of %a, %b.
// C7 describes the mask of the known bits of the result.
APInt Bottom0 = Known.One;
APInt Bottom1 = Known2.One;

// How many times we'd be able to divide each argument by 2 (shr by 1).
// This gives us the number of trailing zeros on the multiplication result.
unsigned TrailBitsKnown0 = (Known.Zero \| Known.One).countTrailingOnes();
unsigned TrailBitsKnown1 = (Known2.Zero \| Known2.One).countTrailingOnes();
unsigned TrailZero0 = Known.countMinTrailingZeros();
unsigned TrailZero1 = Known2.countMinTrailingZeros();
unsigned TrailZ = TrailZero0 + TrailZero1;

// Figure out the fewest known-bits operand.
unsigned SmallestOperand = std::min(TrailBitsKnown0 - TrailZero0,
TrailBitsKnown1 - TrailZero1);
unsigned ResultBitsKnown = std::min(SmallestOperand + TrailZ, BitWidth);

APInt BottomKnown = Bottom0.getLoBits(TrailBitsKnown0) *
Bottom1.getLoBits(TrailBitsKnown1);

Known.resetAll();
Known.Zero.setHighBits(LeadZ);
Known.Zero \|= (~BottomKnown).getLoBits(ResultBitsKnown);
Known.One \|= BottomKnown.getLoBits(ResultBitsKnown);

// Only make use of no-wrap flags if we failed to compute the sign bit		// Only make use of no-wrap flags if we failed to compute the sign bit
// directly. This matters if the multiplication always overflows, in		// directly. This matters if the multiplication always overflows, in
// which case we prefer to follow the result of the direct computation,		// which case we prefer to follow the result of the direct computation,
// though as the program is invoking undefined behaviour we can choose		// though as the program is invoking undefined behaviour we can choose
// whatever we like here.		// whatever we like here.
if (isKnownNonNegative && !Known.isNegative())		if (isKnownNonNegative && !Known.isNegative())
Known.makeNonNegative();		Known.makeNonNegative();
▲ Show 20 Lines • Show All 1,480 Lines • ▼ Show 20 Lines	#endif
// There's no point in looking through other users of ConstantData for		// There's no point in looking through other users of ConstantData for
// assumptions. Confirm that we've handled them all.		// assumptions. Confirm that we've handled them all.
assert(!isa<ConstantData>(V) && "Unhandled constant data!");		assert(!isa<ConstantData>(V) && "Unhandled constant data!");

// All recursive calls that increase depth must come after this.		// All recursive calls that increase depth must come after this.
if (Depth == MaxAnalysisRecursionDepth)		if (Depth == MaxAnalysisRecursionDepth)
return;		return;

		if (Q.TTI &&
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - if (Q.TTI && - Q.TTI->computeKnownBits(V, Known, Q.DL, Depth + 1, - Q.AC, Q.CxtI, Q.DT, Q.ORE, true)) + if (Q.TTI && Q.TTI->computeKnownBits(V, Known, Q.DL, Depth + 1, Q.AC, Q.CxtI, + Q.DT, Q.ORE, true)) Lint: Pre-merge checks: clang-format: please reformat the code ``` - if (Q.TTI && - Q.TTI->computeKnownBits(V…
		Q.TTI->computeKnownBits(V, Known, Q.DL, Depth + 1,
		Q.AC, Q.CxtI, Q.DT, Q.ORE, true))
		return;
// A weak GlobalAlias is totally unknown. A non-weak GlobalAlias has		// A weak GlobalAlias is totally unknown. A non-weak GlobalAlias has
// the bits of its aliasee.		// the bits of its aliasee.
if (const GlobalAlias *GA = dyn_cast<GlobalAlias>(V)) {		if (const GlobalAlias *GA = dyn_cast<GlobalAlias>(V)) {
if (!GA->isInterposable())		if (!GA->isInterposable())
computeKnownBits(GA->getAliasee(), Known, Depth + 1, Q);		computeKnownBits(GA->getAliasee(), Known, Depth + 1, Q);
return;		return;
}		}

▲ Show 20 Lines • Show All 2,501 Lines • ▼ Show 20 Lines	static OverflowResult mapOverflowResult(ConstantRange::OverflowResult OR) {
}		}
llvm_unreachable("Unknown OverflowResult");		llvm_unreachable("Unknown OverflowResult");
}		}

/// Combine constant ranges from computeConstantRange() and computeKnownBits().		/// Combine constant ranges from computeConstantRange() and computeKnownBits().
static ConstantRange computeConstantRangeIncludingKnownBits(		static ConstantRange computeConstantRangeIncludingKnownBits(
const Value *V, bool ForSigned, const DataLayout &DL, unsigned Depth,		const Value *V, bool ForSigned, const DataLayout &DL, unsigned Depth,
AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
OptimizationRemarkEmitter *ORE = nullptr, bool UseInstrInfo = true) {		OptimizationRemarkEmitter *ORE = nullptr, bool UseInstrInfo = true,
KnownBits Known = computeKnownBits(		const TargetTransformInfo *TTI = nullptr) {
V, DL, Depth, AC, CxtI, DT, ORE, UseInstrInfo);		KnownBits Known =
		computeKnownBits(V, DL, Depth, AC, CxtI, DT, ORE, UseInstrInfo, TTI);
ConstantRange CR1 = ConstantRange::fromKnownBits(Known, ForSigned);		ConstantRange CR1 = ConstantRange::fromKnownBits(Known, ForSigned);
ConstantRange CR2 = computeConstantRange(V, UseInstrInfo);		ConstantRange CR2 = computeConstantRange(V, UseInstrInfo);
ConstantRange::PreferredRangeType RangeType =		ConstantRange::PreferredRangeType RangeType =
ForSigned ? ConstantRange::Signed : ConstantRange::Unsigned;		ForSigned ? ConstantRange::Signed : ConstantRange::Unsigned;
return CR1.intersectWith(CR2, RangeType);		return CR1.intersectWith(CR2, RangeType);
}		}

OverflowResult llvm::computeOverflowForUnsignedMul(		OverflowResult llvm::computeOverflowForUnsignedMul(
const Value LHS, const Value RHS, const DataLayout &DL,		const Value LHS, const Value RHS, const DataLayout &DL,
AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
bool UseInstrInfo) {		bool UseInstrInfo, const TargetTransformInfo *TTI) {
KnownBits LHSKnown = computeKnownBits(LHS, DL, /Depth=/0, AC, CxtI, DT,		KnownBits LHSKnown = computeKnownBits(LHS, DL, /Depth=/0, AC, CxtI, DT,
nullptr, UseInstrInfo);		nullptr, UseInstrInfo, TTI);
KnownBits RHSKnown = computeKnownBits(RHS, DL, /Depth=/0, AC, CxtI, DT,		KnownBits RHSKnown = computeKnownBits(RHS, DL, /Depth=/0, AC, CxtI, DT,
nullptr, UseInstrInfo);		nullptr, UseInstrInfo, TTI);
ConstantRange LHSRange = ConstantRange::fromKnownBits(LHSKnown, false);		ConstantRange LHSRange = ConstantRange::fromKnownBits(LHSKnown, false);
ConstantRange RHSRange = ConstantRange::fromKnownBits(RHSKnown, false);		ConstantRange RHSRange = ConstantRange::fromKnownBits(RHSKnown, false);
return mapOverflowResult(LHSRange.unsignedMulMayOverflow(RHSRange));		return mapOverflowResult(LHSRange.unsignedMulMayOverflow(RHSRange));
}		}

OverflowResult		OverflowResult llvm::computeOverflowForSignedMul(
llvm::computeOverflowForSignedMul(const Value LHS, const Value RHS,		const Value LHS, const Value RHS, const DataLayout &DL,
const DataLayout &DL, AssumptionCache *AC,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
const Instruction *CxtI,		bool UseInstrInfo, const TargetTransformInfo *TTI) {
const DominatorTree *DT, bool UseInstrInfo) {
// Multiplying n * m significant bits yields a result of n + m significant		// Multiplying n * m significant bits yields a result of n + m significant
// bits. If the total number of significant bits does not exceed the		// bits. If the total number of significant bits does not exceed the
// result bit width (minus 1), there is no overflow.		// result bit width (minus 1), there is no overflow.
// This means if we have enough leading sign bits in the operands		// This means if we have enough leading sign bits in the operands
// we can guarantee that the result does not overflow.		// we can guarantee that the result does not overflow.
// Ref: "Hacker's Delight" by Henry Warren		// Ref: "Hacker's Delight" by Henry Warren
unsigned BitWidth = LHS->getType()->getScalarSizeInBits();		unsigned BitWidth = LHS->getType()->getScalarSizeInBits();

// Note that underestimating the number of sign bits gives a more		// Note that underestimating the number of sign bits gives a more
// conservative answer.		// conservative answer.
unsigned SignBits = ComputeNumSignBits(LHS, DL, 0, AC, CxtI, DT) +		unsigned SignBits =
ComputeNumSignBits(RHS, DL, 0, AC, CxtI, DT);		ComputeNumSignBits(LHS, DL, 0, AC, CxtI, DT, /UseInstrInfo=/true, TTI) +
		ComputeNumSignBits(RHS, DL, 0, AC, CxtI, DT, /UseInstrInfo=/true, TTI);

// First handle the easy case: if we have enough sign bits there's		// First handle the easy case: if we have enough sign bits there's
// definitely no overflow.		// definitely no overflow.
if (SignBits > BitWidth + 1)		if (SignBits > BitWidth + 1)
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;

// There are two ambiguous cases where there can be no overflow:		// There are two ambiguous cases where there can be no overflow:
// SignBits == BitWidth + 1 and		// SignBits == BitWidth + 1 and
// SignBits == BitWidth		// SignBits == BitWidth
// The second case is difficult to check, therefore we only handle the		// The second case is difficult to check, therefore we only handle the
// first case.		// first case.
if (SignBits == BitWidth + 1) {		if (SignBits == BitWidth + 1) {
// It overflows only when both arguments are negative and the true		// It overflows only when both arguments are negative and the true
// product is exactly the minimum negative number.		// product is exactly the minimum negative number.
// E.g. mul i16 with 17 sign bits: 0xff00 * 0xff80 = 0x8000		// E.g. mul i16 with 17 sign bits: 0xff00 * 0xff80 = 0x8000
// For simplicity we just check if at least one side is not negative.		// For simplicity we just check if at least one side is not negative.
KnownBits LHSKnown = computeKnownBits(LHS, DL, /Depth=/0, AC, CxtI, DT,		KnownBits LHSKnown = computeKnownBits(LHS, DL, /Depth=/0, AC, CxtI, DT,
nullptr, UseInstrInfo);		nullptr, UseInstrInfo, TTI);
KnownBits RHSKnown = computeKnownBits(RHS, DL, /Depth=/0, AC, CxtI, DT,		KnownBits RHSKnown = computeKnownBits(RHS, DL, /Depth=/0, AC, CxtI, DT,
nullptr, UseInstrInfo);		nullptr, UseInstrInfo, TTI);
if (LHSKnown.isNonNegative() \|\| RHSKnown.isNonNegative())		if (LHSKnown.isNonNegative() \|\| RHSKnown.isNonNegative())
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;
}		}
return OverflowResult::MayOverflow;		return OverflowResult::MayOverflow;
}		}

OverflowResult llvm::computeOverflowForUnsignedAdd(		OverflowResult llvm::computeOverflowForUnsignedAdd(
const Value LHS, const Value RHS, const DataLayout &DL,		const Value LHS, const Value RHS, const DataLayout &DL,
AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
bool UseInstrInfo) {		bool UseInstrInfo, const TargetTransformInfo *TTI) {
ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(
LHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT,		LHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT, nullptr,
nullptr, UseInstrInfo);		UseInstrInfo, TTI);
ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(
RHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT,		RHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT, nullptr,
nullptr, UseInstrInfo);		UseInstrInfo, TTI);
return mapOverflowResult(LHSRange.unsignedAddMayOverflow(RHSRange));		return mapOverflowResult(LHSRange.unsignedAddMayOverflow(RHSRange));
}		}

static OverflowResult computeOverflowForSignedAdd(const Value *LHS,		static OverflowResult computeOverflowForSignedAdd(
const Value *RHS,		const Value LHS, const Value RHS, const AddOperator *Add,
const AddOperator *Add,		const DataLayout &DL, AssumptionCache AC, const Instruction CxtI,
const DataLayout &DL,		const DominatorTree DT, const TargetTransformInfo TTI) {
AssumptionCache *AC,
const Instruction *CxtI,
const DominatorTree *DT) {
if (Add && Add->hasNoSignedWrap()) {		if (Add && Add->hasNoSignedWrap()) {
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;
}		}

// If LHS and RHS each have at least two sign bits, the addition will look		// If LHS and RHS each have at least two sign bits, the addition will look
// like		// like
//		//
// XX..... +		// XX..... +
// YY.....		// YY.....
//		//
// If the carry into the most significant position is 0, X and Y can't both		// If the carry into the most significant position is 0, X and Y can't both
// be 1 and therefore the carry out of the addition is also 0.		// be 1 and therefore the carry out of the addition is also 0.
//		//
// If the carry into the most significant position is 1, X and Y can't both		// If the carry into the most significant position is 1, X and Y can't both
// be 0 and therefore the carry out of the addition is also 1.		// be 0 and therefore the carry out of the addition is also 1.
//		//
// Since the carry into the most significant position is always equal to		// Since the carry into the most significant position is always equal to
// the carry out of the addition, there is no signed overflow.		// the carry out of the addition, there is no signed overflow.
if (ComputeNumSignBits(LHS, DL, 0, AC, CxtI, DT) > 1 &&		if (ComputeNumSignBits(LHS, DL, 0, AC, CxtI, DT, /UseInstrInfo=/true, TTI) >
ComputeNumSignBits(RHS, DL, 0, AC, CxtI, DT) > 1)		1 &&
		ComputeNumSignBits(RHS, DL, 0, AC, CxtI, DT, /UseInstrInfo=/true, TTI) >
		1)
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;

ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(
LHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT);		LHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT, /ORE=/nullptr,
		/UseInstrInfo=/true, TTI);
ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(
RHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT);		RHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT, /ORE=/nullptr,
		/UseInstrInfo=/true, TTI);
OverflowResult OR =		OverflowResult OR =
mapOverflowResult(LHSRange.signedAddMayOverflow(RHSRange));		mapOverflowResult(LHSRange.signedAddMayOverflow(RHSRange));
if (OR != OverflowResult::MayOverflow)		if (OR != OverflowResult::MayOverflow)
return OR;		return OR;

// The remaining code needs Add to be available. Early returns if not so.		// The remaining code needs Add to be available. Early returns if not so.
if (!Add)		if (!Add)
return OverflowResult::MayOverflow;		return OverflowResult::MayOverflow;

// If the sign of Add is the same as at least one of the operands, this add		// If the sign of Add is the same as at least one of the operands, this add
// CANNOT overflow. If this can be determined from the known bits of the		// CANNOT overflow. If this can be determined from the known bits of the
// operands the above signedAddMayOverflow() check will have already done so.		// operands the above signedAddMayOverflow() check will have already done so.
// The only other way to improve on the known bits is from an assumption, so		// The only other way to improve on the known bits is from an assumption, so
// call computeKnownBitsFromAssume() directly.		// call computeKnownBitsFromAssume() directly.
bool LHSOrRHSKnownNonNegative =		bool LHSOrRHSKnownNonNegative =
(LHSRange.isAllNonNegative() \|\| RHSRange.isAllNonNegative());		(LHSRange.isAllNonNegative() \|\| RHSRange.isAllNonNegative());
bool LHSOrRHSKnownNegative =		bool LHSOrRHSKnownNegative =
(LHSRange.isAllNegative() \|\| RHSRange.isAllNegative());		(LHSRange.isAllNegative() \|\| RHSRange.isAllNegative());
if (LHSOrRHSKnownNonNegative \|\| LHSOrRHSKnownNegative) {		if (LHSOrRHSKnownNonNegative \|\| LHSOrRHSKnownNegative) {
KnownBits AddKnown(LHSRange.getBitWidth());		KnownBits AddKnown(LHSRange.getBitWidth());
computeKnownBitsFromAssume(		computeKnownBitsFromAssume(
Add, AddKnown, /Depth=/0, Query(DL, AC, CxtI, DT, true));		Add, AddKnown, /Depth=/0,
		Query(DL, AC, CxtI, DT, true, /ORE=/nullptr, TTI));
if ((AddKnown.isNonNegative() && LHSOrRHSKnownNonNegative) \|\|		if ((AddKnown.isNonNegative() && LHSOrRHSKnownNonNegative) \|\|
(AddKnown.isNegative() && LHSOrRHSKnownNegative))		(AddKnown.isNegative() && LHSOrRHSKnownNegative))
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;
}		}

return OverflowResult::MayOverflow;		return OverflowResult::MayOverflow;
}		}

OverflowResult llvm::computeOverflowForUnsignedSub(const Value *LHS,		OverflowResult llvm::computeOverflowForUnsignedSub(
const Value *RHS,		const Value LHS, const Value RHS, const DataLayout &DL,
const DataLayout &DL,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
AssumptionCache *AC,		const TargetTransformInfo *TTI) {
const Instruction *CxtI,
const DominatorTree *DT) {
// Checking for conditions implied by dominating conditions may be expensive.		// Checking for conditions implied by dominating conditions may be expensive.
// Limit it to usub_with_overflow calls for now.		// Limit it to usub_with_overflow calls for now.
if (match(CxtI,		if (match(CxtI,
m_Intrinsic<Intrinsic::usub_with_overflow>(m_Value(), m_Value())))		m_Intrinsic<Intrinsic::usub_with_overflow>(m_Value(), m_Value())))
if (auto C =		if (auto C =
isImpliedByDomCondition(CmpInst::ICMP_UGE, LHS, RHS, CxtI, DL)) {		isImpliedByDomCondition(CmpInst::ICMP_UGE, LHS, RHS, CxtI, DL)) {
if (*C)		if (*C)
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;
return OverflowResult::AlwaysOverflowsLow;		return OverflowResult::AlwaysOverflowsLow;
}		}
ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(
LHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT);		LHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT, /ORE=/nullptr,
		/UseInstrInfo=/true, TTI);
ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(
RHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT);		RHS, /ForSigned=/false, DL, /Depth=/0, AC, CxtI, DT, /ORE=/nullptr,
		/UseInstrInfo=/true, TTI);
return mapOverflowResult(LHSRange.unsignedSubMayOverflow(RHSRange));		return mapOverflowResult(LHSRange.unsignedSubMayOverflow(RHSRange));
}		}

OverflowResult llvm::computeOverflowForSignedSub(const Value *LHS,		OverflowResult llvm::computeOverflowForSignedSub(
const Value *RHS,		const Value LHS, const Value RHS, const DataLayout &DL,
const DataLayout &DL,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
AssumptionCache *AC,		const TargetTransformInfo *TTI) {
const Instruction *CxtI,
const DominatorTree *DT) {
// If LHS and RHS each have at least two sign bits, the subtraction		// If LHS and RHS each have at least two sign bits, the subtraction
// cannot overflow.		// cannot overflow.
if (ComputeNumSignBits(LHS, DL, 0, AC, CxtI, DT) > 1 &&		if (ComputeNumSignBits(LHS, DL, 0, AC, CxtI, DT, /UseInstrInfo=/true, TTI) >
ComputeNumSignBits(RHS, DL, 0, AC, CxtI, DT) > 1)		1 &&
		ComputeNumSignBits(RHS, DL, 0, AC, CxtI, DT, /UseInstrInfo=/true, TTI) >
		1)
return OverflowResult::NeverOverflows;		return OverflowResult::NeverOverflows;

ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange LHSRange = computeConstantRangeIncludingKnownBits(
LHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT);		LHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT, /ORE=/nullptr,
		/UseInstrInfo=/true, TTI);
ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(		ConstantRange RHSRange = computeConstantRangeIncludingKnownBits(
RHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT);		RHS, /ForSigned=/true, DL, /Depth=/0, AC, CxtI, DT, /ORE=/nullptr,
		/UseInstrInfo=/true, TTI);
return mapOverflowResult(LHSRange.signedSubMayOverflow(RHSRange));		return mapOverflowResult(LHSRange.signedSubMayOverflow(RHSRange));
}		}

bool llvm::isOverflowIntrinsicNoWrap(const WithOverflowInst *WO,		bool llvm::isOverflowIntrinsicNoWrap(const WithOverflowInst *WO,
const DominatorTree &DT) {		const DominatorTree &DT) {
SmallVector<const BranchInst *, 2> GuardingBranches;		SmallVector<const BranchInst *, 2> GuardingBranches;
SmallVector<const ExtractValueInst *, 2> Results;		SmallVector<const ExtractValueInst *, 2> Results;

▲ Show 20 Lines • Show All 266 Lines • ▼ Show 20 Lines	bool llvm::isGuaranteedNotToBeUndefOrPoison(const Value *V,
return ::isGuaranteedNotToBeUndefOrPoison(V, CtxI, DT, Depth, false);		return ::isGuaranteedNotToBeUndefOrPoison(V, CtxI, DT, Depth, false);
}		}

bool llvm::isGuaranteedNotToBePoison(const Value V, const Instruction CtxI,		bool llvm::isGuaranteedNotToBePoison(const Value V, const Instruction CtxI,
const DominatorTree *DT, unsigned Depth) {		const DominatorTree *DT, unsigned Depth) {
return ::isGuaranteedNotToBeUndefOrPoison(V, CtxI, DT, Depth, true);		return ::isGuaranteedNotToBeUndefOrPoison(V, CtxI, DT, Depth, true);
}		}

OverflowResult llvm::computeOverflowForSignedAdd(const AddOperator *Add,		OverflowResult
const DataLayout &DL,		llvm::computeOverflowForSignedAdd(const AddOperator *Add, const DataLayout &DL,
AssumptionCache *AC,		AssumptionCache AC, const Instruction CxtI,
const Instruction *CxtI,		const DominatorTree *DT,
const DominatorTree *DT) {		const TargetTransformInfo *TTI) {
return ::computeOverflowForSignedAdd(Add->getOperand(0), Add->getOperand(1),		return ::computeOverflowForSignedAdd(Add->getOperand(0), Add->getOperand(1),
Add, DL, AC, CxtI, DT);		Add, DL, AC, CxtI, DT, TTI);
}		}

OverflowResult llvm::computeOverflowForSignedAdd(const Value *LHS,		OverflowResult llvm::computeOverflowForSignedAdd(
const Value *RHS,		const Value LHS, const Value RHS, const DataLayout &DL,
const DataLayout &DL,		AssumptionCache AC, const Instruction CxtI, const DominatorTree *DT,
AssumptionCache *AC,		const TargetTransformInfo *TTI) {
const Instruction *CxtI,		return ::computeOverflowForSignedAdd(LHS, RHS, nullptr, DL, AC, CxtI, DT,
const DominatorTree *DT) {		TTI);
return ::computeOverflowForSignedAdd(LHS, RHS, nullptr, DL, AC, CxtI, DT);
}		}

bool llvm::isGuaranteedToTransferExecutionToSuccessor(const Instruction *I) {		bool llvm::isGuaranteedToTransferExecutionToSuccessor(const Instruction *I) {
// Note: An atomic operation isn't guaranteed to return in a reasonable amount		// Note: An atomic operation isn't guaranteed to return in a reasonable amount
// of time because it's possible for another thread to interfere with it for an		// of time because it's possible for another thread to interfere with it for an
// arbitrary length of time, but programs aren't allowed to rely on that.		// arbitrary length of time, but programs aren't allowed to rely on that.

// If there is no successor, then execution can't transfer to it.		// If there is no successor, then execution can't transfer to it.
▲ Show 20 Lines • Show All 1,737 Lines • Show Last 20 Lines

llvm/lib/Support/KnownBits.cpp

Show First 20 Lines • Show All 157 Lines • ▼ Show 20 Lines	KnownBits KnownBits::abs() const {
APInt Val = One;		APInt Val = One;
Val.clearSignBit();		Val.clearSignBit();
if (!Val.isNullValue())		if (!Val.isNullValue())
KnownAbs.Zero.setSignBit();		KnownAbs.Zero.setSignBit();

return KnownAbs;		return KnownAbs;
}		}

		KnownBits KnownBits::computeForMul(const KnownBits &LHS, const KnownBits &RHS) {
		unsigned BitWidth = LHS.getBitWidth();

		assert(!LHS.hasConflict() && !RHS.hasConflict());
		// Compute a conservative estimate for high known-0 bits.
		unsigned LeadZ =
		std::max(LHS.countMinLeadingZeros() + RHS.countMinLeadingZeros(),
		BitWidth) -
		BitWidth;
		LeadZ = std::min(LeadZ, BitWidth);

		// The result of the bottom bits of an integer multiply can be
		// inferred by looking at the bottom bits of both operands and
		// multiplying them together.
		// We can infer at least the minimum number of known trailing bits
		// of both operands. Depending on number of trailing zeros, we can
		// infer more bits, because (ab) <=> ((a/m) (b/n)) * (m*n) assuming
		// a and b are divisible by m and n respectively.
		// We then calculate how many of those bits are inferrable and set
		// the output. For example, the i8 mul:
		// a = XXXX1100 (12)
		// b = XXXX1110 (14)
		// We know the bottom 3 bits are zero since the first can be divided by
		// 4 and the second by 2, thus having ((12/4) * (14/2)) * (2*4).
		// Applying the multiplication to the trimmed arguments gets:
		// XX11 (3)
		// X111 (7)
		// -------
		// XX11
		// XX11
		// XX11
		// XX11
		// -------
		// XXXXX01
		// Which allows us to infer the 2 LSBs. Since we're multiplying the result
		// by 8, the bottom 3 bits will be 0, so we can infer a total of 5 bits.
		// The proof for this can be described as:
		// Pre: (C1 >= 0) && (C1 < (1 << C5)) && (C2 >= 0) && (C2 < (1 << C6)) &&
		// (C7 == (1 << (umin(countTrailingZeros(C1), C5) +
		// umin(countTrailingZeros(C2), C6) +
		// umin(C5 - umin(countTrailingZeros(C1), C5),
		// C6 - umin(countTrailingZeros(C2), C6)))) - 1)
		// %aa = shl i8 %a, C5
		// %bb = shl i8 %b, C6
		// %aaa = or i8 %aa, C1
		// %bbb = or i8 %bb, C2
		// %mul = mul i8 %aaa, %bbb
		// %mask = and i8 %mul, C7
		// =>
		// %mask = i8 ((C1*C2)&C7)
		// Where C5, C6 describe the known bits of %a, %b
		// C1, C2 describe the known bottom bits of %a, %b.
		// C7 describes the mask of the known bits of the result.
		APInt Bottom0 = LHS.One;
		APInt Bottom1 = RHS.One;

		// How many times we'd be able to divide each argument by 2 (shr by 1).
		// This gives us the number of trailing zeros on the multiplication result.
		unsigned TrailBitsKnown0 = (LHS.Zero \| LHS.One).countTrailingOnes();
		unsigned TrailBitsKnown1 = (RHS.Zero \| RHS.One).countTrailingOnes();
		unsigned TrailZero0 = LHS.countMinTrailingZeros();
		unsigned TrailZero1 = RHS.countMinTrailingZeros();
		unsigned TrailZ = TrailZero0 + TrailZero1;

		// Figure out the fewest known-bits operand.
		unsigned SmallestOperand =
		std::min(TrailBitsKnown0 - TrailZero0, TrailBitsKnown1 - TrailZero1);
		unsigned ResultBitsKnown = std::min(SmallestOperand + TrailZ, BitWidth);

		APInt BottomKnown =
		Bottom0.getLoBits(TrailBitsKnown0) * Bottom1.getLoBits(TrailBitsKnown1);

		KnownBits Res(BitWidth);
		Res.Zero.setHighBits(LeadZ);
		Res.Zero \|= (~BottomKnown).getLoBits(ResultBitsKnown);
		Res.One = BottomKnown.getLoBits(ResultBitsKnown);
		return Res;
		}

KnownBits &KnownBits::operator&=(const KnownBits &RHS) {		KnownBits &KnownBits::operator&=(const KnownBits &RHS) {
// Result bit is 0 if either operand bit is 0.		// Result bit is 0 if either operand bit is 0.
Zero \|= RHS.Zero;		Zero \|= RHS.Zero;
// Result bit is 1 if both operand bits are 1.		// Result bit is 1 if both operand bits are 1.
One &= RHS.One;		One &= RHS.One;
return *this;		return *this;
}		}

Show All 16 Lines

llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp

Show First 20 Lines • Show All 951 Lines • ▼ Show 20 Lines	if (!TM.isNoopAddrSpaceCast(OldAS, NewAS)) {
// All valid 64-bit to 32-bit casts work by chopping off the high		// All valid 64-bit to 32-bit casts work by chopping off the high
// bits. Any masking only clearing the low bits will also apply in the new		// bits. Any masking only clearing the low bits will also apply in the new
// address space.		// address space.
if (DL.getPointerSizeInBits(OldAS) != 64 \|\|		if (DL.getPointerSizeInBits(OldAS) != 64 \|\|
DL.getPointerSizeInBits(NewAS) != 32)		DL.getPointerSizeInBits(NewAS) != 32)
return nullptr;		return nullptr;

// TODO: Do we need to thread more context in here?		// TODO: Do we need to thread more context in here?
KnownBits Known = computeKnownBits(MaskOp, DL, 0, nullptr, II);		KnownBits Known = ::computeKnownBits(MaskOp, DL, 0, nullptr, II);
if (Known.countMinLeadingOnes() < 32)		if (Known.countMinLeadingOnes() < 32)
return nullptr;		return nullptr;

DoTruncate = true;		DoTruncate = true;
}		}

IRBuilder<> B(II);		IRBuilder<> B(II);
if (DoTruncate) {		if (DoTruncate) {
▲ Show 20 Lines • Show All 178 Lines • Show Last 20 Lines

llvm/unittests/Analysis/ValueTrackingTest.cpp

//===- ValueTrackingTest.cpp - ValueTracking tests ------------------------===//		//===- ValueTrackingTest.cpp - ValueTracking tests ------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/Analysis/AssumptionCache.h"		#include "llvm/Analysis/AssumptionCache.h"
		#include "llvm/Analysis/TargetLibraryInfo.h"
		#include "llvm/Analysis/TargetTransformInfoImpl.h"
#include "llvm/AsmParser/Parser.h"		#include "llvm/AsmParser/Parser.h"
#include "llvm/IR/ConstantRange.h"		#include "llvm/IR/ConstantRange.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/InstIterator.h"		#include "llvm/IR/InstIterator.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/KnownBits.h"		#include "llvm/Support/KnownBits.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
		#include "llvm/Target/TargetMachine.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

static Instruction &findInstructionByName(Function *F, StringRef Name) {		static Instruction &findInstructionByName(Function *F, StringRef Name) {
for (Instruction &I : instructions(F))		for (Instruction &I : instructions(F))
▲ Show 20 Lines • Show All 1,042 Lines • ▼ Show 20 Lines	parseAssembly("define void @test() {\n"
"declare i32 @any_num()\n");		"declare i32 @any_num()\n");
AssumptionCache AC(*F);		AssumptionCache AC(*F);
KnownBits Known = computeKnownBits(A, M->getDataLayout(), /* Depth */ 0, &AC,		KnownBits Known = computeKnownBits(A, M->getDataLayout(), /* Depth */ 0, &AC,
F->front().getTerminator());		F->front().getTerminator());
EXPECT_EQ(Known.Zero.getZExtValue(), 31u);		EXPECT_EQ(Known.Zero.getZExtValue(), 31u);
EXPECT_EQ(Known.One.getZExtValue(), 0u);		EXPECT_EQ(Known.One.getZExtValue(), 0u);
}		}

		TEST_F(ComputeKnownBitsTest, ComputeKnownBitsAddWithRange) {
		parseAssembly("define void @test(i64* %p) {\n"
		" %A = load i64, i64* %p, !range !{i64 64, i64 65536}\n"
		" %APlus512 = add i64 %A, 512\n"
		" %c = icmp ugt i64 %APlus512, 523\n"
		" call void @llvm.assume(i1 %c)\n"
		" ret void\n"
		"}\n"
		"declare void @llvm.assume(i1)\n");
		AssumptionCache AC(*F);
		KnownBits Known = computeKnownBits(A, M->getDataLayout(), /* Depth */ 0, &AC,
		F->front().getTerminator());
		EXPECT_EQ(Known.Zero.getZExtValue(), ~(65536llu - 1));
		EXPECT_EQ(Known.One.getZExtValue(), 0u);
		Instruction &APlus512 = findInstructionByName(F, "APlus512");
		Known = computeKnownBits(&APlus512, M->getDataLayout(), /* Depth */ 0, &AC,
		F->front().getTerminator());
		// We know of one less zero because 512 may have produced a 1 that
		// got carried all the way to the first trailing zero.
		EXPECT_EQ(Known.Zero.getZExtValue(), (~(65536llu - 1)) << 1);
		EXPECT_EQ(Known.One.getZExtValue(), 0u);
		}

		// 512 + [32, 64) doesn't produce overlapping bits.
		// Make sure we get all the individual bits properly.
		TEST_F(ComputeKnownBitsTest, ComputeKnownBitsAddWithRangeNoOverlap) {
		parseAssembly("define void @test(i64* %p) {\n"
		" %A = load i64, i64* %p, !range !{i64 32, i64 64}\n"
		" %APlus512 = add i64 %A, 512\n"
		" %c = icmp ugt i64 %APlus512, 523\n"
		" call void @llvm.assume(i1 %c)\n"
		" ret void\n"
		"}\n"
		"declare void @llvm.assume(i1)\n");
		AssumptionCache AC(*F);
		KnownBits Known = computeKnownBits(A, M->getDataLayout(), /* Depth */ 0, &AC,
		F->front().getTerminator());
		EXPECT_EQ(Known.Zero.getZExtValue(), ~(64llu - 1));
		EXPECT_EQ(Known.One.getZExtValue(), 32u);
		Instruction &APlus512 = findInstructionByName(F, "APlus512");
		Known = computeKnownBits(&APlus512, M->getDataLayout(), /* Depth */ 0, &AC,
		F->front().getTerminator());
		qcolombetAuthorUnsubmitted Done Reply Inline Actions This illustrates how the target can put some extract effort on some instructions. qcolombet: This illustrates how the target can put some extract effort on some instructions.
		EXPECT_EQ(Known.Zero.getZExtValue(), ~512llu & ~(64llu - 1));
		EXPECT_EQ(Known.One.getZExtValue(), 512u \| 32u);
		}
		RKSimonUnsubmitted Not Done Reply Inline Actions These 2 tests above look like they can pre-committed? RKSimon: These 2 tests above look like they can pre-committed?
		qcolombetAuthorUnsubmitted Done Reply Inline Actions Correct. qcolombet: Correct.

		namespace {
		class TestGEPTTIImpl : public TargetTransformInfoImplCRTPBase<TestGEPTTIImpl> {
		typedef TargetTransformInfoImplCRTPBase<TestGEPTTIImpl> BaseT;
		friend BaseT;

		public:
		// Override the handling of GEPs to compute the full bits.
		// By default the generic analysis doesn't spend a lot of time
		// on these as the only interesting information for most
		// optimizations is the alignment.
		bool computeKnownBits(const TargetTransformInfo TTI, const Value V,
		KnownBits &Known, const DataLayout &DL, unsigned Depth,
		AssumptionCache AC, const Instruction CxtI,
		const DominatorTree DT, OptimizationRemarkEmitter ORE,
		bool UseInstrInfo) const {
		if (!isa<Operator>(V))
		return false;
		const Operator *I = cast<Operator>(V);
		unsigned BitWidth = Known.getBitWidth();

		switch (I->getOpcode()) {
		default:
		return false;
		case Instruction::GetElementPtr: {
		// Analyze all of the subscripts of this getelementptr instruction
		// to determine if we can prove known low zero bits.
		KnownBits LocalKnown(BitWidth);
		::computeKnownBits(I->getOperand(0), LocalKnown, DL, Depth + 1, AC, CxtI,
		DT, ORE, UseInstrInfo, TTI);
		KnownBits AddrKnownBits(LocalKnown);

		unsigned TrailZ = LocalKnown.countMinTrailingZeros();

		gep_type_iterator GTI = gep_type_begin(I);
		// If the inbounds keyword is not present, the offsets are added to the
		// base address with silently-wrapping two’s complement arithmetic.
		bool IsInBounds = cast<GEPOperator>(I)->isInBounds();
		for (unsigned i = 1, e = I->getNumOperands(); i != e; ++i, ++GTI) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming] not useful clang-tidy: warning: invalid case style for variable 'e' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'i' [readability-identifier-naming]…
		if (TrailZ == 0 && AddrKnownBits.isUnknown())
		break;
		Value *Index = I->getOperand(i);

		unsigned IndexBitWidth = Index->getType()->getScalarSizeInBits();
		KnownBits IndexBits(IndexBitWidth);
		if (StructType *STy = GTI.getStructTypeOrNull()) {
		// Handle struct member offset arithmetic.

		// Handle case when index is vector zeroinitializer
		Constant *CIndex = cast<Constant>(Index);
		if (CIndex->isZeroValue())
		continue;

		if (CIndex->getType()->isVectorTy())
		Index = CIndex->getSplatValue();

		unsigned Idx = cast<ConstantInt>(Index)->getZExtValue();
		const StructLayout *SL = DL.getStructLayout(STy);
		uint64_t Offset = SL->getElementOffset(Idx);
		if (!AddrKnownBits.isUnknown()) {
		IndexBits.Zero = ~Offset;
		IndexBits.One = Offset;
		}
		TrailZ = std::min<unsigned>(TrailZ, countTrailingZeros(Offset));
		} else {
		// Handle array index arithmetic.
		Type *IndexedTy = GTI.getIndexedType();
		if (!IndexedTy->isSized()) {
		TrailZ = 0;
		AddrKnownBits.resetAll();
		break;
		}
		::computeKnownBits(Index, IndexBits, DL, Depth + 1, AC, CxtI, DT, ORE,
		UseInstrInfo, TTI);
		TypeSize IndexTypeSize = DL.getTypeAllocSize(IndexedTy);
		uint64_t TypeSizeInBytes = IndexTypeSize.getKnownMinSize();
		if (IndexTypeSize.isScalable())
		AddrKnownBits.resetAll();
		if (!AddrKnownBits.isUnknown()) {
		// Multiply by current sizeof type.
		// &A[i] == A + i * sizeof(*A[i]).
		KnownBits ScalingFactor(IndexBitWidth);
		ScalingFactor.Zero = ~TypeSizeInBytes;
		ScalingFactor.One = TypeSizeInBytes;
		IndexBits = KnownBits::computeForMul(IndexBits, ScalingFactor);
		}
		TrailZ =
		std::min(TrailZ, unsigned(countTrailingZeros(TypeSizeInBytes) +
		IndexBits.countMinTrailingZeros()));
		}
		if (AddrKnownBits.isUnknown())
		continue;

		// If the offsets have a different width from the pointer, according
		// to the language reference we need to sign-extend or truncate them
		// to the width of the pointer.
		IndexBits = IndexBits.sextOrTrunc(BitWidth);

		AddrKnownBits = KnownBits::computeForAddSub(
		/Add=/true,
		/NSW=/IsInBounds, AddrKnownBits, IndexBits);
		}
		if (!AddrKnownBits.isUnknown())
		Known = AddrKnownBits;
		else
		Known.Zero.setLowBits(TrailZ);
		return true;
		}
		}
		}

		public:
		explicit TestGEPTTIImpl(const Function &F)
		: BaseT(F.getParent()->getDataLayout()) {}
		};
		} // End anonymous namespace.

		TEST_F(ComputeKnownBitsTest, ComputeKnownBitsGEPWithRange) {
		parseAssembly(
		"define void @test(i64* %p) {\n"
		" %A = load i64, i64* %p, !range !{i64 64, i64 65536}\n"
		qcolombetAuthorUnsubmitted Done Reply Inline Actions Note: This test illustrates how it would work with a custom TTI but is not commit-able as is because it relies on `BasicTTIImplBase`, which is part of the CodeGen library. qcolombet: Note: This test illustrates how it would work with a custom TTI but is not commit-able as is…
		qcolombetAuthorUnsubmitted Done Reply Inline Actions Fixed that part. qcolombet: Fixed that part.
		" %APtr = inttoptr i64 %A to float*"
		" %APtrPlus512 = getelementptr float, float* %APtr, i32 128\n"
		" %c = icmp ugt float* %APtrPlus512, inttoptr (i32 523 to float*)\n"
		" call void @llvm.assume(i1 %c)\n"
		" ret void\n"
		"}\n"
		"declare void @llvm.assume(i1)\n");
		AssumptionCache AC(*F);
		TestGEPTTIImpl TTIImpl(*F);
		TargetTransformInfo TTI(TTIImpl);
		KnownBits Known = computeKnownBits(
		A, M->getDataLayout(), /* Depth */ 0, &AC, F->front().getTerminator(),
		/DT=/nullptr, /ORE=/nullptr, /UseInstrInfo=/true, &TTI);
		EXPECT_EQ(Known.Zero.getZExtValue(), ~(65536llu - 1));
		EXPECT_EQ(Known.One.getZExtValue(), 0u);
		Instruction &APtrPlus512 = findInstructionByName(F, "APtrPlus512");
		Known = computeKnownBits(&APtrPlus512, M->getDataLayout(), /* Depth */ 0, &AC,
		F->front().getTerminator(), /DT=/nullptr,
		/ORE=/nullptr, /UseInstrInfo=/true, &TTI);
		// We know of one less zero because 512 may have produced a 1 that
		// got carried all the way to the first trailing zero.
		EXPECT_EQ(Known.Zero.getZExtValue(), ~(65536llu - 1) << 1);
		EXPECT_EQ(Known.One.getZExtValue(), 0u);
		}

		// 4*128 + [32, 64) doesn't produce overlapping bits.
		// Make sure we get all the individual bits properly.
		// This test is useful to check that we account for the scaling factor
		// in the gep. Indeed, gep float, [32,64), 128 is not 128 + [32,64).
		TEST_F(ComputeKnownBitsTest, ComputeKnownBitsGEPWithRangeNoOverlap) {
		parseAssembly(
		"define void @test(i64* %p) {\n"
		" %A = load i64, i64* %p, !range !{i64 32, i64 64}\n"
		" %APtr = inttoptr i64 %A to float*"
		" %APtrPlus512 = getelementptr float, float* %APtr, i32 128\n"
		" %c = icmp ugt float* %APtrPlus512, inttoptr (i32 523 to float*)\n"
		" call void @llvm.assume(i1 %c)\n"
		" ret void\n"
		"}\n"
		"declare void @llvm.assume(i1)\n");
		AssumptionCache AC(*F);
		TestGEPTTIImpl TTIImpl(*F);
		TargetTransformInfo TTI(TTIImpl);
		KnownBits Known = computeKnownBits(
		A, M->getDataLayout(), /* Depth */ 0, &AC, F->front().getTerminator(),
		/DT=/nullptr, /ORE=/nullptr, /UseInstrInfo=/true, &TTI);
		EXPECT_EQ(Known.Zero.getZExtValue(), ~(64llu - 1));
		EXPECT_EQ(Known.One.getZExtValue(), 32u);
		Instruction &APtrPlus512 = findInstructionByName(F, "APtrPlus512");
		Known = computeKnownBits(&APtrPlus512, M->getDataLayout(), /* Depth */ 0, &AC,
		F->front().getTerminator(),
		/DT=/nullptr, /ORE=/nullptr,
		/UseInstrInfo=/true, &TTI);
		// We know of one less zero because 512 may have produced a 1 that
		// got carried all the way to the first trailing zero.
		EXPECT_EQ(Known.Zero.getZExtValue(), ~512llu & ~(64llu - 1));
		EXPECT_EQ(Known.One.getZExtValue(), 512u \| 32u);
		}

class IsBytewiseValueTest : public ValueTrackingTest,		class IsBytewiseValueTest : public ValueTrackingTest,
public ::testing::WithParamInterface<		public ::testing::WithParamInterface<
std::pair<const char , const char >> {		std::pair<const char , const char >> {
protected:		protected:
};		};

const std::pair<const char , const char > IsBytewiseValueTests[] = {		const std::pair<const char , const char > IsBytewiseValueTests[] = {
{		{
▲ Show 20 Lines • Show All 556 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Allow targets to augment computeKnownBits with their analysis using TargetTransformInfoAbandonedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 292053

llvm/include/llvm/Analysis/TargetTransformInfo.h

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

llvm/include/llvm/Analysis/ValueTracking.h

llvm/include/llvm/CodeGen/BasicTTIImpl.h

llvm/include/llvm/Support/KnownBits.h

llvm/lib/Analysis/TargetTransformInfo.cpp

llvm/lib/Analysis/ValueTracking.cpp

llvm/lib/Support/KnownBits.cpp

llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp

llvm/unittests/Analysis/ValueTrackingTest.cpp

Allow targets to augment computeKnownBits with their analysis using TargetTransformInfo
AbandonedPublic