This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
ValueTracking.h
-
IR/
4/6
Value.h
-
lib/
-
Analysis/
-
CaptureTracking.cpp
-
Loads.cpp
-
MemDerefPrinter.cpp
-
ValueTracking.cpp
-
CodeGen/SelectionDAG/
-
SelectionDAG/
-
SelectionDAGBuilder.cpp
-
IR/
1/3
Value.cpp
-
Transforms/
-
IPO/
-
ArgumentPromotion.cpp
-
Scalar/
1/2
MergeICmps.cpp

Differential D63243

[WIP] Adjust the users of dereferenceable wrt. dereferenceable_globally
Needs RevisionPublic

Authored by jdoerfert on Jun 12 2019, 11:02 PM.

Download Raw Diff

Details

Reviewers

efriedma
hfinkel
chandlerc
nlopes
a.elovikov
reames
sanjoy

Summary

Prototype how we can make use of dereferenceable and
dereferenceable_globally at the same time. This depends on attributes
not in tree yet. This has not been tested nor have I updated the test
cases yet.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 33412
Build 33411: arc lint + arc unit

Event Timeline

jdoerfert created this revision.Jun 12 2019, 11:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 12 2019, 11:02 PM

Herald added subscribers: bollu, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B33309: Diff 204428.Jun 12 2019, 11:04 PM

jdoerfert added parent revisions: D61652: [Attr] Introduce dereferenceable_globally, D49165: Add, and infer, a nofree function attribute, D62766: [Attributor] Deduce "nosync" function attribute..Jun 12 2019, 11:05 PM

sstefan1 added a subscriber: sstefan1.Jun 13 2019, 4:12 AM

reames added inline comments.Jun 13 2019, 9:31 AM

llvm/include/llvm/IR/Value.h
583	An API suggestion. Instead of adding a context pointer here, add a boolean OnlyAtDef out param. Then introduce a helper function in ValueTracking which does the inference over the IR. (i.e. leave this as returning basic facts about the value, and move the analysis into an analysis.) Or if you want, call the param AssumeNoFree, or whatever..

Update according to API suggestion

llvm/include/llvm/IR/Value.h
583	I like it, done. Though, I do need the location of the definition, so I made it an `Instruction *&` not a `bool &`, what do you think?

Harbormaster completed remote builds in B33410: Diff 204816.Jun 14 2019, 11:06 AM

Update all uses of isDereferenceablePointer

Harbormaster completed remote builds in B33412: Diff 204818.Jun 14 2019, 11:18 AM

courbet added a subscriber: courbet.Jun 17 2019, 2:16 AM

courbet added inline comments.

llvm/lib/Transforms/Scalar/MergeICmps.cpp
165	I don't see where in the patch (or the rest of the stack of changes) the signature of `isDereferenceablePointer` is changed to non-const Instruction.

reames added inline comments.Jun 17 2019, 2:10 PM

llvm/include/llvm/IR/Value.h
583	I'd prefer the boolean given there's only two options: 1) on def, and 2) globally. I'm willing to defer to you on this if you feel strongly though.

jdoerfert marked 2 inline comments as done.Jun 18 2019, 1:18 AM

jdoerfert added inline comments.

llvm/include/llvm/IR/Value.h
583	My problem is that for option 1) "on def", the information that is returned would not be sufficient for most users. In the "on def" case you also need the "definition" to query `maybeFreedInBetween`. If I would return a boolean, I would need to implement a second traversal that does the same just to find the definition. Does this make sense?
llvm/lib/Transforms/Scalar/MergeICmps.cpp
165	True, I remove the `const_cast`. That was my bad.

reames added inline comments.Jun 18 2019, 8:10 AM

llvm/include/llvm/IR/Value.h
583	No, not really. You can only ask this question on a particularly value. (i.e. argument or return value or load) That is the definition. If you have the instruction to query the property, you have the def, so I really don't understand your point.

jdoerfert marked an inline comment as done.Jun 19 2019, 3:06 AM

jdoerfert added inline comments.

llvm/include/llvm/IR/Value.h
583	No, not really. You can only ask this question on a particularly value. (i.e. argument or return value or load) That is the definition. If you have the instruction to query the property, you have the def, so I really don't understand your point. I'm also confused. Let me try to explain again what I think we should do/need and why: We call `getPointerDereferenceableBytes` on a pointer (`llvm::Value`) to get the "known dereferenceable bytes". This method determines that by traversing the operands until a "definition" is found. If the definition is natively dereferenceable, e.g., stack or global allocations, `Instruction &DerefKnownBefore` is `null` and there is no change to the behavior we had. If the definition is "known dereferenceable" because of attributes or metadata that says `_globally`, `Instruction &DerefKnownBefore` is `null` and there is no change to the behavior we had. Finally, if the definition is "known dereferenceable" because of attributes or metadata that does not end in `_globally`, `Instruction *&DerefKnownBefore` is set to the `llvm:Instruction` right after the "definition" with the attribute/metadata. This means that the number of dereferenceable bytes is known for that position but not necessarily any other position in the CFG. The caller can then use `maybeFreedInBetween` to determine if it holds for the position it is interested in. To have a tight "is potentially freed", one needs to know where it is "known dereferenceable" and where we are interested in "is dereferenceable".

@reames ping?

Happy to talk offline (call, chat, etc..) since we seem to be talking past each other again.

llvm/lib/IR/Value.cpp
689	Same point as before, but explained differently in the hopes this makes sense. DerefKnownBefore only ever takes one of two values: the first instruction within the function, or the one after the instruction we invoked this function on. As such, the return value adds no information which can't be easily determined from a single boolean.

This revision now requires changes to proceed.Jun 28 2019, 4:37 PM

jdoerfert marked an inline comment as done.Jul 1 2019, 1:52 PM

jdoerfert added inline comments.

llvm/lib/IR/Value.cpp
689	I think I get it know and I hope that means I can convince you we need more than a boolean, at least in the future and if we want to make it easier for the clients. First, in the future we could handle any subset of the following cases which would make the boolean insufficient: casts select/phi gep with constant offsets and dereferenceable base pointer inbound geps with positive offsets w/ or w/o dereferenceabl base pointer Second, the clients would need to do "more" in the boolean case as they have to check, cast, and advance the pointer value before they can call `maybeFreedInbetween`. Checked, means `isa<Argument>(ptr) \|\| isa<Instruction>(ptr)` as we otherwise have an implicit contract that the bool is only set for instructions and arguments. Casted into the right thing, instruction or argument, and advanced in case of the former. I'd argue that will more easily to errors or wrong usage.

reames added inline comments.Jul 1 2019, 4:02 PM

llvm/lib/IR/Value.cpp
689	No, I'm not convinced. I think what your describing is a need for two layers of APIs. There's the "raw" API on value which returns properties of that value, and then there's a more holistic API which returns analysis results and should live in Analysis/Something.h. I'll also point out that future proofing code is a specific anti-pattern that we try very strongly to avoid. Choosing good abstractions is encouraged, but writing code which is not required by the current patch is not.

jdoerfert mentioned this in D64258: [InferFuncAttributes] extend 'dereferenceable' attribute based on loads.Jul 5 2019, 3:06 PM

uenoku added a subscriber: uenoku.Jul 13 2019, 1:17 AM

spatel mentioned this in D64551: [X86] EltsFromConsecutiveLoads - support common source loads.Jul 18 2019, 5:41 AM

dtemirbulatov added a subscriber: dtemirbulatov.Sep 2 2019, 5:08 AM

sanjoy resigned from this revision.Jan 29 2022, 5:40 PM

Herald added subscribers: ormris, dexonsmith. · View Herald TranscriptJan 29 2022, 5:40 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

ValueTracking.h

13 lines

IR/

Value.h

8 lines

lib/

Analysis/

16 lines

15 lines

4 lines

59 lines

CodeGen/

SelectionDAG/

SelectionDAGBuilder.cpp

5 lines

IR/

Value.cpp

68 lines

Transforms/

IPO/

ArgumentPromotion.cpp

3 lines

Scalar/

MergeICmps.cpp

2 lines

Diff 204818

llvm/include/llvm/Analysis/ValueTracking.h

Show All 12 Lines

#ifndef LLVM_ANALYSIS_VALUETRACKING_H		#ifndef LLVM_ANALYSIS_VALUETRACKING_H
#define LLVM_ANALYSIS_VALUETRACKING_H		#define LLVM_ANALYSIS_VALUETRACKING_H

#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/ADT/SmallSet.h"		#include "llvm/ADT/SmallSet.h"
#include "llvm/IR/CallSite.h"		#include "llvm/IR/CallSite.h"
		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Instruction.h"		#include "llvm/IR/Instruction.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>

namespace llvm {		namespace llvm {

▲ Show 20 Lines • Show All 610 Lines • ▼ Show 20 Lines	Optional<bool> isImpliedCondition(const Value LHS, const Value RHS,
const DataLayout &DL, bool LHSIsTrue = true,		const DataLayout &DL, bool LHSIsTrue = true,
unsigned Depth = 0);		unsigned Depth = 0);

/// Return the boolean condition value in the context of the given instruction		/// Return the boolean condition value in the context of the given instruction
/// if it is known based on dominating conditions.		/// if it is known based on dominating conditions.
Optional<bool> isImpliedByDomCondition(const Value *Cond,		Optional<bool> isImpliedByDomCondition(const Value *Cond,
const Instruction *ContextI,		const Instruction *ContextI,
const DataLayout &DL);		const DataLayout &DL);

		/// Return true if there cannot be a memory deallocation (aka. free)
		/// in-between \p SrcI and \p DstI. Thus, check if there is no direct or
		/// transitive "free" call in-between. If \p IsGloballyKnown is true, this
		/// method will additionally check if there is the potential for
		/// synchronization in-between \p SrcI and \p DstI as a globally known pointer
		/// could be passed to a memory deallocation function by another thread.
		/// The \p MaxCheckedInstructions limits the number of instructions that are
		/// inspected to prove the absence of a memory deallocation.
		bool maybeFreedInBetween(const Instruction SrcI, const Instruction DstI,
		unsigned MaxCheckedInstructions,
		bool IsGloballyKnown);
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_ANALYSIS_VALUETRACKING_H		#endif // LLVM_ANALYSIS_VALUETRACKING_H

llvm/include/llvm/IR/Value.h

Show First 20 Lines • Show All 570 Lines • ▼ Show 20 Lines	return const_cast<Value *>(
static_cast<const Value *>(this)->stripInBoundsOffsets());		static_cast<const Value *>(this)->stripInBoundsOffsets());
}		}

/// Returns the number of bytes known to be dereferenceable for the		/// Returns the number of bytes known to be dereferenceable for the
/// pointer value.		/// pointer value.
///		///
/// If CanBeNull is set by this function the pointer can either be null or be		/// If CanBeNull is set by this function the pointer can either be null or be
/// dereferenceable up to the returned number of bytes.		/// dereferenceable up to the returned number of bytes.
uint64_t getPointerDereferenceableBytes(const DataLayout &DL,		///
bool &CanBeNull) const;		/// If \p DerefKnownBefore is set by this function the dereferenceability
		/// information is only known to be valid prior to the \p DerefKnownBefore
		/// instructions.
		uint64_t getPointerDereferenceableBytes(const DataLayout &DL, bool &CanBeNull,
		reamesUnsubmitted Done Reply Inline Actions An API suggestion. Instead of adding a context pointer here, add a boolean OnlyAtDef out param. Then introduce a helper function in ValueTracking which does the inference over the IR. (i.e. leave this as returning basic facts about the value, and move the analysis into an analysis.) Or if you want, call the param AssumeNoFree, or whatever.. reames: An API suggestion. Instead of adding a context pointer here, add a boolean OnlyAtDef out param.
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions I like it, done. Though, I do need the location of the definition, so I made it an `Instruction &` not a `bool &`, what do you think? jdoerfert:* I like it, done. Though, I do need the location of the definition, so I made it an `Instruction…
		reamesUnsubmitted Not Done Reply Inline Actions I'd prefer the boolean given there's only two options: 1) on def, and 2) globally. I'm willing to defer to you on this if you feel strongly though. reames: I'd prefer the boolean given there's only two options: 1) on def, and 2) globally. I'm willing…
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions My problem is that for option 1) "on def", the information that is returned would not be sufficient for most users. In the "on def" case you also need the "definition" to query `maybeFreedInBetween`. If I would return a boolean, I would need to implement a second traversal that does the same just to find the definition. Does this make sense? jdoerfert: My problem is that for option 1) "on def", the information that is returned would not be…
		reamesUnsubmitted Not Done Reply Inline Actions No, not really. You can only ask this question on a particularly value. (i.e. argument or return value or load) That is the definition. If you have the instruction to query the property, you have the def, so I really don't understand your point. reames: No, not really. You can only ask this question on a particularly value. (i.e. argument or…
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions No, not really. You can only ask this question on a particularly value. (i.e. argument or return value or load) That is the definition. If you have the instruction to query the property, you have the def, so I really don't understand your point. I'm also confused. Let me try to explain again what I think we should do/need and why: We call `getPointerDereferenceableBytes` on a pointer (`llvm::Value`) to get the "known dereferenceable bytes". This method determines that by traversing the operands until a "definition" is found. If the definition is natively dereferenceable, e.g., stack or global allocations, `Instruction &DerefKnownBefore` is `null` and there is no change to the behavior we had. If the definition is "known dereferenceable" because of attributes or metadata that says `_globally`, `Instruction &DerefKnownBefore` is `null` and there is no change to the behavior we had. Finally, if the definition is "known dereferenceable" because of attributes or metadata that does not end in `_globally`, `Instruction &DerefKnownBefore` is set to the `llvm:Instruction` right after the "definition" with the attribute/metadata. This means that the number of dereferenceable bytes is known for that position but not necessarily any other position in the CFG. The caller can then use `maybeFreedInBetween` to determine if it holds for the position it is interested in. To have a tight "is potentially freed", one needs to know where it is "known dereferenceable" and where we are interested in "is dereferenceable". jdoerfert:* > No, not really. You can only ask this question on a particularly value. (i.e. argument or…
		Instruction *&DerefKnownBefore) const;

/// Returns an alignment of the pointer value.		/// Returns an alignment of the pointer value.
///		///
/// Returns an alignment which is either specified explicitly, e.g. via		/// Returns an alignment which is either specified explicitly, e.g. via
/// align attribute of a function argument, or guaranteed by DataLayout.		/// align attribute of a function argument, or guaranteed by DataLayout.
unsigned getPointerAlignment(const DataLayout &DL) const;		unsigned getPointerAlignment(const DataLayout &DL) const;

/// Translate PHI node to its predecessor from the given basic block.		/// Translate PHI node to its predecessor from the given basic block.
▲ Show 20 Lines • Show All 290 Lines • Show Last 20 Lines

llvm/lib/Analysis/CaptureTracking.cpp

Show First 20 Lines • Show All 344 Lines • ▼ Show 20 Lines	case Instruction::ICmp: {
// the pointer escape using clever GEP hacking because doing so		// the pointer escape using clever GEP hacking because doing so
// would make the pointer point outside of the allocated object		// would make the pointer point outside of the allocated object
// and thus make the GEP result a poison value.		// and thus make the GEP result a poison value.
if (auto *GEP = dyn_cast<GetElementPtrInst>(O))		if (auto *GEP = dyn_cast<GetElementPtrInst>(O))
if (GEP->isInBounds())		if (GEP->isInBounds())
break;		break;
// Comparing a dereferenceable_or_null argument against null		// Comparing a dereferenceable_or_null argument against null
// cannot lead to pointer escapes, because if it is not null it		// cannot lead to pointer escapes, because if it is not null it
// must be a valid (in-bounds) pointer.		// must be a valid (in-bounds) pointer. Note that we do care if
		// the pointer was freed.
bool CanBeNull;		bool CanBeNull;
if (O->getPointerDereferenceableBytes(I->getModule()->getDataLayout(), CanBeNull))		Instruction *DerefKnownBefore;
		// TODO: Determine IsGloballyKnown for the maybeFreedInBetween call.
		// TODO: Determine a good value for MaxCheckedInstructions in the
		// maybeFreedInBetween call.
		if (O->getPointerDereferenceableBytes(I->getModule()->getDataLayout(),
		CanBeNull, DerefKnownBefore))
		if (!DerefKnownBefore \|\|
		!maybeFreedInBetween(DerefKnownBefore, I,
		/* MaxCheckedInstructions */ 0,
		/* IsGloballyKnown */ true))
break;		break;
}		}
}		}
// Comparison against value stored in global variable. Given the pointer		// Comparison against value stored in global variable. Given the pointer
// does not escape, its value cannot be guessed and stored separately in a		// does not escape, its value cannot be guessed and stored separately in a
// global variable.		// global variable.
unsigned OtherIndex = (I->getOperand(0) == V) ? 1 : 0;		unsigned OtherIndex = (I->getOperand(0) == V) ? 1 : 0;
auto *LI = dyn_cast<LoadInst>(I->getOperand(OtherIndex));		auto *LI = dyn_cast<LoadInst>(I->getOperand(OtherIndex));
if (LI && isa<GlobalVariable>(LI->getPointerOperand()))		if (LI && isa<GlobalVariable>(LI->getPointerOperand()))
Show All 17 Lines

llvm/lib/Analysis/Loads.cpp

Show First 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	static bool isDereferenceableAndAlignedPointer(
// Note that it is not safe to speculate into a malloc'd region because		// Note that it is not safe to speculate into a malloc'd region because
// malloc may return null.		// malloc may return null.

// bitcast instructions are no-ops as far as dereferenceability is concerned.		// bitcast instructions are no-ops as far as dereferenceability is concerned.
if (const BitCastOperator *BC = dyn_cast<BitCastOperator>(V))		if (const BitCastOperator *BC = dyn_cast<BitCastOperator>(V))
return isDereferenceableAndAlignedPointer(BC->getOperand(0), Align, Size,		return isDereferenceableAndAlignedPointer(BC->getOperand(0), Align, Size,
DL, CtxI, DT, Visited);		DL, CtxI, DT, Visited);

		// TODO: Determine IsGloballyKnown for the maybeFreedInBetween call.
		// TODO: Determine a good value for MaxCheckedInstructions in the
		// maybeFreedInBetween call.
		Instruction *DerefKnownBefore;
bool CheckForNonNull = false;		bool CheckForNonNull = false;
APInt KnownDerefBytes(Size.getBitWidth(),		APInt KnownDerefBytes(
V->getPointerDereferenceableBytes(DL, CheckForNonNull));		Size.getBitWidth(),
		V->getPointerDereferenceableBytes(DL, CheckForNonNull, DerefKnownBefore));
if (KnownDerefBytes.getBoolValue()) {		if (KnownDerefBytes.getBoolValue()) {
if (KnownDerefBytes.uge(Size))		if (KnownDerefBytes.uge(Size))
if (!CheckForNonNull \|\| isKnownNonZero(V, DL, 0, nullptr, CtxI, DT))		if (!CheckForNonNull \|\| isKnownNonZero(V, DL, 0, nullptr, CtxI, DT))
		if (!DerefKnownBefore \|\|
		(CtxI && !maybeFreedInBetween(DerefKnownBefore, CtxI,
		/* MaxCheckedInstructions */ 0,
		/* IsGloballyKnown */ true)))
return isAligned(V, Align, DL);		return isAligned(V, Align, DL);
}		}

// For GEPs, determine if the indexing lands within the allocated object.		// For GEPs, determine if the indexing lands within the allocated object.
if (const GEPOperator *GEP = dyn_cast<GEPOperator>(V)) {		if (const GEPOperator *GEP = dyn_cast<GEPOperator>(V)) {
const Value *Base = GEP->getPointerOperand();		const Value *Base = GEP->getPointerOperand();

APInt Offset(DL.getIndexTypeSizeInBits(GEP->getType()), 0);		APInt Offset(DL.getIndexTypeSizeInBits(GEP->getType()), 0);
if (!GEP->accumulateConstantOffset(DL, Offset) \|\| Offset.isNegative() \|\|		if (!GEP->accumulateConstantOffset(DL, Offset) \|\| Offset.isNegative() \|\|
▲ Show 20 Lines • Show All 365 Lines • Show Last 20 Lines

llvm/lib/Analysis/MemDerefPrinter.cpp

Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	FunctionPass *llvm::createMemDerefPrinter() {
return new MemDerefPrinter();		return new MemDerefPrinter();
}		}

bool MemDerefPrinter::runOnFunction(Function &F) {		bool MemDerefPrinter::runOnFunction(Function &F) {
const DataLayout &DL = F.getParent()->getDataLayout();		const DataLayout &DL = F.getParent()->getDataLayout();
for (auto &I: instructions(F)) {		for (auto &I: instructions(F)) {
if (LoadInst *LI = dyn_cast<LoadInst>(&I)) {		if (LoadInst *LI = dyn_cast<LoadInst>(&I)) {
Value *PO = LI->getPointerOperand();		Value *PO = LI->getPointerOperand();
if (isDereferenceablePointer(PO, DL))		if (isDereferenceablePointer(PO, DL, LI))
Deref.push_back(PO);		Deref.push_back(PO);
if (isDereferenceableAndAlignedPointer(PO, LI->getAlignment(), DL))		if (isDereferenceableAndAlignedPointer(PO, LI->getAlignment(), DL, LI))
DerefAndAligned.insert(PO);		DerefAndAligned.insert(PO);
}		}
}		}
return false;		return false;
}		}

void MemDerefPrinter::print(raw_ostream &OS, const Module *M) const {		void MemDerefPrinter::print(raw_ostream &OS, const Module *M) const {
OS << "The following are dereferenceable:\n";		OS << "The following are dereferenceable:\n";
Show All 9 Lines

llvm/lib/Analysis/ValueTracking.cpp

Show First 20 Lines • Show All 4,387 Lines • ▼ Show 20 Lines	switch (I->getOpcode()) {
case Instruction::URem:		case Instruction::URem:
case Instruction::SRem:		case Instruction::SRem:
return I->getOperand(1);		return I->getOperand(1);

default:		default:
// Note: It's really tempting to think that a conditional branch or		// Note: It's really tempting to think that a conditional branch or
// switch should be listed here, but that's incorrect. It's not		// switch should be listed here, but that's incorrect. It's not
// branching off of poison which is UB, it is executing a side effecting		// branching off of poison which is UB, it is executing a side effecting
// instruction which follows the branch.		// instruction which follows the branch.
return nullptr;		return nullptr;
}		}
}		}

bool llvm::mustTriggerUB(const Instruction *I,		bool llvm::mustTriggerUB(const Instruction *I,
const SmallSet<const Value *, 16>& KnownPoison) {		const SmallSet<const Value *, 16>& KnownPoison) {
auto *NotPoison = getGuaranteedNonFullPoisonOp(I);		auto *NotPoison = getGuaranteedNonFullPoisonOp(I);
return (NotPoison && KnownPoison.count(NotPoison));		return (NotPoison && KnownPoison.count(NotPoison));
▲ Show 20 Lines • Show All 1,318 Lines • ▼ Show 20 Lines	ConstantRange llvm::computeConstantRange(const Value *V, bool UseInstrInfo) {
ConstantRange CR = ConstantRange::getNonEmpty(Lower, Upper);		ConstantRange CR = ConstantRange::getNonEmpty(Lower, Upper);

if (auto *I = dyn_cast<Instruction>(V))		if (auto *I = dyn_cast<Instruction>(V))
if (auto *Range = IIQ.getMetadata(I, LLVMContext::MD_range))		if (auto *Range = IIQ.getMetadata(I, LLVMContext::MD_range))
CR = CR.intersectWith(getConstantRangeFromMetadata(*Range));		CR = CR.intersectWith(getConstantRangeFromMetadata(*Range));

return CR;		return CR;
}		}

		bool llvm::maybeFreedInBetween(const Instruction SrcI, const Instruction DstI,
		unsigned MaxCheckedInstructions,
		bool IsGloballyKnown) {
		const Function *F = SrcI->getFunction();
		// If the function is no-free, and no-sync if necessary, there cannot be a
		// deallocation.
		if (F->hasFnAttribute(Attribute::NoFree) &&
		(!IsGloballyKnown \|\| F->hasFnAttribute(Attribute::NoSync)))
		return false;

		// If we do not want to check any instructions we give up now.
		if (MaxCheckedInstructions == 0)
		return true;

		SmallVector<const Instruction *, 32> Worklist;
		SmallPtrSet<const Instruction *, 32> Visited;
		Worklist.push_back(DstI);

		// Lookup all instructions on all paths from SrcI to DstI and
		// determine if there is a conflicting call in-between or not.
		// We do so by exploring the paths in reverse order from DstI.
		do {
		const Instruction *CurI = Worklist.pop_back_val();

		// Never visit an instruction twice.
		if (!Visited.insert(CurI).second)
		continue;

		// Make sure we do not waste too much time trying to prove this.
		if (Visited.size() > MaxCheckedInstructions)
		return true;

		// Only calls can deallocate, aka. free, memory or synchronize.
		if (ImmutableCallSite ICS = ImmutableCallSite(CurI)) {
		if (!ICS.hasFnAttr(Attribute::NoFree) \|\|
		(IsGloballyKnown && !ICS.hasFnAttr(Attribute::NoSync)))
		return true;
		}

		// Once SrcI is reached we are done traversing for this instruction.
		if (CurI == SrcI)
		continue;

		// If we reached the beginning of a block, look at the predecessors.
		if (!CurI->getPrevNode()) {
		const BasicBlock *CurBB = CurI->getParent();
		for (const BasicBlock *PredBB : predecessors(CurBB))
		Worklist.push_back(&PredBB->back());
		}

		} while (!Worklist.empty());

		// No possible free found.
		return false;
		}

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,998 Lines • ▼ Show 20 Lines	void SelectionDAGBuilder::visitLoad(const LoadInst &I) {

SDValue Ptr = getValue(SV);		SDValue Ptr = getValue(SV);

Type *Ty = I.getType();		Type *Ty = I.getType();

bool isVolatile = I.isVolatile();		bool isVolatile = I.isVolatile();
bool isNonTemporal = I.getMetadata(LLVMContext::MD_nontemporal) != nullptr;		bool isNonTemporal = I.getMetadata(LLVMContext::MD_nontemporal) != nullptr;
bool isInvariant = I.getMetadata(LLVMContext::MD_invariant_load) != nullptr;		bool isInvariant = I.getMetadata(LLVMContext::MD_invariant_load) != nullptr;
bool isDereferenceable = isDereferenceablePointer(SV, DAG.getDataLayout());		bool isDereferenceable =
		isDereferenceablePointer(SV, DAG.getDataLayout(), &I);
unsigned Alignment = I.getAlignment();		unsigned Alignment = I.getAlignment();

AAMDNodes AAInfo;		AAMDNodes AAInfo;
I.getAAMetadata(AAInfo);		I.getAAMetadata(AAInfo);
const MDNode *Ranges = I.getMetadata(LLVMContext::MD_range);		const MDNode *Ranges = I.getMetadata(LLVMContext::MD_range);

SmallVector<EVT, 4> ValueVTs, MemVTs;		SmallVector<EVT, 4> ValueVTs, MemVTs;
SmallVector<uint64_t, 4> Offsets;		SmallVector<uint64_t, 4> Offsets;
▲ Show 20 Lines • Show All 608 Lines • ▼ Show 20 Lines	if (!TLI.supportsUnalignedAtomics() &&
I.getAlignment() < MemVT.getSizeInBits() / 8)		I.getAlignment() < MemVT.getSizeInBits() / 8)
report_fatal_error("Cannot generate unaligned atomic load");		report_fatal_error("Cannot generate unaligned atomic load");

auto Flags = MachineMemOperand::MOLoad;		auto Flags = MachineMemOperand::MOLoad;
if (I.isVolatile())		if (I.isVolatile())
Flags \|= MachineMemOperand::MOVolatile;		Flags \|= MachineMemOperand::MOVolatile;
if (I.getMetadata(LLVMContext::MD_invariant_load) != nullptr)		if (I.getMetadata(LLVMContext::MD_invariant_load) != nullptr)
Flags \|= MachineMemOperand::MOInvariant;		Flags \|= MachineMemOperand::MOInvariant;
if (isDereferenceablePointer(I.getPointerOperand(), DAG.getDataLayout()))		if (isDereferenceablePointer(I.getPointerOperand(), DAG.getDataLayout(), &I))
Flags \|= MachineMemOperand::MODereferenceable;		Flags \|= MachineMemOperand::MODereferenceable;

Flags \|= TLI.getMMOFlags(I);		Flags \|= TLI.getMMOFlags(I);

MachineMemOperand *MMO =		MachineMemOperand *MMO =
DAG.getMachineFunction().		DAG.getMachineFunction().
getMachineMemOperand(MachinePointerInfo(I.getPointerOperand()),		getMachineMemOperand(MachinePointerInfo(I.getPointerOperand()),
Flags, MemVT.getStoreSize(),		Flags, MemVT.getStoreSize(),
▲ Show 20 Lines • Show All 5,812 Lines • Show Last 20 Lines

llvm/lib/IR/Value.cpp

Show First 20 Lines • Show All 595 Lines • ▼ Show 20 Lines	Value::stripAndAccumulateInBoundsConstantOffsets(const DataLayout &DL,

return V;		return V;
}		}

const Value *Value::stripInBoundsOffsets() const {		const Value *Value::stripInBoundsOffsets() const {
return stripPointerCastsAndOffsets<PSK_InBounds>(this);		return stripPointerCastsAndOffsets<PSK_InBounds>(this);
}		}

uint64_t Value::getPointerDereferenceableBytes(const DataLayout &DL,		uint64_t
bool &CanBeNull) const {		Value::getPointerDereferenceableBytes(const DataLayout &DL, bool &CanBeNull,
		Instruction *&DerefKnownBefore) const {
assert(getType()->isPointerTy() && "must be pointer");		assert(getType()->isPointerTy() && "must be pointer");

uint64_t DerefBytes = 0;		uint64_t DerefBytes = 0;
CanBeNull = false;		CanBeNull = false;
		DerefKnownBefore = nullptr;
if (const Argument *A = dyn_cast<Argument>(this)) {		if (const Argument *A = dyn_cast<Argument>(this)) {
DerefBytes = A->getDereferenceableBytes();		DerefBytes = A->getDereferenceableGloballyBytes();
if (DerefBytes == 0 && (A->hasByValAttr() \|\| A->hasStructRetAttr())) {		if (DerefBytes == 0 && (A->hasByValAttr() \|\| A->hasStructRetAttr())) {
Type *PT = cast<PointerType>(A->getType())->getElementType();		Type *PT = cast<PointerType>(A->getType())->getElementType();
if (PT->isSized())		if (PT->isSized())
DerefBytes = DL.getTypeStoreSize(PT);		DerefBytes = DL.getTypeStoreSize(PT);
}		}
		if (CtxI && DerefBytes == 0) {
		uint64_t DerefBytesAtDef = A->getDereferenceableBytes();
		if (DerefBytesAtDef) {
		DerefBytes = DerefBytesAtDef;
		DerefKnownBefore = &A->getParent()->getEntryBlock().front();
		}
		}
if (DerefBytes == 0) {		if (DerefBytes == 0) {
DerefBytes = A->getDereferenceableOrNullBytes();		DerefBytes = A->getDereferenceableOrNullBytesGlobally();
		CanBeNull = true;
		}
		if (CtxI && DerefBytes == 0) {
		uint64_t DerefBytesAtDef = A->getDereferenceableOrNullBytes();
		if (DerefBytesAtDef) {
		DerefBytes = DerefBytesAtDef;
		DerefKnownBefore = &A->getParent()->getEntryBlock().front();
CanBeNull = true;		CanBeNull = true;
}		}
		}
} else if (const auto *Call = dyn_cast<CallBase>(this)) {		} else if (const auto *Call = dyn_cast<CallBase>(this)) {
DerefBytes = Call->getDereferenceableBytes(AttributeList::ReturnIndex);
if (DerefBytes == 0) {
DerefBytes =		DerefBytes =
		Call->getDereferenceableGloballyBytes(AttributeList::ReturnIndex);
		if (CtxI && DerefBytes == 0) {
		uint64_t DerefBytesAtDef =
		Call->getDereferenceableBytes(AttributeList::ReturnIndex);
		if (DerefBytesAtDef) {
		DerefBytes = DerefBytesAtDef;
		DerefKnownBefore = Call->getNextNode();
		}
		}
		if (DerefBytes == 0) {
		DerefBytes = Call->getDereferenceableOrNullBytesGlobally(
		AttributeList::ReturnIndex);
		CanBeNull = true;
		}
		if (CtxI && DerefBytes == 0) {
		uint64_t DerefBytesAtDef =
Call->getDereferenceableOrNullBytes(AttributeList::ReturnIndex);		Call->getDereferenceableOrNullBytes(AttributeList::ReturnIndex);
		if (DerefBytesAtDef) {
		DerefBytes = DerefBytesAtDef;
		DerefKnownBefore = Call->getNextNode();
CanBeNull = true;		CanBeNull = true;
}		}
		}
} else if (const LoadInst *LI = dyn_cast<LoadInst>(this)) {		} else if (const LoadInst *LI = dyn_cast<LoadInst>(this)) {
		if (MDNode *MD =
		LI->getMetadata(LLVMContext::MD_dereferenceable_globally)) {
		ConstantInt *CI = mdconst::extract<ConstantInt>(MD->getOperand(0));
		DerefBytes = CI->getLimitedValue();
		}
		if (CtxI && DerefBytes == 0) {
if (MDNode *MD = LI->getMetadata(LLVMContext::MD_dereferenceable)) {		if (MDNode *MD = LI->getMetadata(LLVMContext::MD_dereferenceable)) {
ConstantInt *CI = mdconst::extract<ConstantInt>(MD->getOperand(0));		ConstantInt *CI = mdconst::extract<ConstantInt>(MD->getOperand(0));
DerefBytes = CI->getLimitedValue();		DerefBytes = CI->getLimitedValue();
		DerefKnownBefore = LI->getNextNode();
		}
}		}
if (DerefBytes == 0) {		if (DerefBytes == 0) {
		if (MDNode *MD = LI->getMetadata(
		LLVMContext::MD_dereferenceable_or_null_globally)) {
		ConstantInt *CI = mdconst::extract<ConstantInt>(MD->getOperand(0));
		DerefBytes = CI->getLimitedValue();
		}
		CanBeNull = true;
		}
		if (CtxI && DerefBytes == 0) {
if (MDNode *MD =		if (MDNode *MD =
LI->getMetadata(LLVMContext::MD_dereferenceable_or_null)) {		LI->getMetadata(LLVMContext::MD_dereferenceable_or_null)) {
ConstantInt *CI = mdconst::extract<ConstantInt>(MD->getOperand(0));		ConstantInt *CI = mdconst::extract<ConstantInt>(MD->getOperand(0));
DerefBytes = CI->getLimitedValue();		DerefBytes = CI->getLimitedValue();
		DerefKnownBefore = LI->getNextNode();
		reamesUnsubmitted Not Done Reply Inline Actions Same point as before, but explained differently in the hopes this makes sense. DerefKnownBefore only ever takes one of two values: the first instruction within the function, or the one after the instruction we invoked this function on. As such, the return value adds no information which can't be easily determined from a single boolean. reames: Same point as before, but explained differently in the hopes this makes sense.
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions I think I get it know and I hope that means I can convince you we need more than a boolean, at least in the future and if we want to make it easier for the clients. First, in the future we could handle any subset of the following cases which would make the boolean insufficient: casts select/phi gep with constant offsets and dereferenceable base pointer inbound geps with positive offsets w/ or w/o dereferenceabl base pointer Second, the clients would need to do "more" in the boolean case as they have to check, cast, and advance the pointer value before they can call `maybeFreedInbetween`. Checked, means `isa<Argument>(ptr) \|\| isa<Instruction>(ptr)` as we otherwise have an implicit contract that the bool is only set for instructions and arguments. Casted into the right thing, instruction or argument, and advanced in case of the former. I'd argue that will more easily to errors or wrong usage. jdoerfert: I think I get it know and I hope that means I can convince you we need more than a boolean, at…
		reamesUnsubmitted Not Done Reply Inline Actions No, I'm not convinced. I think what your describing is a need for two layers of APIs. There's the "raw" API on value which returns properties of that value, and then there's a more holistic API which returns analysis results and should live in Analysis/Something.h. I'll also point out that future proofing code is a specific anti-pattern that we try very strongly to avoid. Choosing good abstractions is encouraged, but writing code which is not required by the current patch is not. reames: No, I'm not convinced. I think what your describing is a need for two layers of APIs.
}		}
CanBeNull = true;		CanBeNull = true;
}		}
} else if (auto *AI = dyn_cast<AllocaInst>(this)) {		} else if (auto *AI = dyn_cast<AllocaInst>(this)) {
if (!AI->isArrayAllocation()) {		if (!AI->isArrayAllocation()) {
DerefBytes = DL.getTypeStoreSize(AI->getAllocatedType());		DerefBytes = DL.getTypeStoreSize(AI->getAllocatedType());
CanBeNull = false;		CanBeNull = false;
}		}
▲ Show 20 Lines • Show All 312 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp

Show First 20 Lines • Show All 487 Lines • ▼ Show 20 Lines	static bool allCallersPassInValidPointerForArgument(Argument *Arg) {
unsigned ArgNo = Arg->getArgNo();		unsigned ArgNo = Arg->getArgNo();

// Look at all call sites of the function. At this point we know we only have		// Look at all call sites of the function. At this point we know we only have
// direct callees.		// direct callees.
for (User *U : Callee->users()) {		for (User *U : Callee->users()) {
CallSite CS(U);		CallSite CS(U);
assert(CS && "Should only have direct calls!");		assert(CS && "Should only have direct calls!");

if (!isDereferenceablePointer(CS.getArgument(ArgNo), DL))		if (!isDereferenceablePointer(CS.getArgument(ArgNo), DL,
		CS.getInstruction()))
return false;		return false;
}		}
return true;		return true;
}		}

/// Returns true if Prefix is a prefix of longer. That means, Longer has a size		/// Returns true if Prefix is a prefix of longer. That means, Longer has a size
/// that is greater than or equal to the size of prefix, and each of the		/// that is greater than or equal to the size of prefix, and each of the
/// elements in Prefix is the same as the corresponding elements in Longer.		/// elements in Prefix is the same as the corresponding elements in Longer.
▲ Show 20 Lines • Show All 635 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/MergeICmps.cpp

Show First 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	BCEAtom visitICmpLoadOperand(Value *const Val, BaseIdentifier &BaseId) {
if (!GEP)		if (!GEP)
return {};		return {};
LLVM_DEBUG(dbgs() << "GEP\n");		LLVM_DEBUG(dbgs() << "GEP\n");
if (GEP->isUsedOutsideOfBlock(LoadI->getParent())) {		if (GEP->isUsedOutsideOfBlock(LoadI->getParent())) {
LLVM_DEBUG(dbgs() << "used outside of block\n");		LLVM_DEBUG(dbgs() << "used outside of block\n");
return {};		return {};
}		}
const auto &DL = GEP->getModule()->getDataLayout();		const auto &DL = GEP->getModule()->getDataLayout();
if (!isDereferenceablePointer(GEP, DL)) {		if (!isDereferenceablePointer(GEP, DL, const_cast<LoadInst *>(LoadI))) {
		courbetUnsubmitted Not Done Reply Inline Actions I don't see where in the patch (or the rest of the stack of changes) the signature of `isDereferenceablePointer` is changed to non-const Instruction. courbet: I don't see where in the patch (or the rest of the stack of changes) the signature of…
		jdoerfertAuthorUnsubmitted Done Reply Inline Actions True, I remove the `const_cast`. That was my bad. jdoerfert: True, I remove the `const_cast`. That was my bad.
LLVM_DEBUG(dbgs() << "not dereferenceable\n");		LLVM_DEBUG(dbgs() << "not dereferenceable\n");
// We need to make sure that we can do comparison in any order, so we		// We need to make sure that we can do comparison in any order, so we
// require memory to be unconditionnally dereferencable.		// require memory to be unconditionnally dereferencable.
return {};		return {};
}		}
APInt Offset = APInt(DL.getPointerTypeSizeInBits(GEP->getType()), 0);		APInt Offset = APInt(DL.getPointerTypeSizeInBits(GEP->getType()), 0);
if (!GEP->accumulateConstantOffset(DL, Offset))		if (!GEP->accumulateConstantOffset(DL, Offset))
return {};		return {};
▲ Show 20 Lines • Show All 772 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[WIP] Adjust the users of dereferenceable wrt. dereferenceable_globallyNeeds RevisionPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 204818

llvm/include/llvm/Analysis/ValueTracking.h

llvm/include/llvm/IR/Value.h

llvm/lib/Analysis/CaptureTracking.cpp

llvm/lib/Analysis/Loads.cpp

llvm/lib/Analysis/MemDerefPrinter.cpp

llvm/lib/Analysis/ValueTracking.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/IR/Value.cpp

llvm/lib/Transforms/IPO/ArgumentPromotion.cpp

llvm/lib/Transforms/Scalar/MergeICmps.cpp

[WIP] Adjust the users of dereferenceable wrt. dereferenceable_globally
Needs RevisionPublic