This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
Analysis/
-
TargetTransformInfoImpl.h
-
CodeGen/
-
BasicTTIImpl.h
-
IR/
3/4
BasicBlock.h
-
Instruction.h
-
IntrinsicInst.h
-
Intrinsics.td
-
lib/
-
Analysis/
-
AliasSetTracker.cpp
-
InlineCost.cpp
-
ValueTracking.cpp
-
VectorUtils.cpp
-
CodeGen/
-
Analysis.cpp
2/3
CodeGenPrepare.cpp
-
IR/
1/1
BasicBlock.cpp
-
Instruction.cpp
-
Transforms/
-
Scalar/
-
JumpThreading.cpp
1/2
TailRecursionElimination.cpp
-
Utils/
-
Evaluator.cpp
4/6
SimplifyCFG.cpp
-
Vectorize/
-
LoadStoreVectorizer.cpp
-
LoopVectorize.cpp
-
SLPVectorizer.cpp

Differential D86490

[CSSPGO] IR intrinsic for pseudo-probe block instrumentation
ClosedPublic

Authored by hoy on Aug 24 2020, 2:52 PM.

Download Raw Diff

Details

Reviewers

davidxl
wmi
wenlei

Commits

rGf3c445697d23: [CSSPGO] IR intrinsic for pseudo-probe block instrumentation

Summary

This change introduces a new IR intrinsic named llvm.pseudoprobe for pseudo-probe block instrumentation. Please refer to https://reviews.llvm.org/D86193 for the whole story.

A pseudo probe is used to collect the execution count of the block where the probe is instrumented. This requires a pseudo probe to be persisting. The LLVM PGO instrumentation also instruments in similar places by placing a counter in the form of atomic read/write operations or runtime helper calls. While these operations are very persisting or optimization-resilient, in theory we can borrow the atomic read/write implementation from PGO counters and cut it off at the end of compilation with all the atomics converted into binary data. This was our initial design and we’ve seen promising sample correlation quality with it. However, the atomics approach has a couple issues:

IR Optimizations are blocked unexpectedly. Those atomic instructions are not going to be physically present in the binary code, but since they are on the IR till very end of compilation, they can still prevent certain IR optimizations and result in lower code quality.
The counter atomics may not be fully cleaned up from the code stream eventually.
Extra work is needed for re-targeting.

We choose to implement pseudo probes based on a special LLVM intrinsic, which is expected to have most of the semantics that comes with an atomic operation but does not block desired optimizations as much as possible. More specifically the semantics associated with the new intrinsic enforces a pseudo probe to be virtually executed exactly the same number of times before and after an IR optimization. The intrinsic also comes with certain flags that are carefully chosen so that the places they are probing are not going to be messed up by the optimizer while most of the IR optimizations still work. The core flags given to the special intrinsic is IntrInaccessibleMemOnly, which means the intrinsic accesses memory and does have a side effect so that it is not removable, but is does not access memory locations that are accessible by any original instructions. This way the intrinsic does not alias with any original instruction and thus it does not block optimizations as much as an atomic operation does. We also assign a function GUID and a block index to an intrinsic so that they are uniquely identified and not merged in order to achieve good correlation quality.

Let's now look at an example. Given the following LLVM IR:

define internal void @foo2(i32 %x, void (i32)* %f) !dbg !4 {
bb0:
  %cmp = icmp eq i32 %x, 0
   br i1 %cmp, label %bb1, label %bb2
bb1:
   br label %bb3
bb2:
   br label %bb3
bb3:
   ret void
}

The instrumented IR will look like below. Note that each llvm.pseudoprobe intrinsic call represents a pseudo probe at a block, of which the first parameter is the GUID of the probe’s owner function and the second parameter is the probe’s ID.

define internal void @foo2(i32 %x, void (i32)* %f) !dbg !4 {
bb0:
   %cmp = icmp eq i32 %x, 0
   call void @llvm.pseudoprobe(i64 837061429793323041, i64 1)
   br i1 %cmp, label %bb1, label %bb2
bb1:                                             
   call void @llvm.pseudoprobe(i64 837061429793323041, i64 2)
   br label %bb3
bb2:                                              
   call void @llvm.pseudoprobe(i64 837061429793323041, i64 3)
   br label %bb3
bb3:                                              
   call void @llvm.pseudoprobe(i64 837061429793323041, i64 4)
   ret void
}

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	340 ms	linux > HWAddressSanitizer-x86_64.TestCases::sizes.cpp

Event Timeline

hoy created this revision.Aug 24 2020, 2:52 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 24 2020, 2:52 PM

Herald added subscribers: llvm-commits, wenlei, laytonio and 2 others. · View Herald Transcript

hoy requested review of this revision.Aug 24 2020, 2:52 PM

Herald added a subscriber: jdoerfert. · View Herald TranscriptAug 24 2020, 2:52 PM

hoy edited the summary of this revision. (Show Details)Aug 24 2020, 2:59 PM

Herald added a subscriber: jfb. · View Herald TranscriptAug 24 2020, 2:59 PM

Harbormaster completed remote builds in B69368: Diff 287502.Aug 24 2020, 3:33 PM

hoy edited the summary of this revision. (Show Details)Aug 24 2020, 3:54 PM

hoy added reviewers: davidxl, wmi, wenlei.

hoy added a parent revision: D86193: [CSSPGO] Pseudo probe instrumentation for basic blocks..Aug 24 2020, 3:57 PM

hoy added a parent revision: D86495: [CSSPGO] MIR target-independent pseudo instruction for pseudo-probe intrinsic.Aug 24 2020, 5:23 PM

hoy removed a parent revision: D86495: [CSSPGO] MIR target-independent pseudo instruction for pseudo-probe intrinsic.

hoy added a child revision: D86495: [CSSPGO] MIR target-independent pseudo instruction for pseudo-probe intrinsic.

hoy removed a child revision: D86495: [CSSPGO] MIR target-independent pseudo instruction for pseudo-probe intrinsic.

hoy mentioned this in D86495: [CSSPGO] MIR target-independent pseudo instruction for pseudo-probe intrinsic.Aug 24 2020, 5:26 PM

hoy added a child revision: D86495: [CSSPGO] MIR target-independent pseudo instruction for pseudo-probe intrinsic.Aug 25 2020, 10:47 AM

hoy removed a parent revision: D86193: [CSSPGO] Pseudo probe instrumentation for basic blocks..Aug 25 2020, 10:51 AM

Thanks for splitting the patch into smaller ones. Can you have a separate test just to check the pseudo probe can be parsed and not deleted by any pass?

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
1995	Can we check CallBase::onlyAccessesInaccessibleMemory instead of checking PseudoProbeInst here?

hoy added inline comments.Aug 26 2020, 4:08 PM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
1995	Good question. Checking against `PseudoProbeInst` is more than `onlyAccessesInaccessibleMemory`. The instruction identified here will be either converted to a `select` or deleted. I probably should move the `PseudoProbeInst` check into `BrBB->instructionsWithoutDebug()`.

hoy added inline comments.Aug 26 2020, 4:56 PM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
1995	On the second thought, it might not be a good idea to fuse the check into `instructionsWithoutDebug` since API is used in quite some places. We sometimes like a `PseudoProbeInst` treated like a debug intrinsic but sometimes not. It is specific to the transform we'd like to not block.

wmi added inline comments.Aug 28 2020, 2:40 PM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
1995	I think it is a good idea to fuse the check into instructionsWithoutDebug. That will make psuedoProbe intrinsic behave more like debug intrinsic and block less transformations, like we discussed in D86193. What is the case you have concern about?

Herald added a subscriber: danielkiss. · View Herald TranscriptAug 28 2020, 2:40 PM

hoy added inline comments.Aug 28 2020, 3:27 PM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
1995	I was thinking that some passes may just use `instructionsWithoutDebug` to ignore and remove debug instructions. We may want that happening to pseudo probe selectively. By searching where the API is used, it looks like it's fine have them handle pseudo probes as well. I'm testing to see if there's a regression to the profile quality.

hoy added inline comments.Aug 29 2020, 11:10 PM

llvm/lib/Transforms/Utils/SimplifyCFG.cpp
1995	Test result looked OK. Moved the check into `instructionsWithoutDebug` .

Updating D86490: [CSSPGO] IR instrinsic for pseudo-probe block instrumentation

How about inline cost analysis? It needs to skip the new instructions. Similarly for the Partial inliner, the static cost of this should be set to zero.

llvm/include/llvm/IR/BasicBlock.h
190	Is it possible to also need to skip both PseudoProbe and Lifetime Markers?
llvm/lib/CodeGen/CodeGenPrepare.cpp
2248–2249	Perhaps introduce a helper function to skip non-code instructions
llvm/lib/Transforms/Scalar/TailRecursionElimination.cpp
243–247	why does it access memory?

Herald added a subscriber: dexonsmith. · View Herald TranscriptOct 28 2020, 2:51 PM

In D86490#2360454, @davidxl wrote:

How about inline cost analysis? It needs to skip the new instructions. Similarly for the Partial inliner, the static cost of this should be set to zero.

Good point. Yes, pseudo probes should be excluded from inline cost analysis. We were planning to include the change in upcoming patches. Now I'm moving it here.

llvm/include/llvm/IR/BasicBlock.h
190	Good point. I just checked some uses of `getFirstNonPHIOrDbgOrLifetime` where pseudo probe should also be needed. I'm making a helper for that. I'm thinking about deferring the actual replacement work to when pseudoprobe is used in combination with those cases so that we have good testing there. What do you think?
llvm/lib/CodeGen/CodeGenPrepare.cpp
2248–2249	Changed to using `getFirstNonPHIOrDbgOrPseudoProbe`.
llvm/lib/Transforms/Scalar/TailRecursionElimination.cpp
243–247	Because it has the `IntrInaccessibleMemOnly` flag. I changed the comment to be less confusing.

Updating D86490: [CSSPGO] IR instrinsic for pseudo-probe block instrumentation

Herald added subscribers: haicheng, eraman. · View Herald TranscriptOct 28 2020, 5:19 PM

Added a new Instruction::getNextNonDebugOrPseudoProbeInstruction() helper.

Updating D86490: [CSSPGO] IR instrinsic for pseudo-probe block instrumentation

Harbormaster completed remote builds in B76839: Diff 301487.Oct 28 2020, 7:13 PM

davidxl added inline comments.Oct 28 2020, 8:19 PM

llvm/include/llvm/IR/BasicBlock.h
190	I have concerns on the proliferation of interfaces. How about extending exiting two interfaces (NonDebug, NonDebugOrLifemaker) with an optional argument 'bool SkipPseudoOp = true'?
llvm/lib/IR/BasicBlock.cpp
100–103	There are still quite a few such predicates (debug \|\| pseudoop), or !(debug\|\|pseudoop) in the patch. Perhaps commonize them.

Harbormaster completed remote builds in B76851: Diff 301501.Oct 28 2020, 8:29 PM

hoy marked 2 inline comments as done.Oct 28 2020, 9:38 PM

hoy added inline comments.

llvm/include/llvm/IR/BasicBlock.h
190	Good idea, thanks for the suggestion! `SkipPseudoOp` is set to false by default. The APIs are called with `SkipPseudoOp=true` where we are sure to skip pseudo probes.

Updating D86490: [CSSPGO] IR instrinsic for pseudo-probe block instrumentation

Harbormaster completed remote builds in B76862: Diff 301513.Oct 28 2020, 10:27 PM

hoy retitled this revision from [CSSPGO] IR instrinsic for pseudo-probe block instrumentation to [CSSPGO] IR intrinsic for pseudo-probe block instrumentation.Oct 29 2020, 3:20 PM

asbirlea removed a subscriber: asbirlea.Oct 30 2020, 3:06 PM

@davidxl I'm wondering if the current changes look good to you. Please let me know if you have more comments. Thanks!

Updating D86490: [CSSPGO] IR intrinsic for pseudo-probe block instrumentation

Adding an attribute field for use later.

Harbormaster completed remote builds in B79481: Diff 306443.Nov 19 2020, 10:34 AM

LGTM.

llvm/lib/CodeGen/CodeGenPrepare.cpp
2249–2250	Nit: if (BB->getFirstNonPHIOrDbg(true) != RetI) return false;

This revision is now accepted and ready to land.Nov 19 2020, 10:03 PM

By the way, David has gone through the patches and talked to me offline saying he is ok with the patches generally. Thanks for your great work and patience!

In D86490#2407293, @wmi wrote:

By the way, David has gone through the patches and talked to me offline saying he is ok with the patches generally. Thanks for your great work and patience!

A lot thanks to you and David for reviewing CSSPGO patches and being supportive to us all the time!

Updating D86490: [CSSPGO] IR intrinsic for pseudo-probe block instrumentation

Addressing Wei's feedback.

hoy marked an inline comment as done.Nov 20 2020, 8:49 AM

Harbormaster completed remote builds in B79617: Diff 306705.Nov 20 2020, 9:36 AM

This revision was landed with ongoing or failed builds.Nov 20 2020, 10:40 AM

Closed by commit rGf3c445697d23: [CSSPGO] IR intrinsic for pseudo-probe block instrumentation (authored by hoy). · Explain Why

This revision was automatically updated to reflect the committed changes.

hoy added a commit: rGf3c445697d23: [CSSPGO] IR intrinsic for pseudo-probe block instrumentation.

maksfb mentioned this in rG8a919593c784: [BOLT][CSSPGO] Pseudo probe decoding.Jan 11 2022, 1:32 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

TargetTransformInfoImpl.h

1 line

CodeGen/

BasicTTIImpl.h

1 line

IR/

35 lines

22 lines

22 lines

7 lines

lib/

Analysis/

1 line

4 lines

1 line

2 lines

CodeGen/

Analysis.cpp

3 lines

CodeGenPrepare.cpp

25 lines

IR/

BasicBlock.cpp

38 lines

Instruction.cpp

10 lines

Transforms/

Scalar/

JumpThreading.cpp

4 lines

TailRecursionElimination.cpp

8 lines

Utils/

Evaluator.cpp

4 lines

SimplifyCFG.cpp

18 lines

Vectorize/

LoadStoreVectorizer.cpp

4 lines

LoopVectorize.cpp

3 lines

SLPVectorizer.cpp

4 lines

Diff 306705

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

Show First 20 Lines • Show All 520 Lines • ▼ Show 20 Lines	public:
unsigned getIntrinsicInstrCost(const IntrinsicCostAttributes &ICA,		unsigned getIntrinsicInstrCost(const IntrinsicCostAttributes &ICA,
TTI::TargetCostKind CostKind) {		TTI::TargetCostKind CostKind) {
switch (ICA.getID()) {		switch (ICA.getID()) {
default:		default:
break;		break;
case Intrinsic::annotation:		case Intrinsic::annotation:
case Intrinsic::assume:		case Intrinsic::assume:
case Intrinsic::sideeffect:		case Intrinsic::sideeffect:
		case Intrinsic::pseudoprobe:
case Intrinsic::dbg_declare:		case Intrinsic::dbg_declare:
case Intrinsic::dbg_value:		case Intrinsic::dbg_value:
case Intrinsic::dbg_label:		case Intrinsic::dbg_label:
case Intrinsic::invariant_start:		case Intrinsic::invariant_start:
case Intrinsic::invariant_end:		case Intrinsic::invariant_end:
case Intrinsic::launder_invariant_group:		case Intrinsic::launder_invariant_group:
case Intrinsic::strip_invariant_group:		case Intrinsic::strip_invariant_group:
case Intrinsic::is_constant:		case Intrinsic::is_constant:
▲ Show 20 Lines • Show All 542 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/BasicTTIImpl.h

Show First 20 Lines • Show All 1,424 Lines • ▼ Show 20 Lines	case Intrinsic::fmuladd:
break;		break;
case Intrinsic::experimental_constrained_fmuladd:		case Intrinsic::experimental_constrained_fmuladd:
ISDs.push_back(ISD::STRICT_FMA);		ISDs.push_back(ISD::STRICT_FMA);
break;		break;
// FIXME: We should return 0 whenever getIntrinsicCost == TCC_Free.		// FIXME: We should return 0 whenever getIntrinsicCost == TCC_Free.
case Intrinsic::lifetime_start:		case Intrinsic::lifetime_start:
case Intrinsic::lifetime_end:		case Intrinsic::lifetime_end:
case Intrinsic::sideeffect:		case Intrinsic::sideeffect:
		case Intrinsic::pseudoprobe:
return 0;		return 0;
case Intrinsic::masked_store: {		case Intrinsic::masked_store: {
Type *Ty = Tys[0];		Type *Ty = Tys[0];
Align TyAlign = thisT()->DL.getABITypeAlign(Ty);		Align TyAlign = thisT()->DL.getABITypeAlign(Ty);
return thisT()->getMaskedMemoryOpCost(Instruction::Store, Ty, TyAlign, 0,		return thisT()->getMaskedMemoryOpCost(Instruction::Store, Ty, TyAlign, 0,
CostKind);		CostKind);
}		}
case Intrinsic::masked_load: {		case Intrinsic::masked_load: {
▲ Show 20 Lines • Show All 509 Lines • Show Last 20 Lines

llvm/include/llvm/IR/BasicBlock.h

Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines	public:
/// which might be PHI. Returns 0 is there's no non-PHI instruction.		/// which might be PHI. Returns 0 is there's no non-PHI instruction.
const Instruction* getFirstNonPHI() const;		const Instruction* getFirstNonPHI() const;
Instruction* getFirstNonPHI() {		Instruction* getFirstNonPHI() {
return const_cast<Instruction *>(		return const_cast<Instruction *>(
static_cast<const BasicBlock *>(this)->getFirstNonPHI());		static_cast<const BasicBlock *>(this)->getFirstNonPHI());
}		}

/// Returns a pointer to the first instruction in this block that is not a		/// Returns a pointer to the first instruction in this block that is not a
/// PHINode or a debug intrinsic.		/// PHINode or a debug intrinsic, or any pseudo operation if \c SkipPseudoOp
const Instruction* getFirstNonPHIOrDbg() const;		/// is true.
Instruction* getFirstNonPHIOrDbg() {		const Instruction *getFirstNonPHIOrDbg(bool SkipPseudoOp = false) const;
		Instruction *getFirstNonPHIOrDbg(bool SkipPseudoOp = false) {
return const_cast<Instruction *>(		return const_cast<Instruction *>(
static_cast<const BasicBlock *>(this)->getFirstNonPHIOrDbg());		static_cast<const BasicBlock *>(this)->getFirstNonPHIOrDbg(
		SkipPseudoOp));
}		}

/// Returns a pointer to the first instruction in this block that is not a		/// Returns a pointer to the first instruction in this block that is not a
/// PHINode, a debug intrinsic, or a lifetime intrinsic.		/// PHINode, a debug intrinsic, or a lifetime intrinsic, or any pseudo
const Instruction* getFirstNonPHIOrDbgOrLifetime() const;		/// operation if \c SkipPseudoOp is true.
Instruction* getFirstNonPHIOrDbgOrLifetime() {		const Instruction *
		getFirstNonPHIOrDbgOrLifetime(bool SkipPseudoOp = false) const;
		Instruction *getFirstNonPHIOrDbgOrLifetime(bool SkipPseudoOp = false) {
return const_cast<Instruction *>(		return const_cast<Instruction *>(
static_cast<const BasicBlock *>(this)->getFirstNonPHIOrDbgOrLifetime());		static_cast<const BasicBlock *>(this)->getFirstNonPHIOrDbgOrLifetime(
		SkipPseudoOp));
}		}

/// Returns an iterator to the first instruction in this block that is		/// Returns an iterator to the first instruction in this block that is
/// suitable for inserting a non-PHI instruction.		/// suitable for inserting a non-PHI instruction.
///		///
		davidxlUnsubmitted Not Done Reply Inline Actions Is it possible to also need to skip both PseudoProbe and Lifetime Markers? davidxl: Is it possible to also need to skip both PseudoProbe and Lifetime Markers?
		hoyAuthorUnsubmitted Done Reply Inline Actions Good point. I just checked some uses of `getFirstNonPHIOrDbgOrLifetime` where pseudo probe should also be needed. I'm making a helper for that. I'm thinking about deferring the actual replacement work to when pseudoprobe is used in combination with those cases so that we have good testing there. What do you think? hoy: Good point. I just checked some uses of `getFirstNonPHIOrDbgOrLifetime` where pseudo probe…
		davidxlUnsubmitted Done Reply Inline Actions I have concerns on the proliferation of interfaces. How about extending exiting two interfaces (NonDebug, NonDebugOrLifemaker) with an optional argument 'bool SkipPseudoOp = true'? davidxl: I have concerns on the proliferation of interfaces. How about extending exiting two interfaces…
		hoyAuthorUnsubmitted Done Reply Inline Actions Good idea, thanks for the suggestion! `SkipPseudoOp` is set to false by default. The APIs are called with `SkipPseudoOp=true` where we are sure to skip pseudo probes. hoy: Good idea, thanks for the suggestion! `SkipPseudoOp` is set to false by default. The APIs are…
/// In particular, it skips all PHIs and LandingPad instructions.		/// In particular, it skips all PHIs and LandingPad instructions.
const_iterator getFirstInsertionPt() const;		const_iterator getFirstInsertionPt() const;
iterator getFirstInsertionPt() {		iterator getFirstInsertionPt() {
return static_cast<const BasicBlock *>(this)		return static_cast<const BasicBlock *>(this)
->getFirstInsertionPt().getNonConst();		->getFirstInsertionPt().getNonConst();
}		}

/// Return a const iterator range over the instructions in the block, skipping		/// Return a const iterator range over the instructions in the block, skipping
/// any debug instructions.		/// any debug instructions. Skip any pseudo operations as well if \c
		/// SkipPseudoOp is true.
iterator_range<filter_iterator<BasicBlock::const_iterator,		iterator_range<filter_iterator<BasicBlock::const_iterator,
std::function<bool(const Instruction &)>>>		std::function<bool(const Instruction &)>>>
instructionsWithoutDebug() const;		instructionsWithoutDebug(bool SkipPseudoOp = false) const;

/// Return an iterator range over the instructions in the block, skipping any		/// Return an iterator range over the instructions in the block, skipping any
/// debug instructions.		/// debug instructions. Skip and any pseudo operations as well if \c
iterator_range<filter_iterator<BasicBlock::iterator,		/// SkipPseudoOp is true.
std::function<bool(Instruction &)>>>		iterator_range<
instructionsWithoutDebug();		filter_iterator<BasicBlock::iterator, std::function<bool(Instruction &)>>>
		instructionsWithoutDebug(bool SkipPseudoOp = false);

/// Return the size of the basic block ignoring debug instructions		/// Return the size of the basic block ignoring debug instructions
filter_iterator<BasicBlock::const_iterator,		filter_iterator<BasicBlock::const_iterator,
std::function<bool(const Instruction &)>>::difference_type		std::function<bool(const Instruction &)>>::difference_type
sizeWithoutDebug() const;		sizeWithoutDebug() const;

/// Unlink 'this' from the containing function, but do not delete it.		/// Unlink 'this' from the containing function, but do not delete it.
void removeFromParent();		void removeFromParent();
▲ Show 20 Lines • Show All 337 Lines • Show Last 20 Lines

llvm/include/llvm/IR/Instruction.h

Show First 20 Lines • Show All 645 Lines • ▼ Show 20 Lines	bool isEHPad() const {
}		}
}		}

/// Return true if the instruction is a llvm.lifetime.start or		/// Return true if the instruction is a llvm.lifetime.start or
/// llvm.lifetime.end marker.		/// llvm.lifetime.end marker.
bool isLifetimeStartOrEnd() const;		bool isLifetimeStartOrEnd() const;

/// Return a pointer to the next non-debug instruction in the same basic		/// Return a pointer to the next non-debug instruction in the same basic
/// block as 'this', or nullptr if no such instruction exists.		/// block as 'this', or nullptr if no such instruction exists. Skip any pseudo
const Instruction *getNextNonDebugInstruction() const;		/// operations if \c SkipPseudoOp is true.
Instruction *getNextNonDebugInstruction() {		const Instruction *
		getNextNonDebugInstruction(bool SkipPseudoOp = false) const;
		Instruction *getNextNonDebugInstruction(bool SkipPseudoOp = false) {
return const_cast<Instruction *>(		return const_cast<Instruction *>(
static_cast<const Instruction *>(this)->getNextNonDebugInstruction());		static_cast<const Instruction *>(this)->getNextNonDebugInstruction(
		SkipPseudoOp));
}		}

/// Return a pointer to the previous non-debug instruction in the same basic		/// Return a pointer to the previous non-debug instruction in the same basic
/// block as 'this', or nullptr if no such instruction exists.		/// block as 'this', or nullptr if no such instruction exists. Skip any pseudo
const Instruction *getPrevNonDebugInstruction() const;		/// operations if \c SkipPseudoOp is true.
Instruction *getPrevNonDebugInstruction() {		const Instruction *
		getPrevNonDebugInstruction(bool SkipPseudoOp = false) const;
		Instruction *getPrevNonDebugInstruction(bool SkipPseudoOp = false) {
return const_cast<Instruction *>(		return const_cast<Instruction *>(
static_cast<const Instruction *>(this)->getPrevNonDebugInstruction());		static_cast<const Instruction *>(this)->getPrevNonDebugInstruction(
		SkipPseudoOp));
}		}

/// Create a copy of 'this' instruction that is identical in all ways except		/// Create a copy of 'this' instruction that is identical in all ways except
/// the following:		/// the following:
/// * The instruction has no parent		/// * The instruction has no parent
/// * The instruction has no name		/// * The instruction has no name
///		///
Instruction *clone() const;		Instruction *clone() const;
▲ Show 20 Lines • Show All 165 Lines • Show Last 20 Lines

llvm/include/llvm/IR/IntrinsicInst.h

Show First 20 Lines • Show All 961 Lines • ▼ Show 20 Lines	public:
}		}

// Returns the value site index.		// Returns the value site index.
ConstantInt *getIndex() const {		ConstantInt *getIndex() const {
return cast<ConstantInt>(const_cast<Value *>(getArgOperand(4)));		return cast<ConstantInt>(const_cast<Value *>(getArgOperand(4)));
}		}
};		};

		class PseudoProbeInst : public IntrinsicInst {
		public:
		static bool classof(const IntrinsicInst *I) {
		return I->getIntrinsicID() == Intrinsic::pseudoprobe;
		}

		static bool classof(const Value *V) {
		return isa<IntrinsicInst>(V) && classof(cast<IntrinsicInst>(V));
		}

		ConstantInt *getFuncGuid() const {
		return cast<ConstantInt>(const_cast<Value *>(getArgOperand(0)));
		}

		ConstantInt *getAttributes() const {
		return cast<ConstantInt>(const_cast<Value *>(getArgOperand(2)));
		}

		ConstantInt *getIndex() const {
		return cast<ConstantInt>(const_cast<Value *>(getArgOperand(1)));
		}
		};
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_IR_INTRINSICINST_H		#endif // LLVM_IR_INTRINSICINST_H

llvm/include/llvm/IR/Intrinsics.td

	Show First 20 Lines • Show All 1,271 Lines • ▼ Show 20 Lines
	def int_donothing : DefaultAttrsIntrinsic<[], [], [IntrNoMem, IntrWillReturn]>;			def int_donothing : DefaultAttrsIntrinsic<[], [], [IntrNoMem, IntrWillReturn]>;

	// This instruction has no actual effect, though it is treated by the optimizer			// This instruction has no actual effect, though it is treated by the optimizer
	// has having opaque side effects. This may be inserted into loops to ensure			// has having opaque side effects. This may be inserted into loops to ensure
	// that they are not removed even if they turn out to be empty, for languages			// that they are not removed even if they turn out to be empty, for languages
	// which specify that infinite loops must be preserved.			// which specify that infinite loops must be preserved.
	def int_sideeffect : DefaultAttrsIntrinsic<[], [], [IntrInaccessibleMemOnly, IntrWillReturn]>;			def int_sideeffect : DefaultAttrsIntrinsic<[], [], [IntrInaccessibleMemOnly, IntrWillReturn]>;

				// The pseudoprobe intrinsic works as a place holder to the block it probes.
				// Like the sideeffect intrinsic defined above, this intrinsic is treated by the
				// optimizer as having opaque side effects so that it won't be get rid of or moved
				// out of the block it probes.
				def int_pseudoprobe : Intrinsic<[], [llvm_i64_ty, llvm_i64_ty, llvm_i32_ty],
				[IntrInaccessibleMemOnly, IntrWillReturn]>;

	// Intrinsics to support half precision floating point format			// Intrinsics to support half precision floating point format
	let IntrProperties = [IntrNoMem, IntrWillReturn] in {			let IntrProperties = [IntrNoMem, IntrWillReturn] in {
	def int_convert_to_fp16 : DefaultAttrsIntrinsic<[llvm_i16_ty], [llvm_anyfloat_ty]>;			def int_convert_to_fp16 : DefaultAttrsIntrinsic<[llvm_i16_ty], [llvm_anyfloat_ty]>;
	def int_convert_from_fp16 : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [llvm_i16_ty]>;			def int_convert_from_fp16 : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [llvm_i16_ty]>;
	}			}

	// Clear cache intrinsic, default to ignore (ie. emit nothing)			// Clear cache intrinsic, default to ignore (ie. emit nothing)
	// maps to void __clear_cache() on supporting platforms			// maps to void __clear_cache() on supporting platforms
	▲ Show 20 Lines • Show All 368 Lines • Show Last 20 Lines

llvm/lib/Analysis/AliasSetTracker.cpp

Show First 20 Lines • Show All 433 Lines • ▼ Show 20 Lines	if (auto *II = dyn_cast<IntrinsicInst>(Inst)) {
// These intrinsics will show up as affecting memory, but they are just		// These intrinsics will show up as affecting memory, but they are just
// markers.		// markers.
switch (II->getIntrinsicID()) {		switch (II->getIntrinsicID()) {
default:		default:
break;		break;
// FIXME: Add lifetime/invariant intrinsics (See: PR30807).		// FIXME: Add lifetime/invariant intrinsics (See: PR30807).
case Intrinsic::assume:		case Intrinsic::assume:
case Intrinsic::sideeffect:		case Intrinsic::sideeffect:
		case Intrinsic::pseudoprobe:
return;		return;
}		}
}		}
if (!Inst->mayReadOrWriteMemory())		if (!Inst->mayReadOrWriteMemory())
return; // doesn't alias anything		return; // doesn't alias anything

if (AliasSet *AS = findAliasSetForUnknownInst(Inst)) {		if (AliasSet *AS = findAliasSetForUnknownInst(Inst)) {
AS->addUnknownInst(Inst, AA);		AS->addUnknownInst(Inst, AA);
▲ Show 20 Lines • Show All 333 Lines • Show Last 20 Lines

llvm/lib/Analysis/InlineCost.cpp

Show First 20 Lines • Show All 1,905 Lines • ▼ Show 20 Lines	for (BasicBlock::iterator I = BB->begin(), E = BB->end(); I != E; ++I) {
// are actually used by the vector bonus heuristic. As long as that's true,		// are actually used by the vector bonus heuristic. As long as that's true,
// we have to special case debug intrinsics here to prevent differences in		// we have to special case debug intrinsics here to prevent differences in
// inlining due to debug symbols. Eventually, the number of unsimplified		// inlining due to debug symbols. Eventually, the number of unsimplified
// instructions shouldn't factor into the cost computation, but until then,		// instructions shouldn't factor into the cost computation, but until then,
// hack around it here.		// hack around it here.
if (isa<DbgInfoIntrinsic>(I))		if (isa<DbgInfoIntrinsic>(I))
continue;		continue;

		// Skip pseudo-probes.
		if (isa<PseudoProbeInst>(I))
		continue;

// Skip ephemeral values.		// Skip ephemeral values.
if (EphValues.count(&*I))		if (EphValues.count(&*I))
continue;		continue;

++NumInstructions;		++NumInstructions;
if (isa<ExtractElementInst>(I) \|\| I->getType()->isVectorTy())		if (isa<ExtractElementInst>(I) \|\| I->getType()->isVectorTy())
++NumVectorInstructions;		++NumVectorInstructions;

▲ Show 20 Lines • Show All 673 Lines • Show Last 20 Lines

llvm/lib/Analysis/ValueTracking.cpp

	Show First 20 Lines • Show All 521 Lines • ▼ Show 20 Lines
	bool llvm::isAssumeLikeIntrinsic(const Instruction *I) {			bool llvm::isAssumeLikeIntrinsic(const Instruction *I) {
	if (const CallInst *CI = dyn_cast<CallInst>(I))			if (const CallInst *CI = dyn_cast<CallInst>(I))
	if (Function *F = CI->getCalledFunction())			if (Function *F = CI->getCalledFunction())
	switch (F->getIntrinsicID()) {			switch (F->getIntrinsicID()) {
	default: break;			default: break;
	// FIXME: This list is repeated from NoTTI::getIntrinsicCost.			// FIXME: This list is repeated from NoTTI::getIntrinsicCost.
	case Intrinsic::assume:			case Intrinsic::assume:
	case Intrinsic::sideeffect:			case Intrinsic::sideeffect:
				case Intrinsic::pseudoprobe:
	case Intrinsic::dbg_declare:			case Intrinsic::dbg_declare:
	case Intrinsic::dbg_value:			case Intrinsic::dbg_value:
	case Intrinsic::dbg_label:			case Intrinsic::dbg_label:
	case Intrinsic::invariant_start:			case Intrinsic::invariant_start:
	case Intrinsic::invariant_end:			case Intrinsic::invariant_end:
	case Intrinsic::lifetime_start:			case Intrinsic::lifetime_start:
	case Intrinsic::lifetime_end:			case Intrinsic::lifetime_end:
	case Intrinsic::objectsize:			case Intrinsic::objectsize:
	▲ Show 20 Lines • Show All 6,186 Lines • Show Last 20 Lines

llvm/lib/Analysis/VectorUtils.cpp

	Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines
	Intrinsic::ID llvm::getVectorIntrinsicIDForCall(const CallInst *CI,			Intrinsic::ID llvm::getVectorIntrinsicIDForCall(const CallInst *CI,
	const TargetLibraryInfo *TLI) {			const TargetLibraryInfo *TLI) {
	Intrinsic::ID ID = getIntrinsicForCallSite(*CI, TLI);			Intrinsic::ID ID = getIntrinsicForCallSite(*CI, TLI);
	if (ID == Intrinsic::not_intrinsic)			if (ID == Intrinsic::not_intrinsic)
	return Intrinsic::not_intrinsic;			return Intrinsic::not_intrinsic;

	if (isTriviallyVectorizable(ID) \|\| ID == Intrinsic::lifetime_start \|\|			if (isTriviallyVectorizable(ID) \|\| ID == Intrinsic::lifetime_start \|\|
	ID == Intrinsic::lifetime_end \|\| ID == Intrinsic::assume \|\|			ID == Intrinsic::lifetime_end \|\| ID == Intrinsic::assume \|\|
	ID == Intrinsic::sideeffect)			ID == Intrinsic::sideeffect \|\| ID == Intrinsic::pseudoprobe)
	return ID;			return ID;
	return Intrinsic::not_intrinsic;			return Intrinsic::not_intrinsic;
	}			}

	/// Find the operand of the GEP that should be checked for consecutive			/// Find the operand of the GEP that should be checked for consecutive
	/// stores. This ignores trailing indices that have no effect on the final			/// stores. This ignores trailing indices that have no effect on the final
	/// pointer.			/// pointer.
	unsigned llvm::getGEPInductionOperand(const GetElementPtrInst *Gep) {			unsigned llvm::getGEPInductionOperand(const GetElementPtrInst *Gep) {
	▲ Show 20 Lines • Show All 1,238 Lines • Show Last 20 Lines

llvm/lib/CodeGen/Analysis.cpp

Show First 20 Lines • Show All 531 Lines • ▼ Show 20 Lines	bool llvm::isInTailCallPosition(const CallBase &Call, const TargetMachine &TM) {
// chain interposes between I and the return.		// chain interposes between I and the return.
// Check for all calls including speculatable functions.		// Check for all calls including speculatable functions.
for (BasicBlock::const_iterator BBI = std::prev(ExitBB->end(), 2);; --BBI) {		for (BasicBlock::const_iterator BBI = std::prev(ExitBB->end(), 2);; --BBI) {
if (&*BBI == &Call)		if (&*BBI == &Call)
break;		break;
// Debug info intrinsics do not get in the way of tail call optimization.		// Debug info intrinsics do not get in the way of tail call optimization.
if (isa<DbgInfoIntrinsic>(BBI))		if (isa<DbgInfoIntrinsic>(BBI))
continue;		continue;
		// Pseudo probe intrinsics do not block tail call optimization either.
		if (isa<PseudoProbeInst>(BBI))
		continue;
// A lifetime end or assume intrinsic should not stop tail call		// A lifetime end or assume intrinsic should not stop tail call
// optimization.		// optimization.
if (const IntrinsicInst *II = dyn_cast<IntrinsicInst>(BBI))		if (const IntrinsicInst *II = dyn_cast<IntrinsicInst>(BBI))
if (II->getIntrinsicID() == Intrinsic::lifetime_end \|\|		if (II->getIntrinsicID() == Intrinsic::lifetime_end \|\|
II->getIntrinsicID() == Intrinsic::assume)		II->getIntrinsicID() == Intrinsic::assume)
continue;		continue;
if (BBI->mayHaveSideEffects() \|\| BBI->mayReadFromMemory() \|\|		if (BBI->mayHaveSideEffects() \|\| BBI->mayReadFromMemory() \|\|
!isSafeToSpeculativelyExecute(&*BBI))		!isSafeToSpeculativelyExecute(&*BBI))
▲ Show 20 Lines • Show All 262 Lines • Show Last 20 Lines

llvm/lib/CodeGen/CodeGenPrepare.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,235 Lines • ▼ Show 20 Lines	bool CodeGenPrepare::dupRetToEnableTailCallOpts(BasicBlock *BB, bool &ModifiedDT) {

// Make sure there are no instructions between the PHI and return, or that the		// Make sure there are no instructions between the PHI and return, or that the
// return is the first instruction in the block.		// return is the first instruction in the block.
if (PN) {		if (PN) {
BasicBlock::iterator BI = BB->begin();		BasicBlock::iterator BI = BB->begin();
// Skip over debug and the bitcast.		// Skip over debug and the bitcast.
do {		do {
++BI;		++BI;
} while (isa<DbgInfoIntrinsic>(BI) \|\| &BI == BCI \|\| &BI == EVI);		} while (isa<DbgInfoIntrinsic>(BI) \|\| &BI == BCI \|\| &BI == EVI \|\|
		isa<PseudoProbeInst>(BI));
if (&*BI != RetI)		if (&*BI != RetI)
return false;		return false;
} else {		} else {
BasicBlock::iterator BI = BB->begin();		if (BB->getFirstNonPHIOrDbg(true) != RetI)
		davidxlUnsubmitted Not Done Reply Inline Actions Perhaps introduce a helper function to skip non-code instructions davidxl: Perhaps introduce a helper function to skip non-code instructions
		hoyAuthorUnsubmitted Done Reply Inline Actions Changed to using `getFirstNonPHIOrDbgOrPseudoProbe`. hoy: Changed to using `getFirstNonPHIOrDbgOrPseudoProbe`.
while (isa<DbgInfoIntrinsic>(BI)) ++BI;
if (&*BI != RetI)
return false;		return false;
		wmiUnsubmitted Done Reply Inline Actions Nit: if (BB->getFirstNonPHIOrDbg(true) != RetI) return false; wmi: Nit: ``` if (BB->getFirstNonPHIOrDbg(true) != RetI) return false; ```
}		}

/// Only dup the ReturnInst if the CallInst is likely to be emitted as a tail		/// Only dup the ReturnInst if the CallInst is likely to be emitted as a tail
/// call.		/// call.
const Function *F = BB->getParent();		const Function *F = BB->getParent();
SmallVector<BasicBlock*, 4> TailCallBBs;		SmallVector<BasicBlock*, 4> TailCallBBs;
if (PN) {		if (PN) {
for (unsigned I = 0, E = PN->getNumIncomingValues(); I != E; ++I) {		for (unsigned I = 0, E = PN->getNumIncomingValues(); I != E; ++I) {
// Look through bitcasts.		// Look through bitcasts.
Value *IncomingVal = PN->getIncomingValue(I)->stripPointerCasts();		Value *IncomingVal = PN->getIncomingValue(I)->stripPointerCasts();
CallInst *CI = dyn_cast<CallInst>(IncomingVal);		CallInst *CI = dyn_cast<CallInst>(IncomingVal);
BasicBlock *PredBB = PN->getIncomingBlock(I);		BasicBlock *PredBB = PN->getIncomingBlock(I);
// Make sure the phi value is indeed produced by the tail call.		// Make sure the phi value is indeed produced by the tail call.
if (CI && CI->hasOneUse() && CI->getParent() == PredBB &&		if (CI && CI->hasOneUse() && CI->getParent() == PredBB &&
TLI->mayBeEmittedAsTailCall(CI) &&		TLI->mayBeEmittedAsTailCall(CI) &&
attributesPermitTailCall(F, CI, RetI, *TLI))		attributesPermitTailCall(F, CI, RetI, *TLI))
TailCallBBs.push_back(PredBB);		TailCallBBs.push_back(PredBB);
}		}
} else {		} else {
SmallPtrSet<BasicBlock*, 4> VisitedBBs;		SmallPtrSet<BasicBlock*, 4> VisitedBBs;
for (pred_iterator PI = pred_begin(BB), PE = pred_end(BB); PI != PE; ++PI) {		for (pred_iterator PI = pred_begin(BB), PE = pred_end(BB); PI != PE; ++PI) {
if (!VisitedBBs.insert(*PI).second)		if (!VisitedBBs.insert(*PI).second)
continue;		continue;
		if (Instruction I = (PI)->rbegin()->getPrevNonDebugInstruction(true)) {
BasicBlock::InstListType &InstList = (*PI)->getInstList();		CallInst *CI = dyn_cast<CallInst>(I);
BasicBlock::InstListType::reverse_iterator RI = InstList.rbegin();
BasicBlock::InstListType::reverse_iterator RE = InstList.rend();
do { ++RI; } while (RI != RE && isa<DbgInfoIntrinsic>(&*RI));
if (RI == RE)
continue;

CallInst CI = dyn_cast<CallInst>(&RI);
if (CI && CI->use_empty() && TLI->mayBeEmittedAsTailCall(CI) &&		if (CI && CI->use_empty() && TLI->mayBeEmittedAsTailCall(CI) &&
attributesPermitTailCall(F, CI, RetI, *TLI))		attributesPermitTailCall(F, CI, RetI, *TLI))
TailCallBBs.push_back(*PI);		TailCallBBs.push_back(*PI);
}		}
}		}
		}

bool Changed = false;		bool Changed = false;
for (auto const &TailCallBB : TailCallBBs) {		for (auto const &TailCallBB : TailCallBBs) {
// Make sure the call instruction is followed by an unconditional branch to		// Make sure the call instruction is followed by an unconditional branch to
// the return block.		// the return block.
BranchInst *BI = dyn_cast<BranchInst>(TailCallBB->getTerminator());		BranchInst *BI = dyn_cast<BranchInst>(TailCallBB->getTerminator());
if (!BI \|\| !BI->isUnconditional() \|\| BI->getSuccessor(0) != BB)		if (!BI \|\| !BI->isUnconditional() \|\| BI->getSuccessor(0) != BB)
continue;		continue;
▲ Show 20 Lines • Show All 5,701 Lines • Show Last 20 Lines

llvm/lib/IR/BasicBlock.cpp

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines

	void BasicBlock::setParent(Function *parent) {			void BasicBlock::setParent(Function *parent) {
	// Set Parent=parent, updating instruction symtab entries as appropriate.			// Set Parent=parent, updating instruction symtab entries as appropriate.
	InstList.setSymTabObject(&Parent, parent);			InstList.setSymTabObject(&Parent, parent);
	}			}

	iterator_range<filter_iterator<BasicBlock::const_iterator,			iterator_range<filter_iterator<BasicBlock::const_iterator,
	std::function<bool(const Instruction &)>>>			std::function<bool(const Instruction &)>>>
	BasicBlock::instructionsWithoutDebug() const {			BasicBlock::instructionsWithoutDebug(bool SkipPseudoOp) const {
	std::function<bool(const Instruction &)> Fn = [](const Instruction &I) {			std::function<bool(const Instruction &)> Fn = [=](const Instruction &I) {
	return !isa<DbgInfoIntrinsic>(I);			return !isa<DbgInfoIntrinsic>(I) &&
				!(SkipPseudoOp && isa<PseudoProbeInst>(I));
				davidxlUnsubmitted Done Reply Inline Actions There are still quite a few such predicates (debug \|\| pseudoop), or !(debug\|\|pseudoop) in the patch. Perhaps commonize them. davidxl: There are still quite a few such predicates (debug \|\| pseudoop), or !(debug\|\|pseudoop) in the…
	};			};
	return make_filter_range(*this, Fn);			return make_filter_range(*this, Fn);
	}			}

	iterator_range<filter_iterator<BasicBlock::iterator,			iterator_range<
	std::function<bool(Instruction &)>>>			filter_iterator<BasicBlock::iterator, std::function<bool(Instruction &)>>>
	BasicBlock::instructionsWithoutDebug() {			BasicBlock::instructionsWithoutDebug(bool SkipPseudoOp) {
	std::function<bool(Instruction &)> Fn = [](Instruction &I) {			std::function<bool(Instruction &)> Fn = [=](Instruction &I) {
	return !isa<DbgInfoIntrinsic>(I);			return !isa<DbgInfoIntrinsic>(I) &&
				!(SkipPseudoOp && isa<PseudoProbeInst>(I));
	};			};
	return make_filter_range(*this, Fn);			return make_filter_range(*this, Fn);
	}			}

	filter_iterator<BasicBlock::const_iterator,			filter_iterator<BasicBlock::const_iterator,
	std::function<bool(const Instruction &)>>::difference_type			std::function<bool(const Instruction &)>>::difference_type
	BasicBlock::sizeWithoutDebug() const {			BasicBlock::sizeWithoutDebug() const {
	return std::distance(instructionsWithoutDebug().begin(),			return std::distance(instructionsWithoutDebug().begin(),
	▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines

	const Instruction* BasicBlock::getFirstNonPHI() const {			const Instruction* BasicBlock::getFirstNonPHI() const {
	for (const Instruction &I : *this)			for (const Instruction &I : *this)
	if (!isa<PHINode>(I))			if (!isa<PHINode>(I))
	return &I;			return &I;
	return nullptr;			return nullptr;
	}			}

	const Instruction* BasicBlock::getFirstNonPHIOrDbg() const {			const Instruction *BasicBlock::getFirstNonPHIOrDbg(bool SkipPseudoOp) const {
	for (const Instruction &I : *this)			for (const Instruction &I : *this) {
	if (!isa<PHINode>(I) && !isa<DbgInfoIntrinsic>(I))			if (isa<PHINode>(I) \|\| isa<DbgInfoIntrinsic>(I))
				continue;

				if (SkipPseudoOp && isa<PseudoProbeInst>(I))
				continue;

	return &I;			return &I;
				}
	return nullptr;			return nullptr;
	}			}

	const Instruction* BasicBlock::getFirstNonPHIOrDbgOrLifetime() const {			const Instruction *
				BasicBlock::getFirstNonPHIOrDbgOrLifetime(bool SkipPseudoOp) const {
	for (const Instruction &I : *this) {			for (const Instruction &I : *this) {
	if (isa<PHINode>(I) \|\| isa<DbgInfoIntrinsic>(I))			if (isa<PHINode>(I) \|\| isa<DbgInfoIntrinsic>(I))
	continue;			continue;

	if (I.isLifetimeStartOrEnd())			if (I.isLifetimeStartOrEnd())
	continue;			continue;

				if (SkipPseudoOp && isa<PseudoProbeInst>(I))
				continue;

	return &I;			return &I;
	}			}
	return nullptr;			return nullptr;
	}			}

	BasicBlock::const_iterator BasicBlock::getFirstInsertionPt() const {			BasicBlock::const_iterator BasicBlock::getFirstInsertionPt() const {
	const Instruction *FirstNonPHI = getFirstNonPHI();			const Instruction *FirstNonPHI = getFirstNonPHI();
	if (!FirstNonPHI)			if (!FirstNonPHI)
	▲ Show 20 Lines • Show All 256 Lines • Show Last 20 Lines

llvm/lib/IR/Instruction.cpp

	Show First 20 Lines • Show All 635 Lines • ▼ Show 20 Lines
	bool Instruction::isLifetimeStartOrEnd() const {			bool Instruction::isLifetimeStartOrEnd() const {
	auto II = dyn_cast<IntrinsicInst>(this);			auto II = dyn_cast<IntrinsicInst>(this);
	if (!II)			if (!II)
	return false;			return false;
	Intrinsic::ID ID = II->getIntrinsicID();			Intrinsic::ID ID = II->getIntrinsicID();
	return ID == Intrinsic::lifetime_start \|\| ID == Intrinsic::lifetime_end;			return ID == Intrinsic::lifetime_start \|\| ID == Intrinsic::lifetime_end;
	}			}

	const Instruction *Instruction::getNextNonDebugInstruction() const {			const Instruction *
				Instruction::getNextNonDebugInstruction(bool SkipPseudoOp) const {
	for (const Instruction *I = getNextNode(); I; I = I->getNextNode())			for (const Instruction *I = getNextNode(); I; I = I->getNextNode())
	if (!isa<DbgInfoIntrinsic>(I))			if (!isa<DbgInfoIntrinsic>(I) && !(SkipPseudoOp && isa<PseudoProbeInst>(I)))
	return I;			return I;
	return nullptr;			return nullptr;
	}			}

	const Instruction *Instruction::getPrevNonDebugInstruction() const {			const Instruction *
				Instruction::getPrevNonDebugInstruction(bool SkipPseudoOp) const {
	for (const Instruction *I = getPrevNode(); I; I = I->getPrevNode())			for (const Instruction *I = getPrevNode(); I; I = I->getPrevNode())
	if (!isa<DbgInfoIntrinsic>(I))			if (!isa<DbgInfoIntrinsic>(I) && !(SkipPseudoOp && isa<PseudoProbeInst>(I)))
	return I;			return I;
	return nullptr;			return nullptr;
	}			}

	bool Instruction::isAssociative() const {			bool Instruction::isAssociative() const {
	unsigned Opcode = getOpcode();			unsigned Opcode = getOpcode();
	if (isAssociative(Opcode))			if (isAssociative(Opcode))
	return true;			return true;
	▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/JumpThreading.cpp

Show First 20 Lines • Show All 537 Lines • ▼ Show 20 Lines	for (; &*I != StopAt; ++I) {

// Stop scanning the block if we've reached the threshold.		// Stop scanning the block if we've reached the threshold.
if (Size > Threshold)		if (Size > Threshold)
return Size;		return Size;

// Debugger intrinsics don't incur code size.		// Debugger intrinsics don't incur code size.
if (isa<DbgInfoIntrinsic>(I)) continue;		if (isa<DbgInfoIntrinsic>(I)) continue;

		// Pseudo-probes don't incur code size.
		if (isa<PseudoProbeInst>(I))
		continue;

// If this is a pointer->pointer bitcast, it is free.		// If this is a pointer->pointer bitcast, it is free.
if (isa<BitCastInst>(I) && I->getType()->isPointerTy())		if (isa<BitCastInst>(I) && I->getType()->isPointerTy())
continue;		continue;

// Freeze instruction is free, too.		// Freeze instruction is free, too.
if (isa<FreezeInst>(I))		if (isa<FreezeInst>(I))
continue;		continue;

▲ Show 20 Lines • Show All 2,475 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/TailRecursionElimination.cpp

Show First 20 Lines • Show All 234 Lines • ▼ Show 20 Lines	static bool markTails(Function &F, bool &AllCallsAreTailCalls,
BasicBlock *BB = &F.getEntryBlock();		BasicBlock *BB = &F.getEntryBlock();
VisitType Escaped = UNESCAPED;		VisitType Escaped = UNESCAPED;
do {		do {
for (auto &I : *BB) {		for (auto &I : *BB) {
if (Tracker.EscapePoints.count(&I))		if (Tracker.EscapePoints.count(&I))
Escaped = ESCAPED;		Escaped = ESCAPED;

CallInst *CI = dyn_cast<CallInst>(&I);		CallInst *CI = dyn_cast<CallInst>(&I);
if (!CI \|\| CI->isTailCall() \|\| isa<DbgInfoIntrinsic>(&I))		// A PseudoProbeInst has the IntrInaccessibleMemOnly tag hence it is
		// considered accessing memory and will be marked as a tail call if we
		// don't bail out here.
		if (!CI \|\| CI->isTailCall() \|\| isa<DbgInfoIntrinsic>(&I) \|\|
		isa<PseudoProbeInst>(&I))
		davidxlUnsubmitted Not Done Reply Inline Actions why does it access memory? davidxl: why does it access memory?
		hoyAuthorUnsubmitted Done Reply Inline Actions Because it has the `IntrInaccessibleMemOnly` flag. I changed the comment to be less confusing. hoy: Because it has the `IntrInaccessibleMemOnly` flag. I changed the comment to be less confusing.
continue;		continue;

bool IsNoTail = CI->isNoTailCall() \|\| CI->hasOperandBundles();		bool IsNoTail = CI->isNoTailCall() \|\| CI->hasOperandBundles();

if (!IsNoTail && CI->doesNotAccessMemory()) {		if (!IsNoTail && CI->doesNotAccessMemory()) {
// A call to a readnone function whose arguments are all things computed		// A call to a readnone function whose arguments are all things computed
// outside this function can be marked tail. Even if you stored the		// outside this function can be marked tail. Even if you stored the
// alloca address into a global, a readnone function can't load the		// alloca address into a global, a readnone function can't load the
▲ Show 20 Lines • Show All 495 Lines • ▼ Show 20 Lines	bool TailRecursionEliminator::processBlock(
BasicBlock &BB, bool CannotTailCallElimCallsMarkedTail) {		BasicBlock &BB, bool CannotTailCallElimCallsMarkedTail) {
Instruction *TI = BB.getTerminator();		Instruction *TI = BB.getTerminator();

if (BranchInst *BI = dyn_cast<BranchInst>(TI)) {		if (BranchInst *BI = dyn_cast<BranchInst>(TI)) {
if (BI->isConditional())		if (BI->isConditional())
return false;		return false;

BasicBlock *Succ = BI->getSuccessor(0);		BasicBlock *Succ = BI->getSuccessor(0);
ReturnInst *Ret = dyn_cast<ReturnInst>(Succ->getFirstNonPHIOrDbg());		ReturnInst *Ret = dyn_cast<ReturnInst>(Succ->getFirstNonPHIOrDbg(true));

if (!Ret)		if (!Ret)
return false;		return false;

CallInst *CI = findTRECandidate(&BB, CannotTailCallElimCallsMarkedTail);		CallInst *CI = findTRECandidate(&BB, CannotTailCallElimCallsMarkedTail);

if (!CI)		if (!CI)
return false;		return false;
▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/Evaluator.cpp

Show First 20 Lines • Show All 545 Lines • ▼ Show 20 Lines	if (StoreInst *SI = dyn_cast<StoreInst>(CurInst)) {
} else if (II->getIntrinsicID() == Intrinsic::assume) {		} else if (II->getIntrinsicID() == Intrinsic::assume) {
LLVM_DEBUG(dbgs() << "Skipping assume intrinsic.\n");		LLVM_DEBUG(dbgs() << "Skipping assume intrinsic.\n");
++CurInst;		++CurInst;
continue;		continue;
} else if (II->getIntrinsicID() == Intrinsic::sideeffect) {		} else if (II->getIntrinsicID() == Intrinsic::sideeffect) {
LLVM_DEBUG(dbgs() << "Skipping sideeffect intrinsic.\n");		LLVM_DEBUG(dbgs() << "Skipping sideeffect intrinsic.\n");
++CurInst;		++CurInst;
continue;		continue;
		} else if (II->getIntrinsicID() == Intrinsic::pseudoprobe) {
		LLVM_DEBUG(dbgs() << "Skipping pseudoprobe intrinsic.\n");
		++CurInst;
		continue;
}		}

LLVM_DEBUG(dbgs() << "Unknown intrinsic. Can not evaluate.\n");		LLVM_DEBUG(dbgs() << "Unknown intrinsic. Can not evaluate.\n");
return false;		return false;
}		}

// Resolve function pointers.		// Resolve function pointers.
SmallVector<Constant *, 8> Formals;		SmallVector<Constant *, 8> Formals;
▲ Show 20 Lines • Show All 167 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/SimplifyCFG.cpp

Show First 20 Lines • Show All 1,975 Lines • ▼ Show 20 Lines	static Value isSafeToSpeculateStore(Instruction I, BasicBlock *BrBB,
// Volatile or atomic.		// Volatile or atomic.
if (!StoreToHoist->isSimple())		if (!StoreToHoist->isSimple())
return nullptr;		return nullptr;

Value *StorePtr = StoreToHoist->getPointerOperand();		Value *StorePtr = StoreToHoist->getPointerOperand();

// Look for a store to the same pointer in BrBB.		// Look for a store to the same pointer in BrBB.
unsigned MaxNumInstToLookAt = 9;		unsigned MaxNumInstToLookAt = 9;
for (Instruction &CurI : reverse(BrBB->instructionsWithoutDebug())) {		// Skip pseudo probe intrinsic calls which are not really killing any memory
		// accesses.
		for (Instruction &CurI : reverse(BrBB->instructionsWithoutDebug(true))) {
if (!MaxNumInstToLookAt)		if (!MaxNumInstToLookAt)
break;		break;
--MaxNumInstToLookAt;		--MaxNumInstToLookAt;

// Could be calling an instruction that affects memory like free().		// Could be calling an instruction that affects memory like free().
if (CurI.mayHaveSideEffects() && !isa<StoreInst>(CurI))		if (CurI.mayHaveSideEffects() && !isa<StoreInst>(CurI))
return nullptr;		return nullptr;

if (auto *SI = dyn_cast<StoreInst>(&CurI)) {		if (auto *SI = dyn_cast<StoreInst>(&CurI)) {
		wmiUnsubmitted Not Done Reply Inline Actions Can we check CallBase::onlyAccessesInaccessibleMemory instead of checking PseudoProbeInst here? wmi: Can we check CallBase::onlyAccessesInaccessibleMemory instead of checking PseudoProbeInst here?
		hoyAuthorUnsubmitted Done Reply Inline Actions Good question. Checking against `PseudoProbeInst` is more than `onlyAccessesInaccessibleMemory`. The instruction identified here will be either converted to a `select` or deleted. I probably should move the `PseudoProbeInst` check into `BrBB->instructionsWithoutDebug()`. hoy: Good question. Checking against `PseudoProbeInst` is more than `onlyAccessesInaccessibleMemory`.
		hoyAuthorUnsubmitted Done Reply Inline Actions On the second thought, it might not be a good idea to fuse the check into `instructionsWithoutDebug` since API is used in quite some places. We sometimes like a `PseudoProbeInst` treated like a debug intrinsic but sometimes not. It is specific to the transform we'd like to not block. hoy: On the second thought, it might not be a good idea to fuse the check into…
		wmiUnsubmitted Not Done Reply Inline Actions I think it is a good idea to fuse the check into instructionsWithoutDebug. That will make psuedoProbe intrinsic behave more like debug intrinsic and block less transformations, like we discussed in D86193. What is the case you have concern about? wmi: I think it is a good idea to fuse the check into instructionsWithoutDebug. That will make…
		hoyAuthorUnsubmitted Done Reply Inline Actions I was thinking that some passes may just use `instructionsWithoutDebug` to ignore and remove debug instructions. We may want that happening to pseudo probe selectively. By searching where the API is used, it looks like it's fine have them handle pseudo probes as well. I'm testing to see if there's a regression to the profile quality. hoy: I was thinking that some passes may just use `instructionsWithoutDebug` to ignore and remove…
		hoyAuthorUnsubmitted Done Reply Inline Actions Test result looked OK. Moved the check into `instructionsWithoutDebug` . hoy: Test result looked OK. Moved the check into `instructionsWithoutDebug` .
// Found the previous store make sure it stores to the same location.		// Found the previous store make sure it stores to the same location.
if (SI->getPointerOperand() == StorePtr)		if (SI->getPointerOperand() == StorePtr)
// Found the previous store, return its value operand.		// Found the previous store, return its value operand.
return SI->getValueOperand();		return SI->getValueOperand();
return nullptr; // Unknown store.		return nullptr; // Unknown store.
}		}
}		}

▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	for (BasicBlock::iterator BBI = ThenBB->begin(),
BBI != BBE; ++BBI) {		BBI != BBE; ++BBI) {
Instruction I = &BBI;		Instruction I = &BBI;
// Skip debug info.		// Skip debug info.
if (isa<DbgInfoIntrinsic>(I)) {		if (isa<DbgInfoIntrinsic>(I)) {
SpeculatedDbgIntrinsics.push_back(I);		SpeculatedDbgIntrinsics.push_back(I);
continue;		continue;
}		}

		// Skip pseudo probes. The consequence is we lose track of the branch
		// probability for ThenBB, which is fine since the optimization here takes
		// place regardless of the branch probability.
		if (isa<PseudoProbeInst>(I)) {
		SpeculatedDbgIntrinsics.push_back(I);
		continue;
		}

// Only speculatively execute a single instruction (not counting the		// Only speculatively execute a single instruction (not counting the
// terminator) for now.		// terminator) for now.
++SpeculatedInstructions;		++SpeculatedInstructions;
if (SpeculatedInstructions > 1)		if (SpeculatedInstructions > 1)
return false;		return false;

// Don't hoist the instruction if it's unsafe or expensive.		// Don't hoist the instruction if it's unsafe or expensive.
if (!isSafeToSpeculativelyExecute(I) &&		if (!isSafeToSpeculativelyExecute(I) &&
▲ Show 20 Lines • Show All 338 Lines • ▼ Show 20 Lines	static bool FoldTwoEntryPHINode(PHINode *PN, const TargetTransformInfo &TTI,
BasicBlock *DomBlock = nullptr;		BasicBlock *DomBlock = nullptr;
BasicBlock *IfBlock1 = PN->getIncomingBlock(0);		BasicBlock *IfBlock1 = PN->getIncomingBlock(0);
BasicBlock *IfBlock2 = PN->getIncomingBlock(1);		BasicBlock *IfBlock2 = PN->getIncomingBlock(1);
if (cast<BranchInst>(IfBlock1->getTerminator())->isConditional()) {		if (cast<BranchInst>(IfBlock1->getTerminator())->isConditional()) {
IfBlock1 = nullptr;		IfBlock1 = nullptr;
} else {		} else {
DomBlock = *pred_begin(IfBlock1);		DomBlock = *pred_begin(IfBlock1);
for (BasicBlock::iterator I = IfBlock1->begin(); !I->isTerminator(); ++I)		for (BasicBlock::iterator I = IfBlock1->begin(); !I->isTerminator(); ++I)
if (!AggressiveInsts.count(&*I) && !isa<DbgInfoIntrinsic>(I)) {		if (!AggressiveInsts.count(&*I) && !isa<DbgInfoIntrinsic>(I) &&
		!isa<PseudoProbeInst>(I)) {
// This is not an aggressive instruction that we can promote.		// This is not an aggressive instruction that we can promote.
// Because of this, we won't be able to get rid of the control flow, so		// Because of this, we won't be able to get rid of the control flow, so
// the xform is not worth it.		// the xform is not worth it.
return Changed;		return Changed;
}		}
}		}

if (cast<BranchInst>(IfBlock2->getTerminator())->isConditional()) {		if (cast<BranchInst>(IfBlock2->getTerminator())->isConditional()) {
IfBlock2 = nullptr;		IfBlock2 = nullptr;
} else {		} else {
DomBlock = *pred_begin(IfBlock2);		DomBlock = *pred_begin(IfBlock2);
for (BasicBlock::iterator I = IfBlock2->begin(); !I->isTerminator(); ++I)		for (BasicBlock::iterator I = IfBlock2->begin(); !I->isTerminator(); ++I)
if (!AggressiveInsts.count(&*I) && !isa<DbgInfoIntrinsic>(I)) {		if (!AggressiveInsts.count(&*I) && !isa<DbgInfoIntrinsic>(I) &&
		!isa<PseudoProbeInst>(I)) {
// This is not an aggressive instruction that we can promote.		// This is not an aggressive instruction that we can promote.
// Because of this, we won't be able to get rid of the control flow, so		// Because of this, we won't be able to get rid of the control flow, so
// the xform is not worth it.		// the xform is not worth it.
return Changed;		return Changed;
}		}
}		}
assert(DomBlock && "Failed to find root DomBlock");		assert(DomBlock && "Failed to find root DomBlock");

▲ Show 20 Lines • Show All 3,812 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp

Show First 20 Lines • Show All 660 Lines • ▼ Show 20 Lines	if (isa<LoadInst>(I) \|\| isa<StoreInst>(I)) {
if (!is_contained(Chain, &I))		if (!is_contained(Chain, &I))
MemoryInstrs.push_back(&I);		MemoryInstrs.push_back(&I);
else		else
ChainInstrs.push_back(&I);		ChainInstrs.push_back(&I);
} else if (isa<IntrinsicInst>(&I) &&		} else if (isa<IntrinsicInst>(&I) &&
cast<IntrinsicInst>(&I)->getIntrinsicID() ==		cast<IntrinsicInst>(&I)->getIntrinsicID() ==
Intrinsic::sideeffect) {		Intrinsic::sideeffect) {
// Ignore llvm.sideeffect calls.		// Ignore llvm.sideeffect calls.
		} else if (isa<IntrinsicInst>(&I) &&
		cast<IntrinsicInst>(&I)->getIntrinsicID() ==
		Intrinsic::pseudoprobe) {
		// Ignore llvm.pseudoprobe calls.
} else if (IsLoadChain && (I.mayWriteToMemory() \|\| I.mayThrow())) {		} else if (IsLoadChain && (I.mayWriteToMemory() \|\| I.mayThrow())) {
LLVM_DEBUG(dbgs() << "LSV: Found may-write/throw operation: " << I		LLVM_DEBUG(dbgs() << "LSV: Found may-write/throw operation: " << I
<< '\n');		<< '\n');
break;		break;
} else if (!IsLoadChain && (I.mayReadOrWriteMemory() \|\| I.mayThrow())) {		} else if (!IsLoadChain && (I.mayReadOrWriteMemory() \|\| I.mayThrow())) {
LLVM_DEBUG(dbgs() << "LSV: Found may-read/write/throw operation: " << I		LLVM_DEBUG(dbgs() << "LSV: Found may-read/write/throw operation: " << I
<< '\n');		<< '\n');
break;		break;
▲ Show 20 Lines • Show All 635 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,406 Lines • ▼ Show 20 Lines	bool IsPredicated = LoopVectorizationPlanner::getDecisionAndClampRange(
},		},
Range);		Range);

if (IsPredicated)		if (IsPredicated)
return nullptr;		return nullptr;

Intrinsic::ID ID = getVectorIntrinsicIDForCall(CI, TLI);		Intrinsic::ID ID = getVectorIntrinsicIDForCall(CI, TLI);
if (ID && (ID == Intrinsic::assume \|\| ID == Intrinsic::lifetime_end \|\|		if (ID && (ID == Intrinsic::assume \|\| ID == Intrinsic::lifetime_end \|\|
ID == Intrinsic::lifetime_start \|\| ID == Intrinsic::sideeffect))		ID == Intrinsic::lifetime_start \|\| ID == Intrinsic::sideeffect \|\|
		ID == Intrinsic::pseudoprobe))
return nullptr;		return nullptr;

auto willWiden = [&](ElementCount VF) -> bool {		auto willWiden = [&](ElementCount VF) -> bool {
Intrinsic::ID ID = getVectorIntrinsicIDForCall(CI, TLI);		Intrinsic::ID ID = getVectorIntrinsicIDForCall(CI, TLI);
// The following case may be scalarized depending on the VF.		// The following case may be scalarized depending on the VF.
// The flag shows whether we use Intrinsic or a usual Call for vectorized		// The flag shows whether we use Intrinsic or a usual Call for vectorized
// version of the instruction.		// version of the instruction.
// Is it beneficial to perform intrinsic call compared to lib call?		// Is it beneficial to perform intrinsic call compared to lib call?
▲ Show 20 Lines • Show All 1,279 Lines • Show Last 20 Lines

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,255 Lines • ▼ Show 20 Lines	if (!SD) {
SD->Inst = I;		SD->Inst = I;
}		}
assert(!isInSchedulingRegion(SD) &&		assert(!isInSchedulingRegion(SD) &&
"new ScheduleData already in scheduling region");		"new ScheduleData already in scheduling region");
SD->init(SchedulingRegionID, I);		SD->init(SchedulingRegionID, I);

if (I->mayReadOrWriteMemory() &&		if (I->mayReadOrWriteMemory() &&
(!isa<IntrinsicInst>(I) \|\|		(!isa<IntrinsicInst>(I) \|\|
cast<IntrinsicInst>(I)->getIntrinsicID() != Intrinsic::sideeffect)) {		(cast<IntrinsicInst>(I)->getIntrinsicID() != Intrinsic::sideeffect &&
		cast<IntrinsicInst>(I)->getIntrinsicID() !=
		Intrinsic::pseudoprobe))) {
// Update the linked list of memory accessing instructions.		// Update the linked list of memory accessing instructions.
if (CurrentLoadStore) {		if (CurrentLoadStore) {
CurrentLoadStore->NextLoadStore = SD;		CurrentLoadStore->NextLoadStore = SD;
} else {		} else {
FirstLoadStoreInRegion = SD;		FirstLoadStoreInRegion = SD;
}		}
CurrentLoadStore = SD;		CurrentLoadStore = SD;
}		}
▲ Show 20 Lines • Show All 2,594 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CSSPGO] IR intrinsic for pseudo-probe block instrumentationClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 306705

llvm/include/llvm/Analysis/TargetTransformInfoImpl.h

llvm/include/llvm/CodeGen/BasicTTIImpl.h

llvm/include/llvm/IR/BasicBlock.h

llvm/include/llvm/IR/Instruction.h

llvm/include/llvm/IR/IntrinsicInst.h

llvm/include/llvm/IR/Intrinsics.td

llvm/lib/Analysis/AliasSetTracker.cpp

llvm/lib/Analysis/InlineCost.cpp

llvm/lib/Analysis/ValueTracking.cpp

llvm/lib/Analysis/VectorUtils.cpp

llvm/lib/CodeGen/Analysis.cpp

llvm/lib/CodeGen/CodeGenPrepare.cpp

llvm/lib/IR/BasicBlock.cpp

llvm/lib/IR/Instruction.cpp

llvm/lib/Transforms/Scalar/JumpThreading.cpp

llvm/lib/Transforms/Scalar/TailRecursionElimination.cpp

llvm/lib/Transforms/Utils/Evaluator.cpp

llvm/lib/Transforms/Utils/SimplifyCFG.cpp

llvm/lib/Transforms/Vectorize/LoadStoreVectorizer.cpp

llvm/lib/Transforms/Vectorize/LoopVectorize.cpp

llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp

[CSSPGO] IR intrinsic for pseudo-probe block instrumentation
ClosedPublic