This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
bindings/ocaml/transforms/scalar_opts/
-
ocaml/
-
transforms/
-
scalar_opts/
-
llvm_scalar_opts.mli
-
scalar_opts_ocaml.c
-
include/
-
llvm-c/Transforms/
-
Transforms/
-
Scalar.h
-
llvm/
-
Analysis/
3/3
InstructionSimplify.h
-
InitializePasses.h
-
LinkAllPasses.h
-
Transforms/
-
Scalar.h
-
Scalar/
-
LowerIsConstantIntrinsic.h
-
lib/
-
Analysis/
3/4
InstructionSimplify.cpp
-
CodeGen/
-
TargetPassConfig.cpp
-
Passes/
-
PassBuilder.cpp
1/1
PassRegistry.def
-
Transforms/Scalar/
-
Scalar/
-
CMakeLists.txt
2
LowerIsConstantIntrinsic.cpp
-
Scalar.cpp
-
test/
-
CodeGen/
-
AArch64/
-
O0-pipeline.ll
-
O3-pipeline.ll
-
ARM/
-
O3-pipeline.ll
-
X86/
-
O0-pipeline.ll
-
O3-pipeline.ll
-
is-constant.ll
-
Transforms/LowerIsConstant/
-
LowerIsConstant/
1
is-constant.ll
-
utils/gn/secondary/llvm/lib/Transforms/Scalar/
-
gn/
-
secondary/
-
llvm/
-
lib/
-
Transforms/
-
Scalar/
-
BUILD.gn

Differential D65280

Add a pass to lower is.constant and objectsize intrinsics
ClosedPublic

Authored by joerg on Jul 25 2019, 6:51 AM.

Download Raw Diff

Details

Reviewers

chandlerc
whitequark
void
deadalnix

Commits

rG9681ea9560a0: Reapply r374743 with a fix for the ocaml binding
rL374784: Reapply r374743 with a fix for the ocaml binding
rGe4300c392de2: Add a pass to lower is.constant and objectsize intrinsics
rL374743: Add a pass to lower is.constant and objectsize intrinsics

Summary

This pass lowers the remaining is.constant and objectsize intrinsics. This moves the lowering from the codegen-prepare pass and the fallback in the SDAG/GlobalISel/FastISel layers.

The API for replaceAndRecursivelySimplify is extended to provide a list of un-modified instructions. Those are scanned for conditional branches with now invariant condition.

Diff Detail

Event Timeline

joerg created this revision.Jul 25 2019, 6:51 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 25 2019, 6:51 AM

Herald added subscribers: hiraditya, mgorny. · View Herald Transcript

Avoid goto. Create new BranchInst instead of modifying in-place. Update tests to reflect changes. Move most of the x86 is-constant test to generic.

Herald added a subscriber: javed.absar. · View Herald TranscriptJul 27 2019, 1:22 PM

Couple of random comments, haven't looked at most of it yet.

include/llvm/Analysis/InstructionSimplify.h
258	Really would like to avoid boolean arguments like this.
lib/Analysis/InstructionSimplify.cpp
5047	This seems to return nullptr everywhere?

arsenm added a subscriber: arsenm.Jul 29 2019, 1:39 PM

arsenm added inline comments.

lib/Transforms/Scalar/LowerIsConstantIntrinsic.cpp
51	Should early exit if there are is no is_constant declaration, or just visit uses of it rather than search the whole function for a rarely used intrinsic?

Feel free to hijack my revision. :-) I may not get back to my change soon.

joerg mentioned this in D64963: Add a pass to lower is.constant intrinsics.Jul 29 2019, 2:44 PM

joerg marked 3 inline comments as done.Jul 29 2019, 3:01 PM

joerg added inline comments.

include/llvm/Analysis/InstructionSimplify.h
258	Right, this is the one big part of the change I am absolutely not sure about. I was considering passing a pointer to a vector down to replaceAndRecursivelySimplify. The contract would be to return the work list of all modified uses. The caller can then iterate the list and see if any branch instructions are affected.
lib/Analysis/InstructionSimplify.cpp
5047	It does, trying to preserve the style of the other functions here. The question might become irrelevant when going with the work list idea.
lib/Transforms/Scalar/LowerIsConstantIntrinsic.cpp
51	As mentioned on IRC, llvm.is.constant is a family of intrinsics depending on the type of the argument. There doesn't seem to be an easy way to check if a given template-like intrinsic is used.

Replace boolean argument to replaceAndRecursivelySimplify with optional vector of un-modified instructions. This simplifies the API change significantly and allows other potential use cases. Redo the restart handling. After a successful simplification step, restart the current BB only, but always do another full pass of the function.

joerg marked 2 inline comments as done and an inline comment as not done.Aug 1 2019, 6:49 AM

I think the pass needs to handle the removal of any remaining llvm.objectsize, as well, so that llvm.is.constant(llvm.objectsize(...)) continues to return true -- even if the object size cannot be determined.

With this new pass, I think we should be able to remove the code handling the objectsize and is.constant instrinsics in CodegenPrepare, FastISel, SelectionDAGBuilder.cpp, and IRTranslator. Which sure is a nice cleanup.

test/Transforms/LowerIsConstant/is-constant.ll
1	While this is a more targeted run, the old asserts are checking what _actually_ needs to continue to work. E.g. I'd be more confident that my suggested removal of the handling code elsewhere was correct if this test still ran the original 4 tests.

Update PHI nodes in disconnected block.

joerg mentioned this in D60943: Delay diagnosing asm constraints that require immediates until after inlining.Aug 2 2019, 5:30 AM

Generalize slightly to also cover llvm.objectsize which has very similar constraints.

Generally, I do like the approach. Two high level comments:

First, I don't think we can crash if these things reach lower layers. I think we should retain the logic to just fold them to a constant if somehow they show up. Anything else makes things weird -- the IR is valid, passes the verifier, but crashes? We want to support fuzzing and bisecting and such and so it should be possible to rip this pass out and still have things lower successfully. This pass should just be a much more *advanced* way of lowering these intrinsics. And yes, I understand that in some bizarre cases failing to do this "optimization" will result in inline assembly that cannot be emitted or some such. While I find that really frustrating as well, I still think it is better than making even more boring uses of these intrinsics hostile to things like fuzzers.

Second, I don't think you need the fancy approach you're taking to find the branches that need simplification. How about the following approach:

Walk every instruction and if it is one of these instrinsics, fold it and then recursively simplify.

If we simplify any intrinsics, but after simplifying *all* of them found in the function, walk every terminator in every block and if it is a conditional branch with a constant condition, fold the branch.

I don't think this will be too wasteful, and seems much simpler.

This revision now requires changes to proceed.Aug 3 2019, 6:54 PM

For the first part, I was actually asked to do that. I don't mind either way.

For the second part, it seems to lead to inconsistent behavior. If we want to go down that road, it seems to me that this behavior should either be a separate branch or be folded into the existing unreachable block removal pass.

Looking a bit more into the details. Chandler, you've originally suggested going with the LowerAtomic route and that actually does create code that fails the SDAG lowering if the pass is skipped, e.g. on ARM.
The second part is currently overlapping with the CodeGenPrepare pass. I can cleanup the implementation somewhat by reusing the same functionality as that pass is using OR I could factor out a minimal branch for doing the constant folding optimization from CodeGenPrepare as a general branch that is included in the pass chain for -O0, e.g. instead of the more general CodeGenPrepare pass. The main difference is that the non-optimized build would not get any recursive folding from PHI-simplification, but I think that's fine for the original use case. It would also not get the block merging, but again, that seems to be fine for the constraints.

In D65280#1614752, @joerg wrote:

Looking a bit more into the details. Chandler, you've originally suggested going with the LowerAtomic route and that actually does create code that fails the SDAG lowering if the pass is skipped, e.g. on ARM.

Sorry, I had forgotten about that discussion. Thank you for digging it up and reminding me to be consistent with my past self.

If this pass is going into the llc pipeline like the lower atomic pass, you're right it totally makes sense to fail hard in SDAG lowering. Sorry for the confusion again.

The second part is currently overlapping with the CodeGenPrepare pass. I can cleanup the implementation somewhat by reusing the same functionality as that pass is using OR I could factor out a minimal branch for doing the constant folding optimization from CodeGenPrepare as a general branch that is included in the pass chain for -O0, e.g. instead of the more general CodeGenPrepare pass. The main difference is that the non-optimized build would not get any recursive folding from PHI-simplification, but I think that's fine for the original use case. It would also not get the block merging, but again, that seems to be fine for the constraints.

I'm not really sure I understand the problem?

This pass needs to run (at least) in the same general area as the atomics lowering for the same reasons (it would be part of SDAG but it can't be due to CFG mutation). It might optionally be run in other places of course.

I think you have all the code you need in this patch already. I'm just suggesting a different strategy for applying it to the IR that I think will be simpler (and not require any changes to instsimplify). Maybe my suggestion isn't clear? If so I can try to clarify....

This PR is possibly related to the original issue: https://bugs.llvm.org/show_bug.cgi?id=42956

Thanks to Joerg for some useful discussion on IRC -- there was a concern I hadn't thought about that is exactly right: we somewhat want this pass to minimally disrupt things but also to be reasonably self contained.

Based on the discussion, I think the patch can still be simplified a bit though (although not as much as I had originally suggested). Notably, the additional API for inst simplify makes a lot more sense to me now.

I'd suggest thinking about this in three phases.

First, simplify the instructions to constants. Here, you would keep the new API to extract a list of potentially relevant instructions. After each simplification, scan this and append any branches or switches to a list of terminators needing an update. (May want it to be a set vector so you don't duplicate.) But don't actually update the CFG in any way during this phase.

Second, rewrite the branches and switches collected in the first phase, collecting basic blocks that might be made unreachable in another SetVector. This just mutates the CFG but doesn't delete any blocks and so doesn't make any of the instructions invalid to visit. You should be able to collect a batch of domtree updates during this phase as well without applying them eagerly.

Third, apply the domtree updates (so you have precise reachability) and remove any blocks from the list that are still reachable. Hand the remaining blocks to the DeleteBasicBlocks utility to completely remove them. This will remove the implicit dependence between this pass and the later codegen passes which I think is much cleaner.

I think that will still remove the need to iterate, and will enhance this to actually fully remove the blocks orphaned by the folding of these branches (but *only* those blocks). Thoughts?

Simplify the pass logic. First round will update the predecessor links and note if any block is orphaned, a second round will remove unreachable blocks if necessary.
The pass can be further refined to use and update DominatorTree and AssumptionCache incrementally, but this should be functionally complete now. It will not handle more complex cases like orphaned loops, but I don't think those are commonly used with is.constant or objectsize conditions either.
The pass will now scan every BB once, but fall back to the start of a BB of recursive removal invalidates the iterator. This seems to be the strictest form I can manage.

Hook up a second instance of the pass after the Float To Int pass for optimized builds. This is after the initial loop transforms, so it can profit from some unrolling, but it is before vectorization. The late run of the pass should is kept for now and ensures any potentially added variants are still dropped before SDAG.

Herald added subscribers: dexonsmith, steven_wu, mehdi_amini. · View Herald TranscriptAug 26 2019, 8:22 AM

Chandler, are you OK with getting the InstructionSimplify.h part in now, so that it can be merged into 9.0 and the rest follow separately?

In D65280#1647026, @joerg wrote:

Chandler, are you OK with getting the InstructionSimplify.h part in now, so that it can be merged into 9.0 and the rest follow separately?

Can we not get the entire thing merged? I'd really like that... I think the patch is actually really close. I have a bunch of comments below but they're all pretty boring in reality.

include/llvm/Analysis/InstructionSimplify.h
265–266	I'd call this `UnsimplifiedUsers`.
lib/Analysis/InstructionSimplify.cpp
5242–5243	I'd make this a bit more precise... "Recursively visited users which do not themselves simplify are ..."
5251	If this is what `clang-format` did with this... I am sad. If not, clang-format?
lib/Transforms/Scalar/LowerConstantIntrinsics.cpp
41 ↗	(On Diff #217160)	No #include after usings and statistics and such please.
45–51 ↗	(On Diff #217160)	Seems simpler to write as: return isa<Constant>(Op) ? ConstantInt::getTrue(II->getType()) : ConstantInt::getFalse(II->getType());
54–55 ↗	(On Diff #217160)	clang-format please. I'd also call this `replaceConditionalBranchesOnConstant`.
113–117 ↗	(On Diff #217160)	As I tried to indicate before, this can be even simpler AFAICT. I think you should keep a single worklist of unsimplified users. Then you should just do the recursive inst simplify here using the same worklist each time. You'll want to change the loop to go in RPO over the function rather than in the naive order. Then below this loop you will have a single worklist of unsimplified users that you can walk with the exact code you have above AFAICT.

Can we not get the entire thing merged? I'd really like that... I think the patch is actually really close. I have a bunch of comments below but they're all pretty boring in reality.

It's really late in the process -- as in I'm only really waiting for this and one other bug -- so I'm hesitant to take a large change, but I also don't really understand exactly what's at stake here.

In D65280#1649062, @hans wrote:

Can we not get the entire thing merged? I'd really like that... I think the patch is actually really close. I have a bunch of comments below but they're all pretty boring in reality.

It's really late in the process -- as in I'm only really waiting for this and one other bug -- so I'm hesitant to take a large change, but I also don't really understand exactly what's at stake here.

I'm also against putting it into 9.0, which is supposed to have a final release Real Soon Now. This is not a obviously-correct change, and it should bake in trunk for at least a couple weeks before going into the release, to shake out any unexpected problems.

I think we should target this for the 9.0.1 patch though, which is why joerg wants to merge the API change to 9.0 now.

I'm also against putting it into 9.0, which is supposed to have a final release Real Soon Now. This is not a obviously-correct change, and it should bake in trunk for at least a couple weeks before going into the release, to shake out any unexpected problems.

I think we should target this for the 9.0.1 patch though, which is why joerg wants to merge the API change to 9.0 now.

This sounds good to me. From my point of view, ideally the API change would be broken out and landed on trunk as soon as possible, and then I'd merge that change to the 9 branch.

Apply feedback.

joerg marked an inline comment as not done.Aug 28 2019, 8:02 AM

joerg added inline comments.

lib/Transforms/Scalar/LowerConstantIntrinsics.cpp
113–117 ↗	(On Diff #217160)	Inside a BB, we have to use forward order for the processing so that the intrinsic calls are removed with the expected results. This means that the recursive simplification can remove the next instruction (InstPtr) or unhook it (no parent). Both cases are covered by the test cases. I don't see how a different iteration order inside the BB can work or avoid the problem. Updating the CFG directly can help in those cases were PHIs are removed by removePredecessor. That seems generally desirable as property. So the question is which iteration order for the BBs to use. BB iteration order can change the result, but I'm not sure computing RPO is worth the hassle here? I'm leaving the actual removing of the dead block to removeUnreachableBlocks, especially since I don't want to open-code recursive removal and other edge cases.

joerg mentioned this in rL370355: Allow replaceAndRecursivelySimplify to list unsimplified visitees..Aug 29 2019, 6:22 AM

joerg mentioned this in rG799c96693f68: Allow replaceAndRecursivelySimplify to list unsimplified visitees..

hans mentioned this in rL370447: Merging r370355:.Aug 30 2019, 2:05 AM

hansw mentioned this in rG892dfd7d4e3b: Merging r370355: --------------------------------------------------------------….Aug 30 2019, 2:07 AM

I've merged the InstructionSimplify.h part of this (r370355) to release_90 in r370447.

Switch the pass to use two rounds. The first round will collect all relevant intrinsics in RPO, the second one will translate them accordingly.

joerg marked 2 inline comments as done.Sep 9 2019, 3:40 PM

Ping

2nd ping. Chandler, care to check this please?

(Tried to get this out last weekend, but was blocked by the Phab down time... Sorry about that ...)

Mostly nits around the exact code here. The approach looks really nice now (and sorry it took so many iterations to get there).

lib/Passes/PassRegistry.def
189	Maybe `lower-constant-intrinsics` as a name? (Since it handles `objectsize` as well.
lib/Transforms/Scalar/LowerConstantIntrinsics.cpp
93 ↗	(On Diff #219444)	Use an early continue to reduce indentation?
96 ↗	(On Diff #219444)	Odd to continue here but break below. Doesn't matter in this case of course, but just seemed surprising.
105–106 ↗	(On Diff #219444)	FWIW, this doesn't skip anything, the loop has the same behavior.
112–117 ↗	(On Diff #219444)	For both the `II` thing and the `default` case -- do we really expect these to ever fail? I would expect either the VH to be null, or for it to definitively be one of the two intrinsics we added. Maybe switch to `cast_or_null` above with `VN.get()` or some such, and llvm_unreachable on the default case.

Adjust based on comments.

joerg added inline comments.Oct 11 2019, 4:55 AM

lib/Transforms/Scalar/LowerConstantIntrinsics.cpp
105–106 ↗	(On Diff #219444)	It was primarily to get the correct return value, but I'm changing it to push the check to the final return.
112–117 ↗	(On Diff #219444)	Yes, the same concerns as with the earlier version still apply. The recursive simplification can change the instruction type in place or remove it. The logic is still simpler since no new instructions can appear.

FWIW, the adjustments I'm suggesting around tightening the logic can easily be in a follow-up patch if you like. I think generally the code LGTM and I'd just like us to pin down exactly what changes we expect to happen w/ the handles as much as possible to avoid subtle latent bugs creeping in and never getting noticed.

The other two are trivial, feel free to land w/ those fixed.

lib/Transforms/Scalar/LowerConstantIntrinsics.cpp
112–117 ↗	(On Diff #219444)	I'm really surprised that it can change the value handle in this way. I guess because we're using a tracking value handle (is that really necessary?) they may be moved onto the constant, but IMO that'd be more cleanly handled by checking for the value handle being either null or a non-instruction value. If its an instruction, it should really only be one of these two intrinsics or something deeply wrong has happened elsewhere, no? I'm mostly suggesting we assert on that to track down the strange behavior and make sure the overall logic is actually still correct if it comes up rather than potentially hiding a deeper bug.
test/Transforms/LowerConstantIntrinsics/crash-on-large-allocas.ll
1 ↗	(On Diff #224563)	Probable just one `-` is fine?
test/Transforms/LowerConstantIntrinsics/objectsize_basic.ll
1 ↗	(On Diff #224563)	Probably just one `-` is fine?

This revision is now accepted and ready to land.Oct 13 2019, 12:42 AM

Closed by commit rGe4300c392de2: Add a pass to lower is.constant and objectsize intrinsics (authored by joerg). · Explain WhyOct 13 2019, 3:59 PM

This revision was automatically updated to reflect the committed changes.

Sorry, this change broke the build (http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/19218) and I reverted it in r374768.

FYI: https://bugs.llvm.org/show_bug.cgi?id=48472

Herald added subscribers: nikic, pengfei. · View Herald TranscriptDec 21 2020, 1:47 AM

Revision Contents

Path

Size

bindings/

ocaml/

transforms/

scalar_opts/

llvm_scalar_opts.mli

5 lines

scalar_opts_ocaml.c

6 lines

include/

llvm-c/

Transforms/

Scalar.h

3 lines

llvm/

Analysis/

InstructionSimplify.h

11 lines

InitializePasses.h

1 line

LinkAllPasses.h

1 line

Transforms/

Scalar.h

7 lines

Scalar/

LowerIsConstantIntrinsic.h

40 lines

lib/

Analysis/

InstructionSimplify.cpp

39 lines

CodeGen/

TargetPassConfig.cpp

1 line

Passes/

PassBuilder.cpp

1 line

PassRegistry.def

1 line

Transforms/

Scalar/

CMakeLists.txt

1 line

LowerIsConstantIntrinsic.cpp

111 lines

Scalar.cpp

5 lines

test/

CodeGen/

AArch64/

O0-pipeline.ll

1 line

O3-pipeline.ll

1 line

ARM/

O3-pipeline.ll

1 line

X86/

O0-pipeline.ll

1 line

O3-pipeline.ll

1 line

is-constant.ll

50 lines

Transforms/

LowerIsConstant/

is-constant.ll

30 lines

utils/

gn/

secondary/

llvm/

lib/

Transforms/

Scalar/

BUILD.gn

1 line

Diff 212070

bindings/ocaml/transforms/scalar_opts/llvm_scalar_opts.mli

Show First 20 Lines • Show All 180 Lines • ▼ Show 20 Lines	external add_early_cse
: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit		: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit
= "llvm_add_early_cse"		= "llvm_add_early_cse"

(** See the [llvm::createLowerExpectIntrinsicPass] function. *)		(** See the [llvm::createLowerExpectIntrinsicPass] function. *)
external add_lower_expect_intrinsic		external add_lower_expect_intrinsic
: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit		: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit
= "llvm_add_lower_expect_intrinsic"		= "llvm_add_lower_expect_intrinsic"

		(** See the [llvm::createLowerIsConstantIntrinsicPass] function. *)
		external add_lower_is_constant_intrinsic
		: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit
		= "llvm_add_lower_is_constant_intrinsic"

(** See the [llvm::createTypeBasedAliasAnalysisPass] function. *)		(** See the [llvm::createTypeBasedAliasAnalysisPass] function. *)
external add_type_based_alias_analysis		external add_type_based_alias_analysis
: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit		: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit
= "llvm_add_type_based_alias_analysis"		= "llvm_add_type_based_alias_analysis"

(** See the [llvm::createScopedNoAliasAAPass] function. *)		(** See the [llvm::createScopedNoAliasAAPass] function. *)
external add_scoped_no_alias_alias_analysis		external add_scoped_no_alias_alias_analysis
: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit		: [< Llvm.PassManager.any ] Llvm.PassManager.t -> unit
Show All 11 Lines

bindings/ocaml/transforms/scalar_opts/scalar_opts_ocaml.c

	Show First 20 Lines • Show All 226 Lines • ▼ Show 20 Lines

	/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */			/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */
	CAMLprim value llvm_add_lower_expect_intrinsic(LLVMPassManagerRef PM) {			CAMLprim value llvm_add_lower_expect_intrinsic(LLVMPassManagerRef PM) {
	LLVMAddLowerExpectIntrinsicPass(PM);			LLVMAddLowerExpectIntrinsicPass(PM);
	return Val_unit;			return Val_unit;
	}			}

	/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */			/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */
				CAMLprim value llvm_add_lower_is_constant_intrinsic(LLVMPassManagerRef PM) {
				LLVMAddLowerIsConstantIntrinsicPass(PM);
				return Val_unit;
				}

				/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */
	CAMLprim value llvm_add_type_based_alias_analysis(LLVMPassManagerRef PM) {			CAMLprim value llvm_add_type_based_alias_analysis(LLVMPassManagerRef PM) {
	LLVMAddTypeBasedAliasAnalysisPass(PM);			LLVMAddTypeBasedAliasAnalysisPass(PM);
	return Val_unit;			return Val_unit;
	}			}

	/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */			/* [<Llvm.PassManager.any] Llvm.PassManager.t -> unit */
	CAMLprim value llvm_add_scoped_no_alias_aa(LLVMPassManagerRef PM) {			CAMLprim value llvm_add_scoped_no_alias_aa(LLVMPassManagerRef PM) {
	LLVMAddScopedNoAliasAAPass(PM);			LLVMAddScopedNoAliasAAPass(PM);
	Show All 14 Lines

include/llvm-c/Transforms/Scalar.h

	Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines
	void LLVMAddEarlyCSEPass(LLVMPassManagerRef PM);			void LLVMAddEarlyCSEPass(LLVMPassManagerRef PM);

	/** See llvm::createEarlyCSEPass function */			/** See llvm::createEarlyCSEPass function */
	void LLVMAddEarlyCSEMemSSAPass(LLVMPassManagerRef PM);			void LLVMAddEarlyCSEMemSSAPass(LLVMPassManagerRef PM);

	/** See llvm::createLowerExpectIntrinsicPass function */			/** See llvm::createLowerExpectIntrinsicPass function */
	void LLVMAddLowerExpectIntrinsicPass(LLVMPassManagerRef PM);			void LLVMAddLowerExpectIntrinsicPass(LLVMPassManagerRef PM);

				/** See llvm::createLowerIsConstantIntrinsicPass function */
				void LLVMAddLowerIsConstantIntrinsicPass(LLVMPassManagerRef PM);

	/** See llvm::createTypeBasedAliasAnalysisPass function */			/** See llvm::createTypeBasedAliasAnalysisPass function */
	void LLVMAddTypeBasedAliasAnalysisPass(LLVMPassManagerRef PM);			void LLVMAddTypeBasedAliasAnalysisPass(LLVMPassManagerRef PM);

	/** See llvm::createScopedNoAliasAAPass function */			/** See llvm::createScopedNoAliasAAPass function */
	void LLVMAddScopedNoAliasAAPass(LLVMPassManagerRef PM);			void LLVMAddScopedNoAliasAAPass(LLVMPassManagerRef PM);

	/** See llvm::createBasicAliasAnalysisPass function */			/** See llvm::createBasicAliasAnalysisPass function */
	void LLVMAddBasicAliasAnalysisPass(LLVMPassManagerRef PM);			void LLVMAddBasicAliasAnalysisPass(LLVMPassManagerRef PM);
	Show All 13 Lines

include/llvm/Analysis/InstructionSimplify.h

Show First 20 Lines • Show All 248 Lines • ▼ Show 20 Lines	Value SimplifyBinOp(unsigned Opcode, Value LHS, Value *RHS,
FastMathFlags FMF, const SimplifyQuery &Q);		FastMathFlags FMF, const SimplifyQuery &Q);

/// Given a callsite, fold the result or return null.		/// Given a callsite, fold the result or return null.
Value SimplifyCall(CallBase Call, const SimplifyQuery &Q);		Value SimplifyCall(CallBase Call, const SimplifyQuery &Q);

/// See if we can compute a simplified version of this instruction. If not,		/// See if we can compute a simplified version of this instruction. If not,
/// return null.		/// return null.
Value SimplifyInstruction(Instruction I, const SimplifyQuery &Q,		Value SimplifyInstruction(Instruction I, const SimplifyQuery &Q,
OptimizationRemarkEmitter *ORE = nullptr);		OptimizationRemarkEmitter *ORE = nullptr,
		bool SimplifyCFG = false);
		echristoUnsubmitted Done Reply Inline Actions Really would like to avoid boolean arguments like this. echristo: Really would like to avoid boolean arguments like this.
		joergAuthorUnsubmitted Done Reply Inline Actions Right, this is the one big part of the change I am absolutely not sure about. I was considering passing a pointer to a vector down to replaceAndRecursivelySimplify. The contract would be to return the work list of all modified uses. The caller can then iterate the list and see if any branch instructions are affected. joerg: Right, this is the one big part of the change I am absolutely not sure about. I was considering…

/// Replace all uses of 'I' with 'SimpleV' and simplify the uses recursively.		/// Replace all uses of 'I' with 'SimpleV' and simplify the uses recursively.
///		///
/// This first performs a normal RAUW of I with SimpleV. It then recursively		/// This first performs a normal RAUW of I with SimpleV. It then recursively
/// attempts to simplify those users updated by the operation. The 'I'		/// attempts to simplify those users updated by the operation. The 'I'
/// instruction must not be equal to the simplified value 'SimpleV'.		/// instruction must not be equal to the simplified value 'SimpleV'.
		/// If SimplifyCFG is set, it will also adjust no-longer conditional branches.
///		///
		chandlercUnsubmitted Done Reply Inline Actions I'd call this `UnsimplifiedUsers`. chandlerc: I'd call this `UnsimplifiedUsers`.
/// The function returns true if any simplifications were performed.		/// The function returns true if any simplifications were performed.
bool replaceAndRecursivelySimplify(Instruction I, Value SimpleV,		bool replaceAndRecursivelySimplify(Instruction I, Value SimpleV,
const TargetLibraryInfo *TLI = nullptr,		const TargetLibraryInfo *TLI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
AssumptionCache *AC = nullptr);		AssumptionCache *AC = nullptr,
		bool SimplifyCFG = false);

/// Recursively attempt to simplify an instruction.		/// Recursively attempt to simplify an instruction.
///		///
/// This routine uses SimplifyInstruction to simplify 'I', and if successful		/// This routine uses SimplifyInstruction to simplify 'I', and if successful
/// replaces uses of 'I' with the simplified value. It then recurses on each		/// replaces uses of 'I' with the simplified value. It then recurses on each
/// of the users impacted. It returns true if any simplifications were		/// of the users impacted. It returns true if any simplifications were
/// performed.		/// performed.
		/// If SimplifyCFG is set, it will also adjust no-longer conditional branches.
bool recursivelySimplifyInstruction(Instruction *I,		bool recursivelySimplifyInstruction(Instruction *I,
const TargetLibraryInfo *TLI = nullptr,		const TargetLibraryInfo *TLI = nullptr,
const DominatorTree *DT = nullptr,		const DominatorTree *DT = nullptr,
AssumptionCache *AC = nullptr);		AssumptionCache *AC = nullptr,
		bool SimplifyCFG = false);

// These helper functions return a SimplifyQuery structure that contains as		// These helper functions return a SimplifyQuery structure that contains as
// many of the optional analysis we use as are currently valid. This is the		// many of the optional analysis we use as are currently valid. This is the
// strongly preferred way of constructing SimplifyQuery in passes.		// strongly preferred way of constructing SimplifyQuery in passes.
const SimplifyQuery getBestSimplifyQuery(Pass &, Function &);		const SimplifyQuery getBestSimplifyQuery(Pass &, Function &);
template <class T, class... TArgs>		template <class T, class... TArgs>
const SimplifyQuery getBestSimplifyQuery(AnalysisManager<T, TArgs...> &,		const SimplifyQuery getBestSimplifyQuery(AnalysisManager<T, TArgs...> &,
Function &);		Function &);
const SimplifyQuery getBestSimplifyQuery(LoopStandardAnalysisResults &,		const SimplifyQuery getBestSimplifyQuery(LoopStandardAnalysisResults &,
const DataLayout &);		const DataLayout &);
} // end namespace llvm		} // end namespace llvm

#endif		#endif

include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 238 Lines • ▼ Show 20 Lines
	void initializeLoopUnswitchPass(PassRegistry&);			void initializeLoopUnswitchPass(PassRegistry&);
	void initializeLoopVectorizePass(PassRegistry&);			void initializeLoopVectorizePass(PassRegistry&);
	void initializeLoopVersioningLICMPass(PassRegistry&);			void initializeLoopVersioningLICMPass(PassRegistry&);
	void initializeLoopVersioningPassPass(PassRegistry&);			void initializeLoopVersioningPassPass(PassRegistry&);
	void initializeLowerAtomicLegacyPassPass(PassRegistry&);			void initializeLowerAtomicLegacyPassPass(PassRegistry&);
	void initializeLowerEmuTLSPass(PassRegistry&);			void initializeLowerEmuTLSPass(PassRegistry&);
	void initializeLowerExpectIntrinsicPass(PassRegistry&);			void initializeLowerExpectIntrinsicPass(PassRegistry&);
	void initializeLowerGuardIntrinsicLegacyPassPass(PassRegistry&);			void initializeLowerGuardIntrinsicLegacyPassPass(PassRegistry&);
				void initializeLowerIsConstantIntrinsicPass(PassRegistry&);
	void initializeLowerWidenableConditionLegacyPassPass(PassRegistry&);			void initializeLowerWidenableConditionLegacyPassPass(PassRegistry&);
	void initializeLowerIntrinsicsPass(PassRegistry&);			void initializeLowerIntrinsicsPass(PassRegistry&);
	void initializeLowerInvokeLegacyPassPass(PassRegistry&);			void initializeLowerInvokeLegacyPassPass(PassRegistry&);
	void initializeLowerSwitchPass(PassRegistry&);			void initializeLowerSwitchPass(PassRegistry&);
	void initializeLowerTypeTestsPass(PassRegistry&);			void initializeLowerTypeTestsPass(PassRegistry&);
	void initializeMIRCanonicalizerPass(PassRegistry &);			void initializeMIRCanonicalizerPass(PassRegistry &);
	void initializeMIRPrintingPassPass(PassRegistry&);			void initializeMIRPrintingPassPass(PassRegistry&);
	void initializeMachineBlockFrequencyInfoPass(PassRegistry&);			void initializeMachineBlockFrequencyInfoPass(PassRegistry&);
	▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

include/llvm/LinkAllPasses.h

Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines	ForcePassLinking() {
(void) llvm::createLoopRerollPass();		(void) llvm::createLoopRerollPass();
(void) llvm::createLoopUnrollPass();		(void) llvm::createLoopUnrollPass();
(void) llvm::createLoopUnrollAndJamPass();		(void) llvm::createLoopUnrollAndJamPass();
(void) llvm::createLoopUnswitchPass();		(void) llvm::createLoopUnswitchPass();
(void) llvm::createLoopVersioningLICMPass();		(void) llvm::createLoopVersioningLICMPass();
(void) llvm::createLoopIdiomPass();		(void) llvm::createLoopIdiomPass();
(void) llvm::createLoopRotatePass();		(void) llvm::createLoopRotatePass();
(void) llvm::createLowerExpectIntrinsicPass();		(void) llvm::createLowerExpectIntrinsicPass();
		(void) llvm::createLowerIsConstantIntrinsicPass();
(void) llvm::createLowerInvokePass();		(void) llvm::createLowerInvokePass();
(void) llvm::createLowerSwitchPass();		(void) llvm::createLowerSwitchPass();
(void) llvm::createNaryReassociatePass();		(void) llvm::createNaryReassociatePass();
(void) llvm::createObjCARCAAWrapperPass();		(void) llvm::createObjCARCAAWrapperPass();
(void) llvm::createObjCARCAPElimPass();		(void) llvm::createObjCARCAPElimPass();
(void) llvm::createObjCARCExpandPass();		(void) llvm::createObjCARCExpandPass();
(void) llvm::createObjCARCContractPass();		(void) llvm::createObjCARCContractPass();
(void) llvm::createObjCARCOptPass();		(void) llvm::createObjCARCOptPass();
▲ Show 20 Lines • Show All 93 Lines • Show Last 20 Lines

include/llvm/Transforms/Scalar.h

	Show First 20 Lines • Show All 391 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// LowerExpectIntrinsics - Removes llvm.expect intrinsics and creates			// LowerExpectIntrinsics - Removes llvm.expect intrinsics and creates
	// "block_weights" metadata.			// "block_weights" metadata.
	FunctionPass *createLowerExpectIntrinsicPass();			FunctionPass *createLowerExpectIntrinsicPass();

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
				// LowerIsConstantIntrinsics - Converts llvm.is.constant intrinsics into 'true'
				// or 'false'.
				//
				FunctionPass *createLowerIsConstantIntrinsicPass();

				//===----------------------------------------------------------------------===//
				//
	// PartiallyInlineLibCalls - Tries to inline the fast path of library			// PartiallyInlineLibCalls - Tries to inline the fast path of library
	// calls such as sqrt.			// calls such as sqrt.
	//			//
	FunctionPass *createPartiallyInlineLibCallsPass();			FunctionPass *createPartiallyInlineLibCallsPass();

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// SeparateConstOffsetFromGEP - Split GEPs for better CSE			// SeparateConstOffsetFromGEP - Split GEPs for better CSE
	▲ Show 20 Lines • Show All 107 Lines • Show Last 20 Lines

include/llvm/Transforms/Scalar/LowerIsConstantIntrinsic.h

				//===- LowerIsConstantIntrinsic.h - Lower is.constant int. pass -- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				/// \file
				///
				/// The header file for the LowerIsConstantIntrinsic pass as used by the new pass
				/// manager.
				///
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_TRANSFORMS_SCALAR_LOWERISCONSTANTINTRINSIC_H
				#define LLVM_TRANSFORMS_SCALAR_LOWERISCONSTANTINTRINSIC_H

				#include "llvm/IR/Function.h"
				#include "llvm/IR/PassManager.h"

				namespace llvm {

				struct LowerIsConstantIntrinsicPass :
				PassInfoMixin<LowerIsConstantIntrinsicPass> {
				public:
				explicit LowerIsConstantIntrinsicPass() {}

				/// Run the pass over the function.
				///
				/// This will lower all remaining 'is.constant'` intrinsic calls in this
				/// function into 'true' or 'false', propagate the constants and just
				/// conditional branches.
				/// The pass complements the normal Instruction Simplification for folding
				/// 'is.constant' to true in the optimized pass chain.
				PreservedAnalyses run(Function &F, FunctionAnalysisManager &);
				};

				}

				#endif

lib/Analysis/InstructionSimplify.cpp

Show First 20 Lines • Show All 5,038 Lines • ▼ Show 20 Lines	for (auto &Arg : Call->args()) {
if (!C)		if (!C)
return nullptr;		return nullptr;
ConstantArgs.push_back(C);		ConstantArgs.push_back(C);
}		}

return ConstantFoldCall(Call, F, ConstantArgs, Q.TLI);		return ConstantFoldCall(Call, F, ConstantArgs, Q.TLI);
}		}

		static Value simplifyBranch(BranchInst BI, bool SimplifyCFG) {
		echristoUnsubmitted Done Reply Inline Actions This seems to return nullptr everywhere? echristo: This seems to return nullptr everywhere?
		joergAuthorUnsubmitted Done Reply Inline Actions It does, trying to preserve the style of the other functions here. The question might become irrelevant when going with the work list idea. joerg: It does, trying to preserve the style of the other functions here. The question might become…
		if (!SimplifyCFG \|\| BI->isUnconditional())
		return nullptr;
		if (match(BI->getOperand(0), m_Zero())) {
		BranchInst::Create(BI->getSuccessor(1), BI->getParent());
		BI->eraseFromParent();
		return nullptr;
		}
		if (match(BI->getOperand(0), m_One())) {
		BranchInst::Create(BI->getSuccessor(0), BI->getParent());
		BI->eraseFromParent();
		return nullptr;
		}
		return nullptr;
		}

/// See if we can compute a simplified version of this instruction.		/// See if we can compute a simplified version of this instruction.
/// If not, this returns null.		/// If not, this returns null.

Value llvm::SimplifyInstruction(Instruction I, const SimplifyQuery &SQ,		Value llvm::SimplifyInstruction(Instruction I, const SimplifyQuery &SQ,
OptimizationRemarkEmitter *ORE) {		OptimizationRemarkEmitter *ORE,
		bool SimplifyCFG) {
const SimplifyQuery Q = SQ.CxtI ? SQ : SQ.getWithInstruction(I);		const SimplifyQuery Q = SQ.CxtI ? SQ : SQ.getWithInstruction(I);
Value *Result;		Value *Result;

switch (I->getOpcode()) {		switch (I->getOpcode()) {
default:		default:
Result = ConstantFoldInstruction(I, Q.DL, Q.TLI);		Result = ConstantFoldInstruction(I, Q.DL, Q.TLI);
break;		break;
		case Instruction::Br:
		Result = simplifyBranch(dyn_cast<BranchInst>(I), SimplifyCFG);
		break;
case Instruction::FNeg:		case Instruction::FNeg:
Result = SimplifyFNegInst(I->getOperand(0), I->getFastMathFlags(), Q);		Result = SimplifyFNegInst(I->getOperand(0), I->getFastMathFlags(), Q);
break;		break;
case Instruction::FAdd:		case Instruction::FAdd:
Result = SimplifyFAddInst(I->getOperand(0), I->getOperand(1),		Result = SimplifyFAddInst(I->getOperand(0), I->getOperand(1),
I->getFastMathFlags(), Q);		I->getFastMathFlags(), Q);
break;		break;
case Instruction::Add:		case Instruction::Add:
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines

/// Implementation of recursive simplification through an instruction's		/// Implementation of recursive simplification through an instruction's
/// uses.		/// uses.
///		///
/// This is the common implementation of the recursive simplification routines.		/// This is the common implementation of the recursive simplification routines.
/// If we have a pre-simplified value in 'SimpleV', that is forcibly used to		/// If we have a pre-simplified value in 'SimpleV', that is forcibly used to
/// replace the instruction 'I'. Otherwise, we simply add 'I' to the list of		/// replace the instruction 'I'. Otherwise, we simply add 'I' to the list of
/// instructions to process and attempt to simplify it using		/// instructions to process and attempt to simplify it using
/// InstructionSimplify.		/// InstructionSimplify.
///		///
		chandlercUnsubmitted Not Done Reply Inline Actions I'd make this a bit more precise... "Recursively visited users which do not themselves simplify are ..." chandlerc: I'd make this a bit more precise... "Recursively visited users which do not themselves…
/// This routine returns 'true' only when it simplifies something. The passed		/// This routine returns 'true' only when it simplifies something. The passed
/// in simplified value does not count toward this.		/// in simplified value does not count toward this.
static bool replaceAndRecursivelySimplifyImpl(Instruction I, Value SimpleV,		static bool replaceAndRecursivelySimplifyImpl(Instruction I, Value SimpleV,
const TargetLibraryInfo *TLI,		const TargetLibraryInfo *TLI,
const DominatorTree *DT,		const DominatorTree *DT,
AssumptionCache *AC) {		AssumptionCache *AC,
		bool SimplifyCFG) {
bool Simplified = false;		bool Simplified = false;
		chandlercUnsubmitted Done Reply Inline Actions If this is what `clang-format` did with this... I am sad. If not, clang-format? chandlerc: If this is what `clang-format` did with this... I am sad. If not, clang-format?
SmallSetVector<Instruction *, 8> Worklist;		SmallSetVector<Instruction *, 8> Worklist;
const DataLayout &DL = I->getModule()->getDataLayout();		const DataLayout &DL = I->getModule()->getDataLayout();

// If we have an explicit value to collapse to, do that round of the		// If we have an explicit value to collapse to, do that round of the
// simplification loop by hand initially.		// simplification loop by hand initially.
if (SimpleV) {		if (SimpleV) {
for (User *U : I->users())		for (User *U : I->users())
if (U != I)		if (U != I)
Show All 11 Lines	if (SimpleV) {
Worklist.insert(I);		Worklist.insert(I);
}		}

// Note that we must test the size on each iteration, the worklist can grow.		// Note that we must test the size on each iteration, the worklist can grow.
for (unsigned Idx = 0; Idx != Worklist.size(); ++Idx) {		for (unsigned Idx = 0; Idx != Worklist.size(); ++Idx) {
I = Worklist[Idx];		I = Worklist[Idx];

// See if this instruction simplifies.		// See if this instruction simplifies.
SimpleV = SimplifyInstruction(I, {DL, TLI, DT, AC});		SimpleV = SimplifyInstruction(I, {DL, TLI, DT, AC}, nullptr, SimplifyCFG);
if (!SimpleV)		if (!SimpleV)
continue;		continue;

Simplified = true;		Simplified = true;

// Stash away all the uses of the old instruction so we can check them for		// Stash away all the uses of the old instruction so we can check them for
// recursive simplifications after a RAUW. This is cheaper than checking all		// recursive simplifications after a RAUW. This is cheaper than checking all
// uses of To on the recursive step in most cases.		// uses of To on the recursive step in most cases.
Show All 10 Lines	if (I->getParent() && !I->isEHPad() && !I->isTerminator() &&
I->eraseFromParent();		I->eraseFromParent();
}		}
return Simplified;		return Simplified;
}		}

bool llvm::recursivelySimplifyInstruction(Instruction *I,		bool llvm::recursivelySimplifyInstruction(Instruction *I,
const TargetLibraryInfo *TLI,		const TargetLibraryInfo *TLI,
const DominatorTree *DT,		const DominatorTree *DT,
AssumptionCache *AC) {		AssumptionCache *AC,
return replaceAndRecursivelySimplifyImpl(I, nullptr, TLI, DT, AC);		bool SimplifyCFG) {
		return replaceAndRecursivelySimplifyImpl(I, nullptr, TLI, DT, AC,
		SimplifyCFG);
}		}

bool llvm::replaceAndRecursivelySimplify(Instruction I, Value SimpleV,		bool llvm::replaceAndRecursivelySimplify(Instruction I, Value SimpleV,
const TargetLibraryInfo *TLI,		const TargetLibraryInfo *TLI,
const DominatorTree *DT,		const DominatorTree *DT,
AssumptionCache *AC) {		AssumptionCache *AC,
		bool SimplifyCFG) {
assert(I != SimpleV && "replaceAndRecursivelySimplify(X,X) is not valid!");		assert(I != SimpleV && "replaceAndRecursivelySimplify(X,X) is not valid!");
assert(SimpleV && "Must provide a simplified value.");		assert(SimpleV && "Must provide a simplified value.");
return replaceAndRecursivelySimplifyImpl(I, SimpleV, TLI, DT, AC);		return replaceAndRecursivelySimplifyImpl(I, SimpleV, TLI, DT, AC,
		SimplifyCFG);
}		}

namespace llvm {		namespace llvm {
const SimplifyQuery getBestSimplifyQuery(Pass &P, Function &F) {		const SimplifyQuery getBestSimplifyQuery(Pass &P, Function &F) {
auto *DTWP = P.getAnalysisIfAvailable<DominatorTreeWrapperPass>();		auto *DTWP = P.getAnalysisIfAvailable<DominatorTreeWrapperPass>();
auto *DT = DTWP ? &DTWP->getDomTree() : nullptr;		auto *DT = DTWP ? &DTWP->getDomTree() : nullptr;
auto *TLIWP = P.getAnalysisIfAvailable<TargetLibraryInfoWrapperPass>();		auto *TLIWP = P.getAnalysisIfAvailable<TargetLibraryInfoWrapperPass>();
auto *TLI = TLIWP ? &TLIWP->getTLI() : nullptr;		auto *TLI = TLIWP ? &TLIWP->getTLI() : nullptr;
Show All 21 Lines

lib/CodeGen/TargetPassConfig.cpp

Show First 20 Lines • Show All 648 Lines • ▼ Show 20 Lines	if (!DisableMergeICmps)
addPass(createMergeICmpsLegacyPass());		addPass(createMergeICmpsLegacyPass());
addPass(createExpandMemCmpPass());		addPass(createExpandMemCmpPass());
}		}

// Run GC lowering passes for builtin collectors		// Run GC lowering passes for builtin collectors
// TODO: add a pass insertion point here		// TODO: add a pass insertion point here
addPass(createGCLoweringPass());		addPass(createGCLoweringPass());
addPass(createShadowStackGCLoweringPass());		addPass(createShadowStackGCLoweringPass());
		addPass(createLowerIsConstantIntrinsicPass());

// Make sure that no unreachable blocks are instruction selected.		// Make sure that no unreachable blocks are instruction selected.
addPass(createUnreachableBlockEliminationPass());		addPass(createUnreachableBlockEliminationPass());

// Prepare expensive constants for SelectionDAG.		// Prepare expensive constants for SelectionDAG.
if (getOptLevel() != CodeGenOpt::None && !DisableConstantHoisting)		if (getOptLevel() != CodeGenOpt::None && !DisableConstantHoisting)
addPass(createConstantHoistingPass());		addPass(createConstantHoistingPass());

▲ Show 20 Lines • Show All 571 Lines • Show Last 20 Lines

lib/Passes/PassBuilder.cpp

	Show First 20 Lines • Show All 135 Lines • ▼ Show 20 Lines
	#include "llvm/Transforms/Scalar/LoopSimplifyCFG.h"			#include "llvm/Transforms/Scalar/LoopSimplifyCFG.h"
	#include "llvm/Transforms/Scalar/LoopSink.h"			#include "llvm/Transforms/Scalar/LoopSink.h"
	#include "llvm/Transforms/Scalar/LoopStrengthReduce.h"			#include "llvm/Transforms/Scalar/LoopStrengthReduce.h"
	#include "llvm/Transforms/Scalar/LoopUnrollAndJamPass.h"			#include "llvm/Transforms/Scalar/LoopUnrollAndJamPass.h"
	#include "llvm/Transforms/Scalar/LoopUnrollPass.h"			#include "llvm/Transforms/Scalar/LoopUnrollPass.h"
	#include "llvm/Transforms/Scalar/LowerAtomic.h"			#include "llvm/Transforms/Scalar/LowerAtomic.h"
	#include "llvm/Transforms/Scalar/LowerExpectIntrinsic.h"			#include "llvm/Transforms/Scalar/LowerExpectIntrinsic.h"
	#include "llvm/Transforms/Scalar/LowerGuardIntrinsic.h"			#include "llvm/Transforms/Scalar/LowerGuardIntrinsic.h"
				#include "llvm/Transforms/Scalar/LowerIsConstantIntrinsic.h"
	#include "llvm/Transforms/Scalar/LowerWidenableCondition.h"			#include "llvm/Transforms/Scalar/LowerWidenableCondition.h"
	#include "llvm/Transforms/Scalar/MakeGuardsExplicit.h"			#include "llvm/Transforms/Scalar/MakeGuardsExplicit.h"
	#include "llvm/Transforms/Scalar/MemCpyOptimizer.h"			#include "llvm/Transforms/Scalar/MemCpyOptimizer.h"
	#include "llvm/Transforms/Scalar/MergeICmps.h"			#include "llvm/Transforms/Scalar/MergeICmps.h"
	#include "llvm/Transforms/Scalar/MergedLoadStoreMotion.h"			#include "llvm/Transforms/Scalar/MergedLoadStoreMotion.h"
	#include "llvm/Transforms/Scalar/NaryReassociate.h"			#include "llvm/Transforms/Scalar/NaryReassociate.h"
	#include "llvm/Transforms/Scalar/NewGVN.h"			#include "llvm/Transforms/Scalar/NewGVN.h"
	#include "llvm/Transforms/Scalar/PartiallyInlineLibCalls.h"			#include "llvm/Transforms/Scalar/PartiallyInlineLibCalls.h"
	▲ Show 20 Lines • Show All 2,150 Lines • Show Last 20 Lines

lib/Passes/PassRegistry.def

	Show First 20 Lines • Show All 180 Lines • ▼ Show 20 Lines
	FUNCTION_PASS("instsimplify", InstSimplifyPass())			FUNCTION_PASS("instsimplify", InstSimplifyPass())
	FUNCTION_PASS("invalidate<all>", InvalidateAllAnalysesPass())			FUNCTION_PASS("invalidate<all>", InvalidateAllAnalysesPass())
	FUNCTION_PASS("float2int", Float2IntPass())			FUNCTION_PASS("float2int", Float2IntPass())
	FUNCTION_PASS("no-op-function", NoOpFunctionPass())			FUNCTION_PASS("no-op-function", NoOpFunctionPass())
	FUNCTION_PASS("libcalls-shrinkwrap", LibCallsShrinkWrapPass())			FUNCTION_PASS("libcalls-shrinkwrap", LibCallsShrinkWrapPass())
	FUNCTION_PASS("loweratomic", LowerAtomicPass())			FUNCTION_PASS("loweratomic", LowerAtomicPass())
	FUNCTION_PASS("lower-expect", LowerExpectIntrinsicPass())			FUNCTION_PASS("lower-expect", LowerExpectIntrinsicPass())
	FUNCTION_PASS("lower-guard-intrinsic", LowerGuardIntrinsicPass())			FUNCTION_PASS("lower-guard-intrinsic", LowerGuardIntrinsicPass())
				FUNCTION_PASS("lower-is-constant", LowerIsConstantIntrinsicPass())
				chandlercUnsubmitted Done Reply Inline Actions Maybe `lower-constant-intrinsics` as a name? (Since it handles `objectsize` as well. chandlerc: Maybe `lower-constant-intrinsics` as a name? (Since it handles `objectsize` as well.
	FUNCTION_PASS("lower-widenable-condition", LowerWidenableConditionPass())			FUNCTION_PASS("lower-widenable-condition", LowerWidenableConditionPass())
	FUNCTION_PASS("guard-widening", GuardWideningPass())			FUNCTION_PASS("guard-widening", GuardWideningPass())
	FUNCTION_PASS("gvn", GVN())			FUNCTION_PASS("gvn", GVN())
	FUNCTION_PASS("load-store-vectorizer", LoadStoreVectorizerPass())			FUNCTION_PASS("load-store-vectorizer", LoadStoreVectorizerPass())
	FUNCTION_PASS("loop-simplify", LoopSimplifyPass())			FUNCTION_PASS("loop-simplify", LoopSimplifyPass())
	FUNCTION_PASS("loop-sink", LoopSinkPass())			FUNCTION_PASS("loop-sink", LoopSinkPass())
	FUNCTION_PASS("lowerinvoke", LowerInvokePass())			FUNCTION_PASS("lowerinvoke", LowerInvokePass())
	FUNCTION_PASS("mem2reg", PromotePass())			FUNCTION_PASS("mem2reg", PromotePass())
	▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

lib/Transforms/Scalar/CMakeLists.txt

Show All 40 Lines	add_llvm_library(LLVMScalarOpts
LoopStrengthReduce.cpp		LoopStrengthReduce.cpp
LoopUnrollPass.cpp		LoopUnrollPass.cpp
LoopUnrollAndJamPass.cpp		LoopUnrollAndJamPass.cpp
LoopUnswitch.cpp		LoopUnswitch.cpp
LoopVersioningLICM.cpp		LoopVersioningLICM.cpp
LowerAtomic.cpp		LowerAtomic.cpp
LowerExpectIntrinsic.cpp		LowerExpectIntrinsic.cpp
LowerGuardIntrinsic.cpp		LowerGuardIntrinsic.cpp
		LowerIsConstantIntrinsic.cpp
LowerWidenableCondition.cpp		LowerWidenableCondition.cpp
MakeGuardsExplicit.cpp		MakeGuardsExplicit.cpp
MemCpyOptimizer.cpp		MemCpyOptimizer.cpp
MergeICmps.cpp		MergeICmps.cpp
MergedLoadStoreMotion.cpp		MergedLoadStoreMotion.cpp
NaryReassociate.cpp		NaryReassociate.cpp
NewGVN.cpp		NewGVN.cpp
PartiallyInlineLibCalls.cpp		PartiallyInlineLibCalls.cpp
Show All 26 Lines

lib/Transforms/Scalar/LowerIsConstantIntrinsic.cpp

				//===- LowerIsConstantIntrinsic.cpp - Lower is.constant intrinsic ---------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This pass lowers the 'is.constant' intrinsic to 'true' or 'false'.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Transforms/Scalar/LowerIsConstantIntrinsic.h"
				#include "llvm/ADT/Statistic.h"
				#include "llvm/Analysis/InstructionSimplify.h"
				#include "llvm/Analysis/TargetLibraryInfo.h"
				#include "llvm/IR/BasicBlock.h"
				#include "llvm/IR/Constants.h"
				#include "llvm/IR/Function.h"
				#include "llvm/IR/Instructions.h"
				#include "llvm/IR/IntrinsicInst.h"
				#include "llvm/IR/Intrinsics.h"
				#include "llvm/Pass.h"
				#include "llvm/Support/Debug.h"
				#include "llvm/Transforms/Scalar.h"
				#include "llvm/Transforms/Utils/Local.h"

				using namespace llvm;

				#define DEBUG_TYPE "lower-is-constant-intrinsic"

				STATISTIC(IsConstantIntrinsicsHandled,
				"Number of 'is.constant' intrinsic instructions handled");

				static bool handleIsConstantIntrinsic(IntrinsicInst *II) {
				Value *Op = II->getOperand(0);
				Value *Exp = nullptr;

				if (isa<Constant>(Op))
				Exp = ConstantInt::getTrue(II->getType());
				else
				Exp = ConstantInt::getFalse(II->getType());

				replaceAndRecursivelySimplify(II, Exp, nullptr, nullptr, nullptr, true);
				return true;
				}

				static bool lowerIsConstantIntrinsic(Function &F,
				const TargetLibraryInfo *TLI) {
				bool NewChange, Changed = false;

				arsenmUnsubmitted Not Done Reply Inline Actions Should early exit if there are is no is_constant declaration, or just visit uses of it rather than search the whole function for a rarely used intrinsic? arsenm: Should early exit if there are is no is_constant declaration, or just visit uses of it rather…
				joergAuthorUnsubmitted Not Done Reply Inline Actions As mentioned on IRC, llvm.is.constant is a family of intrinsics depending on the type of the argument. There doesn't seem to be an easy way to check if a given template-like intrinsic is used. joerg: As mentioned on IRC, llvm.is.constant is a family of intrinsics depending on the type of the…
				do {
				NewChange = false;
				for (BasicBlock &BB : F) {
				for (auto &I : BB) {
				if (IntrinsicInst *II = dyn_cast<IntrinsicInst>(&I)) {
				if (II->getIntrinsicID() != Intrinsic::is_constant)
				continue;
				if (handleIsConstantIntrinsic(II)) {
				IsConstantIntrinsicsHandled++;
				Changed = true;
				NewChange = true;
				break;
				}
				}
				}
				if (NewChange)
				break;
				}
				} while (NewChange);
				return Changed;
				}

				PreservedAnalyses LowerIsConstantIntrinsicPass::run(
				Function &F, FunctionAnalysisManager &AM) {
				if (lowerIsConstantIntrinsic(F, AM.getCachedResult<TargetLibraryAnalysis>(F)))
				return PreservedAnalyses::none();

				return PreservedAnalyses::all();
				}


				namespace {
				/// Legacy pass for lowering is.constant intrinsics out of the IR.
				///
				/// When this pass is run over a function it converts is.constant intrinsics
				/// into 'true' or 'false'. This is completements the normal constand folding
				/// to 'true' as part of Instruction Simplify passes.
				class LowerIsConstantIntrinsic : public FunctionPass {
				public:
				static char ID;
				LowerIsConstantIntrinsic()
				: FunctionPass(ID) {
				initializeLowerIsConstantIntrinsicPass(*PassRegistry::getPassRegistry());
				}

				bool runOnFunction(Function &F) override {
				auto *TLIP = getAnalysisIfAvailable<TargetLibraryInfoWrapperPass>();
				const TargetLibraryInfo *TLI = TLIP ? &TLIP->getTLI() : nullptr;
				return lowerIsConstantIntrinsic(F, TLI);
				}
				};
				}

				char LowerIsConstantIntrinsic::ID = 0;
				INITIALIZE_PASS(LowerIsConstantIntrinsic, "lower-is-constant",
				"Lower 'is.constant' Intrinsics", false, false)

				FunctionPass *llvm::createLowerIsConstantIntrinsicPass() {
				return new LowerIsConstantIntrinsic();
				}

lib/Transforms/Scalar/Scalar.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	void llvm::initializeScalarOpts(PassRegistry &Registry) {
initializeLoopUnrollAndJamPass(Registry);		initializeLoopUnrollAndJamPass(Registry);
initializeLoopUnswitchPass(Registry);		initializeLoopUnswitchPass(Registry);
initializeWarnMissedTransformationsLegacyPass(Registry);		initializeWarnMissedTransformationsLegacyPass(Registry);
initializeLoopVersioningLICMPass(Registry);		initializeLoopVersioningLICMPass(Registry);
initializeLoopIdiomRecognizeLegacyPassPass(Registry);		initializeLoopIdiomRecognizeLegacyPassPass(Registry);
initializeLowerAtomicLegacyPassPass(Registry);		initializeLowerAtomicLegacyPassPass(Registry);
initializeLowerExpectIntrinsicPass(Registry);		initializeLowerExpectIntrinsicPass(Registry);
initializeLowerGuardIntrinsicLegacyPassPass(Registry);		initializeLowerGuardIntrinsicLegacyPassPass(Registry);
		initializeLowerIsConstantIntrinsicPass(Registry);
initializeLowerWidenableConditionLegacyPassPass(Registry);		initializeLowerWidenableConditionLegacyPassPass(Registry);
initializeMemCpyOptLegacyPassPass(Registry);		initializeMemCpyOptLegacyPassPass(Registry);
initializeMergeICmpsLegacyPassPass(Registry);		initializeMergeICmpsLegacyPassPass(Registry);
initializeMergedLoadStoreMotionLegacyPassPass(Registry);		initializeMergedLoadStoreMotionLegacyPassPass(Registry);
initializeNaryReassociateLegacyPassPass(Registry);		initializeNaryReassociateLegacyPassPass(Registry);
initializePartiallyInlineLibCallsLegacyPassPass(Registry);		initializePartiallyInlineLibCallsLegacyPassPass(Registry);
initializeReassociateLegacyPassPass(Registry);		initializeReassociateLegacyPassPass(Registry);
initializeRegToMemPass(Registry);		initializeRegToMemPass(Registry);
▲ Show 20 Lines • Show All 187 Lines • ▼ Show 20 Lines
void LLVMAddBasicAliasAnalysisPass(LLVMPassManagerRef PM) {		void LLVMAddBasicAliasAnalysisPass(LLVMPassManagerRef PM) {
unwrap(PM)->add(createBasicAAWrapperPass());		unwrap(PM)->add(createBasicAAWrapperPass());
}		}

void LLVMAddLowerExpectIntrinsicPass(LLVMPassManagerRef PM) {		void LLVMAddLowerExpectIntrinsicPass(LLVMPassManagerRef PM) {
unwrap(PM)->add(createLowerExpectIntrinsicPass());		unwrap(PM)->add(createLowerExpectIntrinsicPass());
}		}

		void LLVMAddLowerIsConstantIntrinsicPass(LLVMPassManagerRef PM) {
		unwrap(PM)->add(createLowerIsConstantIntrinsicPass());
		}

void LLVMAddUnifyFunctionExitNodesPass(LLVMPassManagerRef PM) {		void LLVMAddUnifyFunctionExitNodesPass(LLVMPassManagerRef PM) {
unwrap(PM)->add(createUnifyFunctionExitNodesPass());		unwrap(PM)->add(createUnifyFunctionExitNodesPass());
}		}

test/CodeGen/AArch64/O0-pipeline.ll

	Show All 15 Lines
	; CHECK-NEXT: Pre-ISel Intrinsic Lowering			; CHECK-NEXT: Pre-ISel Intrinsic Lowering
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Expand Atomic instructions			; CHECK-NEXT: Expand Atomic instructions
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: Lower Garbage Collection Instructions			; CHECK-NEXT: Lower Garbage Collection Instructions
	; CHECK-NEXT: Shadow Stack GC Lowering			; CHECK-NEXT: Shadow Stack GC Lowering
				; CHECK-NEXT: Lower 'is.constant' Intrinsics
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)			; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)
	; CHECK-NEXT: Scalarize Masked Memory Intrinsics			; CHECK-NEXT: Scalarize Masked Memory Intrinsics
	; CHECK-NEXT: Expand reduction intrinsics			; CHECK-NEXT: Expand reduction intrinsics
	; CHECK-NEXT: AArch64 Stack Tagging			; CHECK-NEXT: AArch64 Stack Tagging
	; CHECK-NEXT: Rewrite Symbols			; CHECK-NEXT: Rewrite Symbols
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

test/CodeGen/AArch64/O3-pipeline.ll

	Show All 32 Lines
	; CHECK-NEXT: Induction Variable Users			; CHECK-NEXT: Induction Variable Users
	; CHECK-NEXT: Loop Strength Reduction			; CHECK-NEXT: Loop Strength Reduction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Merge contiguous icmps into a memcmp			; CHECK-NEXT: Merge contiguous icmps into a memcmp
	; CHECK-NEXT: Expand memcmp() to load/stores			; CHECK-NEXT: Expand memcmp() to load/stores
	; CHECK-NEXT: Lower Garbage Collection Instructions			; CHECK-NEXT: Lower Garbage Collection Instructions
	; CHECK-NEXT: Shadow Stack GC Lowering			; CHECK-NEXT: Shadow Stack GC Lowering
				; CHECK-NEXT: Lower 'is.constant' Intrinsics
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Block Frequency Analysis			; CHECK-NEXT: Block Frequency Analysis
	; CHECK-NEXT: Constant Hoisting			; CHECK-NEXT: Constant Hoisting
	; CHECK-NEXT: Partially inline calls to library functions			; CHECK-NEXT: Partially inline calls to library functions
	; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)			; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)
	▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

test/CodeGen/ARM/O3-pipeline.ll

	Show All 16 Lines
	; CHECK-NEXT: Induction Variable Users			; CHECK-NEXT: Induction Variable Users
	; CHECK-NEXT: Loop Strength Reduction			; CHECK-NEXT: Loop Strength Reduction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Merge contiguous icmps into a memcmp			; CHECK-NEXT: Merge contiguous icmps into a memcmp
	; CHECK-NEXT: Expand memcmp() to load/stores			; CHECK-NEXT: Expand memcmp() to load/stores
	; CHECK-NEXT: Lower Garbage Collection Instructions			; CHECK-NEXT: Lower Garbage Collection Instructions
	; CHECK-NEXT: Shadow Stack GC Lowering			; CHECK-NEXT: Shadow Stack GC Lowering
				; CHECK-NEXT: Lower 'is.constant' Intrinsics
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Block Frequency Analysis			; CHECK-NEXT: Block Frequency Analysis
	; CHECK-NEXT: Constant Hoisting			; CHECK-NEXT: Constant Hoisting
	; CHECK-NEXT: Partially inline calls to library functions			; CHECK-NEXT: Partially inline calls to library functions
	; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)			; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)
	▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

test/CodeGen/X86/O0-pipeline.ll

	Show All 18 Lines
	; CHECK-NEXT: Pre-ISel Intrinsic Lowering			; CHECK-NEXT: Pre-ISel Intrinsic Lowering
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Expand Atomic instructions			; CHECK-NEXT: Expand Atomic instructions
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: Lower Garbage Collection Instructions			; CHECK-NEXT: Lower Garbage Collection Instructions
	; CHECK-NEXT: Shadow Stack GC Lowering			; CHECK-NEXT: Shadow Stack GC Lowering
				; CHECK-NEXT: Lower 'is.constant' Intrinsics
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)			; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)
	; CHECK-NEXT: Scalarize Masked Memory Intrinsics			; CHECK-NEXT: Scalarize Masked Memory Intrinsics
	; CHECK-NEXT: Expand reduction intrinsics			; CHECK-NEXT: Expand reduction intrinsics
	; CHECK-NEXT: Expand indirectbr instructions			; CHECK-NEXT: Expand indirectbr instructions
	; CHECK-NEXT: Rewrite Symbols			; CHECK-NEXT: Rewrite Symbols
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

test/CodeGen/X86/O3-pipeline.ll

	Show All 29 Lines
	; CHECK-NEXT: Induction Variable Users			; CHECK-NEXT: Induction Variable Users
	; CHECK-NEXT: Loop Strength Reduction			; CHECK-NEXT: Loop Strength Reduction
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Merge contiguous icmps into a memcmp			; CHECK-NEXT: Merge contiguous icmps into a memcmp
	; CHECK-NEXT: Expand memcmp() to load/stores			; CHECK-NEXT: Expand memcmp() to load/stores
	; CHECK-NEXT: Lower Garbage Collection Instructions			; CHECK-NEXT: Lower Garbage Collection Instructions
	; CHECK-NEXT: Shadow Stack GC Lowering			; CHECK-NEXT: Shadow Stack GC Lowering
				; CHECK-NEXT: Lower 'is.constant' Intrinsics
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Block Frequency Analysis			; CHECK-NEXT: Block Frequency Analysis
	; CHECK-NEXT: Constant Hoisting			; CHECK-NEXT: Constant Hoisting
	; CHECK-NEXT: Partially inline calls to library functions			; CHECK-NEXT: Partially inline calls to library functions
	; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)			; CHECK-NEXT: Instrument function entry/exit with calls to e.g. mcount() (post inlining)
	▲ Show 20 Lines • Show All 133 Lines • Show Last 20 Lines

test/CodeGen/X86/is-constant.ll

	; RUN: llc -O2 < %s \| FileCheck %s --check-prefix=CHECK-O2 --check-prefix=CHECK
	; RUN: llc -O0 -fast-isel < %s \| FileCheck %s --check-prefix=CHECK-O0 --check-prefix=CHECK
	; RUN: llc -O0 -fast-isel=0 < %s \| FileCheck %s --check-prefix=CHECK-O0 --check-prefix=CHECK
	; RUN: llc -O0 -global-isel < %s \| FileCheck %s --check-prefix=CHECK-O0 --check-prefix=CHECK

	;; Ensure that an unfoldable is.constant gets lowered reasonably in
	;; optimized codegen, in particular, that the "true" branch is
	;; eliminated.
	;;
	;; This isn't asserting any specific output from non-optimized runs,
	;; (e.g., currently the not-taken branch does not get eliminated). But
	;; it does ensure that lowering succeeds in all 3 codegen paths.

	target triple = "x86_64-unknown-linux-gnu"

	declare i1 @llvm.is.constant.i32(i32 %a) nounwind readnone
	declare i1 @llvm.is.constant.i64(i64 %a) nounwind readnone
	declare i64 @llvm.objectsize.i64.p0i8(i8*, i1, i1, i1) nounwind readnone

	declare i32 @subfun_1()
	declare i32 @subfun_2()

	define i32 @test_branch(i32 %in) nounwind {
	; CHECK-LABEL: test_branch:
	; CHECK-O2: %bb.0:
	; CHECK-O2-NEXT: jmp subfun_2
	%v = call i1 @llvm.is.constant.i32(i32 %in)
	br i1 %v, label %True, label %False

	True:
	%call1 = tail call i32 @subfun_1()
	ret i32 %call1

	False:
	%call2 = tail call i32 @subfun_2()
	ret i32 %call2
	}

	;; llvm.objectsize is another tricky case which gets folded to -1 very
	;; late in the game. We'd like to ensure that llvm.is.constant of
	;; llvm.objectsize is true.
	define i1 @test_objectsize(i8* %obj) nounwind {
	; CHECK-LABEL: test_objectsize:
	; CHECK-O2: %bb.0:
	; CHECK-O2: movb $1, %al
	; CHECK-O2-NEXT: retq
	%os = call i64 @llvm.objectsize.i64.p0i8(i8* %obj, i1 false, i1 false, i1 false)
	%v = call i1 @llvm.is.constant.i64(i64 %os)
	ret i1 %v
	}

test/Transforms/LowerIsConstant/is-constant.ll

	; RUN: llc -O2 < %s \| FileCheck %s --check-prefix=CHECK-O2 --check-prefix=CHECK			; RUN: opt --lower-is-constant --unreachableblockelim -S < %s \| FileCheck %s
				jyknightUnsubmitted Not Done Reply Inline Actions While this is a more targeted run, the old asserts are checking what _actually_ needs to continue to work. E.g. I'd be more confident that my suggested removal of the handling code elsewhere was correct if this test still ran the original 4 tests. jyknight: While this is a more targeted run, the old asserts are checking what _actually_ needs to…
	; RUN: llc -O0 -fast-isel < %s \| FileCheck %s --check-prefix=CHECK-O0 --check-prefix=CHECK
	; RUN: llc -O0 -fast-isel=0 < %s \| FileCheck %s --check-prefix=CHECK-O0 --check-prefix=CHECK
	; RUN: llc -O0 -global-isel < %s \| FileCheck %s --check-prefix=CHECK-O0 --check-prefix=CHECK

	;; Ensure that an unfoldable is.constant gets lowered reasonably in			;; Ensure that an unfoldable is.constant gets lowered reasonably in
	;; optimized codegen, in particular, that the "true" branch is			;; optimized codegen, in particular, that the "true" branch is
	;; eliminated.			;; eliminated.
	;;
	;; This isn't asserting any specific output from non-optimized runs,
	;; (e.g., currently the not-taken branch does not get eliminated). But
	;; it does ensure that lowering succeeds in all 3 codegen paths.

	target triple = "x86_64-unknown-linux-gnu"			;; CHECK-NOT: tail call i32 @subfun_1()
				;; CHECK: tail call i32 @subfun_2()
				;; CHECK-NOT: tail call i32 @subfun_1()

	declare i1 @llvm.is.constant.i32(i32 %a) nounwind readnone			declare i1 @llvm.is.constant.i32(i32 %a) nounwind readnone
	declare i1 @llvm.is.constant.i64(i64 %a) nounwind readnone			declare i1 @llvm.is.constant.i64(i64 %a) nounwind readnone
	declare i64 @llvm.objectsize.i64.p0i8(i8*, i1, i1, i1) nounwind readnone

	declare i32 @subfun_1()			declare i32 @subfun_1()
	declare i32 @subfun_2()			declare i32 @subfun_2()

	define i32 @test_branch(i32 %in) nounwind {			define i32 @test_branch(i32 %in) nounwind {
	; CHECK-LABEL: test_branch:
	; CHECK-O2: %bb.0:
	; CHECK-O2-NEXT: jmp subfun_2
	%v = call i1 @llvm.is.constant.i32(i32 %in)			%v = call i1 @llvm.is.constant.i32(i32 %in)
	br i1 %v, label %True, label %False			br i1 %v, label %True, label %False

	True:			True:
	%call1 = tail call i32 @subfun_1()			%call1 = tail call i32 @subfun_1()
	ret i32 %call1			ret i32 %call1

	False:			False:
	%call2 = tail call i32 @subfun_2()			%call2 = tail call i32 @subfun_2()
	ret i32 %call2			ret i32 %call2
	}			}

	;; llvm.objectsize is another tricky case which gets folded to -1 very
	;; late in the game. We'd like to ensure that llvm.is.constant of
	;; llvm.objectsize is true.
	define i1 @test_objectsize(i8* %obj) nounwind {
	; CHECK-LABEL: test_objectsize:
	; CHECK-O2: %bb.0:
	; CHECK-O2: movb $1, %al
	; CHECK-O2-NEXT: retq
	%os = call i64 @llvm.objectsize.i64.p0i8(i8* %obj, i1 false, i1 false, i1 false)
	%v = call i1 @llvm.is.constant.i64(i64 %os)
	ret i1 %v
	}

utils/gn/secondary/llvm/lib/Transforms/Scalar/BUILD.gn

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	sources = [
"LoopStrengthReduce.cpp",		"LoopStrengthReduce.cpp",
"LoopUnrollAndJamPass.cpp",		"LoopUnrollAndJamPass.cpp",
"LoopUnrollPass.cpp",		"LoopUnrollPass.cpp",
"LoopUnswitch.cpp",		"LoopUnswitch.cpp",
"LoopVersioningLICM.cpp",		"LoopVersioningLICM.cpp",
"LowerAtomic.cpp",		"LowerAtomic.cpp",
"LowerExpectIntrinsic.cpp",		"LowerExpectIntrinsic.cpp",
"LowerGuardIntrinsic.cpp",		"LowerGuardIntrinsic.cpp",
		"LowerIsConstantIntrinsic.cpp",
"LowerWidenableCondition.cpp",		"LowerWidenableCondition.cpp",
"MakeGuardsExplicit.cpp",		"MakeGuardsExplicit.cpp",
"MemCpyOptimizer.cpp",		"MemCpyOptimizer.cpp",
"MergeICmps.cpp",		"MergeICmps.cpp",
"MergedLoadStoreMotion.cpp",		"MergedLoadStoreMotion.cpp",
"NaryReassociate.cpp",		"NaryReassociate.cpp",
"NewGVN.cpp",		"NewGVN.cpp",
"PartiallyInlineLibCalls.cpp",		"PartiallyInlineLibCalls.cpp",
Show All 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Add a pass to lower is.constant and objectsize intrinsicsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 212070

bindings/ocaml/transforms/scalar_opts/llvm_scalar_opts.mli

bindings/ocaml/transforms/scalar_opts/scalar_opts_ocaml.c

include/llvm-c/Transforms/Scalar.h

include/llvm/Analysis/InstructionSimplify.h

include/llvm/InitializePasses.h

include/llvm/LinkAllPasses.h

include/llvm/Transforms/Scalar.h

include/llvm/Transforms/Scalar/LowerIsConstantIntrinsic.h

lib/Analysis/InstructionSimplify.cpp

lib/CodeGen/TargetPassConfig.cpp

lib/Passes/PassBuilder.cpp

lib/Passes/PassRegistry.def

lib/Transforms/Scalar/CMakeLists.txt

lib/Transforms/Scalar/LowerIsConstantIntrinsic.cpp

lib/Transforms/Scalar/Scalar.cpp

test/CodeGen/AArch64/O0-pipeline.ll

test/CodeGen/AArch64/O3-pipeline.ll

test/CodeGen/ARM/O3-pipeline.ll

test/CodeGen/X86/O0-pipeline.ll

test/CodeGen/X86/O3-pipeline.ll

test/CodeGen/X86/is-constant.ll

test/Transforms/LowerIsConstant/is-constant.ll

utils/gn/secondary/llvm/lib/Transforms/Scalar/BUILD.gn

Add a pass to lower is.constant and objectsize intrinsics
ClosedPublic