This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/CodeGen/
-
test/
-
CodeGen/
-
thinlto-distributed-newpm.ll
-
llvm/
-
lib/Passes/
-
Passes/
7/14
PassBuilderPipelines.cpp
-
test/
-
Other/
-
new-pm-defaults.ll
-
new-pm-lto-defaults.ll
-
new-pm-thinlto-defaults.ll
-
new-pm-thinlto-postlink-pgo-defaults.ll
-
new-pm-thinlto-postlink-samplepgo-defaults.ll
-
new-pm-thinlto-prelink-pgo-defaults.ll
-
new-pm-thinlto-prelink-samplepgo-defaults.ll
-
Transforms/
-
InstCombine/
-
unused-nonnull.ll
-
PhaseOrdering/
-
dce-after-argument-promotion.ll

Differential D128830

[Pipelines] Introduce DAE after ArgumentPromotion
ClosedPublic

Authored by psamolysov on Jun 29 2022, 8:25 AM.

Download Raw Diff

Details

Reviewers

aeubanks
mtrofin
fhahn
yln
serge-sans-paille
nikic
jdoerfert

Commits

rG1c530500ab86: [Pipelines] Introduce DAE after ArgumentPromotion
rGb10a341aa5b0: [Pipelines] Introduce DAE after ArgumentPromotion
rG879f5118fc74: [Pipelines] Introduce DAE after ArgumentPromotion
rG3f20dcbf708c: [Pipelines] Introduce DAE after ArgumentPromotion

Summary

The ArgumentPromotion pass uses Mem2Reg promotion at the end to cutting
down generated alloca instructions as well as meaningless stores and
this behavior can leave unused (dead) arguments. To eliminate the dead
arguments and therefore let the DeadCodeElimination remove becoming dead
inserted GEPs as well as loads and casts in the callers, the
DeadArgumentElimination pass should be run after the ArgumentPromotion
one.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,060 ms	x64 debian > ThreadSanitizer-x86_64.ThreadSanitizer-x86_64::restore_stack.cpp

Event Timeline

psamolysov created this revision.Jun 29 2022, 8:25 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 29 2022, 8:25 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

psamolysov requested review of this revision.Jun 29 2022, 8:25 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptJun 29 2022, 8:25 AM

Herald added subscribers: llvm-commits, sstefan1. · View Herald Transcript

Harbormaster completed remote builds in B172773: Diff 441025.Jun 29 2022, 9:54 AM

There should be a PhaseOrdering test that shows that the argument is not removed in the current opimization pipeline.

llvm/lib/Passes/PassBuilderPipelines.cpp
775	I don't think we need a late module pass for this, it can be part of the normal module optimization pipeline (post module simplification).

In D128830#3621522, @nikic wrote:

There should be a PhaseOrdering test that shows that the argument is not removed in the current opimization pipeline.

Thank you for the suggestion. I've added the test (3b7650da725) and rebased the current patch onto main to modify the test here. Also, I've removed the example from the patch's description because it has actually been moved into the test.

llvm/lib/Passes/PassBuilderPipelines.cpp
775	I'm afraid I didn't get your comment. What I tried is to add `DAE` with `MIWP.addModulePass` but such module passes run before call graph passes.

Add a PhaseOrdering test to show how the patch affects dead argument elimination after argument promotion.

Harbormaster completed remote builds in B172995: Diff 441334.Jun 30 2022, 4:33 AM

psamolysov planned changes to this revision.Jun 30 2022, 4:54 AM

A number of tests failed, I'm going to fix them ASAP.

Actualize the new PM tests.

Herald added subscribers: ormris, wenlei, steven_wu. · View Herald TranscriptJun 30 2022, 5:17 AM

Harbormaster completed remote builds in B173016: Diff 441364.Jun 30 2022, 6:06 AM

mtrofin added inline comments.Jun 30 2022, 7:50 AM

llvm/lib/Passes/PassBuilderPipelines.cpp
731–733	are these changes due to running clang-format (i.e. the previous change didn't?)
1611	same clang-format q
1689	spurious change?
1712	was this clang-format?

fhahn added inline comments.Jun 30 2022, 7:52 AM

llvm/lib/Passes/PassBuilderPipelines.cpp
731–733	looks like it, usually it's better to use https://github.com/llvm-mirror/clang/blob/master/tools/clang-format/clang-format-diff.py

Do we need to retain the run of DeadArgumentEliminationPass in the original position or is a single run at the new position sufficient?

psamolysov added inline comments.Jun 30 2022, 7:57 AM

llvm/lib/Passes/PassBuilderPipelines.cpp
731–733	@mtrofin Yes, this change is the result of a run of clang-format as well as the previous one. @fhahn Thank you for the suggestion.
1611	Should I revert the changes introduced by clang-format?
1689	This change is introduced by clang-format. Should there be two empty lines between `MainFPM.addPass(MergedLoadStoreMotionPass());` and `if (EnableConstraintElimination)`?
1712	Yes, it was.

psamolysov added inline comments.Jun 30 2022, 8:03 AM

llvm/lib/Passes/PassBuilderPipelines.cpp
1611	If these formatting changes make sense, I can introduce them in a separate patch. Should I?

mtrofin added inline comments.Jun 30 2022, 8:05 AM

llvm/lib/Passes/PassBuilderPipelines.cpp
1611	They are fine here, I was just checking. Thanks!

In D128830#3622467, @fhahn wrote:

Do we need to retain the run of DeadArgumentEliminationPass in the original position or is a single run at the new position sufficient?

Good point! I tried and also removed the guard that the DAE pass should run with O3 only (currently it run exactly as the pass in the original position did: with any O > 0). I have a chance to run the LLVM :: Transforms tests only, one test failed - LLVM :: Transforms\InstCombine\unused-nonnull.ll:

error: CHECK-SAME: expected string not found in input
; CHECK-SAME: (i32 [[ARGC:%.*]], i8** nocapture readnone [[ARGV:%.*]]) local_unnamed_addr #[[ATTR0:[0-9]+]] {
              ^
<stdin>:7:17: note: scanning from here
define i32 @main(i32 %argc, i8** nocapture readonly %argv) local_unnamed_addr #0 {

The difference is that the %argv argument has the readonly attribute, not readnone. I'm not sure whether this makes much sense (I guess this could because readnone can theoretically open a door for some optimizations for which readonly cannot).

I've submitted this change to let the buildbot run all the tests and see the difference.

llvm/lib/Passes/PassBuilderPipelines.cpp
775	Oh, I think I have got the idea: instead of adding the `DAE` with `MIWP.addLateModulePass()` just after adding the `ArgumentPromotion` in the `buildInlinerPipeline`, to add `DAE` after this line with usual `addPass`: MPM.addPass(buildInlinerPipeline(Level, Phase)); This makes sense when we removing the original point of adding the `DAE` pass, because the `buildInlinerPipeline()` function is called not in every case. Actually, we just moving `DAE` after inlining.

Try to remove the DAE from the original point. Also, apply the suggestion from @nikic - make the DAE a part of the normal module optimization pipeline (post module simplification).

Harbormaster completed remote builds in B173063: Diff 441426.Jun 30 2022, 10:01 AM

Fix Clang :: CodeGen/thinlto-distributed-newpm.ll

Herald added a project: Restricted Project. · View Herald TranscriptJun 30 2022, 11:13 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B173084: Diff 441462.Jun 30 2022, 12:09 PM

moving DAE after the function simplification pipeline makes sense

In D128830#3622736, @psamolysov wrote:
In D128830#3622467, @fhahn wrote:

Do we need to retain the run of DeadArgumentEliminationPass in the original position or is a single run at the new position sufficient?

Good point! I tried and also removed the guard that the DAE pass should run with O3 only (currently it run exactly as the pass in the original position did: with any O > 0). I have a chance to run the LLVM :: Transforms tests only, one test failed - LLVM :: Transforms\InstCombine\unused-nonnull.ll:
error: CHECK-SAME: expected string not found in input
; CHECK-SAME: (i32 [[ARGC:%.*]], i8** nocapture readnone [[ARGV:%.*]]) local_unnamed_addr #[[ATTR0:[0-9]+]] {
              ^
<stdin>:7:17: note: scanning from here
define i32 @main(i32 %argc, i8** nocapture readonly %argv) local_unnamed_addr #0 {
The difference is that the %argv argument has the readonly attribute, not readnone. I'm not sure whether this makes much sense (I guess this could because readnone can theoretically open a door for some optimizations for which readonly cannot).

this would be fixed by running PostOrderFunctionAttrsPass after the function simplification pipeline which is something I've wanted to do but never got around to testing. we should be inferring more precise attributes after fully simplifying functions. the only case that might regress is recursive functions since when visiting the recursive calls we haven't computed attributes for the recursive functions yet

this test is regressing with this patch because previously DAE would replace passed arguments with poison if the argument isn't used in the function, removing the use of %argv in @main and func-attrs would mark %argv as readnone, but with this patch func-attrs runs on @main before the use of %argv is eliminated. the call to @compute is eliminated in all cases in the function simplification pipeline due to inferring the returned attribute on %x, which is why running func-attrs after the function simplification pipeline fixes this issue

@@ -759,9 +758,6 @@ PassBuilder::buildInlinerPipeline(OptimizationLevel Level,
   if (AttributorRun & AttributorRunOption::CGSCC)
     MainCGPipeline.addPass(AttributorCGSCCPass());
 
-  // Now deduce any function attributes based in the current code.
-  MainCGPipeline.addPass(PostOrderFunctionAttrsPass());
-
   // When at O3 add argument promotion to the pass pipeline.
   // FIXME: It isn't at all clear why this should be limited to O3.
   if (Level == OptimizationLevel::O3)
@@ -781,6 +777,9 @@ PassBuilder::buildInlinerPipeline(OptimizationLevel Level,
       buildFunctionSimplificationPipeline(Level, Phase),
       PTO.EagerlyInvalidateAnalyses, EnableNoRerunSimplificationPipeline));
 
+  // Now deduce any function attributes based in the current code.
+  MainCGPipeline.addPass(PostOrderFunctionAttrsPass());
+
   MainCGPipeline.addPass(CoroSplitPass(Level != OptimizationLevel::O0));

@aeubanks Thank you for the great explanation.

I've applied your suggestion and re-uploaded the patch. The test unused-nonnull.ll has been fixed with keeping the readnone attribute.

Harbormaster completed remote builds in B174662: Diff 443631.Jul 11 2022, 8:03 AM

[Pipelines] Fix the Clang :: CodeGenCoroutines/coro-elide.cpp

Now, the %_Z5task1v.Frame type doesn't contain a field of the
%_Z5task0v.Frame one.

I'd tried moving func-attrs after the function simplification pipeline and perf-wise on internal benchmarks it looks fine, but it causes some compile time regressions: https://llvm-compile-time-tracker.com/compare.php?from=d3dd6e57fe84e90cadcdc78fa71d632f6573f156&to=592b8acec55f577713a5e1fb610a36c8742c682b&stat=instructions
My guess is something to do with caching/invalidating analyses but I'm not 100% sure

(and we should definitely separate that out into its own patch first)

Harbormaster completed remote builds in B174687: Diff 443661.Jul 11 2022, 10:02 AM

@aeubanks Hmm, if I correctly get your comment, I should revert this patch to the state before the proposed solution with moving the PostOrderFunctionAttrsPass at the end of the buildInlinerPipeline function regardless of the readonly instead of readnone regression. Personally along with your concern about compilation time, I have a concern about some changing in coroutines compilation, the Clang :: CodeGenCoroutines/coro-elide.cpp test demonstrates them:

// CHECK-NOT: %_Z5task1v.Frame = type {{.*}}%_Z5task0v.Frame

instead of

// CHECK: %_Z5task1v.Frame = type {{.*}}%_Z5task0v.Frame

I've reverted the latest changes because they require more investigation and, as you said, should be introduced in a separate patch.

Return the PostOrderFunctionAttrsPass pass back on its original place in the pipeline.

Harbormaster completed remote builds in B174813: Diff 443860.Jul 12 2022, 2:57 AM

I think even with the readnone -> readonly change this patch should be fine, but let me run some internal benchmarks on this patch
the func-attrs change can come later

In D128830#3647168, @aeubanks wrote:

... but let me run some internal benchmarks on this patch

@aeubanks Sorry for the late answer. Did you have a chance to run the benchmarks? If so, could you share the results?

sorry for the big delay, I did run the benchmarks and it looks fine

ran this through llvm-compile-time-tracker, looks good https://llvm-compile-time-tracker.com/compare.php?from=552b59b9e69fe1cb2b1ee0cb49cf8376a3dc0869&to=7af184006a9fd9d8deca5b7ae3625127fbb42535&stat=instructions

so I think we're good to go

This revision is now accepted and ready to land.Aug 23 2022, 1:35 PM

This revision was landed with ongoing or failed builds.Aug 24 2022, 12:38 AM

Closed by commit rG3f20dcbf708c: [Pipelines] Introduce DAE after ArgumentPromotion (authored by psamolysov). · Explain Why

This revision was automatically updated to reflect the committed changes.

psamolysov added a commit: rG3f20dcbf708c: [Pipelines] Introduce DAE after ArgumentPromotion.

@aeubanks Thank you very much for the benchmark results and patch review. I've landed the patch.

FYI, this broke the LLDB build bot: https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/46324/execution/node/74/log/

Looks like we're testing that inlined unused parameters display correctly...

AssertionError: '(void *) unused1 = <no location, value may have been optimized out>' not found in '(void *) unused1 = 0x000000016fdff4d0\n'

But with this patch DWARF contains this extra entry for the unused parameter:

0x00000045:     DW_TAG_formal_parameter                                                                             
                  DW_AT_location    (0x00000000:                                                                    
                     [0x0000000100003f1c, 0x0000000100003f20): DW_OP_reg0 W0                                        
                     [0x0000000100003f20, 0x0000000100003f24): DW_OP_entry_value(DW_OP_reg0 W0), DW_OP_stack_value) 
                  DW_AT_abstract_origin (0x00000067 "unused1")

whereas previously it was,

0x00000045:     DW_TAG_formal_parameter                       
                  DW_AT_abstract_origin (0x00000061 "unused1")

Maybe a flaw in the test? Any idea if this is a debug-info regression?

psamolysov added a reverting change: rG6703ad1e0c2a: Revert "[Pipelines] Introduce DAE after ArgumentPromotion".Aug 24 2022, 2:44 AM

@Michael137 Thank you very much for the information!

I'm not sure, but it looks like the introduced change of the readnone attribute to readonly might make impact on DWARF. Unfortunately, I have no idea should this changes in DWARF be fixed or just it is enough to actualize the test.

I've reverted the patch to give our time to make the decision about DWARF generation.

In D128830#3745069, @psamolysov wrote:

@Michael137 Thank you very much for the information!

I'm not sure, but it looks like the introduced change of the readnone attribute to readonly might make impact on DWARF. Unfortunately, I have no idea should this changes in DWARF be fixed or just it is enough to actualize the test.

I've reverted the patch to give our time to make the decision about DWARF generation.

Thanks!

@aprantl @dblaikie Looks like this needs to accommodate existing DWARF generation behaviour?

FYI, this test compiles with -O1, presumably expecting unused params to get optimised out (and not having a location attr in DWARF).

I tried to triage a bit. The test lldb\test\API\functionalities\unused-inlined-parameters\TestUnusedInlinedParameters.py compiles the code in main.c with -O1 and generates the following IR for the @f function:

; Function Attrs: alwaysinline nounwind uwtable
define dso_local void @f(ptr nocapture noundef readnone %unused1, i32 noundef %used, i32 noundef %unused2) local_unnamed_addr #1 {
entry:
  tail call void @use(i32 noundef %used)
  ret void
}

With the reverted patch, the IR looks like the follow:

; Function Attrs: alwaysinline nounwind uwtable
define dso_local void @f(ptr nocapture readnone %unused1, i32 noundef %used, i32 %unused2) local_unnamed_addr #1 {
entry:
  tail call void @use(i32 noundef %used)
  ret void
}

So, as we can see, the attribute readnone is present for the first unused argument for which an additional piece of the DWARF code is generated in both IRs but the noundef attribute is omitted in the second case (so, this attribute is introduces by some changes in the patch and this is "documented" by the changes in the llvm/test/Transforms/InstCombine/unused-nonnull.ll test). I believe this comment from @aeubanks can describe what happens on the IR level: D128830#3639373

I'd try compiling with clang -g2 to see the effects of this patch on the debug info in the IR

In D128830#3745031, @Michael137 wrote:

FYI, this broke the LLDB build bot: https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/46324/execution/node/74/log/

Looks like we're testing that inlined unused parameters display correctly...

AssertionError: '(void *) unused1 = <no location, value may have been optimized out>' not found in '(void *) unused1 = 0x000000016fdff4d0\n'

But with this patch DWARF contains this extra entry for the unused parameter:

0x00000045:     DW_TAG_formal_parameter                                                                             
                  DW_AT_location    (0x00000000:                                                                    
                     [0x0000000100003f1c, 0x0000000100003f20): DW_OP_reg0 W0                                        
                     [0x0000000100003f20, 0x0000000100003f24): DW_OP_entry_value(DW_OP_reg0 W0), DW_OP_stack_value) 
                  DW_AT_abstract_origin (0x00000067 "unused1")

whereas previously it was,

0x00000045:     DW_TAG_formal_parameter                       
                  DW_AT_abstract_origin (0x00000061 "unused1")

Based on that debug info it looks like the patch might've improved things - the 'previous' description has no location, the new one has a location (if it's correct - is there evidence it's incorrect?)

What was the expected behavior of the test? What's the new behavior? Oh, I can read the assertion now.

The assertion was that there is no location - but now there is a location. That looks like a good thing?

Maybe a flaw in the test? Any idea if this is a debug-info regression?

I think we can "fix" the test with the following patch:

diff --git a/lldb/test/API/functionalities/unused-inlined-parameters/main.c b/lldb/test/API/functionalities/unused-inlined-parameters/main.c
index f2ef5dcc213d..9b9f95f6c946 100644
--- a/lldb/test/API/functionalities/unused-inlined-parameters/main.c
+++ b/lldb/test/API/functionalities/unused-inlined-parameters/main.c
@@ -7,6 +7,7 @@ __attribute__((always_inline)) void f(void *unused1, int used, int unused2) {
 }
 
 int main(int argc, char **argv) {
-  f(argv, 42, 1);
+  char *undefined;
+  f(undefined, 42, 1);
   return 0;
-}

In D128830#3746153, @aprantl wrote:

I think we can "fix" the test with the following patch:

diff --git a/lldb/test/API/functionalities/unused-inlined-parameters/main.c b/lldb/test/API/functionalities/unused-inlined-parameters/main.c
index f2ef5dcc213d..9b9f95f6c946 100644
--- a/lldb/test/API/functionalities/unused-inlined-parameters/main.c
+++ b/lldb/test/API/functionalities/unused-inlined-parameters/main.c
@@ -7,6 +7,7 @@ __attribute__((always_inline)) void f(void *unused1, int used, int unused2) {
 }
 
 int main(int argc, char **argv) {
-  f(argv, 42, 1);
+  char *undefined;
+  f(undefined, 42, 1);
   return 0;
-}

Made the change to the test. Confirmed it passes with and without the patch. Feel free to push again.

Thanks!

psamolysov added a commit: rG879f5118fc74: [Pipelines] Introduce DAE after ArgumentPromotion.Aug 25 2022, 12:56 AM

Colleagues, thank you for the discussion.

@aprantl @Michael137 Thank you very much for the patch and the test changes and confirmation. I've pushed the patch again.

Michael137 mentioned this in D132664: [debuginfo-tests] Un-XFAIL now passing unused-merged-value.c test.Aug 25 2022, 8:25 AM

This triggers failed asserts for me:

$ cat repro.c 
static float strtof(char *, char *) {}
void a() { strtof(a, 0); }
$ clang -target x86_64-w64-mingw32 -w -c repro.c -O3
clang: ../lib/Analysis/CGSCCPassManager.cpp:958: updateCGAndAnalysisManagerForPass(llvm::LazyCallGraph&, llvm::LazyCallGraph::SCC&, llvm::LazyCallGraph::Node&, llvm::CGSCCAnalysisManager&, llvm::CGSCCUpdateResult&, llvm::FunctionAnalysisManager&, bool)::<lambda(llvm::Function&)>: Assertion `RefereeN && "Visited function should already have an associated node"' failed.

@mstorsjo Thank you very much for the information. Unfortunately, our tests didn't catch this problem.

I've reproduced this on Windows even w/o mingw. Some time is required for triaging.

We also saw this assert on our Windows build, and it also can be reproduced in Linux:

$ cat test2.c
static char *getenv(char *) {}
void foo() { getenv(""); }
$ ~/src/upstream/879f5118fc74657e4a5c4eff6810098e1eed75ac-linux/bin/clang -c -O3 test2.c                                  
test2.c:1:27: warning: omitting the parameter name in a function definition is a C2x extension [-Wc2x-extensions]                                                                                                                                   
static char *getenv(char *) {}                                                                                                                                                                                                                      
                          ^                                                                                                                                                                                                                         
test2.c:1:30: warning: non-void function does not return a value [-Wreturn-type]                                                                                                                                                                    
static char *getenv(char *) {}                                                                                                                                                                                                                      
                             ^                                                                                            
clang: /home/dyung/src/upstream/llvm_clean_git/llvm/lib/Analysis/CGSCCPassManager.cpp:958: updateCGAndAnalysisManagerForPass(llvm::LazyCallGraph&, llvm::LazyCallGraph::SCC&, llvm::LazyCallGraph::Node&, llvm::CGSCCAnalysisManager&, llvm::CGSCCUpdateResult&, llvm::FunctionAnalysisManager&, bool)::<lambda(llvm::Function&)>: Assertion `RefereeN && "Visited function should already have an associated node"' failed.

The key seems to be the special function getenv(). If I rename the function, the crash does not occur.

In D128830#3751368, @dyung wrote:

We also saw this assert on our Windows build, and it also can be reproduced in Linux:

$ cat test2.c
static char *getenv(char *) {}
void foo() { getenv(""); }
$ ~/src/upstream/879f5118fc74657e4a5c4eff6810098e1eed75ac-linux/bin/clang -c -O3 test2.c                                  
test2.c:1:27: warning: omitting the parameter name in a function definition is a C2x extension [-Wc2x-extensions]                                                                                                                                   
static char *getenv(char *) {}                                                                                                                                                                                                                      
                          ^                                                                                                                                                                                                                         
test2.c:1:30: warning: non-void function does not return a value [-Wreturn-type]                                                                                                                                                                    
static char *getenv(char *) {}                                                                                                                                                                                                                      
                             ^                                                                                            
clang: /home/dyung/src/upstream/llvm_clean_git/llvm/lib/Analysis/CGSCCPassManager.cpp:958: updateCGAndAnalysisManagerForPass(llvm::LazyCallGraph&, llvm::LazyCallGraph::SCC&, llvm::LazyCallGraph::Node&, llvm::CGSCCAnalysisManager&, llvm::CGSCCUpdateResult&, llvm::FunctionAnalysisManager&, bool)::<lambda(llvm::Function&)>: Assertion `RefereeN && "Visited function should already have an associated node"' failed.

The key seems to be the special function getenv(). If I rename the function, the crash does not occur.

Note that our internal builder found this while trying to build compiler-rt/lib/profile/InstrProfilingFile.c using the newly built compiler. If you think this might take a while to debug, could you please revert the change while you investigate?

psamolysov added a reverting change: rGf964417c32d0: Revert "[Pipelines] Introduce DAE after ArgumentPromotion".Aug 26 2022, 3:43 AM

No problem, I've reverted the commit while I need some time to build clang with the reverted commit even to make it clear the commit is guilty.

I'm sorry. It's very interesting, in @mstorsjo case, a function from the standard C library is used: strtof. When I renamed it in the test into strtof2, the problem has becomes not reproducible.

@Michael137 It looks like some DWARF generation/debugger test can fail after the revertion.

Noted, thanks for the heads up

AFAIK, the only test that'd now fail on the debug-info side is: https://reviews.llvm.org/D132664

reduced:

define void @a() {
entry:
  %call = call float @strtof(ptr noundef null, ptr noundef null)
  ret void
}

define internal float @strtof(ptr noundef %0, ptr noundef %1) nounwind {
entry:
  ret float 0.0
}

./build/rel/bin/opt -passes='inline,argpromotion' -disable-output /tmp/b.ll

likely something to do with how the CGSCC pass manager handles lib function (see isKnownLibFunction in LazyCallGraph.cpp)

aeubanks reopened this revision.Aug 26 2022, 2:20 PM

This revision is now accepted and ready to land.Aug 26 2022, 2:20 PM

aeubanks mentioned this in D132764: [LazyCallGraph] Update libcall list when replacing a libcall node's function.Aug 26 2022, 3:10 PM

sent out https://reviews.llvm.org/D132764 to fix the CGSCC crash

@aeubanks Thank you for the investigation! I believe this patch can be re-landed after D132764 is committed.

aeubanks mentioned this in rG7a94d189ad1a: [LazyCallGraph] Update libcall list when replacing a libcall node's function.Aug 27 2022, 11:08 AM

Closed by commit rGb10a341aa5b0: [Pipelines] Introduce DAE after ArgumentPromotion (authored by psamolysov). · Explain WhyAug 28 2022, 12:51 AM

This revision was automatically updated to reflect the committed changes.

psamolysov added a commit: rGb10a341aa5b0: [Pipelines] Introduce DAE after ArgumentPromotion.

zequanwu added a subscriber: zequanwu.Aug 29 2022, 1:48 PM

This comment was removed by zequanwu.

aeubanks added a reverting change: rG9599393eebf7: Revert "[Pipelines] Introduce DAE after ArgumentPromotion".Sep 1 2022, 8:52 AM

I've run into https://github.com/llvm/llvm-project/issues/56503 after this patch so I've reverted this for now while I'm fixing that. Sorry for this patch exposing so many pre-existing issues. Will reland after fixing.

aprantl mentioned this in rG5203168f91f3: Revert "[debuginfo-tests] Un-XFAIL no passing unused-merged-value.c test".Sep 2 2022, 8:47 AM

aeubanks added a commit: rG1c530500ab86: [Pipelines] Introduce DAE after ArgumentPromotion.Sep 22 2022, 3:34 PM

that was fixed and I've relanded the patch

hopefully there aren't any more CGSCC issues this uncovers

@aeubanks Thank you very much for the re-landing.

aeubanks mentioned this in D145210: [Pipeline] Adjust PostOrderFunctionAttrs placement in simplification pipeline.Mar 2 2023, 7:01 PM

aeubanks mentioned this in rG0d4a709bb876: [Pipeline] Adjust PostOrderFunctionAttrs placement in simplification pipeline.Mar 6 2023, 9:02 AM

nikic mentioned this in D146051: [Pipelines] Restore old DAE position in LTO pipeline.Mar 14 2023, 8:03 AM

nikic mentioned this in rGfb5683449e97: [Pipelines] Restore old DAE position in LTO pipeline.Mar 14 2023, 9:00 AM

Revision Contents

Path

Size

clang/

test/

CodeGen/

thinlto-distributed-newpm.ll

2 lines

llvm/

lib/

Passes/

PassBuilderPipelines.cpp

29 lines

test/

Other/

new-pm-defaults.ll

2 lines

new-pm-lto-defaults.ll

2 lines

new-pm-thinlto-defaults.ll

2 lines

new-pm-thinlto-postlink-pgo-defaults.ll

2 lines

new-pm-thinlto-postlink-samplepgo-defaults.ll

2 lines

new-pm-thinlto-prelink-pgo-defaults.ll

2 lines

new-pm-thinlto-prelink-samplepgo-defaults.ll

2 lines

Transforms/

InstCombine/

unused-nonnull.ll

5 lines

PhaseOrdering/

dce-after-argument-promotion.ll

5 lines

Diff 443860

clang/test/CodeGen/thinlto-distributed-newpm.ll

	Show All 28 Lines
	; CHECK-O: Running pass: SROAPass on main			; CHECK-O: Running pass: SROAPass on main
	; CHECK-O: Running pass: EarlyCSEPass on main			; CHECK-O: Running pass: EarlyCSEPass on main
	; CHECK-O3: Running pass: CallSiteSplittingPass on main			; CHECK-O3: Running pass: CallSiteSplittingPass on main
	; CHECK-O: Running pass: LowerTypeTestsPass			; CHECK-O: Running pass: LowerTypeTestsPass
	; CHECK-O: Running pass: IPSCCPPass			; CHECK-O: Running pass: IPSCCPPass
	; CHECK-O: Running pass: CalledValuePropagationPass			; CHECK-O: Running pass: CalledValuePropagationPass
	; CHECK-O: Running pass: GlobalOptPass			; CHECK-O: Running pass: GlobalOptPass
	; CHECK-O: Running pass: PromotePass			; CHECK-O: Running pass: PromotePass
	; CHECK-O: Running pass: DeadArgumentEliminationPass
	; CHECK-O: Running pass: InstCombinePass on main			; CHECK-O: Running pass: InstCombinePass on main
	; CHECK-O: Running pass: SimplifyCFGPass on main			; CHECK-O: Running pass: SimplifyCFGPass on main
	; CHECK-O: Running pass: InlinerPass on (main)			; CHECK-O: Running pass: InlinerPass on (main)
	; CHECK-O: Running pass: PostOrderFunctionAttrsPass on (main)			; CHECK-O: Running pass: PostOrderFunctionAttrsPass on (main)
	; CHECK-O3: Running pass: ArgumentPromotionPass on (main)			; CHECK-O3: Running pass: ArgumentPromotionPass on (main)
	; CHECK-O: Running pass: SROAPass on main			; CHECK-O: Running pass: SROAPass on main
	; CHECK-O: Running pass: EarlyCSEPass on main			; CHECK-O: Running pass: EarlyCSEPass on main
	; CHECK-O: Running pass: SpeculativeExecutionPass on main			; CHECK-O: Running pass: SpeculativeExecutionPass on main
	Show All 23 Lines
	; CHECK-O: Running pass: CorrelatedValuePropagationPass on main			; CHECK-O: Running pass: CorrelatedValuePropagationPass on main
	; CHECK-O: Running pass: ADCEPass on main			; CHECK-O: Running pass: ADCEPass on main
	; CHECK-O: Running pass: MemCpyOptPass on main			; CHECK-O: Running pass: MemCpyOptPass on main
	; CHECK-O: Running pass: DSEPass on main			; CHECK-O: Running pass: DSEPass on main
	; CHECK-O: Running pass: LoopSimplifyPass on main			; CHECK-O: Running pass: LoopSimplifyPass on main
	; CHECK-O: Running pass: LCSSAPass on main			; CHECK-O: Running pass: LCSSAPass on main
	; CHECK-O: Running pass: SimplifyCFGPass on main			; CHECK-O: Running pass: SimplifyCFGPass on main
	; CHECK-O: Running pass: InstCombinePass on main			; CHECK-O: Running pass: InstCombinePass on main
				; CHECK-O: Running pass: DeadArgumentEliminationPass
	; CHECK-O: Running pass: GlobalOptPass			; CHECK-O: Running pass: GlobalOptPass
	; CHECK-O: Running pass: GlobalDCEPass			; CHECK-O: Running pass: GlobalDCEPass
	; CHECK-O: Running pass: EliminateAvailableExternallyPass			; CHECK-O: Running pass: EliminateAvailableExternallyPass
	; CHECK-O: Running pass: ReversePostOrderFunctionAttrsPass			; CHECK-O: Running pass: ReversePostOrderFunctionAttrsPass
	; CHECK-O: Running pass: RecomputeGlobalsAAPass			; CHECK-O: Running pass: RecomputeGlobalsAAPass
	; CHECK-O: Running pass: Float2IntPass on main			; CHECK-O: Running pass: Float2IntPass on main
	; CHECK-O: Running pass: LowerConstantIntrinsicsPass on main			; CHECK-O: Running pass: LowerConstantIntrinsicsPass on main
	; CHECK-O: Running pass: LoopSimplifyPass on main			; CHECK-O: Running pass: LoopSimplifyPass on main
	Show All 36 Lines

llvm/lib/Passes/PassBuilderPipelines.cpp

Show First 20 Lines • Show All 629 Lines • ▼ Show 20 Lines	if (!IsCS && !DisablePreInliner) {
IP.HintThreshold = Level.isOptimizingForSize() ? PreInlineThreshold : 325;		IP.HintThreshold = Level.isOptimizingForSize() ? PreInlineThreshold : 325;
ModuleInlinerWrapperPass MIWP(		ModuleInlinerWrapperPass MIWP(
IP, /* MandatoryFirst */ true,		IP, /* MandatoryFirst */ true,
InlineContext{LTOPhase, InlinePass::EarlyInliner});		InlineContext{LTOPhase, InlinePass::EarlyInliner});
CGSCCPassManager &CGPipeline = MIWP.getPM();		CGSCCPassManager &CGPipeline = MIWP.getPM();

FunctionPassManager FPM;		FunctionPassManager FPM;
FPM.addPass(SROAPass());		FPM.addPass(SROAPass());
FPM.addPass(EarlyCSEPass()); // Catch trivial redundancies.		FPM.addPass(EarlyCSEPass()); // Catch trivial redundancies.
FPM.addPass(SimplifyCFGPass(SimplifyCFGOptions().convertSwitchRangeToICmp(		FPM.addPass(SimplifyCFGPass(SimplifyCFGOptions().convertSwitchRangeToICmp(
true))); // Merge & remove basic blocks.		true))); // Merge & remove basic blocks.
FPM.addPass(InstCombinePass()); // Combine silly sequences.		FPM.addPass(InstCombinePass()); // Combine silly sequences.
invokePeepholeEPCallbacks(FPM, Level);		invokePeepholeEPCallbacks(FPM, Level);

CGPipeline.addPass(createCGSCCToFunctionPassAdaptor(		CGPipeline.addPass(createCGSCCToFunctionPassAdaptor(
std::move(FPM), PTO.EagerlyInvalidateAnalyses));		std::move(FPM), PTO.EagerlyInvalidateAnalyses));

▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	PassBuilder::buildInlinerPipeline(OptimizationLevel Level,
// prologue / epilogue.		// prologue / epilogue.
if (Phase == ThinOrFullLTOPhase::ThinLTOPreLink && PGOOpt &&		if (Phase == ThinOrFullLTOPhase::ThinLTOPreLink && PGOOpt &&
PGOOpt->Action == PGOOptions::SampleUse)		PGOOpt->Action == PGOOptions::SampleUse)
IP.HotCallSiteThreshold = 0;		IP.HotCallSiteThreshold = 0;

if (PGOOpt)		if (PGOOpt)
IP.EnableDeferral = EnablePGOInlineDeferral;		IP.EnableDeferral = EnablePGOInlineDeferral;

ModuleInlinerWrapperPass MIWP(		ModuleInlinerWrapperPass MIWP(IP, PerformMandatoryInliningsFirst,
IP, PerformMandatoryInliningsFirst,
InlineContext{Phase, InlinePass::CGSCCInliner},		InlineContext{Phase, InlinePass::CGSCCInliner},
UseInlineAdvisor, MaxDevirtIterations);		UseInlineAdvisor, MaxDevirtIterations);
		mtrofinUnsubmitted Not Done Reply Inline Actions are these changes due to running clang-format (i.e. the previous change didn't?) mtrofin: are these changes due to running clang-format (i.e. the previous change didn't?)
		fhahnUnsubmitted Not Done Reply Inline Actions looks like it, usually it's better to use https://github.com/llvm-mirror/clang/blob/master/tools/clang-format/clang-format-diff.py fhahn: looks like it, usually it's better to use https://github.com/llvm…
		psamolysovAuthorUnsubmitted Done Reply Inline Actions @mtrofin Yes, this change is the result of a run of clang-format as well as the previous one. @fhahn Thank you for the suggestion. psamolysov: @mtrofin Yes, this change is the result of a run of clang-format as well as the previous one.

// Require the GlobalsAA analysis for the module so we can query it within		// Require the GlobalsAA analysis for the module so we can query it within
// the CGSCC pipeline.		// the CGSCC pipeline.
MIWP.addModulePass(RequireAnalysisPass<GlobalsAA, Module>());		MIWP.addModulePass(RequireAnalysisPass<GlobalsAA, Module>());
// Invalidate AAManager so it can be recreated and pick up the newly available		// Invalidate AAManager so it can be recreated and pick up the newly available
// GlobalsAA.		// GlobalsAA.
MIWP.addModulePass(		MIWP.addModulePass(
createModuleToFunctionPassAdaptor(InvalidateAnalysisPass<AAManager>()));		createModuleToFunctionPassAdaptor(InvalidateAnalysisPass<AAManager>()));
Show All 25 Lines	if (Level == OptimizationLevel::O3)
MainCGPipeline.addPass(ArgumentPromotionPass());		MainCGPipeline.addPass(ArgumentPromotionPass());

// Try to perform OpenMP specific optimizations. This is a (quick!) no-op if		// Try to perform OpenMP specific optimizations. This is a (quick!) no-op if
// there are no OpenMP runtime calls present in the module.		// there are no OpenMP runtime calls present in the module.
if (Level == OptimizationLevel::O2 \|\| Level == OptimizationLevel::O3)		if (Level == OptimizationLevel::O2 \|\| Level == OptimizationLevel::O3)
MainCGPipeline.addPass(OpenMPOptCGSCCPass());		MainCGPipeline.addPass(OpenMPOptCGSCCPass());

for (auto &C : CGSCCOptimizerLateEPCallbacks)		for (auto &C : CGSCCOptimizerLateEPCallbacks)
C(MainCGPipeline, Level);		C(MainCGPipeline, Level);
		nikicUnsubmitted Not Done Reply Inline Actions I don't think we need a late module pass for this, it can be part of the normal module optimization pipeline (post module simplification). nikic: I don't think we need a late module pass for this, it can be part of the normal module…
		psamolysovAuthorUnsubmitted Done Reply Inline Actions I'm afraid I didn't get your comment. What I tried is to add `DAE` with `MIWP.addModulePass` but such module passes run before call graph passes. psamolysov: I'm afraid I didn't get your comment. What I tried is to add `DAE` with `MIWP.addModulePass`…
		psamolysovAuthorUnsubmitted Done Reply Inline Actions Oh, I think I have got the idea: instead of adding the `DAE` with `MIWP.addLateModulePass()` just after adding the `ArgumentPromotion` in the `buildInlinerPipeline`, to add `DAE` after this line with usual `addPass`: MPM.addPass(buildInlinerPipeline(Level, Phase)); This makes sense when we removing the original point of adding the `DAE` pass, because the `buildInlinerPipeline()` function is called not in every case. Actually, we just moving `DAE` after inlining. psamolysov: Oh, I think I have got the idea: instead of adding the `DAE` with `MIWP.addLateModulePass()`…

// Lastly, add the core function simplification pipeline nested inside the		// Lastly, add the core function simplification pipeline nested inside the
// CGSCC walk.		// CGSCC walk.
MainCGPipeline.addPass(createCGSCCToFunctionPassAdaptor(		MainCGPipeline.addPass(createCGSCCToFunctionPassAdaptor(
buildFunctionSimplificationPipeline(Level, Phase),		buildFunctionSimplificationPipeline(Level, Phase),
PTO.EagerlyInvalidateAnalyses, EnableNoRerunSimplificationPipeline));		PTO.EagerlyInvalidateAnalyses, EnableNoRerunSimplificationPipeline));

MainCGPipeline.addPass(CoroSplitPass(Level != OptimizationLevel::O0));		MainCGPipeline.addPass(CoroSplitPass(Level != OptimizationLevel::O0));
▲ Show 20 Lines • Show All 165 Lines • ▼ Show 20 Lines	PassBuilder::buildModuleSimplificationPipeline(OptimizationLevel Level,

// Promote any localized globals to SSA registers.		// Promote any localized globals to SSA registers.
// FIXME: Should this instead by a run of SROA?		// FIXME: Should this instead by a run of SROA?
// FIXME: We should probably run instcombine and simplifycfg afterward to		// FIXME: We should probably run instcombine and simplifycfg afterward to
// delete control flows that are dead once globals have been folded to		// delete control flows that are dead once globals have been folded to
// constants.		// constants.
MPM.addPass(createModuleToFunctionPassAdaptor(PromotePass()));		MPM.addPass(createModuleToFunctionPassAdaptor(PromotePass()));

// Remove any dead arguments exposed by cleanups and constant folding
// globals.
MPM.addPass(DeadArgumentEliminationPass());

// Create a small function pass pipeline to cleanup after all the global		// Create a small function pass pipeline to cleanup after all the global
// optimizations.		// optimizations.
FunctionPassManager GlobalCleanupPM;		FunctionPassManager GlobalCleanupPM;
GlobalCleanupPM.addPass(InstCombinePass());		GlobalCleanupPM.addPass(InstCombinePass());
invokePeepholeEPCallbacks(GlobalCleanupPM, Level);		invokePeepholeEPCallbacks(GlobalCleanupPM, Level);

GlobalCleanupPM.addPass(		GlobalCleanupPM.addPass(
SimplifyCFGPass(SimplifyCFGOptions().convertSwitchRangeToICmp(true)));		SimplifyCFGPass(SimplifyCFGOptions().convertSwitchRangeToICmp(true)));
Show All 18 Lines	PassBuilder::buildModuleSimplificationPipeline(OptimizationLevel Level,
if (EnableSyntheticCounts && !PGOOpt)		if (EnableSyntheticCounts && !PGOOpt)
MPM.addPass(SyntheticCountsPropagation());		MPM.addPass(SyntheticCountsPropagation());

if (EnableModuleInliner)		if (EnableModuleInliner)
MPM.addPass(buildModuleInlinerPipeline(Level, Phase));		MPM.addPass(buildModuleInlinerPipeline(Level, Phase));
else		else
MPM.addPass(buildInlinerPipeline(Level, Phase));		MPM.addPass(buildInlinerPipeline(Level, Phase));

		// Remove any dead arguments exposed by cleanups, constant folding globals,
		// and argument promotion.
		MPM.addPass(DeadArgumentEliminationPass());

MPM.addPass(CoroCleanupPass());		MPM.addPass(CoroCleanupPass());

if (EnableMemProfiler && Phase != ThinOrFullLTOPhase::ThinLTOPreLink) {		if (EnableMemProfiler && Phase != ThinOrFullLTOPhase::ThinLTOPreLink) {
MPM.addPass(createModuleToFunctionPassAdaptor(MemProfilerPass()));		MPM.addPass(createModuleToFunctionPassAdaptor(MemProfilerPass()));
MPM.addPass(ModuleMemProfilerPass());		MPM.addPass(ModuleMemProfilerPass());
}		}

return MPM;		return MPM;
▲ Show 20 Lines • Show All 578 Lines • ▼ Show 20 Lines	PassBuilder::buildLTODefaultPipeline(OptimizationLevel Level,

// Promote any localized globals to SSA registers.		// Promote any localized globals to SSA registers.
MPM.addPass(createModuleToFunctionPassAdaptor(PromotePass()));		MPM.addPass(createModuleToFunctionPassAdaptor(PromotePass()));

// Linking modules together can lead to duplicate global constant, only		// Linking modules together can lead to duplicate global constant, only
// keep one copy of each constant.		// keep one copy of each constant.
MPM.addPass(ConstantMergePass());		MPM.addPass(ConstantMergePass());

// Remove unused arguments from functions.
MPM.addPass(DeadArgumentEliminationPass());

// Reduce the code after globalopt and ipsccp. Both can open up significant		// Reduce the code after globalopt and ipsccp. Both can open up significant
// simplification opportunities, and both can propagate functions through		// simplification opportunities, and both can propagate functions through
// function pointers. When this happens, we often have to resolve varargs		// function pointers. When this happens, we often have to resolve varargs
// calls, etc, so let instcombine do this.		// calls, etc, so let instcombine do this.
FunctionPassManager PeepholeFPM;		FunctionPassManager PeepholeFPM;
PeepholeFPM.addPass(InstCombinePass());		PeepholeFPM.addPass(InstCombinePass());
if (Level == OptimizationLevel::O3)		if (Level == OptimizationLevel::O3)
PeepholeFPM.addPass(AggressiveInstCombinePass());		PeepholeFPM.addPass(AggressiveInstCombinePass());
invokePeepholeEPCallbacks(PeepholeFPM, Level);		invokePeepholeEPCallbacks(PeepholeFPM, Level);

MPM.addPass(createModuleToFunctionPassAdaptor(std::move(PeepholeFPM),		MPM.addPass(createModuleToFunctionPassAdaptor(std::move(PeepholeFPM),
PTO.EagerlyInvalidateAnalyses));		PTO.EagerlyInvalidateAnalyses));

// Note: historically, the PruneEH pass was run first to deduce nounwind and		// Note: historically, the PruneEH pass was run first to deduce nounwind and
// generally clean up exception handling overhead. It isn't clear this is		// generally clean up exception handling overhead. It isn't clear this is
// valuable as the inliner doesn't currently care whether it is inlining an		// valuable as the inliner doesn't currently care whether it is inlining an
// invoke or a call.		// invoke or a call.
// Run the inliner now.		// Run the inliner now.
MPM.addPass(ModuleInlinerWrapperPass(		MPM.addPass(ModuleInlinerWrapperPass(
getInlineParamsFromOptLevel(Level),		getInlineParamsFromOptLevel(Level),
/* MandatoryFirst */ true,		/* MandatoryFirst */ true,
InlineContext{ThinOrFullLTOPhase::FullLTOPostLink,		InlineContext{ThinOrFullLTOPhase::FullLTOPostLink,
InlinePass::CGSCCInliner}));		InlinePass::CGSCCInliner}));
		mtrofinUnsubmitted Not Done Reply Inline Actions same clang-format q mtrofin: same clang-format q
		psamolysovAuthorUnsubmitted Done Reply Inline Actions Should I revert the changes introduced by clang-format? psamolysov: Should I revert the changes introduced by clang-format?
		psamolysovAuthorUnsubmitted Done Reply Inline Actions If these formatting changes make sense, I can introduce them in a separate patch. Should I? psamolysov: If these formatting changes make sense, I can introduce them in a separate patch. Should I?
		mtrofinUnsubmitted Not Done Reply Inline Actions They are fine here, I was just checking. Thanks! mtrofin: They are fine here, I was just checking. Thanks!

// Optimize globals again after we ran the inliner.		// Optimize globals again after we ran the inliner.
MPM.addPass(GlobalOptPass());		MPM.addPass(GlobalOptPass());

// Garbage collect dead functions.		// Garbage collect dead functions.
MPM.addPass(GlobalDCEPass());		MPM.addPass(GlobalDCEPass());

// If we didn't decide to inline a function, check to see if we can		// If we didn't decide to inline a function, check to see if we can
// transform it to pass arguments by value instead of by reference.		// transform it to pass arguments by value instead of by reference.
MPM.addPass(createModuleToPostOrderCGSCCPassAdaptor(ArgumentPromotionPass()));		MPM.addPass(createModuleToPostOrderCGSCCPassAdaptor(ArgumentPromotionPass()));

		// Remove unused arguments from functions.
		MPM.addPass(DeadArgumentEliminationPass());

FunctionPassManager FPM;		FunctionPassManager FPM;
// The IPO Passes may leave cruft around. Clean up after them.		// The IPO Passes may leave cruft around. Clean up after them.
FPM.addPass(InstCombinePass());		FPM.addPass(InstCombinePass());
invokePeepholeEPCallbacks(FPM, Level);		invokePeepholeEPCallbacks(FPM, Level);

FPM.addPass(JumpThreadingPass());		FPM.addPass(JumpThreadingPass());

// Do a post inline PGO instrumentation and use pass. This is a context		// Do a post inline PGO instrumentation and use pass. This is a context
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	PassBuilder::buildLTODefaultPipeline(OptimizationLevel Level,

// Remove dead memcpy()'s.		// Remove dead memcpy()'s.
MainFPM.addPass(MemCpyOptPass());		MainFPM.addPass(MemCpyOptPass());

// Nuke dead stores.		// Nuke dead stores.
MainFPM.addPass(DSEPass());		MainFPM.addPass(DSEPass());
MainFPM.addPass(MergedLoadStoreMotionPass());		MainFPM.addPass(MergedLoadStoreMotionPass());


mtrofinUnsubmitted Not Done Reply Inline Actions spurious change? mtrofin: spurious change?
psamolysovAuthorUnsubmitted Done Reply Inline Actions This change is introduced by clang-format. Should there be two empty lines between `MainFPM.addPass(MergedLoadStoreMotionPass());` and `if (EnableConstraintElimination)`? psamolysov: This change is introduced by clang-format. Should there be two empty lines between `MainFPM.
if (EnableConstraintElimination)		if (EnableConstraintElimination)
MainFPM.addPass(ConstraintEliminationPass());		MainFPM.addPass(ConstraintEliminationPass());

LoopPassManager LPM;		LoopPassManager LPM;
if (EnableLoopFlatten && Level.getSpeedupLevel() > 1)		if (EnableLoopFlatten && Level.getSpeedupLevel() > 1)
LPM.addPass(LoopFlattenPass());		LPM.addPass(LoopFlattenPass());
LPM.addPass(IndVarSimplifyPass());		LPM.addPass(IndVarSimplifyPass());
LPM.addPass(LoopDeletionPass());		LPM.addPass(LoopDeletionPass());
// FIXME: Add loop interchange.		// FIXME: Add loop interchange.

// Unroll small loops and perform peeling.		// Unroll small loops and perform peeling.
LPM.addPass(LoopFullUnrollPass(Level.getSpeedupLevel(),		LPM.addPass(LoopFullUnrollPass(Level.getSpeedupLevel(),
/* OnlyWhenForced= */ !PTO.LoopUnrolling,		/* OnlyWhenForced= */ !PTO.LoopUnrolling,
PTO.ForgetAllSCEVInLoopUnroll));		PTO.ForgetAllSCEVInLoopUnroll));
// The loop passes in LPM (LoopFullUnrollPass) do not preserve MemorySSA.		// The loop passes in LPM (LoopFullUnrollPass) do not preserve MemorySSA.
// All loop passes must preserve it, in order to be able to use it.		// All loop passes must preserve it, in order to be able to use it.
MainFPM.addPass(createFunctionToLoopPassAdaptor(		MainFPM.addPass(createFunctionToLoopPassAdaptor(
std::move(LPM), /UseMemorySSA=/false, /UseBlockFrequencyInfo=/true));		std::move(LPM), /UseMemorySSA=/false, /UseBlockFrequencyInfo=/true));

MainFPM.addPass(LoopDistributePass());		MainFPM.addPass(LoopDistributePass());

addVectorPasses(Level, MainFPM, /* IsFullLTO */ true);		addVectorPasses(Level, MainFPM, /* IsFullLTO */ true);

// Run the OpenMPOpt CGSCC pass again late.		// Run the OpenMPOpt CGSCC pass again late.
MPM.addPass(		MPM.addPass(createModuleToPostOrderCGSCCPassAdaptor(OpenMPOptCGSCCPass()));
		mtrofinUnsubmitted Not Done Reply Inline Actions was this clang-format? mtrofin: was this clang-format?
		psamolysovAuthorUnsubmitted Done Reply Inline Actions Yes, it was. psamolysov: Yes, it was.
createModuleToPostOrderCGSCCPassAdaptor(OpenMPOptCGSCCPass()));

invokePeepholeEPCallbacks(MainFPM, Level);		invokePeepholeEPCallbacks(MainFPM, Level);
MainFPM.addPass(JumpThreadingPass());		MainFPM.addPass(JumpThreadingPass());
MPM.addPass(createModuleToFunctionPassAdaptor(std::move(MainFPM),		MPM.addPass(createModuleToFunctionPassAdaptor(std::move(MainFPM),
PTO.EagerlyInvalidateAnalyses));		PTO.EagerlyInvalidateAnalyses));

// Lower type metadata and the type.test intrinsic. This pass supports		// Lower type metadata and the type.test intrinsic. This pass supports
// clang's control flow integrity mechanisms (-fsanitize=cfi*) and needs		// clang's control flow integrity mechanisms (-fsanitize=cfi*) and needs
▲ Show 20 Lines • Show All 175 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-defaults.ll

	Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis			; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis
	; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass			; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass
	; CHECK-O-NEXT: Running pass: OpenMPOptPass			; CHECK-O-NEXT: Running pass: OpenMPOptPass
	; CHECK-EP-PIPELINE-EARLY-SIMPLIFICATION-NEXT: Running pass: NoOpModulePass			; CHECK-EP-PIPELINE-EARLY-SIMPLIFICATION-NEXT: Running pass: NoOpModulePass
	; CHECK-O-NEXT: Running pass: IPSCCPPass			; CHECK-O-NEXT: Running pass: IPSCCPPass
	; CHECK-O-NEXT: Running pass: CalledValuePropagationPass			; CHECK-O-NEXT: Running pass: CalledValuePropagationPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: PromotePass			; CHECK-O-NEXT: Running pass: PromotePass
	; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running analysis: OptimizationRemarkEmitterAnalysis			; CHECK-O-NEXT: Running analysis: OptimizationRemarkEmitterAnalysis
	; CHECK-O-NEXT: Running analysis: AAManager			; CHECK-O-NEXT: Running analysis: AAManager
	; CHECK-O-NEXT: Running analysis: BasicAA			; CHECK-O-NEXT: Running analysis: BasicAA
	; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA			; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA
	; CHECK-O-NEXT: Running analysis: TypeBasedAA			; CHECK-O-NEXT: Running analysis: TypeBasedAA
	; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-EP-PEEPHOLE-NEXT: Running pass: NoOpFunctionPass			; CHECK-EP-PEEPHOLE-NEXT: Running pass: NoOpFunctionPass
	▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines
	; CHECK-O23SZ-NEXT: Running pass: LICMPass			; CHECK-O23SZ-NEXT: Running pass: LICMPass
	; CHECK-O23SZ-NEXT: Running pass: CoroElidePass			; CHECK-O23SZ-NEXT: Running pass: CoroElidePass
	; CHECK-EP-SCALAR-LATE-NEXT: Running pass: NoOpFunctionPass			; CHECK-EP-SCALAR-LATE-NEXT: Running pass: NoOpFunctionPass
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-EP-PEEPHOLE-NEXT: Running pass: NoOpFunctionPass			; CHECK-EP-PEEPHOLE-NEXT: Running pass: NoOpFunctionPass
	; CHECK-O-NEXT: Running pass: CoroSplitPass			; CHECK-O-NEXT: Running pass: CoroSplitPass
	; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis
				; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: CoroCleanupPass			; CHECK-O-NEXT: Running pass: CoroCleanupPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: GlobalDCEPass			; CHECK-O-NEXT: Running pass: GlobalDCEPass
	; CHECK-DEFAULT-NEXT: Running pass: EliminateAvailableExternallyPass			; CHECK-DEFAULT-NEXT: Running pass: EliminateAvailableExternallyPass
	; CHECK-LTO-NOT: Running pass: EliminateAvailableExternallyPass			; CHECK-LTO-NOT: Running pass: EliminateAvailableExternallyPass
	; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass			; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass
	; CHECK-O-NEXT: Running pass: RecomputeGlobalsAAPass			; CHECK-O-NEXT: Running pass: RecomputeGlobalsAAPass
	; CHECK-EP-OPTIMIZER-EARLY: Running pass: NoOpModulePass			; CHECK-EP-OPTIMIZER-EARLY: Running pass: NoOpModulePass
	▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-lto-defaults.ll

	Show First 20 Lines • Show All 69 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass			; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass
	; CHECK-O-NEXT: Running analysis: CallGraphAnalysis			; CHECK-O-NEXT: Running analysis: CallGraphAnalysis
	; CHECK-O-NEXT: Running pass: GlobalSplitPass			; CHECK-O-NEXT: Running pass: GlobalSplitPass
	; CHECK-O-NEXT: Running pass: WholeProgramDevirtPass			; CHECK-O-NEXT: Running pass: WholeProgramDevirtPass
	; CHECK-O1-NEXT: Running pass: LowerTypeTestsPass			; CHECK-O1-NEXT: Running pass: LowerTypeTestsPass
	; CHECK-O23SZ-NEXT: Running pass: GlobalOptPass			; CHECK-O23SZ-NEXT: Running pass: GlobalOptPass
	; CHECK-O23SZ-NEXT: Running pass: PromotePass			; CHECK-O23SZ-NEXT: Running pass: PromotePass
	; CHECK-O23SZ-NEXT: Running pass: ConstantMergePass			; CHECK-O23SZ-NEXT: Running pass: ConstantMergePass
	; CHECK-O23SZ-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O23SZ-NEXT: Running pass: InstCombinePass			; CHECK-O23SZ-NEXT: Running pass: InstCombinePass
	; CHECK-O3-NEXT: Running pass: AggressiveInstCombinePass			; CHECK-O3-NEXT: Running pass: AggressiveInstCombinePass
	; CHECK-EP-Peephole-NEXT: Running pass: NoOpFunctionPass			; CHECK-EP-Peephole-NEXT: Running pass: NoOpFunctionPass
	; CHECK-O23SZ-NEXT: Running pass: ModuleInlinerWrapperPass			; CHECK-O23SZ-NEXT: Running pass: ModuleInlinerWrapperPass
	; CHECK-O23SZ-NEXT: Running analysis: InlineAdvisorAnalysis			; CHECK-O23SZ-NEXT: Running analysis: InlineAdvisorAnalysis
	; CHECK-O23SZ-NEXT: Running pass: InlinerPass			; CHECK-O23SZ-NEXT: Running pass: InlinerPass
	; CHECK-O23SZ-NEXT: Running pass: InlinerPass			; CHECK-O23SZ-NEXT: Running pass: InlinerPass
	; CHECK-O23SZ-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O23SZ-NEXT: Invalidating analysis: InlineAdvisorAnalysis
	; CHECK-O23SZ-NEXT: Running pass: GlobalOptPass			; CHECK-O23SZ-NEXT: Running pass: GlobalOptPass
	; CHECK-O23SZ-NEXT: Running pass: GlobalDCEPass			; CHECK-O23SZ-NEXT: Running pass: GlobalDCEPass
	; CHECK-O23SZ-NEXT: Running pass: ArgumentPromotionPass			; CHECK-O23SZ-NEXT: Running pass: ArgumentPromotionPass
				; CHECK-O23SZ-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O23SZ-NEXT: Running pass: InstCombinePass			; CHECK-O23SZ-NEXT: Running pass: InstCombinePass
	; CHECK-EP-Peephole-NEXT: Running pass: NoOpFunctionPass			; CHECK-EP-Peephole-NEXT: Running pass: NoOpFunctionPass
	; CHECK-O23SZ-NEXT: Running pass: JumpThreadingPass			; CHECK-O23SZ-NEXT: Running pass: JumpThreadingPass
	; CHECK-O23SZ-NEXT: Running analysis: LazyValueAnalysis			; CHECK-O23SZ-NEXT: Running analysis: LazyValueAnalysis
	; CHECK-O23SZ-NEXT: Running pass: SROAPass on foo			; CHECK-O23SZ-NEXT: Running pass: SROAPass on foo
	; CHECK-O23SZ-NEXT: Running pass: TailCallElimPass on foo			; CHECK-O23SZ-NEXT: Running pass: TailCallElimPass on foo
	; CHECK-O23SZ-NEXT: Running pass: PostOrderFunctionAttrsPass on (foo)			; CHECK-O23SZ-NEXT: Running pass: PostOrderFunctionAttrsPass on (foo)
	; CHECK-O23SZ-NEXT: Running pass: RequireAnalysisPass<{{.*}}GlobalsAA			; CHECK-O23SZ-NEXT: Running pass: RequireAnalysisPass<{{.*}}GlobalsAA
	▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-defaults.ll

	Show First 20 Lines • Show All 72 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis			; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis
	; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass			; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass
	; CHECK-O-NEXT: Running pass: OpenMPOptPass			; CHECK-O-NEXT: Running pass: OpenMPOptPass
	; CHECK-POSTLINK-O-NEXT: Running pass: LowerTypeTestsPass			; CHECK-POSTLINK-O-NEXT: Running pass: LowerTypeTestsPass
	; CHECK-O-NEXT: Running pass: IPSCCPPass			; CHECK-O-NEXT: Running pass: IPSCCPPass
	; CHECK-O-NEXT: Running pass: CalledValuePropagationPass			; CHECK-O-NEXT: Running pass: CalledValuePropagationPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: PromotePass			; CHECK-O-NEXT: Running pass: PromotePass
	; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-PRELINK-O-NEXT: Running analysis: OptimizationRemarkEmitterAnalysis			; CHECK-PRELINK-O-NEXT: Running analysis: OptimizationRemarkEmitterAnalysis
	; CHECK-O-NEXT: Running analysis: AAManager			; CHECK-O-NEXT: Running analysis: AAManager
	; CHECK-O-NEXT: Running analysis: BasicAA			; CHECK-O-NEXT: Running analysis: BasicAA
	; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA			; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA
	; CHECK-O-NEXT: Running analysis: TypeBasedAA			; CHECK-O-NEXT: Running analysis: TypeBasedAA
	; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
	; CHECK-O23SZ-NEXT: Running pass: LoopSimplifyPass			; CHECK-O23SZ-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O23SZ-NEXT: Running pass: LCSSAPass			; CHECK-O23SZ-NEXT: Running pass: LCSSAPass
	; CHECK-O23SZ-NEXT: Running pass: LICMPass on Loop at depth 1 containing: %loop			; CHECK-O23SZ-NEXT: Running pass: LICMPass on Loop at depth 1 containing: %loop
	; CHECK-O23SZ-NEXT: Running pass: CoroElidePass			; CHECK-O23SZ-NEXT: Running pass: CoroElidePass
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: CoroSplitPass			; CHECK-O-NEXT: Running pass: CoroSplitPass
	; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis
				; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: CoroCleanupPass			; CHECK-O-NEXT: Running pass: CoroCleanupPass
	; CHECK-PRELINK-O-NEXT: Running pass: GlobalOptPass			; CHECK-PRELINK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-POSTLINK-O-NEXT: Running pass: GlobalOptPass			; CHECK-POSTLINK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-POSTLINK-O-NEXT: Running pass: GlobalDCEPass			; CHECK-POSTLINK-O-NEXT: Running pass: GlobalDCEPass
	; CHECK-POSTLINK-O-NEXT: Running pass: EliminateAvailableExternallyPass			; CHECK-POSTLINK-O-NEXT: Running pass: EliminateAvailableExternallyPass
	; CHECK-POSTLINK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass			; CHECK-POSTLINK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass
	; CHECK-POSTLINK-O-NEXT: Running pass: RecomputeGlobalsAAPass			; CHECK-POSTLINK-O-NEXT: Running pass: RecomputeGlobalsAAPass
	; CHECK-POSTLINK-O-NEXT: Running pass: Float2IntPass			; CHECK-POSTLINK-O-NEXT: Running pass: Float2IntPass
	▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-postlink-pgo-defaults.ll

	Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis			; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis
	; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass			; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass
	; CHECK-O-NEXT: Running pass: OpenMPOptPass			; CHECK-O-NEXT: Running pass: OpenMPOptPass
	; CHECK-O-NEXT: Running pass: LowerTypeTestsPass			; CHECK-O-NEXT: Running pass: LowerTypeTestsPass
	; CHECK-O-NEXT: Running pass: IPSCCPPass			; CHECK-O-NEXT: Running pass: IPSCCPPass
	; CHECK-O-NEXT: Running pass: CalledValuePropagationPass			; CHECK-O-NEXT: Running pass: CalledValuePropagationPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: PromotePass			; CHECK-O-NEXT: Running pass: PromotePass
	; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running analysis: AAManager			; CHECK-O-NEXT: Running analysis: AAManager
	; CHECK-O-NEXT: Running analysis: BasicAA			; CHECK-O-NEXT: Running analysis: BasicAA
	; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA			; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA
	; CHECK-O-NEXT: Running analysis: TypeBasedAA			; CHECK-O-NEXT: Running analysis: TypeBasedAA
	; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running analysis: BlockFrequencyAnalysis on foo			; CHECK-O-NEXT: Running analysis: BlockFrequencyAnalysis on foo
	; These next two can appear in any order since they are accessed as parameters			; These next two can appear in any order since they are accessed as parameters
	▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
	; CHECK-O23SZ-NEXT: Running pass: LoopSimplifyPass			; CHECK-O23SZ-NEXT: Running pass: LoopSimplifyPass
	; CHECK-O23SZ-NEXT: Running pass: LCSSAPass			; CHECK-O23SZ-NEXT: Running pass: LCSSAPass
	; CHECK-O23SZ-NEXT: Running pass: LICMPass			; CHECK-O23SZ-NEXT: Running pass: LICMPass
	; CHECK-O23SZ-NEXT: Running pass: CoroElidePass			; CHECK-O23SZ-NEXT: Running pass: CoroElidePass
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running pass: CoroSplitPass			; CHECK-O-NEXT: Running pass: CoroSplitPass
	; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis
				; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: CoroCleanupPass			; CHECK-O-NEXT: Running pass: CoroCleanupPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: GlobalDCEPass			; CHECK-O-NEXT: Running pass: GlobalDCEPass
	; CHECK-O-NEXT: Running pass: EliminateAvailableExternallyPass			; CHECK-O-NEXT: Running pass: EliminateAvailableExternallyPass
	; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass			; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass
	; CHECK-O-NEXT: Running pass: RecomputeGlobalsAAPass			; CHECK-O-NEXT: Running pass: RecomputeGlobalsAAPass
	; CHECK-O-NEXT: Running pass: Float2IntPass			; CHECK-O-NEXT: Running pass: Float2IntPass
	; CHECK-O-NEXT: Running pass: LowerConstantIntrinsicsPass			; CHECK-O-NEXT: Running pass: LowerConstantIntrinsicsPass
	▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-postlink-samplepgo-defaults.ll

	Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running pass: RequireAnalysisPass<{{.*}}ProfileSummaryAnalysis			; CHECK-O-NEXT: Running pass: RequireAnalysisPass<{{.*}}ProfileSummaryAnalysis
	; CHECK-O-NEXT: Running pass: PGOIndirectCallPromotion			; CHECK-O-NEXT: Running pass: PGOIndirectCallPromotion
	; CHECK-O-NEXT: Running pass: OpenMPOptPass			; CHECK-O-NEXT: Running pass: OpenMPOptPass
	; CHECK-O-NEXT: Running pass: LowerTypeTestsPass			; CHECK-O-NEXT: Running pass: LowerTypeTestsPass
	; CHECK-O-NEXT: Running pass: IPSCCPPass			; CHECK-O-NEXT: Running pass: IPSCCPPass
	; CHECK-O-NEXT: Running pass: CalledValuePropagationPass			; CHECK-O-NEXT: Running pass: CalledValuePropagationPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: PromotePass			; CHECK-O-NEXT: Running pass: PromotePass
	; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running analysis: BlockFrequencyAnalysis on foo			; CHECK-O-NEXT: Running analysis: BlockFrequencyAnalysis on foo
	; These next two can appear in any order since they are accessed as parameters			; These next two can appear in any order since they are accessed as parameters
	; on the same call to BlockFrequencyInfo::calculate.			; on the same call to BlockFrequencyInfo::calculate.
	; CHECK-O-DAG: Running analysis: LoopAnalysis on foo			; CHECK-O-DAG: Running analysis: LoopAnalysis on foo
	; CHECK-O-DAG: Running analysis: BranchProbabilityAnalysis on foo			; CHECK-O-DAG: Running analysis: BranchProbabilityAnalysis on foo
	; CHECK-O-NEXT: Running analysis: PostDominatorTreeAnalysis on foo			; CHECK-O-NEXT: Running analysis: PostDominatorTreeAnalysis on foo
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass on foo			; CHECK-O-NEXT: Running pass: SimplifyCFGPass on foo
	▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
	; CHECK-O23SZ-NEXT: Running pass: CoroElidePass			; CHECK-O23SZ-NEXT: Running pass: CoroElidePass
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O3-NEXT: Running pass: ControlHeightReductionPass on foo			; CHECK-O3-NEXT: Running pass: ControlHeightReductionPass on foo
	; CHECK-O3-NEXT: Running analysis: RegionInfoAnalysis on foo			; CHECK-O3-NEXT: Running analysis: RegionInfoAnalysis on foo
	; CHECK-O3-NEXT: Running analysis: DominanceFrontierAnalysis on foo			; CHECK-O3-NEXT: Running analysis: DominanceFrontierAnalysis on foo
	; CHECK-O-NEXT: Running pass: CoroSplitPass			; CHECK-O-NEXT: Running pass: CoroSplitPass
	; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis
				; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: CoroCleanupPass			; CHECK-O-NEXT: Running pass: CoroCleanupPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: GlobalDCEPass			; CHECK-O-NEXT: Running pass: GlobalDCEPass
	; CHECK-O-NEXT: Running pass: EliminateAvailableExternallyPass			; CHECK-O-NEXT: Running pass: EliminateAvailableExternallyPass
	; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass			; CHECK-O-NEXT: Running pass: ReversePostOrderFunctionAttrsPass
	; CHECK-O-NEXT: Running pass: RecomputeGlobalsAAPass			; CHECK-O-NEXT: Running pass: RecomputeGlobalsAAPass
	; CHECK-O-NEXT: Running pass: Float2IntPass			; CHECK-O-NEXT: Running pass: Float2IntPass
	; CHECK-O-NEXT: Running pass: LowerConstantIntrinsicsPass			; CHECK-O-NEXT: Running pass: LowerConstantIntrinsicsPass
	▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

llvm/test/Other/new-pm-thinlto-prelink-pgo-defaults.ll

	Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running pass: EarlyCSEPass			; CHECK-O-NEXT: Running pass: EarlyCSEPass
	; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis			; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis
	; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass			; CHECK-O3-NEXT: Running pass: CallSiteSplittingPass
	; CHECK-O-NEXT: Running pass: OpenMPOptPass			; CHECK-O-NEXT: Running pass: OpenMPOptPass
	; CHECK-O-NEXT: Running pass: IPSCCPPass			; CHECK-O-NEXT: Running pass: IPSCCPPass
	; CHECK-O-NEXT: Running pass: CalledValuePropagationPass			; CHECK-O-NEXT: Running pass: CalledValuePropagationPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: PromotePass			; CHECK-O-NEXT: Running pass: PromotePass
	; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running analysis: OptimizationRemarkEmitterAnalysis			; CHECK-O-NEXT: Running analysis: OptimizationRemarkEmitterAnalysis
	; CHECK-O-NEXT: Running analysis: AAManager			; CHECK-O-NEXT: Running analysis: AAManager
	; CHECK-O-NEXT: Running analysis: BasicAA			; CHECK-O-NEXT: Running analysis: BasicAA
	; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA			; CHECK-O-NEXT: Running analysis: ScopedNoAliasAA
	; CHECK-O-NEXT: Running analysis: TypeBasedAA			; CHECK-O-NEXT: Running analysis: TypeBasedAA
	; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy			; CHECK-O-NEXT: Running analysis: OuterAnalysisManagerProxy
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines
	; CHECK-O23SZ-NEXT: Running pass: CoroElidePass			; CHECK-O23SZ-NEXT: Running pass: CoroElidePass
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O3-NEXT: Running pass: ControlHeightReductionPass on foo			; CHECK-O3-NEXT: Running pass: ControlHeightReductionPass on foo
	; CHECK-O3-NEXT: Running analysis: RegionInfoAnalysis on foo			; CHECK-O3-NEXT: Running analysis: RegionInfoAnalysis on foo
	; CHECK-O3-NEXT: Running analysis: DominanceFrontierAnalysis on foo			; CHECK-O3-NEXT: Running analysis: DominanceFrontierAnalysis on foo
	; CHECK-O-NEXT: Running pass: CoroSplitPass			; CHECK-O-NEXT: Running pass: CoroSplitPass
	; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis
				; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: CoroCleanupPass			; CHECK-O-NEXT: Running pass: CoroCleanupPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis on bar			; CHECK-O-NEXT: Running analysis: TargetLibraryAnalysis on bar
	; CHECK-EXT: Running pass: {{.*}}::Bye			; CHECK-EXT: Running pass: {{.*}}::Bye
	; CHECK-O-NEXT: Running pass: AnnotationRemarksPass on foo			; CHECK-O-NEXT: Running pass: AnnotationRemarksPass on foo
	; CHECK-O-NEXT: Running pass: CanonicalizeAliasesPass			; CHECK-O-NEXT: Running pass: CanonicalizeAliasesPass
	; CHECK-O-NEXT: Running pass: NameAnonGlobalPass			; CHECK-O-NEXT: Running pass: NameAnonGlobalPass
	; CHECK-O-NEXT: Running pass: PrintModulePass			; CHECK-O-NEXT: Running pass: PrintModulePass
	Show All 32 Lines

llvm/test/Other/new-pm-thinlto-prelink-samplepgo-defaults.ll

	Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	; CHECK-O-NEXT: Running analysis: ProfileSummaryAnalysis			; CHECK-O-NEXT: Running analysis: ProfileSummaryAnalysis
	; CHECK-O-NEXT: Running analysis: CallGraphAnalysis			; CHECK-O-NEXT: Running analysis: CallGraphAnalysis
	; CHECK-O-NEXT: Running pass: RequireAnalysisPass<{{.*}}ProfileSummaryAnalysis			; CHECK-O-NEXT: Running pass: RequireAnalysisPass<{{.*}}ProfileSummaryAnalysis
	; CHECK-O-NEXT: Running pass: OpenMPOptPass			; CHECK-O-NEXT: Running pass: OpenMPOptPass
	; CHECK-O-NEXT: Running pass: IPSCCPPass			; CHECK-O-NEXT: Running pass: IPSCCPPass
	; CHECK-O-NEXT: Running pass: CalledValuePropagationPass			; CHECK-O-NEXT: Running pass: CalledValuePropagationPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: PromotePass			; CHECK-O-NEXT: Running pass: PromotePass
	; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O-NEXT: Running analysis: BlockFrequencyAnalysis on foo			; CHECK-O-NEXT: Running analysis: BlockFrequencyAnalysis on foo
	; These next two can appear in any order since they are accessed as parameters			; These next two can appear in any order since they are accessed as parameters
	; on the same call to BlockFrequencyInfo::calculate.			; on the same call to BlockFrequencyInfo::calculate.
	; CHECK-O-DAG: Running analysis: LoopAnalysis on foo			; CHECK-O-DAG: Running analysis: LoopAnalysis on foo
	; CHECK-O-DAG: Running analysis: BranchProbabilityAnalysis on foo			; CHECK-O-DAG: Running analysis: BranchProbabilityAnalysis on foo
	; CHECK-O-NEXT: Running analysis: PostDominatorTreeAnalysis on foo			; CHECK-O-NEXT: Running analysis: PostDominatorTreeAnalysis on foo
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass on foo			; CHECK-O-NEXT: Running pass: SimplifyCFGPass on foo
	▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines
	; CHECK-O23SZ-NEXT: Running pass: CoroElidePass			; CHECK-O23SZ-NEXT: Running pass: CoroElidePass
	; CHECK-O-NEXT: Running pass: SimplifyCFGPass			; CHECK-O-NEXT: Running pass: SimplifyCFGPass
	; CHECK-O-NEXT: Running pass: InstCombinePass			; CHECK-O-NEXT: Running pass: InstCombinePass
	; CHECK-O3-NEXT: Running pass: ControlHeightReductionPass on foo			; CHECK-O3-NEXT: Running pass: ControlHeightReductionPass on foo
	; CHECK-O3-NEXT: Running analysis: RegionInfoAnalysis on foo			; CHECK-O3-NEXT: Running analysis: RegionInfoAnalysis on foo
	; CHECK-O3-NEXT: Running analysis: DominanceFrontierAnalysis on foo			; CHECK-O3-NEXT: Running analysis: DominanceFrontierAnalysis on foo
	; CHECK-O-NEXT: Running pass: CoroSplitPass			; CHECK-O-NEXT: Running pass: CoroSplitPass
	; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis			; CHECK-O-NEXT: Invalidating analysis: InlineAdvisorAnalysis
				; CHECK-O-NEXT: Running pass: DeadArgumentEliminationPass
	; CHECK-O-NEXT: Running pass: CoroCleanupPass			; CHECK-O-NEXT: Running pass: CoroCleanupPass
	; CHECK-O-NEXT: Running pass: GlobalOptPass			; CHECK-O-NEXT: Running pass: GlobalOptPass
	; CHECK-O-NEXT: Running pass: AnnotationRemarksPass on foo			; CHECK-O-NEXT: Running pass: AnnotationRemarksPass on foo
	; CHECK-O-NEXT: Running pass: CanonicalizeAliasesPass			; CHECK-O-NEXT: Running pass: CanonicalizeAliasesPass
	; CHECK-O-NEXT: Running pass: NameAnonGlobalPass			; CHECK-O-NEXT: Running pass: NameAnonGlobalPass
	; CHECK-O-NEXT: Running pass: PrintModulePass			; CHECK-O-NEXT: Running pass: PrintModulePass

	; Make sure we get the IR back out without changes when we print the module.			; Make sure we get the IR back out without changes when we print the module.
	Show All 30 Lines

llvm/test/Transforms/InstCombine/unused-nonnull.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature
	; RUN: opt -S -O3 -o - %s \| FileCheck %s			; RUN: opt -S -O3 -o - %s \| FileCheck %s

	; PR44154: LLVM c3b06d0c393e caused the body of @main to be replaced with			; PR44154: LLVM c3b06d0c393e caused the body of @main to be replaced with
	; unreachable. Check that we perform the expected calls and optimizations.			; unreachable. Check that we perform the expected calls and optimizations.

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define i32 @main(i32 %argc, i8** %argv) #0 {			define i32 @main(i32 %argc, i8** %argv) #0 {
	; CHECK-LABEL: define {{[^@]+}}@main			; CHECK-LABEL: define {{[^@]+}}@main
	; CHECK-SAME: (i32 [[ARGC:%.]], i8* nocapture readnone [[ARGV:%.*]]) local_unnamed_addr #[[ATTR0:[0-9]+]] {			; CHECK-SAME: (i32 [[ARGC:%.]], i8* nocapture readonly [[ARGV:%.*]]) local_unnamed_addr #[[ATTR0:[0-9]+]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.*]] = icmp slt i32 [[ARGC]], 2			; CHECK-NEXT: [[TMP0:%.*]] = icmp slt i32 [[ARGC]], 2
	; CHECK-NEXT: [[SPEC_SELECT:%.*]] = select i1 [[TMP0]], i32 0, i32 [[ARGC]]			; CHECK-NEXT: [[SPEC_SELECT:%.*]] = select i1 [[TMP0]], i32 0, i32 [[ARGC]]
	; CHECK-NEXT: ret i32 [[SPEC_SELECT]]			; CHECK-NEXT: ret i32 [[SPEC_SELECT]]
	;			;
	entry:			entry:
	%0 = getelementptr inbounds i8, i8* %argv, i32 0			%0 = getelementptr inbounds i8, i8* %argv, i32 0
	%ptr = load i8, i8* %0			%ptr = load i8, i8* %0
	Show All 11 Lines

	done:			done:
	%retval = phi i32 [0, %entry], [%1, %do_work], [%1, %null]			%retval = phi i32 [0, %entry], [%1, %do_work], [%1, %null]
	ret i32 %retval			ret i32 %retval
	}			}

	define i32 @compute(i8* noundef nonnull %ptr, i32 %x) #1 {			define i32 @compute(i8* noundef nonnull %ptr, i32 %x) #1 {
	; CHECK-LABEL: define {{[^@]+}}@compute			; CHECK-LABEL: define {{[^@]+}}@compute
	; CHECK-SAME: (i8* nocapture nonnull readnone [[PTR:%.]], i32 returned [[X:%.]]) local_unnamed_addr #[[ATTR1:[0-9]+]] {			; CHECK-SAME: (i8* nocapture noundef nonnull readnone [[PTR:%.]], i32 returned [[X:%.]])
				; CHECK-SAME: local_unnamed_addr #[[ATTR1:[0-9]+]] {
	; CHECK-NEXT: ret i32 [[X]]			; CHECK-NEXT: ret i32 [[X]]
	;			;
	ret i32 %x			ret i32 %x
	}			}

	declare void @call_if_null(i8* %ptr) #0			declare void @call_if_null(i8* %ptr) #0

	attributes #0 = { nounwind }			attributes #0 = { nounwind }
	attributes #1 = { noinline nounwind readonly }			attributes #1 = { noinline nounwind readonly }

llvm/test/Transforms/PhaseOrdering/dce-after-argument-promotion.ll

	; RUN: opt -O3 -S < %s \| FileCheck %s			; RUN: opt -O3 -S < %s \| FileCheck %s

	; Arg promotion eliminates the struct argument but may leave dead arguments after its work			; Arg promotion eliminates the struct argument but may leave dead arguments after its work

	%struct.ss = type { i32, i64 }			%struct.ss = type { i32, i64 }

	@dummy = global i32 0			@dummy = global i32 0
	; CHECK: [[DUMMY:@.*]] = local_unnamed_addr global i32 0			; CHECK: [[DUMMY:@.*]] = local_unnamed_addr global i32 0

	define internal void @f(%struct.ss* byval(%struct.ss) align 8 %b, i32* byval(i32) align 4 %X) noinline nounwind {			define internal void @f(%struct.ss* byval(%struct.ss) align 8 %b, i32* byval(i32) align 4 %X) noinline nounwind {
	; CHECK-LABEL: define {{[^@]+}}@f			; CHECK-LABEL: define {{[^@]+}}@f
	; CHECK-SAME: (i32 [[B_0:%.]], i32 [[X:%.]]){{[^#]*}} #[[ATTR0:[0-9]+]] {			; CHECK-SAME: (i32 [[B_0:%.]]){{[^#]}} #[[ATTR0:[0-9]+]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TEMP:%.*]] = add i32 [[B_0]], 1			; CHECK-NEXT: [[TEMP:%.*]] = add i32 [[B_0]], 1
	; CHECK-NEXT: store i32 [[TEMP]], i32* [[DUMMY]], align 4			; CHECK-NEXT: store i32 [[TEMP]], i32* [[DUMMY]], align 4
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%temp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0			%temp = getelementptr %struct.ss, %struct.ss* %b, i32 0, i32 0
	%temp1 = load i32, i32* %temp, align 4			%temp1 = load i32, i32* %temp, align 4
	%temp2 = add i32 %temp1, 1			%temp2 = add i32 %temp1, 1
	store i32 %temp2, i32* @dummy			store i32 %temp2, i32* @dummy
	store i32 %temp2, i32* %X			store i32 %temp2, i32* %X
	ret void			ret void
	}			}

	define i32 @test(i32* %X) {			define i32 @test(i32* %X) {
	; CHECK-LABEL: define {{[^@]+}}@test			; CHECK-LABEL: define {{[^@]+}}@test
	; CHECK-SAME: (i32* {{[^%]}} [[X:%.]]){{[^#]*}} #[[ATTR1:[0-9]+]] {			; CHECK-SAME: (i32* {{[^%]}} [[X:%.]]){{[^#]*}} #[[ATTR1:[0-9]+]] {
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[X_VAL:%.]] = load i32, i32 [[X]], align 4			; CHECK-NEXT: tail call {{.*}}void @f(i32 1)
	; CHECK-NEXT: tail call {{.*}}void @f(i32 1, i32 [[X_VAL]])
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0
	;			;
	entry:			entry:
	%S = alloca %struct.ss, align 8			%S = alloca %struct.ss, align 8
	%temp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0			%temp1 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 0
	store i32 1, i32* %temp1, align 8			store i32 1, i32* %temp1, align 8
	%temp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1			%temp4 = getelementptr %struct.ss, %struct.ss* %S, i32 0, i32 1
	store i64 2, i64* %temp4, align 4			store i64 2, i64* %temp4, align 4
	call void @f( %struct.ss* byval(%struct.ss) align 8 %S, i32* byval(i32) align 4 %X)			call void @f( %struct.ss* byval(%struct.ss) align 8 %S, i32* byval(i32) align 4 %X)
	ret i32 0			ret i32 0
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[Pipelines] Introduce DAE after ArgumentPromotionClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 443860

clang/test/CodeGen/thinlto-distributed-newpm.ll

llvm/lib/Passes/PassBuilderPipelines.cpp

llvm/test/Other/new-pm-defaults.ll

llvm/test/Other/new-pm-lto-defaults.ll

llvm/test/Other/new-pm-thinlto-defaults.ll

llvm/test/Other/new-pm-thinlto-postlink-pgo-defaults.ll

llvm/test/Other/new-pm-thinlto-postlink-samplepgo-defaults.ll

llvm/test/Other/new-pm-thinlto-prelink-pgo-defaults.ll

llvm/test/Other/new-pm-thinlto-prelink-samplepgo-defaults.ll

llvm/test/Transforms/InstCombine/unused-nonnull.ll

llvm/test/Transforms/PhaseOrdering/dce-after-argument-promotion.ll

[Pipelines] Introduce DAE after ArgumentPromotion
ClosedPublic