This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
docs/
-
ReleaseNotes.rst
-
include/llvm/
-
llvm/
-
CodeGen/
-
Passes.h
-
InitializePasses.h
-
LinkAllPasses.h
-
lib/CodeGen/
-
CodeGen/
-
AddDisableTailCalls.cpp
-
CMakeLists.txt
-
CodeGen.cpp
-
TargetPassConfig.cpp
-
test/
-
CodeGen/
-
AArch64/
-
O0-pipeline.ll
-
arm64-abi_align.ll
-
dllimport.ll
-
tailcall-fastisel.ll
-
win64_vararg_float.ll
-
AMDGPU/
-
GlobalISel/
-
irtranslator-assert-align.ll
-
llc-pipeline.ll
-
need-fp-from-vgpr-spills.ll
-
sgpr-spills-split-regalloc.ll
-
ARM/
-
fast-tail-call.ll
-
none-macho.ll
-
subtarget-no-movt.ll
-
tail-call-float.ll
-
tail-call.ll
-
Mips/tailcall/
-
tailcall/
-
tail-call-arguments-clobber.ll
-
PowerPC/
-
ppc64-sibcall.ll
-
RISCV/
-
O0-pipeline.ll
-
X86/
-
O0-pipeline.ll
-
add-disable-tail-calls.ll
-
atom-pad-short-functions.ll
-
fold-sext-trunc.ll
-
fold-zext-trunc.ll
-
ins_split_regalloc.ll
-
lvi-hardening-indirectbr.ll
-
mixed-ptr-sizes-i686.ll
-
mixed-ptr-sizes.ll
-
opt-pipeline.ll
-
pr1489.ll
-
pr53243-tail-call-fastisel.ll
-
retpoline-external.ll
-
retpoline.ll
-
swiftself-win64.ll
-
swiftself.ll
-
tailcall-msvc-conventions.ll
-
win64_eh_leaf.ll
-
DebugInfo/
-
COFF/
-
tail-call-without-lexical-scopes.ll
-
X86/
-
dbg-declare-inalloca.ll
-
MC/ARM/
-
ARM/
-
arm-thumb-tail-call.ll
-
tools/opt/
-
opt/
-
opt.cpp
-
utils/gn/secondary/llvm/lib/CodeGen/
-
gn/
-
secondary/
-
llvm/
-
lib/
-
CodeGen/
-
BUILD.gn

Differential D132623

[CodeGen] Disable tail calls at -O0/-O1
Changes PlannedPublic

Authored by aeubanks on Aug 24 2022, 4:41 PM.

Download Raw Diff

Details

Reviewers

rnk
efriedma
ributzka

Summary

-O1 is designed to impact debuggability as little as possible and tail
calls hurt debuggability, so turn off tail calls at -O0/-O1.

Add a new pass that adds the "disable-tail-calls"="true" function
attribute, which disables generating (non-musttail) tail calls in that
function. Add this pass to the -O0/-O1 codegen pipeline.

Motivation: D130374 inferred more tail calls even in -O1, causing
various internal symbolizers to regress.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aeubanks created this revision.Aug 24 2022, 4:41 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 24 2022, 4:41 PM

Herald added subscribers: kosarev, frasercrmck, kerbowa and 30 others. · View Herald Transcript

aeubanks requested review of this revision.Aug 24 2022, 4:41 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 24 2022, 4:41 PM

Herald added subscribers: llvm-commits, • pcwang-thead, MaskRay. · View Herald Transcript

aeubanks added reviewers: rnk, efriedma.Aug 24 2022, 4:42 PM

The goal clearly makes sense to me, and I think this is a reasonable way to do it. Another option is to implement this in the frontend the way that we implement optnone at -O0.

+@probinson @dblaikie, this is kind of -Og / -O1 related if you have thoughts on how that should be handled.

Harbormaster completed remote builds in B183263: Diff 455425.Aug 24 2022, 6:39 PM

In D132623#3747780, @rnk wrote:

The goal clearly makes sense to me, and I think this is a reasonable way to do it. Another option is to implement this in the frontend the way that we implement optnone at -O0.

+@probinson @dblaikie, this is kind of -Og / -O1 related if you have thoughts on how that should be handled.

I think it'd be good to have some more words about how exactly this improves debuggability - because at least at a cursory reading I was/am confused - DWARFv5 has debug info to more accurately model tail calls so back traces can be provided. Does that not work in some cases? If this is about debuggability in the absence of debug info - I'm not sure I'm in support of this direction. Tail calls lose stack trace info in a similar way to inlining, and we don't disable inlining at -O1 - so is there some reasoning that differentiates the impact of these cases & justifies treating one differently than the other?

causing various internal symbolizers to regress.

Regress in what way?

DWARFv5 has debug info to more accurately model tail calls so back traces can be provided. Does that not work in some cases?

If you're three levels down in a tail-call chain, I can imagine it would take a lot of effort to reconstruct the call sequence, even with the tail-call info.

Herald added a reviewer: ributzka. · View Herald TranscriptAug 29 2022, 9:54 AM

Why does this need to use an attribute rather than querying optimization level in the backend?

In D132623#3756345, @nikic wrote:

Why does this need to use an attribute rather than querying optimization level in the backend?

The attribute already exists and is used in several places. This patch just adds the attribute to more functions.

In D132623#3756345, @nikic wrote:

Why does this need to use an attribute rather than querying optimization level in the backend?

It's also generally consistent with how LLVM has handled O0, Os, and Oz to date, as attributes: optnone, minsize, optsize. That was in turn driven by a desire to allow mixing O[2zs0] objects during classic (fat?) LTO, and have things work as they did in traditional compilation.

In D132623#3756786, @probinson wrote:

In D132623#3756345, @nikic wrote:

Why does this need to use an attribute rather than querying optimization level in the backend?

The attribute already exists and is used in several places. This patch just adds the attribute to more functions.

Well, really, this is querying the optimizations level in the backend; that's what the code in TargetPassConfig::addISelPrepare does. The reason it feels weird to me (and probably to @nikic) is that it's querying it in a slightly weird place. If we're going to check it in the backend anyway, we might as well just check it in the same place we'd check for the attribute, instead of adding the attribute, then checking for it a couple passes later.

In D132623#3765673, @efriedma wrote:

Well, really, this is querying the optimizations level in the backend; that's what the code in TargetPassConfig::addISelPrepare does. The reason it feels weird to me (and probably to @nikic) is that it's querying it in a slightly weird place. If we're going to check it in the backend anyway, we might as well just check it in the same place we'd check for the attribute, instead of adding the attribute, then checking for it a couple passes later.

I see, sorry, I didn't read the patch closely. I think we should set this in the frontend, or check the optimization level in the codegenerator. Using an extra pass seems heavyweight.

In D132623#3772858, @rnk wrote:

In D132623#3765673, @efriedma wrote:

Well, really, this is querying the optimizations level in the backend; that's what the code in TargetPassConfig::addISelPrepare does. The reason it feels weird to me (and probably to @nikic) is that it's querying it in a slightly weird place. If we're going to check it in the backend anyway, we might as well just check it in the same place we'd check for the attribute, instead of adding the attribute, then checking for it a couple passes later.

I see, sorry, I didn't read the patch closely. I think we should set this in the frontend, or check the optimization level in the codegenerator. Using an extra pass seems heavyweight.

I think we are better off not having two ways to spell "don't tail-call here" (attribute & opt-level) that have to be checked every place we might tail-call. So, if doing it in the pass is weird, it should be set in the frontend.

The contract we have with the -O1 pipeline is to attempt to generate debuggable code as much as possible. For that reason I don't think the frontend should be responsible for adding the attribute.

As for a separate pass adding the attribute vs checking the opt level in the various isels, I like a separate pass better because it's more explicit and less magical, plus I'd have to go and touch all the various isels. Touching all the isels isn't a huge deal but it makes it easier to regress with future changes ("forgot to check opt level on top of disable-tail-calls"), and I'd have to make sure to not miss anything in the first place. Having only one way for a backend to check for disabling tail calls seems nicer.

In D132623#3775514, @aeubanks wrote:

The contract we have with the -O1 pipeline is to attempt to generate debuggable code as much as possible

No it's not. What we say is:

    :option:`-O0` Means "no optimization": this level compiles the fastest and
    generates the most debuggable code.

    :option:`-O1` Somewhere between :option:`-O0` and :option:`-O2`.

    :option:`-O2` Moderate level of optimization which enables most
    optimizations.

...


    :option:`-Og` Like :option:`-O1`. In future versions, this option might
    disable different optimizations in order to improve debuggability.

In D132623#3775541, @jrtc27 wrote:
In D132623#3775514, @aeubanks wrote:

The contract we have with the -O1 pipeline is to attempt to generate debuggable code as much as possible

No it's not. What we say is:
    :option:`-O0` Means "no optimization": this level compiles the fastest and
    generates the most debuggable code.

    :option:`-O1` Somewhere between :option:`-O0` and :option:`-O2`.

    :option:`-O2` Moderate level of optimization which enables most
    optimizations.

...


    :option:`-Og` Like :option:`-O1`. In future versions, this option might
    disable different optimizations in order to improve debuggability.

The -O1 pipeline has been tuned in the past to attempt to keep debuggability, and in OptimizationLevel.h we say

/// Optimize quickly without destroying debuggability.
///
/// This level is tuned to produce a result from the optimizer as quickly
/// as possible and to avoid destroying debuggability. This tends to result
/// in a very good development mode where the compiled code will be
/// immediately executed as part of testing. As a consequence, where
/// possible, we would like to produce efficient-to-execute code, but not
/// if it significantly slows down compilation or would prevent even basic
/// debugging of the resulting binary.
///
/// As an example, complex loop transformations such as versioning,
/// vectorization, or fusion don't make sense here due to the degree to
/// which the executed code differs from the source code, and the compile
/// time cost.
static const OptimizationLevel O1;

So I'd say that currently we do expect -O1 to generate debuggable code. If there's ever a push for a separate -Og pipeline that's fine, but we don't have that right now.

For that reason I don't think the frontend should be responsible for adding the attribute.

Not sure I follow - the "optnone at -O0" is implemented in the frontend, seems suitable to implement this one the same way, especially if we already have the attribute and infrastructure for it?

In D132623#3756786, @probinson wrote:

In D132623#3756345, @nikic wrote:

Why does this need to use an attribute rather than querying optimization level in the backend?

The attribute already exists and is used in several places. This patch just adds the attribute to more functions.

This seems less magical and more explicit. I also didn't want to make sure I had found every place that could possibly check the attribute and have potential future changes miss disabling tail calls for -O1.

And +1 to rnk's comment, although we only LTO at an IR level, not at an MIR/object file level.

Still trying to understand exactly what's going on with the symbolizer (which is just llvm-symbolizer)

In D132623#3779146, @dblaikie wrote:

For that reason I don't think the frontend should be responsible for adding the attribute.

Not sure I follow - the "optnone at -O0" is implemented in the frontend, seems suitable to implement this one the same way, especially if we already have the attribute and infrastructure for it?

This actually came up in a talk with some people recently.
It would be interesting to have some sort of monolithic pipeline where we embed -O0/1/2/3/s/z attributes per-function in the frontend and don't have per-optimization-level pipelines. Currently we have function attributes for -O0/s/z but not the others. -O1 is fairly distinct from -O2/3 in that we want to preserve debuggability. However, we don't have an -O1 function attribute yet, so I don't think what you're proposing makes sense with the current state of things where we express our intent via the optimization level parameter to the optimization pipeline.

It would be interesting to have some sort of monolithic pipeline where we embed -O0/1/2/3/s/z attributes per-function in the frontend and don't have per-optimization-level pipelines.

I would balk at this. A single generic pipeline at -O0 would be a compile-time performance hit, compared to the separate pipeline, and one of the explicit goals of -O0 is fast compilation. I'd expect a hit also at -O1 although probably less noticeable.
(You could experiment with this today, by compiling -emit-llvm -O0 to get IR with optnone everywhere, then compare compiling that IR at O0 vs O3.)

In D132623#3779146, @dblaikie wrote:

For that reason I don't think the frontend should be responsible for adding the attribute.

Not sure I follow - the "optnone at -O0" is implemented in the frontend, seems suitable to implement this one the same way, especially if we already have the attribute and infrastructure for it?

This actually came up in a talk with some people recently.
It would be interesting to have some sort of monolithic pipeline where we embed -O0/1/2/3/s/z attributes per-function in the frontend and don't have per-optimization-level pipelines. Currently we have function attributes for -O0/s/z but not the others. -O1 is fairly distinct from -O2/3 in that we want to preserve debuggability. However, we don't have an -O1 function attribute yet, so I don't think what you're proposing makes sense with the current state of things where we express our intent via the optimization level parameter to the optimization pipeline.

At least when I think optnone (or maybe optsize, I forget - I remember it was @chandlerc that was expressing these opinions most vocally) was proposed there was some distinction drawn between semantically meaningful differences (that "optimizing for size" and "not optimizing at all" were observable changes in behavior, in some sense - for debugging, for fitting in certain embedded devices, etc) and "varying shades of optimization" (essentially between -O2 and -O3) - and that the latter didn't seem suitable for per-function granularity expression (& so should be expressed only by the optimization pipeline, not embedded in the IR), but the former did.

As -O1 becomes/meanders near -Og, I guess we might end up with another of these "but this needs to be carried in the IR/is semantically meaningful to the end user" attributes ("optdebug"?) - but I don't know that we're there yet. I think if -O1/-Og's desire to be debuggable can be expressed by the frontend as much as possible with existing attributes in a similar way to -O0/optnone, that's a good thing - makes it more compatible with LTO, at least?

In D132623#3784292, @probinson wrote:

It would be interesting to have some sort of monolithic pipeline where we embed -O0/1/2/3/s/z attributes per-function in the frontend and don't have per-optimization-level pipelines.

I would balk at this. A single generic pipeline at -O0 would be a compile-time performance hit, compared to the separate pipeline, and one of the explicit goals of -O0 is fast compilation. I'd expect a hit also at -O1 although probably less noticeable.
(You could experiment with this today, by compiling -emit-llvm -O0 to get IR with optnone everywhere, then compare compiling that IR at O0 vs O3.)

+1
I would be very much against a single pipeline that would run a bunch of passes that check an attribute to decide whether they should do anything. Sounds like it would also prevent building dynamic pass pipelines which ORC JIT currently allows the user to do IIRC.

At least when I think optnone (or maybe optsize, I forget - I remember it was @chandlerc that was expressing these opinions most vocally) was proposed there was some distinction drawn between semantically meaningful differences (that "optimizing for size" and "not optimizing at all" were observable changes in behavior, in some sense - for debugging, for fitting in certain embedded devices, etc) and "varying shades of optimization" (essentially between -O2 and -O3) - and that the latter didn't seem suitable for per-function granularity expression (& so should be expressed only by the optimization pipeline, not embedded in the IR), but the former did.

The talk I had was actually with Chandler who liked the idea of a unified pipeline that looked at function attributes. (I seemed to remember that you had brought up Chandler being against this before and mentioned it but he had no recollection of being against something like this).

But anyway, we're getting off topic, I'm not actually proposing a unified optimization pipeline. I'll need to find time to understand the tail call vs inlining hurting stack traces issue.

In D132623#3832548, @aeubanks wrote:

At least when I think optnone (or maybe optsize, I forget - I remember it was @chandlerc that was expressing these opinions most vocally) was proposed there was some distinction drawn between semantically meaningful differences (that "optimizing for size" and "not optimizing at all" were observable changes in behavior, in some sense - for debugging, for fitting in certain embedded devices, etc) and "varying shades of optimization" (essentially between -O2 and -O3) - and that the latter didn't seem suitable for per-function granularity expression (& so should be expressed only by the optimization pipeline, not embedded in the IR), but the former did.

The talk I had was actually with Chandler who liked the idea of a unified pipeline that looked at function attributes. (I seemed to remember that you had brought up Chandler being against this before and mentioned it but he had no recollection of being against something like this).

Hmm - perhaps I'm misremembering and/or things just got jumbled up over the years and the people involved in different parts of the conversation, etc. :/

But anyway, we're getting off topic, I'm not actually proposing a unified optimization pipeline. I'll need to find time to understand the tail call vs inlining hurting stack traces issue.

*nod* Sounds good - basically, I think the issue is that DWARFv5-or-gcc-extension call site descriptions could solve some amount of tail call lossage in backtraces, but wouldn't guarantee all tail calls could get reconstructed - but it'd be nice to see how much we could get covered with that debug info. Maybe it'd be "enough"?

Revision Contents

Path

Size

llvm/

docs/

ReleaseNotes.rst

2 lines

include/

llvm/

CodeGen/

Passes.h

3 lines

InitializePasses.h

1 line

LinkAllPasses.h

1 line

lib/

CodeGen/

AddDisableTailCalls.cpp

58 lines

CMakeLists.txt

1 line

CodeGen.cpp

1 line

TargetPassConfig.cpp

6 lines

test/

CodeGen/

AArch64/

1 line

2 lines

2 lines

2 lines

win64_vararg_float.ll

4 lines

AMDGPU/

GlobalISel/

irtranslator-assert-align.ll

2 lines

llc-pipeline.ll

3 lines

need-fp-from-vgpr-spills.ll

2 lines

sgpr-spills-split-regalloc.ll

2 lines

ARM/

2 lines

2 lines

6 lines

4 lines

6 lines

Mips/

tailcall/

tail-call-arguments-clobber.ll

6 lines

PowerPC/

ppc64-sibcall.ll

8 lines

RISCV/

O0-pipeline.ll

1 line

X86/

O0-pipeline.ll

1 line

add-disable-tail-calls.ll

14 lines

atom-pad-short-functions.ll

2 lines

fold-sext-trunc.ll

2 lines

fold-zext-trunc.ll

2 lines

ins_split_regalloc.ll

2 lines

lvi-hardening-indirectbr.ll

2 lines

mixed-ptr-sizes-i686.ll

2 lines

mixed-ptr-sizes.ll

2 lines

opt-pipeline.ll

3 lines

pr1489.ll

2 lines

pr53243-tail-call-fastisel.ll

2 lines

retpoline-external.ll

4 lines

retpoline.ll

4 lines

swiftself-win64.ll

2 lines

swiftself.ll

2 lines

tailcall-msvc-conventions.ll

4 lines

win64_eh_leaf.ll

4 lines

DebugInfo/

COFF/

tail-call-without-lexical-scopes.ll

2 lines

X86/

dbg-declare-inalloca.ll

4 lines

MC/

ARM/

arm-thumb-tail-call.ll

2 lines

tools/

opt/

opt.cpp

3 lines

utils/

gn/

secondary/

llvm/

lib/

CodeGen/

BUILD.gn

1 line

Diff 455425

llvm/docs/ReleaseNotes.rst

	Show First 20 Lines • Show All 157 Lines • ▼ Show 20 Lines

	Changes to Sanitizers			Changes to Sanitizers
	---------------------			---------------------


	Other Changes			Other Changes
	-------------			-------------

				* -O0 and -O1 no longer generate tail calls unless required.

	External Open Source Projects Using LLVM 15			External Open Source Projects Using LLVM 15
	===========================================			===========================================

	* A project...			* A project...

	Additional Information			Additional Information
	======================			======================

	Show All 9 Lines

llvm/include/llvm/CodeGen/Passes.h

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	namespace llvm {
/// If AbortOnFailedISel is true, abort compilation instead of resetting.		/// If AbortOnFailedISel is true, abort compilation instead of resetting.
MachineFunctionPass *createResetMachineFunctionPass(bool EmitFallbackDiag,		MachineFunctionPass *createResetMachineFunctionPass(bool EmitFallbackDiag,
bool AbortOnFailedISel);		bool AbortOnFailedISel);

/// createCodeGenPreparePass - Transform the code to expose more pattern		/// createCodeGenPreparePass - Transform the code to expose more pattern
/// matching during instruction selection.		/// matching during instruction selection.
FunctionPass *createCodeGenPreparePass();		FunctionPass *createCodeGenPreparePass();

		/// Pass to add the "disable-tail-calls" function attribute to all functions.
		FunctionPass *createAddDisableTailCallsPass();

/// AtomicExpandID -- Lowers atomic operations in terms of either cmpxchg		/// AtomicExpandID -- Lowers atomic operations in terms of either cmpxchg
/// load-linked/store-conditional loops.		/// load-linked/store-conditional loops.
extern char &AtomicExpandID;		extern char &AtomicExpandID;

/// MachineLoopInfo - This pass is a loop analysis pass.		/// MachineLoopInfo - This pass is a loop analysis pass.
extern char &MachineLoopInfoID;		extern char &MachineLoopInfoID;

/// MachineDominators - This pass is a machine dominators analysis pass.		/// MachineDominators - This pass is a machine dominators analysis pass.
▲ Show 20 Lines • Show All 478 Lines • Show Last 20 Lines

llvm/include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	void initializeGlobalISel(PassRegistry&);			void initializeGlobalISel(PassRegistry&);

	/// Initialize all passes linked into the CodeGen library.			/// Initialize all passes linked into the CodeGen library.
	void initializeTarget(PassRegistry&);			void initializeTarget(PassRegistry&);

	void initializeAAEvalLegacyPassPass(PassRegistry&);			void initializeAAEvalLegacyPassPass(PassRegistry&);
	void initializeAAResultsWrapperPassPass(PassRegistry&);			void initializeAAResultsWrapperPassPass(PassRegistry&);
	void initializeADCELegacyPassPass(PassRegistry&);			void initializeADCELegacyPassPass(PassRegistry&);
				void initializeAddDisableTailCallsPassPass(PassRegistry &);
	void initializeAddDiscriminatorsLegacyPassPass(PassRegistry&);			void initializeAddDiscriminatorsLegacyPassPass(PassRegistry&);
	void initializeAddFSDiscriminatorsPass(PassRegistry &);			void initializeAddFSDiscriminatorsPass(PassRegistry &);
	void initializeAggressiveInstCombinerLegacyPassPass(PassRegistry&);			void initializeAggressiveInstCombinerLegacyPassPass(PassRegistry&);
	void initializeAliasSetPrinterPass(PassRegistry&);			void initializeAliasSetPrinterPass(PassRegistry&);
	void initializeAlignmentFromAssumptionsPass(PassRegistry&);			void initializeAlignmentFromAssumptionsPass(PassRegistry&);
	void initializeAlwaysInlinerLegacyPassPass(PassRegistry&);			void initializeAlwaysInlinerLegacyPassPass(PassRegistry&);
	void initializeAssumeSimplifyPassLegacyPassPass(PassRegistry &);			void initializeAssumeSimplifyPassLegacyPassPass(PassRegistry &);
	void initializeAssumeBuilderPassLegacyPassPass(PassRegistry &);			void initializeAssumeBuilderPassLegacyPassPass(PassRegistry &);
	▲ Show 20 Lines • Show All 376 Lines • Show Last 20 Lines

llvm/include/llvm/LinkAllPasses.h

Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	ForcePassLinking() {
(void)llvm::createScalarizeMaskedMemIntrinLegacyPass();		(void)llvm::createScalarizeMaskedMemIntrinLegacyPass();
(void) llvm::createWarnMissedTransformationsPass();		(void) llvm::createWarnMissedTransformationsPass();
(void) llvm::createHardwareLoopsPass();		(void) llvm::createHardwareLoopsPass();
(void) llvm::createInjectTLIMappingsLegacyPass();		(void) llvm::createInjectTLIMappingsLegacyPass();
(void) llvm::createUnifyLoopExitsPass();		(void) llvm::createUnifyLoopExitsPass();
(void) llvm::createFixIrreduciblePass();		(void) llvm::createFixIrreduciblePass();
(void)llvm::createFunctionSpecializationPass();		(void)llvm::createFunctionSpecializationPass();
(void)llvm::createSelectOptimizePass();		(void)llvm::createSelectOptimizePass();
		(void)llvm::createAddDisableTailCallsPass();

(void)new llvm::IntervalPartition();		(void)new llvm::IntervalPartition();
(void)new llvm::ScalarEvolutionWrapperPass();		(void)new llvm::ScalarEvolutionWrapperPass();
llvm::Function::Create(nullptr, llvm::GlobalValue::ExternalLinkage)->viewCFGOnly();		llvm::Function::Create(nullptr, llvm::GlobalValue::ExternalLinkage)->viewCFGOnly();
llvm::RGPassManager RGM;		llvm::RGPassManager RGM;
llvm::TargetLibraryInfoImpl TLII;		llvm::TargetLibraryInfoImpl TLII;
llvm::TargetLibraryInfo TLI(TLII);		llvm::TargetLibraryInfo TLI(TLII);
llvm::AliasAnalysis AA(TLI);		llvm::AliasAnalysis AA(TLI);
Show All 10 Lines

llvm/lib/CodeGen/AddDisableTailCalls.cpp

This file was added.

				//===- AddDisableTailCalls.cpp --------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Pass to add the "disable-tail-calls" attribute to all functions to prevent
				// the backend from emitting tail calls unless required.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/CodeGen/Passes.h"
				#include "llvm/IR/Function.h"
				#include "llvm/InitializePasses.h"
				#include "llvm/Pass.h"

				#define DEBUG_TYPE "add-disable-tail-calls"

				using namespace llvm;

				namespace {

				class AddDisableTailCallsPass : public FunctionPass {
				public:
				static char ID; // Pass identification, replacement for typeid

				AddDisableTailCallsPass() : FunctionPass(ID) {
				initializeAddDisableTailCallsPassPass(*PassRegistry::getPassRegistry());
				}

				bool runOnFunction(Function &F) override {
				if (!F.hasFnAttribute("disable-tail-calls")) {
				F.addFnAttr("disable-tail-calls", "true");
				return true;
				}
				return false;
				}

				void getAnalysisUsage(AnalysisUsage &AU) const override {
				// This pass doesn't affect any IR analyses.
				AU.setPreservesAll();
				}
				};
				} // namespace

				FunctionPass *llvm::createAddDisableTailCallsPass() {
				return new AddDisableTailCallsPass();
				}

				char AddDisableTailCallsPass::ID = 0;
				INITIALIZE_PASS_BEGIN(AddDisableTailCallsPass, DEBUG_TYPE,
				"Add \"disable-tail-calls\" attribute to functions",
				false, false)
				INITIALIZE_PASS_END(AddDisableTailCallsPass, DEBUG_TYPE,
				"Add \"disable-tail-calls\" attribute to functions", false,
				false)

llvm/lib/CodeGen/CMakeLists.txt

Show All 17 Lines	if (DEFINED LLVM_HAVE_TF_AOT OR DEFINED LLVM_HAVE_TF_API)
endif()		endif()

if (DEFINED LLVM_HAVE_TF_API)		if (DEFINED LLVM_HAVE_TF_API)
list(APPEND MLLinkDeps ${tensorflow_c_api} ${tensorflow_fx})		list(APPEND MLLinkDeps ${tensorflow_c_api} ${tensorflow_fx})
endif()		endif()
endif()		endif()

add_llvm_component_library(LLVMCodeGen		add_llvm_component_library(LLVMCodeGen
		AddDisableTailCalls.cpp
AggressiveAntiDepBreaker.cpp		AggressiveAntiDepBreaker.cpp
AllocationOrder.cpp		AllocationOrder.cpp
Analysis.cpp		Analysis.cpp
AtomicExpandPass.cpp		AtomicExpandPass.cpp
BasicTargetTransformInfo.cpp		BasicTargetTransformInfo.cpp
BranchFolding.cpp		BranchFolding.cpp
BranchRelaxation.cpp		BranchRelaxation.cpp
BreakFalseDeps.cpp		BreakFalseDeps.cpp
▲ Show 20 Lines • Show All 232 Lines • Show Last 20 Lines

llvm/lib/CodeGen/CodeGen.cpp

	Show All 13 Lines
	#include "llvm-c/Initialization.h"			#include "llvm-c/Initialization.h"
	#include "llvm/InitializePasses.h"			#include "llvm/InitializePasses.h"
	#include "llvm/PassRegistry.h"			#include "llvm/PassRegistry.h"

	using namespace llvm;			using namespace llvm;

	/// initializeCodeGen - Initialize all passes linked into the CodeGen library.			/// initializeCodeGen - Initialize all passes linked into the CodeGen library.
	void llvm::initializeCodeGen(PassRegistry &Registry) {			void llvm::initializeCodeGen(PassRegistry &Registry) {
				initializeAddDisableTailCallsPassPass(Registry);
	initializeAtomicExpandPass(Registry);			initializeAtomicExpandPass(Registry);
	initializeBasicBlockSectionsPass(Registry);			initializeBasicBlockSectionsPass(Registry);
	initializeBranchFolderPassPass(Registry);			initializeBranchFolderPassPass(Registry);
	initializeBranchRelaxationPass(Registry);			initializeBranchRelaxationPass(Registry);
	initializeCFGuardLongjmpPass(Registry);			initializeCFGuardLongjmpPass(Registry);
	initializeCFIFixupPass(Registry);			initializeCFIFixupPass(Registry);
	initializeCFIInstrInserterPass(Registry);			initializeCFIInstrInserterPass(Registry);
	initializeCheckDebugMachineModulePass(Registry);			initializeCheckDebugMachineModulePass(Registry);
	▲ Show 20 Lines • Show All 107 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetPassConfig.cpp

Show First 20 Lines • Show All 90 Lines • ▼ Show 20 Lines	static cl::opt<bool> DisablePostRAMachineSink("disable-postra-machine-sink",
cl::Hidden,		cl::Hidden,
cl::desc("Disable PostRA Machine Sinking"));		cl::desc("Disable PostRA Machine Sinking"));
static cl::opt<bool> DisableLSR("disable-lsr", cl::Hidden,		static cl::opt<bool> DisableLSR("disable-lsr", cl::Hidden,
cl::desc("Disable Loop Strength Reduction Pass"));		cl::desc("Disable Loop Strength Reduction Pass"));
static cl::opt<bool> DisableConstantHoisting("disable-constant-hoisting",		static cl::opt<bool> DisableConstantHoisting("disable-constant-hoisting",
cl::Hidden, cl::desc("Disable ConstantHoisting"));		cl::Hidden, cl::desc("Disable ConstantHoisting"));
static cl::opt<bool> DisableCGP("disable-cgp", cl::Hidden,		static cl::opt<bool> DisableCGP("disable-cgp", cl::Hidden,
cl::desc("Disable Codegen Prepare"));		cl::desc("Disable Codegen Prepare"));
		static cl::opt<bool>
		DisableAddDisableTailCalls("disable-add-disable-tail-calls", cl::Hidden,
		cl::desc("Disable AddDisableTailCalls"));
static cl::opt<bool> DisableCopyProp("disable-copyprop", cl::Hidden,		static cl::opt<bool> DisableCopyProp("disable-copyprop", cl::Hidden,
cl::desc("Disable Copy Propagation pass"));		cl::desc("Disable Copy Propagation pass"));
static cl::opt<bool> DisablePartialLibcallInlining("disable-partial-libcall-inlining",		static cl::opt<bool> DisablePartialLibcallInlining("disable-partial-libcall-inlining",
cl::Hidden, cl::desc("Disable Partial Libcall Inlining"));		cl::Hidden, cl::desc("Disable Partial Libcall Inlining"));
static cl::opt<bool> EnableImplicitNullChecks(		static cl::opt<bool> EnableImplicitNullChecks(
"enable-implicit-null-checks",		"enable-implicit-null-checks",
cl::desc("Fold null checks into faulting memory operations"),		cl::desc("Fold null checks into faulting memory operations"),
cl::init(false), cl::Hidden);		cl::init(false), cl::Hidden);
▲ Show 20 Lines • Show All 899 Lines • ▼ Show 20 Lines	void TargetPassConfig::addISelPrepare() {
if (requiresCodeGenSCCOrder())		if (requiresCodeGenSCCOrder())
addPass(new DummyCGSCCPass);		addPass(new DummyCGSCCPass);

// Add both the safe stack and the stack protection passes: each of them will		// Add both the safe stack and the stack protection passes: each of them will
// only protect functions that have corresponding attributes.		// only protect functions that have corresponding attributes.
addPass(createSafeStackPass());		addPass(createSafeStackPass());
addPass(createStackProtectorPass());		addPass(createStackProtectorPass());

		if (getOptLevel() <= CodeGenOpt::Less && !DisableAddDisableTailCalls)
		addPass(createAddDisableTailCallsPass());

if (PrintISelInput)		if (PrintISelInput)
addPass(createPrintFunctionPass(		addPass(createPrintFunctionPass(
dbgs(), "\n\n* Final LLVM Code input to ISel *\n"));		dbgs(), "\n\n* Final LLVM Code input to ISel *\n"));

// All passes which modify the LLVM IR are now complete; run the verifier		// All passes which modify the LLVM IR are now complete; run the verifier
// to ensure that the IR is valid.		// to ensure that the IR is valid.
if (!DisableVerify)		if (!DisableVerify)
addPass(createVerifierPass());		addPass(createVerifierPass());
▲ Show 20 Lines • Show All 556 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/O0-pipeline.ll

	Show All 22 Lines
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Expand vector predication intrinsics			; CHECK-NEXT: Expand vector predication intrinsics
	; CHECK-NEXT: Scalarize Masked Memory Intrinsics			; CHECK-NEXT: Scalarize Masked Memory Intrinsics
	; CHECK-NEXT: Expand reduction intrinsics			; CHECK-NEXT: Expand reduction intrinsics
	; CHECK-NEXT: AArch64 Stack Tagging			; CHECK-NEXT: AArch64 Stack Tagging
	; CHECK-NEXT: Exception handling preparation			; CHECK-NEXT: Exception handling preparation
	; CHECK-NEXT: Safe Stack instrumentation pass			; CHECK-NEXT: Safe Stack instrumentation pass
	; CHECK-NEXT: Insert stack protectors			; CHECK-NEXT: Insert stack protectors
				; CHECK-NEXT: Add "disable-tail-calls" attribute to functions
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: Analysis containing CSE Info			; CHECK-NEXT: Analysis containing CSE Info
	; CHECK-NEXT: IRTranslator			; CHECK-NEXT: IRTranslator
	; CHECK-NEXT: Analysis for ComputingKnownBits			; CHECK-NEXT: Analysis for ComputingKnownBits
	; CHECK-NEXT: AArch64O0PreLegalizerCombiner			; CHECK-NEXT: AArch64O0PreLegalizerCombiner
	; CHECK-NEXT: Analysis containing CSE Info			; CHECK-NEXT: Analysis containing CSE Info
	; CHECK-NEXT: Legalizer			; CHECK-NEXT: Legalizer
	; CHECK-NEXT: AArch64PostLegalizerLowering			; CHECK-NEXT: AArch64PostLegalizerLowering
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/arm64-abi_align.ll

	; RUN: llc -aarch64-load-store-renaming=true < %s -mtriple=arm64-apple-darwin -mcpu=cyclone -enable-misched=false -frame-pointer=all \| FileCheck %s			; RUN: llc -aarch64-load-store-renaming=true < %s -mtriple=arm64-apple-darwin -mcpu=cyclone -enable-misched=false -frame-pointer=all \| FileCheck %s
	; RUN: llc -aarch64-load-store-renaming=true < %s -mtriple=arm64-apple-darwin -O0 -frame-pointer=all -fast-isel \| FileCheck -check-prefix=FAST %s			; RUN: llc -aarch64-load-store-renaming=true < %s -mtriple=arm64-apple-darwin -O0 -disable-add-disable-tail-calls -frame-pointer=all -fast-isel \| FileCheck -check-prefix=FAST %s

	; rdar://12648441			; rdar://12648441
	; Generated from arm64-arguments.c with -O2.			; Generated from arm64-arguments.c with -O2.
	; Test passing structs with size < 8, < 16 and > 16			; Test passing structs with size < 8, < 16 and > 16
	; with alignment of 16 and without			; with alignment of 16 and without

	; Structs with size < 8			; Structs with size < 8
	%struct.s38 = type { i32, i16 }			%struct.s38 = type { i32, i16 }
	▲ Show 20 Lines • Show All 523 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/dllimport.ll

	; RUN: llc -mtriple aarch64-unknown-windows-msvc -filetype asm -o - %s \| FileCheck %s -check-prefixes=CHECK,DAG-ISEL			; RUN: llc -mtriple aarch64-unknown-windows-msvc -filetype asm -o - %s \| FileCheck %s -check-prefixes=CHECK,DAG-ISEL
	; RUN: llc -mtriple aarch64-unknown-windows-msvc -fast-isel -filetype asm -o - %s \| FileCheck %s -check-prefixes=CHECK,FAST-ISEL			; RUN: llc -mtriple aarch64-unknown-windows-msvc -fast-isel -filetype asm -o - %s \| FileCheck %s -check-prefixes=CHECK,FAST-ISEL
	; RUN: llc -mtriple aarch64-unknown-windows-msvc -verify-machineinstrs -O0 -filetype asm -o - %s \| FileCheck %s -check-prefixes=CHECK,GLOBAL-ISEL,GLOBAL-ISEL-FALLBACK			; RUN: llc -mtriple aarch64-unknown-windows-msvc -verify-machineinstrs -O0 -disable-add-disable-tail-calls -filetype asm -o - %s \| FileCheck %s -check-prefixes=CHECK,GLOBAL-ISEL,GLOBAL-ISEL-FALLBACK

	@var = external dllimport global i32			@var = external dllimport global i32
	@ext = external global i32			@ext = external global i32
	declare dllimport i32 @external()			declare dllimport i32 @external()
	declare i32 @internal()			declare i32 @internal()

	define i32 @get_var() {			define i32 @get_var() {
	%1 = load i32, i32* @var, align 4			%1 = load i32, i32* @var, align 4
	▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/tailcall-fastisel.ll

	; RUN: llc < %s -mtriple=arm64-apple-darwin -O0 -fast-isel \| FileCheck %s			; RUN: llc < %s -mtriple=arm64-apple-darwin -O0 -disable-add-disable-tail-calls -fast-isel \| FileCheck %s

	; CHECK: b _foo0			; CHECK: b _foo0

	define i32 @foo1() {			define i32 @foo1() {
	entry:			entry:
	%call = tail call i32 @foo0()			%call = tail call i32 @foo0()
	ret i32 %call			ret i32 %call
	}			}

	declare i32 @foo0()			declare i32 @foo0()

llvm/test/CodeGen/AArch64/win64_vararg_float.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=aarch64-windows -verify-machineinstrs \| FileCheck %s --check-prefixes=DAGISEL			; RUN: llc < %s -mtriple=aarch64-windows -verify-machineinstrs \| FileCheck %s --check-prefixes=DAGISEL
	; RUN: llc < %s -mtriple=aarch64-windows -verify-machineinstrs -O0 -fast-isel \| FileCheck %s --check-prefixes=O0,FASTISEL			; RUN: llc < %s -mtriple=aarch64-windows -verify-machineinstrs -O0 -disable-add-disable-tail-calls -fast-isel \| FileCheck %s --check-prefixes=O0,FASTISEL
	; RUN: llc < %s -mtriple=aarch64-windows -verify-machineinstrs -O0 -global-isel \| FileCheck %s --check-prefixes=O0,GISEL			; RUN: llc < %s -mtriple=aarch64-windows -verify-machineinstrs -O0 -disable-add-disable-tail-calls -global-isel \| FileCheck %s --check-prefixes=O0,GISEL

	define void @float_va_fn(float %a, i32 %b, ...) nounwind {			define void @float_va_fn(float %a, i32 %b, ...) nounwind {
	; DAGISEL-LABEL: float_va_fn:			; DAGISEL-LABEL: float_va_fn:
	; DAGISEL: // %bb.0: // %entry			; DAGISEL: // %bb.0: // %entry
	; DAGISEL-NEXT: str x30, [sp, #-64]! // 8-byte Folded Spill			; DAGISEL-NEXT: str x30, [sp, #-64]! // 8-byte Folded Spill
	; DAGISEL-NEXT: add x8, sp, #16			; DAGISEL-NEXT: add x8, sp, #16
	; DAGISEL-NEXT: fmov s0, w0			; DAGISEL-NEXT: fmov s0, w0
	; DAGISEL-NEXT: add x0, sp, #16			; DAGISEL-NEXT: add x0, sp, #16
	▲ Show 20 Lines • Show All 179 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-assert-align.ll

	; NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	; RUN: llc -march=amdgcn -mcpu=fiji -O0 -stop-after=irtranslator -global-isel -verify-machineinstrs -o - %s \| FileCheck %s			; RUN: llc -march=amdgcn -mcpu=fiji -O0 -disable-add-disable-tail-calls -stop-after=irtranslator -global-isel -verify-machineinstrs -o - %s \| FileCheck %s

	; TODO: Could potentially insert it here			; TODO: Could potentially insert it here
	define void @arg_align_8(i8 addrspace(1)* align 8 %arg0) {			define void @arg_align_8(i8 addrspace(1)* align 8 %arg0) {
	; CHECK-LABEL: name: arg_align_8			; CHECK-LABEL: name: arg_align_8
	; CHECK: bb.1 (%ir-block.0):			; CHECK: bb.1 (%ir-block.0):
	; CHECK-NEXT: liveins: $vgpr0, $vgpr1			; CHECK-NEXT: liveins: $vgpr0, $vgpr1
	; CHECK-NEXT: {{ $}}			; CHECK-NEXT: {{ $}}
	; CHECK-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; CHECK-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	▲ Show 20 Lines • Show All 201 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/llc-pipeline.ll

	Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	; GCN-O0-NEXT: AMDGPU Annotate Uniform Values			; GCN-O0-NEXT: AMDGPU Annotate Uniform Values
	; GCN-O0-NEXT: SI annotate control flow			; GCN-O0-NEXT: SI annotate control flow
	; GCN-O0-NEXT: LCSSA Verifier			; GCN-O0-NEXT: LCSSA Verifier
	; GCN-O0-NEXT: Loop-Closed SSA Form Pass			; GCN-O0-NEXT: Loop-Closed SSA Form Pass
	; GCN-O0-NEXT: DummyCGSCCPass			; GCN-O0-NEXT: DummyCGSCCPass
	; GCN-O0-NEXT: FunctionPass Manager			; GCN-O0-NEXT: FunctionPass Manager
	; GCN-O0-NEXT: Safe Stack instrumentation pass			; GCN-O0-NEXT: Safe Stack instrumentation pass
	; GCN-O0-NEXT: Insert stack protectors			; GCN-O0-NEXT: Insert stack protectors
				; GCN-O0-NEXT: Add "disable-tail-calls" attribute to functions
	; GCN-O0-NEXT: Dominator Tree Construction			; GCN-O0-NEXT: Dominator Tree Construction
	; GCN-O0-NEXT: Post-Dominator Tree Construction			; GCN-O0-NEXT: Post-Dominator Tree Construction
	; GCN-O0-NEXT: Natural Loop Information			; GCN-O0-NEXT: Natural Loop Information
	; GCN-O0-NEXT: Legacy Divergence Analysis			; GCN-O0-NEXT: Legacy Divergence Analysis
	; GCN-O0-NEXT: AMDGPU DAG->DAG Pattern Instruction Selection			; GCN-O0-NEXT: AMDGPU DAG->DAG Pattern Instruction Selection
	; GCN-O0-NEXT: MachineDominator Tree Construction			; GCN-O0-NEXT: MachineDominator Tree Construction
	; GCN-O0-NEXT: SI Fix SGPR copies			; GCN-O0-NEXT: SI Fix SGPR copies
	; GCN-O0-NEXT: MachinePostDominator Tree Construction			; GCN-O0-NEXT: MachinePostDominator Tree Construction
	▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines
	; GCN-O1-NEXT: AMDGPU Annotate Uniform Values			; GCN-O1-NEXT: AMDGPU Annotate Uniform Values
	; GCN-O1-NEXT: SI annotate control flow			; GCN-O1-NEXT: SI annotate control flow
	; GCN-O1-NEXT: LCSSA Verifier			; GCN-O1-NEXT: LCSSA Verifier
	; GCN-O1-NEXT: Loop-Closed SSA Form Pass			; GCN-O1-NEXT: Loop-Closed SSA Form Pass
	; GCN-O1-NEXT: DummyCGSCCPass			; GCN-O1-NEXT: DummyCGSCCPass
	; GCN-O1-NEXT: FunctionPass Manager			; GCN-O1-NEXT: FunctionPass Manager
	; GCN-O1-NEXT: Safe Stack instrumentation pass			; GCN-O1-NEXT: Safe Stack instrumentation pass
	; GCN-O1-NEXT: Insert stack protectors			; GCN-O1-NEXT: Insert stack protectors
				; GCN-O1-NEXT: Add "disable-tail-calls" attribute to functions
	; GCN-O1-NEXT: Dominator Tree Construction			; GCN-O1-NEXT: Dominator Tree Construction
	; GCN-O1-NEXT: Post-Dominator Tree Construction			; GCN-O1-NEXT: Post-Dominator Tree Construction
	; GCN-O1-NEXT: Natural Loop Information			; GCN-O1-NEXT: Natural Loop Information
	; GCN-O1-NEXT: Legacy Divergence Analysis			; GCN-O1-NEXT: Legacy Divergence Analysis
	; GCN-O1-NEXT: Basic Alias Analysis (stateless AA impl)			; GCN-O1-NEXT: Basic Alias Analysis (stateless AA impl)
	; GCN-O1-NEXT: Function Alias Analysis Results			; GCN-O1-NEXT: Function Alias Analysis Results
	; GCN-O1-NEXT: Branch Probability Analysis			; GCN-O1-NEXT: Branch Probability Analysis
	; GCN-O1-NEXT: Lazy Branch Probability Analysis			; GCN-O1-NEXT: Lazy Branch Probability Analysis
	▲ Show 20 Lines • Show All 265 Lines • ▼ Show 20 Lines
	; GCN-O1-OPTS-NEXT: AMDGPU Annotate Uniform Values			; GCN-O1-OPTS-NEXT: AMDGPU Annotate Uniform Values
	; GCN-O1-OPTS-NEXT: SI annotate control flow			; GCN-O1-OPTS-NEXT: SI annotate control flow
	; GCN-O1-OPTS-NEXT: LCSSA Verifier			; GCN-O1-OPTS-NEXT: LCSSA Verifier
	; GCN-O1-OPTS-NEXT: Loop-Closed SSA Form Pass			; GCN-O1-OPTS-NEXT: Loop-Closed SSA Form Pass
	; GCN-O1-OPTS-NEXT: DummyCGSCCPass			; GCN-O1-OPTS-NEXT: DummyCGSCCPass
	; GCN-O1-OPTS-NEXT: FunctionPass Manager			; GCN-O1-OPTS-NEXT: FunctionPass Manager
	; GCN-O1-OPTS-NEXT: Safe Stack instrumentation pass			; GCN-O1-OPTS-NEXT: Safe Stack instrumentation pass
	; GCN-O1-OPTS-NEXT: Insert stack protectors			; GCN-O1-OPTS-NEXT: Insert stack protectors
				; GCN-O1-OPTS-NEXT: Add "disable-tail-calls" attribute to functions
	; GCN-O1-OPTS-NEXT: Dominator Tree Construction			; GCN-O1-OPTS-NEXT: Dominator Tree Construction
	; GCN-O1-OPTS-NEXT: Post-Dominator Tree Construction			; GCN-O1-OPTS-NEXT: Post-Dominator Tree Construction
	; GCN-O1-OPTS-NEXT: Natural Loop Information			; GCN-O1-OPTS-NEXT: Natural Loop Information
	; GCN-O1-OPTS-NEXT: Legacy Divergence Analysis			; GCN-O1-OPTS-NEXT: Legacy Divergence Analysis
	; GCN-O1-OPTS-NEXT: Basic Alias Analysis (stateless AA impl)			; GCN-O1-OPTS-NEXT: Basic Alias Analysis (stateless AA impl)
	; GCN-O1-OPTS-NEXT: Function Alias Analysis Results			; GCN-O1-OPTS-NEXT: Function Alias Analysis Results
	; GCN-O1-OPTS-NEXT: Branch Probability Analysis			; GCN-O1-OPTS-NEXT: Branch Probability Analysis
	; GCN-O1-OPTS-NEXT: Lazy Branch Probability Analysis			; GCN-O1-OPTS-NEXT: Lazy Branch Probability Analysis
	▲ Show 20 Lines • Show All 735 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/need-fp-from-vgpr-spills.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 -O0 -verify-machineinstrs < %s \| FileCheck %s			; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 -O0 -disable-add-disable-tail-calls -verify-machineinstrs < %s \| FileCheck %s

	; FP is in CSR range, modified.			; FP is in CSR range, modified.
	define hidden fastcc void @callee_has_fp() #1 {			define hidden fastcc void @callee_has_fp() #1 {
	; CHECK-LABEL: callee_has_fp:			; CHECK-LABEL: callee_has_fp:
	; CHECK: ; %bb.0:			; CHECK: ; %bb.0:
	; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)			; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
	; CHECK-NEXT: s_mov_b32 s4, s33			; CHECK-NEXT: s_mov_b32 s4, s33
	; CHECK-NEXT: s_mov_b32 s33, s32			; CHECK-NEXT: s_mov_b32 s33, s32
	▲ Show 20 Lines • Show All 232 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/sgpr-spills-split-regalloc.ll

	; RUN: llc -mtriple amdgcn-amd-amdhsa -mcpu=gfx803 -O0 -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefix=GCN %s			; RUN: llc -mtriple amdgcn-amd-amdhsa -mcpu=gfx803 -O0 -disable-add-disable-tail-calls -verify-machineinstrs < %s \| FileCheck -enable-var-scope -check-prefix=GCN %s

	define void @child_function() #0 {			define void @child_function() #0 {
	call void asm sideeffect "", "~{vcc}" () #0			call void asm sideeffect "", "~{vcc}" () #0
	ret void			ret void
	}			}

	; GCN-LABEL: {{^}}spill_sgpr_with_no_lower_vgpr_available:			; GCN-LABEL: {{^}}spill_sgpr_with_no_lower_vgpr_available:
	; GCN: buffer_store_dword v255, off, s[0:3], s32			; GCN: buffer_store_dword v255, off, s[0:3], s32
	▲ Show 20 Lines • Show All 334 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/fast-tail-call.ll

	; RUN: llc -mtriple=thumbv7-linux-gnueabi -O0 < %s \| FileCheck %s			; RUN: llc -mtriple=thumbv7-linux-gnueabi -disable-add-disable-tail-calls -O0 < %s \| FileCheck %s
	; RUN: llc -mtriple=thumbv8m.base-arm-none-eabi -filetype=obj < %s			; RUN: llc -mtriple=thumbv8m.base-arm-none-eabi -filetype=obj < %s

	; Primarily a non-crash test: Thumbv7 Linux does not have FastISel support,			; Primarily a non-crash test: Thumbv7 Linux does not have FastISel support,
	; which led (via a convoluted route) to DAG nodes after a TC_RETURN that			; which led (via a convoluted route) to DAG nodes after a TC_RETURN that
	; couldn't possibly work.			; couldn't possibly work.

	declare i8* @g(i8*)			declare i8* @g(i8*)

	define i8* @f(i8* %a) {			define i8* @f(i8* %a) {
	entry:			entry:
	%0 = tail call i8* @g(i8* %a)			%0 = tail call i8* @g(i8* %a)
	ret i8* %0			ret i8* %0
	; CHECK: b g			; CHECK: b g
	; CHECK-NOT: ldr			; CHECK-NOT: ldr
	; CHECK-NOT: str			; CHECK-NOT: str
	}			}

llvm/test/CodeGen/ARM/none-macho.ll

	; RUN: llc -mtriple=thumbv7m-none-macho %s -o - -relocation-model=pic -frame-pointer=all \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-NON-FAST			; RUN: llc -mtriple=thumbv7m-none-macho %s -o - -relocation-model=pic -frame-pointer=all \| FileCheck %s --check-prefix=CHECK --check-prefix=CHECK-NON-FAST
	; RUN: llc -mtriple=thumbv7m-none-macho -O0 %s -o - -relocation-model=pic -frame-pointer=all \| FileCheck %s			; RUN: llc -mtriple=thumbv7m-none-macho -O0 -disable-add-disable-tail-calls %s -o - -relocation-model=pic -frame-pointer=all \| FileCheck %s
	; RUN: llc -mtriple=thumbv7m-none-macho -filetype=obj %s -o /dev/null			; RUN: llc -mtriple=thumbv7m-none-macho -filetype=obj %s -o /dev/null

	@var = external global i32			@var = external global i32

	define i32 @test_litpool() minsize {			define i32 @test_litpool() minsize {
	; CHECK-LABEL: test_litpool:			; CHECK-LABEL: test_litpool:
	%val = load i32, i32* @var			%val = load i32, i32* @var
	ret i32 %val			ret i32 %val
	▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/subtarget-no-movt.ll

	; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - \| \			; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - \| \
	; RUN: FileCheck -check-prefixes=CHECK,NO-OPTION,NO-OPTION-COMMON %s			; RUN: FileCheck -check-prefixes=CHECK,NO-OPTION,NO-OPTION-COMMON %s
	; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -mattr=-no-movt \| \			; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -mattr=-no-movt \| \
	; RUN: FileCheck -check-prefixes=CHECK,USE-MOVT,USE-MOVT-COMMON %s			; RUN: FileCheck -check-prefixes=CHECK,USE-MOVT,USE-MOVT-COMMON %s
	; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -mattr=+no-movt \| \			; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -mattr=+no-movt \| \
	; RUN: FileCheck -check-prefixes=CHECK,NO-USE-MOVT,NO-USE-MOVT-COMMON %s			; RUN: FileCheck -check-prefixes=CHECK,NO-USE-MOVT,NO-USE-MOVT-COMMON %s
	; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -O0 \| \			; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -O0 -disable-add-disable-tail-calls \| \
	; RUN: FileCheck -check-prefixes=CHECK,NO-OPTION-O0,NO-OPTION-COMMON %s			; RUN: FileCheck -check-prefixes=CHECK,NO-OPTION-O0,NO-OPTION-COMMON %s
	; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -O0 -mattr=-no-movt \| \			; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -O0 -disable-add-disable-tail-calls -mattr=-no-movt \| \
	; RUN: FileCheck -check-prefixes=CHECK,USE-MOVT-O0,USE-MOVT-COMMON %s			; RUN: FileCheck -check-prefixes=CHECK,USE-MOVT-O0,USE-MOVT-COMMON %s
	; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -O0 -mattr=+no-movt \| \			; RUN: llc -mcpu=cortex-a8 -relocation-model=static %s -o - -O0 -disable-add-disable-tail-calls -mattr=+no-movt \| \
	; RUN: FileCheck -check-prefixes=CHECK,NO-USE-MOVT-O0,NO-USE-MOVT-COMMON %s			; RUN: FileCheck -check-prefixes=CHECK,NO-USE-MOVT-O0,NO-USE-MOVT-COMMON %s

	target triple = "thumb-apple-darwin"			target triple = "thumb-apple-darwin"

	; NO-OPTION-COMMON-LABEL: {{_?}}foo0			; NO-OPTION-COMMON-LABEL: {{_?}}foo0
	; NO-OPTION-COMMON: ldr [[R0:r[0-9]+]], [[L0:.*]]			; NO-OPTION-COMMON: ldr [[R0:r[0-9]+]], [[L0:.*]]
	; NO-OPTION-COMMON: [[L0]]:			; NO-OPTION-COMMON: [[L0]]:
	; NO-OPTION-COMMON: .long 2296237089			; NO-OPTION-COMMON: .long 2296237089
	▲ Show 20 Lines • Show All 78 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/tail-call-float.ll

	; RUN: llc -mtriple armv7 -target-abi aapcs -float-abi soft -O0 -o - < %s \			; RUN: llc -mtriple armv7 -target-abi aapcs -float-abi soft -O0 -disable-add-disable-tail-calls -o - < %s \
	; RUN: \| FileCheck %s -check-prefix CHECK-SOFT -check-prefix CHECK			; RUN: \| FileCheck %s -check-prefix CHECK-SOFT -check-prefix CHECK
	; RUN: llc -mtriple armv7 -target-abi aapcs -float-abi hard -O0 -o - < %s \			; RUN: llc -mtriple armv7 -target-abi aapcs -float-abi hard -O0 -disable-add-disable-tail-calls -o - < %s \
	; RUN: \| FileCheck %s -check-prefix CHECK-HARD -check-prefix CHECK			; RUN: \| FileCheck %s -check-prefix CHECK-HARD -check-prefix CHECK

	; Tests for passing floating-point regs. Variadic functions will always use			; Tests for passing floating-point regs. Variadic functions will always use
	; general-purpose registers. Standard functions will use the floating-point			; general-purpose registers. Standard functions will use the floating-point
	; registers if there is hardware FP available.			; registers if there is hardware FP available.

	declare i1 @non_variadic(float, float, float, float)			declare i1 @non_variadic(float, float, float, float)
	declare i1 @non_variadic_big(float, float, float, float, float, float)			declare i1 @non_variadic_big(float, float, float, float, float, float)
	Show All 38 Lines

llvm/test/CodeGen/ARM/tail-call.ll

	; RUN: llc -mtriple armv7 -target-abi apcs -O0 -o - < %s \			; RUN: llc -mtriple armv7 -target-abi apcs -O0 -disable-add-disable-tail-calls -o - < %s \
	; RUN: \| FileCheck %s -check-prefix CHECK-TAIL -check-prefix CHECK			; RUN: \| FileCheck %s -check-prefix CHECK-TAIL -check-prefix CHECK
	; RUN: llc -mtriple armv7 -target-abi apcs -O0 -disable-tail-calls -o - < %s \			; RUN: llc -mtriple armv7 -target-abi apcs -O0 -disable-add-disable-tail-calls -disable-tail-calls -o - < %s \
	; RUN: \| FileCheck %s -check-prefix CHECK-NO-TAIL -check-prefix CHECK			; RUN: \| FileCheck %s -check-prefix CHECK-NO-TAIL -check-prefix CHECK
	; RUN: llc -mtriple armv7 -target-abi aapcs -O0 -o - < %s \			; RUN: llc -mtriple armv7 -target-abi aapcs -O0 -disable-add-disable-tail-calls -o - < %s \
	; RUN: \| FileCheck %s -check-prefix CHECK-TAIL-AAPCS -check-prefix CHECK			; RUN: \| FileCheck %s -check-prefix CHECK-TAIL-AAPCS -check-prefix CHECK

	declare i32 @callee(i32 %i)			declare i32 @callee(i32 %i)
	declare extern_weak fastcc void @callee_weak()			declare extern_weak fastcc void @callee_weak()

	define i32 @caller(i32 %i) {			define i32 @caller(i32 %i) {
	entry:			entry:
	%r = tail call i32 @callee(i32 %i)			%r = tail call i32 @callee(i32 %i)
	▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/tailcall/tail-call-arguments-clobber.ll

	; RUN: llc -march=mips -mcpu=mips32 -O0 -relocation-model=pic -mips-tail-calls=1 < %s \| FileCheck \			; RUN: llc -march=mips -mcpu=mips32 -O0 -disable-add-disable-tail-calls -relocation-model=pic -mips-tail-calls=1 < %s \| FileCheck \
	; RUN: %s -check-prefix=MIPS32			; RUN: %s -check-prefix=MIPS32
	; RUN: llc -march=mips64 -mcpu=mips64 -O0 -relocation-model=pic -target-abi n64 \			; RUN: llc -march=mips64 -mcpu=mips64 -O0 -disable-add-disable-tail-calls -relocation-model=pic -target-abi n64 \
	; RUN: -mips-tail-calls=1 < %s \| FileCheck %s -check-prefix=MIPS64			; RUN: -mips-tail-calls=1 < %s \| FileCheck %s -check-prefix=MIPS64
	; RUN: llc -march=mips64 -mcpu=mips64 -O0 -relocation-model=pic -target-abi n32 \			; RUN: llc -march=mips64 -mcpu=mips64 -O0 -disable-add-disable-tail-calls -relocation-model=pic -target-abi n32 \
	; RUN: -mips-tail-calls=1 < %s \| FileCheck %s -check-prefix=MIPS64			; RUN: -mips-tail-calls=1 < %s \| FileCheck %s -check-prefix=MIPS64


	; LLVM PR/30197			; LLVM PR/30197
	; Test that the scheduler does not order loads and stores of arguments that			; Test that the scheduler does not order loads and stores of arguments that
	; are passed on the stack such that the arguments of the caller are clobbered			; are passed on the stack such that the arguments of the caller are clobbered
	; too early.			; too early.

	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/test/CodeGen/PowerPC/ppc64-sibcall.ll

	; RUN: llc < %s -relocation-model=static -O1 -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64-unknown-linux-gnu \| FileCheck %s -check-prefix=CHECK-SCO			; RUN: llc < %s -relocation-model=static -O1 -disable-add-disable-tail-calls -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64-unknown-linux-gnu \| FileCheck %s -check-prefix=CHECK-SCO
	; RUN: llc < %s -relocation-model=static -O1 -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64-unknown-linux-gnu -mcpu=pwr8 \| FileCheck %s -check-prefix=CHECK-SCO			; RUN: llc < %s -relocation-model=static -O1 -disable-add-disable-tail-calls -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64-unknown-linux-gnu -mcpu=pwr8 \| FileCheck %s -check-prefix=CHECK-SCO
	; RUN: llc < %s -relocation-model=static -O1 -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64le-unknown-linux-gnu -mcpu=pwr8 \| FileCheck %s -check-prefix=CHECK-SCO			; RUN: llc < %s -relocation-model=static -O1 -disable-add-disable-tail-calls -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64le-unknown-linux-gnu -mcpu=pwr8 \| FileCheck %s -check-prefix=CHECK-SCO
	; RUN: llc < %s -relocation-model=static -O1 -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64le-unknown-linux-gnu -mcpu=pwr8 -code-model=small \| FileCheck %s -check-prefix=SCM			; RUN: llc < %s -relocation-model=static -O1 -disable-add-disable-tail-calls -disable-ppc-sco=false -verify-machineinstrs -mtriple=powerpc64le-unknown-linux-gnu -mcpu=pwr8 -code-model=small \| FileCheck %s -check-prefix=SCM

	; No combination of "powerpc64le-unknown-linux-gnu" + "CHECK-SCO", because			; No combination of "powerpc64le-unknown-linux-gnu" + "CHECK-SCO", because
	; only Power8 (and later) fully support LE.			; only Power8 (and later) fully support LE.

	%S_56 = type { [13 x i32], i32 }			%S_56 = type { [13 x i32], i32 }
	%S_64 = type { [15 x i32], i32 }			%S_64 = type { [15 x i32], i32 }
	%S_32 = type { [7 x i32], i32 }			%S_32 = type { [7 x i32], i32 }

	▲ Show 20 Lines • Show All 202 Lines • Show Last 20 Lines

llvm/test/CodeGen/RISCV/O0-pipeline.ll

	Show All 25 Lines
	; CHECK-NEXT: Lower constant intrinsics			; CHECK-NEXT: Lower constant intrinsics
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Expand vector predication intrinsics			; CHECK-NEXT: Expand vector predication intrinsics
	; CHECK-NEXT: Scalarize Masked Memory Intrinsics			; CHECK-NEXT: Scalarize Masked Memory Intrinsics
	; CHECK-NEXT: Expand reduction intrinsics			; CHECK-NEXT: Expand reduction intrinsics
	; CHECK-NEXT: Exception handling preparation			; CHECK-NEXT: Exception handling preparation
	; CHECK-NEXT: Safe Stack instrumentation pass			; CHECK-NEXT: Safe Stack instrumentation pass
	; CHECK-NEXT: Insert stack protectors			; CHECK-NEXT: Insert stack protectors
				; CHECK-NEXT: Add "disable-tail-calls" attribute to functions
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: RISCV DAG->DAG Pattern Instruction Selection			; CHECK-NEXT: RISCV DAG->DAG Pattern Instruction Selection
	; CHECK-NEXT: Finalize ISel and expand pseudo-instructions			; CHECK-NEXT: Finalize ISel and expand pseudo-instructions
	; CHECK-NEXT: Local Stack Slot Allocation			; CHECK-NEXT: Local Stack Slot Allocation
	; CHECK-NEXT: RISCV Pre-RA pseudo instruction expansion pass			; CHECK-NEXT: RISCV Pre-RA pseudo instruction expansion pass
	; CHECK-NEXT: RISCV Insert VSETVLI pass			; CHECK-NEXT: RISCV Insert VSETVLI pass
	; CHECK-NEXT: Eliminate PHI nodes for register allocation			; CHECK-NEXT: Eliminate PHI nodes for register allocation
	; CHECK-NEXT: Two-Address instruction pass			; CHECK-NEXT: Two-Address instruction pass
	Show All 22 Lines

llvm/test/CodeGen/X86/O0-pipeline.ll

	Show All 26 Lines
	; CHECK-NEXT: Remove unreachable blocks from the CFG			; CHECK-NEXT: Remove unreachable blocks from the CFG
	; CHECK-NEXT: Expand vector predication intrinsics			; CHECK-NEXT: Expand vector predication intrinsics
	; CHECK-NEXT: Scalarize Masked Memory Intrinsics			; CHECK-NEXT: Scalarize Masked Memory Intrinsics
	; CHECK-NEXT: Expand reduction intrinsics			; CHECK-NEXT: Expand reduction intrinsics
	; CHECK-NEXT: Expand indirectbr instructions			; CHECK-NEXT: Expand indirectbr instructions
	; CHECK-NEXT: Exception handling preparation			; CHECK-NEXT: Exception handling preparation
	; CHECK-NEXT: Safe Stack instrumentation pass			; CHECK-NEXT: Safe Stack instrumentation pass
	; CHECK-NEXT: Insert stack protectors			; CHECK-NEXT: Insert stack protectors
				; CHECK-NEXT: Add "disable-tail-calls" attribute to functions
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: X86 DAG->DAG Instruction Selection			; CHECK-NEXT: X86 DAG->DAG Instruction Selection
	; CHECK-NEXT: X86 PIC Global Base Reg Initialization			; CHECK-NEXT: X86 PIC Global Base Reg Initialization
	; CHECK-NEXT: Finalize ISel and expand pseudo-instructions			; CHECK-NEXT: Finalize ISel and expand pseudo-instructions
	; CHECK-NEXT: Local Stack Slot Allocation			; CHECK-NEXT: Local Stack Slot Allocation
	; CHECK-NEXT: X86 speculative load hardening			; CHECK-NEXT: X86 speculative load hardening
	; CHECK-NEXT: MachineDominator Tree Construction			; CHECK-NEXT: MachineDominator Tree Construction
	; CHECK-NEXT: X86 EFLAGS copy lowering			; CHECK-NEXT: X86 EFLAGS copy lowering
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/add-disable-tail-calls.ll

This file was added.

				; RUN: opt -add-disable-tail-calls < %s -mtriple=x86_64-- -S \| FileCheck %s --check-prefix=DISABLE
				; RUN: llc -O0 -stop-before=finalize-isel < %s -mtriple=x86_64-- \| FileCheck %s --check-prefix=DISABLE
				; RUN: llc -O1 -stop-before=finalize-isel < %s -mtriple=x86_64-- \| FileCheck %s --check-prefix=DISABLE
				; RUN: llc -O0 -disable-add-disable-tail-calls -stop-before=finalize-isel < %s -mtriple=x86_64-- \| FileCheck %s --check-prefix=IGNORE
				; RUN: llc -O1 -disable-add-disable-tail-calls -stop-before=finalize-isel < %s -mtriple=x86_64-- \| FileCheck %s --check-prefix=IGNORE
				; RUN: llc -O2 -stop-before=finalize-isel < %s -mtriple=x86_64-- \| FileCheck %s --check-prefix=IGNORE
				; RUN: llc -O3 -stop-before=finalize-isel < %s -mtriple=x86_64-- \| FileCheck %s --check-prefix=IGNORE

				define void @f() {
				ret void
				}

				; IGNORE-NOT: "disable-tail-calls"
				; DISABLE: "disable-tail-calls"="true"

llvm/test/CodeGen/X86/atom-pad-short-functions.ll

	; RUN: llc < %s -O1 -mcpu=atom -mtriple=i686-linux \| FileCheck %s			; RUN: llc < %s -O1 -disable-add-disable-tail-calls -mcpu=atom -mtriple=i686-linux \| FileCheck %s

	declare void @external_function(...)			declare void @external_function(...)

	define i32 @test_return_val(i32 %a) nounwind {			define i32 @test_return_val(i32 %a) nounwind {
	; CHECK: test_return_val			; CHECK: test_return_val
	; CHECK: movl			; CHECK: movl
	; CHECK: nop			; CHECK: nop
	; CHECK: nop			; CHECK: nop
	▲ Show 20 Lines • Show All 117 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/fold-sext-trunc.ll

	; RUN: llc < %s -mtriple=x86_64-- \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-- \| FileCheck %s
	; RUN: llc < %s -O0 -mtriple=x86_64-unknown-unknown -mcpu=x86-64 -stop-after livedebugvalues -o - \| FileCheck %s -check-prefix=MIR			; RUN: llc < %s -O0 -disable-add-disable-tail-calls -mtriple=x86_64-unknown-unknown -mcpu=x86-64 -stop-after livedebugvalues -o - \| FileCheck %s -check-prefix=MIR
	; PR4050			; PR4050

	%0 = type { i64 }			%0 = type { i64 }
	%struct.S1 = type { i16, i32 }			%struct.S1 = type { i16, i32 }

	@g_10 = external dso_local global %struct.S1			@g_10 = external dso_local global %struct.S1

	declare void @func_28(i64, i64)			declare void @func_28(i64, i64)
	▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/fold-zext-trunc.ll

	; RUN: llc < %s \| FileCheck %s -check-prefix=ASM			; RUN: llc < %s \| FileCheck %s -check-prefix=ASM
	; RUN: llc < %s -O0 -mtriple=x86_64-unknown-unknown -mcpu=x86-64 -stop-after livedebugvalues -o - \| FileCheck %s -check-prefix=MIR			; RUN: llc < %s -O0 -disable-add-disable-tail-calls -mtriple=x86_64-unknown-unknown -mcpu=x86-64 -stop-after livedebugvalues -o - \| FileCheck %s -check-prefix=MIR
	; PR9055			; PR9055
	target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64-f80:32:32-n8:16:32"			target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64-f80:32:32-n8:16:32"
	target triple = "i686-pc-linux-gnu"			target triple = "i686-pc-linux-gnu"

	%struct.S0 = type { i32, [2 x i8], [2 x i8], [4 x i8] }			%struct.S0 = type { i32, [2 x i8], [2 x i8], [4 x i8] }

	@g_98 = common global %struct.S0 zeroinitializer, align 4			@g_98 = common global %struct.S0 zeroinitializer, align 4

	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/ins_split_regalloc.ll

	; RUN: llc -O1 -regalloc=greedy -mtriple=x86_64-apple-macosx < %s -o - \| FileCheck %s			; RUN: llc -O1 -disable-add-disable-tail-calls -regalloc=greedy -mtriple=x86_64-apple-macosx < %s -o - \| FileCheck %s
	; Check that last chance split (RAGreedy::tryInstructonSplit) just split			; Check that last chance split (RAGreedy::tryInstructonSplit) just split
	; when this is beneficial, otherwise we end up with uncoalesced copies.			; when this is beneficial, otherwise we end up with uncoalesced copies.
	; <rdar://problem/15570057>			; <rdar://problem/15570057>

	target datalayout = "e-i64:64-f80:128-s:64-n8:16:32:64-S128"			target datalayout = "e-i64:64-f80:128-s:64-n8:16:32:64-S128"

	@f = external constant ptr			@f = external constant ptr

	Show All 24 Lines

llvm/test/CodeGen/X86/lvi-hardening-indirectbr.ll

	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -mattr=+lvi-cfi < %s \| FileCheck %s --check-prefix=X64			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -mattr=+lvi-cfi < %s \| FileCheck %s --check-prefix=X64
	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -mattr=+lvi-cfi -O0 < %s \| FileCheck %s --check-prefix=X64FAST			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -mattr=+lvi-cfi -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --check-prefix=X64FAST
	;			;
	; Note that a lot of this code was lifted from retpoline.ll.			; Note that a lot of this code was lifted from retpoline.ll.

	declare dso_local void @bar(i32)			declare dso_local void @bar(i32)

	; Test a simple indirect call and tail call.			; Test a simple indirect call and tail call.
	define void @icall_reg(ptr %fp, i32 %x) {			define void @icall_reg(ptr %fp, i32 %x) {
	entry:			entry:
	▲ Show 20 Lines • Show All 290 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/mixed-ptr-sizes-i686.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s \| FileCheck %s --check-prefixes=ALL,CHECK			; RUN: llc < %s \| FileCheck %s --check-prefixes=ALL,CHECK
	; RUN: llc -O0 < %s \| FileCheck %s --check-prefixes=ALL,CHECK-O0			; RUN: llc -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --check-prefixes=ALL,CHECK-O0

	; Source to regenerate:			; Source to regenerate:
	; struct Foo {			; struct Foo {
	; int * __ptr32 p32;			; int * __ptr32 p32;
	; int * __ptr64 p64;			; int * __ptr64 p64;
	; __attribute__((address_space(9))) int *p_other;			; __attribute__((address_space(9))) int *p_other;
	; };			; };
	; void use_foo(Foo *f);			; void use_foo(Foo *f);
	▲ Show 20 Lines • Show All 332 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/mixed-ptr-sizes.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s \| FileCheck %s --check-prefixes=ALL,CHECK			; RUN: llc < %s \| FileCheck %s --check-prefixes=ALL,CHECK
	; RUN: llc -O0 < %s \| FileCheck %s --check-prefixes=ALL,CHECK-O0			; RUN: llc -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --check-prefixes=ALL,CHECK-O0

	; Source to regenerate:			; Source to regenerate:
	; struct Foo {			; struct Foo {
	; int * __ptr32 p32;			; int * __ptr32 p32;
	; int * __ptr64 p64;			; int * __ptr64 p64;
	; __attribute__((address_space(9))) int *p_other;			; __attribute__((address_space(9))) int *p_other;
	; };			; };
	; void use_foo(Foo *f);			; void use_foo(Foo *f);
	▲ Show 20 Lines • Show All 247 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/opt-pipeline.ll

	; When EXPENSIVE_CHECKS are enabled, the machine verifier appears between each			; When EXPENSIVE_CHECKS are enabled, the machine verifier appears between each
	; pass. Ignore it with 'grep -v'.			; pass. Ignore it with 'grep -v'.
	; RUN: llc -mtriple=x86_64-- -O1 -debug-pass=Structure < %s -o /dev/null 2>&1 \			; RUN: llc -mtriple=x86_64-- -O1 -debug-pass=Structure < %s -o /dev/null 2>&1 \
	; RUN: \| grep -v 'Verify generated machine code' \| FileCheck %s			; RUN: \| grep -v 'Verify generated machine code' \| FileCheck %s --check-prefixes=CHECK,O1
	; RUN: llc -mtriple=x86_64-- -O2 -debug-pass=Structure < %s -o /dev/null 2>&1 \			; RUN: llc -mtriple=x86_64-- -O2 -debug-pass=Structure < %s -o /dev/null 2>&1 \
	; RUN: \| grep -v 'Verify generated machine code' \| FileCheck %s			; RUN: \| grep -v 'Verify generated machine code' \| FileCheck %s
	; RUN: llc -mtriple=x86_64-- -O3 -debug-pass=Structure < %s -o /dev/null 2>&1 \			; RUN: llc -mtriple=x86_64-- -O3 -debug-pass=Structure < %s -o /dev/null 2>&1 \
	; RUN: \| grep -v 'Verify generated machine code' \| FileCheck %s			; RUN: \| grep -v 'Verify generated machine code' \| FileCheck %s
	; RUN: llc -mtriple=x86_64-- -O3 -debug-pass=Structure < %s -o /dev/null 2>&1 \			; RUN: llc -mtriple=x86_64-- -O3 -debug-pass=Structure < %s -o /dev/null 2>&1 \
	; RUN: \| FileCheck %s --check-prefix=FPM			; RUN: \| FileCheck %s --check-prefix=FPM

	; REQUIRES: asserts			; REQUIRES: asserts
	▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: X86 Partial Reduction			; CHECK-NEXT: X86 Partial Reduction
	; CHECK-NEXT: Expand indirectbr instructions			; CHECK-NEXT: Expand indirectbr instructions
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: CodeGen Prepare			; CHECK-NEXT: CodeGen Prepare
	; CHECK-NEXT: Dominator Tree Construction			; CHECK-NEXT: Dominator Tree Construction
	; CHECK-NEXT: Exception handling preparation			; CHECK-NEXT: Exception handling preparation
	; CHECK-NEXT: Safe Stack instrumentation pass			; CHECK-NEXT: Safe Stack instrumentation pass
	; CHECK-NEXT: Insert stack protectors			; CHECK-NEXT: Insert stack protectors
				; O1-NEXT: Add "disable-tail-calls" attribute to functions
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)			; CHECK-NEXT: Basic Alias Analysis (stateless AA impl)
	; CHECK-NEXT: Function Alias Analysis Results			; CHECK-NEXT: Function Alias Analysis Results
	; CHECK-NEXT: Natural Loop Information			; CHECK-NEXT: Natural Loop Information
	; CHECK-NEXT: Post-Dominator Tree Construction			; CHECK-NEXT: Post-Dominator Tree Construction
	; CHECK-NEXT: Branch Probability Analysis			; CHECK-NEXT: Branch Probability Analysis
	; CHECK-NEXT: Lazy Branch Probability Analysis			; CHECK-NEXT: Lazy Branch Probability Analysis
	; CHECK-NEXT: Lazy Block Frequency Analysis			; CHECK-NEXT: Lazy Block Frequency Analysis
	▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/pr1489.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -frame-pointer=all -O0 -mcpu=i486 \| FileCheck %s			; RUN: llc < %s -frame-pointer=all -O0 -disable-add-disable-tail-calls -mcpu=i486 \| FileCheck %s
	;; magic constants are 3.999f and half of 3.999			;; magic constants are 3.999f and half of 3.999
	; ModuleID = '1489.c'			; ModuleID = '1489.c'
	target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64"			target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64"
	target triple = "i686-apple-darwin8"			target triple = "i686-apple-darwin8"
	@.str = internal constant [13 x i8] c"%d %d %d %d\0A\00" ; <ptr> [#uses=1]			@.str = internal constant [13 x i8] c"%d %d %d %d\0A\00" ; <ptr> [#uses=1]

	define i32 @quux() nounwind {			define i32 @quux() nounwind {
	; CHECK-LABEL: quux:			; CHECK-LABEL: quux:
	▲ Show 20 Lines • Show All 135 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/pr53243-tail-call-fastisel.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc -O0 -fast-isel -mtriple=x86_64-- < %s \| FileCheck %s			; RUN: llc -O0 -disable-add-disable-tail-calls -fast-isel -mtriple=x86_64-- < %s \| FileCheck %s

	define void @test() {			define void @test() {
	; CHECK-LABEL: test:			; CHECK-LABEL: test:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: jmp set_state@PLT # TAILCALL			; CHECK-NEXT: jmp set_state@PLT # TAILCALL
	tail call void @set_state()			tail call void @set_state()
	call void @llvm.dbg.value(metadata i64 0, metadata !10, metadata !DIExpression()), !dbg !16			call void @llvm.dbg.value(metadata i64 0, metadata !10, metadata !DIExpression()), !dbg !16
	ret void			ret void
	Show All 29 Lines

llvm/test/CodeGen/X86/retpoline-external.ll

	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64
	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -O0 < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64FAST			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64FAST

	; RUN: llc -verify-machineinstrs -mtriple=i686-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86			; RUN: llc -verify-machineinstrs -mtriple=i686-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86
	; RUN: llc -verify-machineinstrs -mtriple=i686-unknown -O0 < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86FAST			; RUN: llc -verify-machineinstrs -mtriple=i686-unknown -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86FAST

	declare dso_local void @bar(i32)			declare dso_local void @bar(i32)

	; Test a simple indirect call and tail call.			; Test a simple indirect call and tail call.
	define void @icall_reg(ptr %fp, i32 %x) #0 {			define void @icall_reg(ptr %fp, i32 %x) #0 {
	entry:			entry:
	tail call void @bar(i32 %x)			tail call void @bar(i32 %x)
	tail call void %fp(i32 %x)			tail call void %fp(i32 %x)
	▲ Show 20 Lines • Show All 152 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/retpoline.ll

	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64
	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -O0 < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64FAST			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X64FAST

	; RUN: llc -verify-machineinstrs -mtriple=i686-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86			; RUN: llc -verify-machineinstrs -mtriple=i686-unknown < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86
	; RUN: llc -verify-machineinstrs -mtriple=i686-unknown -O0 < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86FAST			; RUN: llc -verify-machineinstrs -mtriple=i686-unknown -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --implicit-check-not="jmp.\" --implicit-check-not="call.\" --check-prefix=X86FAST

	declare void @bar(i32)			declare void @bar(i32)

	; Test a simple indirect call and tail call.			; Test a simple indirect call and tail call.
	define void @icall_reg(ptr %fp, i32 %x) #0 {			define void @icall_reg(ptr %fp, i32 %x) #0 {
	entry:			entry:
	tail call void @bar(i32 %x)			tail call void @bar(i32 %x)
	tail call void %fp(i32 %x)			tail call void %fp(i32 %x)
	▲ Show 20 Lines • Show All 502 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/swiftself-win64.ll

	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown-windows-msvc -o - %s \| FileCheck --check-prefix=CHECK --check-prefix=OPT %s			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown-windows-msvc -o - %s \| FileCheck --check-prefix=CHECK --check-prefix=OPT %s
	; RUN: llc -O0 -verify-machineinstrs -mtriple=x86_64-unknown-windows-msvc -o - %s \| FileCheck %s			; RUN: llc -O0 -disable-add-disable-tail-calls -verify-machineinstrs -mtriple=x86_64-unknown-windows-msvc -o - %s \| FileCheck %s

	; Parameter with swiftself should be allocated to r13.			; Parameter with swiftself should be allocated to r13.
	; CHECK-LABEL: swiftself_param:			; CHECK-LABEL: swiftself_param:
	; CHECK: movq %r13, %rax			; CHECK: movq %r13, %rax
	define ptr@swiftself_param(ptr swiftself %addr0) {			define ptr@swiftself_param(ptr swiftself %addr0) {
	ret ptr%addr0			ret ptr%addr0
	}			}

	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/swiftself.ll

	; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown-unknown -o - %s \| FileCheck --check-prefix=CHECK --check-prefix=OPT %s			; RUN: llc -verify-machineinstrs -mtriple=x86_64-unknown-unknown -o - %s \| FileCheck --check-prefix=CHECK --check-prefix=OPT %s
	; RUN: llc -O0 -verify-machineinstrs -mtriple=x86_64-unknown-unknown -o - %s \| FileCheck %s			; RUN: llc -O0 -disable-add-disable-tail-calls -verify-machineinstrs -mtriple=x86_64-unknown-unknown -o - %s \| FileCheck %s

	; Parameter with swiftself should be allocated to r13.			; Parameter with swiftself should be allocated to r13.
	; CHECK-LABEL: swiftself_param:			; CHECK-LABEL: swiftself_param:
	; CHECK: movq %r13, %rax			; CHECK: movq %r13, %rax
	define ptr@swiftself_param(ptr swiftself %addr0) {			define ptr@swiftself_param(ptr swiftself %addr0) {
	ret ptr%addr0			ret ptr%addr0
	}			}

	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/tailcall-msvc-conventions.ll

	; RUN: llc -mtriple=i686-unknown-linux-gnu -O1 < %s \| FileCheck %s			; RUN: llc -mtriple=i686-unknown-linux-gnu -O1 -disable-add-disable-tail-calls < %s \| FileCheck %s
	; RUN: llc -mtriple=i686-unknown-linux-gnu -O0 < %s \| FileCheck %s			; RUN: llc -mtriple=i686-unknown-linux-gnu -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s

	; The MSVC family of x86 calling conventions makes tail calls really tricky.			; The MSVC family of x86 calling conventions makes tail calls really tricky.
	; Tests of all the various combinations should live here.			; Tests of all the various combinations should live here.

	declare i32 @cdecl_i32()			declare i32 @cdecl_i32()
	declare void @cdecl_void()			declare void @cdecl_void()

	; Don't allow tail calling these cdecl functions, because we need to clear the			; Don't allow tail calling these cdecl functions, because we need to clear the
	▲ Show 20 Lines • Show All 178 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/win64_eh_leaf.ll

	; RUN: llc < %s -O1 -mtriple=x86_64-pc-win32 \| FileCheck %s -check-prefix=ASM			; RUN: llc < %s -O1 -disable-add-disable-tail-calls -mtriple=x86_64-pc-win32 \| FileCheck %s -check-prefix=ASM
	; RUN: llc < %s -O1 -mtriple=x86_64-pc-win32 -filetype=obj -o %t			; RUN: llc < %s -O1 -disable-add-disable-tail-calls -mtriple=x86_64-pc-win32 -filetype=obj -o %t
	; RUN: llvm-readobj --unwind %t \| FileCheck %s -check-prefix=READOBJ			; RUN: llvm-readobj --unwind %t \| FileCheck %s -check-prefix=READOBJ

	declare void @g(i32)			declare void @g(i32)

	define i32 @not_leaf(i32) uwtable {			define i32 @not_leaf(i32) uwtable {
	entry:			entry:
	call void @g(i32 42)			call void @g(i32 42)
	ret i32 42			ret i32 42
	Show All 30 Lines

llvm/test/DebugInfo/COFF/tail-call-without-lexical-scopes.ll

	; RUN: llc -mcpu=core2 -mtriple=i686-pc-win32 -O0 < %s \| FileCheck --check-prefix=X86 %s			; RUN: llc -mcpu=core2 -mtriple=i686-pc-win32 -O0 -disable-add-disable-tail-calls < %s \| FileCheck --check-prefix=X86 %s

	; This LL file was generated by running clang on the following code:			; This LL file was generated by running clang on the following code:
	; D:\test.cpp:			; D:\test.cpp:
	; 1 void foo();			; 1 void foo();
	; 2			; 2
	; 3 static void bar(int arg, ...) {			; 3 static void bar(int arg, ...) {
	; 4 foo();			; 4 foo();
	; 5 }			; 5 }
	▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/dbg-declare-inalloca.ll

	; RUN: llc -O0 < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=DEBUG			; RUN: llc -O0 -disable-add-disable-tail-calls < %s \| FileCheck %s --check-prefix=CHECK --check-prefix=DEBUG
	; RUN: llc < %s \| FileCheck %s			; RUN: llc < %s \| FileCheck %s
	; RUN: llc -filetype=obj -O0 < %s \| llvm-readobj --codeview - \| FileCheck %s --check-prefix=OBJ			; RUN: llc -filetype=obj -O0 -disable-add-disable-tail-calls < %s \| llvm-readobj --codeview - \| FileCheck %s --check-prefix=OBJ

	; IR generated by the following source:			; IR generated by the following source:
	; struct NonTrivial {			; struct NonTrivial {
	; NonTrivial();// : x(42) {}			; NonTrivial();// : x(42) {}
	; ~NonTrivial();// {}			; ~NonTrivial();// {}
	; int x;			; int x;
	; };			; };
	; extern "C" void g(int);// {}			; extern "C" void g(int);// {}
	▲ Show 20 Lines • Show All 192 Lines • Show Last 20 Lines

llvm/test/MC/ARM/arm-thumb-tail-call.ll

	; RUN: llc -O0 < %s -mtriple armv7-linux-gnueabi -o - \			; RUN: llc -O0 -disable-add-disable-tail-calls < %s -mtriple armv7-linux-gnueabi -o - \
	; RUN: \| llvm-mc -triple armv7-linux-gnueabi -filetype=obj -o - \			; RUN: \| llvm-mc -triple armv7-linux-gnueabi -filetype=obj -o - \
	; RUN: \| llvm-readobj -r - \| FileCheck %s			; RUN: \| llvm-readobj -r - \| FileCheck %s

	target datalayout = "e-m:e-p:32:32-i64:64-v128:64:128-a:0:32-n32-S64"			target datalayout = "e-m:e-p:32:32-i64:64-v128:64:128-a:0:32-n32-S64"
	target triple = "armv7--linux-gnueabihf"			target triple = "armv7--linux-gnueabihf"

	define internal i32 @arm_fn() #1 {			define internal i32 @arm_fn() #1 {
	%1 = tail call i32 @thumb_fn()			%1 = tail call i32 @thumb_fn()
	Show All 16 Lines

llvm/tools/opt/opt.cpp

Show First 20 Lines • Show All 449 Lines • ▼ Show 20 Lines	std::vector<StringRef> PassNameExact = {
"expand-reductions", "indirectbr-expand",		"expand-reductions", "indirectbr-expand",
"generic-to-nvvm", "expandmemcmp",		"generic-to-nvvm", "expandmemcmp",
"loop-reduce", "lower-amx-type",		"loop-reduce", "lower-amx-type",
"pre-amx-config", "lower-amx-intrinsics",		"pre-amx-config", "lower-amx-intrinsics",
"polyhedral-info", "print-polyhedral-info",		"polyhedral-info", "print-polyhedral-info",
"replace-with-veclib", "jmc-instrument",		"replace-with-veclib", "jmc-instrument",
"dot-regions", "dot-regions-only",		"dot-regions", "dot-regions-only",
"view-regions", "view-regions-only",		"view-regions", "view-regions-only",
"select-optimize"};		"select-optimize", "add-disable-tail-calls"};
for (const auto &P : PassNamePrefix)		for (const auto &P : PassNamePrefix)
if (Pass.startswith(P))		if (Pass.startswith(P))
return true;		return true;
for (const auto &P : PassNameContain)		for (const auto &P : PassNameContain)
if (Pass.contains(P))		if (Pass.contains(P))
return true;		return true;
return llvm::is_contained(PassNameExact, Pass);		return llvm::is_contained(PassNameExact, Pass);
}		}
Show All 32 Lines	int main(int argc, char **argv) {
initializeAnalysis(Registry);		initializeAnalysis(Registry);
initializeTransformUtils(Registry);		initializeTransformUtils(Registry);
initializeInstCombine(Registry);		initializeInstCombine(Registry);
initializeAggressiveInstCombine(Registry);		initializeAggressiveInstCombine(Registry);
initializeInstrumentation(Registry);		initializeInstrumentation(Registry);
initializeTarget(Registry);		initializeTarget(Registry);
// For codegen passes, only passes that do IR to IR transformation are		// For codegen passes, only passes that do IR to IR transformation are
// supported.		// supported.
		initializeAddDisableTailCallsPassPass(Registry);
initializeExpandMemCmpPassPass(Registry);		initializeExpandMemCmpPassPass(Registry);
initializeScalarizeMaskedMemIntrinLegacyPassPass(Registry);		initializeScalarizeMaskedMemIntrinLegacyPassPass(Registry);
initializeSelectOptimizePass(Registry);		initializeSelectOptimizePass(Registry);
initializeCodeGenPreparePass(Registry);		initializeCodeGenPreparePass(Registry);
initializeAtomicExpandPass(Registry);		initializeAtomicExpandPass(Registry);
initializeRewriteSymbolsLegacyPassPass(Registry);		initializeRewriteSymbolsLegacyPassPass(Registry);
initializeWinEHPreparePass(Registry);		initializeWinEHPreparePass(Registry);
initializeDwarfEHPrepareLegacyPassPass(Registry);		initializeDwarfEHPrepareLegacyPassPass(Registry);
▲ Show 20 Lines • Show All 519 Lines • Show Last 20 Lines

llvm/utils/gn/secondary/llvm/lib/CodeGen/BUILD.gn

Show All 11 Lines	deps = [
"//llvm/lib/MC",		"//llvm/lib/MC",
"//llvm/lib/ProfileData",		"//llvm/lib/ProfileData",
"//llvm/lib/Support",		"//llvm/lib/Support",
"//llvm/lib/Target",		"//llvm/lib/Target",
"//llvm/lib/Transforms/Scalar",		"//llvm/lib/Transforms/Scalar",
"//llvm/lib/Transforms/Utils",		"//llvm/lib/Transforms/Utils",
]		]
sources = [		sources = [
		"AddDisableTailCalls.cpp",
"AggressiveAntiDepBreaker.cpp",		"AggressiveAntiDepBreaker.cpp",
"AllocationOrder.cpp",		"AllocationOrder.cpp",
"Analysis.cpp",		"Analysis.cpp",
"AtomicExpandPass.cpp",		"AtomicExpandPass.cpp",
"BasicBlockSections.cpp",		"BasicBlockSections.cpp",
"BasicBlockSectionsProfileReader.cpp",		"BasicBlockSectionsProfileReader.cpp",
"BasicTargetTransformInfo.cpp",		"BasicTargetTransformInfo.cpp",
"BranchFolding.cpp",		"BranchFolding.cpp",
▲ Show 20 Lines • Show All 204 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[CodeGen] Disable tail calls at -O0/-O1Changes PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 455425

llvm/docs/ReleaseNotes.rst

llvm/include/llvm/CodeGen/Passes.h

llvm/include/llvm/InitializePasses.h

llvm/include/llvm/LinkAllPasses.h

llvm/lib/CodeGen/AddDisableTailCalls.cpp

llvm/lib/CodeGen/CMakeLists.txt

llvm/lib/CodeGen/CodeGen.cpp

llvm/lib/CodeGen/TargetPassConfig.cpp

llvm/test/CodeGen/AArch64/O0-pipeline.ll

llvm/test/CodeGen/AArch64/arm64-abi_align.ll

llvm/test/CodeGen/AArch64/dllimport.ll

llvm/test/CodeGen/AArch64/tailcall-fastisel.ll

llvm/test/CodeGen/AArch64/win64_vararg_float.ll

llvm/test/CodeGen/AMDGPU/GlobalISel/irtranslator-assert-align.ll

llvm/test/CodeGen/AMDGPU/llc-pipeline.ll

llvm/test/CodeGen/AMDGPU/need-fp-from-vgpr-spills.ll

llvm/test/CodeGen/AMDGPU/sgpr-spills-split-regalloc.ll

llvm/test/CodeGen/ARM/fast-tail-call.ll

llvm/test/CodeGen/ARM/none-macho.ll

llvm/test/CodeGen/ARM/subtarget-no-movt.ll

llvm/test/CodeGen/ARM/tail-call-float.ll

llvm/test/CodeGen/ARM/tail-call.ll

llvm/test/CodeGen/Mips/tailcall/tail-call-arguments-clobber.ll

llvm/test/CodeGen/PowerPC/ppc64-sibcall.ll

llvm/test/CodeGen/RISCV/O0-pipeline.ll

llvm/test/CodeGen/X86/O0-pipeline.ll

llvm/test/CodeGen/X86/add-disable-tail-calls.ll

llvm/test/CodeGen/X86/atom-pad-short-functions.ll

llvm/test/CodeGen/X86/fold-sext-trunc.ll

llvm/test/CodeGen/X86/fold-zext-trunc.ll

llvm/test/CodeGen/X86/ins_split_regalloc.ll

llvm/test/CodeGen/X86/lvi-hardening-indirectbr.ll

llvm/test/CodeGen/X86/mixed-ptr-sizes-i686.ll

llvm/test/CodeGen/X86/mixed-ptr-sizes.ll

llvm/test/CodeGen/X86/opt-pipeline.ll

llvm/test/CodeGen/X86/pr1489.ll

llvm/test/CodeGen/X86/pr53243-tail-call-fastisel.ll

llvm/test/CodeGen/X86/retpoline-external.ll

llvm/test/CodeGen/X86/retpoline.ll

llvm/test/CodeGen/X86/swiftself-win64.ll

llvm/test/CodeGen/X86/swiftself.ll

llvm/test/CodeGen/X86/tailcall-msvc-conventions.ll

llvm/test/CodeGen/X86/win64_eh_leaf.ll

llvm/test/DebugInfo/COFF/tail-call-without-lexical-scopes.ll

llvm/test/DebugInfo/X86/dbg-declare-inalloca.ll

llvm/test/MC/ARM/arm-thumb-tail-call.ll

llvm/tools/opt/opt.cpp

llvm/utils/gn/secondary/llvm/lib/CodeGen/BUILD.gn

[CodeGen] Disable tail calls at -O0/-O1
Changes PlannedPublic