This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
docs/
-
LangRef.rst
-
include/llvm/IR/
-
llvm/
-
IR/
-
Instructions.h
-
lib/
-
AsmParser/
-
LLLexer.cpp
-
LLParser.cpp
-
LLToken.h
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
Writer/
-
BitcodeWriter.cpp
-
IR/
-
AsmWriter.cpp
-
Transforms/Scalar/
-
Scalar/
-
TailRecursionElimination.cpp
-
test/
-
Bitcode/
-
compatibility.ll
-
Transforms/TailCallElim/
-
TailCallElim/
-
notail.ll

Differential D12923

Add support for function attribute "notail"
ClosedPublic

Authored by ahatanak on Sep 16 2015, 7:27 PM.

Download Raw Diff

Details

Reviewers

rnk

Commits

rG5cfcce12eb04: Add 'notail' marker for call instructions.
rL252368: Add 'notail' marker for call instructions.

Summary

This patch adds support for a new IR function attribute "notail". The attribute is used to disable tail call optimization on calls to functions marked with the attribute.

This attribute is different from the existing attribute "disable-tail-calls", which disables tail call optimizations on all call sites within the marked function.

The patch to add support for the corresponding source-level function attribute is here:
http://reviews.llvm.org/D12922

Diff Detail

Repository: rL LLVM

Event Timeline

ahatanak updated this revision to Diff 34961.Sep 16 2015, 7:27 PM

ahatanak retitled this revision from to Add support for function attribute "notail".

ahatanak updated this object.

ahatanak added a subscriber: llvm-commits.

Does this mean LLVM will not longer be able to generate indirect tail calls (tail calls when the target function is not known statically); because the target function /could/ have been marked notail?

For instance, clang 3.7.0 optimizes

typedef int (g)(void *, int);

int f(g* ptr, int x) {
  return ptr(ptr, x);
}

into

	.section	__TEXT,__text,regular,pure_instructions
	.macosx_version_min 10, 10
	.globl	_f
	.align	4, 0x90
_f:                                     ## @f
	.cfi_startproc
## BB#0:
	pushq	%rbp
Ltmp0:
	.cfi_def_cfa_offset 16
Ltmp1:
	.cfi_offset %rbp, -16
	movq	%rsp, %rbp
Ltmp2:
	.cfi_def_cfa_register %rbp
	popq	%rbp
	jmpq	*%rdi                   ## TAILCALL
	.cfi_endproc


.subsections_via_symbols

[Edit] And this is now illegal since ptr can be a function in another translation unit marked notail.

Could you add a small test to Bitcode/compatibility.ll in the function attributes section? E.g

; CHECK: define void @f.notail() #35
define void @f.notail() notail {
  ret void
}
; CHECK: attributes #35 = { notail }

No, tail call optimization doesn't have to be disabled on indirect calls.

In D12923#247639, @ahatanak wrote:

No, tail call optimization doesn't have to be disabled on indirect calls.

What if the indirect callee is marked with notail? Isn't the whole point of notail that functions marked notail won't have tail calls to them?

Perhaps I'm missing something here, so it helps to be concrete -- in the C example I gave, what's the expected behavior of the program if the function pointer passed to f via ptr is a notail function?

I think the notail should be on the call instruction, not on the callee; as some sort of "inverse" of musttail. Then we can teach the optimizer to never codegen a notail call as a tail call.

Added a small test to Bitcode/compatibility.ll.
Changed attr-notail.ll to demonstrate attribute "notail" overrides the "tail" marker but doesn't override the "musttail" marker.

In D12923#247639, @ahatanak wrote:

No, tail call optimization doesn't have to be disabled on indirect calls.

An IR example of the issue I'm trying to point out is this:

define i32 @caller(i32 %a) {
entry:
  %p = alloca i32(i32)*
  store i32(i32)* @callee, i32(i32)** %p

  %ptr = load i32(i32)*, i32(i32)** %p
  %call = call i32 %ptr(i32 %a)
  ret i32 %call
}

declare i32 @callee(i32) #0

attributes #0 = { notail }

If you pass the above through opt -tailcallelim -mem2reg (opt built with this patch) you'll get

define i32 @caller(i32 %a) {
entry:
  %call = tail call i32 @callee(i32 %a)
  ret i32 %call
}

; Function Attrs: notail
declare i32 @callee(i32) #0

attributes #0 = { notail }

which is what you're trying to avoid with notail, if I understand this change correctly.

In D12923#247641, @sanjoy wrote:

In D12923#247639, @ahatanak wrote:

No, tail call optimization doesn't have to be disabled on indirect calls.

What if the indirect callee is marked with notail? Isn't the whole point of notail that functions marked notail won't have tail calls to them?

Yes, that's correct. We want to avoid doing tail call optimization on a call site if the compiler knows it is a call to a notail function. If the call is an indirect call, it's not always possible to know that.

Perhaps I'm missing something here, so it helps to be concrete -- in the C example I gave, what's the expected behavior of the program if the function pointer passed to f via ptr is a notail function?

It should have no effect. If the compiler cannot determine a call site is a call to a nontail function, it doesn't block tail call optimization.

I think the notail should be on the call instruction, not on the callee; as some sort of "inverse" of musttail. Then we can teach the optimizer to never codegen a notail call as a tail call.

I think attaching notail to the call instruction is more limiting than attaching it to the callee function in some cases. Suppose we are compiling a code like this with -O3:

int f(g* ptr, int x) {

return ptr(ptr, x);

}

int __attribute((notail)) foo2(void*, int);

int foo1(int a) {

return f(foo2, a);

}

After all the optimization passes (including the inliner) are run, the IR will look like this:

define i32 @foo1(i32 %a) #0 {
entry:

%call.i = call i32 @foo2(i8* bitcast (i32 (i8*, i32)* @foo2 to i8*), i32 %a) #2
ret i32 %call.i

}

If the notail attribute was on the call instruction, there would be no way to tell tail call optimization shouldn't be done on the call to foo2.

So, in your example, you wouldn't be able to block tail call optimization if the attribute was on the call site. Is that correct?

To elaborate on my previous comment, the notail attributes should block tail call optimization on as many call sites as possible, but it's okay if it can't block some of the indirect call sites if the compiler cannot determine which function is being called.

In D12923#247647, @ahatanak wrote:

It should have no effect. If the compiler cannot determine a call site is a call to a nontail function, it doesn't block tail call optimization.

So this (and what you said below) is fine as long as not obeying notail won't result in an
incorrect program; and notail on a function only prevents tail calls
to that function on a best effort basis. Is that the direction of
this patch? If so, then please clarify that on the langref.

Otherwise (i.e. if you *require* notail to prevent tail calls for
correctness) I don't see how you can get avoid making all indirect
calls non-tail calls.

I think the notail should be on the call instruction, not on the callee; as some sort of "inverse" of musttail. Then we can teach the optimizer to never codegen a notail call as a tail call.

I think attaching notail to the call instruction is more limiting than attaching it to the callee function in some cases. Suppose we are compiling a code like this with -O3:

int f(g* ptr, int x) {
return ptr(ptr, x);
}

int __attribute((notail)) foo2(void*, int);

int foo1(int a) {
return f(foo2, a);
}

After all the optimization passes (including the inliner) are run, the IR will look like this:

define i32 @foo1(i32 %a) #0 {
entry:
%call.i = call i32 @foo2(i8* bitcast (i32 (i8*, i32)* @foo2 to i8*), i32 %a) #2
ret i32 %call.i
}

If the notail attribute was on the call instruction, there would be no way to tell tail call optimization shouldn't be done on the call to foo2.

That's correct. Preventing tail call on a best effort basis is good enough for the use cases we care about. I'll update the langref to clarify that.

This patch makes changes to add the "notail" marker to call instructions instead of using a function attribute.

Seems reasonable. Where is the LangRef change for this?

Added description in LangRef.

Let me know if you think there are changes I can make to improve it.

lgtm

This revision is now accepted and ready to land.Nov 4 2015, 4:05 PM

spatel added a subscriber: spatel.Nov 5 2015, 8:56 AM

spatel added inline comments.

lib/Bitcode/Writer/BitcodeWriter.cpp
2132–2134 ↗	(On Diff #39283)	Hi Akira - Can you give these bitfields proper names in a struct or enum in LLVMBitCodes.h? It took me a while to understand why we have this encoding (no code comments...). The other reason I ask is because I was about to swipe bit 16 myself. :) I think that's the only backwards-compatible way to add fast-math-flags to a call ( PR21290 ). We can't use the usual method of tacking an optional field to the end of the record because the record length is unknown for a call with varargs.

ahatanak added inline comments.Nov 5 2015, 7:04 PM

lib/Bitcode/Writer/BitcodeWriter.cpp
2132–2134 ↗	(On Diff #39283)	Hi Sanjay. I've made the changes you suggested in my local branch, but I think they should be in a follow-up patch to separate the changes related to notail from the changes related to the bitfield names. These are the enums I defined in LLVMBitCodes.h: enum CallMarkersFlags { CALL_TAIL = 0, CALL_CCONV = 1, CALL_MUSTTAIL = 14, CALL_EXPLICIT_TYPE = 15, CALL_NOTAIL = 16 };

spatel added inline comments.Nov 6 2015, 8:28 AM

lib/Bitcode/Writer/BitcodeWriter.cpp
2132–2134 ↗	(On Diff #39283)	Yes, a follow-up NFC patch sounds right. Thanks!

Closed by commit rL252368: Add 'notail' marker for call instructions. (authored by ahatanak). · Explain WhyNov 6 2015, 3:58 PM

This revision was automatically updated to reflect the committed changes.

jevinskie added a subscriber: jevinskie.Jan 19 2016, 6:07 PM

Revision Contents

Path

Size

llvm/

trunk/

docs/

LangRef.rst

6 lines

include/

llvm/

IR/

Instructions.h

9 lines

lib/

AsmParser/

LLLexer.cpp

1 line

LLParser.cpp

1 line

LLToken.h

1 line

Bitcode/

Reader/

BitcodeReader.cpp

2 lines

Writer/

BitcodeWriter.cpp

3 lines

IR/

AsmWriter.cpp

2 lines

Transforms/

Scalar/

TailRecursionElimination.cpp

6 lines

test/

Bitcode/

compatibility.ll

7 lines

Transforms/

TailCallElim/

notail.ll

24 lines

Diff 39606

llvm/trunk/docs/LangRef.rst

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,380 Lines • ▼ Show 20 Lines
'``call``' Instruction		'``call``' Instruction
^^^^^^^^^^^^^^^^^^^^^^		^^^^^^^^^^^^^^^^^^^^^^

Syntax:		Syntax:
"""""""		"""""""

::		::

<result> = [tail \| musttail] call [cconv] [ret attrs] <ty> [<fnty>*] <fnptrval>(<function args>) [fn attrs]		<result> = [tail \| musttail \| notail ] call [cconv] [ret attrs] <ty> [<fnty>*] <fnptrval>(<function args>) [fn attrs]
[ operand bundles ]		[ operand bundles ]

Overview:		Overview:
"""""""""		"""""""""

The '``call``' instruction represents a simple function call.		The '``call``' instruction represents a simple function call.

Arguments:		Arguments:
Show All 36 Lines	#. The optional ``tail`` and ``musttail`` markers indicate that the optimizers
- Caller and callee both have the calling convention ``fastcc``.		- Caller and callee both have the calling convention ``fastcc``.
- The call is in tail position (ret immediately follows call and ret		- The call is in tail position (ret immediately follows call and ret
uses value of call or is void).		uses value of call or is void).
- Option ``-tailcallopt`` is enabled, or		- Option ``-tailcallopt`` is enabled, or
``llvm::GuaranteedTailCallOpt`` is ``true``.		``llvm::GuaranteedTailCallOpt`` is ``true``.
- `Platform-specific constraints are		- `Platform-specific constraints are
met. <CodeGenerator.html#tailcallopt>`_		met. <CodeGenerator.html#tailcallopt>`_

		#. The optional ``notail`` marker indicates that the optimizers should not add
		``tail`` or ``musttail`` markers to the call. It is used to prevent tail
		call optimization from being performed on the call.

#. The optional "cconv" marker indicates which :ref:`calling		#. The optional "cconv" marker indicates which :ref:`calling
convention <callingconv>` the call should use. If none is		convention <callingconv>` the call should use. If none is
specified, the call defaults to using C calling conventions. The		specified, the call defaults to using C calling conventions. The
calling convention of the call must match the calling convention of		calling convention of the call must match the calling convention of
the target function, or else the behavior is undefined.		the target function, or else the behavior is undefined.
#. The optional :ref:`Parameter Attributes <paramattrs>` list for return		#. The optional :ref:`Parameter Attributes <paramattrs>` list for return
values. Only '``zeroext``', '``signext``', and '``inreg``' attributes		values. Only '``zeroext``', '``signext``', and '``inreg``' attributes
are valid here.		are valid here.
▲ Show 20 Lines • Show All 3,595 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/IR/Instructions.h

Show First 20 Lines • Show All 1,483 Lines • ▼ Show 20 Lines	public:
FunctionType *getFunctionType() const { return FTy; }		FunctionType *getFunctionType() const { return FTy; }

void mutateFunctionType(FunctionType *FTy) {		void mutateFunctionType(FunctionType *FTy) {
mutateType(FTy->getReturnType());		mutateType(FTy->getReturnType());
this->FTy = FTy;		this->FTy = FTy;
}		}

// Note that 'musttail' implies 'tail'.		// Note that 'musttail' implies 'tail'.
enum TailCallKind { TCK_None = 0, TCK_Tail = 1, TCK_MustTail = 2 };		enum TailCallKind { TCK_None = 0, TCK_Tail = 1, TCK_MustTail = 2,
		TCK_NoTail = 3 };
TailCallKind getTailCallKind() const {		TailCallKind getTailCallKind() const {
return TailCallKind(getSubclassDataFromInstruction() & 3);		return TailCallKind(getSubclassDataFromInstruction() & 3);
}		}
bool isTailCall() const {		bool isTailCall() const {
return (getSubclassDataFromInstruction() & 3) != TCK_None;		unsigned Kind = getSubclassDataFromInstruction() & 3;
		return Kind == TCK_Tail \|\| Kind == TCK_MustTail;
}		}
bool isMustTailCall() const {		bool isMustTailCall() const {
return (getSubclassDataFromInstruction() & 3) == TCK_MustTail;		return (getSubclassDataFromInstruction() & 3) == TCK_MustTail;
}		}
		bool isNoTailCall() const {
		return (getSubclassDataFromInstruction() & 3) == TCK_NoTail;
		}
void setTailCall(bool isTC = true) {		void setTailCall(bool isTC = true) {
setInstructionSubclassData((getSubclassDataFromInstruction() & ~3) \|		setInstructionSubclassData((getSubclassDataFromInstruction() & ~3) \|
unsigned(isTC ? TCK_Tail : TCK_None));		unsigned(isTC ? TCK_Tail : TCK_None));
}		}
void setTailCallKind(TailCallKind TCK) {		void setTailCallKind(TailCallKind TCK) {
setInstructionSubclassData((getSubclassDataFromInstruction() & ~3) \|		setInstructionSubclassData((getSubclassDataFromInstruction() & ~3) \|
unsigned(TCK));		unsigned(TCK));
}		}
▲ Show 20 Lines • Show All 3,406 Lines • Show Last 20 Lines

llvm/trunk/lib/AsmParser/LLLexer.cpp

Show First 20 Lines • Show All 521 Lines • ▼ Show 20 Lines	#define KEYWORD(STR) \
KEYWORD(localexec);		KEYWORD(localexec);
KEYWORD(zeroinitializer);		KEYWORD(zeroinitializer);
KEYWORD(undef);		KEYWORD(undef);
KEYWORD(null);		KEYWORD(null);
KEYWORD(to);		KEYWORD(to);
KEYWORD(caller);		KEYWORD(caller);
KEYWORD(tail);		KEYWORD(tail);
KEYWORD(musttail);		KEYWORD(musttail);
		KEYWORD(notail);
KEYWORD(target);		KEYWORD(target);
KEYWORD(triple);		KEYWORD(triple);
KEYWORD(unwind);		KEYWORD(unwind);
KEYWORD(deplibs); // FIXME: Remove in 4.0.		KEYWORD(deplibs); // FIXME: Remove in 4.0.
KEYWORD(datalayout);		KEYWORD(datalayout);
KEYWORD(volatile);		KEYWORD(volatile);
KEYWORD(atomic);		KEYWORD(atomic);
KEYWORD(unordered);		KEYWORD(unordered);
▲ Show 20 Lines • Show All 438 Lines • Show Last 20 Lines

llvm/trunk/lib/AsmParser/LLParser.cpp

Show First 20 Lines • Show All 4,824 Lines • ▼ Show 20 Lines	int LLParser::ParseInstruction(Instruction &Inst, BasicBlock BB,
case lltok::kw_insertelement: return ParseInsertElement(Inst, PFS);		case lltok::kw_insertelement: return ParseInsertElement(Inst, PFS);
case lltok::kw_shufflevector: return ParseShuffleVector(Inst, PFS);		case lltok::kw_shufflevector: return ParseShuffleVector(Inst, PFS);
case lltok::kw_phi: return ParsePHI(Inst, PFS);		case lltok::kw_phi: return ParsePHI(Inst, PFS);
case lltok::kw_landingpad: return ParseLandingPad(Inst, PFS);		case lltok::kw_landingpad: return ParseLandingPad(Inst, PFS);
// Call.		// Call.
case lltok::kw_call: return ParseCall(Inst, PFS, CallInst::TCK_None);		case lltok::kw_call: return ParseCall(Inst, PFS, CallInst::TCK_None);
case lltok::kw_tail: return ParseCall(Inst, PFS, CallInst::TCK_Tail);		case lltok::kw_tail: return ParseCall(Inst, PFS, CallInst::TCK_Tail);
case lltok::kw_musttail: return ParseCall(Inst, PFS, CallInst::TCK_MustTail);		case lltok::kw_musttail: return ParseCall(Inst, PFS, CallInst::TCK_MustTail);
		case lltok::kw_notail: return ParseCall(Inst, PFS, CallInst::TCK_NoTail);
// Memory.		// Memory.
case lltok::kw_alloca: return ParseAlloc(Inst, PFS);		case lltok::kw_alloca: return ParseAlloc(Inst, PFS);
case lltok::kw_load: return ParseLoad(Inst, PFS);		case lltok::kw_load: return ParseLoad(Inst, PFS);
case lltok::kw_store: return ParseStore(Inst, PFS);		case lltok::kw_store: return ParseStore(Inst, PFS);
case lltok::kw_cmpxchg: return ParseCmpXchg(Inst, PFS);		case lltok::kw_cmpxchg: return ParseCmpXchg(Inst, PFS);
case lltok::kw_atomicrmw: return ParseAtomicRMW(Inst, PFS);		case lltok::kw_atomicrmw: return ParseAtomicRMW(Inst, PFS);
case lltok::kw_fence: return ParseFence(Inst, PFS);		case lltok::kw_fence: return ParseFence(Inst, PFS);
case lltok::kw_getelementptr: return ParseGetElementPtr(Inst, PFS);		case lltok::kw_getelementptr: return ParseGetElementPtr(Inst, PFS);
▲ Show 20 Lines • Show All 1,433 Lines • Show Last 20 Lines

llvm/trunk/lib/AsmParser/LLToken.h

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	enum Kind {
kw_external, kw_thread_local,		kw_external, kw_thread_local,
kw_localdynamic, kw_initialexec, kw_localexec,		kw_localdynamic, kw_initialexec, kw_localexec,
kw_zeroinitializer,		kw_zeroinitializer,
kw_undef, kw_null,		kw_undef, kw_null,
kw_to,		kw_to,
kw_caller,		kw_caller,
kw_tail,		kw_tail,
kw_musttail,		kw_musttail,
		kw_notail,
kw_target,		kw_target,
kw_triple,		kw_triple,
kw_unwind,		kw_unwind,
kw_deplibs, // FIXME: Remove in 4.0		kw_deplibs, // FIXME: Remove in 4.0
kw_datalayout,		kw_datalayout,
kw_volatile,		kw_volatile,
kw_atomic,		kw_atomic,
kw_unordered, kw_monotonic, kw_acquire, kw_release, kw_acq_rel, kw_seq_cst,		kw_unordered, kw_monotonic, kw_acquire, kw_release, kw_acq_rel, kw_seq_cst,
▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

llvm/trunk/lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 4,996 Lines • ▼ Show 20 Lines	case bitc::FUNC_CODE_INST_CALL: {
InstructionList.push_back(I);		InstructionList.push_back(I);
cast<CallInst>(I)->setCallingConv(		cast<CallInst>(I)->setCallingConv(
static_cast<CallingConv::ID>((0x7ff & CCInfo) >> 1));		static_cast<CallingConv::ID>((0x7ff & CCInfo) >> 1));
CallInst::TailCallKind TCK = CallInst::TCK_None;		CallInst::TailCallKind TCK = CallInst::TCK_None;
if (CCInfo & 1)		if (CCInfo & 1)
TCK = CallInst::TCK_Tail;		TCK = CallInst::TCK_Tail;
if (CCInfo & (1 << 14))		if (CCInfo & (1 << 14))
TCK = CallInst::TCK_MustTail;		TCK = CallInst::TCK_MustTail;
		if (CCInfo & (1 << 16))
		TCK = CallInst::TCK_NoTail;
cast<CallInst>(I)->setTailCallKind(TCK);		cast<CallInst>(I)->setTailCallKind(TCK);
cast<CallInst>(I)->setAttributes(PAL);		cast<CallInst>(I)->setAttributes(PAL);
break;		break;
}		}
case bitc::FUNC_CODE_INST_VAARG: { // VAARG: [valistty, valist, instty]		case bitc::FUNC_CODE_INST_VAARG: { // VAARG: [valistty, valist, instty]
if (Record.size() < 3)		if (Record.size() < 3)
return error("Invalid record");		return error("Invalid record");
Type *OpTy = getTypeByID(Record[0]);		Type *OpTy = getTypeByID(Record[0]);
▲ Show 20 Lines • Show All 902 Lines • Show Last 20 Lines

llvm/trunk/lib/Bitcode/Writer/BitcodeWriter.cpp

Show First 20 Lines • Show All 2,125 Lines • ▼ Show 20 Lines	case Instruction::Call: {

if (CI.hasOperandBundles())		if (CI.hasOperandBundles())
WriteOperandBundles(Stream, &CI, InstID, VE);		WriteOperandBundles(Stream, &CI, InstID, VE);

Code = bitc::FUNC_CODE_INST_CALL;		Code = bitc::FUNC_CODE_INST_CALL;

Vals.push_back(VE.getAttributeID(CI.getAttributes()));		Vals.push_back(VE.getAttributeID(CI.getAttributes()));
Vals.push_back((CI.getCallingConv() << 1) \| unsigned(CI.isTailCall()) \|		Vals.push_back((CI.getCallingConv() << 1) \| unsigned(CI.isTailCall()) \|
unsigned(CI.isMustTailCall()) << 14 \| 1 << 15);		unsigned(CI.isMustTailCall()) << 14 \| 1 << 15 \|
		unsigned(CI.isNoTailCall()) << 16);
Vals.push_back(VE.getTypeID(FTy));		Vals.push_back(VE.getTypeID(FTy));
PushValueAndType(CI.getCalledValue(), InstID, Vals, VE); // Callee		PushValueAndType(CI.getCalledValue(), InstID, Vals, VE); // Callee

// Emit value #'s for the fixed parameters.		// Emit value #'s for the fixed parameters.
for (unsigned i = 0, e = FTy->getNumParams(); i != e; ++i) {		for (unsigned i = 0, e = FTy->getNumParams(); i != e; ++i) {
// Check for labels (can happen with asm labels).		// Check for labels (can happen with asm labels).
if (FTy->getParamType(i)->isLabelTy())		if (FTy->getParamType(i)->isLabelTy())
Vals.push_back(VE.getValueID(CI.getArgOperand(i)));		Vals.push_back(VE.getValueID(CI.getArgOperand(i)));
▲ Show 20 Lines • Show All 941 Lines • Show Last 20 Lines

llvm/trunk/lib/IR/AsmWriter.cpp

Show First 20 Lines • Show All 2,762 Lines • ▼ Show 20 Lines	else
Out << '%' << SlotNum << " = ";		Out << '%' << SlotNum << " = ";
}		}

if (const CallInst *CI = dyn_cast<CallInst>(&I)) {		if (const CallInst *CI = dyn_cast<CallInst>(&I)) {
if (CI->isMustTailCall())		if (CI->isMustTailCall())
Out << "musttail ";		Out << "musttail ";
else if (CI->isTailCall())		else if (CI->isTailCall())
Out << "tail ";		Out << "tail ";
		else if (CI->isNoTailCall())
		Out << "notail ";
}		}

// Print out the opcode...		// Print out the opcode...
Out << I.getOpcodeName();		Out << I.getOpcodeName();

// If this is an atomic load or store, print out the atomic marker.		// If this is an atomic load or store, print out the atomic marker.
if ((isa<LoadInst>(I) && cast<LoadInst>(I).isAtomic()) \|\|		if ((isa<LoadInst>(I) && cast<LoadInst>(I).isAtomic()) \|\|
(isa<StoreInst>(I) && cast<StoreInst>(I).isAtomic()))		(isa<StoreInst>(I) && cast<StoreInst>(I).isAtomic()))
▲ Show 20 Lines • Show All 699 Lines • Show Last 20 Lines

llvm/trunk/lib/Transforms/Scalar/TailRecursionElimination.cpp

Show First 20 Lines • Show All 298 Lines • ▼ Show 20 Lines	do {
for (auto &I : *BB) {		for (auto &I : *BB) {
if (Tracker.EscapePoints.count(&I))		if (Tracker.EscapePoints.count(&I))
Escaped = ESCAPED;		Escaped = ESCAPED;

CallInst *CI = dyn_cast<CallInst>(&I);		CallInst *CI = dyn_cast<CallInst>(&I);
if (!CI \|\| CI->isTailCall())		if (!CI \|\| CI->isTailCall())
continue;		continue;

if (CI->doesNotAccessMemory()) {		bool IsNoTail = CI->isNoTailCall();

		if (!IsNoTail && CI->doesNotAccessMemory()) {
// A call to a readnone function whose arguments are all things computed		// A call to a readnone function whose arguments are all things computed
// outside this function can be marked tail. Even if you stored the		// outside this function can be marked tail. Even if you stored the
// alloca address into a global, a readnone function can't load the		// alloca address into a global, a readnone function can't load the
// global anyhow.		// global anyhow.
//		//
// Note that this runs whether we know an alloca has escaped or not. If		// Note that this runs whether we know an alloca has escaped or not. If
// it has, then we can't trust Tracker.AllocaUsers to be accurate.		// it has, then we can't trust Tracker.AllocaUsers to be accurate.
bool SafeToTail = true;		bool SafeToTail = true;
Show All 11 Lines	for (auto &I : *BB) {
F.getContext(), "tailcallelim", F, CI->getDebugLoc(),		F.getContext(), "tailcallelim", F, CI->getDebugLoc(),
"marked this readnone call a tail call candidate");		"marked this readnone call a tail call candidate");
CI->setTailCall();		CI->setTailCall();
Modified = true;		Modified = true;
continue;		continue;
}		}
}		}

if (Escaped == UNESCAPED && !Tracker.AllocaUsers.count(CI)) {		if (!IsNoTail && Escaped == UNESCAPED && !Tracker.AllocaUsers.count(CI)) {
DeferredTails.push_back(CI);		DeferredTails.push_back(CI);
} else {		} else {
AllCallsAreTailCalls = false;		AllCallsAreTailCalls = false;
}		}
}		}

for (auto *SuccBB : make_range(succ_begin(BB), succ_end(BB))) {		for (auto *SuccBB : make_range(succ_begin(BB), succ_end(BB))) {
auto &State = Visited[SuccBB];		auto &State = Visited[SuccBB];
▲ Show 20 Lines • Show All 511 Lines • Show Last 20 Lines

llvm/trunk/test/Bitcode/compatibility.ll

	Show First 20 Lines • Show All 1,138 Lines • ▼ Show 20 Lines

	define void @instructions.call_musttail(i8* inalloca %val) {			define void @instructions.call_musttail(i8* inalloca %val) {
	musttail call void @f.param.inalloca(i8* inalloca %val)			musttail call void @f.param.inalloca(i8* inalloca %val)
	; CHECK: musttail call void @f.param.inalloca(i8* inalloca %val)			; CHECK: musttail call void @f.param.inalloca(i8* inalloca %val)

	ret void			ret void
	}			}

				define void @instructions.call_notail() {
				notail call void @f1()
				; CHECK: notail call void @f1()

				ret void
				}

	define void @instructions.landingpad() personality i32 -2 {			define void @instructions.landingpad() personality i32 -2 {
	invoke void @llvm.donothing() to label %proceed unwind label %catch1			invoke void @llvm.donothing() to label %proceed unwind label %catch1
	invoke void @llvm.donothing() to label %proceed unwind label %catch2			invoke void @llvm.donothing() to label %proceed unwind label %catch2
	invoke void @llvm.donothing() to label %proceed unwind label %catch3			invoke void @llvm.donothing() to label %proceed unwind label %catch3
	invoke void @llvm.donothing() to label %proceed unwind label %catch4			invoke void @llvm.donothing() to label %proceed unwind label %catch4

	catch1:			catch1:
	landingpad i32			landingpad i32
	▲ Show 20 Lines • Show All 384 Lines • Show Last 20 Lines

llvm/trunk/test/Transforms/TailCallElim/notail.ll

				; RUN: opt < %s -tailcallelim -S \| FileCheck %s

				; CHECK: tail call void @callee0()
				; CHECK: notail call void @callee1()

				define void @foo1(i32 %a) {
				entry:
				%tobool = icmp eq i32 %a, 0
				br i1 %tobool, label %if.else, label %if.then

				if.then:
				call void @callee0()
				br label %if.end

				if.else:
				notail call void @callee1()
				br label %if.end

				if.end:
				ret void
				}

				declare void @callee0()
				declare void @callee1()

This is an archive of the discontinued LLVM Phabricator instance.

Add support for function attribute "notail"ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 39606

llvm/trunk/docs/LangRef.rst

llvm/trunk/include/llvm/IR/Instructions.h

llvm/trunk/lib/AsmParser/LLLexer.cpp

llvm/trunk/lib/AsmParser/LLParser.cpp

llvm/trunk/lib/AsmParser/LLToken.h

llvm/trunk/lib/Bitcode/Reader/BitcodeReader.cpp

llvm/trunk/lib/Bitcode/Writer/BitcodeWriter.cpp

llvm/trunk/lib/IR/AsmWriter.cpp

llvm/trunk/lib/Transforms/Scalar/TailRecursionElimination.cpp

llvm/trunk/test/Bitcode/compatibility.ll

llvm/trunk/test/Transforms/TailCallElim/notail.ll

Add support for function attribute "notail"
ClosedPublic